Commit Graph

3643 Commits

Author SHA1 Message Date
Jakub Nowacki
a23647da15 feature: expose event_query_timestamps extension
Related-To: NEO-11561

Signed-off-by: Jakub Nowacki <jakub.nowacki@intel.com>
2025-07-31 10:51:26 +02:00
Jakub Nowacki
781b6bbee2 feature: expose sub_allocations extension
Related-To: NEO-11561

Signed-off-by: Jakub Nowacki <jakub.nowacki@intel.com>
2025-07-30 14:31:09 +02:00
Joshua Santosh Ranjan
280239ebec feature: add support to query metrics hw buffer size
Related-To: NEO-13439

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2025-07-30 13:19:19 +02:00
Compute-Runtime-Validation
705dc02f06 Revert "feature: Use zesInit on windows always"
This reverts commit 99ee605410.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-07-30 04:38:02 +02:00
Bartosz Dunajski
d693daf339 fix: fallback copy offload to compute if max copy fill pattern exceeded
Related-To: GSD-11436

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-07-29 17:00:50 +02:00
Jakub Nowacki
b522da6532 feature: expose eu_count l0 extension
Related-To: NEO-11561

Signed-off-by: Jakub Nowacki <jakub.nowacki@intel.com>
2025-07-29 14:08:45 +02:00
shubham kumar
99ee605410 feature: Use zesInit on windows always
Related-To: NEO-15060

Signed-off-by: shubham kumar <shubham.kumar@intel.com>
2025-07-29 12:34:02 +02:00
Matias Cabral
b7f9d55118 feature: support excluded metrics from calc op
Resolves: NEO-15492

Signed-off-by: Matias Cabral <matias.a.cabral@intel.com>
2025-07-29 05:16:29 +02:00
Mateusz Jablonski
1c1028a57c refactor: remove not needed constant
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-07-28 17:31:46 +02:00
Kindracki, Jakub Tomasz
a73d5724e3 feature: Return insufficient permissions when driver device cannot be loaded
Return ZE_RESULT_ERROR_INSUFFICIENT_PERMISSIONS when user doesn't have
permissions to access the GPU driver device file.

Related-To: NEO-8682
Signed-off-by: Kindracki, Jakub Tomasz <jakub.tomasz.kindracki@intel.com>
2025-07-28 16:31:07 +02:00
Mateusz Jablonski
05cf05e10e refactor: remove redundant tracking of media sampler programming
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-07-28 15:17:46 +02:00
Bartosz Dunajski
42fbe388fb refactor: simplify bcs split code
Related-To: NEO-14557

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-07-28 13:25:56 +02:00
Maciej Bielski
c396367411 feature: Record&Replay support for appendLaunchKernel
Related-To: NEO-15374
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-07-28 12:19:30 +02:00
Bartosz Dunajski
8156c9d5aa feature: adjust required engines count for aggregated split
Related-To: NEO-14557

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-07-25 15:51:30 +02:00
Mateusz Hoppe
be483491bf feature: extract indirect access buffer major version from ELF's notes
- use version from ELF to create correct layout of implcit args
Related-To: NEO-15574

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-07-25 14:50:49 +02:00
Maciej Bielski
a755ad131d feature: implement KernelMutableState operator=
Related-To: NEO-15374
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-07-25 14:37:04 +02:00
Chandio, Bibrak Qamar
aeb85c202c fix: Add more return types to parseUSMType
Related-To: NEO-15604, GSD-11403

Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com>
2025-07-24 21:56:21 +02:00
Mateusz Jablonski
0a347a2d8b refactor: remove not needed code related to vme usage
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-07-24 17:54:51 +02:00
Zbigniew Zdanowicz
cdc7be1518 performance: limit number of queries when getting usm allocation data
Related-To: NEO-13916

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-07-24 17:35:33 +02:00
Zbigniew Zdanowicz
f4d238b281 performance: move main mutation object instead of copy and destruct
Related-To: NEO-13916

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-07-24 15:54:55 +02:00
Dominik Dabek
a8553a55c3 fix: disable usm pooling on non-hw
Usm pooling does not work with saving/downloading whole allocations on
non-hw csrs.

Related-To: NEO-6893

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-07-24 15:04:57 +02:00
Zbigniew Zdanowicz
7bcce38ae4 performance: split mutation data into event and kernel for better locality
Related-To: NEO-13916

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-07-24 11:51:13 +02:00
Mateusz Hoppe
575fe7fe33 fix: remove local dispatch support
Related-To: NEO-14561

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-07-24 10:31:18 +02:00
Bartosz Dunajski
e3e1ae0def refactor: helper function to set bcs split transfer direction
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-07-24 10:31:03 +02:00
Matias Cabral
93195e3501 feature: support for Metric Scopes
Resolves: NEO-15136

Signed-off-by: Matias Cabral <matias.a.cabral@intel.com>
2025-07-23 21:42:48 +02:00
Szymon Morek
6526a7fb66 Revert "performance: enable staging copy in L0"
This reverts commit 57e4cc025f.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-07-23 17:32:05 +02:00
Mateusz Jablonski
df7e114d54 fix: correct global device synchronization
skip uninitialized command stream receivers

Additionally, add L0 black box for kernel with arg slm

Test verifies new simplified L0 API

Functions:
- zerDriverGetDefaultContext
- zeDeviceSynchronize
- zeCommandListAppendLaunchKernelWithArguments
- zerIdentifierTranslateToDeviceHandle
- zerDeviceTranslateToIdentifier
- zerDriverGetLastErrorDescription

Definitions:
- defaultCommandQueueDesc
- defaultHostMemDesc
- defaultDeviceMemDesc

Related-To: NEO-14560
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-07-23 12:39:29 +02:00
Compute-Runtime-Validation
357fd3108e Revert "refactor: move surface state programming method to base class"
This reverts commit 5893fb08fb.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-07-23 04:37:46 +02:00
Compute-Runtime-Validation
321f285fd9 Revert "fix: correct global device synchronization"
This reverts commit 8bdc479fe7.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-07-22 21:34:43 +02:00
Zbigniew Zdanowicz
5893fb08fb refactor: move surface state programming method to base class
Related-To: NEO-15374

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-07-22 19:09:03 +02:00
Oskar Hubert Weber
9055ae8e54 fix: skip teardown clean up when terminating process on Windows
Per https://learn.microsoft.com/en-us/windows/win32/dlls/dllmain,
it's not safe to clean up resources in DllMain when
fdwReason == DLL_PROCESS_DETACH and lpvReserved != NULL (the process is
terminating), e.g. can result in a hang in some cases.

Related-To: NEO-14121

Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com>
2025-07-22 15:13:11 +02:00
Mateusz Hoppe
b17ed79618 fix: update implicitArgs versions
- fix layout of implicit args
- add enqueued local size and sync buffer ptr

Related-To: NEO-15160

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-07-22 15:12:55 +02:00
Mateusz Jablonski
09ee9bf093 feature: expose DDI Handles extension by default
Related-To: NEO-13917, NEO-13121
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-07-22 13:13:57 +02:00
Bartosz Dunajski
1f734d6f6a refactor: add helper to set default split mode
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-07-22 11:51:48 +02:00
Tomasz Biernacik
2da77209dc refactor: encapsulate condition
Related-To: NEO-14844

Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-07-22 10:43:39 +02:00
Szymon Morek
57e4cc025f performance: enable staging copy in L0
Related-To: NEO-14026

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-07-22 07:47:38 +02:00
Mateusz Jablonski
8bdc479fe7 fix: correct global device synchronization
skip uninitialized command stream receivers

Additionally, add L0 black box for kernel with arg slm

Test verifies new simplified L0 API

Functions:
- zerDriverGetDefaultContext
- zeDeviceSynchronize
- zeCommandListAppendLaunchKernelWithArguments
- zerIdentifierTranslateToDeviceHandle
- zerDeviceTranslateToIdentifier
- zerDriverGetLastErrorDescription

Definitions:
- defaultCommandQueueDesc
- defaultHostMemDesc
- defaultDeviceMemDesc

Related-To: NEO-14560
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-07-21 19:51:49 +02:00
Zbigniew Zdanowicz
86528a10a3 performance: change mutable structures for better browse and access patterns
Related-To: NEO-13916

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-07-21 17:21:02 +02:00
Michal Mrozek
773248dc45 performance: remove redundant vector in mcl
Related-To: NEO-15498

Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2025-07-21 14:43:03 +02:00
Michal Mrozek
aa8ab38156 performance: remove redundant vector
Related-To: NEO-15498
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2025-07-21 12:54:22 +02:00
Szymon Morek
86b5660c23 performance: introduce staging copy in L0
Related-To: NEO-14026

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-07-21 08:50:43 +02:00
Zbigniew Zdanowicz
259883c9c7 performance: do not capture and mutate stateful arguments in default mode
Related-To: NEO-13916

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-07-18 13:25:47 +02:00
Zbigniew Zdanowicz
0af70b2e18 performance: do not copy host view per-thread data at kernel isa mutation
- per-thread data will be recreated at mandatory group size mutation

Related-To: NEO-13916

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-07-18 12:48:46 +02:00
Zbigniew Zdanowicz
95dedbbcd8 performance: limit copying compute walker from host view at kernel mutation
- at kernel mutation copy post sync and inline data
- at mandatory group count/size mutation copy dispatch data

Related-To: NEO-13916

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-07-18 12:42:04 +02:00
Eric Mortensen
69f2cda2fa fix: add error checks & return to unMapVirtualMem
Related-To: NEO-8566

Signed-off-by: Eric Mortensen <eric.mortensen@intel.com>
2025-07-17 23:53:19 +02:00
Maciej Bielski
35ecde70df refactor: extract parts of KernelImp data for easier capturing
`KernelImp` is virtual class so capturing the current kernel state
cannot be done by a simple copy-instance of the class. However, this can
be done by extracting the relevant data to a separate class and
capturing a copy of its instance.

Related-To: NEO-15374
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-07-17 23:00:22 +02:00
Neil R. Spruit
2f6e4d36da fix: Correct all L0 handle definitions to properly handle the DDI ext
Related-To: NEO-15570

- Fixes for missing handle definitions for DDi Ext compliance and
incorrect definitions for several handle definitions requiring handle
translation.

Signed-off-by: Neil R. Spruit <neil.r.spruit@intel.com>
2025-07-17 19:13:51 +02:00
Grochowski, Stanislaw
e652679b95 fix: correct pollForCompletion in aub on program termination
Related-To: NEO-14867

Signed-off-by: Grochowski, Stanislaw <stanislaw.grochowski@intel.com>
2025-07-17 12:59:00 +02:00
Compute-Runtime-Validation
4c929fcc84 Revert "feature: support for Metric Scopes"
This reverts commit e92ac1c8ab.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-07-17 04:29:44 +02:00
Chodor, Jaroslaw
fcc2038def refactor: modernizing NEO::Range->std::span
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-07-17 01:27:11 +02:00