Zbigniew Zdanowicz
841267ecbd
feature: save command buffer gpu address for front end command in command list
...
Related-To: NEO-15376
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-08-05 09:59:08 +02:00
Mateusz Jablonski
ee5cd678ea
fix: correct naming of zer functions
...
zerDriverGetDefaultContext -> zerGetDefaultContext
zerDriverGetLastErrorDescription -> zerGetLastErrorDesription
zerDeviceTranslateToIdentifier -> zerTranslateDeviceHandleToIdentifier
zerIdentifierTranslateToDeviceHandle -> zerTranslateIdentifierToDeviceHandle
Related-To: NEO-14560
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-08-04 15:17:04 +02:00
Mateusz Hoppe
b7a73653a2
fix: set sync buffer ptr in implicit args
...
Related-To: NEO-15737
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-08-04 14:22:46 +02:00
Mateusz Jablonski
dbc00d623a
refactor: allow mocking event completion timeout in ULT
...
fix timing issue in L0 ULT
Related-To: NEO-15354
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-08-04 13:00:21 +02:00
Matias Cabral
1b7a26996e
refactor: renaming of metrics calcOp related structures
...
Resolves: NEO-15139
Signed-off-by: Matias Cabral <matias.a.cabral@intel.com >
2025-08-04 11:09:28 +02:00
Compute-Runtime-Validation
566b6551a5
Revert "fix: correct naming of zer functions"
...
This reverts commit 91a9ccaebe .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-08-02 02:09:08 +02:00
Zbigniew Zdanowicz
01889c97a5
feature: add patch preamble to level zero queue handling bb_start commands
...
Related-To: NEO-15376
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-08-01 14:50:56 +02:00
Mateusz Jablonski
91a9ccaebe
fix: correct naming of zer functions
...
zerDriverGetDefaultContext -> zerGetDefaultContext
zerDriverGetLastErrorDescription -> zerGetLastErrorDesription
zerDeviceTranslateToIdentifier -> zerTranslateDeviceHandleToIdentifier
zerIdentifierTranslateToDeviceHandle -> zerTranslateIdentifierToDeviceHandle
Related-To: NEO-14560
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-08-01 14:25:20 +02:00
Maciej Bielski
a0231c2d22
feature: allow for cloning KernelImp instances
...
From the `cloneWithStateOverride()` perspecitve the `KernelImp` has
two logical parts handled differently:
- `KernelMutableState`: separate for both regular and cloned instances,
may be optionally overriden when cloning
- rest: separate for regular instances but shared with all clones
(shallow copy), the original instance owns these resources
Related-To: NEO-15374
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com >
2025-08-01 11:51:45 +02:00
Joshua Santosh Ranjan
2fbe207328
refactor: move subdevice handling to metrics
...
Related-To: NEO-15516
Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com >
2025-08-01 11:03:23 +02:00
Michal Mrozek
9522320275
performance: improve driver getter
...
- make it final to optimize virtual calls
- move to header to allow to inline
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2025-08-01 08:33:18 +02:00
Mateusz Jablonski
abb00a5ce3
fix: ensure thread-safety in zeDeviceSynchronize
...
get task count and flush stamp within single critical section
Related-To: NEO-14560
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-07-31 15:59:57 +02:00
Jakub Nowacki
a23647da15
feature: expose event_query_timestamps extension
...
Related-To: NEO-11561
Signed-off-by: Jakub Nowacki <jakub.nowacki@intel.com >
2025-07-31 10:51:26 +02:00
Jakub Nowacki
781b6bbee2
feature: expose sub_allocations extension
...
Related-To: NEO-11561
Signed-off-by: Jakub Nowacki <jakub.nowacki@intel.com >
2025-07-30 14:31:09 +02:00
Joshua Santosh Ranjan
280239ebec
feature: add support to query metrics hw buffer size
...
Related-To: NEO-13439
Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com >
2025-07-30 13:19:19 +02:00
Compute-Runtime-Validation
705dc02f06
Revert "feature: Use zesInit on windows always"
...
This reverts commit 99ee605410 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-07-30 04:38:02 +02:00
Bartosz Dunajski
d693daf339
fix: fallback copy offload to compute if max copy fill pattern exceeded
...
Related-To: GSD-11436
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-07-29 17:00:50 +02:00
Jakub Nowacki
b522da6532
feature: expose eu_count l0 extension
...
Related-To: NEO-11561
Signed-off-by: Jakub Nowacki <jakub.nowacki@intel.com >
2025-07-29 14:08:45 +02:00
shubham kumar
99ee605410
feature: Use zesInit on windows always
...
Related-To: NEO-15060
Signed-off-by: shubham kumar <shubham.kumar@intel.com >
2025-07-29 12:34:02 +02:00
Matias Cabral
b7f9d55118
feature: support excluded metrics from calc op
...
Resolves: NEO-15492
Signed-off-by: Matias Cabral <matias.a.cabral@intel.com >
2025-07-29 05:16:29 +02:00
Mateusz Jablonski
1c1028a57c
refactor: remove not needed constant
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-07-28 17:31:46 +02:00
Kindracki, Jakub Tomasz
a73d5724e3
feature: Return insufficient permissions when driver device cannot be loaded
...
Return ZE_RESULT_ERROR_INSUFFICIENT_PERMISSIONS when user doesn't have
permissions to access the GPU driver device file.
Related-To: NEO-8682
Signed-off-by: Kindracki, Jakub Tomasz <jakub.tomasz.kindracki@intel.com >
2025-07-28 16:31:07 +02:00
Mateusz Jablonski
05cf05e10e
refactor: remove redundant tracking of media sampler programming
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-07-28 15:17:46 +02:00
Bartosz Dunajski
42fbe388fb
refactor: simplify bcs split code
...
Related-To: NEO-14557
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-07-28 13:25:56 +02:00
Maciej Bielski
c396367411
feature: Record&Replay support for appendLaunchKernel
...
Related-To: NEO-15374
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com >
2025-07-28 12:19:30 +02:00
Bartosz Dunajski
8156c9d5aa
feature: adjust required engines count for aggregated split
...
Related-To: NEO-14557
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-07-25 15:51:30 +02:00
Mateusz Hoppe
be483491bf
feature: extract indirect access buffer major version from ELF's notes
...
- use version from ELF to create correct layout of implcit args
Related-To: NEO-15574
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-07-25 14:50:49 +02:00
Maciej Bielski
a755ad131d
feature: implement KernelMutableState operator=
...
Related-To: NEO-15374
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com >
2025-07-25 14:37:04 +02:00
Chandio, Bibrak Qamar
aeb85c202c
fix: Add more return types to parseUSMType
...
Related-To: NEO-15604, GSD-11403
Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com >
2025-07-24 21:56:21 +02:00
Mateusz Jablonski
0a347a2d8b
refactor: remove not needed code related to vme usage
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-07-24 17:54:51 +02:00
Zbigniew Zdanowicz
cdc7be1518
performance: limit number of queries when getting usm allocation data
...
Related-To: NEO-13916
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-24 17:35:33 +02:00
Zbigniew Zdanowicz
f4d238b281
performance: move main mutation object instead of copy and destruct
...
Related-To: NEO-13916
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-24 15:54:55 +02:00
Dominik Dabek
a8553a55c3
fix: disable usm pooling on non-hw
...
Usm pooling does not work with saving/downloading whole allocations on
non-hw csrs.
Related-To: NEO-6893
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2025-07-24 15:04:57 +02:00
Zbigniew Zdanowicz
7bcce38ae4
performance: split mutation data into event and kernel for better locality
...
Related-To: NEO-13916
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-24 11:51:13 +02:00
Mateusz Hoppe
575fe7fe33
fix: remove local dispatch support
...
Related-To: NEO-14561
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-07-24 10:31:18 +02:00
Bartosz Dunajski
e3e1ae0def
refactor: helper function to set bcs split transfer direction
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-07-24 10:31:03 +02:00
Matias Cabral
93195e3501
feature: support for Metric Scopes
...
Resolves: NEO-15136
Signed-off-by: Matias Cabral <matias.a.cabral@intel.com >
2025-07-23 21:42:48 +02:00
Szymon Morek
6526a7fb66
Revert "performance: enable staging copy in L0"
...
This reverts commit 57e4cc025f .
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-07-23 17:32:05 +02:00
Mateusz Jablonski
df7e114d54
fix: correct global device synchronization
...
skip uninitialized command stream receivers
Additionally, add L0 black box for kernel with arg slm
Test verifies new simplified L0 API
Functions:
- zerDriverGetDefaultContext
- zeDeviceSynchronize
- zeCommandListAppendLaunchKernelWithArguments
- zerIdentifierTranslateToDeviceHandle
- zerDeviceTranslateToIdentifier
- zerDriverGetLastErrorDescription
Definitions:
- defaultCommandQueueDesc
- defaultHostMemDesc
- defaultDeviceMemDesc
Related-To: NEO-14560
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-07-23 12:39:29 +02:00
Compute-Runtime-Validation
357fd3108e
Revert "refactor: move surface state programming method to base class"
...
This reverts commit 5893fb08fb .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-07-23 04:37:46 +02:00
Compute-Runtime-Validation
321f285fd9
Revert "fix: correct global device synchronization"
...
This reverts commit 8bdc479fe7 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-07-22 21:34:43 +02:00
Zbigniew Zdanowicz
5893fb08fb
refactor: move surface state programming method to base class
...
Related-To: NEO-15374
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-22 19:09:03 +02:00
Oskar Hubert Weber
9055ae8e54
fix: skip teardown clean up when terminating process on Windows
...
Per https://learn.microsoft.com/en-us/windows/win32/dlls/dllmain ,
it's not safe to clean up resources in DllMain when
fdwReason == DLL_PROCESS_DETACH and lpvReserved != NULL (the process is
terminating), e.g. can result in a hang in some cases.
Related-To: NEO-14121
Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com >
2025-07-22 15:13:11 +02:00
Mateusz Hoppe
b17ed79618
fix: update implicitArgs versions
...
- fix layout of implicit args
- add enqueued local size and sync buffer ptr
Related-To: NEO-15160
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-07-22 15:12:55 +02:00
Mateusz Jablonski
09ee9bf093
feature: expose DDI Handles extension by default
...
Related-To: NEO-13917, NEO-13121
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-07-22 13:13:57 +02:00
Bartosz Dunajski
1f734d6f6a
refactor: add helper to set default split mode
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-07-22 11:51:48 +02:00
Tomasz Biernacik
2da77209dc
refactor: encapsulate condition
...
Related-To: NEO-14844
Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com >
2025-07-22 10:43:39 +02:00
Szymon Morek
57e4cc025f
performance: enable staging copy in L0
...
Related-To: NEO-14026
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-07-22 07:47:38 +02:00
Mateusz Jablonski
8bdc479fe7
fix: correct global device synchronization
...
skip uninitialized command stream receivers
Additionally, add L0 black box for kernel with arg slm
Test verifies new simplified L0 API
Functions:
- zerDriverGetDefaultContext
- zeDeviceSynchronize
- zeCommandListAppendLaunchKernelWithArguments
- zerIdentifierTranslateToDeviceHandle
- zerDeviceTranslateToIdentifier
- zerDriverGetLastErrorDescription
Definitions:
- defaultCommandQueueDesc
- defaultHostMemDesc
- defaultDeviceMemDesc
Related-To: NEO-14560
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-07-21 19:51:49 +02:00
Zbigniew Zdanowicz
86528a10a3
performance: change mutable structures for better browse and access patterns
...
Related-To: NEO-13916
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-21 17:21:02 +02:00