Jakub Nowacki
a23647da15
feature: expose event_query_timestamps extension
...
Related-To: NEO-11561
Signed-off-by: Jakub Nowacki <jakub.nowacki@intel.com >
2025-07-31 10:51:26 +02:00
Jakub Nowacki
781b6bbee2
feature: expose sub_allocations extension
...
Related-To: NEO-11561
Signed-off-by: Jakub Nowacki <jakub.nowacki@intel.com >
2025-07-30 14:31:09 +02:00
Joshua Santosh Ranjan
280239ebec
feature: add support to query metrics hw buffer size
...
Related-To: NEO-13439
Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com >
2025-07-30 13:19:19 +02:00
Compute-Runtime-Validation
705dc02f06
Revert "feature: Use zesInit on windows always"
...
This reverts commit 99ee605410 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-07-30 04:38:02 +02:00
Bartosz Dunajski
d693daf339
fix: fallback copy offload to compute if max copy fill pattern exceeded
...
Related-To: GSD-11436
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-07-29 17:00:50 +02:00
Jakub Nowacki
b522da6532
feature: expose eu_count l0 extension
...
Related-To: NEO-11561
Signed-off-by: Jakub Nowacki <jakub.nowacki@intel.com >
2025-07-29 14:08:45 +02:00
shubham kumar
99ee605410
feature: Use zesInit on windows always
...
Related-To: NEO-15060
Signed-off-by: shubham kumar <shubham.kumar@intel.com >
2025-07-29 12:34:02 +02:00
Matias Cabral
b7f9d55118
feature: support excluded metrics from calc op
...
Resolves: NEO-15492
Signed-off-by: Matias Cabral <matias.a.cabral@intel.com >
2025-07-29 05:16:29 +02:00
Mateusz Jablonski
1c1028a57c
refactor: remove not needed constant
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-07-28 17:31:46 +02:00
Kindracki, Jakub Tomasz
a73d5724e3
feature: Return insufficient permissions when driver device cannot be loaded
...
Return ZE_RESULT_ERROR_INSUFFICIENT_PERMISSIONS when user doesn't have
permissions to access the GPU driver device file.
Related-To: NEO-8682
Signed-off-by: Kindracki, Jakub Tomasz <jakub.tomasz.kindracki@intel.com >
2025-07-28 16:31:07 +02:00
Mateusz Jablonski
05cf05e10e
refactor: remove redundant tracking of media sampler programming
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-07-28 15:17:46 +02:00
Bartosz Dunajski
42fbe388fb
refactor: simplify bcs split code
...
Related-To: NEO-14557
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-07-28 13:25:56 +02:00
Maciej Bielski
c396367411
feature: Record&Replay support for appendLaunchKernel
...
Related-To: NEO-15374
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com >
2025-07-28 12:19:30 +02:00
Bartosz Dunajski
8156c9d5aa
feature: adjust required engines count for aggregated split
...
Related-To: NEO-14557
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-07-25 15:51:30 +02:00
Mateusz Hoppe
be483491bf
feature: extract indirect access buffer major version from ELF's notes
...
- use version from ELF to create correct layout of implcit args
Related-To: NEO-15574
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-07-25 14:50:49 +02:00
Maciej Bielski
a755ad131d
feature: implement KernelMutableState operator=
...
Related-To: NEO-15374
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com >
2025-07-25 14:37:04 +02:00
Chandio, Bibrak Qamar
aeb85c202c
fix: Add more return types to parseUSMType
...
Related-To: NEO-15604, GSD-11403
Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com >
2025-07-24 21:56:21 +02:00
Mateusz Jablonski
0a347a2d8b
refactor: remove not needed code related to vme usage
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-07-24 17:54:51 +02:00
Zbigniew Zdanowicz
cdc7be1518
performance: limit number of queries when getting usm allocation data
...
Related-To: NEO-13916
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-24 17:35:33 +02:00
Zbigniew Zdanowicz
f4d238b281
performance: move main mutation object instead of copy and destruct
...
Related-To: NEO-13916
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-24 15:54:55 +02:00
Dominik Dabek
a8553a55c3
fix: disable usm pooling on non-hw
...
Usm pooling does not work with saving/downloading whole allocations on
non-hw csrs.
Related-To: NEO-6893
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2025-07-24 15:04:57 +02:00
Zbigniew Zdanowicz
7bcce38ae4
performance: split mutation data into event and kernel for better locality
...
Related-To: NEO-13916
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-24 11:51:13 +02:00
Mateusz Hoppe
575fe7fe33
fix: remove local dispatch support
...
Related-To: NEO-14561
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-07-24 10:31:18 +02:00
Bartosz Dunajski
e3e1ae0def
refactor: helper function to set bcs split transfer direction
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-07-24 10:31:03 +02:00
Matias Cabral
93195e3501
feature: support for Metric Scopes
...
Resolves: NEO-15136
Signed-off-by: Matias Cabral <matias.a.cabral@intel.com >
2025-07-23 21:42:48 +02:00
Szymon Morek
6526a7fb66
Revert "performance: enable staging copy in L0"
...
This reverts commit 57e4cc025f .
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-07-23 17:32:05 +02:00
Mateusz Jablonski
df7e114d54
fix: correct global device synchronization
...
skip uninitialized command stream receivers
Additionally, add L0 black box for kernel with arg slm
Test verifies new simplified L0 API
Functions:
- zerDriverGetDefaultContext
- zeDeviceSynchronize
- zeCommandListAppendLaunchKernelWithArguments
- zerIdentifierTranslateToDeviceHandle
- zerDeviceTranslateToIdentifier
- zerDriverGetLastErrorDescription
Definitions:
- defaultCommandQueueDesc
- defaultHostMemDesc
- defaultDeviceMemDesc
Related-To: NEO-14560
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-07-23 12:39:29 +02:00
Compute-Runtime-Validation
357fd3108e
Revert "refactor: move surface state programming method to base class"
...
This reverts commit 5893fb08fb .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-07-23 04:37:46 +02:00
Compute-Runtime-Validation
321f285fd9
Revert "fix: correct global device synchronization"
...
This reverts commit 8bdc479fe7 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-07-22 21:34:43 +02:00
Zbigniew Zdanowicz
5893fb08fb
refactor: move surface state programming method to base class
...
Related-To: NEO-15374
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-22 19:09:03 +02:00
Oskar Hubert Weber
9055ae8e54
fix: skip teardown clean up when terminating process on Windows
...
Per https://learn.microsoft.com/en-us/windows/win32/dlls/dllmain ,
it's not safe to clean up resources in DllMain when
fdwReason == DLL_PROCESS_DETACH and lpvReserved != NULL (the process is
terminating), e.g. can result in a hang in some cases.
Related-To: NEO-14121
Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com >
2025-07-22 15:13:11 +02:00
Mateusz Hoppe
b17ed79618
fix: update implicitArgs versions
...
- fix layout of implicit args
- add enqueued local size and sync buffer ptr
Related-To: NEO-15160
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-07-22 15:12:55 +02:00
Mateusz Jablonski
09ee9bf093
feature: expose DDI Handles extension by default
...
Related-To: NEO-13917, NEO-13121
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-07-22 13:13:57 +02:00
Bartosz Dunajski
1f734d6f6a
refactor: add helper to set default split mode
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-07-22 11:51:48 +02:00
Tomasz Biernacik
2da77209dc
refactor: encapsulate condition
...
Related-To: NEO-14844
Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com >
2025-07-22 10:43:39 +02:00
Szymon Morek
57e4cc025f
performance: enable staging copy in L0
...
Related-To: NEO-14026
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-07-22 07:47:38 +02:00
Mateusz Jablonski
8bdc479fe7
fix: correct global device synchronization
...
skip uninitialized command stream receivers
Additionally, add L0 black box for kernel with arg slm
Test verifies new simplified L0 API
Functions:
- zerDriverGetDefaultContext
- zeDeviceSynchronize
- zeCommandListAppendLaunchKernelWithArguments
- zerIdentifierTranslateToDeviceHandle
- zerDeviceTranslateToIdentifier
- zerDriverGetLastErrorDescription
Definitions:
- defaultCommandQueueDesc
- defaultHostMemDesc
- defaultDeviceMemDesc
Related-To: NEO-14560
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-07-21 19:51:49 +02:00
Zbigniew Zdanowicz
86528a10a3
performance: change mutable structures for better browse and access patterns
...
Related-To: NEO-13916
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-21 17:21:02 +02:00
Michal Mrozek
773248dc45
performance: remove redundant vector in mcl
...
Related-To: NEO-15498
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2025-07-21 14:43:03 +02:00
Michal Mrozek
aa8ab38156
performance: remove redundant vector
...
Related-To: NEO-15498
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2025-07-21 12:54:22 +02:00
Szymon Morek
86b5660c23
performance: introduce staging copy in L0
...
Related-To: NEO-14026
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-07-21 08:50:43 +02:00
Zbigniew Zdanowicz
259883c9c7
performance: do not capture and mutate stateful arguments in default mode
...
Related-To: NEO-13916
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-18 13:25:47 +02:00
Zbigniew Zdanowicz
0af70b2e18
performance: do not copy host view per-thread data at kernel isa mutation
...
- per-thread data will be recreated at mandatory group size mutation
Related-To: NEO-13916
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-18 12:48:46 +02:00
Zbigniew Zdanowicz
95dedbbcd8
performance: limit copying compute walker from host view at kernel mutation
...
- at kernel mutation copy post sync and inline data
- at mandatory group count/size mutation copy dispatch data
Related-To: NEO-13916
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-18 12:42:04 +02:00
Eric Mortensen
69f2cda2fa
fix: add error checks & return to unMapVirtualMem
...
Related-To: NEO-8566
Signed-off-by: Eric Mortensen <eric.mortensen@intel.com >
2025-07-17 23:53:19 +02:00
Maciej Bielski
35ecde70df
refactor: extract parts of KernelImp data for easier capturing
...
`KernelImp` is virtual class so capturing the current kernel state
cannot be done by a simple copy-instance of the class. However, this can
be done by extracting the relevant data to a separate class and
capturing a copy of its instance.
Related-To: NEO-15374
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com >
2025-07-17 23:00:22 +02:00
Neil R. Spruit
2f6e4d36da
fix: Correct all L0 handle definitions to properly handle the DDI ext
...
Related-To: NEO-15570
- Fixes for missing handle definitions for DDi Ext compliance and
incorrect definitions for several handle definitions requiring handle
translation.
Signed-off-by: Neil R. Spruit <neil.r.spruit@intel.com >
2025-07-17 19:13:51 +02:00
Grochowski, Stanislaw
e652679b95
fix: correct pollForCompletion in aub on program termination
...
Related-To: NEO-14867
Signed-off-by: Grochowski, Stanislaw <stanislaw.grochowski@intel.com >
2025-07-17 12:59:00 +02:00
Compute-Runtime-Validation
4c929fcc84
Revert "feature: support for Metric Scopes"
...
This reverts commit e92ac1c8ab .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-07-17 04:29:44 +02:00
Chodor, Jaroslaw
fcc2038def
refactor: modernizing NEO::Range->std::span
...
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com >
2025-07-17 01:27:11 +02:00