Zbigniew Zdanowicz
7bcce38ae4
performance: split mutation data into event and kernel for better locality
...
Related-To: NEO-13916
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-24 11:51:13 +02:00
Mateusz Hoppe
575fe7fe33
fix: remove local dispatch support
...
Related-To: NEO-14561
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-07-24 10:31:18 +02:00
Bartosz Dunajski
e3e1ae0def
refactor: helper function to set bcs split transfer direction
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-07-24 10:31:03 +02:00
Matias Cabral
93195e3501
feature: support for Metric Scopes
...
Resolves: NEO-15136
Signed-off-by: Matias Cabral <matias.a.cabral@intel.com >
2025-07-23 21:42:48 +02:00
Szymon Morek
6526a7fb66
Revert "performance: enable staging copy in L0"
...
This reverts commit 57e4cc025f .
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-07-23 17:32:05 +02:00
Mateusz Jablonski
df7e114d54
fix: correct global device synchronization
...
skip uninitialized command stream receivers
Additionally, add L0 black box for kernel with arg slm
Test verifies new simplified L0 API
Functions:
- zerDriverGetDefaultContext
- zeDeviceSynchronize
- zeCommandListAppendLaunchKernelWithArguments
- zerIdentifierTranslateToDeviceHandle
- zerDeviceTranslateToIdentifier
- zerDriverGetLastErrorDescription
Definitions:
- defaultCommandQueueDesc
- defaultHostMemDesc
- defaultDeviceMemDesc
Related-To: NEO-14560
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-07-23 12:39:29 +02:00
Compute-Runtime-Validation
357fd3108e
Revert "refactor: move surface state programming method to base class"
...
This reverts commit 5893fb08fb .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-07-23 04:37:46 +02:00
Compute-Runtime-Validation
321f285fd9
Revert "fix: correct global device synchronization"
...
This reverts commit 8bdc479fe7 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-07-22 21:34:43 +02:00
Zbigniew Zdanowicz
5893fb08fb
refactor: move surface state programming method to base class
...
Related-To: NEO-15374
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-22 19:09:03 +02:00
Oskar Hubert Weber
9055ae8e54
fix: skip teardown clean up when terminating process on Windows
...
Per https://learn.microsoft.com/en-us/windows/win32/dlls/dllmain ,
it's not safe to clean up resources in DllMain when
fdwReason == DLL_PROCESS_DETACH and lpvReserved != NULL (the process is
terminating), e.g. can result in a hang in some cases.
Related-To: NEO-14121
Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com >
2025-07-22 15:13:11 +02:00
Mateusz Hoppe
b17ed79618
fix: update implicitArgs versions
...
- fix layout of implicit args
- add enqueued local size and sync buffer ptr
Related-To: NEO-15160
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-07-22 15:12:55 +02:00
Mateusz Jablonski
09ee9bf093
feature: expose DDI Handles extension by default
...
Related-To: NEO-13917, NEO-13121
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-07-22 13:13:57 +02:00
Bartosz Dunajski
1f734d6f6a
refactor: add helper to set default split mode
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-07-22 11:51:48 +02:00
Tomasz Biernacik
2da77209dc
refactor: encapsulate condition
...
Related-To: NEO-14844
Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com >
2025-07-22 10:43:39 +02:00
Szymon Morek
57e4cc025f
performance: enable staging copy in L0
...
Related-To: NEO-14026
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-07-22 07:47:38 +02:00
Mateusz Jablonski
8bdc479fe7
fix: correct global device synchronization
...
skip uninitialized command stream receivers
Additionally, add L0 black box for kernel with arg slm
Test verifies new simplified L0 API
Functions:
- zerDriverGetDefaultContext
- zeDeviceSynchronize
- zeCommandListAppendLaunchKernelWithArguments
- zerIdentifierTranslateToDeviceHandle
- zerDeviceTranslateToIdentifier
- zerDriverGetLastErrorDescription
Definitions:
- defaultCommandQueueDesc
- defaultHostMemDesc
- defaultDeviceMemDesc
Related-To: NEO-14560
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-07-21 19:51:49 +02:00
Zbigniew Zdanowicz
86528a10a3
performance: change mutable structures for better browse and access patterns
...
Related-To: NEO-13916
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-21 17:21:02 +02:00
Michal Mrozek
773248dc45
performance: remove redundant vector in mcl
...
Related-To: NEO-15498
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2025-07-21 14:43:03 +02:00
Michal Mrozek
aa8ab38156
performance: remove redundant vector
...
Related-To: NEO-15498
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2025-07-21 12:54:22 +02:00
Szymon Morek
86b5660c23
performance: introduce staging copy in L0
...
Related-To: NEO-14026
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-07-21 08:50:43 +02:00
Zbigniew Zdanowicz
259883c9c7
performance: do not capture and mutate stateful arguments in default mode
...
Related-To: NEO-13916
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-18 13:25:47 +02:00
Zbigniew Zdanowicz
0af70b2e18
performance: do not copy host view per-thread data at kernel isa mutation
...
- per-thread data will be recreated at mandatory group size mutation
Related-To: NEO-13916
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-18 12:48:46 +02:00
Zbigniew Zdanowicz
95dedbbcd8
performance: limit copying compute walker from host view at kernel mutation
...
- at kernel mutation copy post sync and inline data
- at mandatory group count/size mutation copy dispatch data
Related-To: NEO-13916
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-18 12:42:04 +02:00
Eric Mortensen
69f2cda2fa
fix: add error checks & return to unMapVirtualMem
...
Related-To: NEO-8566
Signed-off-by: Eric Mortensen <eric.mortensen@intel.com >
2025-07-17 23:53:19 +02:00
Maciej Bielski
35ecde70df
refactor: extract parts of KernelImp data for easier capturing
...
`KernelImp` is virtual class so capturing the current kernel state
cannot be done by a simple copy-instance of the class. However, this can
be done by extracting the relevant data to a separate class and
capturing a copy of its instance.
Related-To: NEO-15374
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com >
2025-07-17 23:00:22 +02:00
Neil R. Spruit
2f6e4d36da
fix: Correct all L0 handle definitions to properly handle the DDI ext
...
Related-To: NEO-15570
- Fixes for missing handle definitions for DDi Ext compliance and
incorrect definitions for several handle definitions requiring handle
translation.
Signed-off-by: Neil R. Spruit <neil.r.spruit@intel.com >
2025-07-17 19:13:51 +02:00
Grochowski, Stanislaw
e652679b95
fix: correct pollForCompletion in aub on program termination
...
Related-To: NEO-14867
Signed-off-by: Grochowski, Stanislaw <stanislaw.grochowski@intel.com >
2025-07-17 12:59:00 +02:00
Compute-Runtime-Validation
4c929fcc84
Revert "feature: support for Metric Scopes"
...
This reverts commit e92ac1c8ab .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-07-17 04:29:44 +02:00
Chodor, Jaroslaw
fcc2038def
refactor: modernizing NEO::Range->std::span
...
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com >
2025-07-17 01:27:11 +02:00
Matias Cabral
e92ac1c8ab
feature: support for Metric Scopes
...
Resolves: NEO-14753
Signed-off-by: Matias Cabral <matias.a.cabral@intel.com >
2025-07-16 19:30:14 +02:00
Chodor, Jaroslaw
7acb9585af
feature: Adding fork/join support
...
Related-To: NEO-15373
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com >
2025-07-16 17:28:26 +02:00
Mateusz Jablonski
4dc4c45bbb
fix: correct signature of zeCommandListAppendLaunchKernelWithArguments
...
add const to pArguments and pNext types
Related-To: NEO-14560
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-07-16 16:25:12 +02:00
Mateusz Jablonski
10dc8a52a8
fix: correct shared cross device alloc caps for sub devices
...
in case of single exposed device, if there are subdevice, then
subdevices may access between each other
Related-To: HSD-18042639532
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-07-16 15:47:25 +02:00
Compute-Runtime-Validation
a6149fca1a
Revert "fix: Add -emit-lib-compile-errors flag when library compilation is en...
...
This reverts commit c9ebf91271 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-07-16 14:36:11 +02:00
Zbigniew Zdanowicz
ef5efeac0c
refactor: add kernel properties and move implementations to imp class
...
Related-To: NEO-15374
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-15 15:08:44 +02:00
Shreyas Kunder
e4158208ae
feature: Update Metrics Driver Code to use runtime enable & Disable APIs
...
Related-To: NEO-15339
Signed-off-by: Shreyas Kunder <shreyas.kunder@intel.com >
2025-07-15 14:28:49 +02:00
Tomasz Biernacik
2c5cbec033
feature: control post sync completion check
...
Related-To: NEO-14844
Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com >
2025-07-15 13:51:19 +02:00
Zbigniew Zdanowicz
087d1ecea4
refactor: use encoders instead direct dispatch commands in mcl experimental
...
Related-To: NEO-10492
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-15 12:52:13 +02:00
Radoslaw Jablonski
be6fe9294c
fix: stop Direct Submission on device teardown
...
This prevents a race condition, causing page faults, where device related
resources are released, but are still in use by commands (e.g. cache flushes)
submitted by the Direct Submission Controller.
Related-To: NEO-15017
Signed-off-by: Radoslaw Jablonski <radoslaw.jablonski@intel.com >
2025-07-15 12:12:59 +02:00
Zbigniew Zdanowicz
b62813297d
refactor: move add kernel arg stateful buffer to experimental implementation
...
Related-To: NEO-10492
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-15 01:02:26 +02:00
Mateusz Jablonski
8dc24d9e2c
feature: add default command queue descriptor to ze_intel_gpu.h
...
Related-To: NEO-14560
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-07-14 16:24:18 +02:00
Zbigniew Zdanowicz
1aefbd709b
performance: reserve space for internal vectors
...
- reserved space avoids vector memory relocations
Related-To: NEO-13916
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-14 13:13:55 +02:00
Zbigniew Zdanowicz
d31366794a
fix: skip and continue kernel isa mutation when same kernel is provided
...
Related-To: NEO-10492
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-11 20:28:51 +02:00
Mateusz Jablonski
762b04cf77
feature: add command queue flag to pass copy offload hint
...
Related-To: NEO-14560
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-07-11 19:42:48 +02:00
Mateusz Jablonski
645de5add8
fix: correct shared cross device alloc capabilities
...
allow for access in case of multiple exposed devices when
there is only one root device
Related-To: HSD-18042639532
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-07-11 18:32:06 +02:00
Zbigniew Zdanowicz
1599e5bafe
feature: add missing experimental mcl api calls
...
Related-To: NEO-10492
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-11 18:23:37 +02:00
Mateusz Jablonski
8f1903c7dd
feature: add zerDriverGetLastErrorDescription API
...
this function retrieves last error from default driver
Related-To: NEO-14560
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-07-11 13:37:50 +02:00
Aravind Gopalakrishnan
c1c8eb59c0
refactor: Cache context settings during init
...
- cache context related settings once to reuse in subsequent calls
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com >
2025-07-10 12:06:20 +02:00
Aleksandra Nizio
c9ebf91271
fix: Add -emit-lib-compile-errors flag when library compilation is enabled
...
Related-To: NEO-8608
Signed-off-by: Aleksandra Nizio <aleksandra.nizio@intel.com >
2025-07-10 10:48:21 +02:00
Mateusz Hoppe
703497b067
fix: read ONEAPI_PVC_SEND_WAR_WA env
...
- disable optimization with compiler internal option when env is set to
zero
Related-To: NEO-15378, GSD-10884
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-07-09 16:22:18 +02:00