Commit Graph

6111 Commits

Author SHA1 Message Date
Maciej Bielski
c396367411 feature: Record&Replay support for appendLaunchKernel
Related-To: NEO-15374
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-07-28 12:19:30 +02:00
Compute-Runtime-Validation
4d4a5cc8b3 Revert "fix: create temporary main.cl with kernel source when -g debug flag i...
This reverts commit dfc97c016e.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-07-26 02:45:54 +02:00
Bartosz Dunajski
8156c9d5aa feature: adjust required engines count for aggregated split
Related-To: NEO-14557

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-07-25 15:51:30 +02:00
Mateusz Hoppe
be483491bf feature: extract indirect access buffer major version from ELF's notes
- use version from ELF to create correct layout of implcit args
Related-To: NEO-15574

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-07-25 14:50:49 +02:00
Maciej Bielski
a755ad131d feature: implement KernelMutableState operator=
Related-To: NEO-15374
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-07-25 14:37:04 +02:00
Brandon Yates
05d57ce28a fix: set EventRead reserved field to zero in debugger
Related-to: HSD-18042688323

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-07-25 13:40:47 +02:00
Pratik Bari
3f241b9fc5 refactor(sysman): Refine Sysman Engine Module
Related-To: NEO-14756

Signed-off-by: Pratik Bari <pratik.bari@intel.com>
2025-07-25 13:07:31 +02:00
Aleksandra Nizio
dfc97c016e fix: create temporary main.cl with kernel source when -g debug flag is used
Implements mechanism in ocloc to generate a temporary main.cl file
containing kernel source code when compiling with the debug (-g) flag
and the source file does not exist. This enables proper source code
annotations in generated assembly for SYCL/online compilation workflows.

Related-To: NEO-11900
Signed-off-by: Aleksandra Nizio <aleksandra.nizio@intel.com>
2025-07-25 12:42:26 +02:00
Compute-Runtime-Validation
51b5eeda09 Revert "refactor: Refine Sysman Engine Module"
This reverts commit b240f92e82.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-07-25 02:40:25 +02:00
Chandio, Bibrak Qamar
aeb85c202c fix: Add more return types to parseUSMType
Related-To: NEO-15604, GSD-11403

Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com>
2025-07-24 21:56:21 +02:00
Mateusz Jablonski
0a347a2d8b refactor: remove not needed code related to vme usage
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-07-24 17:54:51 +02:00
Zbigniew Zdanowicz
cdc7be1518 performance: limit number of queries when getting usm allocation data
Related-To: NEO-13916

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-07-24 17:35:33 +02:00
shubham kumar
4011006a3a fix: Handle multiple device interface paths for windows PMT init
Related-To: NEO-15620

Signed-off-by: shubham kumar <shubham.kumar@intel.com>
2025-07-24 16:57:35 +02:00
Zbigniew Zdanowicz
f4d238b281 performance: move main mutation object instead of copy and destruct
Related-To: NEO-13916

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-07-24 15:54:55 +02:00
Dominik Dabek
a8553a55c3 fix: disable usm pooling on non-hw
Usm pooling does not work with saving/downloading whole allocations on
non-hw csrs.

Related-To: NEO-6893

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-07-24 15:04:57 +02:00
Young Jin Yoon
14ea40f85d test: fix staging buffer fixture
Fix the test fixture for staging buffer logic to respect
additional blit properties setting, as implemented in the
isValidForStagingTransfer function of cmdlist_hw_immediate.inl.

Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2025-07-24 14:52:32 +02:00
Young Jin Yoon
4e090ad562 test: modify offload tests to use copy csr
Fixed some of copy offload tests that are designed with copy
command stream receiver to not to use additional blit
properties.

Related-To: NEO-7067
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2025-07-24 14:35:07 +02:00
Chodor, Jaroslaw
656a4b1b38 test: Refactoring Record&Replay mocks
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-07-24 13:22:43 +02:00
Pratik Bari
b240f92e82 refactor: Refine Sysman Engine Module
Related-To: NEO-14756

Signed-off-by: Pratik Bari <pratik.bari@intel.com>
2025-07-24 12:48:10 +02:00
Mateusz Jablonski
87ed4728f2 fix: add if guards for API definition to avoid future conflicts
Related-To: NEO-14560
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-07-24 12:38:48 +02:00
Mateusz Jablonski
bbafd20b6a test: add immediate arg to zello_arg_slm black box test
the test shows example of setting immediate arg
using zeCommandListAppendLaunchKernelWithArguments

Related-To: NEO-14560
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-07-24 12:31:50 +02:00
Zbigniew Zdanowicz
7bcce38ae4 performance: split mutation data into event and kernel for better locality
Related-To: NEO-13916

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-07-24 11:51:13 +02:00
Mateusz Hoppe
575fe7fe33 fix: remove local dispatch support
Related-To: NEO-14561

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-07-24 10:31:18 +02:00
Bartosz Dunajski
e3e1ae0def refactor: helper function to set bcs split transfer direction
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-07-24 10:31:03 +02:00
Matias Cabral
93195e3501 feature: support for Metric Scopes
Resolves: NEO-15136

Signed-off-by: Matias Cabral <matias.a.cabral@intel.com>
2025-07-23 21:42:48 +02:00
Szymon Morek
6526a7fb66 Revert "performance: enable staging copy in L0"
This reverts commit 57e4cc025f.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-07-23 17:32:05 +02:00
shubham kumar
28dd2d36ff fix: Send proper buffer size to EUSS open stream escape call
Related-To: NEO-15590

Signed-off-by: shubham kumar <shubham.kumar@intel.com>
2025-07-23 13:00:59 +02:00
Mateusz Jablonski
df7e114d54 fix: correct global device synchronization
skip uninitialized command stream receivers

Additionally, add L0 black box for kernel with arg slm

Test verifies new simplified L0 API

Functions:
- zerDriverGetDefaultContext
- zeDeviceSynchronize
- zeCommandListAppendLaunchKernelWithArguments
- zerIdentifierTranslateToDeviceHandle
- zerDeviceTranslateToIdentifier
- zerDriverGetLastErrorDescription

Definitions:
- defaultCommandQueueDesc
- defaultHostMemDesc
- defaultDeviceMemDesc

Related-To: NEO-14560
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-07-23 12:39:29 +02:00
Compute-Runtime-Validation
357fd3108e Revert "refactor: move surface state programming method to base class"
This reverts commit 5893fb08fb.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-07-23 04:37:46 +02:00
Compute-Runtime-Validation
321f285fd9 Revert "fix: correct global device synchronization"
This reverts commit 8bdc479fe7.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-07-22 21:34:43 +02:00
Zbigniew Zdanowicz
5893fb08fb refactor: move surface state programming method to base class
Related-To: NEO-15374

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-07-22 19:09:03 +02:00
Mateusz Jablonski
dedfcae377 test: enable output validation in zello_printf
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-07-22 15:13:27 +02:00
Oskar Hubert Weber
9055ae8e54 fix: skip teardown clean up when terminating process on Windows
Per https://learn.microsoft.com/en-us/windows/win32/dlls/dllmain,
it's not safe to clean up resources in DllMain when
fdwReason == DLL_PROCESS_DETACH and lpvReserved != NULL (the process is
terminating), e.g. can result in a hang in some cases.

Related-To: NEO-14121

Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com>
2025-07-22 15:13:11 +02:00
Mateusz Hoppe
b17ed79618 fix: update implicitArgs versions
- fix layout of implicit args
- add enqueued local size and sync buffer ptr

Related-To: NEO-15160

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-07-22 15:12:55 +02:00
Mateusz Jablonski
09ee9bf093 feature: expose DDI Handles extension by default
Related-To: NEO-13917, NEO-13121
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-07-22 13:13:57 +02:00
Bartosz Dunajski
e000167b91 documentation: update regular cmd list with CB Events usage pattern
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-07-22 12:14:19 +02:00
Bartosz Dunajski
1f734d6f6a refactor: add helper to set default split mode
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-07-22 11:51:48 +02:00
Chodor, Jaroslaw
42ab9b5829 feature: Record&Replay support for queries
Related-To: NEO-15377

Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-07-22 10:48:55 +02:00
Tomasz Biernacik
2da77209dc refactor: encapsulate condition
Related-To: NEO-14844

Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-07-22 10:43:39 +02:00
Szymon Morek
57e4cc025f performance: enable staging copy in L0
Related-To: NEO-14026

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-07-22 07:47:38 +02:00
Mateusz Jablonski
8bdc479fe7 fix: correct global device synchronization
skip uninitialized command stream receivers

Additionally, add L0 black box for kernel with arg slm

Test verifies new simplified L0 API

Functions:
- zerDriverGetDefaultContext
- zeDeviceSynchronize
- zeCommandListAppendLaunchKernelWithArguments
- zerIdentifierTranslateToDeviceHandle
- zerDeviceTranslateToIdentifier
- zerDriverGetLastErrorDescription

Definitions:
- defaultCommandQueueDesc
- defaultHostMemDesc
- defaultDeviceMemDesc

Related-To: NEO-14560
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-07-21 19:51:49 +02:00
Zbigniew Zdanowicz
86528a10a3 performance: change mutable structures for better browse and access patterns
Related-To: NEO-13916

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-07-21 17:21:02 +02:00
Michal Mrozek
773248dc45 performance: remove redundant vector in mcl
Related-To: NEO-15498

Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2025-07-21 14:43:03 +02:00
Michal Mrozek
aa8ab38156 performance: remove redundant vector
Related-To: NEO-15498
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2025-07-21 12:54:22 +02:00
Szymon Morek
86b5660c23 performance: introduce staging copy in L0
Related-To: NEO-14026

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-07-21 08:50:43 +02:00
Chodor, Jaroslaw
74f04acb88 feature: Record&Replay support for additional API calls
Related-To: NEO-15373

Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-07-19 02:56:47 +02:00
Zbigniew Zdanowicz
259883c9c7 performance: do not capture and mutate stateful arguments in default mode
Related-To: NEO-13916

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-07-18 13:25:47 +02:00
Zbigniew Zdanowicz
0af70b2e18 performance: do not copy host view per-thread data at kernel isa mutation
- per-thread data will be recreated at mandatory group size mutation

Related-To: NEO-13916

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-07-18 12:48:46 +02:00
Zbigniew Zdanowicz
95dedbbcd8 performance: limit copying compute walker from host view at kernel mutation
- at kernel mutation copy post sync and inline data
- at mandatory group count/size mutation copy dispatch data

Related-To: NEO-13916

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-07-18 12:42:04 +02:00
Eric Mortensen
69f2cda2fa fix: add error checks & return to unMapVirtualMem
Related-To: NEO-8566

Signed-off-by: Eric Mortensen <eric.mortensen@intel.com>
2025-07-17 23:53:19 +02:00