Commit Graph

6888 Commits

Author SHA1 Message Date
Compute-Runtime-Validation
a6b4e61d34 Revert "performance: Remove global fence from command stream on BMG"
This reverts commit f5301ac379.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-04-29 05:17:21 +02:00
Dominik Dabek
75e313ce28 feature: prepare for l0 usm device pooling
Related-To: NEO-6893

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-04-28 14:55:03 +02:00
Bartosz Dunajski
c2266fc69e refactor: remove not used CB event TS logic
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-04-28 14:31:34 +02:00
Lukasz Jobczyk
6f4a56d440 refactor: pass product helper to isFenceAllocationRequired
Related-To: NEO-14642

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-28 14:09:02 +02:00
Lukasz Jobczyk
f5301ac379 performance: Remove global fence from command stream on BMG
Related-To: NEO-14642

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-28 14:08:47 +02:00
Michal Mrozek
c41ac0ad4c refactor: remove not needed code
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2025-04-25 16:44:01 +02:00
Lukasz Jobczyk
02444ac0e6 refactor: Use global fence helper for additional synchronization
Related-To: NEO-14642

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-25 14:44:13 +02:00
Lukasz Jobczyk
97358acabe fix: use mfence instead of sfence on discrete devices
Related-To: NEO-14642

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-25 13:32:25 +02:00
Mateusz Hoppe
41efee1e7c feature: add mode to expose single root device
- simplify API to expose and use only single root device. Disallow
returning subdevices. This is experimental mode.

Related-To: NEO-14559

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-04-25 13:20:56 +02:00
Lukasz Jobczyk
dfcdbb20cc fix: Move ULLS semaphore to smem on xe2
Related-To: NEO-14642

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-25 12:54:27 +02:00
Lukasz Jobczyk
a75f1434b0 performance: Store vm bind available in drmCsr
Resolves: GSD-10986

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-24 15:45:05 +02:00
Slawomir Milczarek
a97a66506f refactor: Add ioctl helper to get primary context properties
Related-To: NEO-12952

Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2025-04-24 14:27:25 +02:00
Bartosz Dunajski
71f81c786f fix: correct sysman engine query
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-04-24 13:00:32 +02:00
Lukasz Jobczyk
c1df81c9be refactor: Simplify handles destroy
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-24 11:13:18 +02:00
Szymon Morek
8ec5688ca1 Revert "performance: extend usage of staging buffers"
This reverts commit 6a4ddf5dfb.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-24 11:02:27 +02:00
Slawomir Milczarek
ab1f39f07f refactor: Remove ioctl helper for context destruction
Related-To: NEO-12952

Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2025-04-24 10:53:04 +02:00
Compute-Runtime-Validation
d28fcdb55b Revert "refactor: Simplify handles destroy"
This reverts commit 8776bdba2f.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-04-24 05:51:18 +02:00
Lukasz Jobczyk
f36e3c7d23 refactor: Disable gem close worker in ULTs
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-23 17:58:18 +02:00
Mateusz Jablonski
a81c1f38d2 refactor: cleanup duplicated NOLINT comments
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-04-23 15:15:20 +02:00
Lukasz Jobczyk
8776bdba2f refactor: Simplify handles destroy
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-23 14:30:37 +02:00
Maciej Plewka
6d085ed0d4 fix: Return max wg count equal one on platforms without dispatch all support
Second push of a change that was reverted due to errors in tests
Related-To: NEO-14125, GSD-10725

Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-04-23 13:48:11 +02:00
Lukasz Jobczyk
53f0832614 fix: Register shared handle size on windows
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-23 13:00:27 +02:00
Lukasz Jobczyk
5d2b673ef6 fix: Adjust waitpkg threshold to 20 us
Resolves: HSD-13012958900

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-23 10:46:22 +02:00
Compute-Runtime-Validation
30f6ddb6c7 Revert "refactor: deallocate via gmm"
This reverts commit bf861d8d49.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-04-23 03:50:43 +02:00
Szymon Morek
6a4ddf5dfb performance: extend usage of staging buffers
Related-To: NEO-14026

Move enabling from xe2 to xe2_and_later

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-22 16:46:52 +02:00
Lukasz Jobczyk
bf861d8d49 refactor: deallocate via gmm
Related-To: NEO-13051

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-22 16:46:36 +02:00
Lukasz Jobczyk
d510f742da fix: Pass Deallocate2 callback to gmm
Resolves: HSD-18041541320

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-22 12:08:56 +02:00
Slawomir Milczarek
297d4c99e8 refactor: Add missing const identifier
Related-To: NEO-12952

Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2025-04-22 10:41:28 +02:00
Aravind Gopalakrishnan
b4f95a1a26 feature: Add pidfd option for IPC handle import
Related-To: NEO-10380

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2025-04-18 13:01:27 +02:00
Fabian Zwoliński
204577a91f fix: disable TimestampPoolAllocator
Related-To: NEO-12287, HSD-13012945876
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2025-04-18 11:30:35 +02:00
Compute-Runtime-Validation
fe1db94454 Revert "performance: Remove global fence from command stream on BMG"
This reverts commit 0c778ef6f4.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-04-18 09:23:16 +02:00
Young Jin Yoon
05c6612386 refactor: add EncodePostSyncArgs to EncodeDispatchKernelArgs
Refactored various member variables in EncodeDispatchKernel to directly
include EncodePostSync
Changed command encoder and command list to use the modified
EncodeDispatchKernel.

Related-To: NEO-13003
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2025-04-17 20:55:40 +02:00
Slawomir Milczarek
a2d25ca31f feature: Add syscall wrappers for pidfdopen and pidfdgetfd
Related-To: NEO-12952

Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2025-04-17 19:50:39 +02:00
Szymon Morek
3596522637 refactor: remove unused logic in ULLS controller
Related-To: NEO-13843

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-17 18:35:20 +02:00
Oskar Hubert Weber
1816c8eac4 fix: avoid joining thread in deferred deleter
Join thread in DllMain (which we are not aware of)
can result in hang occurring in DeferredDeleter, if the library is
freed before FreeLibraryAndExitThread call from within the worker thread,
the thread gets stuck, thus the main thread is stuck on worker->join().

Related-To: NEO-14121

Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com>
2025-04-17 16:17:42 +02:00
Fabian Zwoliński
ee769f5983 fix: Add texture cache flush after kernels writing to images on imm cmdlists
- add `hasImageWriteArg` flag to KernelDescriptor,
based on image access qualifier
- add `isPostImageWriteFlushRequired` method to productHelper,
to require cache flush only on selected platforms
- if possible, add the `textureCacheInvalidationEnable` arg to the
existing PC, otherwise add a new PC

Related-To: NEO-13427, HSD-18041321008
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2025-04-17 16:04:31 +02:00
Young Jin Yoon
5a2a792c34 Revert "refactor: add EncodePostSyncArgs to EncodeDispatchKernelArgs"
This reverts commit 40aef1555e.

Related-To: NEO-13003
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2025-04-17 15:37:05 +02:00
Compute-Runtime-Validation
b96cf55985 Revert "fix: Return max wg count equal one on platforms without dispatch all ...
This reverts commit b8be602bfb.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-04-17 12:10:11 +02:00
Chodor, Jaroslaw
a855ce019e refactor: pass underlying allocator by ref
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-17 10:43:21 +02:00
Lukasz Jobczyk
d9a40734ae performance: Remove global fence from CW post sync on BMG
Related-To: NEO-14642

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-17 10:23:40 +02:00
Chodor, Jaroslaw
eb58e1f964 feature: Allow NEOReadDebugKeys and PrintDebugMessages envs in ocloc
This enables NEO's NEOReadDebugKeys and PrintDebugMessages in ocloc
in the same scenarios as in NEO OCL/L0 RT

Note :
* NEOReadDebugKeys=1 should not be prefixed with NEO_OCLOC_
* PrintDebugMessages=1 should be prefixed with NEO_OCLOC_
(to distinguish from OCL/L0 RT usage), so the full env
is NEO_OCLOC_PrintDebugMessages=1

Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-16 18:17:15 +02:00
Young Jin Yoon
40aef1555e refactor: add EncodePostSyncArgs to EncodeDispatchKernelArgs
Refactored various member variables in EncodeDispatchKernel to directly
include EncodePostSync
Changed command encoder and command list to use the modified
EncodeDispatchKernel.

Related-To: NEO-13003
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2025-04-16 17:46:07 +02:00
Lukasz Jobczyk
0c778ef6f4 performance: Remove global fence from command stream on BMG
Related-To: NEO-14642

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-16 16:01:05 +02:00
Compute-Runtime-Validation
331b515478 Revert "fix: avoid joining thread in deferred deleter"
This reverts commit 38a3667304.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-04-16 15:18:27 +02:00
Szymon Morek
4d60465a9e fix: don't flush texture cache if not required
Related-To: NEO-14645

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-16 10:38:24 +02:00
Chodor, Jaroslaw
b423084861 refactor: Change FinalizerInputType debug var to string
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-16 10:33:19 +02:00
Chodor, Jaroslaw
7744251995 refactor: explicit copy/move operations in OpaqueElementAllocator
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-16 10:10:05 +02:00
Oskar Hubert Weber
38a3667304 fix: avoid joining thread in deferred deleter
Join thread in DllMain (which we are not aware of)
can result in hang occurring in DeferredDeleter, if the library is
freed before FreeLibraryAndExitThread call from within the worker thread,
the thread gets stuck, thus the main thread is stuck on worker->join().

Related-To: NEO-14121

Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com>
2025-04-16 10:02:10 +02:00
Mateusz Jablonski
f6b92aa5d5 build: update default PTL revision to B0
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-04-16 09:44:58 +02:00
Tomasz Biernacik
eff2d1cde8 performance: change usage for semaphoreBuffer on integrated devices
Related-To: NEO-9421

Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-04-16 09:07:05 +02:00