Commit Graph

4195 Commits

Author SHA1 Message Date
Young Jin Yoon 713df81dbf feature: enable recoverable pagefault
Enabled recoverable pagefault and kmd migration support by
default for bmg and lnl, but disabled kmd migration for XeKMD
in order to disable kmd migration for all platforms using
XeKMD for now.

Related-To: NEO-13116
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2025-01-21 12:17:04 +01:00
Radoslaw Jablonski be0b2bd06d performance: enable ResolveDependenciesViaPipeControls on PTL Windows
Related-To: NEO-10526

Signed-off-by: Radoslaw Jablonski <radoslaw.jablonski@intel.com>
2025-01-21 11:46:19 +01:00
Fabian Zwoliński 285414e852 fix: cl_cache - add cm header includes to whitelist
This patch adds following includes to the whitelist:
<cm/cm.h>
<cm/cmtl.h>

Related-To: NEO-13641
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2025-01-21 10:22:03 +01:00
Dominik Dabek 474b91aa36 fix: move device usm reuse max size to device
Related-To: NEO-6893

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-01-20 18:05:37 +01:00
Chodor, Jaroslaw 5619da6238 feature: Adding hook for applying OpenVino-related AIL
Related-To: NEO-12747

Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-01-20 17:38:49 +01:00
Andrzej Koska b1bcfe668c feature: Enable Bindless on PTL L0
Related-To: NEO-13791

Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
`
2025-01-20 16:18:48 +01:00
Mateusz Hoppe 850e6c02eb test: allocate sip allocation for all contexts in initSipKernel
- some tests initialize sip kernel but do not free it
- this is causing memory leaks
- as a solution mock sip listener is added that will release allocated
sip kernel

Related-To: NEO-13789

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-01-20 15:46:05 +01:00
Fabian Zwoliński bbaca5d8e2 feature: cl_cache - add support for whitelisted includes
Related-To: NEO-13641
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2025-01-20 15:07:14 +01:00
Zbigniew Zdanowicz 897c890d03 refactor: add thread group count parameter to implicit scaling functions
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-01-20 13:12:45 +01:00
Jitendra Sharma 8f5a0995e0 fix: Dont directly use gtId as TileId
gtId is not same as tileId. instead use gtIdToTileId
to retrieve tileId based on gtId.

Related-To: NEO-11104
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2025-01-20 10:23:53 +01:00
Mateusz Hoppe 1ce795c265 refactor: fixes in ults
Related-To: NEO-13789

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-01-20 09:57:05 +01:00
Lukasz Jobczyk af0bb7964a performance: Set dispatch all for small TG
Resolves: NEO-11814

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-01-20 08:49:59 +01:00
Raiyan Latif 12e3204cab fix: Process OpaqueWin32 external semaphore properly
Related-To: NEO-11488

Signed-off-by: Raiyan Latif <raiyan.latif@intel.com>
2025-01-20 00:22:14 +01:00
Compute-Runtime-Validation 71aee41ca3 Revert "feature: enable WMTP on BMG"
This reverts commit d4256475fb.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-01-19 11:57:40 +01:00
Jack Myers d62122a656 fix: exceptions to TBX faultable types
This commit addresses a bug in the previous implementation where almost all once
writable types, except `gpuTimestampBuffers`, were incorrectly enabled for TBX
faultable checks. The fix ensures that only the subset of once writable
types that are also lockable are considered TBX faultable, using the lockable
check to avoid manual exceptions and re-inventing the wheel.

Changes:

- Updated `isAllocTbxFaultable` method to check if the allocation type is
lockable in addition to being once writable.
- Refactored unit tests to include separate checks for lockable and non-lockable
allocation types.

Performance optimization:

- Removed unnecessary memory data erasure in `handlePageFault` to avoid constant
erase/insert operations, leveraging the O(1) search time of unordered maps.

Related-To: NEO-12319
Signed-off-by: Jack Myers <jack.myers@intel.com>
2025-01-17 00:52:49 +01:00
Naklicki, Mateusz 118c5afc47 test: unify scratch pages product helper tests
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com>
2025-01-16 20:19:47 +01:00
Jaroslaw Warchulski 4dd1c80362 refactor: use enum class instead of enum for DeviceHierarchyMode
Related-To: GSD-9560
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-01-16 15:49:44 +01:00
Kamil Kopryk 7d8e08f00b test: adjust code to compile with c++20 2/n
Related-To: NEO-10767
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-01-16 15:24:25 +01:00
Mateusz Jablonski d4256475fb feature: enable WMTP on BMG
Related-To: NEO-12814
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-01-16 14:56:12 +01:00
Naklicki, Mateusz 218122c46b test: make sure scratch pages are disabled on pvc+
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com>
2025-01-16 14:23:24 +01:00
Naklicki, Mateusz 3e29ca9057 fix: explicitly disable scratch pages on xekmd platforms
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com>
2025-01-16 12:09:27 +01:00
Kamil Kopryk 0278d2e652 test: adjust code to compile with c++20
Related-To: NEO-10767
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-01-16 12:04:50 +01:00
Kamil Kopryk c5ba3dd575 test: remove not needed volatile
Related-To: NEO-10767
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-01-16 11:36:45 +01:00
Compute-Runtime-Validation 945c4a3ccc Revert "feature: Enable page faults for xe driver with debugger"
This reverts commit 4300f0bdbe.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-01-16 11:16:14 +01:00
Jack Myers 0b2ac4d331 feature: Tbx faults for all once writable types
Patch #34223 introduced the TbxPageFaultManager for handling
uploads/downloads of host buffers to the Tbx server, ensuring
host memory is kept consistent between the host and device,
even after multiple alternating writes from the host and gpu.

This patch enable fault handling for all `isAubOnceWritable`
types.

Minor exception for gpuTimestampBuffers as enabling this type
seems to break things in real-world use cases outside of ULTs.

Related-To: NEO-12319
Signed-off-by: Jack Myers <jack.myers@intel.com>
2025-01-16 01:43:19 +01:00
Kamil Kopryk 99a7b5a4fb refactor: remove not needed volatile
Related-To: NEO-10767
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-01-16 00:43:10 +01:00
Krzysztof Sprzaczkowski a17745532c performance: Move preemption allocation init to the first submission
Related-To: NEO-12323
Signed-off-by: Krzysztof Sprzaczkowski <krzysztof.sprzaczkowski@intel.com>
2025-01-15 20:22:50 +01:00
Lukasz Jobczyk 2dd9940f60 Revert "fix: count active modules for enabling per-dispatch private memory"
This reverts commit a483b361f9.

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-01-15 15:03:37 +01:00
Szymon Morek 0b4fe9a0df performance: add staging transfers for cl buffers
Related-To: NEO-13529

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-01-15 12:24:43 +01:00
Vysochyn, Illia 4270e3c356 feature: Add dynamic stack management on Xe3
Related-To: NEO-13147

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-01-15 10:12:48 +01:00
Vysochyn, Illia ca72dff1ab feature: Add missing pipelined EU thread arbitration on Xe3
Related-To: NEO-13682

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-01-15 08:24:43 +01:00
Jaroslaw Warchulski 6a7bee67e9 refactor: unify device hierarchy mode
Related-To: GSD-9560
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-01-14 21:08:44 +01:00
Maciej Bielski e95c7aac92 fix: set memoryBanks correctly for single memory bank
Resolves: HSD-18040585222

Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-01-14 20:38:10 +01:00
Vysochyn, Illia 7903086db3 refactor: Update thread scheduling mode naming
Renames the EU_THREAD_SCHEDULING_MODE_OVERRIDE member of
STATE_COMPUTE_MODE structure to EU_THREAD_SCHEDULING_MODE.

This modification is introduced in order to align with the latest specification.

Related-To: NEO-13147

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-01-14 19:00:46 +01:00
Brandon Yates 4300f0bdbe feature: Enable page faults for xe driver with debugger
Related-to: NEO-10134

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-01-14 09:32:53 +01:00
Zbigniew Zdanowicz 575ae7f6f7 fix: get minimal offset size for region barrier
Related-To: NEO-13350

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-01-13 22:56:25 +01:00
Jaroslaw Warchulski a180afa2e6 fix: respect affinity mask with COMBINED hierarchy in OCL
Related-To: GSD-9560
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-01-13 15:38:39 +01:00
Compute-Runtime-Validation dc49d09e64 Revert "fix: set memoryBanks correctly for single memory bank"
This reverts commit ec5477e3ee.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-01-11 01:56:37 +01:00
Wenbin Lu a483b361f9 fix: count active modules for enabling per-dispatch private memory
Related-To: NEO-13086

Signed-off-by: Wenbin Lu <wenbin.lu@intel.com>
2025-01-10 15:03:34 +01:00
Mateusz Jablonski 112abeeeef fix: don't adjust programmed per thread scratch size
when adjusting scratch space size then adjust only allocation size

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-01-10 11:35:50 +01:00
Maciej Bielski ec5477e3ee fix: set memoryBanks correctly for single memory bank
Resolves: HSD-18040585222

Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-01-10 10:01:33 +01:00
Mateusz Jablonski a3b6c1fa6d fix: correct thread/eu ratio for scratch to Xe2
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-01-09 22:42:36 +01:00
Szymon Morek f3c9362fc5 fix: check for gpu hang during wait for ring completion
Related-To: NEO-13490

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-01-09 18:44:25 +01:00
Compute-Runtime-Validation 9e5a52f689 Revert "fix: correct thread/eu ratio for scratch to Xe2"
This reverts commit b192893738.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-01-09 11:06:26 +01:00
Jack Myers 7f9fadc314 fix: regression caused by tbx fault mngr
Addresses regressions from the reverted merge
of the tbx fault manager for host memory.

Recursive locking of mutex caused deadlock.

To fix, separate tbx fault data from base
cpu fault data, allowing separate mutexes
for each, eliminating recursive locks on
the same mutex.

By separating, we also help ensure that tbx-related
changes don't affect the original cpu fault manager code
paths.

As an added safe guard preventing critical regressions
and avoiding another auto-revert, the tbx fault manager
is hidden behind a new debug flag which is disabled by default.

Related-To: NEO-12268
Signed-off-by: Jack Myers <jack.myers@intel.com>
2025-01-09 07:48:53 +01:00
Compute-Runtime-Validation 22cebedcd1 Revert "refactor: Update thread scheduling mode naming"
This reverts commit d50beb5df4.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-01-09 05:03:36 +01:00
Vysochyn, Illia d50beb5df4 refactor: Update thread scheduling mode naming
Renames the EU_THREAD_SCHEDULING_MODE_OVERRIDE member of
STATE_COMPUTE_MODE structure to EU_THREAD_SCHEDULING_MODE.

This modification is introduced in order to align with the latest specification.

Related-To: NEO-13147

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-01-08 16:52:45 +01:00
Vysochyn, Illia a2aa645404 refactor: Update MI_MEM_FENCE
Refactors the MI_MEM_FENCE to align with the latest specification.

Related-To: NEO-13147

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-01-08 16:46:58 +01:00
Mateusz Jablonski b192893738 fix: correct thread/eu ratio for scratch to Xe2
Xe2 platforms don't need overallocation there

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-01-08 15:34:50 +01:00
Szymon Morek 2815d4167d fix: don't use staging for map allocations
Related-To: NEO-13572

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-01-07 13:30:11 +01:00