Commit Graph

7556 Commits

Author SHA1 Message Date
Mateusz Jablonski
f3e4ba9ff6 refactor: remove not needed code
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-10-07 13:44:30 +02:00
Dominik Dabek
17199556ca performance: lazy start usm reuse cleaner thread
Delay starting usm reuse cleaner thread to first allocation reuse.

Related-To: HSD-16028754950

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-10-07 13:29:10 +02:00
Kamil Kopryk
0eeb1899d4 refactor: correct debug flag description
Specify debug flag size unit

Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-10-07 11:59:13 +02:00
Lukasz Jobczyk
db5924221d refactor: remove unused enum value
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-10-07 11:54:21 +02:00
Zbigniew Zdanowicz
24db09e3e9 fix: make pipeline thread arbitration one-time switch property
- detect switch happens by making "is dirty" property
- make sure to call the property setter
- setting to always true will make "is dirty" true only once

Related-To: NEO-16330

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-10-07 11:27:22 +02:00
Damian Tomczak
bb742a88a4 feature: fail stateful kernel when stateless is required
Resolves: NEO-11859

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-10-07 10:49:26 +02:00
Mateusz Hoppe
f411dc2e65 fix: debug key for using temp memory when writing to aub
Related-To: NEO-16276

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-10-07 09:43:02 +02:00
Bartosz Dunajski
d1a2ec4590 fix: add missing lock in bcs split path 2
Related-To: NEO-14557

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-10-07 08:00:18 +02:00
Compute-Runtime-Validation
bafb847c73 Revert "fix: use condition variables instead of busy waits in worker threads"
This reverts commit 9d1da44e08.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-10-06 20:38:46 +02:00
Szymon Morek
4df9783448 performance: use 2MB heaps for 2MB-aligned allocations
Related-To: NEO-16165

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-10-06 18:11:01 +02:00
Kamil Kopryk
d415c3eb11 refactor: mark noexcept move operations in nonCopyableClass
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-10-06 16:23:08 +02:00
Radoslaw Jablonski
54de14a9dc refactor: apply clang-format's InsertBraces rule
Formats code before InsertBraces rule is enabled.

Signed-off-by: Radoslaw Jablonski <radoslaw.jablonski@intel.com>
2025-10-06 15:32:46 +02:00
Igor Venevtsev
9d1da44e08 fix: use condition variables instead of busy waits in worker threads
Resolves: NEO-16085, GSD-11678, HSD-14025819208

Signed-off-by: Igor Venevtsev <igor.venevtsev@intel.com>
2025-10-06 13:27:34 +02:00
Bartosz Dunajski
42371ee7bd fix: decanonize prefetch gpu va
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-10-06 12:54:42 +02:00
Compute-Runtime-Validation
d3ac2f871b Revert "fix: add missing lock in bcs split path"
This reverts commit 8ea8e78471.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-10-06 11:38:28 +02:00
Dominik Dabek
8575573dbd fix: l0, bindless image from pooled alloc
Make sure pool offset is applied when growing pools are used.

Related-To: NEO-16317, GSD-11804

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-10-06 10:21:05 +02:00
Kamil Kopryk
f17ac35108 refactor: improve compilation time
Limit bitset includes,
stl bitset is slow to compile

Related-To: NEO-15390

Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-10-03 21:21:25 +02:00
Bartosz Dunajski
8ea8e78471 fix: add missing lock in bcs split path
Related-To: NEO-14557

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-10-03 21:06:28 +02:00
Mateusz Jablonski
6c0c229c82 refactor: remove not needed flags to force OCL version support
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-10-03 16:57:08 +02:00
Katarzyna Cencelewska
0f866e24e6 fix: enable setting timestamp via submission on tbx mode
Resolves: NEO-16293
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2025-10-03 08:43:18 +02:00
Mateusz Jablonski
69eeaee0eb refactor: remove not needed function - isDotIntegerProductExtensionSupported
the function always returns true

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-10-02 16:20:59 +02:00
Mateusz Jablonski
e9af7aee15 refactor: remove always true variable - supportsOcl21Features
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-10-02 16:03:37 +02:00
Szymon Morek
ee032982a6 refactor: print faulted address when waiting for tag
getDeviceState on Windows prints faulted GPU VA
if OOB access happened.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-10-02 14:59:02 +02:00
Slawomir Milczarek
98b6259129 fix: Switch compute-engine idle check to try-lock
Related-To: NEO-13225

Replace blocking obtainUniqueOwnership with non-blocking
tryObtainUniqueOwnership in isDirectSubmissionIdle.
Treat compute engine as not idle when lock is contended
(conservative; prevents false stops).
Avoid deadlock scenarios caused by holding
directSubmissionsMutex while waiting on a CSR lock.

Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2025-10-02 14:41:24 +02:00
Mateusz Jablonski
710d70a787 refactor: remove not needed code
Related-To: NEO-16154

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-10-02 11:41:08 +02:00
Mateusz Jablonski
165305090b refactor: remove always true variable - ftrSupportsInteger64BitAtomics
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-10-01 16:45:16 +02:00
Jack Myers
7b18a66710 fix: ensure proper heap alignment
Noticed an issue on machine with following `sycl-ls`:

When running `sycl-ls` with a recent debug build of NEO, it exposed a
heap alignment problem:

```shell
Assert was called at 324 line in file:
../../src/neo/shared/source/memory_manager/gfx_partition.cpp
```

Related-To: NEO-16287
Signed-off-by: Jack Myers <jack.myers@intel.com>
2025-10-01 14:46:52 +02:00
Jack Myers
518e044cc4 fix: correctly report pvc ondemand paging capabilities
Related-To: NEO-15938
Signed-off-by: Jack Myers <jack.myers@intel.com>
2025-10-01 14:46:05 +02:00
Mateusz Jablonski
5b8e8b8b1a refactor: unify GmmMemory definition
Related-To: NEO-11080
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-10-01 14:23:35 +02:00
Mateusz Jablonski
0caa8d5297 refactor: remove always true variable - supportsFloatAtomics
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-10-01 14:06:44 +02:00
Kamil Kopryk
2192e0b90e refactor: improve compilation time
Limit chrono includes,
chrono is very slow to compile

Related-To: NEO-15390
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-10-01 13:50:29 +02:00
Slawomir Milczarek
393c4d0985 Revert "fix: Fix DirectSubmissionController deadlock with try_lock pattern"
Related-To: NEO-13325

This reverts commit 40ce5eb55e.

Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2025-10-01 11:23:16 +02:00
Filip Hazubski
728bdc2178 fix: Correct dummy blit pitch size for xe lpg and xe hpg
Resolves: HSD-18043456303

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-10-01 07:41:43 +02:00
Compute-Runtime-Validation
eed28d6552 Revert "fix: remove spir support from xe2 and later"
This reverts commit a744ce6fbb.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-10-01 04:41:59 +02:00
Compute-Runtime-Validation
a0d55768a0 Revert "feature: fail stateful kernel when stateless is required"
This reverts commit 2a8c0d867f.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-09-30 18:54:40 +02:00
Grochowski, Stanislaw
a744ce6fbb fix: remove spir support from xe2 and later
Related-To: NEO-15215

Signed-off-by: Grochowski, Stanislaw <stanislaw.grochowski@intel.com>
2025-09-30 18:23:24 +02:00
Radoslaw Jablonski
1f3ca4e2fa fix: use move instead of copy in DeviceCapsReaderDrm constructor
Signed-off-by: Radoslaw Jablonski <radoslaw.jablonski@intel.com>
2025-09-30 18:06:26 +02:00
Marcel Skierkowski
047f6851f7 refactor: correct programming of SLM size
Unify reading hw avaialble slm size:
Pre-Xe2 SLMSizeInKb stores total SLM size across all DSS
Xe2+ SLMSizeInKb stores SLM size per SS

apply restrictions for preferred/programmable SLM size

Related-To: NEO-12949
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-09-30 17:57:18 +02:00
Mateusz Jablonski
3c13fb81f7 refactor: remove not needed logic
Related-To: NEO-11080
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-09-30 14:50:49 +02:00
Filip Hazubski
b73e85de0d refactor: Refactor direct submission logic
Remove ReleaseHelper::isDirectSubmissionSupported function.
Update ProductHelper::isDirectSubmissionSupported to return correct values.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-09-30 13:41:03 +02:00
Damian Tomczak
2a8c0d867f feature: fail stateful kernel when stateless is required
Resolves: NEO-11859

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-09-30 12:48:48 +02:00
Mateusz Hoppe
d4ef7dae26 fix: set IGC as default generator prior decoding zebin
Related-To: NEO-16231, NEO-16167

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-09-30 08:22:33 +02:00
Slawomir Milczarek
7334aee8a8 fix: Switch copy-engine idle check to try-lock
Related-To: NEO-13325

Replace blocking obtainUniqueOwnership with non-blocking
tryObtainUniqueOwnership in isCopyEngineOnDeviceIdle.
Treat copy engine as not idle when lock is contended
(conservative; prevents false stops).
Avoid deadlock scenarios caused by holding
directSubmissionsMutex while waiting on a CSR lock.

Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2025-09-29 17:28:32 +02:00
Mateusz Jablonski
15d0feeda8 fix: improve propagating external functions info to kernel
When relocation points to symbol that is not defined within module mark
it as optional. When symbol is available at dynamic linking time then
info from the function is retrieved but when the symbol is not available
then ignore the dependency.

Any unresolved symbol needed for module linking is already handled
in a separate place.

Related-To: NEO-16243, NEO-16263, NEO-16262, NEO-16268
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-09-29 15:34:41 +02:00
Maciej Bielski
a440a3e8ea refactor: use Clonable type for pImplicitArgs
Clonable is a helper type which eases proper management of
dynamically-allocated resources. It allows to reduce the amount of
manual handling.

Related-To: NEO-16017
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-09-29 15:34:24 +02:00
Mateusz Hoppe
4b5f206058 feature: add ForceCompatibilityMode debug flag
- allows using device binary without validating device against target
device

Related-To: HSD-13013893426

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-09-29 15:22:06 +02:00
Filip Hazubski
07abb404e0 refactor: Refactor release_helper_common_xe_lpg.inl
Apply common file to release 12.74.

Move isMatrixMultiplyAccumulateSupported
specialization outside of the common file.

Move common specializations from releases: 12.70, 12.71, 12.74
to release_helper_common_xe_lpg.inl file.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-09-29 14:44:08 +02:00
Slawomir Milczarek
40ce5eb55e fix: Fix DirectSubmissionController deadlock with try_lock pattern
Related-To: NEO-13325

Replace blocking obtainUniqueOwnership() with tryObtainUniqueOwnership()
in critical paths to prevent deadlock between controller thread holding
directSubmissionsMutex and submission threads holding CSR locks.

Changes:
- Add CommandStreamReceiver::tryObtainUniqueOwnership() method
- Use try_lock in checkNewSubmissions() and context group idle detection
- Skip contended CSRs instead of blocking (conservative approach)
- Add comprehensive unit tests for try_lock functionality

Fixes deadlock scenario where registerDirectSubmission() waits for
directSubmissionsMutex while checkNewSubmissions() holds it and waits
for CSR ownership locks.

Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2025-09-29 11:38:26 +02:00
Mateusz Jablonski
39f10f9c5e refactor: stop using DEFAULT_TEST_FAMILY_NAME
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-09-29 10:15:13 +02:00
Compute-Runtime-Validation
aaa9342141 Revert "fix: improve parsing relocations in external usage"
This reverts commit 65d883bc9d.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-09-29 10:05:58 +02:00