Commit Graph

7540 Commits

Author SHA1 Message Date
Kamil Kopryk
f17ac35108 refactor: improve compilation time
Limit bitset includes,
stl bitset is slow to compile

Related-To: NEO-15390

Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-10-03 21:21:25 +02:00
Bartosz Dunajski
8ea8e78471 fix: add missing lock in bcs split path
Related-To: NEO-14557

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-10-03 21:06:28 +02:00
Mateusz Jablonski
6c0c229c82 refactor: remove not needed flags to force OCL version support
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-10-03 16:57:08 +02:00
Katarzyna Cencelewska
0f866e24e6 fix: enable setting timestamp via submission on tbx mode
Resolves: NEO-16293
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2025-10-03 08:43:18 +02:00
Mateusz Jablonski
69eeaee0eb refactor: remove not needed function - isDotIntegerProductExtensionSupported
the function always returns true

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-10-02 16:20:59 +02:00
Mateusz Jablonski
e9af7aee15 refactor: remove always true variable - supportsOcl21Features
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-10-02 16:03:37 +02:00
Szymon Morek
ee032982a6 refactor: print faulted address when waiting for tag
getDeviceState on Windows prints faulted GPU VA
if OOB access happened.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-10-02 14:59:02 +02:00
Slawomir Milczarek
98b6259129 fix: Switch compute-engine idle check to try-lock
Related-To: NEO-13225

Replace blocking obtainUniqueOwnership with non-blocking
tryObtainUniqueOwnership in isDirectSubmissionIdle.
Treat compute engine as not idle when lock is contended
(conservative; prevents false stops).
Avoid deadlock scenarios caused by holding
directSubmissionsMutex while waiting on a CSR lock.

Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2025-10-02 14:41:24 +02:00
Mateusz Jablonski
710d70a787 refactor: remove not needed code
Related-To: NEO-16154

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-10-02 11:41:08 +02:00
Mateusz Jablonski
165305090b refactor: remove always true variable - ftrSupportsInteger64BitAtomics
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-10-01 16:45:16 +02:00
Jack Myers
7b18a66710 fix: ensure proper heap alignment
Noticed an issue on machine with following `sycl-ls`:

When running `sycl-ls` with a recent debug build of NEO, it exposed a
heap alignment problem:

```shell
Assert was called at 324 line in file:
../../src/neo/shared/source/memory_manager/gfx_partition.cpp
```

Related-To: NEO-16287
Signed-off-by: Jack Myers <jack.myers@intel.com>
2025-10-01 14:46:52 +02:00
Jack Myers
518e044cc4 fix: correctly report pvc ondemand paging capabilities
Related-To: NEO-15938
Signed-off-by: Jack Myers <jack.myers@intel.com>
2025-10-01 14:46:05 +02:00
Mateusz Jablonski
5b8e8b8b1a refactor: unify GmmMemory definition
Related-To: NEO-11080
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-10-01 14:23:35 +02:00
Mateusz Jablonski
0caa8d5297 refactor: remove always true variable - supportsFloatAtomics
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-10-01 14:06:44 +02:00
Kamil Kopryk
2192e0b90e refactor: improve compilation time
Limit chrono includes,
chrono is very slow to compile

Related-To: NEO-15390
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-10-01 13:50:29 +02:00
Slawomir Milczarek
393c4d0985 Revert "fix: Fix DirectSubmissionController deadlock with try_lock pattern"
Related-To: NEO-13325

This reverts commit 40ce5eb55e.

Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2025-10-01 11:23:16 +02:00
Filip Hazubski
728bdc2178 fix: Correct dummy blit pitch size for xe lpg and xe hpg
Resolves: HSD-18043456303

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-10-01 07:41:43 +02:00
Compute-Runtime-Validation
eed28d6552 Revert "fix: remove spir support from xe2 and later"
This reverts commit a744ce6fbb.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-10-01 04:41:59 +02:00
Compute-Runtime-Validation
a0d55768a0 Revert "feature: fail stateful kernel when stateless is required"
This reverts commit 2a8c0d867f.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-09-30 18:54:40 +02:00
Grochowski, Stanislaw
a744ce6fbb fix: remove spir support from xe2 and later
Related-To: NEO-15215

Signed-off-by: Grochowski, Stanislaw <stanislaw.grochowski@intel.com>
2025-09-30 18:23:24 +02:00
Radoslaw Jablonski
1f3ca4e2fa fix: use move instead of copy in DeviceCapsReaderDrm constructor
Signed-off-by: Radoslaw Jablonski <radoslaw.jablonski@intel.com>
2025-09-30 18:06:26 +02:00
Marcel Skierkowski
047f6851f7 refactor: correct programming of SLM size
Unify reading hw avaialble slm size:
Pre-Xe2 SLMSizeInKb stores total SLM size across all DSS
Xe2+ SLMSizeInKb stores SLM size per SS

apply restrictions for preferred/programmable SLM size

Related-To: NEO-12949
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-09-30 17:57:18 +02:00
Mateusz Jablonski
3c13fb81f7 refactor: remove not needed logic
Related-To: NEO-11080
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-09-30 14:50:49 +02:00
Filip Hazubski
b73e85de0d refactor: Refactor direct submission logic
Remove ReleaseHelper::isDirectSubmissionSupported function.
Update ProductHelper::isDirectSubmissionSupported to return correct values.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-09-30 13:41:03 +02:00
Damian Tomczak
2a8c0d867f feature: fail stateful kernel when stateless is required
Resolves: NEO-11859

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-09-30 12:48:48 +02:00
Mateusz Hoppe
d4ef7dae26 fix: set IGC as default generator prior decoding zebin
Related-To: NEO-16231, NEO-16167

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-09-30 08:22:33 +02:00
Slawomir Milczarek
7334aee8a8 fix: Switch copy-engine idle check to try-lock
Related-To: NEO-13325

Replace blocking obtainUniqueOwnership with non-blocking
tryObtainUniqueOwnership in isCopyEngineOnDeviceIdle.
Treat copy engine as not idle when lock is contended
(conservative; prevents false stops).
Avoid deadlock scenarios caused by holding
directSubmissionsMutex while waiting on a CSR lock.

Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2025-09-29 17:28:32 +02:00
Mateusz Jablonski
15d0feeda8 fix: improve propagating external functions info to kernel
When relocation points to symbol that is not defined within module mark
it as optional. When symbol is available at dynamic linking time then
info from the function is retrieved but when the symbol is not available
then ignore the dependency.

Any unresolved symbol needed for module linking is already handled
in a separate place.

Related-To: NEO-16243, NEO-16263, NEO-16262, NEO-16268
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-09-29 15:34:41 +02:00
Maciej Bielski
a440a3e8ea refactor: use Clonable type for pImplicitArgs
Clonable is a helper type which eases proper management of
dynamically-allocated resources. It allows to reduce the amount of
manual handling.

Related-To: NEO-16017
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-09-29 15:34:24 +02:00
Mateusz Hoppe
4b5f206058 feature: add ForceCompatibilityMode debug flag
- allows using device binary without validating device against target
device

Related-To: HSD-13013893426

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-09-29 15:22:06 +02:00
Filip Hazubski
07abb404e0 refactor: Refactor release_helper_common_xe_lpg.inl
Apply common file to release 12.74.

Move isMatrixMultiplyAccumulateSupported
specialization outside of the common file.

Move common specializations from releases: 12.70, 12.71, 12.74
to release_helper_common_xe_lpg.inl file.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-09-29 14:44:08 +02:00
Slawomir Milczarek
40ce5eb55e fix: Fix DirectSubmissionController deadlock with try_lock pattern
Related-To: NEO-13325

Replace blocking obtainUniqueOwnership() with tryObtainUniqueOwnership()
in critical paths to prevent deadlock between controller thread holding
directSubmissionsMutex and submission threads holding CSR locks.

Changes:
- Add CommandStreamReceiver::tryObtainUniqueOwnership() method
- Use try_lock in checkNewSubmissions() and context group idle detection
- Skip contended CSRs instead of blocking (conservative approach)
- Add comprehensive unit tests for try_lock functionality

Fixes deadlock scenario where registerDirectSubmission() waits for
directSubmissionsMutex while checkNewSubmissions() holds it and waits
for CSR ownership locks.

Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2025-09-29 11:38:26 +02:00
Mateusz Jablonski
39f10f9c5e refactor: stop using DEFAULT_TEST_FAMILY_NAME
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-09-29 10:15:13 +02:00
Compute-Runtime-Validation
aaa9342141 Revert "fix: improve parsing relocations in external usage"
This reverts commit 65d883bc9d.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-09-29 10:05:58 +02:00
Mateusz Hoppe
98fcaf4ceb feature: support for context group on Windows
Related-To: NEO-14289


Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-09-28 13:38:53 +02:00
Mateusz Jablonski
65d883bc9d fix: improve parsing relocations in external usage
ignore relocations for symbols with undefined section

Related-To: NEO-16243
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-09-26 16:33:11 +02:00
Compute-Runtime-Validation
ce65d34a8f Revert "fix: set proper gmm usage type for misaligned user ptr"
This reverts commit a665f27676.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-09-26 16:24:45 +02:00
Bartosz Dunajski
4df478c513 fix: change default lnl rev id
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-09-25 16:44:04 +02:00
Szymon Morek
d429f0d8a0 fix: respect uncacheable flag for USM host allocations
Related-To: NEO-14360

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-09-25 16:31:19 +02:00
Bartosz Dunajski
72a5bcbfa1 feature: extend copy offload hint support
Related-To: NEO-14557

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-09-25 12:56:40 +02:00
Szymon Morek
a665f27676 fix: set proper gmm usage type for misaligned user ptr
Related-To: NEO-15295

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-09-25 12:06:43 +02:00
Mateusz Hoppe
cfb3dade30 refactor: replace isContextGroup with contextGroupCount in osContext
Related-To: NEO-14289

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-09-25 11:33:38 +02:00
Kamil Kopryk
8c7fb0dde8 fix: data race in host function data initialization
Make sure hostFunctionInitialized bool is set after actual operations
are done

Related-To: NEO-14577
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-09-25 11:29:01 +02:00
Tomasz Biernacik
bc104a7b0d fix: move headers to definitions folder
Related-To: NEO-15440

Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-09-25 07:35:55 +02:00
Compute-Runtime-Validation
79cdf69efe Revert "fix: disable support for deprecated patchtokens format"
This reverts commit 506d5837df.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-09-25 03:09:03 +02:00
Maciej Plewka
cf35f8b40e fix: wa for multi ccs execution on bmg
Related-To: NEO-15771
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-09-24 18:15:07 +02:00
Kamil Kopryk
4133827e6f fix: set host function allocation as uncached
The usage of host function allocation is similar to tagBuffer.
Ensure it's uncached and it follows tag buffer's cache policy.
Host function allocation will be readable and writable
by both: cpu and gpu.

Related-To: NEO-14577
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-09-24 18:00:50 +02:00
Szymon Morek
804b8f5db6 fix: do not cache tag buffer allocation on GPU
Related-To: NEO-14360

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-09-24 16:28:35 +02:00
Dominik Dabek
f54e3fda41 feature: adapt usm pool manager
Change behavior to continue allocating usm pools as needed.
Intended to replace singular usm pools.

Related-To: NEO-16084

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-09-24 15:16:10 +02:00
Mateusz Jablonski
506d5837df fix: disable support for deprecated patchtokens format
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-09-24 12:44:12 +02:00