Compute-Runtime-Validation
6736378c4d
Revert "fix: use condition variables instead of busy waits in worker threads"
...
This reverts commit 452475a0b9 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-09-23 17:06:18 +02:00
Igor Venevtsev
452475a0b9
fix: use condition variables instead of busy waits in worker threads
...
Resolves: NEO-16085, GSD-11678, HSD-14025819208
Signed-off-by: Igor Venevtsev <igor.venevtsev@intel.com >
2025-09-23 14:44:39 +02:00
Kamil Kopryk
8ab463f47c
refactor: allow designated initialization directSubmissionPropertiesPerEngine
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-09-19 11:08:34 +02:00
Compute-Runtime-Validation
ac225e204d
Revert "performance: set 1ms ulls timeout on linux"
...
This reverts commit 8ae49b17ae .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-09-19 06:59:27 +02:00
Szymon Morek
8ae49b17ae
performance: set 1ms ulls timeout on linux
...
Related-To: NEO-13843
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-09-18 11:48:12 +02:00
Szymon Morek
9647322cbf
performance: remove cache flush before stopping ULLS
...
Related-To: NEO-16153
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-09-17 13:22:13 +02:00
Slawomir Milczarek
b169105da0
feature: Don't terminate ULLS contexts if busy context exists (default)
...
Controlled with the knob DirectSubmissionControllerContextGroupIdleDetection
(note: the feature is enabled by default)
Related-To: NEO-13325
Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com >
2025-09-16 13:42:32 +02:00
Lukasz Jobczyk
3f1e6a6092
refactor: Remove unused ulls functionalities
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-09-12 13:54:44 +02:00
Slawomir Milczarek
e80b84e699
feature: Don't terminate ULLS contexts if busy context exists
...
Enhanced direct submission idle detection to ensure that
ULLS contexts are not terminated if any context in the same group
is still busy or has pending work.
Idle detection now accurately considers the state of all CSRs
in a context group before terminating any direct submission.
Controlled with DirectSubmissionControllerContextGroupIdleDetection
(note: the feature is disabled by default in first step).
Related-To: NEO-13325
Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com >
2025-09-12 09:32:59 +02:00
Compute-Runtime-Validation
7610d7c90a
Revert "refactor: Remove unused ulls functionalities"
...
This reverts commit 8ab4e1bcb8 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-09-12 02:54:13 +02:00
Lukasz Jobczyk
8ab4e1bcb8
refactor: Remove unused ulls functionalities
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-09-11 15:33:23 +02:00
Michal Mrozek
3f9c49f216
refactor: remove redundant code
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2025-09-03 13:43:37 +02:00
Michal Mrozek
3ee33d26d9
refactor: remove not needed code
...
- low priority never uses direct submission
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2025-09-02 16:52:53 +02:00
Michal Mrozek
d6bd0262e9
refactor: remove redundant code
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2025-08-21 07:29:00 +02:00
Szymon Morek
c771674ffb
performance: dont wait for BCS to check if CCS is busy
...
Related-To: NEO-15813
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-08-20 16:26:43 +02:00
Michal Mrozek
3b017d41a7
refactor: remove not needed debug variable
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2025-08-20 15:03:08 +02:00
Compute-Runtime-Validation
93b2e68707
Revert "performance: set 1ms ulls timeout on linux"
...
This reverts commit 0ea4b8cf9c .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-08-20 14:55:30 +02:00
Szymon Morek
0ea4b8cf9c
performance: set 1ms ulls timeout on linux
...
Related-To: NEO-13843
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-08-19 14:05:31 +02:00
Jakub Nowacki
c4ab767797
fix: add missing NonCopyableAndNonMovable asserts
...
Related-To: NEO-15630
Signed-off-by: Jakub Nowacki <jakub.nowacki@intel.com >
2025-08-13 13:13:05 +02:00
Szymon Morek
1d842c58bf
performance: don't terminate ULLS if BCS is busy
...
Related-To: NEO-15452
If CCS is idle, but BCS is busy, keep CCS ULLS
context running. BMG only.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-07-17 17:46:23 +02:00
Lukasz Jobczyk
a1ec6bc243
fix: Do not update ulls tag when ring not started
...
When ring is not started ulls tag update is not dispatched. Counter
should reflect that.
Resolves: GSD-11312
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-07-02 15:43:06 +02:00
Jaroslaw Warchulski
2375ff89bb
refactor: cleanup headers
...
Related-To: NEO-5548
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com >
2025-06-23 11:23:49 +02:00
Szymon Morek
558752288f
performance: set 1ms ULLS timeout on Windows since Xe2
...
Related-To: NEO-13843
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-06-17 08:09:24 +02:00
Szymon Morek
6a7f6f1ad6
fix: add noexcept to move operators
...
Without it, copy operator is selected, even if move is explicitly defined.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-06-09 10:32:23 +02:00
Mateusz Jablonski
1b2db22238
refactor: simplify compare operator for RuntimeCapabilityTable
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-06-03 20:20:22 +02:00
Lukasz Jobczyk
332340b02b
refactor: Prework for release fence removal leaving acquire fence
...
Related-To: NEO-14642
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-30 12:01:28 +02:00
Lukasz Jobczyk
19361031fe
refactor: Make SemaphoreFenceHelper compliant with rule of three
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-29 15:56:43 +02:00
Maciej Plewka
e6f3ebce5d
fix: add pc with stall before barrier with post sync on bmg
...
Related-To: NEO-14491
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2025-05-28 09:24:21 +02:00
Lukasz Jobczyk
86e7d5b276
performance: Skip fence after PC when no post sync write
...
Related-To: NEO-14642
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-23 10:33:28 +02:00
Lukasz Jobczyk
680fa1ec1d
refactor: Split direct_submission_hw.inl
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-22 11:27:52 +02:00
Szymon Morek
bff8f74de7
refactor: remove unused parameter
...
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-05-15 12:23:27 +02:00
Lukasz Jobczyk
e964ccdf57
refactor: remove unused ulls diagnostic mode
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-09 10:01:51 +02:00
Krzysztof Sprzaczkowski
de72e91269
feature: Native GPU fence synchronization object implementation
...
Related-To: NEO-10811
Signed-off-by: Krzysztof Sprzaczkowski <krzysztof.sprzaczkowski@intel.com >
2025-05-07 11:00:34 +02:00
Lukasz Jobczyk
1f9cfb8916
fix: Add missing fences when unblock residency semaphore
...
Related-To: NEO-14642
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-06 08:54:30 +02:00
Lukasz Jobczyk
6c2ddf1120
fix: program system fence address when global fence passed to ULLS
...
Related-To: NEO-14642
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-04-30 12:57:04 +02:00
Lukasz Jobczyk
97358acabe
fix: use mfence instead of sfence on discrete devices
...
Related-To: NEO-14642
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-04-25 13:32:25 +02:00
Szymon Morek
3596522637
refactor: remove unused logic in ULLS controller
...
Related-To: NEO-13843
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-17 18:35:20 +02:00
Lukasz Jobczyk
455209aadc
refactor: Simplify fence selection in ULLS
...
Related-To: NEO-14642
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-04-15 14:04:47 +02:00
Jaroslaw Warchulski
3e1aa33924
refactor: cleanup headers
...
Related-To: NEO-5548
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com >
2025-04-14 14:59:40 +02:00
Kamil Kopryk
dd3d294f87
performance: cache MOCS values
...
This change caches the most used MOCS values:
* getMOCS(GMM_RESOURCE_USAGE_OCL_BUFFER_CONST);
* getMOCS(GMM_RESOURCE_USAGE_OCL_BUFFER);
* getMOCS(GMM_RESOURCE_USAGE_OCL_BUFFER_CACHELINE_MISALIGNED);
inside gmmHelper class during initialization to avoid repeated
calls of virtual functions, branches and/or gmm lib access.
and adds more readably corresponding getters:
* getL1EnabledMOCS
* getL3EnabledMOCS
* getUncachedMOCS
If force all resources uncached is called,
these 3 cached mocs values are reinitialized
It also changes the order of gmmHelper members, to avoid
not needed padding after addressWidth
and simplifies logic in getMocsIndex function
for xehp and later products.
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-04-14 14:12:48 +02:00
Maciej Plewka
286acf30a3
fix: Dispatch residency fence when stopping ulls on windows
...
Related-To: NEO-14396, HSD-18041496023, HSD-13012953666, NEO-14611
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2025-04-09 19:51:16 +02:00
Maciej Plewka
bb36614567
fix: add separate counter for tracking ring buffer completion
...
Related-To: NEO-14396, NEO-9548
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2025-04-08 16:30:44 +02:00
Lukasz Jobczyk
ca45573dd3
fix: Handle new ring buffer residency when switch ULLS light ring
...
Related-To: NEO-14406, NEO-13922
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-04-08 10:12:57 +02:00
Filip Hazubski
504440fc4d
feature: Add ftrHeaplessMode flag
...
Pass hwInfo to isHeaplessModeEnabled and isForceBindlessRequired functions.
Related-To: NEO-14526
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2025-04-02 21:06:05 +02:00
Lukasz Jobczyk
deca36fd32
fix: Stop ULLS light when evict resource
...
Related-To: NEO-14406
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-04-02 16:37:43 +02:00
Katarzyna Cencelewska
92e40afc49
feature: update debug flag DirectSubmissionPrintSemaphoreUsage
...
instead of printf use makro that make flush after printf
Related-To: HSD-14024170600
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2025-03-28 13:36:15 +01:00
Maciej Plewka
1b7bf690d2
fix: update fence values when residency list passed to switch ring buffer
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2025-03-21 12:26:08 +01:00
Lukasz Jobczyk
8a85a96ed2
feature: Add 3-level wait scheme with tpause intrinsic
...
Related-To: NEO-14336
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-03-21 12:12:57 +01:00
Bartosz Dunajski
6e17df4fb3
refactor: improve EncodeAluHelper
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-03-19 15:44:34 +01:00
Lukasz Jobczyk
6cb52f71b4
fix: Avoid mutex deadlock when switch ulls light ring buffer
...
Related-To: NEO-14406
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-03-19 11:47:40 +01:00