Compute-Runtime-Validation
8839d62c79
Revert "performance: use RESOURCE_BARRIER as stalling barrier"
...
This reverts commit 2a63853349
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-05-16 20:19:06 +02:00
Szymon Morek
2a63853349
performance: use RESOURCE_BARRIER as stalling barrier
...
Related-To: NEO-14943
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-05-16 14:12:18 +02:00
Szymon Morek
bff8f74de7
refactor: remove unused parameter
...
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-05-15 12:23:27 +02:00
Compute-Runtime-Validation
4ab91eab1e
Revert "refactor: move empty sba programming in heapless prolog to base class"
...
This reverts commit 809feca620
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-05-12 14:00:19 +02:00
Kamil Kopryk
809feca620
refactor: move empty sba programming in heapless prolog to base class
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-05-09 16:46:02 +02:00
Lukasz Jobczyk
5cd5bbafc5
refactor: Split waitpkg params for ulls light and default
...
Related-To: NEO-14866
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-09 12:40:36 +02:00
Lukasz Jobczyk
1d1414febc
refactor: remove unused dc flush mitigation
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-09 08:26:26 +02:00
Brandon Yates
a48d66ad75
feature: Add programExceptions stub to CSR
...
Related-to: NEO-12967
Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-04-01 18:33:40 +02:00
Szymon Morek
62964a0b08
fix: invalidate caches when heap is placed into reuse list
...
Related-To: NEO-9004
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-31 12:30:29 +02:00
Lukasz Jobczyk
60b551758c
performance: Adjust waitpkg threshold for discrete devices
...
Related-To: NEO-14336
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-26 14:59:19 +01:00
Kamil Kopryk
73795ced64
refactor: add setupTimestampPacketFlushL3 function
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-03-24 13:10:03 +01:00
Lukasz Jobczyk
54cb0e24f8
performance: Switch waitpkg use to tpause for ULLS light
...
Related-To: NEO-13922, NEO-14336
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-21 18:09:37 +01:00
Lukasz Jobczyk
8a85a96ed2
feature: Add 3-level wait scheme with tpause intrinsic
...
Related-To: NEO-14336
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-21 12:12:57 +01:00
Compute-Runtime-Validation
5f7f0dd785
Revert "performance: Enable waitpkg"
...
This reverts commit 8ec5434460
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-18 13:28:29 +01:00
Maciej Plewka
36fa6d66ae
fix: lock csr in stopDirectSubmission if needed
...
Related-To: NEO-13875, NEO-14143, HSD-16026538384, HSD-16026780358
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-03-18 09:30:38 +01:00
Zbigniew Zdanowicz
ddc0b0d03b
feature: disable flat ring buffer for command list append operation
...
Related-To: NEO-10356
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-13 06:06:55 +01:00
Lukasz Jobczyk
8ec5434460
performance: Enable waitpkg
...
Resolves: NEO-14336
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-11 14:43:01 +01:00
Zbigniew Zdanowicz
cd904269ed
fix: request for task count should enable monitor fence dispatch
...
Related-To: NEO-10356
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-10 18:25:35 +01:00
Lukasz Jobczyk
53062056ec
performance: Enable wait pkg for ULLS light
...
Related-To: NEO-13922
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-07 12:18:38 +01:00
Zbigniew Zdanowicz
ae1eb076b7
feature: add optional epilogue to flush task method
...
Related-To: NEO-10356
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-05 15:15:59 +01:00
Zbigniew Zdanowicz
27d7d72033
feature: add pipeline state management for append command list operation
...
- command list append state is managed from internal queue and can be skipped
- initial state configuration should be processed by both kernel and non-kernel
- only kernel operation can process required state, as non-kernel cannot change
Related-To: NEO-10356
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-04 20:43:22 +01:00
Zbigniew Zdanowicz
db99c25c79
feature: add support to dispatch epilogue commands into dedicated stream
...
Related-To: NEO-10356
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-03 14:39:34 +01:00
Zbigniew Zdanowicz
08b13750a1
fix: set stall cmd flag for bcs flush task count flag
...
Related-To: NEO-10356
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-02-28 12:24:51 +01:00
Zbigniew Zdanowicz
cae3bb1d0a
feature: add internal interfaces to manage all dispatch models of command lists
...
- add new enum type for command list flush from immediate
- add new argument for flushing immediate command list - regular command list
- add capability to provide additional stream for epilogue commands
- add pointer to provide external csr mutex to lock both execution and flush
Related-To: NEO-10356
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-02-28 12:02:22 +01:00
Lukasz Jobczyk
be946ae56c
performance: Optimize make resident for ULLS light
...
Do not check if ULLS light is active during every Csr::makeResident
call. Store that information once during ULLS init.
Related-To: NEO-13922
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-24 12:59:56 +01:00
Szymon Morek
254e7c5c6a
fix: set notify enable flag when flushing monitor fence
...
Related-To: NEO-13848
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-01-24 22:19:24 +01:00
Bartosz Dunajski
c2dbdb6797
refactor: move blit post sync data to BlitProperties
...
Related-To: NEO-13003
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-01-22 18:35:21 +01:00
Krzysztof Sprzaczkowski
a17745532c
performance: Move preemption allocation init to the first submission
...
Related-To: NEO-12323
Signed-off-by: Krzysztof Sprzaczkowski <krzysztof.sprzaczkowski@intel.com>
2025-01-15 20:22:50 +01:00
Damian Tomczak
9a149b6da5
refactor: useGlobalHeaps naming unification
...
Related-to: NEO-12737
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-01-14 11:01:07 +01:00
Bartosz Dunajski
e8cfb38db4
performance: improve relaxed ordering task count tracking
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-12-18 11:36:22 +01:00
Filip Hazubski
a0cc124b2e
performance: Pass RootDeviceIndicesContainer by reference
...
Additionally pass std::map by reference in UsmMemAllocPoolsManager c-tor.
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-12-17 14:18:30 +01:00
Dunajski, Bartosz
37e81d2a11
feature: new heuristic to enable relaxed ordering 2
...
Related-To: NEO-13431
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-12-09 11:58:42 +01:00
Compute-Runtime-Validation
af8ad3aa7a
Revert "feature: new heuristic to enable relaxed ordering"
...
This reverts commit 526f9c5e81
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-12-08 16:01:32 +01:00
Bartosz Dunajski
526f9c5e81
feature: new heuristic to enable relaxed ordering
...
Related-To: GSD-10308
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-12-06 17:04:39 +01:00
Mateusz Jablonski
fa58073095
refactor: remove not used usings/typedefs/variables
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-11-28 16:19:39 +01:00
Zbigniew Zdanowicz
6453a5ec31
fix: correct sequence of estimates to get correct size for start command
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-27 09:38:39 +01:00
Wojciech Konior
6b40f9bc5a
refactor: engineInstancedType removed
...
Related-To: NEO-12594
Signed-off-by: Wojciech Konior <wojciech.konior@intel.com>
2024-10-09 16:30:48 +02:00
Mateusz Hoppe
4a068c8eab
fix: correclty program StateBaseAddress in global bindless mode
...
- prepare bindful ssh when kernel requires ssh heap and
SurfaceStateBaseAddress
- remove lastAppendedKernelBindlesMode - local ssh heap may be needed
for bindless kernels with scratch or misaligned buffer args
- use ssh heap gpu address to program SurfaceStateBaseAddress, global base is
used for BindlessSurfaceState and DynamicState
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-09-20 11:57:05 +02:00
Kamil Kopryk
ec5beaf616
refactor: reduce csr class size
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-09-18 13:33:55 +02:00
Bartosz Dunajski
d3d8b5fcc1
fix: inherit work partition allocation from primary root csr
...
Related-To: NEO-8171
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-09-16 18:45:16 +02:00
Mateusz Jablonski
78604bd475
refactor: remove not needed code
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-09-16 12:12:43 +02:00
Andrzej Koska
b0e7a11e9a
refactor: Improving information transfer about the copy engine
...
Related-To: NEO-11934
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2024-09-05 16:11:52 +02:00
Aravind Gopalakrishnan
cb8063f71d
feature: Append recorded command list into immediate (3/N)
...
- Use correct stream for dispatch
- Add capability to append signal event
- Check available space globally in immediate append call
Related-To: NEO-10356
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-08-16 17:40:28 +02:00
Compute-Runtime-Validation
9b652f4a34
Revert "feature: Improving information transfer about the copy engine"
...
This reverts commit 17ffdff4f1
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-08-15 22:06:31 +02:00
Andrzej Koska
17ffdff4f1
feature: Improving information transfer about the copy engine
...
Related-To: NEO-11934
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2024-08-14 11:28:29 +02:00
Bartosz Dunajski
d76ac1d1de
fix: scratch controller residency
...
Related-To: HSD-18039519400
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-09 14:47:56 +02:00
Szymon Morek
d7d6996464
performance: initialize timeout params once
...
Currently this is done per each enqueue
which is not really needed
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-08-07 14:35:12 +02:00
Szymon Morek
d4c1631ac7
performance: don't wait for paging fence on user thread
...
Related-To: NEO-12197
Currently for new resources user thread must wait before submitting
actual workload. With this commit, instead of waiting on user thread,
request is sent to background ULLS controller thread and additional
semaphore is programmed. ULLS controller will perform actual wait
and signal semaphore when paging fence reaches required value.
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-08-07 08:30:51 +02:00
Bartosz Dunajski
24cfd203ab
fix: dont download tbx allocations on heapless first device submission
...
Related-To: HSD-18039476929
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-06 14:03:42 +02:00
Fabian Zwoliński
674c4a15ad
fix: use correct gpu address when bindless heaps helper is enabled
...
Related-To: NEO-7063
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-08-05 15:09:57 +02:00