Zbigniew Zdanowicz
3759ab279f
feature: enable dispatch primary batch buffer from immediate
...
Related-To: NEO-10356
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-13 12:40:22 +01:00
Compute-Runtime-Validation
b2b3b55b19
Revert "feature: enable dispatch primary batch buffer from immediate"
...
This reverts commit fc770cacf2
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-11 03:46:02 +01:00
Zbigniew Zdanowicz
fc770cacf2
feature: enable dispatch primary batch buffer from immediate
...
Related-To: NEO-10356
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-10 17:33:34 +01:00
Zbigniew Zdanowicz
f8be8414ac
feature: add support for wait event preamble in append command list
...
- add mechanism in queue to trigger start command from queue to regular
- add detection in immediate command list need to dispatch extra start in queue
- fix secondary linear stream in immediate case as it should not use container
- modify tests for primary batch buffer dispatch as default mode
- remove invalid or obsolete tests
Related-To: NEO-10356
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-07 20:31:09 +01:00
Zbigniew Zdanowicz
566e52f72b
feature: add main lock for both execute and flush
...
- both execute and flush have their own locks
- spliting these locks makes race condition
- obtain lock at higher level
- do not lock in queue when calling execute from immediate
- pass main lock to flush so can be unlocked before synchronize
Related-To: NEO-10356
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-05 18:40:49 +01:00
Zbigniew Zdanowicz
f4f779f5b2
feature: save parent immediate cmdlist stream as starting stream
...
Related-To: NEO-10356
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-03 12:06:55 +01:00
Compute-Runtime-Validation
84eafda460
Revert "feature: add main lock for both execute and flush"
...
This reverts commit a8ea63d638
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-01 03:44:32 +01:00
Zbigniew Zdanowicz
a8ea63d638
feature: add main lock for both execute and flush
...
- both execute and flush have their locks, but split makes race condition
- save start stream processed by queue, so immediate can use it in flush
Related-To: NEO-10356
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-02-28 15:29:33 +01:00
Zbigniew Zdanowicz
b3e10a3d16
fix: correct estimation for command queue stream in heapless mode
...
Related-To: NEO-10356
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-02-28 11:46:00 +01:00
Compute-Runtime-Validation
9785e5c11e
Revert "feature: update base layout of L0 handles to match ze_handle_t layout"
...
This reverts commit 276c606329
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-26 02:37:14 +01:00
Mateusz Jablonski
276c606329
feature: update base layout of L0 handles to match ze_handle_t layout
...
this layout is required by L0 DDI Handle extension
Related-To: NEO-13121, NEO-13917
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-02-25 01:23:43 +01:00
Filip Hazubski
4be1153253
fix: Remove pragma once from inl files
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-02-18 20:19:15 +01:00
Compute-Runtime-Validation
b3807e7a36
Revert "feature: Append recorded commandlist into immediate (8/N)"
...
This reverts commit 9db77efd03
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-13 16:35:40 +01:00
Aravind Gopalakrishnan
9db77efd03
feature: Append recorded commandlist into immediate (8/N)
...
- Add primary dispatch capability for immediate command list
- turn off usage of parent linear stream for copy engine
Related-To: NEO-10356
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2025-02-12 12:39:38 +01:00
Michał Pryba
75bc74089b
refactor: adjust file names after pre-gen12 removal 2/3
...
Related-To: NEO-12681
Signed-off-by: Michał Pryba <michal.pryba@intel.com>
2025-02-03 15:31:51 +01:00
Krzysztof Sprzaczkowski
a17745532c
performance: Move preemption allocation init to the first submission
...
Related-To: NEO-12323
Signed-off-by: Krzysztof Sprzaczkowski <krzysztof.sprzaczkowski@intel.com>
2025-01-15 20:22:50 +01:00
Mateusz Hoppe
60495a5b20
performance: optimize memory used for scratch programming
...
- reserve exact number of slots for scratch surface states in surface
state heaps
- do not use offsets for contexts depending on engine type executing
cmdlists
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-01-14 15:47:08 +01:00
Damian Tomczak
9a149b6da5
refactor: useGlobalHeaps naming unification
...
Related-to: NEO-12737
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-01-14 11:01:07 +01:00
Filip Hazubski
3997d2a6b3
Revert "feature: Append recorded commandlist into immediate (8/N)"
...
This reverts commit acfeffe274
.
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-01-03 15:50:52 +01:00
Aravind Gopalakrishnan
acfeffe274
feature: Append recorded commandlist into immediate (8/N)
...
Add primary dispatch capability for immediate command list
Related-To: NEO-10356
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-12-20 18:52:33 +01:00
Compute-Runtime-Validation
f35d620e06
Revert "feature: Append recorded commandlist into immediate (8/N)"
...
This reverts commit d8d7419aee
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-12-20 03:35:27 +01:00
Aravind Gopalakrishnan
d8d7419aee
feature: Append recorded commandlist into immediate (8/N)
...
Add primary dispatch capability for immediate command list
Related-To: NEO-10356
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-12-19 19:02:14 +01:00
Bartosz Dunajski
e8cfb38db4
performance: improve relaxed ordering task count tracking
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-12-18 11:36:22 +01:00
Maciej Plewka
8151224501
fix: add microsecond resolution for timeout
...
Related-To: NEO-13445
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-12-13 09:32:18 +01:00
Zbigniew Zdanowicz
75139d2322
fix: zero sync and region barrier buffers before use on regular command lists
...
Related-To: NEO-13350
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-12-06 15:54:50 +01:00
Mateusz Jablonski
fa58073095
refactor: remove not used usings/typedefs/variables
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-11-28 16:19:39 +01:00
Aravind Gopalakrishnan
c617048f1a
fix: Enqueue blocking support for L0
...
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-10-19 10:28:48 +02:00
Artur Harasimiuk
00aedcaed4
refactor: add NOLINT for pending problems
...
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2024-10-10 11:46:35 +02:00
Wojciech Konior
6b40f9bc5a
refactor: engineInstancedType removed
...
Related-To: NEO-12594
Signed-off-by: Wojciech Konior <wojciech.konior@intel.com>
2024-10-09 16:30:48 +02:00
Mateusz Jablonski
552930a75f
fix: don't setup preemption surface when debugger is active
...
Related-To: NEO-12878
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-10-08 13:58:11 +02:00
Compute-Runtime-Validation
abcb462ac3
Revert "performance: optimize memory used for scratch programming"
...
This reverts commit 9b2696a942
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-09-24 06:52:19 +02:00
Mateusz Hoppe
9b2696a942
performance: optimize memory used for scratch programming
...
- reserve exact number of slots for scratch surface states in surface
state heaps
- do not use offsets for contexts depending on engine type executing
cmdlists
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-09-23 14:15:47 +02:00
Mateusz Hoppe
4a068c8eab
fix: correclty program StateBaseAddress in global bindless mode
...
- prepare bindful ssh when kernel requires ssh heap and
SurfaceStateBaseAddress
- remove lastAppendedKernelBindlesMode - local ssh heap may be needed
for bindless kernels with scratch or misaligned buffer args
- use ssh heap gpu address to program SurfaceStateBaseAddress, global base is
used for BindlessSurfaceState and DynamicState
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-09-20 11:57:05 +02:00
Bartosz Dunajski
7a56424c2c
fix: program partition offset for secondary csr
...
Related-To: HSD-18039998563
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-09-18 17:04:57 +02:00
Aravind Gopalakrishnan
2a6faa64de
feature: Append recorded commandlist into immediate (6/N)
...
- fix premature call to make non resident
Related-To: NEO-10356
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-09-11 07:41:15 +02:00
Bartosz Dunajski
9bdd7b784a
fix: remove not needed cooperative kernel checks
...
Related-To: HSD-18039871488
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-09-10 18:32:20 +02:00
Mateusz Jablonski
14fd9f3f24
fix: correct using L0 loader functions
...
use zelLoaderTranslateHandle for translating handle to internal handle
get pointer to zelSetDriverTeardown during global ctor
don't load loader library by name
get loader function pointers directly from current process
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-09-04 17:30:25 +02:00
Tomasz Biernacik
c5e174da00
refactor: remove queue extended methods and files
...
Related-To: NEO-12103
Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2024-09-04 11:11:33 +02:00
Aravind Gopalakrishnan
20aa853369
feature: Append recorded commandlist into immediate (5/N)
...
- add support for heapless mode
Related-To: NEO-10356
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-08-30 09:23:59 +02:00
Mateusz Jablonski
dec502e11b
fix: add magic identifier to L0 command queue object
...
add tests for toInternalType function
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-08-29 16:39:29 +02:00
Aravind Gopalakrishnan
a9cb7876a5
feature: Append recorded commandlist into immediate (4/N)
...
- add changes for copy queue support for immediate cmdlist submit
Related-To: NEO-10356
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-08-29 09:35:31 +02:00
Aravind Gopalakrishnan
cb8063f71d
feature: Append recorded command list into immediate (3/N)
...
- Use correct stream for dispatch
- Add capability to append signal event
- Check available space globally in immediate append call
Related-To: NEO-10356
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-08-16 17:40:28 +02:00
Compute-Runtime-Validation
1cca5e24c1
Revert "performance: disable dispatch task count update when direct submissio...
...
This reverts commit 6d486459fe
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-08-13 16:37:31 +02:00
Zbigniew Zdanowicz
6d486459fe
performance: disable dispatch task count update when direct submission active
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-08-13 13:19:26 +02:00
Bartosz Dunajski
d76ac1d1de
fix: scratch controller residency
...
Related-To: HSD-18039519400
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-09 14:47:56 +02:00
Bartosz Dunajski
24cfd203ab
fix: dont download tbx allocations on heapless first device submission
...
Related-To: HSD-18039476929
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-06 14:03:42 +02:00
Fabian Zwoliński
674c4a15ad
fix: use correct gpu address when bindless heaps helper is enabled
...
Related-To: NEO-7063
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-08-05 15:09:57 +02:00
Bartosz Dunajski
ec34656e0e
fix: debug flag to defer first device submission
...
Related-To: HSD-18039343751
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-02 09:31:25 +02:00
Zbigniew Zdanowicz
7ca0210ef2
fix: patch compute walker scratch address when scratch controller is changed
...
- scenario when command list is changed from normal to low priority queue
- save only scratch patch address, when changed, then enable patching
- kernels will re-use scratch patch address after update of compute walker
Related-To: NEO-11972
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-08-01 13:28:53 +02:00
Kamil Kopryk
4008ccea05
fix: add scratch ptr in implicit args patching for L0 regular cmdlists
...
Related-To: NEO-11874
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-07-18 14:07:07 +02:00