Commit Graph

491 Commits

Author SHA1 Message Date
Zbigniew Zdanowicz
fc770cacf2 feature: enable dispatch primary batch buffer from immediate
Related-To: NEO-10356

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-10 17:33:34 +01:00
Zbigniew Zdanowicz
f8be8414ac feature: add support for wait event preamble in append command list
- add mechanism in queue to trigger start command from queue to regular
- add detection in immediate command list need to dispatch extra start in queue
- fix secondary linear stream in immediate case as it should not use container
- modify tests for primary batch buffer dispatch as default mode
- remove invalid or obsolete tests

Related-To: NEO-10356

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-07 20:31:09 +01:00
Zbigniew Zdanowicz
566e52f72b feature: add main lock for both execute and flush
- both execute and flush have their own locks
- spliting these locks makes race condition
- obtain lock at higher level
- do not lock in queue when calling execute from immediate
- pass main lock to flush so can be unlocked before synchronize

Related-To: NEO-10356

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-05 18:40:49 +01:00
Zbigniew Zdanowicz
f4f779f5b2 feature: save parent immediate cmdlist stream as starting stream
Related-To: NEO-10356

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-03 12:06:55 +01:00
Compute-Runtime-Validation
84eafda460 Revert "feature: add main lock for both execute and flush"
This reverts commit a8ea63d638.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-01 03:44:32 +01:00
Zbigniew Zdanowicz
a8ea63d638 feature: add main lock for both execute and flush
- both execute and flush have their locks, but split makes race condition
- save start stream processed by queue, so immediate can use it in flush

Related-To: NEO-10356

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-02-28 15:29:33 +01:00
Zbigniew Zdanowicz
b3e10a3d16 fix: correct estimation for command queue stream in heapless mode
Related-To: NEO-10356

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-02-28 11:46:00 +01:00
Compute-Runtime-Validation
9785e5c11e Revert "feature: update base layout of L0 handles to match ze_handle_t layout"
This reverts commit 276c606329.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-26 02:37:14 +01:00
Mateusz Jablonski
276c606329 feature: update base layout of L0 handles to match ze_handle_t layout
this layout is required by L0 DDI Handle extension

Related-To: NEO-13121, NEO-13917
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-02-25 01:23:43 +01:00
Filip Hazubski
4be1153253 fix: Remove pragma once from inl files
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-02-18 20:19:15 +01:00
Compute-Runtime-Validation
b3807e7a36 Revert "feature: Append recorded commandlist into immediate (8/N)"
This reverts commit 9db77efd03.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-13 16:35:40 +01:00
Aravind Gopalakrishnan
9db77efd03 feature: Append recorded commandlist into immediate (8/N)
- Add primary dispatch capability for immediate command list
- turn off usage of parent linear stream for copy engine

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2025-02-12 12:39:38 +01:00
Michał Pryba
75bc74089b refactor: adjust file names after pre-gen12 removal 2/3
Related-To: NEO-12681
Signed-off-by: Michał Pryba <michal.pryba@intel.com>
2025-02-03 15:31:51 +01:00
Krzysztof Sprzaczkowski
a17745532c performance: Move preemption allocation init to the first submission
Related-To: NEO-12323
Signed-off-by: Krzysztof Sprzaczkowski <krzysztof.sprzaczkowski@intel.com>
2025-01-15 20:22:50 +01:00
Mateusz Hoppe
60495a5b20 performance: optimize memory used for scratch programming
- reserve exact number of slots for scratch surface states in surface
state heaps
- do not use offsets for contexts depending on engine type executing
cmdlists

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-01-14 15:47:08 +01:00
Damian Tomczak
9a149b6da5 refactor: useGlobalHeaps naming unification
Related-to: NEO-12737

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-01-14 11:01:07 +01:00
Filip Hazubski
3997d2a6b3 Revert "feature: Append recorded commandlist into immediate (8/N)"
This reverts commit acfeffe274.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-01-03 15:50:52 +01:00
Aravind Gopalakrishnan
acfeffe274 feature: Append recorded commandlist into immediate (8/N)
Add primary dispatch capability for immediate command list

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-12-20 18:52:33 +01:00
Compute-Runtime-Validation
f35d620e06 Revert "feature: Append recorded commandlist into immediate (8/N)"
This reverts commit d8d7419aee.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-12-20 03:35:27 +01:00
Aravind Gopalakrishnan
d8d7419aee feature: Append recorded commandlist into immediate (8/N)
Add primary dispatch capability for immediate command list

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-12-19 19:02:14 +01:00
Bartosz Dunajski
e8cfb38db4 performance: improve relaxed ordering task count tracking
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-12-18 11:36:22 +01:00
Maciej Plewka
8151224501 fix: add microsecond resolution for timeout
Related-To: NEO-13445
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-12-13 09:32:18 +01:00
Zbigniew Zdanowicz
75139d2322 fix: zero sync and region barrier buffers before use on regular command lists
Related-To: NEO-13350

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-12-06 15:54:50 +01:00
Mateusz Jablonski
fa58073095 refactor: remove not used usings/typedefs/variables
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-11-28 16:19:39 +01:00
Aravind Gopalakrishnan
c617048f1a fix: Enqueue blocking support for L0
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-10-19 10:28:48 +02:00
Artur Harasimiuk
00aedcaed4 refactor: add NOLINT for pending problems
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2024-10-10 11:46:35 +02:00
Wojciech Konior
6b40f9bc5a refactor: engineInstancedType removed
Related-To: NEO-12594

Signed-off-by: Wojciech Konior <wojciech.konior@intel.com>
2024-10-09 16:30:48 +02:00
Mateusz Jablonski
552930a75f fix: don't setup preemption surface when debugger is active
Related-To: NEO-12878
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-10-08 13:58:11 +02:00
Compute-Runtime-Validation
abcb462ac3 Revert "performance: optimize memory used for scratch programming"
This reverts commit 9b2696a942.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-09-24 06:52:19 +02:00
Mateusz Hoppe
9b2696a942 performance: optimize memory used for scratch programming
- reserve exact number of slots for scratch surface states in surface
state heaps
- do not use offsets for contexts depending on engine type executing
cmdlists

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-09-23 14:15:47 +02:00
Mateusz Hoppe
4a068c8eab fix: correclty program StateBaseAddress in global bindless mode
- prepare bindful ssh when kernel requires ssh heap and
SurfaceStateBaseAddress
- remove lastAppendedKernelBindlesMode - local ssh heap may be needed
for bindless kernels with scratch or misaligned buffer args
- use ssh heap gpu address to program SurfaceStateBaseAddress, global base is
used for BindlessSurfaceState and DynamicState

Related-To: NEO-7063

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-09-20 11:57:05 +02:00
Bartosz Dunajski
7a56424c2c fix: program partition offset for secondary csr
Related-To: HSD-18039998563

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-09-18 17:04:57 +02:00
Aravind Gopalakrishnan
2a6faa64de feature: Append recorded commandlist into immediate (6/N)
- fix premature call to make non resident

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-09-11 07:41:15 +02:00
Bartosz Dunajski
9bdd7b784a fix: remove not needed cooperative kernel checks
Related-To: HSD-18039871488

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-09-10 18:32:20 +02:00
Mateusz Jablonski
14fd9f3f24 fix: correct using L0 loader functions
use zelLoaderTranslateHandle for translating handle to internal handle
get pointer to zelSetDriverTeardown during global ctor
don't load loader library by name
get loader function pointers directly from current process

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-09-04 17:30:25 +02:00
Tomasz Biernacik
c5e174da00 refactor: remove queue extended methods and files
Related-To: NEO-12103

Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2024-09-04 11:11:33 +02:00
Aravind Gopalakrishnan
20aa853369 feature: Append recorded commandlist into immediate (5/N)
- add support for heapless mode

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-08-30 09:23:59 +02:00
Mateusz Jablonski
dec502e11b fix: add magic identifier to L0 command queue object
add tests for toInternalType function

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-08-29 16:39:29 +02:00
Aravind Gopalakrishnan
a9cb7876a5 feature: Append recorded commandlist into immediate (4/N)
- add changes for copy queue support for immediate cmdlist submit

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-08-29 09:35:31 +02:00
Aravind Gopalakrishnan
cb8063f71d feature: Append recorded command list into immediate (3/N)
- Use correct stream for dispatch
- Add capability to append signal event
- Check available space globally in immediate append call

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-08-16 17:40:28 +02:00
Compute-Runtime-Validation
1cca5e24c1 Revert "performance: disable dispatch task count update when direct submissio...
This reverts commit 6d486459fe.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-08-13 16:37:31 +02:00
Zbigniew Zdanowicz
6d486459fe performance: disable dispatch task count update when direct submission active
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-08-13 13:19:26 +02:00
Bartosz Dunajski
d76ac1d1de fix: scratch controller residency
Related-To: HSD-18039519400

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-09 14:47:56 +02:00
Bartosz Dunajski
24cfd203ab fix: dont download tbx allocations on heapless first device submission
Related-To: HSD-18039476929

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-06 14:03:42 +02:00
Fabian Zwoliński
674c4a15ad fix: use correct gpu address when bindless heaps helper is enabled
Related-To: NEO-7063
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-08-05 15:09:57 +02:00
Bartosz Dunajski
ec34656e0e fix: debug flag to defer first device submission
Related-To: HSD-18039343751

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-02 09:31:25 +02:00
Zbigniew Zdanowicz
7ca0210ef2 fix: patch compute walker scratch address when scratch controller is changed
- scenario when command list is changed from normal to low priority queue
- save only scratch patch address, when changed, then enable patching
- kernels will re-use scratch patch address after update of compute walker

Related-To: NEO-11972

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-08-01 13:28:53 +02:00
Kamil Kopryk
4008ccea05 fix: add scratch ptr in implicit args patching for L0 regular cmdlists
Related-To: NEO-11874
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-07-18 14:07:07 +02:00
Mateusz Hoppe
52f5a9d6f0 refactor: remove SourceLevelDebugger from test and function name
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-07-10 16:32:26 +02:00
Bartosz Dunajski
6ccd14838e feature: set CSR for interruptable Events during CmdList Execute
Related-To: NEO-8179

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-07-01 19:35:24 +02:00