Commit Graph

493 Commits

Author SHA1 Message Date
Zbigniew Zdanowicz 3759ab279f feature: enable dispatch primary batch buffer from immediate
Related-To: NEO-10356

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-13 12:40:22 +01:00
Compute-Runtime-Validation b2b3b55b19 Revert "feature: enable dispatch primary batch buffer from immediate"
This reverts commit fc770cacf2.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-11 03:46:02 +01:00
Zbigniew Zdanowicz fc770cacf2 feature: enable dispatch primary batch buffer from immediate
Related-To: NEO-10356

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-10 17:33:34 +01:00
Zbigniew Zdanowicz f8be8414ac feature: add support for wait event preamble in append command list
- add mechanism in queue to trigger start command from queue to regular
- add detection in immediate command list need to dispatch extra start in queue
- fix secondary linear stream in immediate case as it should not use container
- modify tests for primary batch buffer dispatch as default mode
- remove invalid or obsolete tests

Related-To: NEO-10356

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-07 20:31:09 +01:00
Zbigniew Zdanowicz 566e52f72b feature: add main lock for both execute and flush
- both execute and flush have their own locks
- spliting these locks makes race condition
- obtain lock at higher level
- do not lock in queue when calling execute from immediate
- pass main lock to flush so can be unlocked before synchronize

Related-To: NEO-10356

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-05 18:40:49 +01:00
Zbigniew Zdanowicz f4f779f5b2 feature: save parent immediate cmdlist stream as starting stream
Related-To: NEO-10356

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-03 12:06:55 +01:00
Compute-Runtime-Validation 84eafda460 Revert "feature: add main lock for both execute and flush"
This reverts commit a8ea63d638.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-01 03:44:32 +01:00
Zbigniew Zdanowicz a8ea63d638 feature: add main lock for both execute and flush
- both execute and flush have their locks, but split makes race condition
- save start stream processed by queue, so immediate can use it in flush

Related-To: NEO-10356

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-02-28 15:29:33 +01:00
Zbigniew Zdanowicz b3e10a3d16 fix: correct estimation for command queue stream in heapless mode
Related-To: NEO-10356

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-02-28 11:46:00 +01:00
Compute-Runtime-Validation 9785e5c11e Revert "feature: update base layout of L0 handles to match ze_handle_t layout"
This reverts commit 276c606329.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-26 02:37:14 +01:00
Mateusz Jablonski 276c606329 feature: update base layout of L0 handles to match ze_handle_t layout
this layout is required by L0 DDI Handle extension

Related-To: NEO-13121, NEO-13917
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-02-25 01:23:43 +01:00
Filip Hazubski 4be1153253 fix: Remove pragma once from inl files
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-02-18 20:19:15 +01:00
Compute-Runtime-Validation b3807e7a36 Revert "feature: Append recorded commandlist into immediate (8/N)"
This reverts commit 9db77efd03.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-13 16:35:40 +01:00
Aravind Gopalakrishnan 9db77efd03 feature: Append recorded commandlist into immediate (8/N)
- Add primary dispatch capability for immediate command list
- turn off usage of parent linear stream for copy engine

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2025-02-12 12:39:38 +01:00
Michał Pryba 75bc74089b refactor: adjust file names after pre-gen12 removal 2/3
Related-To: NEO-12681
Signed-off-by: Michał Pryba <michal.pryba@intel.com>
2025-02-03 15:31:51 +01:00
Krzysztof Sprzaczkowski a17745532c performance: Move preemption allocation init to the first submission
Related-To: NEO-12323
Signed-off-by: Krzysztof Sprzaczkowski <krzysztof.sprzaczkowski@intel.com>
2025-01-15 20:22:50 +01:00
Mateusz Hoppe 60495a5b20 performance: optimize memory used for scratch programming
- reserve exact number of slots for scratch surface states in surface
state heaps
- do not use offsets for contexts depending on engine type executing
cmdlists

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-01-14 15:47:08 +01:00
Damian Tomczak 9a149b6da5 refactor: useGlobalHeaps naming unification
Related-to: NEO-12737

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-01-14 11:01:07 +01:00
Filip Hazubski 3997d2a6b3 Revert "feature: Append recorded commandlist into immediate (8/N)"
This reverts commit acfeffe274.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-01-03 15:50:52 +01:00
Aravind Gopalakrishnan acfeffe274 feature: Append recorded commandlist into immediate (8/N)
Add primary dispatch capability for immediate command list

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-12-20 18:52:33 +01:00
Compute-Runtime-Validation f35d620e06 Revert "feature: Append recorded commandlist into immediate (8/N)"
This reverts commit d8d7419aee.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-12-20 03:35:27 +01:00
Aravind Gopalakrishnan d8d7419aee feature: Append recorded commandlist into immediate (8/N)
Add primary dispatch capability for immediate command list

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-12-19 19:02:14 +01:00
Bartosz Dunajski e8cfb38db4 performance: improve relaxed ordering task count tracking
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-12-18 11:36:22 +01:00
Maciej Plewka 8151224501 fix: add microsecond resolution for timeout
Related-To: NEO-13445
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-12-13 09:32:18 +01:00
Zbigniew Zdanowicz 75139d2322 fix: zero sync and region barrier buffers before use on regular command lists
Related-To: NEO-13350

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-12-06 15:54:50 +01:00
Mateusz Jablonski fa58073095 refactor: remove not used usings/typedefs/variables
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-11-28 16:19:39 +01:00
Aravind Gopalakrishnan c617048f1a fix: Enqueue blocking support for L0
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-10-19 10:28:48 +02:00
Artur Harasimiuk 00aedcaed4 refactor: add NOLINT for pending problems
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2024-10-10 11:46:35 +02:00
Wojciech Konior 6b40f9bc5a refactor: engineInstancedType removed
Related-To: NEO-12594

Signed-off-by: Wojciech Konior <wojciech.konior@intel.com>
2024-10-09 16:30:48 +02:00
Mateusz Jablonski 552930a75f fix: don't setup preemption surface when debugger is active
Related-To: NEO-12878
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-10-08 13:58:11 +02:00
Compute-Runtime-Validation abcb462ac3 Revert "performance: optimize memory used for scratch programming"
This reverts commit 9b2696a942.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-09-24 06:52:19 +02:00
Mateusz Hoppe 9b2696a942 performance: optimize memory used for scratch programming
- reserve exact number of slots for scratch surface states in surface
state heaps
- do not use offsets for contexts depending on engine type executing
cmdlists

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-09-23 14:15:47 +02:00
Mateusz Hoppe 4a068c8eab fix: correclty program StateBaseAddress in global bindless mode
- prepare bindful ssh when kernel requires ssh heap and
SurfaceStateBaseAddress
- remove lastAppendedKernelBindlesMode - local ssh heap may be needed
for bindless kernels with scratch or misaligned buffer args
- use ssh heap gpu address to program SurfaceStateBaseAddress, global base is
used for BindlessSurfaceState and DynamicState

Related-To: NEO-7063

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-09-20 11:57:05 +02:00
Bartosz Dunajski 7a56424c2c fix: program partition offset for secondary csr
Related-To: HSD-18039998563

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-09-18 17:04:57 +02:00
Aravind Gopalakrishnan 2a6faa64de feature: Append recorded commandlist into immediate (6/N)
- fix premature call to make non resident

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-09-11 07:41:15 +02:00
Bartosz Dunajski 9bdd7b784a fix: remove not needed cooperative kernel checks
Related-To: HSD-18039871488

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-09-10 18:32:20 +02:00
Mateusz Jablonski 14fd9f3f24 fix: correct using L0 loader functions
use zelLoaderTranslateHandle for translating handle to internal handle
get pointer to zelSetDriverTeardown during global ctor
don't load loader library by name
get loader function pointers directly from current process

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-09-04 17:30:25 +02:00
Tomasz Biernacik c5e174da00 refactor: remove queue extended methods and files
Related-To: NEO-12103

Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2024-09-04 11:11:33 +02:00
Aravind Gopalakrishnan 20aa853369 feature: Append recorded commandlist into immediate (5/N)
- add support for heapless mode

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-08-30 09:23:59 +02:00
Mateusz Jablonski dec502e11b fix: add magic identifier to L0 command queue object
add tests for toInternalType function

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-08-29 16:39:29 +02:00
Aravind Gopalakrishnan a9cb7876a5 feature: Append recorded commandlist into immediate (4/N)
- add changes for copy queue support for immediate cmdlist submit

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-08-29 09:35:31 +02:00
Aravind Gopalakrishnan cb8063f71d feature: Append recorded command list into immediate (3/N)
- Use correct stream for dispatch
- Add capability to append signal event
- Check available space globally in immediate append call

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-08-16 17:40:28 +02:00
Compute-Runtime-Validation 1cca5e24c1 Revert "performance: disable dispatch task count update when direct submissio...
This reverts commit 6d486459fe.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-08-13 16:37:31 +02:00
Zbigniew Zdanowicz 6d486459fe performance: disable dispatch task count update when direct submission active
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-08-13 13:19:26 +02:00
Bartosz Dunajski d76ac1d1de fix: scratch controller residency
Related-To: HSD-18039519400

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-09 14:47:56 +02:00
Bartosz Dunajski 24cfd203ab fix: dont download tbx allocations on heapless first device submission
Related-To: HSD-18039476929

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-06 14:03:42 +02:00
Fabian Zwoliński 674c4a15ad fix: use correct gpu address when bindless heaps helper is enabled
Related-To: NEO-7063
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-08-05 15:09:57 +02:00
Bartosz Dunajski ec34656e0e fix: debug flag to defer first device submission
Related-To: HSD-18039343751

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-02 09:31:25 +02:00
Zbigniew Zdanowicz 7ca0210ef2 fix: patch compute walker scratch address when scratch controller is changed
- scenario when command list is changed from normal to low priority queue
- save only scratch patch address, when changed, then enable patching
- kernels will re-use scratch patch address after update of compute walker

Related-To: NEO-11972

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-08-01 13:28:53 +02:00
Kamil Kopryk 4008ccea05 fix: add scratch ptr in implicit args patching for L0 regular cmdlists
Related-To: NEO-11874
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-07-18 14:07:07 +02:00