Commit Graph

485 Commits

Author SHA1 Message Date
Zbigniew Zdanowicz
b3e10a3d16 fix: correct estimation for command queue stream in heapless mode
Related-To: NEO-10356

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-02-28 11:46:00 +01:00
Compute-Runtime-Validation
9785e5c11e Revert "feature: update base layout of L0 handles to match ze_handle_t layout"
This reverts commit 276c606329.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-26 02:37:14 +01:00
Mateusz Jablonski
276c606329 feature: update base layout of L0 handles to match ze_handle_t layout
this layout is required by L0 DDI Handle extension

Related-To: NEO-13121, NEO-13917
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-02-25 01:23:43 +01:00
Filip Hazubski
4be1153253 fix: Remove pragma once from inl files
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-02-18 20:19:15 +01:00
Compute-Runtime-Validation
b3807e7a36 Revert "feature: Append recorded commandlist into immediate (8/N)"
This reverts commit 9db77efd03.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-13 16:35:40 +01:00
Aravind Gopalakrishnan
9db77efd03 feature: Append recorded commandlist into immediate (8/N)
- Add primary dispatch capability for immediate command list
- turn off usage of parent linear stream for copy engine

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2025-02-12 12:39:38 +01:00
Michał Pryba
75bc74089b refactor: adjust file names after pre-gen12 removal 2/3
Related-To: NEO-12681
Signed-off-by: Michał Pryba <michal.pryba@intel.com>
2025-02-03 15:31:51 +01:00
Krzysztof Sprzaczkowski
a17745532c performance: Move preemption allocation init to the first submission
Related-To: NEO-12323
Signed-off-by: Krzysztof Sprzaczkowski <krzysztof.sprzaczkowski@intel.com>
2025-01-15 20:22:50 +01:00
Mateusz Hoppe
60495a5b20 performance: optimize memory used for scratch programming
- reserve exact number of slots for scratch surface states in surface
state heaps
- do not use offsets for contexts depending on engine type executing
cmdlists

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-01-14 15:47:08 +01:00
Damian Tomczak
9a149b6da5 refactor: useGlobalHeaps naming unification
Related-to: NEO-12737

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-01-14 11:01:07 +01:00
Filip Hazubski
3997d2a6b3 Revert "feature: Append recorded commandlist into immediate (8/N)"
This reverts commit acfeffe274.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-01-03 15:50:52 +01:00
Aravind Gopalakrishnan
acfeffe274 feature: Append recorded commandlist into immediate (8/N)
Add primary dispatch capability for immediate command list

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-12-20 18:52:33 +01:00
Compute-Runtime-Validation
f35d620e06 Revert "feature: Append recorded commandlist into immediate (8/N)"
This reverts commit d8d7419aee.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-12-20 03:35:27 +01:00
Aravind Gopalakrishnan
d8d7419aee feature: Append recorded commandlist into immediate (8/N)
Add primary dispatch capability for immediate command list

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-12-19 19:02:14 +01:00
Bartosz Dunajski
e8cfb38db4 performance: improve relaxed ordering task count tracking
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-12-18 11:36:22 +01:00
Maciej Plewka
8151224501 fix: add microsecond resolution for timeout
Related-To: NEO-13445
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-12-13 09:32:18 +01:00
Zbigniew Zdanowicz
75139d2322 fix: zero sync and region barrier buffers before use on regular command lists
Related-To: NEO-13350

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-12-06 15:54:50 +01:00
Mateusz Jablonski
fa58073095 refactor: remove not used usings/typedefs/variables
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-11-28 16:19:39 +01:00
Aravind Gopalakrishnan
c617048f1a fix: Enqueue blocking support for L0
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-10-19 10:28:48 +02:00
Artur Harasimiuk
00aedcaed4 refactor: add NOLINT for pending problems
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2024-10-10 11:46:35 +02:00
Wojciech Konior
6b40f9bc5a refactor: engineInstancedType removed
Related-To: NEO-12594

Signed-off-by: Wojciech Konior <wojciech.konior@intel.com>
2024-10-09 16:30:48 +02:00
Mateusz Jablonski
552930a75f fix: don't setup preemption surface when debugger is active
Related-To: NEO-12878
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-10-08 13:58:11 +02:00
Compute-Runtime-Validation
abcb462ac3 Revert "performance: optimize memory used for scratch programming"
This reverts commit 9b2696a942.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-09-24 06:52:19 +02:00
Mateusz Hoppe
9b2696a942 performance: optimize memory used for scratch programming
- reserve exact number of slots for scratch surface states in surface
state heaps
- do not use offsets for contexts depending on engine type executing
cmdlists

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-09-23 14:15:47 +02:00
Mateusz Hoppe
4a068c8eab fix: correclty program StateBaseAddress in global bindless mode
- prepare bindful ssh when kernel requires ssh heap and
SurfaceStateBaseAddress
- remove lastAppendedKernelBindlesMode - local ssh heap may be needed
for bindless kernels with scratch or misaligned buffer args
- use ssh heap gpu address to program SurfaceStateBaseAddress, global base is
used for BindlessSurfaceState and DynamicState

Related-To: NEO-7063

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-09-20 11:57:05 +02:00
Bartosz Dunajski
7a56424c2c fix: program partition offset for secondary csr
Related-To: HSD-18039998563

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-09-18 17:04:57 +02:00
Aravind Gopalakrishnan
2a6faa64de feature: Append recorded commandlist into immediate (6/N)
- fix premature call to make non resident

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-09-11 07:41:15 +02:00
Bartosz Dunajski
9bdd7b784a fix: remove not needed cooperative kernel checks
Related-To: HSD-18039871488

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-09-10 18:32:20 +02:00
Mateusz Jablonski
14fd9f3f24 fix: correct using L0 loader functions
use zelLoaderTranslateHandle for translating handle to internal handle
get pointer to zelSetDriverTeardown during global ctor
don't load loader library by name
get loader function pointers directly from current process

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-09-04 17:30:25 +02:00
Tomasz Biernacik
c5e174da00 refactor: remove queue extended methods and files
Related-To: NEO-12103

Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2024-09-04 11:11:33 +02:00
Aravind Gopalakrishnan
20aa853369 feature: Append recorded commandlist into immediate (5/N)
- add support for heapless mode

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-08-30 09:23:59 +02:00
Mateusz Jablonski
dec502e11b fix: add magic identifier to L0 command queue object
add tests for toInternalType function

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-08-29 16:39:29 +02:00
Aravind Gopalakrishnan
a9cb7876a5 feature: Append recorded commandlist into immediate (4/N)
- add changes for copy queue support for immediate cmdlist submit

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-08-29 09:35:31 +02:00
Aravind Gopalakrishnan
cb8063f71d feature: Append recorded command list into immediate (3/N)
- Use correct stream for dispatch
- Add capability to append signal event
- Check available space globally in immediate append call

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-08-16 17:40:28 +02:00
Compute-Runtime-Validation
1cca5e24c1 Revert "performance: disable dispatch task count update when direct submissio...
This reverts commit 6d486459fe.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-08-13 16:37:31 +02:00
Zbigniew Zdanowicz
6d486459fe performance: disable dispatch task count update when direct submission active
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-08-13 13:19:26 +02:00
Bartosz Dunajski
d76ac1d1de fix: scratch controller residency
Related-To: HSD-18039519400

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-09 14:47:56 +02:00
Bartosz Dunajski
24cfd203ab fix: dont download tbx allocations on heapless first device submission
Related-To: HSD-18039476929

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-06 14:03:42 +02:00
Fabian Zwoliński
674c4a15ad fix: use correct gpu address when bindless heaps helper is enabled
Related-To: NEO-7063
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-08-05 15:09:57 +02:00
Bartosz Dunajski
ec34656e0e fix: debug flag to defer first device submission
Related-To: HSD-18039343751

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-02 09:31:25 +02:00
Zbigniew Zdanowicz
7ca0210ef2 fix: patch compute walker scratch address when scratch controller is changed
- scenario when command list is changed from normal to low priority queue
- save only scratch patch address, when changed, then enable patching
- kernels will re-use scratch patch address after update of compute walker

Related-To: NEO-11972

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-08-01 13:28:53 +02:00
Kamil Kopryk
4008ccea05 fix: add scratch ptr in implicit args patching for L0 regular cmdlists
Related-To: NEO-11874
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-07-18 14:07:07 +02:00
Mateusz Hoppe
52f5a9d6f0 refactor: remove SourceLevelDebugger from test and function name
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-07-10 16:32:26 +02:00
Bartosz Dunajski
6ccd14838e feature: set CSR for interruptable Events during CmdList Execute
Related-To: NEO-8179

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-07-01 19:35:24 +02:00
Jitendra Sharma
51fc9059c2 feature: program debug surface address
Related-To: NEO-7986
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2024-07-01 16:30:14 +02:00
Lukasz Jobczyk
fed90f5c8e fix: Add infrastructure to force dc flush when mitigate dc
-force dc on next tag update after RT kernel
-force dc when release shared object

Related-To: NEO-10556

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-06-24 17:15:03 +02:00
Dominik Dabek
296d2bb148 performance: adjust ulls timeout on power saving
Use l0 power saving setting to also adjust ulls timeout.

Related-To: NEO-10800

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-06-17 16:51:55 +02:00
Bartosz Dunajski
54e07def28 refactor: add const suffix
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-06-12 15:45:23 +02:00
Bartosz Dunajski
2e9bb26708 feature: enable copy offload api
Related-To: NEO-11376

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-06-12 12:02:23 +02:00
Bartosz Dunajski
bb00193205 refactor: remove csr from cmd list class
Related-To: NEO-11376

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-06-11 15:43:13 +02:00