Commit Graph

2776 Commits

Author SHA1 Message Date
Mateusz Hoppe
37b7caa137 fix: correct program header generation for shared isa allocation
- when kernels share single allocation, LOAD address in program headers
should point to correct virtual address including kernel offset

Related-To: NEO-7788, GSD-9836

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-09-03 17:50:29 +02:00
Mateusz Jablonski
43e3957e66 fix: add nullptr check in cache reservation API
in case of AUB/TBX mode there is no os interface

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-09-03 15:43:35 +02:00
Mateusz Jablonski
7adfa344c0 performance: get reference from getBindlessInfo
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-09-03 12:50:52 +02:00
Wenbin Lu
93cde3ee12 feature: support SVM heap in reserveVirtualMem
Related-To: NEO-11981

Signed-off-by: Wenbin Lu <wenbin.lu@intel.com>
2024-09-03 11:38:51 +02:00
Mateusz Jablonski
c0cec096a5 fix: don't return gtpin init error from L0 API call
gtpin should be transparent for application

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-09-02 17:41:08 +02:00
Mateusz Hoppe
d5e275cb49 performance: optimize surface state programming
- eliminate read-modify-write on gfx memory when global bindless mode
enabled

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-09-02 15:31:30 +02:00
Aravind Gopalakrishnan
20aa853369 feature: Append recorded commandlist into immediate (5/N)
- add support for heapless mode

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-08-30 09:23:59 +02:00
Mateusz Jablonski
dec502e11b fix: add magic identifier to L0 command queue object
add tests for toInternalType function

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-08-29 16:39:29 +02:00
Aravind Gopalakrishnan
a9cb7876a5 feature: Append recorded commandlist into immediate (4/N)
- add changes for copy queue support for immediate cmdlist submit

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-08-29 09:35:31 +02:00
Fabian Zwoliński
6a50320f3c fix: use gpuAddress when patching sync buffer
Related-To: NEO-11533
Related-To: HSD-18039788811
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-08-28 17:47:35 +02:00
Mateusz Jablonski
d45c16dfc2 fix: add fallback for invalid handles in extension functions
handle context, commandlist, driver, device, event, image and kernel handles

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-08-28 17:22:35 +02:00
Bartosz Dunajski
db611962f7 fix: improve task count handling in tbx download path
Related-To: HSD-18039789178

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-28 15:32:15 +02:00
Mateusz Hoppe
482e077252 feature: add regular and hp contexts in group without dedicated hp engine
- if no hp copy engine available, create group with regular and hp
contexts

Related-To: NEO-11983

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-08-28 14:01:10 +02:00
Mateusz Hoppe
ce07b86d87 fix: return valid LP csr from root device
Related-To: NEO-12257

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-08-27 23:33:57 +02:00
Mateusz Jablonski
b7cedb99a4 fix: move gtpin initialization to first zeDriverGet call
Currently gtpin is intialized during zeInit call. However, it is not
correct as gtpin is calling other L0 API functions, like zeDriverGet
which should not be called until zeInit function finishes.

With this commit gtpin initialization is performed during first
zeDriverGet function call. In case of initialization failure
ZE_RESULT_ERROR_DEPENDENCY_UNAVAILABLE will be returned every
zeDriverGet call.

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-08-27 15:09:04 +02:00
Bartosz Dunajski
85359331ce fix: ensure couterbased timestamps writes before access
Related-To: HSD-18039452525

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-27 15:04:32 +02:00
Bartosz Dunajski
1795642da9 fix: disallow enabling synchronized dispatch on non-ioq queues
Related-To: HSD-18039265269

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-23 19:30:24 +02:00
Mateusz Hoppe
c9e8931a18 fix: global bindless offsets for implicit args
- do not patch bindless offsets for implicit args in with global bindless
allocator, the offsets are patched at kernel initialization time and
do not change

Related-To: NEO-7063

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-08-23 13:21:27 +02:00
Bartosz Dunajski
696b02bfd3 fix: improve TBX downloading after L0 Event sync
Related-To: HSD-18038498579

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-23 10:42:17 +02:00
Mateusz Jablonski
5b51146673 fix: fail L0 device creation when invalid state save area header was delivered
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-08-22 16:24:40 +02:00
Bartosz Dunajski
716c557cdb feature: update region barrier size allocation
Related-To: NEO-11031

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-22 12:21:11 +02:00
Winston Zhang
0590b34cfa feature: refactor and rewrite setErrorDescription
Related-To: NEO-8379

Signed-off-by: Winston Zhang <winston.zhang@intel.com>
2024-08-21 17:26:25 +02:00
Szymon Morek
f2fd669011 fix: call wait after post-processing
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-08-21 12:46:18 +02:00
Mateusz Jablonski
579af57161 refactor: don't call OsLibrary::load directly, use function pointer
this allows mocking this call in ULT

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-08-20 08:59:26 +02:00
Bartosz Dunajski
ce0ecf5939 fix: download L0 Event TBX allocation on related subdevice if needed
Related-To: HSD-18038498579

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-19 18:34:58 +02:00
Aravind Gopalakrishnan
cb8063f71d feature: Append recorded command list into immediate (3/N)
- Use correct stream for dispatch
- Add capability to append signal event
- Check available space globally in immediate append call

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-08-16 17:40:28 +02:00
Compute-Runtime-Validation
9b652f4a34 Revert "feature: Improving information transfer about the copy engine"
This reverts commit 17ffdff4f1.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-08-15 22:06:31 +02:00
Szymon Morek
26769205a9 fix: unlock mutex before wait
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-08-14 14:27:43 +02:00
Andrzej Koska
17ffdff4f1 feature: Improving information transfer about the copy engine
Related-To: NEO-11934

Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2024-08-14 11:28:29 +02:00
Compute-Runtime-Validation
1cca5e24c1 Revert "performance: disable dispatch task count update when direct submissio...
This reverts commit 6d486459fe.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-08-13 16:37:31 +02:00
Kamil Kopryk
c4850dbd4b refactor: set containsAnyKernel also in heapless
Related-To: NEO-10641
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-08-13 14:44:36 +02:00
Zbigniew Zdanowicz
6d486459fe performance: disable dispatch task count update when direct submission active
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-08-13 13:19:26 +02:00
Bartosz Dunajski
d76ac1d1de fix: scratch controller residency
Related-To: HSD-18039519400

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-09 14:47:56 +02:00
Dominik Dabek
ad229377b9 fix: disable indirect detection if any stack calls
Don't know if kernels will be initialized in the order needed to check
for indirect accesses in stack calls.

Remove now unused functionPointerWithIndirectAccessExists and reading
this value from zebin.

Related-To: NEO-12235

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-08-07 14:48:58 +02:00
Bartosz Dunajski
24cfd203ab fix: dont download tbx allocations on heapless first device submission
Related-To: HSD-18039476929

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-06 14:03:42 +02:00
Filip Hazubski
f4f6bc799e fix: Move Event resource cleanup do destroy function
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-08-06 11:39:53 +02:00
Kamil Kopryk
d3541a8149 fix: don't abort in clearCommandsToPatch with CwImplicitArgsScratch
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>

Related-To: NEO-11874
Related-To: HSD-18039321798
Related-To: HSD-18039443985
Related-To: HSD-18039321794
2024-08-06 08:59:03 +02:00
Kamil Kopryk
2a9bcdeb83 refactor: pass outImplicitArgs to patchImplicitArgs function
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-08-05 17:31:47 +02:00
Zbigniew Zdanowicz
5f2646daa6 feature: add external required scratch space
Related-To: NEO-11972

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-08-05 16:12:42 +02:00
Fabian Zwoliński
674c4a15ad fix: use correct gpu address when bindless heaps helper is enabled
Related-To: NEO-7063
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-08-05 15:09:57 +02:00
Zbigniew Zdanowicz
ae022c1113 feature: add kernel command view flag at command list level
Related-To: NEO-11972

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-08-05 12:58:35 +02:00
Bartosz Dunajski
6103ab1dae fix: dont set needsFlush for Copy operation if its not programmed
Related-To: HSD-18039452525

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-02 17:54:19 +02:00
Bartosz Dunajski
ec34656e0e fix: debug flag to defer first device submission
Related-To: HSD-18039343751

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-02 09:31:25 +02:00
Bartosz Dunajski
5ccfd6f2be fix: add missing AlignLocalMemoryVaTo2MB flag support
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-01 13:54:51 +02:00
Zbigniew Zdanowicz
7ca0210ef2 fix: patch compute walker scratch address when scratch controller is changed
- scenario when command list is changed from normal to low priority queue
- save only scratch patch address, when changed, then enable patching
- kernels will re-use scratch patch address after update of compute walker

Related-To: NEO-11972

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-08-01 13:28:53 +02:00
Jack Myers
f5d00b2616 feature: 2d-block-load-transpose query
Implemented device property query API for determining
support capabilities regarding 2d-block-load-tranpose
features for which not all Intel devices support.

Related-To: NEO-11592
Signed-off-by: Jack Myers <jack.myers@intel.com>
2024-07-30 18:21:07 +02:00
Fabian Zwoliński
b1a50104a8 fix: include dynamic SLM in clGetKernelWorkGroupInfo and zeKernelGetProperties
Current implementation only takes static slmInlineSize into account.
With this change we also include dynamic SLM passed as a kernel arguments.

Related-To: NEO-5761
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-07-30 17:59:45 +02:00
Zbigniew Zdanowicz
7d1b59f008 feature: add option to make compute walker command view
Related-To: NEO-11972

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-07-30 11:23:45 +02:00
Mateusz Hoppe
a734a738fa fix: use regular engine for HP copy engine if no HP engine
- map HIGH PRIORTY queues to regular engine for copy engine if there is
no HP copy engine available

Related-To: NEO-11983

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-07-26 17:09:31 +02:00
Bartosz Dunajski
ebcbbcf8fe fix: add missing api exports
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-07-26 12:15:28 +02:00