Mateusz Hoppe
37b7caa137
fix: correct program header generation for shared isa allocation
...
- when kernels share single allocation, LOAD address in program headers
should point to correct virtual address including kernel offset
Related-To: NEO-7788, GSD-9836
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2024-09-03 17:50:29 +02:00
Mateusz Jablonski
43e3957e66
fix: add nullptr check in cache reservation API
...
in case of AUB/TBX mode there is no os interface
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-09-03 15:43:35 +02:00
Mateusz Jablonski
7adfa344c0
performance: get reference from getBindlessInfo
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-09-03 12:50:52 +02:00
Wenbin Lu
93cde3ee12
feature: support SVM heap in reserveVirtualMem
...
Related-To: NEO-11981
Signed-off-by: Wenbin Lu <wenbin.lu@intel.com >
2024-09-03 11:38:51 +02:00
Mateusz Jablonski
c0cec096a5
fix: don't return gtpin init error from L0 API call
...
gtpin should be transparent for application
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-09-02 17:41:08 +02:00
Mateusz Hoppe
d5e275cb49
performance: optimize surface state programming
...
- eliminate read-modify-write on gfx memory when global bindless mode
enabled
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2024-09-02 15:31:30 +02:00
Aravind Gopalakrishnan
20aa853369
feature: Append recorded commandlist into immediate (5/N)
...
- add support for heapless mode
Related-To: NEO-10356
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com >
2024-08-30 09:23:59 +02:00
Mateusz Jablonski
dec502e11b
fix: add magic identifier to L0 command queue object
...
add tests for toInternalType function
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-08-29 16:39:29 +02:00
Aravind Gopalakrishnan
a9cb7876a5
feature: Append recorded commandlist into immediate (4/N)
...
- add changes for copy queue support for immediate cmdlist submit
Related-To: NEO-10356
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com >
2024-08-29 09:35:31 +02:00
Fabian Zwoliński
6a50320f3c
fix: use gpuAddress when patching sync buffer
...
Related-To: NEO-11533
Related-To: HSD-18039788811
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com >
2024-08-28 17:47:35 +02:00
Mateusz Jablonski
d45c16dfc2
fix: add fallback for invalid handles in extension functions
...
handle context, commandlist, driver, device, event, image and kernel handles
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-08-28 17:22:35 +02:00
Bartosz Dunajski
db611962f7
fix: improve task count handling in tbx download path
...
Related-To: HSD-18039789178
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-08-28 15:32:15 +02:00
Mateusz Hoppe
482e077252
feature: add regular and hp contexts in group without dedicated hp engine
...
- if no hp copy engine available, create group with regular and hp
contexts
Related-To: NEO-11983
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2024-08-28 14:01:10 +02:00
Mateusz Hoppe
ce07b86d87
fix: return valid LP csr from root device
...
Related-To: NEO-12257
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2024-08-27 23:33:57 +02:00
Mateusz Jablonski
b7cedb99a4
fix: move gtpin initialization to first zeDriverGet call
...
Currently gtpin is intialized during zeInit call. However, it is not
correct as gtpin is calling other L0 API functions, like zeDriverGet
which should not be called until zeInit function finishes.
With this commit gtpin initialization is performed during first
zeDriverGet function call. In case of initialization failure
ZE_RESULT_ERROR_DEPENDENCY_UNAVAILABLE will be returned every
zeDriverGet call.
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-08-27 15:09:04 +02:00
Bartosz Dunajski
85359331ce
fix: ensure couterbased timestamps writes before access
...
Related-To: HSD-18039452525
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-08-27 15:04:32 +02:00
Bartosz Dunajski
1795642da9
fix: disallow enabling synchronized dispatch on non-ioq queues
...
Related-To: HSD-18039265269
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-08-23 19:30:24 +02:00
Mateusz Hoppe
c9e8931a18
fix: global bindless offsets for implicit args
...
- do not patch bindless offsets for implicit args in with global bindless
allocator, the offsets are patched at kernel initialization time and
do not change
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2024-08-23 13:21:27 +02:00
Bartosz Dunajski
696b02bfd3
fix: improve TBX downloading after L0 Event sync
...
Related-To: HSD-18038498579
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-08-23 10:42:17 +02:00
Mateusz Jablonski
5b51146673
fix: fail L0 device creation when invalid state save area header was delivered
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-08-22 16:24:40 +02:00
Bartosz Dunajski
716c557cdb
feature: update region barrier size allocation
...
Related-To: NEO-11031
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-08-22 12:21:11 +02:00
Winston Zhang
0590b34cfa
feature: refactor and rewrite setErrorDescription
...
Related-To: NEO-8379
Signed-off-by: Winston Zhang <winston.zhang@intel.com >
2024-08-21 17:26:25 +02:00
Szymon Morek
f2fd669011
fix: call wait after post-processing
...
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-08-21 12:46:18 +02:00
Mateusz Jablonski
579af57161
refactor: don't call OsLibrary::load directly, use function pointer
...
this allows mocking this call in ULT
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-08-20 08:59:26 +02:00
Bartosz Dunajski
ce0ecf5939
fix: download L0 Event TBX allocation on related subdevice if needed
...
Related-To: HSD-18038498579
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-08-19 18:34:58 +02:00
Aravind Gopalakrishnan
cb8063f71d
feature: Append recorded command list into immediate (3/N)
...
- Use correct stream for dispatch
- Add capability to append signal event
- Check available space globally in immediate append call
Related-To: NEO-10356
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com >
2024-08-16 17:40:28 +02:00
Compute-Runtime-Validation
9b652f4a34
Revert "feature: Improving information transfer about the copy engine"
...
This reverts commit 17ffdff4f1 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-08-15 22:06:31 +02:00
Szymon Morek
26769205a9
fix: unlock mutex before wait
...
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-08-14 14:27:43 +02:00
Andrzej Koska
17ffdff4f1
feature: Improving information transfer about the copy engine
...
Related-To: NEO-11934
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com >
2024-08-14 11:28:29 +02:00
Compute-Runtime-Validation
1cca5e24c1
Revert "performance: disable dispatch task count update when direct submissio...
...
This reverts commit 6d486459fe .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-08-13 16:37:31 +02:00
Kamil Kopryk
c4850dbd4b
refactor: set containsAnyKernel also in heapless
...
Related-To: NEO-10641
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-08-13 14:44:36 +02:00
Zbigniew Zdanowicz
6d486459fe
performance: disable dispatch task count update when direct submission active
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-08-13 13:19:26 +02:00
Bartosz Dunajski
d76ac1d1de
fix: scratch controller residency
...
Related-To: HSD-18039519400
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-08-09 14:47:56 +02:00
Dominik Dabek
ad229377b9
fix: disable indirect detection if any stack calls
...
Don't know if kernels will be initialized in the order needed to check
for indirect accesses in stack calls.
Remove now unused functionPointerWithIndirectAccessExists and reading
this value from zebin.
Related-To: NEO-12235
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-08-07 14:48:58 +02:00
Bartosz Dunajski
24cfd203ab
fix: dont download tbx allocations on heapless first device submission
...
Related-To: HSD-18039476929
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-08-06 14:03:42 +02:00
Filip Hazubski
f4f6bc799e
fix: Move Event resource cleanup do destroy function
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2024-08-06 11:39:53 +02:00
Kamil Kopryk
d3541a8149
fix: don't abort in clearCommandsToPatch with CwImplicitArgsScratch
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
Related-To: NEO-11874
Related-To: HSD-18039321798
Related-To: HSD-18039443985
Related-To: HSD-18039321794
2024-08-06 08:59:03 +02:00
Kamil Kopryk
2a9bcdeb83
refactor: pass outImplicitArgs to patchImplicitArgs function
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-08-05 17:31:47 +02:00
Zbigniew Zdanowicz
5f2646daa6
feature: add external required scratch space
...
Related-To: NEO-11972
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-08-05 16:12:42 +02:00
Fabian Zwoliński
674c4a15ad
fix: use correct gpu address when bindless heaps helper is enabled
...
Related-To: NEO-7063
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com >
2024-08-05 15:09:57 +02:00
Zbigniew Zdanowicz
ae022c1113
feature: add kernel command view flag at command list level
...
Related-To: NEO-11972
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-08-05 12:58:35 +02:00
Bartosz Dunajski
6103ab1dae
fix: dont set needsFlush for Copy operation if its not programmed
...
Related-To: HSD-18039452525
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-08-02 17:54:19 +02:00
Bartosz Dunajski
ec34656e0e
fix: debug flag to defer first device submission
...
Related-To: HSD-18039343751
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-08-02 09:31:25 +02:00
Bartosz Dunajski
5ccfd6f2be
fix: add missing AlignLocalMemoryVaTo2MB flag support
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-08-01 13:54:51 +02:00
Zbigniew Zdanowicz
7ca0210ef2
fix: patch compute walker scratch address when scratch controller is changed
...
- scenario when command list is changed from normal to low priority queue
- save only scratch patch address, when changed, then enable patching
- kernels will re-use scratch patch address after update of compute walker
Related-To: NEO-11972
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-08-01 13:28:53 +02:00
Jack Myers
f5d00b2616
feature: 2d-block-load-transpose query
...
Implemented device property query API for determining
support capabilities regarding 2d-block-load-tranpose
features for which not all Intel devices support.
Related-To: NEO-11592
Signed-off-by: Jack Myers <jack.myers@intel.com >
2024-07-30 18:21:07 +02:00
Fabian Zwoliński
b1a50104a8
fix: include dynamic SLM in clGetKernelWorkGroupInfo and zeKernelGetProperties
...
Current implementation only takes static slmInlineSize into account.
With this change we also include dynamic SLM passed as a kernel arguments.
Related-To: NEO-5761
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com >
2024-07-30 17:59:45 +02:00
Zbigniew Zdanowicz
7d1b59f008
feature: add option to make compute walker command view
...
Related-To: NEO-11972
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-07-30 11:23:45 +02:00
Mateusz Hoppe
a734a738fa
fix: use regular engine for HP copy engine if no HP engine
...
- map HIGH PRIORTY queues to regular engine for copy engine if there is
no HP copy engine available
Related-To: NEO-11983
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2024-07-26 17:09:31 +02:00
Bartosz Dunajski
ebcbbcf8fe
fix: add missing api exports
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-07-26 12:15:28 +02:00