Jaroslaw Warchulski
05250e7460
fix: Remove patchtokens fallback AIL
...
Related-To: NEO-12358
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com >
2024-09-12 18:26:19 +02:00
Zbigniew Zdanowicz
8bb92ff445
performance: avoid reading from gfx memory when modifying walker command
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-09-11 18:58:33 +02:00
Lukasz Jobczyk
48dc0a823d
fix: Check dst pitch in copy rect misaligned case
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-09-11 11:45:10 +02:00
Zbigniew Zdanowicz
7ce4a8adc2
performance: replace virtual calls with native class methods
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-09-11 11:10:40 +02:00
Dominik Dabek
b2fc7345cf
performance: redesign usm alloc reuse mechanism
...
Dedicated pools for different allocations size ranges.
Additional reused allocations will create their own pools.
Do not reuse allocations >256MB.
Related-To: NEO-6893, NEO-12299, NEO-12349
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-09-10 18:12:14 +02:00
Bartosz Dunajski
4f1262645b
refactor: pass extra walker params
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-09-10 15:30:03 +02:00
Filip Hazubski
beaddb55e0
fix: Minor code changes
...
Remove needless checks for module and cmdList.
Add explicit check for surfaceState in setArgImageWithMipLevel.
Explicitly initialize WddmResidencyController::csr.
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2024-09-10 13:31:15 +02:00
Mateusz Jablonski
54bda0e986
fix: In Linux CL/GL sharing
...
- always issue flush request before export
Apparently it's expected to flush the object (which might convert them
from one format to another for export, or remove aux buffer uses or
anything not supported by export).
- use modifier to select tiling mode
Previously we just assumed that whatever tiling mode was picked by mesa
will match the one picked by GMMLIB but that's not always the case
and in particular on Arc and Xe it doesn't work ... Mesa picks Tile4
and GMMLIB picks Tile64 ...
Fixes : #761
Fixes : #736
Signed-off-by: Sylvain Munaut <tnt@246tNt.com >
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-09-06 21:40:18 +02:00
Mateusz Hoppe
9d5f8eb587
refactor: save hpCopyEngine in Device
...
- remove loop iteration to get hpCopyEngine
Related-To: NEO-11983
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2024-09-06 17:56:53 +02:00
Lukasz Jobczyk
03690e9b83
fix: Set special queue after its setup
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-09-06 12:05:24 +02:00
Kamil Kopryk
95b035a071
fix: global stateless heap creation in heapless path
...
Related-To: NEO-10681
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-09-05 16:56:00 +02:00
Andrzej Koska
b0e7a11e9a
refactor: Improving information transfer about the copy engine
...
Related-To: NEO-11934
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com >
2024-09-05 16:11:52 +02:00
Compute-Runtime-Validation
d842f65cf1
Revert "fix: submit dummy exec to pin memory during zeContextMakeMemoryReside...
...
This reverts commit f9b87d53e6 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-09-05 03:28:03 +02:00
Maciej Plewka
f9b87d53e6
fix: submit dummy exec to pin memory during zeContextMakeMemoryResident call
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
Related-To: NEO-11879
2024-09-04 14:07:29 +02:00
Mateusz Hoppe
37b7caa137
fix: correct program header generation for shared isa allocation
...
- when kernels share single allocation, LOAD address in program headers
should point to correct virtual address including kernel offset
Related-To: NEO-7788, GSD-9836
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2024-09-03 17:50:29 +02:00
Mateusz Jablonski
7adfa344c0
performance: get reference from getBindlessInfo
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-09-03 12:50:52 +02:00
Lukasz Jobczyk
8b760f8528
performance: Copy hostptr on cpu for host buffer when mitigate dc flush
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-08-30 19:55:45 +02:00
Bartosz Dunajski
847180dc9a
refactor: add debug prints for timestamp waits
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-08-30 19:55:15 +02:00
Mateusz Jablonski
c934877790
refactor: remove not needed function
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-08-30 12:18:14 +02:00
Dominik Dabek
4552960e1f
performance(ocl): limit flushes on resolve via pc
...
When resolving dependencies via pipecontrol skip hdc and untyped data
port cache flushes. Leave only command streamer stall.
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-08-29 17:19:19 +02:00
Lukasz Jobczyk
7550e55d64
fix: Always copy hostptr on gpu when mitigate dc flush
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-08-29 14:53:57 +02:00
Lukasz Jobczyk
496012d82f
performance: Use copy buffer rect middle builtin
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-08-28 15:32:00 +02:00
Lukasz Jobczyk
5aa5d40937
performance: Mitigate dc flush on LNL Windows
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-08-28 13:35:17 +02:00
Lukasz Jobczyk
0b848a5fdb
fix: Don't use copy buffer rect middle builtin
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-08-27 12:01:58 +02:00
Fabian Zwoliński
90bc6c10ea
fix(ocl): use correct surface state index for bindless offset in setArgImage
...
Related-To: NEO-11156
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com >
2024-08-27 09:42:46 +02:00
Compute-Runtime-Validation
ad0d6f5435
Revert "refactor: Add dc flush mitigation infrastructure"
...
This reverts commit e4412e385a .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-08-27 02:35:06 +02:00
Dominik Dabek
a47ca96a42
fix(ocl): allocate small buffer pool uncompressed
...
Related-To: HSD-15016054429
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-08-26 13:00:47 +02:00
Lukasz Jobczyk
e4412e385a
refactor: Add dc flush mitigation infrastructure
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-08-26 10:38:56 +02:00
Compute-Runtime-Validation
5dbbaa39b9
Revert "fix: ulls controller sleep, windows"
...
This reverts commit 6455d4648c .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-08-24 06:37:58 +02:00
Compute-Runtime-Validation
956dd8e17d
Revert "fix: set properly resource params when setAllocationType"
...
This reverts commit 2e0884a301 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-08-24 05:01:38 +02:00
Katarzyna Cencelewska
2e0884a301
fix: set properly resource params when setAllocationType
...
gmm params: usage, cachable and resource info
should be set properly when override allocation type
Resolves: HSD-22020344331
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2024-08-23 16:57:23 +02:00
Lukasz Jobczyk
9152b6ac04
performance: Defer special queue init to first use
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-08-23 14:22:51 +02:00
Mateusz Hoppe
c9e8931a18
fix: global bindless offsets for implicit args
...
- do not patch bindless offsets for implicit args in with global bindless
allocator, the offsets are patched at kernel initialization time and
do not change
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2024-08-23 13:21:27 +02:00
Dominik Dabek
6455d4648c
fix: ulls controller sleep, windows
...
Request higher resolution for windows periodic timers for ulls
controller sleep.
Allows for controller thread to sleep with granularity of 1ms.
Related-To: NEO-10800
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-08-23 12:05:26 +02:00
Bartosz Dunajski
696b02bfd3
fix: improve TBX downloading after L0 Event sync
...
Related-To: HSD-18038498579
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-08-23 10:42:17 +02:00
Compute-Runtime-Validation
4b01058706
Revert "performance: Defer special queue init to first use"
...
This reverts commit 25bb3c87ad .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-08-23 09:09:17 +02:00
Lukasz Jobczyk
c1a5fb089b
performance: Add copy buffer rect middle builtin
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-08-22 10:30:17 +02:00
Lukasz Jobczyk
25bb3c87ad
performance: Defer special queue init to first use
...
Resolves: NEO-12332
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-08-22 10:07:44 +02:00
Compute-Runtime-Validation
d8ea5516b2
Revert "performance: Add copy buffer rect middle builtin"
...
This reverts commit bbb44c7a4d .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-08-22 04:41:42 +02:00
Mateusz Jablonski
7ac41615cd
fix: create thread with function pointer
...
don't create async thread in neo shared tests
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-08-21 18:02:37 +02:00
Lukasz Jobczyk
bbb44c7a4d
performance: Add copy buffer rect middle builtin
...
Resolves: NEO-12132
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-08-21 14:41:44 +02:00
Mateusz Jablonski
f617093a6a
fix: add missing nullptr check before accessing ail helper
...
Fixes : #755
Fixes : #754
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-08-20 16:58:53 +02:00
Mateusz Jablonski
579af57161
refactor: don't call OsLibrary::load directly, use function pointer
...
this allows mocking this call in ULT
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-08-20 08:59:26 +02:00
Compute-Runtime-Validation
9b652f4a34
Revert "feature: Improving information transfer about the copy engine"
...
This reverts commit 17ffdff4f1 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-08-15 22:06:31 +02:00
Andrzej Koska
17ffdff4f1
feature: Improving information transfer about the copy engine
...
Related-To: NEO-11934
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com >
2024-08-14 11:28:29 +02:00
Maciej Plewka
27488a8315
fix: Add option to use divergent paths for barriers
...
Related-To: NEO-12159, IGC-9846, HSD-16024235164, HSD-15014782386
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2024-08-12 12:15:54 +02:00
Bartosz Dunajski
d76ac1d1de
fix: scratch controller residency
...
Related-To: HSD-18039519400
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-08-09 14:47:56 +02:00
Kamil Kopryk
775b14a7f6
fix: add ioh alignment in heapless
...
Related-To: NEO-11871
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-08-09 12:20:00 +02:00
Dominik Dabek
ad229377b9
fix: disable indirect detection if any stack calls
...
Don't know if kernels will be initialized in the order needed to check
for indirect accesses in stack calls.
Remove now unused functionPointerWithIndirectAccessExists and reading
this value from zebin.
Related-To: NEO-12235
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-08-07 14:48:58 +02:00
Kamil Kopryk
38a194eee6
fix: scratch address from implicit args in ocl
...
Related-To: NEO-12237
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-08-07 09:40:27 +02:00