Commit Graph

2511 Commits

Author SHA1 Message Date
Szymon Morek
53da277bc1 fix: assign CSR once for staging image write
Related-To: NEO-13330

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-11-28 11:25:38 +01:00
Szymon Morek
81ba52aac4 performance: stop ULLS for BCS during migration
Related-To: NEO-13340

When regular copy CSR has enabled direct submission,
stop it before migration on internal CSR.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-11-27 20:06:50 +01:00
Zbigniew Zdanowicz
6b7235cd6c refactor: change parameter names into more meaningful
- change additional size into local region size
- change walk order into dispatch walk order to distinguish for local id walk

Related-To: NEO-13350

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-27 16:15:02 +01:00
Bartosz Dunajski
1ddcb92f1d refactor: pass device to encoder functions
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-11-25 16:25:04 +01:00
Lukasz Jobczyk
7832b115a4 fix: Select csr once for staging buffer memcpy
Resolves: NEO-13083

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-11-22 15:42:09 +01:00
Wenbin Lu
47383d752f fix: report correct number of XeCore per Cluster
Related-To: NEO-10060

Signed-off-by: Wenbin Lu <wenbin.lu@intel.com>
2024-11-22 09:40:28 +01:00
Jaroslaw Warchulski
723e1e7d29 fix: expose tiles as devices in OCL with combined hierarchy
Related-To: GSD-8708
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2024-11-21 15:23:06 +01:00
Maciej Plewka
46c345789d refactor: move depth limitation from release helper to image_hw
Related-To: NEO-8390, HSD-16021488507
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-11-21 11:24:06 +01:00
Jaroslaw Warchulski
051cada78b refactor: add patchtokens fallback AIL
Related-To: HSD-14023878700

Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2024-11-20 12:58:22 +01:00
Chodor, Jaroslaw
3a0c2005d4 refactor: Cleanup around options files
* Adding options*.inl to sources list
* Improving robustness of missing IGC library log

Related-To: NEO-12747

Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2024-11-17 17:13:03 +01:00
Damian Tomczak
d07d54b8f0 Revert "refactor: stateless enqueueCopyBufferRect"
This reverts commit d8f0b1c17d.

Related-To: NEO-12964

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2024-11-13 01:33:22 +01:00
Mateusz Jablonski
1350441a72 build: fix multiple definitions issue
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-11-12 15:54:12 +01:00
Dominik Dabek
0a12817664 performance: flag, force zero copy for host ptr
When debug flag ForceZeroCopyForUseHostPtr is set, add
CL_MEM_FORCE_HOST_MEMORY_INTEL flag to buffers created with
CL_MEM_USE_HOST_PTR.
This makes the buffers use zero copy.

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-11-12 15:43:17 +01:00
Szymon Morek
a25e973205 performance: write image through staging chunks
Related-To: NEO-12968

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-11-06 16:45:22 +01:00
Maciej Plewka
df7e5191ed fix: check if tracing_exit was called before return from api call
Related-To: GSD-9415, NEO-12719
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-11-06 11:31:31 +01:00
Maciej Plewka
75595e687e fix: add missing tracing_exit before return in api.cpp
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>

Related-To: GSD-9415, NEO-12719
2024-11-05 11:59:05 +01:00
Zbigniew Zdanowicz
89b40b7ec4 refactor: change additional walker fields encoder 6/n
- change order of members of additional walker struct

Related-To: NEO-12639

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-05 08:28:57 +01:00
Zbigniew Zdanowicz
2e2b7a473a refactor: change additional walker fields encoder 5/n
- move compute dispatch all walker into dedicated encoder
- group same implementations into single file

Related-To: NEO-12639

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-04 13:11:59 +01:00
Zbigniew Zdanowicz
32fd00e150 refactor: change additional walker fields encoder 4/n
- move post sync system fence into dedicated encoder

Related-To: NEO-12639

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-10-31 14:41:55 +01:00
Lukasz Jobczyk
b050a83242 performance: Use lock pointer copy for dc flush mitigation
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-10-31 10:53:39 +01:00
Compute-Runtime-Validation
3fcb9b18ee Revert "performance: Use lock pointer copy for dc flush mitigation"
This reverts commit b8be102455.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-10-31 05:15:07 +01:00
Szymon Morek
cf58be4142 performance: use staging buffer when writing to an image
Related-To: NEO-12968

Also, don't import usm/mapped allocations for image
operations

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-10-30 17:02:13 +01:00
Lukasz Jobczyk
b8be102455 performance: Use lock pointer copy for dc flush mitigation
Resolves: NEO-12898

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-10-29 21:37:53 +01:00
Zbigniew Zdanowicz
6f4994c269 refactor: change additional walker fields encoder 1/n
- move encoding l3 prefetch field into dedicated function

Related-To: NEO-12639

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-10-29 21:37:38 +01:00
Oskar Hubert Weber
fd45b2883d fix: neo ULT build with optimizations disabled 1/n
Related-To: NEO-8116

Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com>
2024-10-28 15:38:28 +01:00
Compute-Runtime-Validation
f5c433c8f8 Revert "performance: Use lock pointer copy with sfence for dc flush mitigation"
This reverts commit 8c3c703ec0.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-10-26 14:28:26 +02:00
Kamil Kopryk
3891e887c1 feature: use heapless builtins for images
Related-To: NEO-12744
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-10-25 16:42:27 +02:00
Zbigniew Zdanowicz
bbdf1ac7b6 refactor: change encoder for thread group over dispatch 1/n
- change method name to more meaningful
- add all inputs of the algorithm as explicit function arguments
- position all implementations accordingly
- rename unit test names to fit new method name
- fix unit test to have correct initial command values
- fix unit test to have consistent input values with command values
- fix unit test to change input values together with command values

Related-To: NEO-12639

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-10-25 16:31:39 +02:00
Lukasz Jobczyk
8c3c703ec0 performance: Use lock pointer copy with sfence for dc flush mitigation
Resolves: NEO-12898

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-10-25 16:24:40 +02:00
Dominik Dabek
22a3ec948b fix: ulls controller sleep, windows
Request higher resolution for windows periodic timers for ulls
controller sleep.

Allows for controller thread to sleep with granularity of 1ms.

Related-To: NEO-10800

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-10-24 18:07:04 +02:00
Bartosz Dunajski
4d6265a46c refactor: pass additional enqueue params
Related-To: NEO-12591

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-10-21 17:55:41 +02:00
Damian Tomczak
e495b2238a refactor: isStateless on init
Related-to: NEO-12938

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2024-10-18 10:53:17 +02:00
Mateusz Jablonski
4154e6666b fix: correct loading L0 loader functions
- don't load ze_loader.dll from file system
- to perform self-open on Windows use getModuleHandleA with proper module name
- don't free library loaded with getModuleHandleA
- as loader may be not available during runtime teardown:
- load translate handle function during global setup
- load setDriverTeardown function during global teardown
- when loader is not available during teardown, unset translate handle
function

Related-To: GSD-10147

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-10-17 15:39:20 +02:00
Compute-Runtime-Validation
f117b5fb68 Revert "fix: correct loading L0 loader functions"
This reverts commit 9c7b3c5e19.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-10-17 08:20:19 +02:00
Mateusz Jablonski
9c7b3c5e19 fix: correct loading L0 loader functions
on Windows use getModuleHandleA with proper module name
don't load ze_loader.dll from file system

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-10-16 19:53:05 +02:00
Compute-Runtime-Validation
f195e977ff Revert "feature: stateless enqueueCopyBufferToImage"
This reverts commit 9dbace61eb.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-10-15 08:54:24 +02:00
Damian Tomczak
bd500b5b85 feature: stateless enqueueCopyImageToBuffer
Related-to: NEO-11859

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2024-10-14 10:33:46 +02:00
Damian Tomczak
9dbace61eb feature: stateless enqueueCopyBufferToImage
Related-to: NEO-11859

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2024-10-14 10:21:02 +02:00
Damian Tomczak
d8f0b1c17d refactor: stateless enqueueCopyBufferRect
Related-to: NEO-11859

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2024-10-14 10:14:15 +02:00
Damian Tomczak
0c8af82d1b fix: stateless suffix for stateless kernels
Resolves: NEO-12847

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2024-10-11 17:40:10 +02:00
Wojciech Konior
6b40f9bc5a refactor: engineInstancedType removed
Related-To: NEO-12594

Signed-off-by: Wojciech Konior <wojciech.konior@intel.com>
2024-10-09 16:30:48 +02:00
Zbigniew Zdanowicz
49371cb13e refactor: unify programming of preferred slm size 1/n
- rename function to reflect actual task function does

Related-To: NEO-12639

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-10-03 18:52:01 +02:00
Compute-Runtime-Validation
680e62d333 Revert "performance: Set dispatch all for small TG"
This reverts commit 0dc2870513.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-10-02 05:41:13 +02:00
Szymon Morek
b2fd1972a4 fix: add cpu alloc to eviction list only once
Related-To: NEO-12572

Also, before migration to GPU domain, remove it from this list

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-10-01 11:47:32 +02:00
Lukasz Jobczyk
0dc2870513 performance: Set dispatch all for small TG
Resolves: NEO-11814

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-10-01 09:28:55 +02:00
Maciej Plewka
33a346838e fix: return error when flush in waitForEvents return error
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-09-27 09:17:07 +02:00
Maciej Plewka
1dafb66463 fix: return success from release calls after platform teardown
Related-To: NEO-11282
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-09-26 19:30:26 +02:00
Radoslaw Jablonski
c53393152a fix: remove Systolic field from Xe2 Compute Walker
The field has been removed from Xe2.

Signed-off-by: Radoslaw Jablonski <radoslaw.jablonski@intel.com>
2024-09-24 13:40:03 +02:00
Maciej Plewka
80f75ceace fix: submit dummy exec to pin memory during zeContextMakeMemoryResident call
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-09-23 14:43:59 +02:00
Mateusz Hoppe
4a068c8eab fix: correclty program StateBaseAddress in global bindless mode
- prepare bindful ssh when kernel requires ssh heap and
SurfaceStateBaseAddress
- remove lastAppendedKernelBindlesMode - local ssh heap may be needed
for bindless kernels with scratch or misaligned buffer args
- use ssh heap gpu address to program SurfaceStateBaseAddress, global base is
used for BindlessSurfaceState and DynamicState

Related-To: NEO-7063

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-09-20 11:57:05 +02:00