Lukasz Jobczyk
7832b115a4
fix: Select csr once for staging buffer memcpy
...
Resolves: NEO-13083
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-11-22 15:42:09 +01:00
Damian Tomczak
d07d54b8f0
Revert "refactor: stateless enqueueCopyBufferRect"
...
This reverts commit d8f0b1c17d
.
Related-To: NEO-12964
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2024-11-13 01:33:22 +01:00
Szymon Morek
a25e973205
performance: write image through staging chunks
...
Related-To: NEO-12968
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-11-06 16:45:22 +01:00
Zbigniew Zdanowicz
89b40b7ec4
refactor: change additional walker fields encoder 6/n
...
- change order of members of additional walker struct
Related-To: NEO-12639
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-05 08:28:57 +01:00
Zbigniew Zdanowicz
2e2b7a473a
refactor: change additional walker fields encoder 5/n
...
- move compute dispatch all walker into dedicated encoder
- group same implementations into single file
Related-To: NEO-12639
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-04 13:11:59 +01:00
Zbigniew Zdanowicz
32fd00e150
refactor: change additional walker fields encoder 4/n
...
- move post sync system fence into dedicated encoder
Related-To: NEO-12639
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-10-31 14:41:55 +01:00
Szymon Morek
cf58be4142
performance: use staging buffer when writing to an image
...
Related-To: NEO-12968
Also, don't import usm/mapped allocations for image
operations
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-10-30 17:02:13 +01:00
Zbigniew Zdanowicz
6f4994c269
refactor: change additional walker fields encoder 1/n
...
- move encoding l3 prefetch field into dedicated function
Related-To: NEO-12639
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-10-29 21:37:38 +01:00
Oskar Hubert Weber
fd45b2883d
fix: neo ULT build with optimizations disabled 1/n
...
Related-To: NEO-8116
Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com>
2024-10-28 15:38:28 +01:00
Kamil Kopryk
3891e887c1
feature: use heapless builtins for images
...
Related-To: NEO-12744
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-10-25 16:42:27 +02:00
Bartosz Dunajski
4d6265a46c
refactor: pass additional enqueue params
...
Related-To: NEO-12591
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-10-21 17:55:41 +02:00
Damian Tomczak
e495b2238a
refactor: isStateless on init
...
Related-to: NEO-12938
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2024-10-18 10:53:17 +02:00
Compute-Runtime-Validation
f195e977ff
Revert "feature: stateless enqueueCopyBufferToImage"
...
This reverts commit 9dbace61eb
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-10-15 08:54:24 +02:00
Damian Tomczak
bd500b5b85
feature: stateless enqueueCopyImageToBuffer
...
Related-to: NEO-11859
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2024-10-14 10:33:46 +02:00
Damian Tomczak
9dbace61eb
feature: stateless enqueueCopyBufferToImage
...
Related-to: NEO-11859
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2024-10-14 10:21:02 +02:00
Damian Tomczak
d8f0b1c17d
refactor: stateless enqueueCopyBufferRect
...
Related-to: NEO-11859
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2024-10-14 10:14:15 +02:00
Compute-Runtime-Validation
680e62d333
Revert "performance: Set dispatch all for small TG"
...
This reverts commit 0dc2870513
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-10-02 05:41:13 +02:00
Lukasz Jobczyk
0dc2870513
performance: Set dispatch all for small TG
...
Resolves: NEO-11814
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-10-01 09:28:55 +02:00
Radoslaw Jablonski
c53393152a
fix: remove Systolic field from Xe2 Compute Walker
...
The field has been removed from Xe2.
Signed-off-by: Radoslaw Jablonski <radoslaw.jablonski@intel.com>
2024-09-24 13:40:03 +02:00
Mateusz Hoppe
4a068c8eab
fix: correclty program StateBaseAddress in global bindless mode
...
- prepare bindful ssh when kernel requires ssh heap and
SurfaceStateBaseAddress
- remove lastAppendedKernelBindlesMode - local ssh heap may be needed
for bindless kernels with scratch or misaligned buffer args
- use ssh heap gpu address to program SurfaceStateBaseAddress, global base is
used for BindlessSurfaceState and DynamicState
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-09-20 11:57:05 +02:00
Mateusz Jablonski
8e7959b243
refactor: remove not needed code
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-09-16 14:55:55 +02:00
Mateusz Jablonski
ab090b515c
refactor: remove not needed code
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-09-16 14:20:35 +02:00
Maciej Bielski
5bdc4a55cc
fix: avoid nullptr dereference
...
Related-To: NEO-12651
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2024-09-13 19:59:04 +02:00
Zbigniew Zdanowicz
8bb92ff445
performance: avoid reading from gfx memory when modifying walker command
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-09-11 18:58:33 +02:00
Bartosz Dunajski
4f1262645b
refactor: pass extra walker params
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-09-10 15:30:03 +02:00
Mateusz Hoppe
9d5f8eb587
refactor: save hpCopyEngine in Device
...
- remove loop iteration to get hpCopyEngine
Related-To: NEO-11983
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-09-06 17:56:53 +02:00
Andrzej Koska
b0e7a11e9a
refactor: Improving information transfer about the copy engine
...
Related-To: NEO-11934
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2024-09-05 16:11:52 +02:00
Bartosz Dunajski
847180dc9a
refactor: add debug prints for timestamp waits
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-30 19:55:15 +02:00
Dominik Dabek
4552960e1f
performance(ocl): limit flushes on resolve via pc
...
When resolving dependencies via pipecontrol skip hdc and untyped data
port cache flushes. Leave only command streamer stall.
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-08-29 17:19:19 +02:00
Bartosz Dunajski
696b02bfd3
fix: improve TBX downloading after L0 Event sync
...
Related-To: HSD-18038498579
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-23 10:42:17 +02:00
Compute-Runtime-Validation
9b652f4a34
Revert "feature: Improving information transfer about the copy engine"
...
This reverts commit 17ffdff4f1
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-08-15 22:06:31 +02:00
Andrzej Koska
17ffdff4f1
feature: Improving information transfer about the copy engine
...
Related-To: NEO-11934
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2024-08-14 11:28:29 +02:00
Bartosz Dunajski
d76ac1d1de
fix: scratch controller residency
...
Related-To: HSD-18039519400
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-09 14:47:56 +02:00
Fabian Zwoliński
674c4a15ad
fix: use correct gpu address when bindless heaps helper is enabled
...
Related-To: NEO-7063
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-08-05 15:09:57 +02:00
Bartosz Dunajski
ec34656e0e
fix: debug flag to defer first device submission
...
Related-To: HSD-18039343751
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-02 09:31:25 +02:00
Michal Mrozek
47009cec90
refactor: remove not needed code
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-07-31 17:07:56 +02:00
Zbigniew Zdanowicz
b33fe6ccf1
feature: adding flag to block dispatch implicit scaling commands
...
- this feature is part of making compute walker command view
- compute walker is programed for implicit scaling but not dispatched
- together with new flag, comes the refactor to reduce number of arguments
Related-To: NEO-11972
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-07-31 14:24:27 +02:00
Michal Mrozek
e668b4965c
performance: demote unrecoverable to debug_break
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-07-30 16:51:00 +02:00
Kamil Kopryk
65fcbff55c
refactor: Simplify code
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-07-29 14:26:51 +02:00
Szymon Morek
ace883ca55
performance: don't flush gpgpu if not required
...
Related-To: NEO-12124
If queue is OOQ and there are no cross-engine dependencies,
don't flush CCS before submitting copy on BCS.
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-26 06:49:45 +02:00
Maciej Plewka
1cd00b5b89
fix: use per product cache line size to align heaps
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-07-24 17:29:20 +02:00
Szymon Morek
a7fbc90ebd
fix: re-enable staging buffer copy when ccs is busy
...
Related-To: NEO-11501
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-22 18:14:46 +02:00
Szymon Morek
39ec7facee
performance: use BCS for transfers if CCS is busy
...
Related-To: NEO-11501
Also, if device is iGPU, don't use staging buffers
in that case.
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-22 15:36:26 +02:00
Mateusz Hoppe
b6299b8a21
feature: add support for HP copy engine context
...
- add support for contect group with HP copy engine
- choose HP copy engine when available
Related-To: NEO-11983
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-07-19 12:23:03 +02:00
Filip Hazubski
f8867ac3ac
fix: Minor code improvements
...
Add explicit pointer checks to CommandQueue::blitEnqueueAllowed.
Explicitly check result of getDeviceArgValueIdx during ocloc compile.
Explicitly remove unused StagingBufferManager functions.
Move chunkCopyFunc by reference.
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-07-19 09:24:37 +02:00
Compute-Runtime-Validation
7ad15639fc
Revert "feature: add support for HP copy engine context"
...
This reverts commit 3fbcbcaef2
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-07-18 21:02:14 +02:00
Mateusz Hoppe
3fbcbcaef2
feature: add support for HP copy engine context
...
- add support for contect group with HP copy engine
- choose HP copy engine when available
Related-To: NEO-11983
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-07-18 16:07:07 +02:00
Dominik Dabek
fc9de71feb
fix(ocl): finish in release ogl object if needed
...
Finish cache flushes before exiting api call if releasing displayable
ogl object or dcflush is mitigated.
Related-To: NEO-11694
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-07-16 11:21:32 +02:00
Bartosz Dunajski
e188de2489
fix: initialize page tables before access for TSP allocation in TBX mode
...
Related-To: NEO-8340
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-07-11 16:03:19 +02:00
Dominik Dabek
bdeccab7aa
fix: bcs enqueue after marker properly waits
...
For an example sequence of:
IOQ_1 -> enqueue copy, enqueue marker with waitlist (out event)
IOQ_2 -> enqueue marker with waitlist (event), enqueue copy
Add missing synchronization between the enqueue copies
Related-To: NEO-11694
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-07-11 10:36:18 +02:00