Compute-Runtime-Validation
58707e0cf7
Revert "refactor: migration to std::make_unique C++17"
...
This reverts commit 4cf685cb7d .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-01-09 01:41:36 +01:00
Semenov Herman (Семенов Герман)
4cf685cb7d
refactor: migration to std::make_unique C++17
...
Signed-off-by: Semenov Herman (Семенов Герман) <GermanAizek@yandex.ru >
2025-01-08 17:20:00 +01:00
Szymon Morek
bd5ca4fed5
fix: flush ocl queue after staging transfer
...
This is required for scenarios without direct submission
Otherwise, staging transfers are enqueued but not actually
flushed. If that happens, subsequent staging transfers can't
reuse previous staging buffers. It results in increased memory usage.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-01-08 14:02:33 +01:00
Szymon Morek
2815d4167d
fix: don't use staging for map allocations
...
Related-To: NEO-13572
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-01-07 13:30:11 +01:00
Szymon Morek
c5d541dd81
fix: initialize pointers to events
...
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-12-18 16:25:56 +01:00
Lukasz Jobczyk
093d987e33
performance: Enable timestamp wait for queues on Xe2
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-12-13 09:48:12 +01:00
Szymon Morek
6c4eb322b1
performance: introduce staging reads from image
...
Related-To: NEO-12968
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-12-11 14:43:45 +01:00
Filip Hazubski
43c199185a
fix: Add explicit nullptr checks
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2024-12-10 10:06:56 +01:00
Robert Krzemień
d8a4f92856
fix: don't include perf counters in timestamp profiling
...
Resolves: GSD-10387
Signed-off-by: Robert Krzemień <robert.krzemien@intel.com >
2024-12-02 12:54:06 +01:00
Zbigniew Zdanowicz
c5ed6bf73c
refactor: split sync buffer and region allocation creation code
...
- split the allocation code from command list or kernel
- allow to call allocation code in all parts of the driver
Related-To: NEO-13350
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-11-29 11:41:39 +01:00
Vysochyn, Illia
89c3aab321
refactor: Extract PostSyncType to outer abstract layer
...
This modification serves to simplify the integration of generated
compute walker structures.
Related-To: NEO-13147
Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com >
2024-11-28 18:09:21 +01:00
Mateusz Jablonski
fa58073095
refactor: remove not used usings/typedefs/variables
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-11-28 16:19:39 +01:00
Szymon Morek
53da277bc1
fix: assign CSR once for staging image write
...
Related-To: NEO-13330
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-11-28 11:25:38 +01:00
Zbigniew Zdanowicz
6b7235cd6c
refactor: change parameter names into more meaningful
...
- change additional size into local region size
- change walk order into dispatch walk order to distinguish for local id walk
Related-To: NEO-13350
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-11-27 16:15:02 +01:00
Bartosz Dunajski
1ddcb92f1d
refactor: pass device to encoder functions
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-11-25 16:25:04 +01:00
Lukasz Jobczyk
7832b115a4
fix: Select csr once for staging buffer memcpy
...
Resolves: NEO-13083
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-11-22 15:42:09 +01:00
Damian Tomczak
d07d54b8f0
Revert "refactor: stateless enqueueCopyBufferRect"
...
This reverts commit d8f0b1c17d .
Related-To: NEO-12964
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com >
2024-11-13 01:33:22 +01:00
Szymon Morek
a25e973205
performance: write image through staging chunks
...
Related-To: NEO-12968
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-11-06 16:45:22 +01:00
Zbigniew Zdanowicz
89b40b7ec4
refactor: change additional walker fields encoder 6/n
...
- change order of members of additional walker struct
Related-To: NEO-12639
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-11-05 08:28:57 +01:00
Zbigniew Zdanowicz
2e2b7a473a
refactor: change additional walker fields encoder 5/n
...
- move compute dispatch all walker into dedicated encoder
- group same implementations into single file
Related-To: NEO-12639
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-11-04 13:11:59 +01:00
Zbigniew Zdanowicz
32fd00e150
refactor: change additional walker fields encoder 4/n
...
- move post sync system fence into dedicated encoder
Related-To: NEO-12639
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-10-31 14:41:55 +01:00
Szymon Morek
cf58be4142
performance: use staging buffer when writing to an image
...
Related-To: NEO-12968
Also, don't import usm/mapped allocations for image
operations
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-10-30 17:02:13 +01:00
Zbigniew Zdanowicz
6f4994c269
refactor: change additional walker fields encoder 1/n
...
- move encoding l3 prefetch field into dedicated function
Related-To: NEO-12639
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-10-29 21:37:38 +01:00
Oskar Hubert Weber
fd45b2883d
fix: neo ULT build with optimizations disabled 1/n
...
Related-To: NEO-8116
Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com >
2024-10-28 15:38:28 +01:00
Kamil Kopryk
3891e887c1
feature: use heapless builtins for images
...
Related-To: NEO-12744
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-10-25 16:42:27 +02:00
Bartosz Dunajski
4d6265a46c
refactor: pass additional enqueue params
...
Related-To: NEO-12591
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-10-21 17:55:41 +02:00
Damian Tomczak
e495b2238a
refactor: isStateless on init
...
Related-to: NEO-12938
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com >
2024-10-18 10:53:17 +02:00
Compute-Runtime-Validation
f195e977ff
Revert "feature: stateless enqueueCopyBufferToImage"
...
This reverts commit 9dbace61eb .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-10-15 08:54:24 +02:00
Damian Tomczak
bd500b5b85
feature: stateless enqueueCopyImageToBuffer
...
Related-to: NEO-11859
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com >
2024-10-14 10:33:46 +02:00
Damian Tomczak
9dbace61eb
feature: stateless enqueueCopyBufferToImage
...
Related-to: NEO-11859
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com >
2024-10-14 10:21:02 +02:00
Damian Tomczak
d8f0b1c17d
refactor: stateless enqueueCopyBufferRect
...
Related-to: NEO-11859
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com >
2024-10-14 10:14:15 +02:00
Compute-Runtime-Validation
680e62d333
Revert "performance: Set dispatch all for small TG"
...
This reverts commit 0dc2870513 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-10-02 05:41:13 +02:00
Lukasz Jobczyk
0dc2870513
performance: Set dispatch all for small TG
...
Resolves: NEO-11814
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-10-01 09:28:55 +02:00
Radoslaw Jablonski
c53393152a
fix: remove Systolic field from Xe2 Compute Walker
...
The field has been removed from Xe2.
Signed-off-by: Radoslaw Jablonski <radoslaw.jablonski@intel.com >
2024-09-24 13:40:03 +02:00
Mateusz Hoppe
4a068c8eab
fix: correclty program StateBaseAddress in global bindless mode
...
- prepare bindful ssh when kernel requires ssh heap and
SurfaceStateBaseAddress
- remove lastAppendedKernelBindlesMode - local ssh heap may be needed
for bindless kernels with scratch or misaligned buffer args
- use ssh heap gpu address to program SurfaceStateBaseAddress, global base is
used for BindlessSurfaceState and DynamicState
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2024-09-20 11:57:05 +02:00
Mateusz Jablonski
8e7959b243
refactor: remove not needed code
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-09-16 14:55:55 +02:00
Mateusz Jablonski
ab090b515c
refactor: remove not needed code
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-09-16 14:20:35 +02:00
Maciej Bielski
5bdc4a55cc
fix: avoid nullptr dereference
...
Related-To: NEO-12651
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com >
2024-09-13 19:59:04 +02:00
Zbigniew Zdanowicz
8bb92ff445
performance: avoid reading from gfx memory when modifying walker command
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-09-11 18:58:33 +02:00
Bartosz Dunajski
4f1262645b
refactor: pass extra walker params
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-09-10 15:30:03 +02:00
Mateusz Hoppe
9d5f8eb587
refactor: save hpCopyEngine in Device
...
- remove loop iteration to get hpCopyEngine
Related-To: NEO-11983
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2024-09-06 17:56:53 +02:00
Andrzej Koska
b0e7a11e9a
refactor: Improving information transfer about the copy engine
...
Related-To: NEO-11934
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com >
2024-09-05 16:11:52 +02:00
Bartosz Dunajski
847180dc9a
refactor: add debug prints for timestamp waits
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-08-30 19:55:15 +02:00
Dominik Dabek
4552960e1f
performance(ocl): limit flushes on resolve via pc
...
When resolving dependencies via pipecontrol skip hdc and untyped data
port cache flushes. Leave only command streamer stall.
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-08-29 17:19:19 +02:00
Bartosz Dunajski
696b02bfd3
fix: improve TBX downloading after L0 Event sync
...
Related-To: HSD-18038498579
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-08-23 10:42:17 +02:00
Compute-Runtime-Validation
9b652f4a34
Revert "feature: Improving information transfer about the copy engine"
...
This reverts commit 17ffdff4f1 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-08-15 22:06:31 +02:00
Andrzej Koska
17ffdff4f1
feature: Improving information transfer about the copy engine
...
Related-To: NEO-11934
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com >
2024-08-14 11:28:29 +02:00
Bartosz Dunajski
d76ac1d1de
fix: scratch controller residency
...
Related-To: HSD-18039519400
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-08-09 14:47:56 +02:00
Fabian Zwoliński
674c4a15ad
fix: use correct gpu address when bindless heaps helper is enabled
...
Related-To: NEO-7063
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com >
2024-08-05 15:09:57 +02:00
Bartosz Dunajski
ec34656e0e
fix: debug flag to defer first device submission
...
Related-To: HSD-18039343751
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-08-02 09:31:25 +02:00