Commit Graph

766 Commits

Author SHA1 Message Date
Semenov Herman (Семенов Герман)
9f07f56f7f performance: align structures for 64-bit platforms
Signed-off-by: Semenov Herman (Семенов Герман) <GermanAizek@yandex.ru>
2025-01-09 06:03:39 +01:00
Compute-Runtime-Validation
58707e0cf7 Revert "refactor: migration to std::make_unique C++17"
This reverts commit 4cf685cb7d.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-01-09 01:41:36 +01:00
Semenov Herman (Семенов Герман)
4cf685cb7d refactor: migration to std::make_unique C++17
Signed-off-by: Semenov Herman (Семенов Герман) <GermanAizek@yandex.ru>
2025-01-08 17:20:00 +01:00
Szymon Morek
bd5ca4fed5 fix: flush ocl queue after staging transfer
This is required for scenarios without direct submission
Otherwise, staging transfers are enqueued but not actually
flushed. If that happens, subsequent staging transfers can't
reuse previous staging buffers. It results in increased memory usage.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-01-08 14:02:33 +01:00
Szymon Morek
2815d4167d fix: don't use staging for map allocations
Related-To: NEO-13572

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-01-07 13:30:11 +01:00
Szymon Morek
c5d541dd81 fix: initialize pointers to events
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-12-18 16:25:56 +01:00
Lukasz Jobczyk
093d987e33 performance: Enable timestamp wait for queues on Xe2
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-12-13 09:48:12 +01:00
Szymon Morek
6c4eb322b1 performance: introduce staging reads from image
Related-To: NEO-12968

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-12-11 14:43:45 +01:00
Filip Hazubski
43c199185a fix: Add explicit nullptr checks
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-12-10 10:06:56 +01:00
Robert Krzemień
d8a4f92856 fix: don't include perf counters in timestamp profiling
Resolves: GSD-10387
Signed-off-by: Robert Krzemień <robert.krzemien@intel.com>
2024-12-02 12:54:06 +01:00
Zbigniew Zdanowicz
c5ed6bf73c refactor: split sync buffer and region allocation creation code
- split the allocation code from command list or kernel
- allow to call allocation code in all parts of the driver

Related-To: NEO-13350

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-29 11:41:39 +01:00
Vysochyn, Illia
89c3aab321 refactor: Extract PostSyncType to outer abstract layer
This modification serves to simplify the integration of generated
compute walker structures.

Related-To: NEO-13147

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2024-11-28 18:09:21 +01:00
Mateusz Jablonski
fa58073095 refactor: remove not used usings/typedefs/variables
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-11-28 16:19:39 +01:00
Szymon Morek
53da277bc1 fix: assign CSR once for staging image write
Related-To: NEO-13330

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-11-28 11:25:38 +01:00
Zbigniew Zdanowicz
6b7235cd6c refactor: change parameter names into more meaningful
- change additional size into local region size
- change walk order into dispatch walk order to distinguish for local id walk

Related-To: NEO-13350

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-27 16:15:02 +01:00
Bartosz Dunajski
1ddcb92f1d refactor: pass device to encoder functions
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-11-25 16:25:04 +01:00
Lukasz Jobczyk
7832b115a4 fix: Select csr once for staging buffer memcpy
Resolves: NEO-13083

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-11-22 15:42:09 +01:00
Damian Tomczak
d07d54b8f0 Revert "refactor: stateless enqueueCopyBufferRect"
This reverts commit d8f0b1c17d.

Related-To: NEO-12964

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2024-11-13 01:33:22 +01:00
Szymon Morek
a25e973205 performance: write image through staging chunks
Related-To: NEO-12968

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-11-06 16:45:22 +01:00
Zbigniew Zdanowicz
89b40b7ec4 refactor: change additional walker fields encoder 6/n
- change order of members of additional walker struct

Related-To: NEO-12639

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-05 08:28:57 +01:00
Zbigniew Zdanowicz
2e2b7a473a refactor: change additional walker fields encoder 5/n
- move compute dispatch all walker into dedicated encoder
- group same implementations into single file

Related-To: NEO-12639

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-04 13:11:59 +01:00
Zbigniew Zdanowicz
32fd00e150 refactor: change additional walker fields encoder 4/n
- move post sync system fence into dedicated encoder

Related-To: NEO-12639

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-10-31 14:41:55 +01:00
Szymon Morek
cf58be4142 performance: use staging buffer when writing to an image
Related-To: NEO-12968

Also, don't import usm/mapped allocations for image
operations

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-10-30 17:02:13 +01:00
Zbigniew Zdanowicz
6f4994c269 refactor: change additional walker fields encoder 1/n
- move encoding l3 prefetch field into dedicated function

Related-To: NEO-12639

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-10-29 21:37:38 +01:00
Oskar Hubert Weber
fd45b2883d fix: neo ULT build with optimizations disabled 1/n
Related-To: NEO-8116

Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com>
2024-10-28 15:38:28 +01:00
Kamil Kopryk
3891e887c1 feature: use heapless builtins for images
Related-To: NEO-12744
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-10-25 16:42:27 +02:00
Bartosz Dunajski
4d6265a46c refactor: pass additional enqueue params
Related-To: NEO-12591

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-10-21 17:55:41 +02:00
Damian Tomczak
e495b2238a refactor: isStateless on init
Related-to: NEO-12938

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2024-10-18 10:53:17 +02:00
Compute-Runtime-Validation
f195e977ff Revert "feature: stateless enqueueCopyBufferToImage"
This reverts commit 9dbace61eb.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-10-15 08:54:24 +02:00
Damian Tomczak
bd500b5b85 feature: stateless enqueueCopyImageToBuffer
Related-to: NEO-11859

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2024-10-14 10:33:46 +02:00
Damian Tomczak
9dbace61eb feature: stateless enqueueCopyBufferToImage
Related-to: NEO-11859

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2024-10-14 10:21:02 +02:00
Damian Tomczak
d8f0b1c17d refactor: stateless enqueueCopyBufferRect
Related-to: NEO-11859

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2024-10-14 10:14:15 +02:00
Compute-Runtime-Validation
680e62d333 Revert "performance: Set dispatch all for small TG"
This reverts commit 0dc2870513.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-10-02 05:41:13 +02:00
Lukasz Jobczyk
0dc2870513 performance: Set dispatch all for small TG
Resolves: NEO-11814

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-10-01 09:28:55 +02:00
Radoslaw Jablonski
c53393152a fix: remove Systolic field from Xe2 Compute Walker
The field has been removed from Xe2.

Signed-off-by: Radoslaw Jablonski <radoslaw.jablonski@intel.com>
2024-09-24 13:40:03 +02:00
Mateusz Hoppe
4a068c8eab fix: correclty program StateBaseAddress in global bindless mode
- prepare bindful ssh when kernel requires ssh heap and
SurfaceStateBaseAddress
- remove lastAppendedKernelBindlesMode - local ssh heap may be needed
for bindless kernels with scratch or misaligned buffer args
- use ssh heap gpu address to program SurfaceStateBaseAddress, global base is
used for BindlessSurfaceState and DynamicState

Related-To: NEO-7063

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-09-20 11:57:05 +02:00
Mateusz Jablonski
8e7959b243 refactor: remove not needed code
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-09-16 14:55:55 +02:00
Mateusz Jablonski
ab090b515c refactor: remove not needed code
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-09-16 14:20:35 +02:00
Maciej Bielski
5bdc4a55cc fix: avoid nullptr dereference
Related-To: NEO-12651

Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2024-09-13 19:59:04 +02:00
Zbigniew Zdanowicz
8bb92ff445 performance: avoid reading from gfx memory when modifying walker command
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-09-11 18:58:33 +02:00
Bartosz Dunajski
4f1262645b refactor: pass extra walker params
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-09-10 15:30:03 +02:00
Mateusz Hoppe
9d5f8eb587 refactor: save hpCopyEngine in Device
- remove loop iteration to get hpCopyEngine

Related-To: NEO-11983

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-09-06 17:56:53 +02:00
Andrzej Koska
b0e7a11e9a refactor: Improving information transfer about the copy engine
Related-To: NEO-11934

Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2024-09-05 16:11:52 +02:00
Bartosz Dunajski
847180dc9a refactor: add debug prints for timestamp waits
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-30 19:55:15 +02:00
Dominik Dabek
4552960e1f performance(ocl): limit flushes on resolve via pc
When resolving dependencies via pipecontrol skip hdc and untyped data
port cache flushes. Leave only command streamer stall.

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-08-29 17:19:19 +02:00
Bartosz Dunajski
696b02bfd3 fix: improve TBX downloading after L0 Event sync
Related-To: HSD-18038498579

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-23 10:42:17 +02:00
Compute-Runtime-Validation
9b652f4a34 Revert "feature: Improving information transfer about the copy engine"
This reverts commit 17ffdff4f1.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-08-15 22:06:31 +02:00
Andrzej Koska
17ffdff4f1 feature: Improving information transfer about the copy engine
Related-To: NEO-11934

Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2024-08-14 11:28:29 +02:00
Bartosz Dunajski
d76ac1d1de fix: scratch controller residency
Related-To: HSD-18039519400

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-09 14:47:56 +02:00
Fabian Zwoliński
674c4a15ad fix: use correct gpu address when bindless heaps helper is enabled
Related-To: NEO-7063
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-08-05 15:09:57 +02:00