Commit Graph

785 Commits

Author SHA1 Message Date
Szymon Morek
8152241693 fix: don't override user data during staging read
Related-To: NEO-13574

User can read image with row pitch larger than region width.
In such cases, don't override memory which is out of region
but within row pitch.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-02-12 16:22:48 +01:00
Szymon Morek
b11322332c performance: enable staging write for cl buffers
Related-To: NEO-13529

Also, add size threshold on iGPU on Linux,
and disable staging if imported host ptr could
be reused

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-02-05 09:41:17 +01:00
Mateusz Jablonski
0a068ce96a fix: add pipe control before scratch register write
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-02-03 18:13:18 +01:00
Michał Pryba
75bc74089b refactor: adjust file names after pre-gen12 removal 2/3
Related-To: NEO-12681
Signed-off-by: Michał Pryba <michal.pryba@intel.com>
2025-02-03 15:31:51 +01:00
Szymon Morek
d7c8b42d01 Revert "performance: enable staging write for cl buffers"
This reverts commit a9a2f7577f.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-01-24 23:41:12 +01:00
Bartosz Dunajski
c75bcba1e6 fix: ensure profiling enabled for OCL Blit Split path
Related-To: NEO-13842

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-01-23 13:59:15 +01:00
Michał Pryba
53831f61df refactor: adjust file names after pre-gen12 removal 1/3
Related-To: NEO-12681
Signed-off-by: Michał Pryba <michal.pryba@intel.com>
2025-01-22 22:47:24 +01:00
Bartosz Dunajski
c2dbdb6797 refactor: move blit post sync data to BlitProperties
Related-To: NEO-13003

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-01-22 18:35:21 +01:00
Damian Tomczak
4af92c20e4 feature: enable heapless for enqueueCopyBufferToImage
Related-to: NEO-12737

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-01-22 12:06:16 +01:00
Kamil Kopryk
47fc1ce34b refactor: use designated initializer for EncodeWalkerArgs
also make EncodeWalkerArgs an aggregate

Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-01-22 11:44:57 +01:00
Zbigniew Zdanowicz
1e89058232 refactor: change data fields for encoder of additional walker fields
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-01-21 16:49:38 +01:00
Zbigniew Zdanowicz
897c890d03 refactor: add thread group count parameter to implicit scaling functions
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-01-20 13:12:45 +01:00
Lukasz Jobczyk
af0bb7964a performance: Set dispatch all for small TG
Resolves: NEO-11814

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-01-20 08:49:59 +01:00
Zbigniew Zdanowicz
446f9c1964 refactor: change interfaces of walker region encoder
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-01-17 19:18:24 +01:00
Szymon Morek
ac78c07899 refactor: unify duplicated code for staging images
Pass command type instead of maintaining
two methods for reads and writes

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-01-17 09:52:35 +01:00
Szymon Morek
a9a2f7577f performance: enable staging write for cl buffers
Related-To: NEO-13529

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-01-16 12:04:28 +01:00
Krzysztof Sprzaczkowski
a17745532c performance: Move preemption allocation init to the first submission
Related-To: NEO-12323
Signed-off-by: Krzysztof Sprzaczkowski <krzysztof.sprzaczkowski@intel.com>
2025-01-15 20:22:50 +01:00
Szymon Morek
0b4fe9a0df performance: add staging transfers for cl buffers
Related-To: NEO-13529

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-01-15 12:24:43 +01:00
Compute-Runtime-Validation
af031ee0e3 Revert "performance: align structures for 64-bit platforms"
This reverts commit 9f07f56f7f.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-01-15 09:02:01 +01:00
Semenov Herman (Семенов Герман)
9f07f56f7f performance: align structures for 64-bit platforms
Signed-off-by: Semenov Herman (Семенов Герман) <GermanAizek@yandex.ru>
2025-01-09 06:03:39 +01:00
Compute-Runtime-Validation
58707e0cf7 Revert "refactor: migration to std::make_unique C++17"
This reverts commit 4cf685cb7d.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-01-09 01:41:36 +01:00
Semenov Herman (Семенов Герман)
4cf685cb7d refactor: migration to std::make_unique C++17
Signed-off-by: Semenov Herman (Семенов Герман) <GermanAizek@yandex.ru>
2025-01-08 17:20:00 +01:00
Szymon Morek
bd5ca4fed5 fix: flush ocl queue after staging transfer
This is required for scenarios without direct submission
Otherwise, staging transfers are enqueued but not actually
flushed. If that happens, subsequent staging transfers can't
reuse previous staging buffers. It results in increased memory usage.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-01-08 14:02:33 +01:00
Szymon Morek
2815d4167d fix: don't use staging for map allocations
Related-To: NEO-13572

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-01-07 13:30:11 +01:00
Szymon Morek
c5d541dd81 fix: initialize pointers to events
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-12-18 16:25:56 +01:00
Lukasz Jobczyk
093d987e33 performance: Enable timestamp wait for queues on Xe2
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-12-13 09:48:12 +01:00
Szymon Morek
6c4eb322b1 performance: introduce staging reads from image
Related-To: NEO-12968

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-12-11 14:43:45 +01:00
Filip Hazubski
43c199185a fix: Add explicit nullptr checks
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-12-10 10:06:56 +01:00
Robert Krzemień
d8a4f92856 fix: don't include perf counters in timestamp profiling
Resolves: GSD-10387
Signed-off-by: Robert Krzemień <robert.krzemien@intel.com>
2024-12-02 12:54:06 +01:00
Zbigniew Zdanowicz
c5ed6bf73c refactor: split sync buffer and region allocation creation code
- split the allocation code from command list or kernel
- allow to call allocation code in all parts of the driver

Related-To: NEO-13350

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-29 11:41:39 +01:00
Vysochyn, Illia
89c3aab321 refactor: Extract PostSyncType to outer abstract layer
This modification serves to simplify the integration of generated
compute walker structures.

Related-To: NEO-13147

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2024-11-28 18:09:21 +01:00
Mateusz Jablonski
fa58073095 refactor: remove not used usings/typedefs/variables
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-11-28 16:19:39 +01:00
Szymon Morek
53da277bc1 fix: assign CSR once for staging image write
Related-To: NEO-13330

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-11-28 11:25:38 +01:00
Zbigniew Zdanowicz
6b7235cd6c refactor: change parameter names into more meaningful
- change additional size into local region size
- change walk order into dispatch walk order to distinguish for local id walk

Related-To: NEO-13350

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-27 16:15:02 +01:00
Bartosz Dunajski
1ddcb92f1d refactor: pass device to encoder functions
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-11-25 16:25:04 +01:00
Lukasz Jobczyk
7832b115a4 fix: Select csr once for staging buffer memcpy
Resolves: NEO-13083

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-11-22 15:42:09 +01:00
Damian Tomczak
d07d54b8f0 Revert "refactor: stateless enqueueCopyBufferRect"
This reverts commit d8f0b1c17d.

Related-To: NEO-12964

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2024-11-13 01:33:22 +01:00
Szymon Morek
a25e973205 performance: write image through staging chunks
Related-To: NEO-12968

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-11-06 16:45:22 +01:00
Zbigniew Zdanowicz
89b40b7ec4 refactor: change additional walker fields encoder 6/n
- change order of members of additional walker struct

Related-To: NEO-12639

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-05 08:28:57 +01:00
Zbigniew Zdanowicz
2e2b7a473a refactor: change additional walker fields encoder 5/n
- move compute dispatch all walker into dedicated encoder
- group same implementations into single file

Related-To: NEO-12639

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-04 13:11:59 +01:00
Zbigniew Zdanowicz
32fd00e150 refactor: change additional walker fields encoder 4/n
- move post sync system fence into dedicated encoder

Related-To: NEO-12639

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-10-31 14:41:55 +01:00
Szymon Morek
cf58be4142 performance: use staging buffer when writing to an image
Related-To: NEO-12968

Also, don't import usm/mapped allocations for image
operations

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-10-30 17:02:13 +01:00
Zbigniew Zdanowicz
6f4994c269 refactor: change additional walker fields encoder 1/n
- move encoding l3 prefetch field into dedicated function

Related-To: NEO-12639

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-10-29 21:37:38 +01:00
Oskar Hubert Weber
fd45b2883d fix: neo ULT build with optimizations disabled 1/n
Related-To: NEO-8116

Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com>
2024-10-28 15:38:28 +01:00
Kamil Kopryk
3891e887c1 feature: use heapless builtins for images
Related-To: NEO-12744
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-10-25 16:42:27 +02:00
Bartosz Dunajski
4d6265a46c refactor: pass additional enqueue params
Related-To: NEO-12591

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-10-21 17:55:41 +02:00
Damian Tomczak
e495b2238a refactor: isStateless on init
Related-to: NEO-12938

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2024-10-18 10:53:17 +02:00
Compute-Runtime-Validation
f195e977ff Revert "feature: stateless enqueueCopyBufferToImage"
This reverts commit 9dbace61eb.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-10-15 08:54:24 +02:00
Damian Tomczak
bd500b5b85 feature: stateless enqueueCopyImageToBuffer
Related-to: NEO-11859

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2024-10-14 10:33:46 +02:00
Damian Tomczak
9dbace61eb feature: stateless enqueueCopyBufferToImage
Related-to: NEO-11859

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2024-10-14 10:21:02 +02:00