Szymon Morek
8152241693
fix: don't override user data during staging read
...
Related-To: NEO-13574
User can read image with row pitch larger than region width.
In such cases, don't override memory which is out of region
but within row pitch.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-02-12 16:22:48 +01:00
Szymon Morek
b11322332c
performance: enable staging write for cl buffers
...
Related-To: NEO-13529
Also, add size threshold on iGPU on Linux,
and disable staging if imported host ptr could
be reused
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-02-05 09:41:17 +01:00
Mateusz Jablonski
0a068ce96a
fix: add pipe control before scratch register write
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-02-03 18:13:18 +01:00
Michał Pryba
75bc74089b
refactor: adjust file names after pre-gen12 removal 2/3
...
Related-To: NEO-12681
Signed-off-by: Michał Pryba <michal.pryba@intel.com >
2025-02-03 15:31:51 +01:00
Szymon Morek
d7c8b42d01
Revert "performance: enable staging write for cl buffers"
...
This reverts commit a9a2f7577f .
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-01-24 23:41:12 +01:00
Bartosz Dunajski
c75bcba1e6
fix: ensure profiling enabled for OCL Blit Split path
...
Related-To: NEO-13842
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-01-23 13:59:15 +01:00
Michał Pryba
53831f61df
refactor: adjust file names after pre-gen12 removal 1/3
...
Related-To: NEO-12681
Signed-off-by: Michał Pryba <michal.pryba@intel.com >
2025-01-22 22:47:24 +01:00
Bartosz Dunajski
c2dbdb6797
refactor: move blit post sync data to BlitProperties
...
Related-To: NEO-13003
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-01-22 18:35:21 +01:00
Damian Tomczak
4af92c20e4
feature: enable heapless for enqueueCopyBufferToImage
...
Related-to: NEO-12737
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com >
2025-01-22 12:06:16 +01:00
Kamil Kopryk
47fc1ce34b
refactor: use designated initializer for EncodeWalkerArgs
...
also make EncodeWalkerArgs an aggregate
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-01-22 11:44:57 +01:00
Zbigniew Zdanowicz
1e89058232
refactor: change data fields for encoder of additional walker fields
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-01-21 16:49:38 +01:00
Zbigniew Zdanowicz
897c890d03
refactor: add thread group count parameter to implicit scaling functions
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-01-20 13:12:45 +01:00
Lukasz Jobczyk
af0bb7964a
performance: Set dispatch all for small TG
...
Resolves: NEO-11814
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-01-20 08:49:59 +01:00
Zbigniew Zdanowicz
446f9c1964
refactor: change interfaces of walker region encoder
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-01-17 19:18:24 +01:00
Szymon Morek
ac78c07899
refactor: unify duplicated code for staging images
...
Pass command type instead of maintaining
two methods for reads and writes
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-01-17 09:52:35 +01:00
Szymon Morek
a9a2f7577f
performance: enable staging write for cl buffers
...
Related-To: NEO-13529
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-01-16 12:04:28 +01:00
Krzysztof Sprzaczkowski
a17745532c
performance: Move preemption allocation init to the first submission
...
Related-To: NEO-12323
Signed-off-by: Krzysztof Sprzaczkowski <krzysztof.sprzaczkowski@intel.com >
2025-01-15 20:22:50 +01:00
Szymon Morek
0b4fe9a0df
performance: add staging transfers for cl buffers
...
Related-To: NEO-13529
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-01-15 12:24:43 +01:00
Compute-Runtime-Validation
af031ee0e3
Revert "performance: align structures for 64-bit platforms"
...
This reverts commit 9f07f56f7f .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-01-15 09:02:01 +01:00
Semenov Herman (Семенов Герман)
9f07f56f7f
performance: align structures for 64-bit platforms
...
Signed-off-by: Semenov Herman (Семенов Герман) <GermanAizek@yandex.ru >
2025-01-09 06:03:39 +01:00
Compute-Runtime-Validation
58707e0cf7
Revert "refactor: migration to std::make_unique C++17"
...
This reverts commit 4cf685cb7d .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-01-09 01:41:36 +01:00
Semenov Herman (Семенов Герман)
4cf685cb7d
refactor: migration to std::make_unique C++17
...
Signed-off-by: Semenov Herman (Семенов Герман) <GermanAizek@yandex.ru >
2025-01-08 17:20:00 +01:00
Szymon Morek
bd5ca4fed5
fix: flush ocl queue after staging transfer
...
This is required for scenarios without direct submission
Otherwise, staging transfers are enqueued but not actually
flushed. If that happens, subsequent staging transfers can't
reuse previous staging buffers. It results in increased memory usage.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-01-08 14:02:33 +01:00
Szymon Morek
2815d4167d
fix: don't use staging for map allocations
...
Related-To: NEO-13572
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-01-07 13:30:11 +01:00
Szymon Morek
c5d541dd81
fix: initialize pointers to events
...
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-12-18 16:25:56 +01:00
Lukasz Jobczyk
093d987e33
performance: Enable timestamp wait for queues on Xe2
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-12-13 09:48:12 +01:00
Szymon Morek
6c4eb322b1
performance: introduce staging reads from image
...
Related-To: NEO-12968
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-12-11 14:43:45 +01:00
Filip Hazubski
43c199185a
fix: Add explicit nullptr checks
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2024-12-10 10:06:56 +01:00
Robert Krzemień
d8a4f92856
fix: don't include perf counters in timestamp profiling
...
Resolves: GSD-10387
Signed-off-by: Robert Krzemień <robert.krzemien@intel.com >
2024-12-02 12:54:06 +01:00
Zbigniew Zdanowicz
c5ed6bf73c
refactor: split sync buffer and region allocation creation code
...
- split the allocation code from command list or kernel
- allow to call allocation code in all parts of the driver
Related-To: NEO-13350
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-11-29 11:41:39 +01:00
Vysochyn, Illia
89c3aab321
refactor: Extract PostSyncType to outer abstract layer
...
This modification serves to simplify the integration of generated
compute walker structures.
Related-To: NEO-13147
Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com >
2024-11-28 18:09:21 +01:00
Mateusz Jablonski
fa58073095
refactor: remove not used usings/typedefs/variables
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-11-28 16:19:39 +01:00
Szymon Morek
53da277bc1
fix: assign CSR once for staging image write
...
Related-To: NEO-13330
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-11-28 11:25:38 +01:00
Zbigniew Zdanowicz
6b7235cd6c
refactor: change parameter names into more meaningful
...
- change additional size into local region size
- change walk order into dispatch walk order to distinguish for local id walk
Related-To: NEO-13350
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-11-27 16:15:02 +01:00
Bartosz Dunajski
1ddcb92f1d
refactor: pass device to encoder functions
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-11-25 16:25:04 +01:00
Lukasz Jobczyk
7832b115a4
fix: Select csr once for staging buffer memcpy
...
Resolves: NEO-13083
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-11-22 15:42:09 +01:00
Damian Tomczak
d07d54b8f0
Revert "refactor: stateless enqueueCopyBufferRect"
...
This reverts commit d8f0b1c17d .
Related-To: NEO-12964
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com >
2024-11-13 01:33:22 +01:00
Szymon Morek
a25e973205
performance: write image through staging chunks
...
Related-To: NEO-12968
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-11-06 16:45:22 +01:00
Zbigniew Zdanowicz
89b40b7ec4
refactor: change additional walker fields encoder 6/n
...
- change order of members of additional walker struct
Related-To: NEO-12639
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-11-05 08:28:57 +01:00
Zbigniew Zdanowicz
2e2b7a473a
refactor: change additional walker fields encoder 5/n
...
- move compute dispatch all walker into dedicated encoder
- group same implementations into single file
Related-To: NEO-12639
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-11-04 13:11:59 +01:00
Zbigniew Zdanowicz
32fd00e150
refactor: change additional walker fields encoder 4/n
...
- move post sync system fence into dedicated encoder
Related-To: NEO-12639
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-10-31 14:41:55 +01:00
Szymon Morek
cf58be4142
performance: use staging buffer when writing to an image
...
Related-To: NEO-12968
Also, don't import usm/mapped allocations for image
operations
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-10-30 17:02:13 +01:00
Zbigniew Zdanowicz
6f4994c269
refactor: change additional walker fields encoder 1/n
...
- move encoding l3 prefetch field into dedicated function
Related-To: NEO-12639
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-10-29 21:37:38 +01:00
Oskar Hubert Weber
fd45b2883d
fix: neo ULT build with optimizations disabled 1/n
...
Related-To: NEO-8116
Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com >
2024-10-28 15:38:28 +01:00
Kamil Kopryk
3891e887c1
feature: use heapless builtins for images
...
Related-To: NEO-12744
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-10-25 16:42:27 +02:00
Bartosz Dunajski
4d6265a46c
refactor: pass additional enqueue params
...
Related-To: NEO-12591
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-10-21 17:55:41 +02:00
Damian Tomczak
e495b2238a
refactor: isStateless on init
...
Related-to: NEO-12938
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com >
2024-10-18 10:53:17 +02:00
Compute-Runtime-Validation
f195e977ff
Revert "feature: stateless enqueueCopyBufferToImage"
...
This reverts commit 9dbace61eb .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-10-15 08:54:24 +02:00
Damian Tomczak
bd500b5b85
feature: stateless enqueueCopyImageToBuffer
...
Related-to: NEO-11859
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com >
2024-10-14 10:33:46 +02:00
Damian Tomczak
9dbace61eb
feature: stateless enqueueCopyBufferToImage
...
Related-to: NEO-11859
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com >
2024-10-14 10:21:02 +02:00