Chodor, Jaroslaw
fcc2038def
refactor: modernizing NEO::Range->std::span
...
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com >
2025-07-17 01:27:11 +02:00
Tomasz Biernacik
2c5cbec033
feature: control post sync completion check
...
Related-To: NEO-14844
Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com >
2025-07-15 13:51:19 +02:00
Kamil Kopryk
ecb411108a
fix: don't skip wait for task count - cpu read + L3FlushAfterPostSync
...
Related-To: NEO-13163
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-07-01 12:09:43 +02:00
Kamil Kopryk
f0f89836e0
fix: l3 flush after post sync logic in OCL
...
Related-To: NEO-13163
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-06-23 18:03:21 +02:00
Szymon Morek
ac7d936942
fix: invalidate texture cache for BCS image write
...
Related-To: NEO-15029
If BCS is writing new data to an image, texture cache
should be invalidated as it might contain stale data for
that image.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-06-13 13:28:08 +02:00
Lukasz Jobczyk
df2c776aab
refactor: Remove unused cmdq round robin engine assign
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-09 11:03:57 +02:00
Szymon Morek
91ff22f255
fix: correctly calculate host ptr size for images
...
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-04 17:54:42 +02:00
Szymon Morek
dacbce7f01
performance: introduce staging read for cl_buffer
...
Related-To: NEO-14026
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-03-11 08:19:28 +01:00
Compute-Runtime-Validation
571e0f2ba3
Revert "performance: enable staging read for cl_buffer"
...
This reverts commit 04b6e87869 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-03-07 20:30:54 +01:00
Szymon Morek
04b6e87869
performance: enable staging read for cl_buffer
...
Related-To: NEO-14026
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-03-07 18:56:06 +01:00
Filip Hazubski
6b2b42972a
fix: Add asserts to ensure NonCopyable and NonMovable 1/n
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2025-02-18 09:41:20 +01:00
Szymon Morek
d477efeb40
fix: disable staging when cpu copy is allowed
...
Related-To: NEO-14027
If buffer cpu copy is allowed, it means that whole
transfer can be done on cpu memcpy.
In that case, disable staging as it's less performant.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-02-14 14:05:06 +01:00
Szymon Morek
be2ac667fc
performance: enable staging for reads from image
...
Related-To: NEO-14026
Also, provide performance hint if ptr from
user does not meet restrictions.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-02-13 18:15:59 +01:00
Szymon Morek
ac78c07899
refactor: unify duplicated code for staging images
...
Pass command type instead of maintaining
two methods for reads and writes
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-01-17 09:52:35 +01:00
Szymon Morek
0b4fe9a0df
performance: add staging transfers for cl buffers
...
Related-To: NEO-13529
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-01-15 12:24:43 +01:00
Szymon Morek
2815d4167d
fix: don't use staging for map allocations
...
Related-To: NEO-13572
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-01-07 13:30:11 +01:00
Szymon Morek
6c4eb322b1
performance: introduce staging reads from image
...
Related-To: NEO-12968
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-12-11 14:43:45 +01:00
Szymon Morek
53da277bc1
fix: assign CSR once for staging image write
...
Related-To: NEO-13330
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-11-28 11:25:38 +01:00
Lukasz Jobczyk
7832b115a4
fix: Select csr once for staging buffer memcpy
...
Resolves: NEO-13083
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-11-22 15:42:09 +01:00
Szymon Morek
a25e973205
performance: write image through staging chunks
...
Related-To: NEO-12968
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-11-06 16:45:22 +01:00
Szymon Morek
cf58be4142
performance: use staging buffer when writing to an image
...
Related-To: NEO-12968
Also, don't import usm/mapped allocations for image
operations
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-10-30 17:02:13 +01:00
Damian Tomczak
e495b2238a
refactor: isStateless on init
...
Related-to: NEO-12938
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com >
2024-10-18 10:53:17 +02:00
Szymon Morek
dbd96372be
performance: adjust staging buffer usage
...
Related-To: NEO-11928
Don't copy through staging buffer if dst usm allocation
was not used before and transfer would be splitted.
Also, don't use staging buffers for mapped ocl buffers.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-07-10 10:19:18 +02:00
Szymon Morek
29e3eb512c
performance: non-usm copy through staging buffers
...
Related-To: NEO-11501
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-06-25 07:18:53 +02:00
Andrzej Koska
ae139aeffd
refactor: Passing information about the engine
...
Extension of the interface with information
about the engine type passed to the function
Related-To: NEO-10678
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com >
2024-04-22 13:33:56 +02:00
Kamil Kopryk
168445784e
feature: introduce states programming at driver init heapless ocl
...
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-03-08 12:29:44 +01:00
Mateusz Jablonski
a6458433dc
refactor: correct naming of enum class constants 9/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-19 10:36:25 +01:00
Mateusz Jablonski
dd1b9d6abc
refactor: correct naming of enum class constants 8/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-19 08:18:18 +01:00
Mateusz Jablonski
27fbdde4c5
refactor: correct naming of unified memory enums
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-13 15:58:21 +01:00
Milczarek, Slawomir
bec9874487
fix: Improved event profiling setup for bcs split enqueue
...
Fixes OpenCL Khronos failures in test profiling with bcs split
Related-To: NEO-8927
Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com >
2023-12-07 21:29:55 +01:00
Kamil Kopryk
ce7298d512
feature: Add heapless mode programming in ocl
...
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2023-11-24 12:53:39 +01:00
Dominik Dabek
1b7e178b25
performance(ocl): program barrier pc in taskStream
...
Program barrier to task stream, before next enqueue kernel.
This will reduce the number of batch buffer starts for sequences of
enqueue, barrier, enqueue, ... .
Related-To: NEO-8147
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-09-19 11:48:02 +02:00
Dominik Dabek
ee767822b8
performance(ocl): add bcsEngineCount to cmdq
...
Related-To: NEO-8155
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-09-15 15:49:56 +02:00
Mrozek, Michal
15a0bf9d7b
refactor: remove not needed code.
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com >
2023-09-12 13:52:47 +02:00
Compute-Runtime-Validation
b5e9c10f64
Revert "performance(ocl): program barrier pc in taskStream"
...
This reverts commit 839c2d6737 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-09-12 01:32:28 +02:00
Dominik Dabek
839c2d6737
performance(ocl): program barrier pc in taskStream
...
Program barrier immediately to task stream.
This will reduce the number of batch buffer starts.
Related-To: NEO-8147
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-09-11 13:23:26 +02:00
Dunajski, Bartosz
3cf1f5c462
fix: unregister CSR client on OCL sync points
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-08-16 13:35:07 +02:00
Dunajski, Bartosz
6656e23b86
refactor: unify completion check methods
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-25 14:17:52 +02:00
Dominik Dabek
622a3ed89c
performance(ocl): flag to not dcFlush on no event
...
If waitForBarrier is not passed outEvent then do
dcFlush on the next synchronize call.
Related-To: NEO-8147
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-07-18 15:38:54 +02:00
Dunajski, Bartosz
815b37bf3a
performance: allow waiting for OOQ timestamps in clEnqueueWaitForEvents
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-14 11:33:10 +02:00
Dunajski, Bartosz
712e059ace
performance: check completion alloc only once when waiting for Event
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-14 08:32:50 +02:00
Dunajski, Bartosz
40c7b2842f
fix: check engines completion before releasing deferred TSP nodes
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
Related-To: NEO-8146
2023-07-11 16:11:51 +02:00
Dunajski, Bartosz
d96cf5846a
fix: dont allocate TSP for OOQ without Event
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-06 15:12:22 +02:00
Lukasz Jobczyk
0e758e4bb5
performance: Add debug flag to set BCS split minimal size
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-05-17 08:07:43 +02:00
Lukasz Jobczyk
9f755b54ad
performance: Bidirectional BCS split implementation for OCL
...
Related-To: NEO-7877
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-05-16 13:34:18 +02:00
Cencelewska, Katarzyna
ac8ac62c92
fix: add bcs engine type only if engine exist
...
- check bcs availability before insert to bcsEnginesTypes
Resolves: NEO-7886
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-04-20 09:02:38 +02:00
Mateusz Jablonski
d014a731c5
fix multi device ocl: add missing tag update flush after blit
...
when memory is migrated then we need to flush tag update
Related-To: NEO-7552
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-04-04 11:06:57 +02:00
Lukasz Jobczyk
320deec53b
Wait for barrier on every OCL split blitter
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-03-15 09:34:41 +01:00
Maciej Plewka
52d322e738
Move barrier flush property from csr to cmdQueue
...
Related-To: NEO-6982, HSD-15010621906
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2023-03-03 14:33:53 +01:00
Dunajski, Bartosz
72ada8b41f
Add missing pollForCompletion on clEnqueueVerifyMemoryINTEL
...
Related-To: NEO-7618
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-03-03 10:55:33 +01:00