Vysochyn, Illia
c74e9af84c
performance: Pass std::span by value
...
Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com >
2025-09-10 21:05:25 +02:00
Kamil Kopryk
8a1dc7e8a9
refactor: remove not used command queue member
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-09-10 09:52:06 +02:00
Kamil Kopryk
1f862b3ee7
refactor: flush L3 after post sync
...
Related-To: NEO-13163
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-09-09 12:56:50 +02:00
Lukasz Jobczyk
9d91e85f5c
fix: Submit every marker with dependencies
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-08-28 17:21:27 +02:00
Compute-Runtime-Validation
c279647cb2
Revert "fix: Submit every marker with dependencies"
...
This reverts commit d498f16fc3 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-08-28 05:04:18 +02:00
Lukasz Jobczyk
d498f16fc3
fix: Submit every marker with dependencies
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-08-27 15:00:54 +02:00
Maciej Bielski
9a82e3439a
fix: serialize constructBcsEngine
...
Related-To: NEO-15630
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com >
2025-08-22 19:23:53 +02:00
Kamil Kopryk
2a3bd7cd03
fix: add deferred l3 flush tag in kernels
...
Related-To: NEO-13163
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-08-22 12:56:47 +02:00
Compute-Runtime-Validation
f763974ec5
Revert "fix: serialize constructBcsEngine"
...
This reverts commit 9c92733829 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-08-20 03:21:02 +02:00
Maciej Bielski
9c92733829
fix: serialize constructBcsEngine
...
Related-To: NEO-15630
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com >
2025-08-19 16:13:05 +02:00
Kamil Kopryk
d86cc127ea
fix: remove no longer needed flush l3 after cpu read
...
Related-To: NEO-13163
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-08-14 09:12:04 +02:00
Bartosz Dunajski
21076b57df
fix: ensure queue completion on destruction
...
Related-To: HSD-22021388724
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-08-06 19:06:51 +02:00
Naklicki, Mateusz
dade66307e
refactor: cleanup header includes 4/n
...
cleanup of opencl/source directory
Related-To: NEO-15383
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com >
2025-07-22 19:36:34 +02:00
Tomasz Biernacik
2da77209dc
refactor: encapsulate condition
...
Related-To: NEO-14844
Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com >
2025-07-22 10:43:39 +02:00
Chodor, Jaroslaw
fcc2038def
refactor: modernizing NEO::Range->std::span
...
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com >
2025-07-17 01:27:11 +02:00
Tomasz Biernacik
2c5cbec033
feature: control post sync completion check
...
Related-To: NEO-14844
Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com >
2025-07-15 13:51:19 +02:00
Kamil Kopryk
ecb411108a
fix: don't skip wait for task count - cpu read + L3FlushAfterPostSync
...
Related-To: NEO-13163
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-07-01 12:09:43 +02:00
Kamil Kopryk
f0f89836e0
fix: l3 flush after post sync logic in OCL
...
Related-To: NEO-13163
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-06-23 18:03:21 +02:00
Szymon Morek
ac7d936942
fix: invalidate texture cache for BCS image write
...
Related-To: NEO-15029
If BCS is writing new data to an image, texture cache
should be invalidated as it might contain stale data for
that image.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-06-13 13:28:08 +02:00
Lukasz Jobczyk
df2c776aab
refactor: Remove unused cmdq round robin engine assign
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-09 11:03:57 +02:00
Szymon Morek
91ff22f255
fix: correctly calculate host ptr size for images
...
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-04 17:54:42 +02:00
Szymon Morek
dacbce7f01
performance: introduce staging read for cl_buffer
...
Related-To: NEO-14026
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-03-11 08:19:28 +01:00
Compute-Runtime-Validation
571e0f2ba3
Revert "performance: enable staging read for cl_buffer"
...
This reverts commit 04b6e87869 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-03-07 20:30:54 +01:00
Szymon Morek
04b6e87869
performance: enable staging read for cl_buffer
...
Related-To: NEO-14026
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-03-07 18:56:06 +01:00
Filip Hazubski
6b2b42972a
fix: Add asserts to ensure NonCopyable and NonMovable 1/n
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2025-02-18 09:41:20 +01:00
Szymon Morek
d477efeb40
fix: disable staging when cpu copy is allowed
...
Related-To: NEO-14027
If buffer cpu copy is allowed, it means that whole
transfer can be done on cpu memcpy.
In that case, disable staging as it's less performant.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-02-14 14:05:06 +01:00
Szymon Morek
be2ac667fc
performance: enable staging for reads from image
...
Related-To: NEO-14026
Also, provide performance hint if ptr from
user does not meet restrictions.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-02-13 18:15:59 +01:00
Szymon Morek
ac78c07899
refactor: unify duplicated code for staging images
...
Pass command type instead of maintaining
two methods for reads and writes
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-01-17 09:52:35 +01:00
Szymon Morek
0b4fe9a0df
performance: add staging transfers for cl buffers
...
Related-To: NEO-13529
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-01-15 12:24:43 +01:00
Szymon Morek
2815d4167d
fix: don't use staging for map allocations
...
Related-To: NEO-13572
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-01-07 13:30:11 +01:00
Szymon Morek
6c4eb322b1
performance: introduce staging reads from image
...
Related-To: NEO-12968
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-12-11 14:43:45 +01:00
Szymon Morek
53da277bc1
fix: assign CSR once for staging image write
...
Related-To: NEO-13330
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-11-28 11:25:38 +01:00
Lukasz Jobczyk
7832b115a4
fix: Select csr once for staging buffer memcpy
...
Resolves: NEO-13083
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-11-22 15:42:09 +01:00
Szymon Morek
a25e973205
performance: write image through staging chunks
...
Related-To: NEO-12968
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-11-06 16:45:22 +01:00
Szymon Morek
cf58be4142
performance: use staging buffer when writing to an image
...
Related-To: NEO-12968
Also, don't import usm/mapped allocations for image
operations
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-10-30 17:02:13 +01:00
Damian Tomczak
e495b2238a
refactor: isStateless on init
...
Related-to: NEO-12938
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com >
2024-10-18 10:53:17 +02:00
Szymon Morek
dbd96372be
performance: adjust staging buffer usage
...
Related-To: NEO-11928
Don't copy through staging buffer if dst usm allocation
was not used before and transfer would be splitted.
Also, don't use staging buffers for mapped ocl buffers.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-07-10 10:19:18 +02:00
Szymon Morek
29e3eb512c
performance: non-usm copy through staging buffers
...
Related-To: NEO-11501
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-06-25 07:18:53 +02:00
Andrzej Koska
ae139aeffd
refactor: Passing information about the engine
...
Extension of the interface with information
about the engine type passed to the function
Related-To: NEO-10678
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com >
2024-04-22 13:33:56 +02:00
Kamil Kopryk
168445784e
feature: introduce states programming at driver init heapless ocl
...
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-03-08 12:29:44 +01:00
Mateusz Jablonski
a6458433dc
refactor: correct naming of enum class constants 9/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-19 10:36:25 +01:00
Mateusz Jablonski
dd1b9d6abc
refactor: correct naming of enum class constants 8/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-19 08:18:18 +01:00
Mateusz Jablonski
27fbdde4c5
refactor: correct naming of unified memory enums
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-13 15:58:21 +01:00
Milczarek, Slawomir
bec9874487
fix: Improved event profiling setup for bcs split enqueue
...
Fixes OpenCL Khronos failures in test profiling with bcs split
Related-To: NEO-8927
Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com >
2023-12-07 21:29:55 +01:00
Kamil Kopryk
ce7298d512
feature: Add heapless mode programming in ocl
...
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2023-11-24 12:53:39 +01:00
Dominik Dabek
1b7e178b25
performance(ocl): program barrier pc in taskStream
...
Program barrier to task stream, before next enqueue kernel.
This will reduce the number of batch buffer starts for sequences of
enqueue, barrier, enqueue, ... .
Related-To: NEO-8147
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-09-19 11:48:02 +02:00
Dominik Dabek
ee767822b8
performance(ocl): add bcsEngineCount to cmdq
...
Related-To: NEO-8155
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-09-15 15:49:56 +02:00
Mrozek, Michal
15a0bf9d7b
refactor: remove not needed code.
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com >
2023-09-12 13:52:47 +02:00
Compute-Runtime-Validation
b5e9c10f64
Revert "performance(ocl): program barrier pc in taskStream"
...
This reverts commit 839c2d6737 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-09-12 01:32:28 +02:00
Dominik Dabek
839c2d6737
performance(ocl): program barrier pc in taskStream
...
Program barrier immediately to task stream.
This will reduce the number of batch buffer starts.
Related-To: NEO-8147
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-09-11 13:23:26 +02:00