Szymon Morek
0b4fe9a0df
performance: add staging transfers for cl buffers
...
Related-To: NEO-13529
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-01-15 12:24:43 +01:00
Szymon Morek
2815d4167d
fix: don't use staging for map allocations
...
Related-To: NEO-13572
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-01-07 13:30:11 +01:00
Szymon Morek
6c4eb322b1
performance: introduce staging reads from image
...
Related-To: NEO-12968
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-12-11 14:43:45 +01:00
Szymon Morek
53da277bc1
fix: assign CSR once for staging image write
...
Related-To: NEO-13330
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-11-28 11:25:38 +01:00
Lukasz Jobczyk
7832b115a4
fix: Select csr once for staging buffer memcpy
...
Resolves: NEO-13083
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-11-22 15:42:09 +01:00
Szymon Morek
a25e973205
performance: write image through staging chunks
...
Related-To: NEO-12968
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-11-06 16:45:22 +01:00
Szymon Morek
cf58be4142
performance: use staging buffer when writing to an image
...
Related-To: NEO-12968
Also, don't import usm/mapped allocations for image
operations
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-10-30 17:02:13 +01:00
Damian Tomczak
e495b2238a
refactor: isStateless on init
...
Related-to: NEO-12938
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com >
2024-10-18 10:53:17 +02:00
Szymon Morek
dbd96372be
performance: adjust staging buffer usage
...
Related-To: NEO-11928
Don't copy through staging buffer if dst usm allocation
was not used before and transfer would be splitted.
Also, don't use staging buffers for mapped ocl buffers.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-07-10 10:19:18 +02:00
Szymon Morek
29e3eb512c
performance: non-usm copy through staging buffers
...
Related-To: NEO-11501
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-06-25 07:18:53 +02:00
Andrzej Koska
ae139aeffd
refactor: Passing information about the engine
...
Extension of the interface with information
about the engine type passed to the function
Related-To: NEO-10678
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com >
2024-04-22 13:33:56 +02:00
Kamil Kopryk
168445784e
feature: introduce states programming at driver init heapless ocl
...
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-03-08 12:29:44 +01:00
Mateusz Jablonski
a6458433dc
refactor: correct naming of enum class constants 9/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-19 10:36:25 +01:00
Mateusz Jablonski
dd1b9d6abc
refactor: correct naming of enum class constants 8/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-19 08:18:18 +01:00
Mateusz Jablonski
27fbdde4c5
refactor: correct naming of unified memory enums
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-13 15:58:21 +01:00
Milczarek, Slawomir
bec9874487
fix: Improved event profiling setup for bcs split enqueue
...
Fixes OpenCL Khronos failures in test profiling with bcs split
Related-To: NEO-8927
Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com >
2023-12-07 21:29:55 +01:00
Kamil Kopryk
ce7298d512
feature: Add heapless mode programming in ocl
...
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2023-11-24 12:53:39 +01:00
Dominik Dabek
1b7e178b25
performance(ocl): program barrier pc in taskStream
...
Program barrier to task stream, before next enqueue kernel.
This will reduce the number of batch buffer starts for sequences of
enqueue, barrier, enqueue, ... .
Related-To: NEO-8147
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-09-19 11:48:02 +02:00
Dominik Dabek
ee767822b8
performance(ocl): add bcsEngineCount to cmdq
...
Related-To: NEO-8155
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-09-15 15:49:56 +02:00
Mrozek, Michal
15a0bf9d7b
refactor: remove not needed code.
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com >
2023-09-12 13:52:47 +02:00
Compute-Runtime-Validation
b5e9c10f64
Revert "performance(ocl): program barrier pc in taskStream"
...
This reverts commit 839c2d6737 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-09-12 01:32:28 +02:00
Dominik Dabek
839c2d6737
performance(ocl): program barrier pc in taskStream
...
Program barrier immediately to task stream.
This will reduce the number of batch buffer starts.
Related-To: NEO-8147
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-09-11 13:23:26 +02:00
Dunajski, Bartosz
3cf1f5c462
fix: unregister CSR client on OCL sync points
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-08-16 13:35:07 +02:00
Dunajski, Bartosz
6656e23b86
refactor: unify completion check methods
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-25 14:17:52 +02:00
Dominik Dabek
622a3ed89c
performance(ocl): flag to not dcFlush on no event
...
If waitForBarrier is not passed outEvent then do
dcFlush on the next synchronize call.
Related-To: NEO-8147
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-07-18 15:38:54 +02:00
Dunajski, Bartosz
815b37bf3a
performance: allow waiting for OOQ timestamps in clEnqueueWaitForEvents
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-14 11:33:10 +02:00
Dunajski, Bartosz
712e059ace
performance: check completion alloc only once when waiting for Event
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-14 08:32:50 +02:00
Dunajski, Bartosz
40c7b2842f
fix: check engines completion before releasing deferred TSP nodes
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
Related-To: NEO-8146
2023-07-11 16:11:51 +02:00
Dunajski, Bartosz
d96cf5846a
fix: dont allocate TSP for OOQ without Event
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-06 15:12:22 +02:00
Lukasz Jobczyk
0e758e4bb5
performance: Add debug flag to set BCS split minimal size
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-05-17 08:07:43 +02:00
Lukasz Jobczyk
9f755b54ad
performance: Bidirectional BCS split implementation for OCL
...
Related-To: NEO-7877
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-05-16 13:34:18 +02:00
Cencelewska, Katarzyna
ac8ac62c92
fix: add bcs engine type only if engine exist
...
- check bcs availability before insert to bcsEnginesTypes
Resolves: NEO-7886
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-04-20 09:02:38 +02:00
Mateusz Jablonski
d014a731c5
fix multi device ocl: add missing tag update flush after blit
...
when memory is migrated then we need to flush tag update
Related-To: NEO-7552
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-04-04 11:06:57 +02:00
Lukasz Jobczyk
320deec53b
Wait for barrier on every OCL split blitter
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-03-15 09:34:41 +01:00
Maciej Plewka
52d322e738
Move barrier flush property from csr to cmdQueue
...
Related-To: NEO-6982, HSD-15010621906
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2023-03-03 14:33:53 +01:00
Dunajski, Bartosz
72ada8b41f
Add missing pollForCompletion on clEnqueueVerifyMemoryINTEL
...
Related-To: NEO-7618
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-03-03 10:55:33 +01:00
Warchulski, Jaroslaw
e21cf516d3
Cleanup includes 44
...
Cleaned up files:
level_zero/tools/source/sysman/windows/os_sysman_imp.h
opencl/source/command_queue/command_queue.h
shared/source/command_container/cmdcontainer.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com >
2023-01-25 16:05:22 +01:00
Maciej Plewka
fa4830036a
feature(ocl) use tags to synchronize multi root device events
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2023-01-23 10:28:01 +01:00
Warchulski, Jaroslaw
8cc4cc1612
Cleanup includes 38
...
Cleaned up files:
opencl/source/cl_device/cl_device.h
opencl/source/command_queue/command_queue.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com >
2023-01-20 14:55:17 +01:00
Warchulski, Jaroslaw
191d4de033
Cleanup includes 28
...
Cleaned up files:
opencl/source/command_queue/command_queue.h
opencl/source/command_queue/gpgpu_walker.h
shared/source/helpers/memory_properties_helpers.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com >
2023-01-11 17:47:33 +01:00
Warchulski, Jaroslaw
3d59dce80c
Cleanup includes 27
...
Cleaned up files:
opencl/source/command_queue/command_queue.h
shared/source/built_ins/registry/built_ins_registry.h
shared/source/kernel/kernel_descriptor.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com >
2023-01-11 16:10:28 +01:00
Warchulski, Jaroslaw
b7ee3486d9
Cleanup includes 21
...
Cleaned up files:
opencl/source/kernel/kernel.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com >
2023-01-04 10:40:26 +01:00
Milczarek, Slawomir
1e8169ca3d
Add bcs support for cross-device migrations of multi-graphics allocations
...
This commit adds a support for cross-device migration to bcs enqueue path
exercised for CL enqueue rea/write, map, unmap calls, controlled with
the regkey AllocateBuffersInLocalMemoryForMultiRootDeviceContexts.
Related-To: NEO-7092
Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com >
2023-01-03 14:09:07 +01:00
Maciej Plewka
4b42b066f8
Use dedicated using type for TaskCount
...
Related-To: NEO-7155
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2022-11-28 16:44:44 +01:00
Mateusz Jablonski
1c3d5c3892
Prepare mechanism for returning GPU execution error on OCL API
...
translate task count value to OCL error
Related-To: NEO-7412
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-11-04 15:54:19 +01:00
Lukasz Jobczyk
1fedaee18c
Wait on timestamps in Event::wait
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-10-10 13:04:21 +02:00
Compute-Runtime-Validation
1c73031a9b
Revert "Wait on timestamps in Event::wait"
...
This reverts commit f91047f2ee .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2022-10-08 06:17:33 +02:00
Lukasz Jobczyk
f91047f2ee
Wait on timestamps in Event::wait
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-10-07 12:04:45 +02:00
Milczarek, Slawomir
0192e8038f
Check for GPU hang in path with wait for timestamps
...
Related-To: NEO-6868
Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com >
2022-09-19 15:01:46 +02:00
Mateusz Hoppe
6ad16c9e72
Missing support for OCL debugging with L0
...
- make resident debugSurface
- allocate debugSurface with correct allocation type and size
- notify cmdQ create/destroy
Related-To: NEO-7075
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2022-09-07 13:30:05 +02:00