Warchulski, Jaroslaw
7ae44268cb
Cleanup includes 17
...
Cleaned up files:
shared/source/built_ins/sip.h
shared/source/compiler_interface/compiler_cache.h
shared/source/compiler_interface/compiler_interface.h
shared/source/device_binary_format/device_binary_formats.h
shared/source/helpers/timestamp_packet.h
shared/source/kernel/debug_data.h
shared/source/utilities/tag_allocator.h
shared/test/common/mocks/mock_device.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-02 10:59:08 +01:00
Kamil Kopryk
da80d9906e
Refactor: don't use global GfxCoreHelper getter in shared files 5/n
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-01-02 10:54:14 +01:00
Mateusz Hoppe
a2138e2ab9
perf: fix performance in enqueue path in OCL
...
add constexpr to if to only add extra if fo NDRange command
Related-To: NEO-7603
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2022-12-30 13:32:15 +01:00
Kamil Kopryk
7c23ea3928
Refactor: don't use global ProductHelper getter in ocl files 2/n
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2022-12-29 09:41:39 +01:00
Compute-Runtime-Validation
876de37b92
Revert "Feature(OCL) Use tag nodes for root device synchronization"
...
This reverts commit 547d1c37b3
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-12-24 19:14:33 +01:00
Maciej Plewka
547d1c37b3
Feature(OCL) Use tag nodes for root device synchronization
...
With this commit events created on multi root device contexts will
synchronize using signaled TagNodes instead of using taskCounts.
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
Related-To: NEO-7105
2022-12-23 15:48:54 +01:00
Kamil Kopryk
93deff0698
Refactor: don't use global ProductHelper getter in L0 3/n
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2022-12-23 12:19:03 +01:00
Warchulski, Jaroslaw
f275eea6ec
Cleanup includes 14
...
Cleaned up files:
shared/source/device/device.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2022-12-23 10:46:34 +01:00
Kamil Kopryk
47b6801b0b
Refactor: pass rootDeviceEnvironment to dispatchProfilingCommands
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2022-12-22 15:05:12 +01:00
Mateusz Jablonski
c8b55e38a0
Refactor: dont use global getters for Product/GfxCore helper in engine helpers
...
Related-To: NEO-6853
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2022-12-20 17:16:21 +01:00
Naklicki, Mateusz
810241e2a8
Remove unused useSingleSubdevice flag
...
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com>
2022-12-16 12:15:12 +01:00
Kamil Kopryk
232b886056
Rename HwInfoConfig to ProductHelper
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2022-12-14 14:39:52 +01:00
Mateusz Jablonski
10dbfc0d19
Reduce usage of global gfx core helper getter [3/n]
...
Related-To: NEO-6853
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2022-12-13 11:13:11 +01:00
Dunajski, Bartosz
bcecd069b4
Add additional kernel flag + capability to pull extra patch tokens
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2022-12-13 09:04:39 +01:00
Kamil Kopryk
03b687881f
Rename HwHelper -> GfxCoreHelper
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2022-12-09 10:29:06 +01:00
Lukasz Jobczyk
573ac33e1d
Enqueue marker before split when OOQ barrier have to be handled
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-12-07 17:15:41 +01:00
Warchulski, Jaroslaw
be647d42d9
Cleanup includes 12
...
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2022-12-07 13:14:15 +01:00
Warchulski, Jaroslaw
c10aa90815
Cleanup includes 11
...
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2022-12-06 12:25:30 +01:00
Dominik Dabek
4adba15dbb
Update for resolving dependencies by pipecontrol
...
Flag ResolveDependenciesViaPipeControls now removes only same csr
dependencies. Still enables pipe controls.
Works through hwInfoConfig method isResolveDependenciesByPipeControlsSupported
Related-To: NEO-7321
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2022-12-05 15:48:49 +01:00
Kamil Kopryk
91d39ad5bf
Rename ClHwHelper -> ClGfxCoreHelper
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2022-12-05 11:26:20 +01:00
Maciej Plewka
4b42b066f8
Use dedicated using type for TaskCount
...
Related-To: NEO-7155
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2022-11-28 16:44:44 +01:00
Dunajski, Bartosz
3f962bf3e8
RelaxedOrdering: Improve dependencies tracking
...
Avoid not needed scheduler programming
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2022-11-28 16:00:21 +01:00
Naklicki, Mateusz
914939c377
Fix execution of cooperative kernels on multi-tile device
...
Add flag for forcing execution of kernels on single tile
Force cooperative kernels to use only single tile
Related-to: NEO-6729
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com>
2022-11-28 14:36:21 +01:00
Dunajski, Bartosz
bc619fcbec
Queue stall mode for RelaxedOrdering
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2022-11-23 12:09:52 +01:00
Mateusz Jablonski
bb308c04ed
Refactor aubstream include interface
...
set include path to third_party/aub_stream
rename third_party/aub_stream/headers -> third_party/aub_stream/aubstream
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2022-11-23 10:30:13 +01:00
Szymon Morek
a66e69abc9
Prealloc cmd buffer for CSR only when being used
...
Related-To: NEO-7361
Currently additional command buffer is
preallocated for all CSRs, even for those which
won't be used by application. This PR changes that
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2022-11-15 20:06:29 +01:00
Kamil Kopryk
9564af14f5
Move ClCoreHelper ownership to RootDeviceEnvironment 3/n
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
Use RootDeviceEnvironment getHelper<ClCoreHelper> for
- getQueueFamilyName
- getKernelPrivateMemSize
- preferBlitterForLocalToLocalTransfers
2022-11-15 15:45:33 +01:00
Lukasz Jobczyk
0a07ea94c7
Enable bcs split for H2H transfers
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-11-14 14:23:02 +01:00
Milczarek, Slawomir
25a5ed0dca
Allocate buffers in local memory for PVC multi root device platforms (1/n)
...
PVC platform with no support for atomic operations on system memory
must always allocate buffers in local memory to avoid atomic access violation.
Note: the feature is being implemented under the new registry key
AllocateBuffersInLocalMemoryForMultiRootDeviceContexts (disabled by default)
Related-To: NEO-7092
Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>
2022-11-14 13:56:50 +01:00
Mateusz Hoppe
f77f47099c
OCL: Fix error for too big dimensions
...
- cases with null lws should only fail when computed
lws sizes result in too big number of workgroups
Related-To: NEO-6976
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2022-11-08 16:11:41 +01:00
Warchulski, Jaroslaw
ef95bfb45e
Cleanup includes
...
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2022-11-04 18:04:13 +01:00
Mateusz Jablonski
1c3d5c3892
Prepare mechanism for returning GPU execution error on OCL API
...
translate task count value to OCL error
Related-To: NEO-7412
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2022-11-04 15:54:19 +01:00
Krystian Chmielewski
69bef975f0
OCL cache per thread data
...
Add caching mechanism for local ids in OCL.
Signed-off-by: Krystian Chmielewski <krystian.chmielewski@intel.com>
2022-11-02 17:07:29 +01:00
Zbigniew Zdanowicz
565d820933
Replace virtual method call for DC flush with stored bool value 3/n
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-10-18 15:04:02 +02:00
Lukasz Jobczyk
4faf1eeb52
Flush tag update while enqueue mem fill
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-10-14 11:46:39 +02:00
Compute-Runtime-Validation
945897cf55
Revert "Replace virtual method call for DC flush with stored bool value 3/n"
...
This reverts commit 9d94089a95
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-10-13 04:46:10 +02:00
Zbigniew Zdanowicz
9d94089a95
Replace virtual method call for DC flush with stored bool value 3/n
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-10-12 14:38:47 +02:00
Zbigniew Zdanowicz
87822f94e2
Replace virtual method call for DC flush with stored bool value 2/n
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-10-12 09:43:01 +02:00
Lukasz Jobczyk
1fedaee18c
Wait on timestamps in Event::wait
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-10-10 13:04:21 +02:00
Compute-Runtime-Validation
1c73031a9b
Revert "Wait on timestamps in Event::wait"
...
This reverts commit f91047f2ee
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-10-08 06:17:33 +02:00
Lukasz Jobczyk
f91047f2ee
Wait on timestamps in Event::wait
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-10-07 12:04:45 +02:00
Mateusz Hoppe
2f24ef6855
Error from clEnqueueNDRangeKernel() for too big group counts
...
Resolves: NEO-6976
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2022-10-05 12:35:16 +02:00
Mateusz Jablonski
23eff82d0a
OCL: optimize creating printf buffer
...
Dont create printf buffer when kernel doesnt require it
Related-To: HSD-18023825570
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2022-09-30 12:25:57 +02:00
Dominik Dabek
d8b7d56160
Copy host ptr on cpu if possible in clCreateBuffer
...
use cpu copy with locked pointer if possible
because this is faster than copy on gpu
limit to buffers of size at most 64kb
Related-To: NEO-7332
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2022-09-27 17:54:06 +02:00
Lukasz Jobczyk
e00a5e0a1e
Release main copy engine from device and sub device
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-09-20 17:12:34 +02:00
Milczarek, Slawomir
0192e8038f
Check for GPU hang in path with wait for timestamps
...
Related-To: NEO-6868
Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>
2022-09-19 15:01:46 +02:00
Lukasz Jobczyk
f65d2aeb87
Split copy along single dimension
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-09-08 12:39:06 +02:00
Mateusz Hoppe
6ad16c9e72
Missing support for OCL debugging with L0
...
- make resident debugSurface
- allocate debugSurface with correct allocation type and size
- notify cmdQ create/destroy
Related-To: NEO-7075
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2022-09-07 13:30:05 +02:00
Lukasz Jobczyk
a9b3b8137f
Release locks before wait on blocking BCS split calls
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-09-05 11:20:23 +02:00
Lukasz Jobczyk
0d6bef0753
Add BCS split to api specific config
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-09-05 11:13:10 +02:00
Compute-Runtime-Validation
20f49481f2
Revert "Enable BCS split WA in OCL"
...
This reverts commit d672920121
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-09-03 20:25:10 +02:00
Lukasz Jobczyk
d672920121
Enable BCS split WA in OCL
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-09-01 16:26:00 +02:00
Zbigniew Zdanowicz
c3f7e40a8d
Rename special pipeline select mode to systolic
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-08-31 22:16:26 +02:00
Lukasz Jobczyk
399758ef17
Change default engines for BCS split
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-08-31 21:34:36 +02:00
Lukasz Jobczyk
a3dedcc7ee
Add minimal transfer size for BCS split
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-08-31 21:28:40 +02:00
Dunajski, Bartosz
595cfebaef
Refactor PIPE_CONTROL programming
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2022-08-23 13:55:25 +02:00
Lukasz Jobczyk
82e29fd048
Add bcs split control mask
...
Introduce debug variable to control which engines
the tranfser will be split into
Related-To: NEO-7173
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-08-19 13:21:10 +02:00
Krystian Chmielewski
18adbed233
feat(zebin): add thread scheduling mode support
...
Resolves: NEO-7197
Signed-off-by: Krystian Chmielewski <krystian.chmielewski@intel.com>
2022-08-18 16:10:18 +02:00
Lukasz Jobczyk
b10b3ed9dd
Add initial enqueue bcs split infrastructure
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-08-18 15:02:08 +02:00
Katarzyna Cencelewska
fea9c9aca7
Add test to detect potential race
...
also add lock inside initialGpgpu
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2022-08-12 09:42:58 +02:00
Rafal Maziejuk
ed0c36117e
Apply heuristics when setting TG dispatch size on XE_HPC_CORE
...
The default TG dispatch size can be changed
to a better value based on number of threads in TG or
currently available amount of threads on GPU.
Decision on what TG dispatch size should be are based on
implemented heuristics.
Signed-off-by: Rafal Maziejuk <rafal.maziejuk@intel.com>
Related-To: NEO-6989
2022-08-08 16:43:10 +02:00
Compute-Runtime-Validation
b38c750cc4
Revert "Add finish before command queue is released"
...
This reverts commit 50fae92ea2
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-08-05 19:25:23 +02:00
Cencelewska, Katarzyna
61510e9a92
Revert optimization of gpgpu csr's mutex lock in the enqueue blit
...
optimization available under flag
ForceCsrLockInBcsEnqueueOnlyForGpgpuSubmission
Related-To: NEO-7011
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>
2022-08-04 16:42:50 +02:00
Dunajski, Bartosz
a3903c385e
Remove HW types from synchronization interface
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2022-07-25 13:59:26 +02:00
Kamil Kopryk
d4d54f5093
Cleanup includes
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2022-07-25 09:58:38 +02:00
Dominik Dabek
e88b24eb37
Remove flush on svmmap
...
No longer needed, problem was with reusing mem obj allocation.
Related-To: NEO-6948
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2022-07-20 17:26:15 +02:00
Mateusz Hoppe
50fae92ea2
Add finish before command queue is released
...
Related-To: NEO-5279
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2022-07-20 12:16:27 +02:00
Zbigniew Zdanowicz
3f8c19eec9
Limit system memory flag in builtin kernels to destination argument
...
Related-To: NEO-6959
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-07-19 10:58:51 +02:00
Bartosz Dunajski
52b00a11b0
Remove LSH from CommandQueue
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2022-07-19 08:47:02 +02:00
Zbigniew Zdanowicz
82a6f9e7b2
Use compute walker system fence for system memory or events in use
...
Related-To: NEO-6959
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-07-13 11:55:32 +02:00
Zbigniew Zdanowicz
e07f9f0698
Add kernel algorithm to check any argument is using system memory
...
Related-To: NEO-6959
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-07-12 13:08:49 +02:00
Zbigniew Zdanowicz
461a2eb8c7
Refactor interface to hardware interface
...
Related-To: NEO-6959
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-07-05 11:54:33 +02:00
Bartosz Dunajski
4fb4a1d77b
Add LogicalStateHelper getter for CommandQueue.
...
Refactor Kernel handling
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2022-07-04 13:21:15 +02:00
Lukasz Jobczyk
7a1cac0674
Fix EnableCmdQRoundRobindEngineAssign flag
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-06-27 11:38:52 +02:00
Dominik Dabek
39dfaf64bc
Fix: Flush on svmMap on multi device 2/n
...
Previous fix was causing the runtime to get buffer size
without gfx allocation, causing a seg fault.
This commit moves the fix logic to enqueue handler,
only changing the enqueueProperties.
Related-To: NEO-6948
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2022-06-21 11:38:28 +02:00
Maciej Plewka
6ab6e1abff
Fix mutex order for event task and move args to gpu
...
This commit fixes problem with untransfered shared usm memory to gpu
when there is submit to gpu trigerred by user event. Also there is a fix
for dead lock problem caused by mixed orders of locking mutexes in csr
and in direct submission controller.
Related-To: NEO-6762
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2022-06-21 11:28:25 +02:00
Lukasz Jobczyk
f98c6b1a8b
Disable round robin engine assign on PVC
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-06-20 15:24:18 +02:00
Dominik Dabek
ae61f0368a
Fix: Flush on svmMap on multi device
...
On multi device contexts we allocate svm in system memory.
This caused svmMap calls to assume no copy is needed and
no cache flush was sent to gpu and data modified by earlier
gpu commands was not being visible on host.
This change will add pipecontrol with dcFlush on svmMap
calls when task count from wait is enabled.
Related-To: NEO-6948
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2022-06-20 12:24:43 +02:00
Bartosz Dunajski
939d109362
Add LogicalStateHelper class
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2022-06-14 16:57:16 +02:00
Maciej Plewka
213dc2fe24
Make CPU copy for read buffer when host ptr is write combined on DG2
...
With this commit on DG2 32bit driver will check if passed host ptr for
clEnqueueReadBuffer is write combined memory. If check will be true copy
will be make on CPU.
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2022-06-13 21:23:21 +02:00
Dominik Dabek
0a3c960d61
Enable update task count from wait on DG2
...
Related-To: NEO-6948
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2022-06-06 11:41:27 +02:00
Zbigniew Zdanowicz
8431234845
Change interface to method programing additional fields of command
...
Related-To: NEO-6959
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-05-26 21:32:59 +02:00
Katarzyna Cencelewska
8ff34fea29
Fix typo
...
-engineGroupTyp to engineGroupType
-remove gap between commits to use same desc.ordinal in createCommandQueue
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2022-05-24 12:15:12 +02:00
Sebastian Luzynski
b8cf0c757a
Notify gtpin onCommandBufferComplete
...
Notify gtpin onContextDestroy before SVM Allocations are deleted.
Resolves: NEO-6985
Signed-off-by: Sebastian Luzynski <sebastian.jozef.luzynski@intel.com>
2022-05-20 16:42:13 +02:00
Michal Mrozek
cbd73d2d9c
Do not switch to batched mode when update task count form wait is enabled.
...
There is not need to enable batching in this case as we do not have
synchronization points between enqueues.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2022-05-19 13:16:20 +02:00
Artur Harasimiuk
3f04769f07
style: configure readability-identifier-naming.FunctionCase
...
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2022-05-17 20:55:56 +02:00
Katarzyna Cencelewska
96e1eb7467
Move variables baseDieRev and baseDieA0Masked from xe_hpc to pvc
...
Pvc specific variables should be located in pvc struct
Related-To: NEO-6738
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2022-05-17 12:19:16 +02:00
Patryk Wrobel
9b2ad0c5df
Detect GPU hangs in flushBcsTask()
...
This change introduces detection of GPU hangs in flushBcsTask()
function. The new code has been covered with ULTs.
Related-To: NEO-6681
Signed-off-by: Patryk Wrobel <patryk.wrobel@intel.com>
2022-05-12 16:17:16 +02:00
Bartosz Dunajski
e722afbefb
Track waitlist TimestampPackets to avoid too early return to the pool
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2022-05-12 13:54:04 +02:00
Kamil Kopryk
fb4b1cca4f
Use internal blitter for internal memory transfers
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
Related-To: NEO-6621
2022-05-11 19:33:00 +02:00
Artur Harasimiuk
e9be9b64c6
clang-tidy configuration cleanup
...
Define single .clang-tidy configuration with all used checks and use
NOLINT to selectively silence tool. That way cleanup should be easier.
third_part/ has its own configuration that disables clang-tidy for this
folder.
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2022-05-11 14:02:04 +02:00
Dominik Dabek
6e8cabdce5
Split wait for timestamps to queue and event
...
On PVC both enabled.
On DG2 only for events.
Related-To: NEO-6948
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2022-05-06 15:34:47 +02:00
Lukasz Jobczyk
82d1962aba
Assign BCS at first blit enqueue
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-05-04 17:11:34 +02:00
Lukasz Jobczyk
ba63e3c4b5
Obtain BCS for aux translation only if needed
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-05-04 15:59:11 +02:00
Compute-Runtime-Validation
a3fe50c2e4
Revert "Assign BCS at first blit enqueue"
...
This reverts commit 0469034acb
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-05-04 13:33:25 +02:00
Lukasz Jobczyk
0469034acb
Assign BCS at first blit enqueue
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-05-04 10:19:05 +02:00
Lukasz Jobczyk
73d3d83e60
Assign gpgpu engine at first enqueue
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-04-29 11:33:12 +02:00
Artur Harasimiuk
a6490062a9
fix code issues reported by clang 14
...
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2022-04-29 10:43:34 +02:00
Mateusz Jablonski
ac01f74924
style: fix gcc12 compilation errors
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2022-04-28 18:07:15 +02:00
Lukasz Jobczyk
72f6ef7d2b
Do not submit to gpgpu during blit in immediate dispatch mode
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-04-28 11:36:42 +02:00
Compute-Runtime-Validation
e8e3355a93
Revert "Assign BCS at first blit enqueue"
...
This reverts commit 5f5a1a6f81
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-04-26 09:40:46 +02:00
Lukasz Jobczyk
5f5a1a6f81
Assign BCS at first blit enqueue
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-04-25 13:48:21 +02:00
Katarzyna Cencelewska
91840e6e07
Fix gap for forcing engine by debug flag NodeOrdinal
...
add scenario when command queue created with properties
and there is also applied NodeOrdinal flag
then value from flag will use to force engine
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2022-04-22 10:48:04 +02:00
Jobczyk, Lukasz
93fe8a4794
Optimize gpgpu csr's mutex lock in the enqueue blit
...
Signed-off-by: Jobczyk, Lukasz <lukasz.jobczyk@intel.com>
2022-04-20 11:40:59 +02:00
Mateusz Jablonski
b5b8762faf
refactor direct submission: Pass command stream receiver to ctor
...
Related-To: NEO-6643
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2022-04-19 23:02:55 +02:00
Lukasz Jobczyk
6fdcb83020
Add missing tbx calls to timestamp wait logic
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-04-19 14:24:06 +02:00
Lukasz Jobczyk
7e4262839a
Add timestamp wait support for BCS
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-04-15 13:26:23 +02:00
Jaime Arteaga
7d4d22228f
Check for null allocation in processResources
...
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
2022-04-13 02:36:27 +02:00
Zbigniew Zdanowicz
f4407064a4
Refactor store register mem encoder to include partition parameter
...
Related-To: NEO-6811
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-04-06 14:00:56 +02:00
Dominik Dabek
76289be8c1
Avoid unnecessary allocation in enqueueHandler
...
Related-To: NEO-6837
kernelObjsForAuxTranslation will allocate only if needed
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2022-04-06 13:03:41 +02:00
Lukasz Jobczyk
9ace9356f8
Use timestamp wait only if cache flush not needed
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-04-04 09:36:03 +02:00
Dominik Dabek
308f9ccfff
Do not allocate dynamic state heap if not needed
...
Dynamic state heap is only used for sampler data.
Related-To: NEO-6821
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2022-03-31 12:13:13 +02:00
Lukasz Jobczyk
aa46dd9cdf
Assign data only to unused barrier timestamps
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-03-30 16:56:06 +02:00
Zbigniew Zdanowicz
9858438121
Limit multiple partition count to compute command lists
...
Related-To: NEO-6811
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-03-29 07:29:08 +02:00
Jobczyk, Lukasz
d77a6cbe4b
Enable task count update from wait
...
Signed-off-by: Jobczyk, Lukasz <lukasz.jobczyk@intel.com>
2022-03-28 11:09:55 +02:00
Mateusz Jablonski
e11eb46bff
Unify logic for programming mocs in post sync struct
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2022-03-25 17:01:51 +01:00
Lukasz Jobczyk
a230f267e1
Poll task count indefinitely on high throttle command queue
...
Resolves: NEO-6781
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-03-25 10:06:16 +01:00
Compute-Runtime-Validation
0c064ccf4c
Revert "Enable task count update from wait"
...
This reverts commit 5118a5d3a6
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-03-25 00:09:52 +01:00
Lukasz Jobczyk
5118a5d3a6
Enable task count update from wait
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-03-23 11:38:50 +01:00
Patryk Wrobel
e4d56cde21
Detect GPU hangs in blocking enqueue handler calls
...
This change introduces detection of GPU hangs in blocking
calls to enqueueHandler() function. Moreover, usages of
this function template have been revised and adjusted to
check the exit code. Furthermore, enqueueBlit() and
dispatchBcsOrGpgpuEnqueue() functions returns value now.
ULTs have been added to cover new cases.
Signed-off-by: Patryk Wrobel <patryk.wrobel@intel.com>
Related-To: NEO-6681
2022-03-22 21:06:19 +01:00
Jitendra Sharma
f52f3df274
Add platform specific getter of debug surface size
...
For different platforms based on number of available threads
and debug surface layout, calculate max debug surface size.
Related-To: NEO-6676
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2022-03-22 12:18:40 +01:00
Lukasz Jobczyk
fec738208a
Assign data to overwritten bcs barrier timestamps
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-03-22 11:54:11 +01:00
Lukasz Jobczyk
a74ae8f6af
Implement timestamp wait for events
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-03-17 17:57:30 +01:00
Lukasz Jobczyk
c8ba97e492
Restore gpgpu csr's mutex lock in the enqueue blit
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-03-10 13:36:46 +01:00
Lukasz Jobczyk
f91ae9d59c
Add multithread enqueue blit OOQ test
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-03-07 11:17:27 +01:00
Compute-Runtime-Validation
e526cc470b
Revert "Add multithread enqueue blit OOQ test"
...
This reverts commit 0919cad885
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-03-03 16:06:15 +01:00
Lukasz Jobczyk
999c6424a4
While enqueue blit do not flush gpgpu if already flushed
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-03-03 13:01:57 +01:00
Patryk Wrobel
f193efec2f
Remove additional memory allocations for surfaces container
...
In constructor of CommandComputeKernel we had been doing multiple allocations
of memory on heap due to lack of call to std::vector copy-constructor or reserve
member function.
Furthermore, in production code there is only one place, where we create objects
of this type and we redundantly copy the local variable, which could be moved.
This change:
- ensures that constructor of CommandComputeKernel performs single allocation
in the worst case; in the best case, it does not allocate memory due to usage
of std::move on input parameter
- steals the memory of the local variable in place of usage of the constructor
to remove redundant copying and memory allocations
- uses reserve() method to reduce the number of allocations during creation
of this local variable
Signed-off-by: Patryk Wrobel <patryk.wrobel@intel.com>
2022-03-03 12:07:36 +01:00
Lukasz Jobczyk
0919cad885
Add multithread enqueue blit OOQ test
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-03-02 14:18:58 +01:00
Michal Mrozek
bfacd14b61
Remove not needed code.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2022-03-02 13:10:15 +01:00
Lukasz Jobczyk
ea574d9b39
Optimize enqueue blit mutex
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-03-01 14:43:29 +01:00
Lukasz Jobczyk
3c30e1b02b
Add AssignBCSAtEnqueue debug flag
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-03-01 12:43:36 +01:00
Patryk Wrobel
0ecc7c5e3b
Detect GPU hangs in clFinish
...
This change introduces detection of GPU hangs in
clFinish function as well as unit tests to cover
the new code.
Signed-off-by: Patryk Wrobel <patryk.wrobel@intel.com>
2022-02-28 19:07:36 +01:00
Konstanty Misiak
cf1bc3a2ba
Disable EU fusion based on kernel properties from compiler
...
Related-To: NEO-6633
Signed-off-by: Konstanty Misiak <konstanty.misiak@intel.com>
2022-02-28 18:50:38 +01:00
Patryk Wrobel
7f729b7f89
Detect GPU hang in clWaitForEvents
...
This change:
- moves NEO::WaitStatus to a separate file
- enables detection of GPU hang in clWaitForEvents
- adjusts most of blocking calls in CommandStreamReceiver to return WaitStatus
- adds ULTs to cover the new code
Related-To: NEO-6681
Signed-off-by: Patryk Wrobel <patryk.wrobel@intel.com>
2022-02-23 13:33:09 +01:00
Bartosz Dunajski
4b0d986876
Move AllocationType enum out of GraphicsAllocation class
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2022-02-04 17:49:09 +01:00
Lukasz Jobczyk
9ff1307b4b
Fix optimize timestamp packet dependiencies
...
-program barrier after global fence allocation is programmed
-do not double barrier timestamp in blit enqueue
-flush GPGPU while submitting to BCS when barrier requested
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-02-03 16:27:09 +01:00
Bartosz Dunajski
52c6973e89
Rename blitBuffer method
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2022-02-03 15:02:55 +01:00
Michal Mrozek
8f85d4b8f8
Add debug variable to override message simd.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2022-02-03 11:07:42 +01:00
Konstanty Misiak
65255ea38f
Allow pointer copy when copying from host to svm
...
Related-To: NEO-5737
Signed-off-by: Konstanty Misiak <konstanty.misiak@intel.com>
2022-02-02 13:08:35 +01:00
Mateusz Hoppe
3e66f21df6
Remove incorrect Debug break
...
- kernels using debug surface does not need to use
only bindful buffers
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2022-01-28 13:51:02 +01:00
Maciej Plewka
f8c104feaa
Use fw declaration of IndirectHeap in CommandContainer
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2022-01-26 13:30:26 +01:00
Rafal Maziejuk
f064f7dd67
Check if origin + region doesn't exceed image dimensions
...
Signed-off-by: Rafal Maziejuk <rafal.maziejuk@intel.com>
Related-To: NEO-6137
2022-01-25 15:13:15 +01:00
Katarzyna Cencelewska
3d9e1ea3a5
Remove device enqueue part 11
...
- remove templates from queue functions
Related-To: NEO-6559
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2022-01-25 12:40:56 +01:00
Michal Mrozek
27c43b27f3
Remove not needed method.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2022-01-20 15:02:19 +01:00
Katarzyna Cencelewska
a06fbd2077
Remove device enqueue part 10
...
- remove DeviceQueue
Related-To: NEO-6559
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2022-01-19 17:41:06 +01:00
Katarzyna Cencelewska
97765cd39f
Remove device enqueue part 7
...
- mainly remove BlockKernelManager and ReflectionSurfaceHelper
Related-To: NEO-6559
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2022-01-19 10:41:57 +01:00
Michal Mrozek
a8b8a61c87
Fix marker calls in non profiling path.
...
When waiting , if we guard blit copy operation,
we need to wait for bcs as well.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2022-01-18 15:46:50 +01:00
Michal Mrozek
dd63f96836
Fix dependency resolution when marker with profiling is programmed.
...
We need to make sure it properly follows dependencies.
Before the change, code was not programming semaphores.
Related-to: NEO-6444
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2022-01-18 11:48:38 +01:00
Katarzyna Cencelewska
59683ec491
Remove device enqueue part 6
...
- isParentKernel, peekParentKernel, parentKernel
- structs: AUBParentKernelFixture, MockParentKernel,
ParentKernelCommandQueueFixture
Related-To: NEO-6559
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2022-01-17 13:56:29 +01:00
Katarzyna Cencelewska
d2818aaea2
Remove device enqueue part 5
...
-remove scheduler and builtin_kernels_simulation
Related-To: NEO-6559
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2022-01-13 14:15:26 +01:00
Katarzyna Cencelewska
a9ebb49fb5
Remove device enqueue part 1
...
Remove:
-tests with matcher for device enqueue
-classes: MockDeviceQueueHw, DeviceQueueHw, SchedulerSimulation,
DeviceQueueHwTest, KernelArgDevQueueTest
-functions: forceDispatchScheduler, processDeviceEnqueue, dispatchScheduler
Related-To: NEO-6559
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2022-01-11 15:29:49 +01:00
Dominik Dabek
6c40c712d8
Check workDim when enqueueing kernel
...
Return CL_INVALID_WORK_DIMENSION when workDim is more than
maxWorkItemDimensions.
Related-To: NEO-6159
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2021-12-31 15:42:09 +01:00
Filip Hazubski
5be4d89b73
Rename function
...
Rename MemorySynchronizationCommands::isDcFlushAllowed
to MemorySynchronizationCommands::getDcFlushEnable
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2021-12-30 17:03:22 +01:00
Lukasz Jobczyk
418a8854b2
Skip timestamp wait when enqueue non kernel operation
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-12-30 10:02:07 +01:00
Lukasz Jobczyk
95585a81f7
Optimize timestamp packet dependencies
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-12-30 09:48:36 +01:00
Lukasz Jobczyk
b59b0b6b36
Download timestamps before checking completion
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-12-28 08:14:27 +01:00
Lukasz Jobczyk
14e338e669
Revert "Optimize timestamp packet dependencies"
...
This reverts commit c365b422963917e7b882f9db985969c036f0fa3f.
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-12-23 14:29:29 +01:00
Filip Hazubski
f4c151cce5
Refactor PipeControlArgs struct
...
Remove struct PipeControlArgsBase
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2021-12-22 17:13:16 +01:00
Filip Hazubski
9a450d1b74
Pass hwInfo to appendMiFlushDw
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2021-12-22 15:22:47 +01:00
Filip Hazubski
0fd685541d
Add isDcFlushAllowed function to HwInfoConfig
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2021-12-21 18:29:43 +01:00
Filip Hazubski
6d439f88bb
Explicitly set dcFlushEnable value
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2021-12-21 12:21:11 +01:00
Mateusz Jablonski
66bf806018
Remove magic number from set/getBatchBufferStartAddressGraphicsaddress methods
...
rename methods to set/getBatchBufferStartAddress
Related-To: NEO-6466
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2021-12-16 19:03:01 +01:00
Lukasz Jobczyk
1f0c58d0bf
Refactor timestamp wait mechanism
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-12-16 13:22:19 +01:00
Bartosz Dunajski
fe8cb190bc
Remove deprecated code
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2021-12-14 11:36:10 +01:00
Zbigniew Zdanowicz
01348451db
Add multi tile barrier to marker command on multi tile device
...
Related-To: NEO-6262
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2021-12-13 18:14:39 +01:00
Filip Hazubski
64aec8dc4e
Rename functions
...
Rename getEngines to getAllEngines
Rename engines to allEngines
Rename getEngineGroups to getRegularEngineGroups
Rename engineGroups to regularEngineGroups
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2021-12-13 14:27:02 +01:00
Michal Mrozek
ebb16c8b74
Add new debug key to control dependency resolution.
...
Currently only supported scenario is one in order queue.
Instead of resolving dependencies via semaphores, do this with pipe controls.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2021-12-09 17:26:38 +01:00
Filip Hazubski
f137e927c5
Add EngineUsageHint debug variable
...
Related-To: NEO-6219
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2021-12-09 16:06:23 +01:00
Rafal Maziejuk
d5f3ac37bf
Add KernelExecutionType argument to encodeAdditionalWalkerFields method
...
Signed-off-by: Rafal Maziejuk <rafal.maziejuk@intel.com>
2021-12-08 12:00:42 +01:00
Lukasz Jobczyk
0c49fa8203
Add missing cache flush for blocked path
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-12-07 12:36:34 +01:00
Maciej Dziuban
e3bb526067
Optimize timestamp packet dependencies
...
- Clear dependencies even if last engine changed
- Do no program semaphore waiting for blit when blit is submitted with gpgpu
- Track barrier timestamps to correctly synchronize blits in OOQ
Related-To: NEO-6444
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-12-07 11:47:50 +01:00
Filip Hazubski
2098b0c3fa
Refactor Device::engineGroups to store only available engine groups
...
Simplify logic around engine groups.
Remove no longer needed code.
Ensure correct device is used when verifying engine groups.
Related-To: NEO-6219
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2021-12-06 18:27:31 +01:00
Lukasz Jobczyk
09d2ffb9ed
Add missing cache flush
...
Resolves: NEO-6505
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-12-06 17:28:12 +01:00
Lukasz Jobczyk
c455c0f9c6
Remove "Update Device::engineGroups"
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-12-03 10:16:40 +01:00
Lukasz Jobczyk
63116e4100
Use wait utils while waiting for timestamps
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-12-02 17:49:05 +01:00
Filip Hazubski
4461b8ea3f
Refactor: Update Device::engineGroups to store only available engine groups
...
Simplify logic around engine groups.
Remove no longer needed code.
Related-To: NEO-6219
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2021-12-02 15:10:22 +01:00
Lukasz Jobczyk
36efe72e3e
Add wait on timestamp mechanism
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-12-01 08:48:00 +01:00
Maciej Dziuban
457ef00abf
Reuse graphics allocations in svmMemcpy
...
Related-To: NEO-6352
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-11-17 14:03:05 +01:00
Lukasz Jobczyk
e8cc34d7b0
Add infrastructure to assign engine to command queue with round robin
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-11-15 13:13:40 +01:00
Maciej Plewka
bdf5a1c39c
Disable deviceEnqueue for BDW
...
Related-To: NEO-6378
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2021-11-15 12:49:59 +01:00
Maciej Dziuban
b8a03cdea1
Remove getAnyBcs() method
...
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-11-10 18:55:37 +01:00
Zbigniew Zdanowicz
61641bb70a
Add implicit barriers capability to opencl
...
Related-To: NEO-6262
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2021-11-10 08:19:50 +01:00
Compute-Runtime-Validation
908a7721b2
Revert "Assign engine to command queue using round robin algorithm"
...
This reverts commit 2fff0f9059
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2021-11-10 07:25:44 +01:00
Lukasz Jobczyk
2fff0f9059
Assign engine to command queue using round robin algorithm
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-11-09 11:05:16 +01:00
Compute-Runtime-Validation
9058de77cc
Revert "Assign engine to command queue using round robin algorithm"
...
This reverts commit 1c68ac1cbc
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2021-11-08 16:36:18 +01:00
Maciej Dziuban
4c4b37f8d2
Implement BCS selection heuristic for OpenCL CommandQueue
...
Related-To: NEO-6057
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-11-08 12:58:14 +01:00
Bartosz Dunajski
4bf4b170f1
Simplify WALKER_TYPE definition
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2021-11-04 17:09:06 +01:00
Lukasz Jobczyk
1c68ac1cbc
Assign engine to command queue using round robin algorithm
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-11-03 17:16:54 +01:00
Maciej Dziuban
a582f34c04
Reuse graphics allocations in readBufferRect/writeBufferRect
...
Related-To: NEO-6352
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-11-03 14:58:51 +01:00
Mateusz Jablonski
3a2b018eb6
Enable blitterOperationsSupported on DG1
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2021-10-28 14:16:19 +02:00
Kamil Diedrich
5fdcb14d57
WSL fallback to staging buffer on blocking reads
...
Signed-off-by: Kamil Diedrich <kamil.diedrich@intel.com>
2021-10-28 03:34:58 +02:00
Maciej Dziuban
f1b6b733f0
Reuse graphics allocation from mapped buffers in OpenCL
...
Related-To: NEO-6352
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-10-27 15:53:10 +02:00
Filip Hazubski
28b37aea72
Correct handling unique_ptr in functions
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2021-10-21 18:17:14 +02:00
Rafal Maziejuk
811d107ea9
Don't allow blit enqueue for images with 64KB tiling & 3D surface type
...
Signed-off-by: Rafal Maziejuk <rafal.maziejuk@intel.com>
Related-To: NEO-6142
2021-10-19 16:03:48 +02:00
Zbigniew Zdanowicz
24ff191e61
Verify completion on all tiles
...
Related-To: NEO-6244
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2021-10-15 15:43:28 +02:00
Katarzyna Cencelewska
1c8a6d895a
Use hwInfoConfig to check blitter support for image
...
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2021-10-13 17:51:00 +02:00
Maciej Dziuban
75b7adbe95
Skip transfer when CL_MAP_INVALIDATE_REGION is specified
...
Resolves: NEO-5490
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-10-13 17:13:33 +02:00
Michal Mrozek
4768be244b
Cache post syncs.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2021-10-13 15:57:13 +02:00
Maciej Dziuban
19a6c9b1d3
Track separate task count for each BCS in OpenCL CommandQueue
...
Related-To: NEO-6057
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-10-13 12:06:27 +02:00
Mateusz Jablonski
b1df167632
Move enable_product.inl to shared
...
extract api agnostic validators to shared
remove not needed opencl includes from neo shared
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2021-10-12 22:02:16 +02:00
Maciej Dziuban
cdb7287816
Fix direct submission initialization
...
Related-To: NEO-6057
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-10-12 11:51:48 +02:00
Mateusz Jablonski
ec1e39bab8
Move OCL specific part of FileLogger to ClFileLogger
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2021-10-11 23:55:08 +02:00
Maciej Dziuban
c910a58aa0
Store bcsEngine in an std::vector in OpenCL CommandQUeue
...
Related-To: NEO-6057
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-10-07 21:55:21 +02:00
Rafal Maziejuk
eccebfb49d
Fix typo 2/n
...
Signed-off-by: Rafal Maziejuk <rafal.maziejuk@intel.com>
2021-10-07 21:42:49 +02:00
Zbigniew Zdanowicz
38915c6de9
Refactor performance counters interface
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2021-10-06 22:53:27 +02:00
Bartosz Dunajski
5560663b01
Revert "Disabled forced cross engine synchronization on single queue"
...
This reverts commit 4b5432967eb9f484a491323b6766efe14929c5e9.
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2021-10-06 17:49:48 +02:00
Zbigniew Zdanowicz
0363749948
Disjoint local work size code from hw command generation
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2021-10-06 01:09:56 +02:00
Zbigniew Zdanowicz
0b64ecba3f
Pass active partitions from dispatched kernel to context
...
Related-To: NEO-6244
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2021-10-05 14:57:47 +02:00
Konstanty Misiak
684b9e5f51
Fix missing submit time calculation
...
Related-To: NEO-5799
Signed-off-by: Konstanty Misiak <konstanty.misiak@intel.com>
2021-10-05 12:58:04 +02:00
Mateusz Jablonski
5e201f40be
Prevent nullptr dereference in aub dump subcapture mode
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2021-10-04 14:23:19 +02:00
Zbigniew Zdanowicz
96fef37834
Move shared local work size code to shared directory
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2021-10-04 14:04:28 +02:00
Bartosz Dunajski
50bd8b5f19
Disabled forced cross engine synchronization on single queue
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2021-10-01 13:09:33 +02:00
Zbigniew Zdanowicz
317ef3776c
Do not pass active partitions from dispatched kernel to context class
...
Related-To: NEO-6244
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2021-09-30 13:32:19 +02:00
Zbigniew Zdanowicz
25c71a6c13
Move kernel info to shared directory
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2021-09-29 23:49:49 +02:00
Zbigniew Zdanowicz
6091861f3e
Enable multi-tile task count post-sync writes
...
Related-To: NEO-6244
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2021-09-28 16:32:16 +02:00
Artur Harasimiuk
a63b9b1273
move common files to shared/
...
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2021-09-27 11:10:20 +02:00
Maciej Dziuban
c04f8e5e5b
Pass copy engines to waitUntilComplete in OpenCL command queue
...
Related-To: NEO-6057
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-09-24 14:11:44 +02:00
Zbigniew Zdanowicz
3b35ba504f
Adapt command stream receiver to multiple active partitions
...
Related-To: NEO-6244
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2021-09-23 14:32:20 +02:00
Andrzej Koska
bd9457262e
Removing dependent commandQueue kernels
...
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
Related-To: NEO-6212
2021-09-23 14:04:53 +02:00
Mateusz Jablonski
80b49e4a47
Create ClPreemptionHelper
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2021-09-23 10:15:58 +02:00
Maciej Dziuban
6d1e048327
Rename an overload of waitUntilComplete to waitForAllEngines
...
- Overloaded functions were quite different, they should have different names
- waitForLatestTaskCount function was redundant and hence its body was moved
inside waitForAllEngines
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-09-23 09:43:11 +02:00
Mateusz Jablonski
d348526941
Simplify checkAndActivateAubSubCapture method
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2021-09-22 20:23:56 +02:00
Mateusz Hoppe
d4a4c35759
Zero-initialize debug surface
...
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2021-09-22 14:36:36 +02:00
Maciej Dziuban
04b6a604ec
Refactor BCS in OpenCL CommandQueue
...
- use CopyEngineState class to hold task count and engine type
- pass engine type to updateBcsTaskCount
Related-To: NEO-6057
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-09-22 10:39:46 +02:00
Maciej Dziuban
2707618fe3
Pass engine type to getBcsCommandStreamReceiver in OpenCL
...
Related-To: NEO-6057
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-09-21 16:27:07 +02:00
Maciej Dziuban
a4b3dc29e4
Remove unnecessary method
...
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-09-21 11:34:27 +02:00
Maciej Dziuban
c7d86af500
Pass blitter CSR to event
...
Related-To: NEO-6057
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-09-15 12:49:22 +02:00
Maciej Dziuban
858727010f
Add selectCsrForBuiltinOperation method to OpenCL CommandQueue
...
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
Related-To: NEO-6057
2021-09-10 12:26:47 +02:00
Zbigniew Zdanowicz
cd4f3c221a
Synchronize switching command buffers for all partitions
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2021-09-08 12:12:23 +02:00
Kamil Kopryk
a924b6a304
Code cleanup - avoid copy 5/n
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2021-09-08 08:59:55 +02:00
Maciej Dziuban
3e6b3cd780
Revert "Create single point for selecting engine for builtin ops"
...
This reverts commit 6513bd371a69cb821ea45c7c1964ea1f51dd75b2.
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-09-02 11:58:46 +02:00
Compute-Runtime-Validation
94cd243360
Revert "Correct CSR used for hostptr allocations in builtins"
...
This reverts commit 69ae9dc9c2
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2021-09-02 04:45:06 +02:00
Bartosz Dunajski
0345d9f707
Improve queries with number of SubDevices
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2021-09-01 14:01:57 +02:00
Maciej Dziuban
69ae9dc9c2
Correct CSR used for hostptr allocations in builtins
...
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-09-01 13:19:49 +02:00
Maciej Dziuban
33ac15dabe
Pass blitter CSR in blocked queue scenarios
...
Related-To: NEO-6057
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-08-31 16:29:17 +02:00
Bartosz Dunajski
92000bd75b
Improve Device getter to support EngineInstanced Device with single Engine
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2021-08-31 15:50:23 +02:00
Maciej Dziuban
a03b5f8a95
Create single point for selecting engine for builtin ops
...
- selectCsrForBuiltinOperation selects proper CSR
- selected CSR is passed to dispatchBcsOrGpgpuEnqueue
- Notifications such as notifyEnqueueReadBuffer are now made on correct engine
- Temporary allocs for hostptrs are now created on gpgpuCsr
Related-To: NEO-6057
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-08-31 13:29:17 +02:00
Maciej Dziuban
42524b6bef
Limit usage of getBcsCommandStreamReceiver()
...
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-08-27 12:15:30 +02:00
Bartosz Dunajski
cb91474b1a
Improve getDevice methods
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2021-08-26 14:45:00 +02:00
Filip Hazubski
63f8c9d98b
Add new EngineUsage
...
Simplify verifying EngineUsage in tryGetEngine function.
Remove unused getGpgpuEnginesCount function.
Related-To: NEO-4940
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2021-08-23 17:41:13 +02:00
Maciej Dziuban
78fa40fac0
Add new parameter to functions
...
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-08-19 17:25:43 +02:00
Maciej Plewka
096373ffe0
Dont use gpu time in submited timestamp for non blocking enqueue
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2021-08-17 19:00:44 +02:00
Szymon Morek
aa5e1780a2
Rename plus in filenames to and_later
...
Related-To: NEO-5920
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2021-08-17 11:26:27 +02:00
Konstanty Misiak
ad19eda689
Fix event profiling for marker commands
...
Related-To: NEO-5799
Signed-off-by: Konstanty Misiak <konstanty.misiak@intel.com>
2021-08-13 15:02:01 +02:00
Szymon Morek
a85ae676e7
Rename plus in cmake macros to and_later
...
Related-To: NEO-5920
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2021-08-13 11:40:19 +02:00
Milczarek, Slawomir
eb14d8458b
Add helper function to enable stateless compression
...
Related-To: NEO-5107
Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>
2021-08-02 14:34:33 +02:00
Kamil Kopryk
26e24fa872
Rename debug flag to EnableBlitterForEnqueueImageOperations
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
Related-To: NEO-4692
2021-07-30 12:31:12 +02:00
Kamil Kopryk
af4b1afdbd
Add support for copy images by blitter OCL
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
Related-To: NEO-4692
2021-07-30 09:29:45 +02:00
Kamil Kopryk
aed1da7495
Code cleanup - avoid copy 3/n
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2021-07-26 12:25:38 +02:00
Kamil Kopryk
88c6c9def8
Enable support for half_float images on BCS OCL
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
Related-To: NEO-4692
2021-07-26 11:26:49 +02:00
Maciej Dziuban
9bca773b91
Remove command queue constructor
...
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-07-23 14:16:02 +02:00
Kamil Kopryk
30151a8f02
Cleanup images pitch programming OCL
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
Related-To: NEO-4692
2021-07-21 19:47:56 +02:00
Lukasz Jobczyk
9e50138829
Defer ULLS initialization
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-07-21 15:55:16 +02:00
Maciej Plewka
dfe578754b
Fix add pipe controll before marker profiling
...
Resolves: NEO-6065
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2021-07-20 16:23:57 +02:00
Kamil Kopryk
ce92b42f63
Code cleanup - avoid copy 2/n
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2021-07-20 10:44:24 +02:00
Mateusz Jablonski
6f3c89decb
Add mechanism to migrate multi root device memory
...
invalidate TLB cache if kernel requires migration
Related-To: NEO-3691
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2021-07-19 19:49:22 +02:00
Kamil Kopryk
5a78fe3728
Code cleanup - avoid copy
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2021-07-19 18:01:36 +02:00
Kamil Kopryk
d703cff0bd
Code cleanup - don't copy string
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2021-07-19 17:54:42 +02:00
Kamil Kopryk
f06938c3d0
Enable support for 1Darray images on BCS OCL
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
Related-To: NEO-4692
2021-07-19 17:30:49 +02:00
Zbigniew Zdanowicz
a869921c2a
Remove RMW from gfx allocations
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2021-07-15 11:28:03 +02:00
Szymon Morek
399dc60bf8
Remove processing extra properties
...
Related-To: NEO-6024
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2021-07-08 16:18:06 +02:00
Maciej Dziuban
6f6ee90aa5
Extend copy engine selector data
...
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-07-06 12:28:55 +02:00
Michal Mrozek
ad3855ceb3
Revert "Fix event profiling for marker commands"
...
This reverts commit 5f0167b477993f3ebc804c53a110bc432638615b.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2021-07-01 08:01:33 +02:00
Konstanty Misiak
1df1094465
Fix event profiling for marker commands
...
Related-To: NEO-5799
Signed-off-by: Konstanty Misiak <konstanty.misiak@intel.com>
2021-06-30 14:56:39 +02:00
Kamil Kopryk
8582c9cbc7
Don't use 1Darrayed images with BCS OCL
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
Related-To: NEO-4692
2021-06-28 15:04:03 +02:00
Bartosz Dunajski
5fbf084c41
Dont access PrintfHandler after move operation
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2021-06-28 13:57:42 +02:00
Kamil Kopryk
05cff2501a
Remove not needed check IsNV12
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
Related-To: NEO-4692
2021-06-28 11:39:07 +02:00
Bartosz Dunajski
96d14967ac
Partial support for XE_HP_SDV
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2021-06-24 18:35:54 +02:00
Bartosz Dunajski
e1f42c2ae1
Remove TSP atomic dependency tracking
...
This reverts commit d17668f023bee409ee68e766bbc7d5f16ce8d52b.
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2021-06-23 16:59:41 +02:00
Bartosz Dunajski
c4a936d864
Track TSP OOQ ownership
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2021-06-23 15:59:47 +02:00
Bartosz Dunajski
82f1816aec
Keep implicit TSP dependencies ownership after enqueue
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2021-06-22 13:37:20 +02:00
Bartosz Dunajski
2e97aeccfd
Revert TSP changes
...
This commit reverts:
a1d2bdc76666059653c79fe39a26113ce47c632a,
71a115129c1698ff15305fd0ea3828cba861be47,
e1a9087a466bfba54d84a64247e6596092034a91.
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2021-06-21 12:44:12 +02:00
Bartosz Dunajski
8523747f91
Clear Event dependencies for Subcapture
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2021-06-18 12:58:49 +02:00
Mateusz Jablonski
e027178c37
Correct semaphore wait programming for cross device dependencies
...
when event is blocked by blocked user event then program semaphore wait during
unblocking user event
Related-To: NEO-3691
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2021-06-18 09:06:53 +02:00
Bartosz Dunajski
5af793ddc6
Remove TSP implicit dependency tracking logic
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2021-06-16 08:47:00 +02:00
Kamil Kopryk
b2918b5115
Correct enqueueCopyImage params
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
Related-To: NEO-4692
2021-06-15 13:24:57 +02:00
Bartosz Dunajski
06fd238a95
Keep old queue TimestampPacket nodes ownership until completed
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2021-06-15 08:33:00 +02:00
Maciej Plewka
9fe2dddcd3
Estimate command stream size for marker profiling
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2021-06-14 21:36:07 +02:00
Kamil Kopryk
9c181df6e6
Don't use half_float images with bcs OCL
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
Related-To: NEO-4692
2021-06-14 16:30:17 +02:00
Maciej Plewka
10cff46c22
Fix make resident event alloc for marker dispatch
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2021-06-14 10:44:27 +02:00
Krystian Chmielewski
6fad8b3100
Refactor ocl kernel
...
Removed dispatch traits pointers and added getters. Removed uneeded
tests.
Related-To: NEO-4729
Signed-off-by: Krystian Chmielewski <krystian.chmielewski@intel.com>
2021-06-11 16:52:09 +02:00
Kamil Kopryk
a9fe74c98d
Don't use BCS for NV12 images
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
Related-To: NEO-4692
2021-06-10 13:51:28 +02:00
Kamil Kopryk
6ea3f57849
Don't use bcs for mipmapped images OCL
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
Related-To: NEO-4692
2021-06-10 12:48:47 +02:00
Piotr Zdunowski
7cac94b597
Fix profiling for marker commands.
...
Resolves: NEO-4867
Signed-off-by: Piotr Zdunowski <piotr.zdunowski@intel.com>
2021-05-27 12:15:07 +02:00
Michal Mrozek
6bd6182759
Improve helper.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2021-05-26 11:42:46 +02:00
Michal Mrozek
128c994821
improve lws algorithm
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2021-05-25 15:55:19 +02:00
Michal Mrozek
270cae33fd
Change LWS selection policy for short kernels.
...
-prefer small LWS.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2021-05-20 15:23:06 +02:00
lgotszal
3bd4bca911
Copyright header update
...
Dates corrected in copyright headers to reflect original publication date
(2018 for OpenCL, 2020 for Level Zero).
Signed-off-by: lgotszal <lukasz.gotszald@intel.com>
2021-05-17 20:38:19 +02:00
Matias Cabral
2830073139
Save State Save Area Header in OCL
...
Signed-off-by: Matias Cabral <matias.a.cabral@intel.com>
2021-05-13 17:58:14 +02:00
Lukasz Jobczyk
8e1e213bcf
Disable GEM_WAIT when new residency model available
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-05-10 14:59:21 +02:00
Krystian Chmielewski
ef71915c71
Remove KernelArgInfo and use KernelDescriptor's args instead
...
Related-To: NEO-4729
Signed-off-by: Krystian Chmielewski <krystian.chmielewski@intel.com>
2021-04-23 15:12:09 +02:00
Kamil Kopryk
7a5e0e13a5
Add isBlitterForImagesSupported helper
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2021-04-22 16:10:20 +02:00
Krzysztof Gibala
2b12e9b42f
Program MI_SEMAPHORE_WAIT only on multi device setup
...
Fill and program CsrDependenciesForForTaskCountContainer
only on multi device environment
Related-To: NEO-5508
Signed-off-by: Krzysztof Gibala <krzysztof.gibala@intel.com>
2021-04-21 16:38:06 +02:00
Adam Cetnerowski
33f05a25cd
Fix typo
...
Signed-off-by: Adam Cetnerowski <adam.cetnerowski@intel.com>
2021-04-21 16:25:17 +02:00
Maciej Dziuban
5318ff1872
Defer OsContext initialization
...
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
Related-To: NEO-5610
2021-04-19 11:54:34 +02:00
Krzysztof Gibala
b01b8ba5ac
Use MI_SEMAPHORE_WAIT command for event synchronization
...
Related-To: NEO-5508
Signed-off-by: Krzysztof Gibala <krzysztof.gibala@intel.com>
2021-04-19 11:34:52 +02:00
Mateusz Jablonski
722f693e0f
Set Context in program when using builtins
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2021-04-06 17:41:19 +02:00
Mateusz Jablonski
8215395401
Simplify Context method
...
return if context has multiple sub devices related to a given root device
Related-To: NEO-3691
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2021-03-30 10:22:15 +02:00
Krzysztof Gibala
7bfe26a08d
Remove ensureMemoryOnDevice function
...
Remove:
- ensureMemoryOnDevice
- lastUsedRootDeviceIndex
- requiredRootDeviceIndex
- getMigrateableMultiGraphicsAllocation
Related-To: NEO-4589
Signed-off-by: Krzysztof Gibala <krzysztof.gibala@intel.com>
2021-03-29 11:41:07 +02:00