Dominik Dabek
4adba15dbb
Update for resolving dependencies by pipecontrol
...
Flag ResolveDependenciesViaPipeControls now removes only same csr
dependencies. Still enables pipe controls.
Works through hwInfoConfig method isResolveDependenciesByPipeControlsSupported
Related-To: NEO-7321
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2022-12-05 15:48:49 +01:00
Kamil Kopryk
91d39ad5bf
Rename ClHwHelper -> ClGfxCoreHelper
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2022-12-05 11:26:20 +01:00
Maciej Plewka
4b42b066f8
Use dedicated using type for TaskCount
...
Related-To: NEO-7155
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2022-11-28 16:44:44 +01:00
Dunajski, Bartosz
3f962bf3e8
RelaxedOrdering: Improve dependencies tracking
...
Avoid not needed scheduler programming
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2022-11-28 16:00:21 +01:00
Naklicki, Mateusz
914939c377
Fix execution of cooperative kernels on multi-tile device
...
Add flag for forcing execution of kernels on single tile
Force cooperative kernels to use only single tile
Related-to: NEO-6729
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com >
2022-11-28 14:36:21 +01:00
Dunajski, Bartosz
bc619fcbec
Queue stall mode for RelaxedOrdering
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2022-11-23 12:09:52 +01:00
Mateusz Jablonski
bb308c04ed
Refactor aubstream include interface
...
set include path to third_party/aub_stream
rename third_party/aub_stream/headers -> third_party/aub_stream/aubstream
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-11-23 10:30:13 +01:00
Szymon Morek
a66e69abc9
Prealloc cmd buffer for CSR only when being used
...
Related-To: NEO-7361
Currently additional command buffer is
preallocated for all CSRs, even for those which
won't be used by application. This PR changes that
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2022-11-15 20:06:29 +01:00
Kamil Kopryk
9564af14f5
Move ClCoreHelper ownership to RootDeviceEnvironment 3/n
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
Use RootDeviceEnvironment getHelper<ClCoreHelper> for
- getQueueFamilyName
- getKernelPrivateMemSize
- preferBlitterForLocalToLocalTransfers
2022-11-15 15:45:33 +01:00
Lukasz Jobczyk
0a07ea94c7
Enable bcs split for H2H transfers
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-11-14 14:23:02 +01:00
Milczarek, Slawomir
25a5ed0dca
Allocate buffers in local memory for PVC multi root device platforms (1/n)
...
PVC platform with no support for atomic operations on system memory
must always allocate buffers in local memory to avoid atomic access violation.
Note: the feature is being implemented under the new registry key
AllocateBuffersInLocalMemoryForMultiRootDeviceContexts (disabled by default)
Related-To: NEO-7092
Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com >
2022-11-14 13:56:50 +01:00
Mateusz Hoppe
f77f47099c
OCL: Fix error for too big dimensions
...
- cases with null lws should only fail when computed
lws sizes result in too big number of workgroups
Related-To: NEO-6976
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2022-11-08 16:11:41 +01:00
Warchulski, Jaroslaw
ef95bfb45e
Cleanup includes
...
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com >
2022-11-04 18:04:13 +01:00
Mateusz Jablonski
1c3d5c3892
Prepare mechanism for returning GPU execution error on OCL API
...
translate task count value to OCL error
Related-To: NEO-7412
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-11-04 15:54:19 +01:00
Krystian Chmielewski
69bef975f0
OCL cache per thread data
...
Add caching mechanism for local ids in OCL.
Signed-off-by: Krystian Chmielewski <krystian.chmielewski@intel.com >
2022-11-02 17:07:29 +01:00
Zbigniew Zdanowicz
565d820933
Replace virtual method call for DC flush with stored bool value 3/n
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2022-10-18 15:04:02 +02:00
Lukasz Jobczyk
4faf1eeb52
Flush tag update while enqueue mem fill
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-10-14 11:46:39 +02:00
Compute-Runtime-Validation
945897cf55
Revert "Replace virtual method call for DC flush with stored bool value 3/n"
...
This reverts commit 9d94089a95 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2022-10-13 04:46:10 +02:00
Zbigniew Zdanowicz
9d94089a95
Replace virtual method call for DC flush with stored bool value 3/n
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2022-10-12 14:38:47 +02:00
Zbigniew Zdanowicz
87822f94e2
Replace virtual method call for DC flush with stored bool value 2/n
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2022-10-12 09:43:01 +02:00
Lukasz Jobczyk
1fedaee18c
Wait on timestamps in Event::wait
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-10-10 13:04:21 +02:00
Compute-Runtime-Validation
1c73031a9b
Revert "Wait on timestamps in Event::wait"
...
This reverts commit f91047f2ee .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2022-10-08 06:17:33 +02:00
Lukasz Jobczyk
f91047f2ee
Wait on timestamps in Event::wait
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-10-07 12:04:45 +02:00
Mateusz Hoppe
2f24ef6855
Error from clEnqueueNDRangeKernel() for too big group counts
...
Resolves: NEO-6976
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2022-10-05 12:35:16 +02:00
Mateusz Jablonski
23eff82d0a
OCL: optimize creating printf buffer
...
Dont create printf buffer when kernel doesnt require it
Related-To: HSD-18023825570
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-09-30 12:25:57 +02:00
Dominik Dabek
d8b7d56160
Copy host ptr on cpu if possible in clCreateBuffer
...
use cpu copy with locked pointer if possible
because this is faster than copy on gpu
limit to buffers of size at most 64kb
Related-To: NEO-7332
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2022-09-27 17:54:06 +02:00
Lukasz Jobczyk
e00a5e0a1e
Release main copy engine from device and sub device
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-09-20 17:12:34 +02:00
Milczarek, Slawomir
0192e8038f
Check for GPU hang in path with wait for timestamps
...
Related-To: NEO-6868
Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com >
2022-09-19 15:01:46 +02:00
Lukasz Jobczyk
f65d2aeb87
Split copy along single dimension
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-09-08 12:39:06 +02:00
Mateusz Hoppe
6ad16c9e72
Missing support for OCL debugging with L0
...
- make resident debugSurface
- allocate debugSurface with correct allocation type and size
- notify cmdQ create/destroy
Related-To: NEO-7075
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2022-09-07 13:30:05 +02:00
Lukasz Jobczyk
a9b3b8137f
Release locks before wait on blocking BCS split calls
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-09-05 11:20:23 +02:00
Lukasz Jobczyk
0d6bef0753
Add BCS split to api specific config
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-09-05 11:13:10 +02:00
Compute-Runtime-Validation
20f49481f2
Revert "Enable BCS split WA in OCL"
...
This reverts commit d672920121 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2022-09-03 20:25:10 +02:00
Lukasz Jobczyk
d672920121
Enable BCS split WA in OCL
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-09-01 16:26:00 +02:00
Zbigniew Zdanowicz
c3f7e40a8d
Rename special pipeline select mode to systolic
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2022-08-31 22:16:26 +02:00
Lukasz Jobczyk
399758ef17
Change default engines for BCS split
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-08-31 21:34:36 +02:00
Lukasz Jobczyk
a3dedcc7ee
Add minimal transfer size for BCS split
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-08-31 21:28:40 +02:00
Dunajski, Bartosz
595cfebaef
Refactor PIPE_CONTROL programming
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2022-08-23 13:55:25 +02:00
Lukasz Jobczyk
82e29fd048
Add bcs split control mask
...
Introduce debug variable to control which engines
the tranfser will be split into
Related-To: NEO-7173
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-08-19 13:21:10 +02:00
Krystian Chmielewski
18adbed233
feat(zebin): add thread scheduling mode support
...
Resolves: NEO-7197
Signed-off-by: Krystian Chmielewski <krystian.chmielewski@intel.com >
2022-08-18 16:10:18 +02:00
Lukasz Jobczyk
b10b3ed9dd
Add initial enqueue bcs split infrastructure
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-08-18 15:02:08 +02:00
Katarzyna Cencelewska
fea9c9aca7
Add test to detect potential race
...
also add lock inside initialGpgpu
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2022-08-12 09:42:58 +02:00
Rafal Maziejuk
ed0c36117e
Apply heuristics when setting TG dispatch size on XE_HPC_CORE
...
The default TG dispatch size can be changed
to a better value based on number of threads in TG or
currently available amount of threads on GPU.
Decision on what TG dispatch size should be are based on
implemented heuristics.
Signed-off-by: Rafal Maziejuk <rafal.maziejuk@intel.com >
Related-To: NEO-6989
2022-08-08 16:43:10 +02:00
Compute-Runtime-Validation
b38c750cc4
Revert "Add finish before command queue is released"
...
This reverts commit 50fae92ea2 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2022-08-05 19:25:23 +02:00
Cencelewska, Katarzyna
61510e9a92
Revert optimization of gpgpu csr's mutex lock in the enqueue blit
...
optimization available under flag
ForceCsrLockInBcsEnqueueOnlyForGpgpuSubmission
Related-To: NEO-7011
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2022-08-04 16:42:50 +02:00
Dunajski, Bartosz
a3903c385e
Remove HW types from synchronization interface
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2022-07-25 13:59:26 +02:00
Kamil Kopryk
d4d54f5093
Cleanup includes
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2022-07-25 09:58:38 +02:00
Dominik Dabek
e88b24eb37
Remove flush on svmmap
...
No longer needed, problem was with reusing mem obj allocation.
Related-To: NEO-6948
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2022-07-20 17:26:15 +02:00
Mateusz Hoppe
50fae92ea2
Add finish before command queue is released
...
Related-To: NEO-5279
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2022-07-20 12:16:27 +02:00
Zbigniew Zdanowicz
3f8c19eec9
Limit system memory flag in builtin kernels to destination argument
...
Related-To: NEO-6959
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2022-07-19 10:58:51 +02:00