Maciej Plewka
6ab9037c35
Use FusedEu Dpas W/A only in ACM
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-03-08 14:14:58 +01:00
Cencelewska, Katarzyna
3e116ea378
refactor: use same paths when add command mi_semaphore_wait
...
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>
2023-03-07 10:35:26 +01:00
Maciej Plewka
52d322e738
Move barrier flush property from csr to cmdQueue
...
Related-To: NEO-6982, HSD-15010621906
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-03-03 14:33:53 +01:00
Dunajski, Bartosz
72ada8b41f
Add missing pollForCompletion on clEnqueueVerifyMemoryINTEL
...
Related-To: NEO-7618
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-03-03 10:55:33 +01:00
Dominik Dabek
22d7c2ea28
feat(ocl): move dependencies programming
...
Move out of csr dependencies programming to queue stream.
Correct method fillCsrDependenciesForTimestampPacketContainer.
Related-To: NEO-7321
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-02-20 13:42:56 +01:00
Warchulski, Jaroslaw
0556d543a3
Cleanup includes 56
...
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-02-16 14:42:44 +01:00
Maciej Plewka
75a3f99685
fix(ocl) use correct lws and groupCount to disable eu fusion check
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-02-15 14:43:36 +01:00
Mateusz Jablonski
51887af219
refactor OCL: cleanup after moving multi root device buffers to local mem
...
remove debug flag AllocateBuffersInLocalMemoryForMultiRootDeviceContexts
Related-To: NEO-5735
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-02-15 09:41:20 +01:00
Mateusz Jablonski
5a89a761f5
Fix OCL: always signal usage when migrating memory
...
multi storage memory migration needs to wait until memory is no longer used
Related-To: NEO-5735
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-02-14 15:10:02 +01:00
Maciej Plewka
429be6b4cb
Disable EUFusion for odd work groups with DPAS on DG2
...
Related-To: NEO-7495, HSD-14017007475
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-02-13 15:27:49 +01:00
Lukasz Jobczyk
7eb91e3b04
Split the L0 BCS split into D2H and H2D
...
-use separate pair of engines for D2H and H2D transfers
Related-To: NEO-7716
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-02-13 14:17:39 +01:00
Warchulski, Jaroslaw
d1b2311207
Cleanup includes 49
...
Cleaned up files:
opencl/source/command_queue/command_queue_hw.h
opencl/source/gtpin/gtpin_defs.h
opencl/source/mem_obj/mem_obj_helper.h
opencl/source/memory_manager/mem_obj_surface.h
opencl/source/sharings/unified/unified_sharing.h
opencl/test/unit_test/api/cl_enqueue_migrate_mem_objects_tests.inl
opencl/test/unit_test/aub_tests/fixtures/unified_memory_fixture.h
opencl/test/unit_test/command_queue/command_queue_fixture.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-02-10 17:26:49 +01:00
Warchulski, Jaroslaw
64f735481d
Cleanup includes 48
...
Cleaned up files:
shared/source/command_container/command_encoder.inl
shared/source/os_interface/hw_info_config.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-02-10 17:23:02 +01:00
Mateusz Jablonski
24c5352350
refactor: remove redundant including of compiler_cache.h
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-02-03 11:16:31 +01:00
Compute-Runtime-Validation
606a900080
Revert "Disable EUFusion for odd work groups with DPAS on DG2"
...
This reverts commit 017d66a469
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-02-03 02:45:21 +01:00
Maciej Plewka
017d66a469
Disable EUFusion for odd work groups with DPAS on DG2
...
Related-To: NEO-7495, HSD-14017007475
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-02-02 13:57:42 +01:00
Kamil Kopryk
2484c7ceb2
refactor: rename hw_helper files to gfx_core_helper files
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-02-01 19:37:51 +01:00
Kamil Kopryk
004d3e3416
refactor: don't use global ProductHelper getter 18
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-01-30 08:15:39 +01:00
Kamil Kopryk
5e059d4b30
refactor: don't use global ProductHelper getter 16
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-01-27 17:03:46 +01:00
Kamil Kopryk
445706361d
refactor: don't use global ProductHelper 14/n
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-01-27 14:51:12 +01:00
Dunajski, Bartosz
d42ec1ad8a
Unify multi regular context selection
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
Related-To: NEO-7618
2023-01-27 14:04:04 +01:00
Kamil Kopryk
026d50c7b9
refactor: don't use global ProductHelper getter 13/n
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-01-27 13:52:36 +01:00
Kamil Kopryk
b101f9f2f5
refactor: don't use global ProductHelper getter 12/n
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-01-27 12:29:18 +01:00
Kamil Kopryk
27393c76ea
refactor: don't use global ProductHelper getter 11/n
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-01-26 08:16:52 +01:00
Warchulski, Jaroslaw
e21cf516d3
Cleanup includes 44
...
Cleaned up files:
level_zero/tools/source/sysman/windows/os_sysman_imp.h
opencl/source/command_queue/command_queue.h
shared/source/command_container/cmdcontainer.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-25 16:05:22 +01:00
Warchulski, Jaroslaw
439aa6c87f
Cleanup includes 43
...
Cleaned up files:
level_zero/core/test/unit_tests/mocks/mock_kernel.h
opencl/source/mem_obj/mem_obj.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-25 11:33:39 +01:00
Kamil Kopryk
0499a72451
refactor: rename cl_gfx_core_helper files
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-01-24 11:16:03 +01:00
Warchulski, Jaroslaw
49837b7bb5
Cleanup includes 39
...
Cleaned up files:
shared/source/command_container/command_encoder.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-23 11:56:42 +01:00
Maciej Plewka
fa4830036a
feature(ocl) use tags to synchronize multi root device events
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-01-23 10:28:01 +01:00
Warchulski, Jaroslaw
8cc4cc1612
Cleanup includes 38
...
Cleaned up files:
opencl/source/cl_device/cl_device.h
opencl/source/command_queue/command_queue.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-20 14:55:17 +01:00
Dunajski, Bartosz
5a5596957a
Capability to create multiple Regular BCS contexts per engine.
...
Related-To: NEO-7618
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-01-20 13:43:48 +01:00
Kamil Kopryk
a4eba996f7
refactor: don't use global ProductHelper getter 4/n
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-01-20 10:27:27 +01:00
Warchulski, Jaroslaw
77501d86ba
Cleanup includes 35
...
Cleaned up files:
shared/source/command_stream/command_stream_receiver.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-17 18:51:40 +01:00
Warchulski, Jaroslaw
16d5a323c7
Cleanup includes 34
...
Cleaned up files:
opencl/source/command_queue/cl_local_work_size.h
opencl/test/unit_test/mocks/mock_buffer.h
shared/source/program/kernel_info.cpp
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-17 14:42:04 +01:00
Maciej Plewka
1421796541
Revert "feature(ocl) use tags to synchronize multi root device events"
...
This reverts commit 353a7510b2bd2d774d0b7ee82ee48eae7f5dc1d3.
Signed-off-by: Maciej Plewka maciej.plewka@intel.com
2023-01-17 11:29:58 +01:00
Warchulski, Jaroslaw
c275008e51
Cleanup includes 32
...
Cleaned up files:
level_zero/core/source/cmdlist/cmdlist_hw.h
level_zero/core/source/cmdqueue/cmdqueue.h
level_zero/core/source/event/event.h
opencl/source/helpers/get_info_status_mapper.h
opencl/source/helpers/hardware_commands_helper.h
shared/source/helpers/per_thread_data.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-16 20:41:37 +01:00
Dunajski, Bartosz
99e0493a39
Capability to create multiple Regular contexts per engine
...
Ralated-To: NEO-7618
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-01-16 12:45:35 +01:00
Cencelewska, Katarzyna
f94528097a
fix: add set stateCacheInvalidationEnable to flush cache
...
when blit operation and dcflush needed
resolves problem with corruptions visible when switch ccs with bcs
on platforms without engines coherency
Resolves: NEO-7577
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>
2023-01-13 10:35:54 +01:00
Maciej Plewka
16bc84e27d
feature(ocl) use tags to synchronize multi root device events
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-01-13 08:09:32 +01:00
Warchulski, Jaroslaw
fecb52ac49
Cleanup includes 29
...
Cleaned up files:
opencl/source/helpers/cl_memory_properties_helpers.h
shared/source/memory_manager/surface.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-13 07:53:03 +01:00
Warchulski, Jaroslaw
191d4de033
Cleanup includes 28
...
Cleaned up files:
opencl/source/command_queue/command_queue.h
opencl/source/command_queue/gpgpu_walker.h
shared/source/helpers/memory_properties_helpers.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-11 17:47:33 +01:00
Kamil Kopryk
200734892b
refactor: don't use global gfxCoreHelper getter 7/7
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-01-11 16:19:32 +01:00
Warchulski, Jaroslaw
3d59dce80c
Cleanup includes 27
...
Cleaned up files:
opencl/source/command_queue/command_queue.h
shared/source/built_ins/registry/built_ins_registry.h
shared/source/kernel/kernel_descriptor.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-11 16:10:28 +01:00
Warchulski, Jaroslaw
4794648978
Cleanup includes 26
...
Cleaned up files:
opencl/source/command_queue/csr_selection_args.h
opencl/source/event/event.h
shared/source/helpers/engine_control.h
shared/source/sku_info/definitions/sku_info.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-11 09:10:45 +01:00
Warchulski, Jaroslaw
77b88f19a1
Cleanup includes 23
...
Cleaned up files:
opencl/source/execution_environment/cl_execution_environment.h
opencl/source/helpers/cl_validators.h
opencl/test/unit_test/mocks/mock_cl_device.h
opencl/test/unit_test/mocks/mock_context.h
shared/source/helpers/cache_policy.h
shared/source/image/image_surface_state.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-09 12:30:30 +01:00
Mateusz Jablonski
43b790957d
style: format code using clang-format 15.0.6
...
Related-To: NEO-7500
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-01-05 10:33:47 +01:00
Kamil Kopryk
0dbf92d401
Refactor: reduce global productHelper getter usage
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-01-04 16:35:20 +01:00
Warchulski, Jaroslaw
b7ee3486d9
Cleanup includes 21
...
Cleaned up files:
opencl/source/kernel/kernel.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-04 10:40:26 +01:00
Milczarek, Slawomir
1e8169ca3d
Add bcs support for cross-device migrations of multi-graphics allocations
...
This commit adds a support for cross-device migration to bcs enqueue path
exercised for CL enqueue rea/write, map, unmap calls, controlled with
the regkey AllocateBuffersInLocalMemoryForMultiRootDeviceContexts.
Related-To: NEO-7092
Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>
2023-01-03 14:09:07 +01:00
Warchulski, Jaroslaw
a2fe929f0c
Cleanup includes 18
...
Cleaned up files:
shared/source/command_stream/command_stream_receiver_hw.h
shared/source/compiler_interface/compiler_interface.h
shared/source/direct_submission/direct_submission_hw.h
shared/source/helpers/dirty_state_helpers.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-02 13:28:29 +01:00
Warchulski, Jaroslaw
7ae44268cb
Cleanup includes 17
...
Cleaned up files:
shared/source/built_ins/sip.h
shared/source/compiler_interface/compiler_cache.h
shared/source/compiler_interface/compiler_interface.h
shared/source/device_binary_format/device_binary_formats.h
shared/source/helpers/timestamp_packet.h
shared/source/kernel/debug_data.h
shared/source/utilities/tag_allocator.h
shared/test/common/mocks/mock_device.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-02 10:59:08 +01:00
Kamil Kopryk
da80d9906e
Refactor: don't use global GfxCoreHelper getter in shared files 5/n
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-01-02 10:54:14 +01:00
Mateusz Hoppe
a2138e2ab9
perf: fix performance in enqueue path in OCL
...
add constexpr to if to only add extra if fo NDRange command
Related-To: NEO-7603
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2022-12-30 13:32:15 +01:00
Kamil Kopryk
7c23ea3928
Refactor: don't use global ProductHelper getter in ocl files 2/n
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2022-12-29 09:41:39 +01:00
Compute-Runtime-Validation
876de37b92
Revert "Feature(OCL) Use tag nodes for root device synchronization"
...
This reverts commit 547d1c37b3
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-12-24 19:14:33 +01:00
Maciej Plewka
547d1c37b3
Feature(OCL) Use tag nodes for root device synchronization
...
With this commit events created on multi root device contexts will
synchronize using signaled TagNodes instead of using taskCounts.
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
Related-To: NEO-7105
2022-12-23 15:48:54 +01:00
Kamil Kopryk
93deff0698
Refactor: don't use global ProductHelper getter in L0 3/n
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2022-12-23 12:19:03 +01:00
Warchulski, Jaroslaw
f275eea6ec
Cleanup includes 14
...
Cleaned up files:
shared/source/device/device.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2022-12-23 10:46:34 +01:00
Kamil Kopryk
47b6801b0b
Refactor: pass rootDeviceEnvironment to dispatchProfilingCommands
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2022-12-22 15:05:12 +01:00
Mateusz Jablonski
c8b55e38a0
Refactor: dont use global getters for Product/GfxCore helper in engine helpers
...
Related-To: NEO-6853
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2022-12-20 17:16:21 +01:00
Naklicki, Mateusz
810241e2a8
Remove unused useSingleSubdevice flag
...
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com>
2022-12-16 12:15:12 +01:00
Kamil Kopryk
232b886056
Rename HwInfoConfig to ProductHelper
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2022-12-14 14:39:52 +01:00
Mateusz Jablonski
10dbfc0d19
Reduce usage of global gfx core helper getter [3/n]
...
Related-To: NEO-6853
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2022-12-13 11:13:11 +01:00
Dunajski, Bartosz
bcecd069b4
Add additional kernel flag + capability to pull extra patch tokens
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2022-12-13 09:04:39 +01:00
Kamil Kopryk
03b687881f
Rename HwHelper -> GfxCoreHelper
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2022-12-09 10:29:06 +01:00
Lukasz Jobczyk
573ac33e1d
Enqueue marker before split when OOQ barrier have to be handled
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-12-07 17:15:41 +01:00
Warchulski, Jaroslaw
be647d42d9
Cleanup includes 12
...
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2022-12-07 13:14:15 +01:00
Warchulski, Jaroslaw
c10aa90815
Cleanup includes 11
...
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2022-12-06 12:25:30 +01:00
Dominik Dabek
4adba15dbb
Update for resolving dependencies by pipecontrol
...
Flag ResolveDependenciesViaPipeControls now removes only same csr
dependencies. Still enables pipe controls.
Works through hwInfoConfig method isResolveDependenciesByPipeControlsSupported
Related-To: NEO-7321
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2022-12-05 15:48:49 +01:00
Kamil Kopryk
91d39ad5bf
Rename ClHwHelper -> ClGfxCoreHelper
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2022-12-05 11:26:20 +01:00
Maciej Plewka
4b42b066f8
Use dedicated using type for TaskCount
...
Related-To: NEO-7155
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2022-11-28 16:44:44 +01:00
Dunajski, Bartosz
3f962bf3e8
RelaxedOrdering: Improve dependencies tracking
...
Avoid not needed scheduler programming
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2022-11-28 16:00:21 +01:00
Naklicki, Mateusz
914939c377
Fix execution of cooperative kernels on multi-tile device
...
Add flag for forcing execution of kernels on single tile
Force cooperative kernels to use only single tile
Related-to: NEO-6729
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com>
2022-11-28 14:36:21 +01:00
Dunajski, Bartosz
bc619fcbec
Queue stall mode for RelaxedOrdering
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2022-11-23 12:09:52 +01:00
Mateusz Jablonski
bb308c04ed
Refactor aubstream include interface
...
set include path to third_party/aub_stream
rename third_party/aub_stream/headers -> third_party/aub_stream/aubstream
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2022-11-23 10:30:13 +01:00
Szymon Morek
a66e69abc9
Prealloc cmd buffer for CSR only when being used
...
Related-To: NEO-7361
Currently additional command buffer is
preallocated for all CSRs, even for those which
won't be used by application. This PR changes that
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2022-11-15 20:06:29 +01:00
Kamil Kopryk
9564af14f5
Move ClCoreHelper ownership to RootDeviceEnvironment 3/n
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
Use RootDeviceEnvironment getHelper<ClCoreHelper> for
- getQueueFamilyName
- getKernelPrivateMemSize
- preferBlitterForLocalToLocalTransfers
2022-11-15 15:45:33 +01:00
Lukasz Jobczyk
0a07ea94c7
Enable bcs split for H2H transfers
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-11-14 14:23:02 +01:00
Milczarek, Slawomir
25a5ed0dca
Allocate buffers in local memory for PVC multi root device platforms (1/n)
...
PVC platform with no support for atomic operations on system memory
must always allocate buffers in local memory to avoid atomic access violation.
Note: the feature is being implemented under the new registry key
AllocateBuffersInLocalMemoryForMultiRootDeviceContexts (disabled by default)
Related-To: NEO-7092
Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>
2022-11-14 13:56:50 +01:00
Mateusz Hoppe
f77f47099c
OCL: Fix error for too big dimensions
...
- cases with null lws should only fail when computed
lws sizes result in too big number of workgroups
Related-To: NEO-6976
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2022-11-08 16:11:41 +01:00
Warchulski, Jaroslaw
ef95bfb45e
Cleanup includes
...
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2022-11-04 18:04:13 +01:00
Mateusz Jablonski
1c3d5c3892
Prepare mechanism for returning GPU execution error on OCL API
...
translate task count value to OCL error
Related-To: NEO-7412
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2022-11-04 15:54:19 +01:00
Krystian Chmielewski
69bef975f0
OCL cache per thread data
...
Add caching mechanism for local ids in OCL.
Signed-off-by: Krystian Chmielewski <krystian.chmielewski@intel.com>
2022-11-02 17:07:29 +01:00
Zbigniew Zdanowicz
565d820933
Replace virtual method call for DC flush with stored bool value 3/n
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-10-18 15:04:02 +02:00
Lukasz Jobczyk
4faf1eeb52
Flush tag update while enqueue mem fill
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-10-14 11:46:39 +02:00
Compute-Runtime-Validation
945897cf55
Revert "Replace virtual method call for DC flush with stored bool value 3/n"
...
This reverts commit 9d94089a95
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-10-13 04:46:10 +02:00
Zbigniew Zdanowicz
9d94089a95
Replace virtual method call for DC flush with stored bool value 3/n
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-10-12 14:38:47 +02:00
Zbigniew Zdanowicz
87822f94e2
Replace virtual method call for DC flush with stored bool value 2/n
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-10-12 09:43:01 +02:00
Lukasz Jobczyk
1fedaee18c
Wait on timestamps in Event::wait
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-10-10 13:04:21 +02:00
Compute-Runtime-Validation
1c73031a9b
Revert "Wait on timestamps in Event::wait"
...
This reverts commit f91047f2ee
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-10-08 06:17:33 +02:00
Lukasz Jobczyk
f91047f2ee
Wait on timestamps in Event::wait
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-10-07 12:04:45 +02:00
Mateusz Hoppe
2f24ef6855
Error from clEnqueueNDRangeKernel() for too big group counts
...
Resolves: NEO-6976
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2022-10-05 12:35:16 +02:00
Mateusz Jablonski
23eff82d0a
OCL: optimize creating printf buffer
...
Dont create printf buffer when kernel doesnt require it
Related-To: HSD-18023825570
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2022-09-30 12:25:57 +02:00
Dominik Dabek
d8b7d56160
Copy host ptr on cpu if possible in clCreateBuffer
...
use cpu copy with locked pointer if possible
because this is faster than copy on gpu
limit to buffers of size at most 64kb
Related-To: NEO-7332
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2022-09-27 17:54:06 +02:00
Lukasz Jobczyk
e00a5e0a1e
Release main copy engine from device and sub device
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-09-20 17:12:34 +02:00
Milczarek, Slawomir
0192e8038f
Check for GPU hang in path with wait for timestamps
...
Related-To: NEO-6868
Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>
2022-09-19 15:01:46 +02:00
Lukasz Jobczyk
f65d2aeb87
Split copy along single dimension
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-09-08 12:39:06 +02:00
Mateusz Hoppe
6ad16c9e72
Missing support for OCL debugging with L0
...
- make resident debugSurface
- allocate debugSurface with correct allocation type and size
- notify cmdQ create/destroy
Related-To: NEO-7075
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2022-09-07 13:30:05 +02:00
Lukasz Jobczyk
a9b3b8137f
Release locks before wait on blocking BCS split calls
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-09-05 11:20:23 +02:00
Lukasz Jobczyk
0d6bef0753
Add BCS split to api specific config
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-09-05 11:13:10 +02:00
Compute-Runtime-Validation
20f49481f2
Revert "Enable BCS split WA in OCL"
...
This reverts commit d672920121
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-09-03 20:25:10 +02:00
Lukasz Jobczyk
d672920121
Enable BCS split WA in OCL
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-09-01 16:26:00 +02:00
Zbigniew Zdanowicz
c3f7e40a8d
Rename special pipeline select mode to systolic
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-08-31 22:16:26 +02:00
Lukasz Jobczyk
399758ef17
Change default engines for BCS split
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-08-31 21:34:36 +02:00
Lukasz Jobczyk
a3dedcc7ee
Add minimal transfer size for BCS split
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-08-31 21:28:40 +02:00
Dunajski, Bartosz
595cfebaef
Refactor PIPE_CONTROL programming
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2022-08-23 13:55:25 +02:00
Lukasz Jobczyk
82e29fd048
Add bcs split control mask
...
Introduce debug variable to control which engines
the tranfser will be split into
Related-To: NEO-7173
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-08-19 13:21:10 +02:00
Krystian Chmielewski
18adbed233
feat(zebin): add thread scheduling mode support
...
Resolves: NEO-7197
Signed-off-by: Krystian Chmielewski <krystian.chmielewski@intel.com>
2022-08-18 16:10:18 +02:00
Lukasz Jobczyk
b10b3ed9dd
Add initial enqueue bcs split infrastructure
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-08-18 15:02:08 +02:00
Katarzyna Cencelewska
fea9c9aca7
Add test to detect potential race
...
also add lock inside initialGpgpu
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2022-08-12 09:42:58 +02:00
Rafal Maziejuk
ed0c36117e
Apply heuristics when setting TG dispatch size on XE_HPC_CORE
...
The default TG dispatch size can be changed
to a better value based on number of threads in TG or
currently available amount of threads on GPU.
Decision on what TG dispatch size should be are based on
implemented heuristics.
Signed-off-by: Rafal Maziejuk <rafal.maziejuk@intel.com>
Related-To: NEO-6989
2022-08-08 16:43:10 +02:00
Compute-Runtime-Validation
b38c750cc4
Revert "Add finish before command queue is released"
...
This reverts commit 50fae92ea2
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-08-05 19:25:23 +02:00
Cencelewska, Katarzyna
61510e9a92
Revert optimization of gpgpu csr's mutex lock in the enqueue blit
...
optimization available under flag
ForceCsrLockInBcsEnqueueOnlyForGpgpuSubmission
Related-To: NEO-7011
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>
2022-08-04 16:42:50 +02:00
Dunajski, Bartosz
a3903c385e
Remove HW types from synchronization interface
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2022-07-25 13:59:26 +02:00
Kamil Kopryk
d4d54f5093
Cleanup includes
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2022-07-25 09:58:38 +02:00
Dominik Dabek
e88b24eb37
Remove flush on svmmap
...
No longer needed, problem was with reusing mem obj allocation.
Related-To: NEO-6948
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2022-07-20 17:26:15 +02:00
Mateusz Hoppe
50fae92ea2
Add finish before command queue is released
...
Related-To: NEO-5279
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2022-07-20 12:16:27 +02:00
Zbigniew Zdanowicz
3f8c19eec9
Limit system memory flag in builtin kernels to destination argument
...
Related-To: NEO-6959
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-07-19 10:58:51 +02:00
Bartosz Dunajski
52b00a11b0
Remove LSH from CommandQueue
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2022-07-19 08:47:02 +02:00
Zbigniew Zdanowicz
82a6f9e7b2
Use compute walker system fence for system memory or events in use
...
Related-To: NEO-6959
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-07-13 11:55:32 +02:00
Zbigniew Zdanowicz
e07f9f0698
Add kernel algorithm to check any argument is using system memory
...
Related-To: NEO-6959
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-07-12 13:08:49 +02:00
Zbigniew Zdanowicz
461a2eb8c7
Refactor interface to hardware interface
...
Related-To: NEO-6959
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-07-05 11:54:33 +02:00
Bartosz Dunajski
4fb4a1d77b
Add LogicalStateHelper getter for CommandQueue.
...
Refactor Kernel handling
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2022-07-04 13:21:15 +02:00
Lukasz Jobczyk
7a1cac0674
Fix EnableCmdQRoundRobindEngineAssign flag
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-06-27 11:38:52 +02:00
Dominik Dabek
39dfaf64bc
Fix: Flush on svmMap on multi device 2/n
...
Previous fix was causing the runtime to get buffer size
without gfx allocation, causing a seg fault.
This commit moves the fix logic to enqueue handler,
only changing the enqueueProperties.
Related-To: NEO-6948
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2022-06-21 11:38:28 +02:00
Maciej Plewka
6ab6e1abff
Fix mutex order for event task and move args to gpu
...
This commit fixes problem with untransfered shared usm memory to gpu
when there is submit to gpu trigerred by user event. Also there is a fix
for dead lock problem caused by mixed orders of locking mutexes in csr
and in direct submission controller.
Related-To: NEO-6762
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2022-06-21 11:28:25 +02:00
Lukasz Jobczyk
f98c6b1a8b
Disable round robin engine assign on PVC
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-06-20 15:24:18 +02:00
Dominik Dabek
ae61f0368a
Fix: Flush on svmMap on multi device
...
On multi device contexts we allocate svm in system memory.
This caused svmMap calls to assume no copy is needed and
no cache flush was sent to gpu and data modified by earlier
gpu commands was not being visible on host.
This change will add pipecontrol with dcFlush on svmMap
calls when task count from wait is enabled.
Related-To: NEO-6948
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2022-06-20 12:24:43 +02:00
Bartosz Dunajski
939d109362
Add LogicalStateHelper class
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2022-06-14 16:57:16 +02:00
Maciej Plewka
213dc2fe24
Make CPU copy for read buffer when host ptr is write combined on DG2
...
With this commit on DG2 32bit driver will check if passed host ptr for
clEnqueueReadBuffer is write combined memory. If check will be true copy
will be make on CPU.
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2022-06-13 21:23:21 +02:00
Dominik Dabek
0a3c960d61
Enable update task count from wait on DG2
...
Related-To: NEO-6948
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2022-06-06 11:41:27 +02:00
Zbigniew Zdanowicz
8431234845
Change interface to method programing additional fields of command
...
Related-To: NEO-6959
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-05-26 21:32:59 +02:00
Katarzyna Cencelewska
8ff34fea29
Fix typo
...
-engineGroupTyp to engineGroupType
-remove gap between commits to use same desc.ordinal in createCommandQueue
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2022-05-24 12:15:12 +02:00
Sebastian Luzynski
b8cf0c757a
Notify gtpin onCommandBufferComplete
...
Notify gtpin onContextDestroy before SVM Allocations are deleted.
Resolves: NEO-6985
Signed-off-by: Sebastian Luzynski <sebastian.jozef.luzynski@intel.com>
2022-05-20 16:42:13 +02:00
Michal Mrozek
cbd73d2d9c
Do not switch to batched mode when update task count form wait is enabled.
...
There is not need to enable batching in this case as we do not have
synchronization points between enqueues.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2022-05-19 13:16:20 +02:00
Artur Harasimiuk
3f04769f07
style: configure readability-identifier-naming.FunctionCase
...
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2022-05-17 20:55:56 +02:00
Katarzyna Cencelewska
96e1eb7467
Move variables baseDieRev and baseDieA0Masked from xe_hpc to pvc
...
Pvc specific variables should be located in pvc struct
Related-To: NEO-6738
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2022-05-17 12:19:16 +02:00
Patryk Wrobel
9b2ad0c5df
Detect GPU hangs in flushBcsTask()
...
This change introduces detection of GPU hangs in flushBcsTask()
function. The new code has been covered with ULTs.
Related-To: NEO-6681
Signed-off-by: Patryk Wrobel <patryk.wrobel@intel.com>
2022-05-12 16:17:16 +02:00
Bartosz Dunajski
e722afbefb
Track waitlist TimestampPackets to avoid too early return to the pool
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2022-05-12 13:54:04 +02:00
Kamil Kopryk
fb4b1cca4f
Use internal blitter for internal memory transfers
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
Related-To: NEO-6621
2022-05-11 19:33:00 +02:00
Artur Harasimiuk
e9be9b64c6
clang-tidy configuration cleanup
...
Define single .clang-tidy configuration with all used checks and use
NOLINT to selectively silence tool. That way cleanup should be easier.
third_part/ has its own configuration that disables clang-tidy for this
folder.
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2022-05-11 14:02:04 +02:00
Dominik Dabek
6e8cabdce5
Split wait for timestamps to queue and event
...
On PVC both enabled.
On DG2 only for events.
Related-To: NEO-6948
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2022-05-06 15:34:47 +02:00
Lukasz Jobczyk
82d1962aba
Assign BCS at first blit enqueue
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-05-04 17:11:34 +02:00
Lukasz Jobczyk
ba63e3c4b5
Obtain BCS for aux translation only if needed
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-05-04 15:59:11 +02:00
Compute-Runtime-Validation
a3fe50c2e4
Revert "Assign BCS at first blit enqueue"
...
This reverts commit 0469034acb
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-05-04 13:33:25 +02:00
Lukasz Jobczyk
0469034acb
Assign BCS at first blit enqueue
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-05-04 10:19:05 +02:00
Lukasz Jobczyk
73d3d83e60
Assign gpgpu engine at first enqueue
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-04-29 11:33:12 +02:00
Artur Harasimiuk
a6490062a9
fix code issues reported by clang 14
...
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2022-04-29 10:43:34 +02:00
Mateusz Jablonski
ac01f74924
style: fix gcc12 compilation errors
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2022-04-28 18:07:15 +02:00
Lukasz Jobczyk
72f6ef7d2b
Do not submit to gpgpu during blit in immediate dispatch mode
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-04-28 11:36:42 +02:00
Compute-Runtime-Validation
e8e3355a93
Revert "Assign BCS at first blit enqueue"
...
This reverts commit 5f5a1a6f81
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-04-26 09:40:46 +02:00
Lukasz Jobczyk
5f5a1a6f81
Assign BCS at first blit enqueue
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-04-25 13:48:21 +02:00
Katarzyna Cencelewska
91840e6e07
Fix gap for forcing engine by debug flag NodeOrdinal
...
add scenario when command queue created with properties
and there is also applied NodeOrdinal flag
then value from flag will use to force engine
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2022-04-22 10:48:04 +02:00
Jobczyk, Lukasz
93fe8a4794
Optimize gpgpu csr's mutex lock in the enqueue blit
...
Signed-off-by: Jobczyk, Lukasz <lukasz.jobczyk@intel.com>
2022-04-20 11:40:59 +02:00
Mateusz Jablonski
b5b8762faf
refactor direct submission: Pass command stream receiver to ctor
...
Related-To: NEO-6643
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2022-04-19 23:02:55 +02:00
Lukasz Jobczyk
6fdcb83020
Add missing tbx calls to timestamp wait logic
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-04-19 14:24:06 +02:00
Lukasz Jobczyk
7e4262839a
Add timestamp wait support for BCS
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-04-15 13:26:23 +02:00
Jaime Arteaga
7d4d22228f
Check for null allocation in processResources
...
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
2022-04-13 02:36:27 +02:00
Zbigniew Zdanowicz
f4407064a4
Refactor store register mem encoder to include partition parameter
...
Related-To: NEO-6811
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-04-06 14:00:56 +02:00
Dominik Dabek
76289be8c1
Avoid unnecessary allocation in enqueueHandler
...
Related-To: NEO-6837
kernelObjsForAuxTranslation will allocate only if needed
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2022-04-06 13:03:41 +02:00
Lukasz Jobczyk
9ace9356f8
Use timestamp wait only if cache flush not needed
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-04-04 09:36:03 +02:00
Dominik Dabek
308f9ccfff
Do not allocate dynamic state heap if not needed
...
Dynamic state heap is only used for sampler data.
Related-To: NEO-6821
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2022-03-31 12:13:13 +02:00
Lukasz Jobczyk
aa46dd9cdf
Assign data only to unused barrier timestamps
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-03-30 16:56:06 +02:00
Zbigniew Zdanowicz
9858438121
Limit multiple partition count to compute command lists
...
Related-To: NEO-6811
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-03-29 07:29:08 +02:00
Jobczyk, Lukasz
d77a6cbe4b
Enable task count update from wait
...
Signed-off-by: Jobczyk, Lukasz <lukasz.jobczyk@intel.com>
2022-03-28 11:09:55 +02:00
Mateusz Jablonski
e11eb46bff
Unify logic for programming mocs in post sync struct
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2022-03-25 17:01:51 +01:00
Lukasz Jobczyk
a230f267e1
Poll task count indefinitely on high throttle command queue
...
Resolves: NEO-6781
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-03-25 10:06:16 +01:00
Compute-Runtime-Validation
0c064ccf4c
Revert "Enable task count update from wait"
...
This reverts commit 5118a5d3a6
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-03-25 00:09:52 +01:00
Lukasz Jobczyk
5118a5d3a6
Enable task count update from wait
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-03-23 11:38:50 +01:00
Patryk Wrobel
e4d56cde21
Detect GPU hangs in blocking enqueue handler calls
...
This change introduces detection of GPU hangs in blocking
calls to enqueueHandler() function. Moreover, usages of
this function template have been revised and adjusted to
check the exit code. Furthermore, enqueueBlit() and
dispatchBcsOrGpgpuEnqueue() functions returns value now.
ULTs have been added to cover new cases.
Signed-off-by: Patryk Wrobel <patryk.wrobel@intel.com>
Related-To: NEO-6681
2022-03-22 21:06:19 +01:00
Jitendra Sharma
f52f3df274
Add platform specific getter of debug surface size
...
For different platforms based on number of available threads
and debug surface layout, calculate max debug surface size.
Related-To: NEO-6676
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2022-03-22 12:18:40 +01:00
Lukasz Jobczyk
fec738208a
Assign data to overwritten bcs barrier timestamps
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-03-22 11:54:11 +01:00
Lukasz Jobczyk
a74ae8f6af
Implement timestamp wait for events
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-03-17 17:57:30 +01:00
Lukasz Jobczyk
c8ba97e492
Restore gpgpu csr's mutex lock in the enqueue blit
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-03-10 13:36:46 +01:00
Lukasz Jobczyk
f91ae9d59c
Add multithread enqueue blit OOQ test
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-03-07 11:17:27 +01:00
Compute-Runtime-Validation
e526cc470b
Revert "Add multithread enqueue blit OOQ test"
...
This reverts commit 0919cad885
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-03-03 16:06:15 +01:00
Lukasz Jobczyk
999c6424a4
While enqueue blit do not flush gpgpu if already flushed
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-03-03 13:01:57 +01:00
Patryk Wrobel
f193efec2f
Remove additional memory allocations for surfaces container
...
In constructor of CommandComputeKernel we had been doing multiple allocations
of memory on heap due to lack of call to std::vector copy-constructor or reserve
member function.
Furthermore, in production code there is only one place, where we create objects
of this type and we redundantly copy the local variable, which could be moved.
This change:
- ensures that constructor of CommandComputeKernel performs single allocation
in the worst case; in the best case, it does not allocate memory due to usage
of std::move on input parameter
- steals the memory of the local variable in place of usage of the constructor
to remove redundant copying and memory allocations
- uses reserve() method to reduce the number of allocations during creation
of this local variable
Signed-off-by: Patryk Wrobel <patryk.wrobel@intel.com>
2022-03-03 12:07:36 +01:00
Lukasz Jobczyk
0919cad885
Add multithread enqueue blit OOQ test
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-03-02 14:18:58 +01:00
Michal Mrozek
bfacd14b61
Remove not needed code.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2022-03-02 13:10:15 +01:00
Lukasz Jobczyk
ea574d9b39
Optimize enqueue blit mutex
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-03-01 14:43:29 +01:00
Lukasz Jobczyk
3c30e1b02b
Add AssignBCSAtEnqueue debug flag
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-03-01 12:43:36 +01:00
Patryk Wrobel
0ecc7c5e3b
Detect GPU hangs in clFinish
...
This change introduces detection of GPU hangs in
clFinish function as well as unit tests to cover
the new code.
Signed-off-by: Patryk Wrobel <patryk.wrobel@intel.com>
2022-02-28 19:07:36 +01:00
Konstanty Misiak
cf1bc3a2ba
Disable EU fusion based on kernel properties from compiler
...
Related-To: NEO-6633
Signed-off-by: Konstanty Misiak <konstanty.misiak@intel.com>
2022-02-28 18:50:38 +01:00
Patryk Wrobel
7f729b7f89
Detect GPU hang in clWaitForEvents
...
This change:
- moves NEO::WaitStatus to a separate file
- enables detection of GPU hang in clWaitForEvents
- adjusts most of blocking calls in CommandStreamReceiver to return WaitStatus
- adds ULTs to cover the new code
Related-To: NEO-6681
Signed-off-by: Patryk Wrobel <patryk.wrobel@intel.com>
2022-02-23 13:33:09 +01:00
Bartosz Dunajski
4b0d986876
Move AllocationType enum out of GraphicsAllocation class
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2022-02-04 17:49:09 +01:00
Lukasz Jobczyk
9ff1307b4b
Fix optimize timestamp packet dependiencies
...
-program barrier after global fence allocation is programmed
-do not double barrier timestamp in blit enqueue
-flush GPGPU while submitting to BCS when barrier requested
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-02-03 16:27:09 +01:00
Bartosz Dunajski
52c6973e89
Rename blitBuffer method
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2022-02-03 15:02:55 +01:00
Michal Mrozek
8f85d4b8f8
Add debug variable to override message simd.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2022-02-03 11:07:42 +01:00
Konstanty Misiak
65255ea38f
Allow pointer copy when copying from host to svm
...
Related-To: NEO-5737
Signed-off-by: Konstanty Misiak <konstanty.misiak@intel.com>
2022-02-02 13:08:35 +01:00
Mateusz Hoppe
3e66f21df6
Remove incorrect Debug break
...
- kernels using debug surface does not need to use
only bindful buffers
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2022-01-28 13:51:02 +01:00
Maciej Plewka
f8c104feaa
Use fw declaration of IndirectHeap in CommandContainer
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2022-01-26 13:30:26 +01:00
Rafal Maziejuk
f064f7dd67
Check if origin + region doesn't exceed image dimensions
...
Signed-off-by: Rafal Maziejuk <rafal.maziejuk@intel.com>
Related-To: NEO-6137
2022-01-25 15:13:15 +01:00
Katarzyna Cencelewska
3d9e1ea3a5
Remove device enqueue part 11
...
- remove templates from queue functions
Related-To: NEO-6559
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2022-01-25 12:40:56 +01:00
Michal Mrozek
27c43b27f3
Remove not needed method.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2022-01-20 15:02:19 +01:00
Katarzyna Cencelewska
a06fbd2077
Remove device enqueue part 10
...
- remove DeviceQueue
Related-To: NEO-6559
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2022-01-19 17:41:06 +01:00
Katarzyna Cencelewska
97765cd39f
Remove device enqueue part 7
...
- mainly remove BlockKernelManager and ReflectionSurfaceHelper
Related-To: NEO-6559
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2022-01-19 10:41:57 +01:00
Michal Mrozek
a8b8a61c87
Fix marker calls in non profiling path.
...
When waiting , if we guard blit copy operation,
we need to wait for bcs as well.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2022-01-18 15:46:50 +01:00
Michal Mrozek
dd63f96836
Fix dependency resolution when marker with profiling is programmed.
...
We need to make sure it properly follows dependencies.
Before the change, code was not programming semaphores.
Related-to: NEO-6444
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2022-01-18 11:48:38 +01:00
Katarzyna Cencelewska
59683ec491
Remove device enqueue part 6
...
- isParentKernel, peekParentKernel, parentKernel
- structs: AUBParentKernelFixture, MockParentKernel,
ParentKernelCommandQueueFixture
Related-To: NEO-6559
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2022-01-17 13:56:29 +01:00
Katarzyna Cencelewska
d2818aaea2
Remove device enqueue part 5
...
-remove scheduler and builtin_kernels_simulation
Related-To: NEO-6559
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2022-01-13 14:15:26 +01:00
Katarzyna Cencelewska
a9ebb49fb5
Remove device enqueue part 1
...
Remove:
-tests with matcher for device enqueue
-classes: MockDeviceQueueHw, DeviceQueueHw, SchedulerSimulation,
DeviceQueueHwTest, KernelArgDevQueueTest
-functions: forceDispatchScheduler, processDeviceEnqueue, dispatchScheduler
Related-To: NEO-6559
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2022-01-11 15:29:49 +01:00
Dominik Dabek
6c40c712d8
Check workDim when enqueueing kernel
...
Return CL_INVALID_WORK_DIMENSION when workDim is more than
maxWorkItemDimensions.
Related-To: NEO-6159
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2021-12-31 15:42:09 +01:00
Filip Hazubski
5be4d89b73
Rename function
...
Rename MemorySynchronizationCommands::isDcFlushAllowed
to MemorySynchronizationCommands::getDcFlushEnable
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2021-12-30 17:03:22 +01:00
Lukasz Jobczyk
418a8854b2
Skip timestamp wait when enqueue non kernel operation
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-12-30 10:02:07 +01:00
Lukasz Jobczyk
95585a81f7
Optimize timestamp packet dependencies
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-12-30 09:48:36 +01:00
Lukasz Jobczyk
b59b0b6b36
Download timestamps before checking completion
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-12-28 08:14:27 +01:00
Lukasz Jobczyk
14e338e669
Revert "Optimize timestamp packet dependencies"
...
This reverts commit c365b422963917e7b882f9db985969c036f0fa3f.
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-12-23 14:29:29 +01:00
Filip Hazubski
f4c151cce5
Refactor PipeControlArgs struct
...
Remove struct PipeControlArgsBase
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2021-12-22 17:13:16 +01:00
Filip Hazubski
9a450d1b74
Pass hwInfo to appendMiFlushDw
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2021-12-22 15:22:47 +01:00
Filip Hazubski
0fd685541d
Add isDcFlushAllowed function to HwInfoConfig
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2021-12-21 18:29:43 +01:00
Filip Hazubski
6d439f88bb
Explicitly set dcFlushEnable value
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2021-12-21 12:21:11 +01:00
Mateusz Jablonski
66bf806018
Remove magic number from set/getBatchBufferStartAddressGraphicsaddress methods
...
rename methods to set/getBatchBufferStartAddress
Related-To: NEO-6466
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2021-12-16 19:03:01 +01:00
Lukasz Jobczyk
1f0c58d0bf
Refactor timestamp wait mechanism
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-12-16 13:22:19 +01:00
Bartosz Dunajski
fe8cb190bc
Remove deprecated code
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2021-12-14 11:36:10 +01:00
Zbigniew Zdanowicz
01348451db
Add multi tile barrier to marker command on multi tile device
...
Related-To: NEO-6262
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2021-12-13 18:14:39 +01:00
Filip Hazubski
64aec8dc4e
Rename functions
...
Rename getEngines to getAllEngines
Rename engines to allEngines
Rename getEngineGroups to getRegularEngineGroups
Rename engineGroups to regularEngineGroups
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2021-12-13 14:27:02 +01:00
Michal Mrozek
ebb16c8b74
Add new debug key to control dependency resolution.
...
Currently only supported scenario is one in order queue.
Instead of resolving dependencies via semaphores, do this with pipe controls.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2021-12-09 17:26:38 +01:00
Filip Hazubski
f137e927c5
Add EngineUsageHint debug variable
...
Related-To: NEO-6219
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2021-12-09 16:06:23 +01:00
Rafal Maziejuk
d5f3ac37bf
Add KernelExecutionType argument to encodeAdditionalWalkerFields method
...
Signed-off-by: Rafal Maziejuk <rafal.maziejuk@intel.com>
2021-12-08 12:00:42 +01:00
Lukasz Jobczyk
0c49fa8203
Add missing cache flush for blocked path
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-12-07 12:36:34 +01:00
Maciej Dziuban
e3bb526067
Optimize timestamp packet dependencies
...
- Clear dependencies even if last engine changed
- Do no program semaphore waiting for blit when blit is submitted with gpgpu
- Track barrier timestamps to correctly synchronize blits in OOQ
Related-To: NEO-6444
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-12-07 11:47:50 +01:00
Filip Hazubski
2098b0c3fa
Refactor Device::engineGroups to store only available engine groups
...
Simplify logic around engine groups.
Remove no longer needed code.
Ensure correct device is used when verifying engine groups.
Related-To: NEO-6219
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2021-12-06 18:27:31 +01:00
Lukasz Jobczyk
09d2ffb9ed
Add missing cache flush
...
Resolves: NEO-6505
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-12-06 17:28:12 +01:00
Lukasz Jobczyk
c455c0f9c6
Remove "Update Device::engineGroups"
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-12-03 10:16:40 +01:00
Lukasz Jobczyk
63116e4100
Use wait utils while waiting for timestamps
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-12-02 17:49:05 +01:00
Filip Hazubski
4461b8ea3f
Refactor: Update Device::engineGroups to store only available engine groups
...
Simplify logic around engine groups.
Remove no longer needed code.
Related-To: NEO-6219
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2021-12-02 15:10:22 +01:00
Lukasz Jobczyk
36efe72e3e
Add wait on timestamp mechanism
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-12-01 08:48:00 +01:00
Maciej Dziuban
457ef00abf
Reuse graphics allocations in svmMemcpy
...
Related-To: NEO-6352
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-11-17 14:03:05 +01:00
Lukasz Jobczyk
e8cc34d7b0
Add infrastructure to assign engine to command queue with round robin
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-11-15 13:13:40 +01:00
Maciej Plewka
bdf5a1c39c
Disable deviceEnqueue for BDW
...
Related-To: NEO-6378
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2021-11-15 12:49:59 +01:00
Maciej Dziuban
b8a03cdea1
Remove getAnyBcs() method
...
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-11-10 18:55:37 +01:00
Zbigniew Zdanowicz
61641bb70a
Add implicit barriers capability to opencl
...
Related-To: NEO-6262
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2021-11-10 08:19:50 +01:00
Compute-Runtime-Validation
908a7721b2
Revert "Assign engine to command queue using round robin algorithm"
...
This reverts commit 2fff0f9059
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2021-11-10 07:25:44 +01:00
Lukasz Jobczyk
2fff0f9059
Assign engine to command queue using round robin algorithm
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-11-09 11:05:16 +01:00
Compute-Runtime-Validation
9058de77cc
Revert "Assign engine to command queue using round robin algorithm"
...
This reverts commit 1c68ac1cbc
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2021-11-08 16:36:18 +01:00
Maciej Dziuban
4c4b37f8d2
Implement BCS selection heuristic for OpenCL CommandQueue
...
Related-To: NEO-6057
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-11-08 12:58:14 +01:00
Bartosz Dunajski
4bf4b170f1
Simplify WALKER_TYPE definition
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2021-11-04 17:09:06 +01:00
Lukasz Jobczyk
1c68ac1cbc
Assign engine to command queue using round robin algorithm
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2021-11-03 17:16:54 +01:00
Maciej Dziuban
a582f34c04
Reuse graphics allocations in readBufferRect/writeBufferRect
...
Related-To: NEO-6352
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-11-03 14:58:51 +01:00
Mateusz Jablonski
3a2b018eb6
Enable blitterOperationsSupported on DG1
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2021-10-28 14:16:19 +02:00
Kamil Diedrich
5fdcb14d57
WSL fallback to staging buffer on blocking reads
...
Signed-off-by: Kamil Diedrich <kamil.diedrich@intel.com>
2021-10-28 03:34:58 +02:00
Maciej Dziuban
f1b6b733f0
Reuse graphics allocation from mapped buffers in OpenCL
...
Related-To: NEO-6352
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-10-27 15:53:10 +02:00
Filip Hazubski
28b37aea72
Correct handling unique_ptr in functions
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2021-10-21 18:17:14 +02:00
Rafal Maziejuk
811d107ea9
Don't allow blit enqueue for images with 64KB tiling & 3D surface type
...
Signed-off-by: Rafal Maziejuk <rafal.maziejuk@intel.com>
Related-To: NEO-6142
2021-10-19 16:03:48 +02:00
Zbigniew Zdanowicz
24ff191e61
Verify completion on all tiles
...
Related-To: NEO-6244
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2021-10-15 15:43:28 +02:00
Katarzyna Cencelewska
1c8a6d895a
Use hwInfoConfig to check blitter support for image
...
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2021-10-13 17:51:00 +02:00
Maciej Dziuban
75b7adbe95
Skip transfer when CL_MAP_INVALIDATE_REGION is specified
...
Resolves: NEO-5490
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-10-13 17:13:33 +02:00
Michal Mrozek
4768be244b
Cache post syncs.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2021-10-13 15:57:13 +02:00
Maciej Dziuban
19a6c9b1d3
Track separate task count for each BCS in OpenCL CommandQueue
...
Related-To: NEO-6057
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
2021-10-13 12:06:27 +02:00