compute-runtime/shared/source/command_stream
Zbigniew Zdanowicz d725d454ea performance: emit fence on immediate copy engine only for blocking calls
- this change handles level zero immediate command lists on copy engine
- monitor fence will be dispatched for blocking calls
- asynchronous mode will dispatch monitor fence only on host synchronization

Related-To: NEO-8395

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-09-22 13:14:37 +02:00
..
definitions performance: add new copy operations to state base address properties 2023-06-07 11:34:28 +02:00
CMakeLists.txt
aub_command_stream_receiver.cpp
aub_command_stream_receiver.h
aub_command_stream_receiver_hw.h fix: decanonize GPU VA during TBX memory read. 2023-07-26 19:44:19 +02:00
aub_command_stream_receiver_hw_base.inl fix: decanonize GPU VA during TBX memory read. 2023-07-26 19:44:19 +02:00
aub_command_stream_receiver_hw_bdw_and_later.inl
aub_command_stream_receiver_hw_xehp_and_later.inl
aub_subcapture_status.h
command_stream_receiver.cpp fix: track registered CSR clients 2023-09-22 09:42:55 +02:00
command_stream_receiver.h fix: track registered CSR clients 2023-09-22 09:42:55 +02:00
command_stream_receiver_hw.h performance(ocl): program barrier pc in taskStream 2023-09-19 11:48:02 +02:00
command_stream_receiver_hw_base.inl performance: emit fence on immediate copy engine only for blocking calls 2023-09-22 13:14:37 +02:00
command_stream_receiver_hw_bdw_and_later.inl performance(ocl): flag to not dcFlush on no event 2023-07-18 15:38:54 +02:00
command_stream_receiver_hw_dg2_and_later.inl fix: improve release helper 2023-05-15 14:30:15 +02:00
command_stream_receiver_hw_xehp_and_later.inl performance(ocl): flag to not dcFlush on no event 2023-07-18 15:38:54 +02:00
command_stream_receiver_simulated_common_hw.h fix: decanonize GPU VA during TBX memory read. 2023-07-26 19:44:19 +02:00
command_stream_receiver_simulated_common_hw_base.inl fix: decanonize GPU VA during TBX memory read. 2023-07-26 19:44:19 +02:00
command_stream_receiver_simulated_common_hw_bdw_and_later.inl
command_stream_receiver_simulated_common_hw_xehp_and_later.inl
command_stream_receiver_simulated_hw.h fix: decanonize GPU VA during TBX memory read. 2023-07-26 19:44:19 +02:00
command_stream_receiver_with_aub_dump.h
command_stream_receiver_with_aub_dump.inl feature: new multitile post sync layout for immediate write [1/n] 2023-06-07 13:11:10 +02:00
create_command_stream_impl.cpp
create_command_stream_impl.h
csr_definitions.h refactor: remove not needed code 2023-09-12 14:25:04 +02:00
csr_deps.cpp
csr_deps.h
csr_properties_flags.h
device_command_stream.h
experimental_command_buffer.cpp
experimental_command_buffer.h
experimental_command_buffer.inl feature: add indirect semaphore mode 2023-09-12 13:15:51 +02:00
linear_stream.cpp
linear_stream.h fix: commandContainer is nullptr in LinearStream for immediate cmdList 2023-07-24 15:06:18 +02:00
memory_compression_state.h
preemption.cpp
preemption.h refactor: remove LogicalStateHelper 2023-09-13 10:29:53 +02:00
preemption.inl refactor: remove LogicalStateHelper 2023-09-13 10:29:53 +02:00
preemption_mode.h
preemption_xehp_and_later.inl refactor: remove LogicalStateHelper 2023-09-13 10:29:53 +02:00
queue_throttle.h
scratch_space_controller.cpp
scratch_space_controller.h
scratch_space_controller_base.cpp
scratch_space_controller_base.h
scratch_space_controller_xehp_and_later.cpp feature: bindless global allocator with scratch 2023-08-03 13:24:59 +02:00
scratch_space_controller_xehp_and_later.h
stream_properties.cpp performance: add new copy operations to state base address properties 2023-06-07 11:34:28 +02:00
stream_properties.h [perf] reduce host overhead in command list reset call 2023-04-05 11:29:39 +02:00
stream_properties_extra.cpp [perf] reduce host overhead in command list reset call 2023-04-05 11:29:39 +02:00
stream_property.h [perf] simplify state transition for size properties 2023-04-04 10:41:36 +02:00
submission_status.h
submissions_aggregator.cpp refactor: add batch buffer attribute for explicit monitor fence dispatch 2023-09-20 12:18:32 +02:00
submissions_aggregator.h refactor: add batch buffer attribute for explicit monitor fence dispatch 2023-09-20 12:18:32 +02:00
tag_allocation_layout.h
task_count_helper.h
tbx_command_stream_receiver.cpp
tbx_command_stream_receiver.h
tbx_command_stream_receiver_hw.h fix: decanonize GPU VA during TBX memory read. 2023-07-26 19:44:19 +02:00
tbx_command_stream_receiver_hw.inl fix: do not download event allocation in TBX mode 2023-08-29 16:27:33 +02:00
tbx_command_stream_receiver_xehp_and_later.inl
tbx_stream.cpp
thread_arbitration_policy.h
transfer_direction.h
wait_status.h