Lukasz Jobczyk
edd6fd5e6c
performance: Optimize ULLS start on submit path
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-12-29 15:21:49 +01:00
Compute-Runtime-Validation
1d02f7fdd9
Revert "performance: Optimize ULLS start on submit path"
...
This reverts commit ea78831e28
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-12-28 01:54:18 +01:00
Lukasz Jobczyk
ea78831e28
performance: Optimize ULLS start on submit path
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-12-27 13:55:30 +01:00
Mateusz Jablonski
a6458433dc
refactor: correct naming of enum class constants 9/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-19 10:36:25 +01:00
Zbigniew Zdanowicz
4f9fe6bb53
fix: detect gpu reset on windows direct submission
...
Related-To: NEO-8395
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-12-01 14:13:14 +01:00
Zbigniew Zdanowicz
fdcf7f6793
performance: enable new dispatch monitor fence policy in direct submission
...
Related-To: NEO-8395
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-11-28 17:21:46 +01:00
Maciej Plewka
560b588cbe
fix: Update residency fence value after ring buffer switch
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-11-17 20:46:14 +01:00
Katarzyna Cencelewska
fbe61cf547
fix: add kmd wait on cpu when wait on paging fence when needed
...
add storing of throttle hint value for direct submission
Related-To: NEO-9123
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2023-11-09 19:22:15 +01:00
Zbigniew Zdanowicz
20c3f45998
refactor: add ulls diagnostic flag to select monitor fence input
...
Related-To: NEO-8395
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-11-06 15:09:36 +01:00
Zbigniew Zdanowicz
e0ce08bb77
fix: detect gpu hang or page fault at direct submission flush to gpu
...
Related-To: NEO-8395
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-11-06 14:22:02 +01:00
Zbigniew Zdanowicz
7dfd3e5e59
performance: use dedicated flag to dispatch monitor fence
...
Related-To: NEO-8395
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-09-21 13:25:06 +02:00
Zbigniew Zdanowicz
a16d8f7b88
refactor: change direct submission interfaces to accept monitor fence
...
- unify Linux and Windows default settings
- unify override default code
- correct size estimation when fence is required
- call virtual function once for both estimation and dispatch
Related-To: NEO-8395
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-09-20 13:17:12 +02:00
Dunajski, Bartosz
7562842a58
refactor: remove LogicalStateHelper
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-13 10:29:53 +02:00
Lukasz Jobczyk
d3355411b8
performance: Improve dispatch monitor fence from wait
...
-update last submitted value while flush monitor fence
-dispatch monitor fence with workload when stalling cmds present
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-08-30 16:15:32 +02:00
Lukasz Jobczyk
fc1f45b630
feature: Dispatch monitor fence from wait
...
Resolves: NEO-8240
Related-To: NEO-8067
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-08-08 16:42:27 +02:00
Lukasz Jobczyk
e70f441f52
fix: Idle gpu before invalidate aux table
...
Related-To: NEO-8067
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-07-05 13:51:27 +02:00
Lukasz Jobczyk
c294ef48ce
fix: Make tlb flush os agnostic
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-06-30 14:12:59 +02:00
Dunajski, Bartosz
5fe9d70066
feature: new multitile post sync layout for immediate write [1/n]
...
No functional changes in this commit. This is prework.
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-06-07 13:11:10 +02:00
Fabian Zwolinski
cbce863dc2
refactor: Rename member variables to camelCase 3/n
...
Additionally enable clang-tidy check for member variables
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-04-28 16:01:14 +02:00
Lukasz Jobczyk
853a65aae9
Add PCI barrier implementation
...
Resolves: NEO-7850
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-04-24 10:33:03 +02:00
Zhenjie Pan
1ce269a9dd
fix: wrong return value of zeCommandQueueExecuteCommandLists when OOM
...
zeCommandQueueExecuteCommandLists return ZE_RESULT_ERROR_UNKNOWN when OOM
in some scenario of direct submission.
Related-To: NEO-7840
Signed-off-by: Pan Zhenjie <zhenjie.pan@intel.com>
2023-03-28 09:52:17 +02:00
Dunajski, Bartosz
e49e245bec
Revert "Disable RelaxedOrdering if UpdateTagFromWait is disabled"
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-03-27 11:47:10 +02:00
Dunajski, Bartosz
151aecc8bd
Disable RelaxedOrdering if UpdateTagFromWait is disabled
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-03-22 18:15:39 +01:00
Lukasz Jobczyk
2f5be7a48d
Copy command buffer into ring buffer
...
Resolves: NEO-7422
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-02-22 16:37:34 +01:00
Dunajski, Bartosz
6ebdc51fae
Dynamic queue size limit in RelaxedOrdering mode
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-02-06 12:02:02 +01:00
Warchulski, Jaroslaw
e21cf516d3
Cleanup includes 44
...
Cleaned up files:
level_zero/tools/source/sysman/windows/os_sysman_imp.h
opencl/source/command_queue/command_queue.h
shared/source/command_container/cmdcontainer.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-25 16:05:22 +01:00
Kamil Kopryk
a4eba996f7
refactor: don't use global ProductHelper getter 4/n
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-01-20 10:27:27 +01:00
Warchulski, Jaroslaw
a2fe929f0c
Cleanup includes 18
...
Cleaned up files:
shared/source/command_stream/command_stream_receiver_hw.h
shared/source/compiler_interface/compiler_interface.h
shared/source/direct_submission/direct_submission_hw.h
shared/source/helpers/dirty_state_helpers.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-02 13:28:29 +01:00
Kamil Kopryk
6993ed5c52
Reduce binaries sizes 2/n
...
Observed about 50MB reduction in overall binaries size (directory build))
when building all targets
with MSVC (Visual Studio 2022 17.3.0 preview 6)
using Debug 64 configuration.
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2022-12-08 16:14:38 +01:00
Dunajski, Bartosz
3b91cc1b50
Enable RelaxedOrdering for XE_HPC Immediate cmd lists
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
Related-To: NEO-7458
2022-12-08 10:12:40 +01:00
Dunajski, Bartosz
2c08f2ca8c
RelaxedOrdering: Optimize return ptr programming
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2022-11-29 13:22:25 +01:00
Maciej Plewka
4b42b066f8
Use dedicated using type for TaskCount
...
Related-To: NEO-7155
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2022-11-28 16:44:44 +01:00
Dunajski, Bartosz
3f962bf3e8
RelaxedOrdering: Improve dependencies tracking
...
Avoid not needed scheduler programming
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2022-11-28 16:00:21 +01:00
Dunajski, Bartosz
a7d4162ca2
RelaxedOrdering: Preallocate scheduler to optimize dispatch time
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2022-11-28 12:42:44 +01:00
Dunajski, Bartosz
a969636b6a
RelaxedOrdering: Optimize GPU Queue stall by adding early return
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2022-11-24 16:41:05 +01:00
Dunajski, Bartosz
bc619fcbec
Queue stall mode for RelaxedOrdering
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2022-11-23 12:09:52 +01:00
Dunajski, Bartosz
e050d231b9
RelaxedOrdering: Add support for return pointer registers programming
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2022-11-23 09:58:33 +01:00
Dunajski, Bartosz
e6c1658bae
Relaxed ordering scheduler section for DirectSubmission
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2022-11-21 10:56:26 +01:00
Warchulski, Jaroslaw
f35f59b573
Cleanup includes 5
...
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2022-11-18 22:46:38 +01:00
Dunajski, Bartosz
89b96e5e8f
Introduce initial implementation of DirectSubmission relaxed ordering mode.
...
Initial implementation of task store section
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2022-11-17 12:23:29 +01:00
Lukasz Jobczyk
5011046a17
Add sfence when stop ring buffer
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-11-09 20:02:03 +01:00
Zbigniew Zdanowicz
87822f94e2
Replace virtual method call for DC flush with stored bool value 2/n
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-10-12 09:43:01 +02:00
Rafal Maziejuk
5e58104f5a
Add flag to control prefetcher disabling behaviour
...
Certain platforms might not require prefetcher to
be disabled in direct submission. This change
provides a way to control that behaviour.
Signed-off-by: Rafal Maziejuk <rafal.maziejuk@intel.com>
Related-To: NEO-7218
2022-08-16 16:01:30 +02:00
Lukasz Jobczyk
880464da77
Apply additional synchronization WA to DG2 ULLS
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-07-06 13:37:56 +02:00
Bartosz Dunajski
61b2ee45cd
Use LogicalStateHelper to encode SystemMemoryFence
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2022-06-24 13:29:58 +02:00
Lukasz Jobczyk
24ff26c396
Allocate new ring buffer if all are in use
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-06-10 12:45:49 +02:00
Daniel Chabrowski
7463e1970b
Cleanup headers
...
Make TUs and headers self-contained, remove unused headers
Signed-off-by: Daniel Chabrowski <daniel.chabrowski@intel.com>
2022-05-18 11:42:06 +02:00
Mateusz Jablonski
ffd1c430b1
feature: use completion fence value from direct submission when available
...
Related-To: NEO-6643
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2022-04-28 15:38:11 +02:00
Mateusz Jablonski
03185f7111
feature direct submission: use tag allocation as a completion fence
...
use tag allocation address as a completion address in exec call
wait for completion value before destroying drm direct submission
Related-To: NEO-6643
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2022-04-25 13:30:55 +02:00
Mateusz Jablonski
4cb46ee15c
Refactor direct submission: remove csr from members
...
create translation layer to obtain needed params
Related-To: NEO-6643
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2022-04-20 17:20:17 +02:00