Szymon Morek
d4c1631ac7
performance: don't wait for paging fence on user thread
...
Related-To: NEO-12197
Currently for new resources user thread must wait before submitting
actual workload. With this commit, instead of waiting on user thread,
request is sent to background ULLS controller thread and additional
semaphore is programmed. ULLS controller will perform actual wait
and signal semaphore when paging fence reaches required value.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-08-07 08:30:51 +02:00
Dominik Dabek
2b964254d6
performance: debug key for adjust ULLS on battery
...
ULLS controller timeout settings will be adjusted based on ac line
status and lowest queue throttle from submissions.
Lowest queue throttle is reset when controller stops ULLS.
Related-To: NEO-10800
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-03-22 14:24:00 +01:00
Lukasz Jobczyk
edd6fd5e6c
performance: Optimize ULLS start on submit path
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-12-29 15:21:49 +01:00
Compute-Runtime-Validation
1d02f7fdd9
Revert "performance: Optimize ULLS start on submit path"
...
This reverts commit ea78831e28 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-12-28 01:54:18 +01:00
Lukasz Jobczyk
ea78831e28
performance: Optimize ULLS start on submit path
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-12-27 13:55:30 +01:00
Maciej Plewka
560b588cbe
fix: Update residency fence value after ring buffer switch
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2023-11-17 20:46:14 +01:00
Zbigniew Zdanowicz
20c3f45998
refactor: add ulls diagnostic flag to select monitor fence input
...
Related-To: NEO-8395
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-11-06 15:09:36 +01:00
Zbigniew Zdanowicz
e0ce08bb77
fix: detect gpu hang or page fault at direct submission flush to gpu
...
Related-To: NEO-8395
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-11-06 14:22:02 +01:00
Venevtsev, Igor
50b2262339
fix: stop direct submission blocking on platform destruction
...
Resolves: GSD-6341
Signed-off-by: Venevtsev, Igor <igor.venevtsev@intel.com >
2023-10-06 17:03:38 +02:00
Zbigniew Zdanowicz
7dfd3e5e59
performance: use dedicated flag to dispatch monitor fence
...
Related-To: NEO-8395
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-09-21 13:25:06 +02:00
Zbigniew Zdanowicz
a16d8f7b88
refactor: change direct submission interfaces to accept monitor fence
...
- unify Linux and Windows default settings
- unify override default code
- correct size estimation when fence is required
- call virtual function once for both estimation and dispatch
Related-To: NEO-8395
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-09-20 13:17:12 +02:00
Lukasz Jobczyk
d3355411b8
performance: Improve dispatch monitor fence from wait
...
-update last submitted value while flush monitor fence
-dispatch monitor fence with workload when stalling cmds present
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-08-30 16:15:32 +02:00
Lukasz Jobczyk
e70f441f52
fix: Idle gpu before invalidate aux table
...
Related-To: NEO-8067
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-07-05 13:51:27 +02:00
Lukasz Jobczyk
c294ef48ce
fix: Make tlb flush os agnostic
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-06-30 14:12:59 +02:00
Dunajski, Bartosz
5fe9d70066
feature: new multitile post sync layout for immediate write [1/n]
...
No functional changes in this commit. This is prework.
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-07 13:11:10 +02:00
Lukasz Jobczyk
853a65aae9
Add PCI barrier implementation
...
Resolves: NEO-7850
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-04-24 10:33:03 +02:00
Dunajski, Bartosz
97a35debde
RelaxedOrdering dispatch for OCL 1/n
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-03-13 17:37:12 +01:00
Lukasz Jobczyk
2f5be7a48d
Copy command buffer into ring buffer
...
Resolves: NEO-7422
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-02-22 16:37:34 +01:00
Dunajski, Bartosz
6ebdc51fae
Dynamic queue size limit in RelaxedOrdering mode
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-02-06 12:02:02 +01:00
Kamil Kopryk
27393c76ea
refactor: don't use global ProductHelper getter 11/n
...
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2023-01-26 08:16:52 +01:00
Dunajski, Bartosz
3f962bf3e8
RelaxedOrdering: Improve dependencies tracking
...
Avoid not needed scheduler programming
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2022-11-28 16:00:21 +01:00
Dunajski, Bartosz
a7d4162ca2
RelaxedOrdering: Preallocate scheduler to optimize dispatch time
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2022-11-28 12:42:44 +01:00
Dunajski, Bartosz
a969636b6a
RelaxedOrdering: Optimize GPU Queue stall by adding early return
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2022-11-24 16:41:05 +01:00
Dunajski, Bartosz
89b96e5e8f
Introduce initial implementation of DirectSubmission relaxed ordering mode.
...
Initial implementation of task store section
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2022-11-17 12:23:29 +01:00
Zbigniew Zdanowicz
87822f94e2
Replace virtual method call for DC flush with stored bool value 2/n
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2022-10-12 09:43:01 +02:00
Lukasz Jobczyk
24ff26c396
Allocate new ring buffer if all are in use
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-06-10 12:45:49 +02:00
Mateusz Jablonski
03185f7111
feature direct submission: use tag allocation as a completion fence
...
use tag allocation address as a completion address in exec call
wait for completion value before destroying drm direct submission
Related-To: NEO-6643
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-04-25 13:30:55 +02:00
Mateusz Jablonski
4cb46ee15c
Refactor direct submission: remove csr from members
...
create translation layer to obtain needed params
Related-To: NEO-6643
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-04-20 17:20:17 +02:00
Mateusz Jablonski
b5b8762faf
refactor direct submission: Pass command stream receiver to ctor
...
Related-To: NEO-6643
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-04-19 23:02:55 +02:00
Bartosz Dunajski
db9c0d1103
Refactor and enable MI_MEM_FENCE programming for DirectSubmission dispatch
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2022-04-07 12:53:56 +02:00
Mateusz Jablonski
3ac0581f2e
Add debug flags to read back cmd/ring buffer before unblocking semaphore
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-03-25 20:36:29 +01:00
Zbigniew Zdanowicz
b6da6471f2
Add notify parameter to direct submission post sync operations
...
Related-To: NEO-5845
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2021-12-23 17:35:48 +01:00
Zbigniew Zdanowicz
3e1023fa1a
Unify memory layout for all multi tile post sync operations
...
Related-To: NEO-6262
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2021-12-02 18:00:40 +01:00
Zbigniew Zdanowicz
7ea0a11c0a
Unify programming of partition registers
...
Related-To: NEO-6262
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2021-11-18 16:52:51 +01:00
Zbigniew Zdanowicz
60805cdbcf
Fix direct submission wait on multi tile device using single tile context
...
Related-To: NEO-6244
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2021-10-28 14:01:58 +02:00
Zbigniew Zdanowicz
f0d32ed5f7
Fix multi tile synchronization in direct submission
...
Related-To: NEO-6244
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2021-10-13 16:13:20 +02:00
Zbigniew Zdanowicz
7b95dc4be4
Add partitioned post sync for completion fence of direct submission
...
Related-To: NEO-6244
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2021-10-12 14:57:29 +02:00
Mateusz Hoppe
f2eb7f3aea
Fix DirectSubmission residency handling
...
- allocations should be resident within OsContext
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2021-09-03 16:57:42 +02:00
Pawel Cieslak
8a700c5187
Move non-ult shared files to single directory
...
Add SKIP_SHARED_UNIT_TESTS flag
Related-To: NEO-5201
Signed-off-by: Pawel Cieslak <pawel.cieslak@intel.com >
2021-02-10 21:03:13 +01:00