Compute-Runtime-Validation
00a1a14652
Revert "Fix multi thread usage of external host alloc"
...
This reverts commit 54eee2a88b .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2022-04-28 17:42:07 +02:00
Mateusz Jablonski
ffd1c430b1
feature: use completion fence value from direct submission when available
...
Related-To: NEO-6643
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-04-28 15:38:11 +02:00
Krzysztof Gibala
a3745c28a3
Create peekGmmHelper function in CSR
...
Related-To: NEO-6523
Signed-off-by: Krzysztof Gibala <krzysztof.gibala@intel.com >
2022-04-28 12:54:32 +02:00
Lukasz Jobczyk
72f6ef7d2b
Do not submit to gpgpu during blit in immediate dispatch mode
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-04-28 11:36:42 +02:00
Filip Hazubski
3900c9d24a
Report to StreamProperties whether large grf should be programmed with SCM
...
Add helper method to UnitTestHelper to query programmed grf values.
Related-To: NEO-6659
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2022-04-27 13:20:14 +02:00
Maciej Plewka
54eee2a88b
Fix multi thread usage of external host alloc
...
With this commit OpenCL will track if external host memory is used from
few threads and will secure to update task count in all threads before
destroing allocation.
Resolves: NEO-6807
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2022-04-26 15:31:24 +02:00
Bartosz Dunajski
192ffa4c26
Add support for allocating RT globals
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2022-04-25 17:56:09 +02:00
Mateusz Jablonski
03185f7111
feature direct submission: use tag allocation as a completion fence
...
use tag allocation address as a completion address in exec call
wait for completion value before destroying drm direct submission
Related-To: NEO-6643
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-04-25 13:30:55 +02:00
Mateusz Hoppe
96c668d595
L0 Debugger - Support for debugging immediate cmd lists
...
- add debugger support to imm cmd lists
- add debugger support to flushTask
Related-To: NEO-6845
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2022-04-22 14:43:41 +02:00
Mateusz Hoppe
5911515ed0
Refactor debugger code
...
- helper sets all SbaAddresses for debugger in
EncodeStateBaseAddress<GfxFamily>::setSbaAddressesForDebugger()
- change DebuggerL0::captureStateBaseAddress() to take
LinearStream
- move getSbaTrackingCommandsSize() to Debugger class
Related-To: NEO-6845
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2022-04-21 13:04:34 +02:00
Mateusz Jablonski
8f38f4ee3c
fix command stream receiver: stop calling virtual methods in dtor
...
make getCompletionAddress and getCompletionValue non-virtual methods
Related-To: NEO-6643
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-04-20 17:41:54 +02:00
Mateusz Jablonski
b5b8762faf
refactor direct submission: Pass command stream receiver to ctor
...
Related-To: NEO-6643
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-04-19 23:02:55 +02:00
Lukasz Jobczyk
6fdcb83020
Add missing tbx calls to timestamp wait logic
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-04-19 14:24:06 +02:00
Bartosz Dunajski
5f37aa4ba4
Pass cache settings to aub_stream
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2022-04-19 11:00:07 +02:00
Compute-Runtime-Validation
f6f5baf23a
Revert "Fix Scratch surface state programming on PVC"
...
This reverts commit 5b175ebb4a .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2022-04-19 10:53:22 +02:00
Dominik Dabek
8d1ad5a4f3
Refactor: use stack vector for root device indices
...
Stack vector will not cause dynamic allocations in most circumstances
ie. number of root device indices not more than 16
Related-To: NEO-6837
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2022-04-14 14:05:42 +02:00
Lukasz Jobczyk
b67b1bd6fc
Download tag allocation only if was submitted
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-04-13 16:30:47 +02:00
Bartosz Dunajski
fc4eaa1894
Refactor passing GlobalFenceAllocation to DirectSubmission
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2022-04-07 16:08:07 +02:00
Zbigniew Zdanowicz
fd45ac133d
Unify append calls
...
Related-To: NEO-6242
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2022-04-07 10:18:55 +02:00
Mateusz Hoppe
5b175ebb4a
Fix Scratch surface state programming on PVC
...
- program max num of threads in surface state
Resolves: NEO-6850
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2022-04-07 07:44:23 +02:00
Dominik Dabek
4044320301
Limit allocations in execute command list
...
Reserve space in ResidencyContainer
Related-To: NEO-6837
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2022-04-06 15:00:34 +02:00
Mateusz Jablonski
9d502dea25
Refactor waitOnCompletionFence method in DrmMemoryManager
...
get completion address and value from command stream receiver
Related-To: NEO-6643
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-03-31 19:30:16 +02:00
Dominik Dabek
308f9ccfff
Do not allocate dynamic state heap if not needed
...
Dynamic state heap is only used for sampler data.
Related-To: NEO-6821
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2022-03-31 12:13:13 +02:00
Jobczyk, Lukasz
a285712cc4
Add missing download allocation calls
...
Signed-off-by: Jobczyk, Lukasz <lukasz.jobczyk@intel.com >
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-03-31 09:49:22 +02:00
Bartosz Dunajski
08e3853982
Debug flag to add extra MI_MEM_FENCE for DirectSubmission
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2022-03-30 16:07:25 +02:00
Mateusz Hoppe
681c09d314
Extend LinearStream with gpuBase
...
- added methods to set and query gpu base address
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2022-03-29 15:15:53 +02:00
Jobczyk, Lukasz
d77a6cbe4b
Enable task count update from wait
...
Signed-off-by: Jobczyk, Lukasz <lukasz.jobczyk@intel.com >
2022-03-28 11:09:55 +02:00
Filip Hazubski
586e8510de
Remove unneeded include
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2022-03-26 09:55:25 +01:00
Lukasz Jobczyk
a230f267e1
Poll task count indefinitely on high throttle command queue
...
Resolves: NEO-6781
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-03-25 10:06:16 +01:00
Compute-Runtime-Validation
0c064ccf4c
Revert "Enable task count update from wait"
...
This reverts commit 5118a5d3a6 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2022-03-25 00:09:52 +01:00
Compute-Runtime-Validation
525470917f
Revert "Clean up resources in final derived csr class"
...
This reverts commit 737e6e4de5 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2022-03-24 19:12:20 +01:00
Krzysztof Gibala
ebc006ad53
Move SBA related WAs logic from CSR to EncodeWA
...
Signed-off-by: Krzysztof Gibala <krzysztof.gibala@intel.com >
2022-03-24 12:24:56 +01:00
Lukasz Jobczyk
737e6e4de5
Clean up resources in final derived csr class
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-03-24 11:41:22 +01:00
Zbigniew Zdanowicz
814b0ecb87
Improve wait function for Fence object
...
Related-To: NEO-6242
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2022-03-24 10:48:49 +01:00
Lukasz Jobczyk
5118a5d3a6
Enable task count update from wait
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-03-23 11:38:50 +01:00
Filip Hazubski
ef3dffca33
Remove unused function parameter
...
Remove parameter requiredThreadArbitrationPolicy
from PreambleHelper::programPreamble function.
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2022-03-16 13:16:13 +01:00
Filip Hazubski
cd95572443
Reuse common logic of programming SCM fields for gen 9 and gen 11
...
Logic related to programming non coherent and thread arbitration policy for
gens 9 and 11 has been moved to EncodeComputeMode object, where similar
logic for gens gen12lp and newer is located.
Functions PreambleHelper::programThreadArbitration and
PreambleHelper::getThreadArbitrationCommandsSize have been removed.
Redundant setForceNonCoherent call has been removed from XE HPG
Related-To: NEO-6728
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2022-03-16 10:04:32 +01:00
Filip Hazubski
32b0f7b014
Remove redundant value CsrSizeRequestFlags::numGrfRequiredChanged
...
Related-To: NEO-5995
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2022-03-15 15:08:15 +01:00
Filip Hazubski
3eab7009ac
Move SCM related WAs logic from CSR to EncodeComputeMode
...
This will help with unifying the logic between APIs and GENs.
Related-To: NEO-6728
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2022-03-11 14:00:53 +01:00
Bartosz Dunajski
c6e27bdc18
Enable prepatcher DirectSubmission WA
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2022-03-10 20:23:11 +01:00
Bartosz Dunajski
79fedd59e4
Add BB_START prepatching WA - disabled by default
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2022-03-10 17:05:53 +01:00
Patryk Wrobel
835b344968
Add debug flag to disable GPU hang detection
...
This change introduces the new flag called DisableGpuHangDetection.
By default it is disabled. When someone wants to disable hang checking,
then this flag can be set to true.
Related-To: NEO-6681
Signed-off-by: Patryk Wrobel <patryk.wrobel@intel.com >
2022-03-09 18:07:42 +01:00
Filip Hazubski
43ed1acc63
Update StateComputeModeProperties design
...
Related-To: NEO-6728
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2022-03-08 16:30:25 +01:00
Filip Hazubski
dd01cff879
Unify logic determining thread arbitration policy value
...
Related-To: NEO-6728
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2022-03-08 13:14:56 +01:00
Filip Hazubski
80b520bc9b
Change ThreadArbitrationPolicy enum type to int32_t
...
Change ThreadArbitrationPolicy::NotPresent value to -1
Update initial values to ThreadArbitrationPolicy::NotPresent
Related-To: NEO-6728
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2022-03-07 20:04:24 +01:00
Michal Mrozek
f293c9ab25
Avoid ReadModifyWrite problem on devices with coherent L3.
...
Setting one of bitfields requires read from local memory which is very slow.
This is not needed for devices that have coherent L3.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2022-03-04 15:05:04 +01:00
Lukasz Jobczyk
999c6424a4
While enqueue blit do not flush gpgpu if already flushed
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-03-03 13:01:57 +01:00
Michal Mrozek
8e94d568a8
Do not set dcFlush in Immediate dispatch mode.
...
Some devices do not need dcFlush.
Setting it prevents further optimization of pipe controls which
are not needed.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2022-03-02 13:20:28 +01:00
John Falkowski
22ed1be1a3
Add checks for mmap and getSpace
...
Signed-off-by: John Falkowski <john.falkowski@intel.com >
2022-02-28 20:57:29 +01:00
Konstanty Misiak
cf1bc3a2ba
Disable EU fusion based on kernel properties from compiler
...
Related-To: NEO-6633
Signed-off-by: Konstanty Misiak <konstanty.misiak@intel.com >
2022-02-28 18:50:38 +01:00