Bartosz Dunajski
e722afbefb
Track waitlist TimestampPackets to avoid too early return to the pool
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2022-05-12 13:54:04 +02:00
Lukasz Jobczyk
d63a044e60
Add infrastructure to request ULLS restart
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-05-11 14:33:30 +02:00
Artur Harasimiuk
e9be9b64c6
clang-tidy configuration cleanup
...
Define single .clang-tidy configuration with all used checks and use
NOLINT to selectively silence tool. That way cleanup should be easier.
third_part/ has its own configuration that disables clang-tidy for this
folder.
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com >
2022-05-11 14:02:04 +02:00
Zbigniew Zdanowicz
910871a706
add multi-tile support for completion diagnostic
...
Related-To: NEO-6871
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2022-05-11 13:37:32 +02:00
Krzysztof Gibala
2fcda0a528
Refactor: Change decanonize method accessing point
...
Accessing decanonize method as a member of GmmHelper class object
Signed-off-by: Krzysztof Gibala <krzysztof.gibala@intel.com >
2022-05-11 12:57:02 +02:00
Mateusz Jablonski
943ad0e1eb
style: skip redundant unique_ptr::get function
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-05-10 13:22:40 +02:00
Filip Hazubski
3413039a69
Add debug variable to control whether large grf should be programmed with SCM
...
Add debug variable ForceGrfNumProgrammingWithScm.
Do not update large grf value in StreamProperties when unnecessary.
Related-To: NEO-6659
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2022-05-06 15:45:46 +02:00
Maciej Plewka
0a16dc6c47
Fix multi thread usage of external host alloc
...
This is fixed reupload of this commit after auto revert
With this commit OpenCL will track if external host memory is used from
few threads and will secure to update task count in all threads before
destroing allocation.
Resolves: NEO-6807
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2022-05-05 13:32:15 +02:00
Compute-Runtime-Validation
00a1a14652
Revert "Fix multi thread usage of external host alloc"
...
This reverts commit 54eee2a88b .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2022-04-28 17:42:07 +02:00
Mateusz Jablonski
ffd1c430b1
feature: use completion fence value from direct submission when available
...
Related-To: NEO-6643
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-04-28 15:38:11 +02:00
Krzysztof Gibala
a3745c28a3
Create peekGmmHelper function in CSR
...
Related-To: NEO-6523
Signed-off-by: Krzysztof Gibala <krzysztof.gibala@intel.com >
2022-04-28 12:54:32 +02:00
Lukasz Jobczyk
72f6ef7d2b
Do not submit to gpgpu during blit in immediate dispatch mode
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-04-28 11:36:42 +02:00
Filip Hazubski
3900c9d24a
Report to StreamProperties whether large grf should be programmed with SCM
...
Add helper method to UnitTestHelper to query programmed grf values.
Related-To: NEO-6659
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2022-04-27 13:20:14 +02:00
Maciej Plewka
54eee2a88b
Fix multi thread usage of external host alloc
...
With this commit OpenCL will track if external host memory is used from
few threads and will secure to update task count in all threads before
destroing allocation.
Resolves: NEO-6807
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2022-04-26 15:31:24 +02:00
Bartosz Dunajski
192ffa4c26
Add support for allocating RT globals
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2022-04-25 17:56:09 +02:00
Mateusz Jablonski
03185f7111
feature direct submission: use tag allocation as a completion fence
...
use tag allocation address as a completion address in exec call
wait for completion value before destroying drm direct submission
Related-To: NEO-6643
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-04-25 13:30:55 +02:00
Mateusz Hoppe
96c668d595
L0 Debugger - Support for debugging immediate cmd lists
...
- add debugger support to imm cmd lists
- add debugger support to flushTask
Related-To: NEO-6845
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2022-04-22 14:43:41 +02:00
Mateusz Hoppe
5911515ed0
Refactor debugger code
...
- helper sets all SbaAddresses for debugger in
EncodeStateBaseAddress<GfxFamily>::setSbaAddressesForDebugger()
- change DebuggerL0::captureStateBaseAddress() to take
LinearStream
- move getSbaTrackingCommandsSize() to Debugger class
Related-To: NEO-6845
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2022-04-21 13:04:34 +02:00
Mateusz Jablonski
8f38f4ee3c
fix command stream receiver: stop calling virtual methods in dtor
...
make getCompletionAddress and getCompletionValue non-virtual methods
Related-To: NEO-6643
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-04-20 17:41:54 +02:00
Mateusz Jablonski
b5b8762faf
refactor direct submission: Pass command stream receiver to ctor
...
Related-To: NEO-6643
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-04-19 23:02:55 +02:00
Lukasz Jobczyk
6fdcb83020
Add missing tbx calls to timestamp wait logic
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-04-19 14:24:06 +02:00
Bartosz Dunajski
5f37aa4ba4
Pass cache settings to aub_stream
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2022-04-19 11:00:07 +02:00
Compute-Runtime-Validation
f6f5baf23a
Revert "Fix Scratch surface state programming on PVC"
...
This reverts commit 5b175ebb4a .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2022-04-19 10:53:22 +02:00
Dominik Dabek
8d1ad5a4f3
Refactor: use stack vector for root device indices
...
Stack vector will not cause dynamic allocations in most circumstances
ie. number of root device indices not more than 16
Related-To: NEO-6837
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2022-04-14 14:05:42 +02:00
Lukasz Jobczyk
b67b1bd6fc
Download tag allocation only if was submitted
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-04-13 16:30:47 +02:00
Bartosz Dunajski
fc4eaa1894
Refactor passing GlobalFenceAllocation to DirectSubmission
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2022-04-07 16:08:07 +02:00
Zbigniew Zdanowicz
fd45ac133d
Unify append calls
...
Related-To: NEO-6242
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2022-04-07 10:18:55 +02:00
Mateusz Hoppe
5b175ebb4a
Fix Scratch surface state programming on PVC
...
- program max num of threads in surface state
Resolves: NEO-6850
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2022-04-07 07:44:23 +02:00
Dominik Dabek
4044320301
Limit allocations in execute command list
...
Reserve space in ResidencyContainer
Related-To: NEO-6837
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2022-04-06 15:00:34 +02:00
Mateusz Jablonski
9d502dea25
Refactor waitOnCompletionFence method in DrmMemoryManager
...
get completion address and value from command stream receiver
Related-To: NEO-6643
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-03-31 19:30:16 +02:00
Dominik Dabek
308f9ccfff
Do not allocate dynamic state heap if not needed
...
Dynamic state heap is only used for sampler data.
Related-To: NEO-6821
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2022-03-31 12:13:13 +02:00
Jobczyk, Lukasz
a285712cc4
Add missing download allocation calls
...
Signed-off-by: Jobczyk, Lukasz <lukasz.jobczyk@intel.com >
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-03-31 09:49:22 +02:00
Bartosz Dunajski
08e3853982
Debug flag to add extra MI_MEM_FENCE for DirectSubmission
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2022-03-30 16:07:25 +02:00
Mateusz Hoppe
681c09d314
Extend LinearStream with gpuBase
...
- added methods to set and query gpu base address
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2022-03-29 15:15:53 +02:00
Jobczyk, Lukasz
d77a6cbe4b
Enable task count update from wait
...
Signed-off-by: Jobczyk, Lukasz <lukasz.jobczyk@intel.com >
2022-03-28 11:09:55 +02:00
Filip Hazubski
586e8510de
Remove unneeded include
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2022-03-26 09:55:25 +01:00
Lukasz Jobczyk
a230f267e1
Poll task count indefinitely on high throttle command queue
...
Resolves: NEO-6781
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-03-25 10:06:16 +01:00
Compute-Runtime-Validation
0c064ccf4c
Revert "Enable task count update from wait"
...
This reverts commit 5118a5d3a6 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2022-03-25 00:09:52 +01:00
Compute-Runtime-Validation
525470917f
Revert "Clean up resources in final derived csr class"
...
This reverts commit 737e6e4de5 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2022-03-24 19:12:20 +01:00
Krzysztof Gibala
ebc006ad53
Move SBA related WAs logic from CSR to EncodeWA
...
Signed-off-by: Krzysztof Gibala <krzysztof.gibala@intel.com >
2022-03-24 12:24:56 +01:00
Lukasz Jobczyk
737e6e4de5
Clean up resources in final derived csr class
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-03-24 11:41:22 +01:00
Zbigniew Zdanowicz
814b0ecb87
Improve wait function for Fence object
...
Related-To: NEO-6242
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2022-03-24 10:48:49 +01:00
Lukasz Jobczyk
5118a5d3a6
Enable task count update from wait
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-03-23 11:38:50 +01:00
Filip Hazubski
ef3dffca33
Remove unused function parameter
...
Remove parameter requiredThreadArbitrationPolicy
from PreambleHelper::programPreamble function.
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2022-03-16 13:16:13 +01:00
Filip Hazubski
cd95572443
Reuse common logic of programming SCM fields for gen 9 and gen 11
...
Logic related to programming non coherent and thread arbitration policy for
gens 9 and 11 has been moved to EncodeComputeMode object, where similar
logic for gens gen12lp and newer is located.
Functions PreambleHelper::programThreadArbitration and
PreambleHelper::getThreadArbitrationCommandsSize have been removed.
Redundant setForceNonCoherent call has been removed from XE HPG
Related-To: NEO-6728
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2022-03-16 10:04:32 +01:00
Filip Hazubski
32b0f7b014
Remove redundant value CsrSizeRequestFlags::numGrfRequiredChanged
...
Related-To: NEO-5995
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2022-03-15 15:08:15 +01:00
Filip Hazubski
3eab7009ac
Move SCM related WAs logic from CSR to EncodeComputeMode
...
This will help with unifying the logic between APIs and GENs.
Related-To: NEO-6728
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2022-03-11 14:00:53 +01:00
Bartosz Dunajski
c6e27bdc18
Enable prepatcher DirectSubmission WA
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2022-03-10 20:23:11 +01:00
Bartosz Dunajski
79fedd59e4
Add BB_START prepatching WA - disabled by default
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2022-03-10 17:05:53 +01:00
Patryk Wrobel
835b344968
Add debug flag to disable GPU hang detection
...
This change introduces the new flag called DisableGpuHangDetection.
By default it is disabled. When someone wants to disable hang checking,
then this flag can be set to true.
Related-To: NEO-6681
Signed-off-by: Patryk Wrobel <patryk.wrobel@intel.com >
2022-03-09 18:07:42 +01:00