6ad16c9e72
Missing support for OCL debugging with L0
...
- make resident debugSurface
- allocate debugSurface with correct allocation type and size
- notify cmdQ create/destroy
Related-To: NEO-7075
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2022-09-07 13:30:05 +02:00
98d776867f
Add initial support for KernelArgsBuffer allocation
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2022-08-03 20:28:21 +02:00
d4d54f5093
Cleanup includes
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2022-07-25 09:58:38 +02:00
4ac6e09117
Do not reuse mem obj's allocation
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-07-19 11:42:17 +02:00
76e023b941
Link build option with L1 cache policy helper
...
Related-To: NEO-7003
Add L1CachePolicyHelper struct.
This struct is resposible for L1 cache policy
in build option, Surface State and stateless
caching. Currently default option for all
platforms is WBP (write by-pass)
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2022-07-04 11:49:55 +02:00
f4485ec541
Use LogicalStateHelper for SIP programming
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2022-06-22 13:46:57 +02:00
620bb970f4
Resubmit fix for task count hang
...
Signed-off-by: John Falkowski <john.falkowski@intel.com >
2022-06-07 16:28:17 +02:00
359e848dbe
Revert "Fix for task count hang issue"
...
This reverts commit 340ba8bf56
.
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com >
2022-05-25 20:34:00 +02:00
ddd8a08fac
Add mechanism to avoid calling gdi calls while process exit
...
Signed-off-by: Kamil Diedrich <kamil.diedrich@intel.com >
2022-05-20 11:11:52 +02:00
340ba8bf56
Fix for task count hang issue
...
Signed-off-by: John Falkowski <john.falkowski@intel.com >
2022-05-18 19:35:12 +02:00
910871a706
add multi-tile support for completion diagnostic
...
Related-To: NEO-6871
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2022-05-11 13:37:32 +02:00
943ad0e1eb
style: skip redundant unique_ptr::get function
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-05-10 13:22:40 +02:00
0a16dc6c47
Fix multi thread usage of external host alloc
...
This is fixed reupload of this commit after auto revert
With this commit OpenCL will track if external host memory is used from
few threads and will secure to update task count in all threads before
destroing allocation.
Resolves: NEO-6807
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2022-05-05 13:32:15 +02:00
00a1a14652
Revert "Fix multi thread usage of external host alloc"
...
This reverts commit 54eee2a88b
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2022-04-28 17:42:07 +02:00
ffd1c430b1
feature: use completion fence value from direct submission when available
...
Related-To: NEO-6643
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-04-28 15:38:11 +02:00
a3745c28a3
Create peekGmmHelper function in CSR
...
Related-To: NEO-6523
Signed-off-by: Krzysztof Gibala <krzysztof.gibala@intel.com >
2022-04-28 12:54:32 +02:00
54eee2a88b
Fix multi thread usage of external host alloc
...
With this commit OpenCL will track if external host memory is used from
few threads and will secure to update task count in all threads before
destroing allocation.
Resolves: NEO-6807
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2022-04-26 15:31:24 +02:00
192ffa4c26
Add support for allocating RT globals
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2022-04-25 17:56:09 +02:00
8f38f4ee3c
fix command stream receiver: stop calling virtual methods in dtor
...
make getCompletionAddress and getCompletionValue non-virtual methods
Related-To: NEO-6643
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-04-20 17:41:54 +02:00
b5b8762faf
refactor direct submission: Pass command stream receiver to ctor
...
Related-To: NEO-6643
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-04-19 23:02:55 +02:00
6fdcb83020
Add missing tbx calls to timestamp wait logic
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-04-19 14:24:06 +02:00
8d1ad5a4f3
Refactor: use stack vector for root device indices
...
Stack vector will not cause dynamic allocations in most circumstances
ie. number of root device indices not more than 16
Related-To: NEO-6837
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2022-04-14 14:05:42 +02:00
b67b1bd6fc
Download tag allocation only if was submitted
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-04-13 16:30:47 +02:00
4044320301
Limit allocations in execute command list
...
Reserve space in ResidencyContainer
Related-To: NEO-6837
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2022-04-06 15:00:34 +02:00
a285712cc4
Add missing download allocation calls
...
Signed-off-by: Jobczyk, Lukasz <lukasz.jobczyk@intel.com >
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-03-31 09:49:22 +02:00
a230f267e1
Poll task count indefinitely on high throttle command queue
...
Resolves: NEO-6781
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-03-25 10:06:16 +01:00
814b0ecb87
Improve wait function for Fence object
...
Related-To: NEO-6242
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2022-03-24 10:48:49 +01:00
835b344968
Add debug flag to disable GPU hang detection
...
This change introduces the new flag called DisableGpuHangDetection.
By default it is disabled. When someone wants to disable hang checking,
then this flag can be set to true.
Related-To: NEO-6681
Signed-off-by: Patryk Wrobel <patryk.wrobel@intel.com >
2022-03-09 18:07:42 +01:00
7f729b7f89
Detect GPU hang in clWaitForEvents
...
This change:
- moves NEO::WaitStatus to a separate file
- enables detection of GPU hang in clWaitForEvents
- adjusts most of blocking calls in CommandStreamReceiver to return WaitStatus
- adds ULTs to cover the new code
Related-To: NEO-6681
Signed-off-by: Patryk Wrobel <patryk.wrobel@intel.com >
2022-02-23 13:33:09 +01:00
6a111e41ff
Improve submission logging
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2022-02-11 17:36:14 +01:00
18cafd3a52
Implement GPU hang detection on Windows
...
This change uses value of cpuAddress from monitored fence
to detect GPU hang.
Related-To: NEO-5313
Signed-off-by: Patryk Wrobel <patryk.wrobel@intel.com >
2022-02-09 17:22:52 +01:00
b60d963ff5
Add debug variable to force default heap allocation size
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-02-09 13:27:05 +01:00
538e0aea87
Add debug flag to bind at creation time
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2022-02-09 09:26:02 +01:00
4b0d986876
Move AllocationType enum out of GraphicsAllocation class
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2022-02-04 17:49:09 +01:00
498cf5e871
Implement GPU hang detection
...
This change uses DRM_IOCTL_I915_GET_RESET_STATS to detect
GPU hangs. When such situation is encountered, then
zeCommandQueueSynchronize returns ZE_RESULT_ERROR_DEVICE_LOST.
Related-To: NEO-5313
Signed-off-by: Patryk Wrobel <patryk.wrobel@intel.com >
2022-01-31 13:48:17 +01:00
f8c104feaa
Use fw declaration of IndirectHeap in CommandContainer
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2022-01-26 13:30:26 +01:00
6df17f5a30
[3/n] Optimize indirect allocations handling.
...
Add new debug variable to trigger new mode.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2022-01-25 16:40:56 +01:00
e29a85ebb3
Use ImmediateDispatch mode for L0 command queues
...
Related-To: LOCI-1988
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com >
2022-01-21 00:23:04 +01:00
4238679078
Refactor implicit scaling device support
...
Related-To: NEO-6589
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2022-01-18 13:08:43 +01:00
394c0e90e1
Return error when failing on submission
...
Signed-off-by: Raiyan Latif <raiyan.latif@intel.com >
2022-01-12 16:42:30 +01:00
34856747b4
Add implementation for reading logical subDevice from builtin
...
Related-To: NEO-6258
Signed-off-by: Kamil Diedrich <kamil.diedrich@intel.com >
2022-01-11 13:06:15 +01:00
2bbe7438ce
Revert "Add implementation for reading logical subDevice from builtin"
...
This reverts commit 901e1e71f95a8cd90611ed7cbd25705a14f38170.
Signed-off-by: Kamil Diedrich <kamil.diedrich@intel.com >
2021-12-27 15:58:51 +01:00
1f0c58d0bf
Refactor timestamp wait mechanism
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2021-12-16 13:22:19 +01:00
3511b4755f
Remove redundant coherency variables for SCM
...
Remove CommandStreamReceiver::lastSentCoherencyRequest
Remove CsrSizeRequestFlags::coherencyRequestChanged
Related-To: NEO-5995
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2021-12-09 14:43:21 +01:00
24c19509b3
Add implementation for reading logical subDevice from builtin
...
- add function for resolving builtins using relocations
- add relocation for __SubDeviceID symbol
Related-To: NEO-6258
Signed-off-by: Kamil Diedrich <kamil.diedrich@intel.com >
2021-12-09 13:20:49 +01:00
8b51358054
In level zero create single tag allocation for csr
...
Multi tag allocation is useful only in openCL to ensure cross root
device synchronization based on tag address.
It is not required in level zero.
Futher multi tag allocation is causing instability in sysman
device reset. So, for level zero instead of multi tag allocation
create single tag allocation.
Related-To: LOCI-2651
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com >
2021-12-06 15:16:34 +01:00
3e1023fa1a
Unify memory layout for all multi tile post sync operations
...
Related-To: NEO-6262
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2021-12-02 18:00:40 +01:00
36efe72e3e
Add wait on timestamp mechanism
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2021-12-01 08:48:00 +01:00
995cb88bfa
Improve ftr/wa flags packing
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2021-11-25 16:05:57 +01:00
f56773d166
Setup partition registers once per context
...
Related-To: NEO-6262
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2021-11-22 22:41:09 +01:00