Commit Graph

5874 Commits

Author SHA1 Message Date
Zbigniew Zdanowicz
db9d0c9a58 fix: remove heap resource leak after command list reset
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-02-20 14:38:19 +01:00
Dunajski, Bartosz
915b19b0be refactor: remove not needed debug_break_if
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-19 16:28:58 +01:00
Michal Mrozek
27f4eab52f fix: restore previous order of variables
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
Resolves: NEO-10439
2024-02-19 14:13:54 +01:00
Dunajski, Bartosz
f31c88d05f fix: size estimation for tlb flush before copy
Related-To: HSD-18036669673

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-19 13:35:35 +01:00
Dominik Dabek
07639401c5 performance: enable pat index, mtl linux
Enable programming pat indexes on mtl linux by default.

Related-To: NEO-7896

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-02-16 18:31:21 +01:00
Dominik Dabek
0120d8a58d performance: program pat index on mtl linux
Enable programming pat indexes on mtl linux for device buffers.

Change DrmMemoryManager::allocateMemoryByKMD to use gemCreateExt.

Set mmap flags based on coherency.
Map as write back on legacy and coherent.
On non-coherent map as write combined.

Changes currently disabled, to enable use debug keys:
DisableGemCreateExtSetPat=0
UseGemCreateExtInAllocateMemoryByKMD=1

Reorder BufferObject to decrease padding.

Related-To: NEO-7896

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-02-16 17:33:07 +01:00
Dunajski, Bartosz
be7b80746b fix: update incorrect debug_break
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-16 16:55:59 +01:00
Kamil Kopryk
0c5cba8ebd refactor: use common indirectDataAlignment static constexpr value
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-02-16 13:55:00 +01:00
Lukasz Jobczyk
cc1732c930 performance: make resident before lock
Resolves: NEO-10369

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-16 12:00:38 +01:00
Compute-Runtime-Validation
4d159bd06c Revert "performance: make resident before lock"
This reverts commit 7e7fac01e3.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-16 09:23:49 +01:00
Dominik Dabek
ed011de03e performance: program pat index on mtl linux
Enable programming pat indexes on mtl linux for device buffers.

Change DrmMemoryManager::allocateMemoryByKMD to use gemCreateExt.

Changes currently disabled, can be enabled with flag
DisableGemCreateExtSetPat=0

Related-To: NEO-7896

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-02-15 17:15:28 +01:00
Filip Hazubski
405e9d220a fix: Add RPL device ids
Add device IDs: 0xA7AA, 0xA7AB, 0xA7AC, 0xA7AD.

Resolves: NEO-8274

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-02-15 12:34:47 +01:00
Lukasz Jobczyk
7e7fac01e3 performance: make resident before lock
Resolves: NEO-10369

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-15 11:56:36 +01:00
Michal Mrozek
25b3a3dbfa fix: user proper settings for caches.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-02-15 11:38:34 +01:00
Chodor, Jaroslaw
b58717b9e3 feature: Add support for legacy acronyms in ocloc's fatbinary
Expands support for deprecated acronyms to fatbinary. Previously,
these were allowed only in single-target builds.

Related-To: NEO-10190

Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2024-02-15 10:14:49 +01:00
Compute-Runtime-Validation
57c946b61c Revert "fix: align indirect data pointer to cacheline size in heapless mode"
This reverts commit 004e6e647f.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-15 05:49:19 +01:00
Compute-Runtime-Validation
d22b2a8a38 Revert "feature: add support for the cl_khr_extended_bit_ops extension"
This reverts commit 6dec2143df.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-15 05:34:10 +01:00
Compute-Runtime-Validation
9570d02134 Revert "fix: correctly report support for SPIR-V 1.0 through 1.3"
This reverts commit 5120ec2f93.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-15 04:00:31 +01:00
Compute-Runtime-Validation
7b340775c6 Revert "performance: program pat index on mtl linux"
This reverts commit 8e0b23db84.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-15 02:06:03 +01:00
Katarzyna Cencelewska
3d51832498 test: add preemption field to mock objects
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-02-14 21:49:18 +01:00
Dunajski, Bartosz
88c5872682 feature: debug flag to flush tlb before copy
Related-To: HSD-18036669673

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-14 20:05:57 +01:00
Dominik Dabek
8e0b23db84 performance: program pat index on mtl linux
Enable programming pat indexes on mtl linux for device buffers.

Change DrmMemoryManager::allocateMemoryByKMD to use gemCreateExt.

Related-To: NEO-7896

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-02-14 18:42:04 +01:00
Kacper Nowak
cccd9a3703 refactor: Add method for retrieving extra device info
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2024-02-14 16:38:15 +01:00
Kamil Kopryk
004e6e647f fix: align indirect data pointer to cacheline size in heapless mode
Align indirect data pointer to cacheline size in heapless mode,
restore debug_break_if if avx2 load/store operation
gets unaligned pointer,
remove fallback to mm256 loadu/storeu unaligned operation

Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-02-14 10:00:15 +01:00
Ben Ashbaugh
5120ec2f93 fix: correctly report support for SPIR-V 1.0 through 1.3
Related-To: NEO-10336

Signed-off-by: Ben Ashbaugh <ben.ashbaugh@intel.com>
2024-02-14 09:21:06 +01:00
Ben Ashbaugh
6dec2143df feature: add support for the cl_khr_extended_bit_ops extension
Related-To: NEO-10337

Signed-off-by: Ben Ashbaugh <ben.ashbaugh@intel.com>
2024-02-14 09:11:44 +01:00
Maciej Plewka
233f898b59 fix: split freed chunk from right
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>

Related-To: NEO-10280
2024-02-13 16:14:39 +01:00
Jitendra Sharma
ff604bda1b feature: Implement thread control for debugger in XE
Related-To: NEO-9668
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2024-02-12 19:46:28 +01:00
Mateusz Jablonski
bb5f6d9660 fix: don't query vm bind support on i915 prelim for pre-Xe platforms
Related-To: HSD-18036843571
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-09 12:50:48 +01:00
Brandon Yates
ea7ae7564f feature: Implement read/writeGpuMemory for Xe debugger
- makes prelim read/writeGpuMemory generic
- Implements Xe specific ioctls and fsyncs
- Refactors dbg IoctlHelper to use shared base class
for Xe and i915

Related-to: NEO-9668

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2024-02-08 22:09:59 +01:00
Kamil Kopryk
59f661356c feature: use heapless builtins in OCL if supported
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-02-08 15:49:58 +01:00
Lu, Wenbin
5e562ae7b0 fix: store the correct pagesize in SvmAllocationData
Also use the same alignment for both CPU & GPU in shared USM

Related-To: GSD-7103, NEO-9812

Signed-off-by: Wenbin Lu <wenbin.lu@intel.com>
2024-02-08 10:10:22 +01:00
Kamil Kopryk
2b91286754 fix: fix load store uint16x16 avx2 for unaligned ptr
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-02-08 09:35:30 +01:00
Yoon, Young Jin
97ef964bc4 feature: Add keys to override sync mode for immediate command list
Added OverrideImmediateCmdListSynchronousMode to override synchronous
mode for immediate command list

Related-To: NEO-10316
Signed-off-by: Yoon, Young Jin <young.jin.yoon@intel.com>
2024-02-08 08:35:32 +01:00
Compute-Runtime-Validation
0b64240deb Revert "feature: enable dummy blit WA for PVC"
This reverts commit fb9d225495.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-08 04:27:00 +01:00
Lukasz Jobczyk
486cc71b76 refactor: Add GDI profiling
Resolves: NEO-9236
Related-To: NEO-10036

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-07 18:44:11 +01:00
Maciej Plewka
ce17580b28 fix: Use Rcs engine in blender on DG2
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-02-07 18:21:54 +01:00
Dominik Dabek
371788210d performance: limit usm host allocation recycle
Query system total memory size and limit usm host allocation recycle to
use at most x%.
x is read from ExperimentalEnableDeviceAllocationCache for device and
ExperimentalEnableHostAllocationCache for host.

Related-To: GSD-7497

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-02-07 17:45:41 +01:00
Lukasz Jobczyk
5717a726ff fix: Allow free of zero sized allocation
Related-To: NEO-9236

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-07 17:04:11 +01:00
Dunajski, Bartosz
923707a509 feature: initial support for local dispatch size query
Related-To: NEO-8070

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-07 14:47:05 +01:00
Kacper Nowak
999ec9b2ca refactor: Unify logic for getting atomic FP caps 1/n
- Separate logic for fp16/32/46 caps.
- Add aggregated constexprs for local & global caps of given type
- Pass arguments by reference
- Add hwInfo as argument for future refactors
- Add static_asserts in L0 to ensure there is no mismatch between
internal/external caps
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2024-02-07 11:39:36 +01:00
Compute-Runtime-Validation
f6ca565323 Revert "feature: enable use of waitpkg intrinsic"
This reverts commit 2d15517fbe.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-06 16:30:26 +01:00
Dominik Dabek
b4a2e243dc performance: accept AOT kernels for indirects
Use indirects detection mechanism for AOT kernels if the detection
version is at least 2.

Related-To: NEO-7712

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-02-06 15:59:42 +01:00
Naklicki, Mateusz
eb0b0c2c89 refactor: add missing xe logs
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com>
2024-02-06 13:03:46 +01:00
Mateusz Jablonski
fb9d225495 feature: enable dummy blit WA for PVC
cleanup redundant implementation for DG2

Related-To: NEO-9996
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-06 10:17:12 +01:00
Kacper Nowak
ab1ffea592 refactor: Create function to check if OCL2.1 caps are enabled or enforced
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2024-02-06 08:27:09 +01:00
Compute-Runtime-Validation
9b649c1582 Revert "feature: enable bindless mode in L0 on dg2"
This reverts commit fbb8c5a98d.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-06 01:41:03 +01:00
Dunajski, Bartosz
f31fafb1e2 refactor: improve debug flag to override bcs mocs
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-05 20:23:54 +01:00
Mateusz Jablonski
8708963c4f test: don't call real cpuid function in ULT
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-05 14:56:26 +01:00
Fabian Zwolinski
fbb8c5a98d feature: enable bindless mode in L0 on dg2
Related-To: NEO-7063
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-02-05 09:48:16 +01:00