Michal Mrozek
25b3a3dbfa
fix: user proper settings for caches.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2024-02-15 11:38:34 +01:00
Chodor, Jaroslaw
b58717b9e3
feature: Add support for legacy acronyms in ocloc's fatbinary
...
Expands support for deprecated acronyms to fatbinary. Previously,
these were allowed only in single-target builds.
Related-To: NEO-10190
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com >
2024-02-15 10:14:49 +01:00
Compute-Runtime-Validation
57c946b61c
Revert "fix: align indirect data pointer to cacheline size in heapless mode"
...
This reverts commit 004e6e647f .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-02-15 05:49:19 +01:00
Compute-Runtime-Validation
d22b2a8a38
Revert "feature: add support for the cl_khr_extended_bit_ops extension"
...
This reverts commit 6dec2143df .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-02-15 05:34:10 +01:00
Compute-Runtime-Validation
9570d02134
Revert "fix: correctly report support for SPIR-V 1.0 through 1.3"
...
This reverts commit 5120ec2f93 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-02-15 04:00:31 +01:00
Compute-Runtime-Validation
7b340775c6
Revert "performance: program pat index on mtl linux"
...
This reverts commit 8e0b23db84 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-02-15 02:06:03 +01:00
Dunajski, Bartosz
88c5872682
feature: debug flag to flush tlb before copy
...
Related-To: HSD-18036669673
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2024-02-14 20:05:57 +01:00
Dominik Dabek
8e0b23db84
performance: program pat index on mtl linux
...
Enable programming pat indexes on mtl linux for device buffers.
Change DrmMemoryManager::allocateMemoryByKMD to use gemCreateExt.
Related-To: NEO-7896
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-02-14 18:42:04 +01:00
Kacper Nowak
cccd9a3703
refactor: Add method for retrieving extra device info
...
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com >
2024-02-14 16:38:15 +01:00
Kamil Kopryk
004e6e647f
fix: align indirect data pointer to cacheline size in heapless mode
...
Align indirect data pointer to cacheline size in heapless mode,
restore debug_break_if if avx2 load/store operation
gets unaligned pointer,
remove fallback to mm256 loadu/storeu unaligned operation
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-02-14 10:00:15 +01:00
Ben Ashbaugh
5120ec2f93
fix: correctly report support for SPIR-V 1.0 through 1.3
...
Related-To: NEO-10336
Signed-off-by: Ben Ashbaugh <ben.ashbaugh@intel.com >
2024-02-14 09:21:06 +01:00
Ben Ashbaugh
6dec2143df
feature: add support for the cl_khr_extended_bit_ops extension
...
Related-To: NEO-10337
Signed-off-by: Ben Ashbaugh <ben.ashbaugh@intel.com >
2024-02-14 09:11:44 +01:00
Maciej Plewka
233f898b59
fix: split freed chunk from right
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
Related-To: NEO-10280
2024-02-13 16:14:39 +01:00
Mateusz Jablonski
bb5f6d9660
fix: don't query vm bind support on i915 prelim for pre-Xe platforms
...
Related-To: HSD-18036843571
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-02-09 12:50:48 +01:00
Brandon Yates
ea7ae7564f
feature: Implement read/writeGpuMemory for Xe debugger
...
- makes prelim read/writeGpuMemory generic
- Implements Xe specific ioctls and fsyncs
- Refactors dbg IoctlHelper to use shared base class
for Xe and i915
Related-to: NEO-9668
Signed-off-by: Brandon Yates <brandon.yates@intel.com >
2024-02-08 22:09:59 +01:00
Kamil Kopryk
59f661356c
feature: use heapless builtins in OCL if supported
...
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-02-08 15:49:58 +01:00
Lu, Wenbin
5e562ae7b0
fix: store the correct pagesize in SvmAllocationData
...
Also use the same alignment for both CPU & GPU in shared USM
Related-To: GSD-7103, NEO-9812
Signed-off-by: Wenbin Lu <wenbin.lu@intel.com >
2024-02-08 10:10:22 +01:00
Kamil Kopryk
2b91286754
fix: fix load store uint16x16 avx2 for unaligned ptr
...
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-02-08 09:35:30 +01:00
Yoon, Young Jin
97ef964bc4
feature: Add keys to override sync mode for immediate command list
...
Added OverrideImmediateCmdListSynchronousMode to override synchronous
mode for immediate command list
Related-To: NEO-10316
Signed-off-by: Yoon, Young Jin <young.jin.yoon@intel.com >
2024-02-08 08:35:32 +01:00
Compute-Runtime-Validation
0b64240deb
Revert "feature: enable dummy blit WA for PVC"
...
This reverts commit fb9d225495 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-02-08 04:27:00 +01:00
Lukasz Jobczyk
486cc71b76
refactor: Add GDI profiling
...
Resolves: NEO-9236
Related-To: NEO-10036
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-02-07 18:44:11 +01:00
Maciej Plewka
ce17580b28
fix: Use Rcs engine in blender on DG2
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2024-02-07 18:21:54 +01:00
Dominik Dabek
371788210d
performance: limit usm host allocation recycle
...
Query system total memory size and limit usm host allocation recycle to
use at most x%.
x is read from ExperimentalEnableDeviceAllocationCache for device and
ExperimentalEnableHostAllocationCache for host.
Related-To: GSD-7497
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-02-07 17:45:41 +01:00
Lukasz Jobczyk
5717a726ff
fix: Allow free of zero sized allocation
...
Related-To: NEO-9236
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-02-07 17:04:11 +01:00
Dunajski, Bartosz
923707a509
feature: initial support for local dispatch size query
...
Related-To: NEO-8070
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2024-02-07 14:47:05 +01:00
Kacper Nowak
999ec9b2ca
refactor: Unify logic for getting atomic FP caps 1/n
...
- Separate logic for fp16/32/46 caps.
- Add aggregated constexprs for local & global caps of given type
- Pass arguments by reference
- Add hwInfo as argument for future refactors
- Add static_asserts in L0 to ensure there is no mismatch between
internal/external caps
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com >
2024-02-07 11:39:36 +01:00
Compute-Runtime-Validation
f6ca565323
Revert "feature: enable use of waitpkg intrinsic"
...
This reverts commit 2d15517fbe .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-02-06 16:30:26 +01:00
Dominik Dabek
b4a2e243dc
performance: accept AOT kernels for indirects
...
Use indirects detection mechanism for AOT kernels if the detection
version is at least 2.
Related-To: NEO-7712
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-02-06 15:59:42 +01:00
Naklicki, Mateusz
eb0b0c2c89
refactor: add missing xe logs
...
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com >
2024-02-06 13:03:46 +01:00
Mateusz Jablonski
fb9d225495
feature: enable dummy blit WA for PVC
...
cleanup redundant implementation for DG2
Related-To: NEO-9996
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-02-06 10:17:12 +01:00
Kacper Nowak
ab1ffea592
refactor: Create function to check if OCL2.1 caps are enabled or enforced
...
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com >
2024-02-06 08:27:09 +01:00
Compute-Runtime-Validation
9b649c1582
Revert "feature: enable bindless mode in L0 on dg2"
...
This reverts commit fbb8c5a98d .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-02-06 01:41:03 +01:00
Dunajski, Bartosz
f31fafb1e2
refactor: improve debug flag to override bcs mocs
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2024-02-05 20:23:54 +01:00
Fabian Zwolinski
fbb8c5a98d
feature: enable bindless mode in L0 on dg2
...
Related-To: NEO-7063
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com >
2024-02-05 09:48:16 +01:00
Kamil Kopryk
a4f7dda98f
refactor: Add xe print debug key
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-02-02 16:39:51 +01:00
Katarzyna Cencelewska
e6ba9766bd
feature: add debug flags to force pat index
...
for cached recouces: OverridePatIndexForCachedTypes
for uncached resouces: OverridePatIndexForUncachedTypes
Related-To: NEO-10157
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2024-02-02 16:11:34 +01:00
Brandon Yates
27c089d60d
feature: Register ELF for xe debugger
...
Related-to: NEO-9674
Signed-off-by: Brandon Yates <brandon.yates@intel.com >
2024-02-02 09:19:19 +01:00
Jitendra Sharma
00b1f1c5b5
fix: set runalone mode in xe only for render and compute
...
Runalone mode in XE is supported only for RENDER and COMPUTE.
Related-To: NEO-9139
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com >
2024-02-02 09:13:17 +01:00
Dominik Dabek
2cad595a0d
performance: debug flag for usm host alloc recycle
...
set ExperimentalEnableHostAllocationCache=1 to recycle host usm
allocations
Related-To: GSD-7497
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-02-01 16:47:59 +01:00
Mateusz Jablonski
2237961deb
performance: add macro for ULT-only unrecoverable if
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-02-01 15:09:31 +01:00
Francois Dugast
e311ba5597
refactor: Move ownership of engine type to caller of setDefaultEngine
...
Related-To: GSD-7097
Signed-off-by: Francois Dugast <francois.dugast@intel.com >
2024-02-01 14:20:20 +01:00
Compute-Runtime-Validation
e7b7eb06e4
Revert "fix: store the correct pagesize in SvmAllocationData"
...
This reverts commit a104d9199d .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-02-01 05:00:45 +01:00
Katarzyna Cencelewska
8ca9141019
fix: remove wa with override pat index on xe_lpg
...
Related-To: NEO-9657
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2024-01-31 16:30:12 +01:00
Katarzyna Cencelewska
d0b009901c
fix: use proper gmm resource type for uncache resources
...
when new coherency model
Resolves: NEO-9657
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2024-01-31 15:26:11 +01:00
Filip Hazubski
d920753ca6
fix: Disable related logic when EnableHostAllocationMemPolicy is not set
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2024-01-31 13:45:52 +01:00
Kamil Kopryk
6d3a53fe7f
refactor: Improve scratch programming in heapless mode
...
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-01-31 10:39:39 +01:00
Lu, Wenbin
a104d9199d
fix: store the correct pagesize in SvmAllocationData
...
Related-To: GSD-7103, NEO-9812
Signed-off-by: Lu, Wenbin <wenbin.lu@intel.com >
2024-01-31 09:12:20 +01:00
Compute-Runtime-Validation
fb46066abc
Revert "fix: enable cache env variables for level-zero"
...
This reverts commit 743904d2df .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-01-31 08:33:05 +01:00
Yoon, Young Jin
cbe35d70a5
fix: initialize libnuma only when flag is set
...
Modified in memory_info.cpp to initialize libnuma only when
EnableHostAllocationMemPolicy is set.
Related-To: NEO-8276
Signed-off-by: Yoon, Young Jin <young.jin.yoon@intel.com >
2024-01-30 18:27:43 +01:00
Maciej Plewka
564e0f0319
performance: Align host mem to 2MB when range is not limited
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
Related-To: NEO-10217
2024-01-30 14:43:13 +01:00