Katarzyna Cencelewska
dd1d52259e
refactor: add param rootDeviceEnvironment to calculateNumThreadsPerThreadGroup
...
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-03-21 22:25:14 +01:00
Young Jin Yoon
ec009cf9e3
fix: abort only when disabling scratch page
...
Modifed getResetStatus to abort only when scratch page is disabled
Removed an incorrect UNRECOVERABLE_IF statement based on the status:
validPageFault can be true when banned flag is not set, if CAT error
does not occur as a result of page fault.
Related-To: GSD-5673
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-03-21 21:55:25 +01:00
Mateusz Jablonski
4df0dd7894
fix: remove hardcoded caps reported by ioctl helper xe
...
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-21 21:09:20 +01:00
Mateusz Jablonski
92d37b20a6
fix: setup gpu address space based on config info from xe kmd
...
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-21 18:49:19 +01:00
Mateusz Jablonski
1e343053ba
refactor: remove redundant recreating vector of engines in xe kmd path
...
make ContextParamEngine structure more generic and populate engines
by drm specific methods
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-21 17:55:39 +01:00
Mateusz Jablonski
22d08dabc4
fix: pass FtrTile64Optimization as-is
...
Related-To: NEO-10623
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-21 16:52:27 +01:00
Aravind Gopalakrishnan
04b99de4d6
refactor: Force tlb flush during TC after copy
...
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-03-21 07:25:46 +01:00
Mateusz Hoppe
0cec5ccc32
fix: check if any of devices has BindlessHeapHelper enabled
...
- report Bindless_images extension in driver only when at least one
device has global bindless heaps enabled
Related-To: NEO-10352
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-03-20 20:50:58 +01:00
Mateusz Jablonski
614ed7ff5f
fix: remove not needed appendBaseAddressOffset method
...
Related-To: NEO-10623
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-20 17:46:14 +01:00
Wenju He
03078541d7
feature: make global bindless heaps resident when created
...
Make bindless heaps resident right after heap allocation.
Motivation is that SYCL bindless image can be passed as a value argument
or through memory. Therefore, we're not able to make its bindless heap
resident during kernel initialization or setting kernel arguments.
This fixes SYCL bindless image read_write_*D.cpp tests on DG2.
Related-To: NEO-7063
Signed-off-by: Wenju He <wenju.he@intel.com>
2024-03-20 14:40:11 +01:00
Mateusz Hoppe
27b930cabc
refactor: allow default setting for UseExternalAllocatorForSshAndDsh
...
- value of -1 selects driver default setting for external allocator
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-03-20 12:29:56 +01:00
Raiyan Latif
40471da591
fix: properly retrieve sub-device count in allocateRTDispatchGlobals
...
Previous method used for retrieiving sub-device count in this path
did not take into account device affinity mask nor device hiearchy
mode, resulting in segmentation faults when attempting to allocate
rtDispatchGlobals structure using improper deviceBitFields.
Related-To: NEO-8422
Signed-off-by: Raiyan Latif <raiyan.latif@intel.com>
2024-03-20 09:23:30 +01:00
Jaroslaw Warchulski
7c5640669c
fix: incorrect check in parseAffinityMask function
...
Resolves: GSD-8333
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2024-03-19 09:20:00 +01:00
Mateusz Jablonski
a2742492ab
feature: enable xe drm detection by default
...
driver is built with xe drm support by default
added cmake flag to control xe eu debug API support
NEO_ENABLE_XE_EU_DEBUG_SUPPORT
This flag is disabled by default and uapi-eu-debug headers are not
needed for driver compilation as these headers are not a part of
upstream kernel yet.
Related-To: NEO-10780
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-19 08:47:13 +01:00
Zbigniew Zdanowicz
d1041e2335
feature: find cpu base address from all command buffers of container
...
Related-To: NEO-10381
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-18 13:43:08 +01:00
Mateusz Jablonski
19dcc80e44
Revert "build: enable xe drm detection by default"
...
This reverts commit 973757a58d
.
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-18 09:04:59 +01:00
Compute-Runtime-Validation
016c234893
Revert "feature: disable scratch page by default"
...
This reverts commit dab5469f81
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-16 01:52:00 +01:00
Mateusz Jablonski
1e1d675606
fix: disable passing FtrTile64Optimization to gmmlib
...
add debug key to control if the value should be passed
Related-To: NEO-10785
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-15 17:42:53 +01:00
Mateusz Jablonski
0270cd6a5b
fix: respect gt id when getting engines for drm context under xe kmd
...
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-15 16:02:47 +01:00
Bellekallu Rajkiran
9736313d10
feature: Support for ccs mode configuration via SysFs
...
Add support for configuring ccs mode for all applicable devices
before KMD is loaded.
Use ZEX_NUMBER_OF_CCS to configure ccs mode.
Format is as follows:
ZEX_NUMBER_OF_CCS=NumberOfCcs i,e Setting ZEX_NUMBER_OF_CCS
to 4 sets ccs mode to 4 for all devices for which configuration
is supported.
Related-To: NEO-10378
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2024-03-15 15:51:45 +01:00
Zbigniew Zdanowicz
f496233462
refactor: use single vector to store used heap allocations
...
Related-To: NEO-10483
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-15 15:01:36 +01:00
Young Jin Yoon
dab5469f81
feature: disable scratch page by default
...
Modified default values for disableScratch and gpuPageFault
to true and 10 respectively in drm_nep.cpp, in order to
disable scratch pages by default.
Related-To: GSD-5673
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-03-15 11:44:10 +01:00
Mateusz Jablonski
470def9be6
test: fix variable naming
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-15 10:06:46 +01:00
Young Jin Yoon
82728ff394
feature: add logic to iterate for all contexts to check GPU pagefault
...
Implemented to go through entire contexts in the process and then query
reset status to check the unexpected GPU segfault.
Added a new debug variable GpuFaultCheckThreshold to change the checking
frequency for each hang check for performance analysis.
Related-To: GSD-5673
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-03-15 07:48:39 +01:00
Compute-Runtime-Validation
94cc48f81b
Revert "fix: don't use fake userptr flag in ioctl helper xe"
...
This reverts commit d3ab256f55
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-15 03:08:01 +01:00
Mateusz Jablonski
d3ab256f55
fix: don't use fake userptr flag in ioctl helper xe
...
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-14 18:41:17 +01:00
Neil R. Spruit
b5f8a38f19
feature: Enable Per IP euStall Functionality
...
Related-To: NEO-10220
Signed-off-by: Neil R. Spruit <neil.r.spruit@intel.com>
2024-03-14 16:49:52 +01:00
Compute-Runtime-Validation
ef7dbc99f1
Revert "fix: don't use fake userptr flag in ioctl helper xe"
...
This reverts commit 98824fdaf6
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-14 14:35:14 +01:00
Mateusz Jablonski
833fa6bce1
fix: correct querying engines from xe kmd
...
we get drm_xe_query_engines, not array of drm_xe_engine_class_instance
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-14 12:06:25 +01:00
Mateusz Jablonski
98824fdaf6
fix: don't use fake userptr flag in ioctl helper xe
...
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-14 10:07:38 +01:00
Young Jin Yoon
7b81c4e08f
feature: abort when unexpected GPU page fault detected
...
If ResetStats from i915 is from the GPU page fault, abort
the entire process instead of disabling engines.
Added a fallback mechanism when prelim_drm_i915_reset_stats
fails.
Related-To: GSD-5673
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-03-14 08:14:59 +01:00
Mateusz Jablonski
0210e37f03
fix: respect gt id when finding xe engine info
...
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-13 20:52:36 +01:00
Bartosz Dunajski
e400a5c624
feature: in-order host counter allocation pooling 2
...
Related-To: NEO-10507
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-03-13 18:59:02 +01:00
Compute-Runtime-Validation
9cce1183cd
Revert "feature: use prelim reset_stats for detailed statisics"
...
This reverts commit 835dc8b594
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-13 14:31:57 +01:00
Aravind Gopalakrishnan
3f20dd3b49
refactor: Add optional user fence during unbind
...
Add optional fence and wait operations after unbind operation.
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-03-13 12:47:44 +01:00
Young Jin Yoon
835dc8b594
feature: use prelim reset_stats for detailed statisics
...
Added getResetStats() in ioctl_helper.h to support extended header for
prelim_drm_i915_reset_stats.
Added new data structure to capture the fault data structure for prelim.
Related-To: GSD-5673
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-03-13 11:37:04 +01:00
Francois Dugast
5483e466e8
fix: Align on strings returned for unknown values
...
Related-to: NEO-10321
Signed-off-by: Francois Dugast <francois.dugast@intel.com>
2024-03-13 11:21:51 +01:00
Dominik Dabek
d664995221
performance: enable localPreferred on dg2
...
Use localPreferred for buffer, svmGpu and image allocations on dg2.
Related-To: NEO-9695
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-03-12 22:47:47 +01:00
Ben Ashbaugh
e00230a6c2
fix: correctly report support for SPIR-V 1.0 through 1.3
...
Related-To: NEO-10336
Signed-off-by: Ben Ashbaugh <ben.ashbaugh@intel.com>
2024-03-12 17:54:09 +01:00
Kamil Kopryk
626b3846b0
refactor: change compute walker to defaultWalkerType
...
Related-To: NEO-10641
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-03-12 17:52:43 +01:00
Lukasz Jobczyk
c3f1eba24a
refactor: Add flag to control DC flush
...
Related-To: NEO-10556
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-12 14:54:16 +01:00
Dominik Dabek
8bc17778c4
test: refactor prealloc cmd buffer tests
...
Related-To: NEO-10526
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-03-12 09:28:53 +01:00
Zbigniew Zdanowicz
3f740f3460
feature: add new functionality to in order patch helpers
...
Related-To: NEO-10385
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-12 09:17:53 +01:00
Mateusz Jablonski
973757a58d
build: enable xe drm detection by default
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-11 14:29:20 +01:00
Dominik Dabek
5ba9308804
performance: debug flag for localPreferred
...
Add flag for setting localPreferred (implicit when gmm localOnly=0 and
NonLocalOnly=0) when allocating buffer, svmGpu and image.
Related-To: NEO-9695
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-03-11 10:51:49 +01:00
Compute-Runtime-Validation
4082e9f028
Revert "feature: in-order host counter allocation pooling"
...
This reverts commit 2f03c48c7a
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-09 19:47:10 +01:00
Mateusz Hoppe
cb7ac1ada0
feature: add debug key to generate sip header file
...
- header file can be used with LoadBinarySipFromFile
Related-To: GSD-8253
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-03-08 19:03:43 +01:00
Mrozek, Michal
f71f6d2b72
refactor: remove not needed code
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-03-08 18:18:55 +01:00
Mrozek, Michal
13a347cbde
refactor: remove not needed code
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-03-08 15:10:06 +01:00
Mateusz Hoppe
5e15543b14
test: improve unit tests - use virtual file system
...
- use virtualFileSystem in ULTs, writeDataToFile() does not create file
but instead saves filename. Call to fileExists() checks if file was
previously written
Related-To: NEO-7006
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-03-08 13:34:19 +01:00
Dunajski, Bartosz
2f03c48c7a
feature: in-order host counter allocation pooling
...
Related-To: NEO-10507
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-03-08 13:13:04 +01:00
Kamil Kopryk
168445784e
feature: introduce states programming at driver init heapless ocl
...
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-03-08 12:29:44 +01:00
Mrozek, Michal
c72b9ec448
refactor: remove not needed code
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-03-08 08:53:13 +01:00
Mrozek, Michal
10313b7b84
refactor: remove not needed code
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-03-07 18:50:16 +01:00
Lukasz Jobczyk
6d1a3d404e
refactor: Add helper to control flat ring buffer
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-07 17:52:23 +01:00
Mateusz Jablonski
8e7e5ca58f
fix: return same UUID format independently on pci bus info presence
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-07 16:56:30 +01:00
Michal Mrozek
e08668b982
refactor: remove not needed code.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-03-07 13:02:29 +01:00
Zbigniew Zdanowicz
fe377b6597
feature: add out cmd pointer to store data imm encoder
...
Related-To: NEO-10385
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-07 12:52:08 +01:00
Bartosz Dunajski
79d80047ef
refactor: improve mmap logging logic
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-03-07 12:15:39 +01:00
Michal Mrozek
660539fe4e
refactor: remove not needed code.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-03-06 20:13:57 +01:00
Mateusz Jablonski
debbf87671
fix: move ARL-U device id to MTL-M release
...
Related-To: NEO-10404
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-06 17:33:00 +01:00
Dominik Dabek
033ff28609
performance: enable cmd buffer preallocate xehp+
...
For L0 immediate cmdlists try to use reusable allocations if available.
Related-To: NEO-10526
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-03-06 16:53:01 +01:00
Mateusz Jablonski
8ae4a3bc7a
fix: pass Sku/Wa tables for gmm without additional translations on Windows
...
Related-To: NEO-10623
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-06 14:58:58 +01:00
Bartosz Dunajski
fcd57f94cf
refactor: capability to print mmap and munmap calls
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-03-06 14:29:01 +01:00
Brandon Yates
7a0d2df2fe
fix: Handle Pat Index Ext not supported on Xe
...
Xe does not support VmBindPatIndexExtension. This patch
fixes the handling of this case and prevents corrupting
other extensions
Related-to: NEO-9674
Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2024-03-06 11:18:31 +01:00
Filip Hazubski
d25026b263
refactor: Add getTotalMemBankSize function to ReleaseHelper
...
Minor refactor of ULTs to not use hard coded banks size.
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-03-06 09:53:56 +01:00
Dominik Dabek
a04c67ec52
performance(ocl): refactor pool allocators tests
...
add explicit tests for xe hpc
Related-To: NEO-9700
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-03-05 18:48:55 +01:00
Mateusz Jablonski
8b4bca89bc
refactor: move ray tracing support capability to release helper
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-05 17:30:57 +01:00
Filip Hazubski
5f7e56e78b
refactor: Unify aub config generation
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-03-05 11:09:34 +01:00
Compute-Runtime-Validation
293858320e
Revert "performance: Fill reusable allocation list on xe and later"
...
This reverts commit 15934ceb43
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-02 07:09:45 +01:00
Fabian Zwolinski
52430762ac
fix: cl_cache L0 env vars + refactor code structure
...
Added support for new Compiler Cache
environment variables in Level Zero.
Moved
`opencl/source/compiler_interface/default_cache_config.cpp`
`level_zero/core/source/compiler_interface/default_cache_config.cpp`
to shared directory
`source/compiler_interface/default_cache_config.cpp`
Switched enabling cache by default from per OS to per API.
Changed default state of cl_cache in Level Zero to disabled.
Related-To: NEO-10045
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-03-01 17:35:08 +01:00
Dunajski, Bartosz
51ae76a25f
refactor: improve handling of in-order atomic signaling
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-03-01 15:59:25 +01:00
Lukasz Jobczyk
bd6925d51a
refactor: Exclude thread dispatch algorithm for overdispatch to function
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-01 15:04:04 +01:00
Mateusz Jablonski
a8fbed6120
feature: enable dummy blit WA for PVC
...
Related-To: NEO-9996
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-01 14:04:46 +01:00
Dunajski, Bartosz
ea2ad550a1
refactor: improve handling duplicated in-order host storage
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-03-01 13:54:28 +01:00
Mateusz Jablonski
80aa55a3cb
fix: don't program dummy blit prior to MI_FLUSH_DW without postsync
...
add missing dummy blits before MI_FLUSH_DW with postsync
Related-To: NEO-9996
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-01 12:36:13 +01:00
Kozlowski, Marek
6751d19c19
fix: decanonize pointer to match GPU heap address space
...
* `zeVirtualMemReserve` `pStart` address may be passed in a canonizated form.
Resolves: NEO-10086
Signed-off-by: Kozlowski, Marek <marek.kozlowski@intel.com>
2024-03-01 12:18:11 +01:00
Dunajski, Bartosz
bbe1043f08
feature: initial support for pooling in-order counter allocations
...
Related-To: NEO-10507
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-03-01 12:08:02 +01:00
Zbigniew Zdanowicz
4baee71974
refactor: add out cmd argument to semaphore wait encoder
...
Related-To: NEO-10065
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-01 11:43:36 +01:00
Lukasz Jobczyk
15934ceb43
performance: Fill reusable allocation list on xe and later
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-01 09:41:19 +01:00
Compute-Runtime-Validation
5fce046877
Revert "fix: don't program dummy blit prior to MI_FLUSH_DW without postsync"
...
This reverts commit d796fb559d
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-01 01:43:35 +01:00
Lukasz Jobczyk
e5db84f370
performance: Use GEMCreateExt when allocate by KMD
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-29 18:01:55 +01:00
Lukasz Jobczyk
409e19a832
performance: Enable cmd buffer preallocation per CmdQ on xe and later
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-29 17:41:58 +01:00
Dunajski, Bartosz
00964275fb
fix: create preemption allocation for Root Csr
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-29 16:46:24 +01:00
Mateusz Jablonski
d796fb559d
fix: don't program dummy blit prior to MI_FLUSH_DW without postsync
...
add missing dummy blits before MI_FLUSH_DW with postsync
Related-To: NEO-9996
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-29 13:30:09 +01:00
Fabian Zwolinski
a27dafd2e1
fix: cl_cache store binary and debug data in one file
...
Related-To: NEO-10045
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-02-29 10:13:15 +01:00
Lukasz Jobczyk
676644bc50
performance: Enable internal heap preallocation on xe and later
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-28 17:58:52 +01:00
Lukasz Jobczyk
0fcc67fba9
refactor: Change scope of product helper test
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-28 15:52:25 +01:00
Lukasz Jobczyk
39a44628a3
performance: Enable timestamp wait for queues on xe and later
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-28 14:26:59 +01:00
Lukasz Jobczyk
d1dd34d0c7
performance: Enable timestamp wait for events on xe and later
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-28 14:18:14 +01:00
Michal Mrozek
64232ec370
fix: choose proper csr for low priority immediate command lists
...
Resolves: NEO-10168
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-02-28 12:45:02 +01:00
Brandon Yates
fa4b737326
feature: Implement metadata attaching for vm_bind in xe
...
Related-to: NEO-9674
Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2024-02-28 01:36:20 +01:00
Dunajski, Bartosz
8483a922a7
feature: simplify in-order allocation overflow
...
Related-To: NEO-10507
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-27 18:25:26 +01:00
Mateusz Jablonski
b6b53daa3f
fix: don't program dummy blit prior to MI ARB CHECK
...
Related-To: NEO-9996
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-27 15:46:46 +01:00
Mateusz Jablonski
9f01a831a2
fix: correct command for dummy blit on PVC
...
use MEM_SET command
Related-To: NEO-9996
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-27 14:27:47 +01:00
Zbigniew Zdanowicz
0bf8e8727e
refactor: add output buffer argument to store register to memory encoder
...
Related-To: NEO-10064
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-02-27 13:31:55 +01:00
Mateusz Jablonski
9cbc33e23d
fix: setup correct render/display core family for gmmlib initialization
...
pass values got from adapter info
Related-To: NEO-10482
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-26 14:49:04 +01:00
Lukasz Jobczyk
cfd3edfb2c
fix: Align IOH entry
...
Related-To: NEO-10036
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-26 14:36:31 +01:00
Mateusz Jablonski
459da6a482
refactor: cleanup xe hpg logic related to dummy blit wa
...
Related-To: NEO-9996
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-26 13:32:18 +01:00
Mateusz Jablonski
3e65e7bdba
fix: correct number of max work group count for concurrent kernel on PVC
...
for single-CCS mode use all EUs
Related-To: NEO-8377
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-26 10:42:50 +01:00
Dunajski, Bartosz
6cdd2d5dca
fix: add missing gt_id when creating XE context
...
Related-To: GSD-8046
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-23 16:50:32 +01:00
Zbigniew Zdanowicz
1995e6aa40
feature: store cpu pointer of command buffer pointing to post sync command
...
Related-To: NEO-10064
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-02-23 13:12:15 +01:00
Michal Mrozek
ef2d3b538b
performance: move timestampPacketTagBuffer to local memory
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
Resolves: NEO-7765
2024-02-23 09:55:48 +01:00
Brandon Yates
0fa730e524
build: Update debugger uapi headers to latest
...
Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2024-02-22 16:07:06 +01:00
Lukasz Jobczyk
b44729771c
fix: Extend MTL's overridePatIndex to XE LPG
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-21 13:21:30 +01:00
Maciej Plewka
964f0166d7
fix: Reuse freed chunk from left even if not aligned
...
Related-To: NEO-10416, NEO-10418, NEO-10437
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-02-21 12:31:54 +01:00
Zbigniew Zdanowicz
db9d0c9a58
fix: remove heap resource leak after command list reset
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-02-20 14:38:19 +01:00
Dominik Dabek
0120d8a58d
performance: program pat index on mtl linux
...
Enable programming pat indexes on mtl linux for device buffers.
Change DrmMemoryManager::allocateMemoryByKMD to use gemCreateExt.
Set mmap flags based on coherency.
Map as write back on legacy and coherent.
On non-coherent map as write combined.
Changes currently disabled, to enable use debug keys:
DisableGemCreateExtSetPat=0
UseGemCreateExtInAllocateMemoryByKMD=1
Reorder BufferObject to decrease padding.
Related-To: NEO-7896
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-02-16 17:33:07 +01:00
Lukasz Jobczyk
cc1732c930
performance: make resident before lock
...
Resolves: NEO-10369
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-16 12:00:38 +01:00
Compute-Runtime-Validation
4d159bd06c
Revert "performance: make resident before lock"
...
This reverts commit 7e7fac01e3
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-16 09:23:49 +01:00
Dominik Dabek
ed011de03e
performance: program pat index on mtl linux
...
Enable programming pat indexes on mtl linux for device buffers.
Change DrmMemoryManager::allocateMemoryByKMD to use gemCreateExt.
Changes currently disabled, can be enabled with flag
DisableGemCreateExtSetPat=0
Related-To: NEO-7896
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-02-15 17:15:28 +01:00
Lukasz Jobczyk
7e7fac01e3
performance: make resident before lock
...
Resolves: NEO-10369
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-15 11:56:36 +01:00
Michal Mrozek
25b3a3dbfa
fix: user proper settings for caches.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-02-15 11:38:34 +01:00
Compute-Runtime-Validation
9570d02134
Revert "fix: correctly report support for SPIR-V 1.0 through 1.3"
...
This reverts commit 5120ec2f93
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-15 04:00:31 +01:00
Compute-Runtime-Validation
7b340775c6
Revert "performance: program pat index on mtl linux"
...
This reverts commit 8e0b23db84
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-15 02:06:03 +01:00
Dominik Dabek
8e0b23db84
performance: program pat index on mtl linux
...
Enable programming pat indexes on mtl linux for device buffers.
Change DrmMemoryManager::allocateMemoryByKMD to use gemCreateExt.
Related-To: NEO-7896
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-02-14 18:42:04 +01:00
Kacper Nowak
cccd9a3703
refactor: Add method for retrieving extra device info
...
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2024-02-14 16:38:15 +01:00
Ben Ashbaugh
5120ec2f93
fix: correctly report support for SPIR-V 1.0 through 1.3
...
Related-To: NEO-10336
Signed-off-by: Ben Ashbaugh <ben.ashbaugh@intel.com>
2024-02-14 09:21:06 +01:00
Maciej Plewka
233f898b59
fix: split freed chunk from right
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
Related-To: NEO-10280
2024-02-13 16:14:39 +01:00
Mateusz Jablonski
bb5f6d9660
fix: don't query vm bind support on i915 prelim for pre-Xe platforms
...
Related-To: HSD-18036843571
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-09 12:50:48 +01:00
Kamil Kopryk
59f661356c
feature: use heapless builtins in OCL if supported
...
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-02-08 15:49:58 +01:00
Lu, Wenbin
5e562ae7b0
fix: store the correct pagesize in SvmAllocationData
...
Also use the same alignment for both CPU & GPU in shared USM
Related-To: GSD-7103, NEO-9812
Signed-off-by: Wenbin Lu <wenbin.lu@intel.com>
2024-02-08 10:10:22 +01:00
Compute-Runtime-Validation
0b64240deb
Revert "feature: enable dummy blit WA for PVC"
...
This reverts commit fb9d225495
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-08 04:27:00 +01:00
Lukasz Jobczyk
486cc71b76
refactor: Add GDI profiling
...
Resolves: NEO-9236
Related-To: NEO-10036
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-07 18:44:11 +01:00
Maciej Plewka
ce17580b28
fix: Use Rcs engine in blender on DG2
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-02-07 18:21:54 +01:00
Dominik Dabek
371788210d
performance: limit usm host allocation recycle
...
Query system total memory size and limit usm host allocation recycle to
use at most x%.
x is read from ExperimentalEnableDeviceAllocationCache for device and
ExperimentalEnableHostAllocationCache for host.
Related-To: GSD-7497
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-02-07 17:45:41 +01:00
Lukasz Jobczyk
5717a726ff
fix: Allow free of zero sized allocation
...
Related-To: NEO-9236
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-07 17:04:11 +01:00
Dunajski, Bartosz
923707a509
feature: initial support for local dispatch size query
...
Related-To: NEO-8070
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-07 14:47:05 +01:00
Compute-Runtime-Validation
f6ca565323
Revert "feature: enable use of waitpkg intrinsic"
...
This reverts commit 2d15517fbe
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-06 16:30:26 +01:00
Dominik Dabek
b4a2e243dc
performance: accept AOT kernels for indirects
...
Use indirects detection mechanism for AOT kernels if the detection
version is at least 2.
Related-To: NEO-7712
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-02-06 15:59:42 +01:00
Mateusz Jablonski
fb9d225495
feature: enable dummy blit WA for PVC
...
cleanup redundant implementation for DG2
Related-To: NEO-9996
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-06 10:17:12 +01:00
Compute-Runtime-Validation
9b649c1582
Revert "feature: enable bindless mode in L0 on dg2"
...
This reverts commit fbb8c5a98d
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-06 01:41:03 +01:00
Dunajski, Bartosz
f31fafb1e2
refactor: improve debug flag to override bcs mocs
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-05 20:23:54 +01:00
Fabian Zwolinski
fbb8c5a98d
feature: enable bindless mode in L0 on dg2
...
Related-To: NEO-7063
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-02-05 09:48:16 +01:00
Kamil Kopryk
a4f7dda98f
refactor: Add xe print debug key
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-02-02 16:39:51 +01:00
Katarzyna Cencelewska
e6ba9766bd
feature: add debug flags to force pat index
...
for cached recouces: OverridePatIndexForCachedTypes
for uncached resouces: OverridePatIndexForUncachedTypes
Related-To: NEO-10157
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-02-02 16:11:34 +01:00
Brandon Yates
27c089d60d
feature: Register ELF for xe debugger
...
Related-to: NEO-9674
Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2024-02-02 09:19:19 +01:00
Jitendra Sharma
00b1f1c5b5
fix: set runalone mode in xe only for render and compute
...
Runalone mode in XE is supported only for RENDER and COMPUTE.
Related-To: NEO-9139
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2024-02-02 09:13:17 +01:00
Dominik Dabek
2cad595a0d
performance: debug flag for usm host alloc recycle
...
set ExperimentalEnableHostAllocationCache=1 to recycle host usm
allocations
Related-To: GSD-7497
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-02-01 16:47:59 +01:00
Francois Dugast
e311ba5597
refactor: Move ownership of engine type to caller of setDefaultEngine
...
Related-To: GSD-7097
Signed-off-by: Francois Dugast <francois.dugast@intel.com>
2024-02-01 14:20:20 +01:00
Compute-Runtime-Validation
e7b7eb06e4
Revert "fix: store the correct pagesize in SvmAllocationData"
...
This reverts commit a104d9199d
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-01 05:00:45 +01:00
Katarzyna Cencelewska
8ca9141019
fix: remove wa with override pat index on xe_lpg
...
Related-To: NEO-9657
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-01-31 16:30:12 +01:00
Katarzyna Cencelewska
d0b009901c
fix: use proper gmm resource type for uncache resources
...
when new coherency model
Resolves: NEO-9657
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-01-31 15:26:11 +01:00
Kamil Kopryk
6d3a53fe7f
refactor: Improve scratch programming in heapless mode
...
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-01-31 10:39:39 +01:00
Lu, Wenbin
a104d9199d
fix: store the correct pagesize in SvmAllocationData
...
Related-To: GSD-7103, NEO-9812
Signed-off-by: Lu, Wenbin <wenbin.lu@intel.com>
2024-01-31 09:12:20 +01:00
Yoon, Young Jin
cbe35d70a5
fix: initialize libnuma only when flag is set
...
Modified in memory_info.cpp to initialize libnuma only when
EnableHostAllocationMemPolicy is set.
Related-To: NEO-8276
Signed-off-by: Yoon, Young Jin <young.jin.yoon@intel.com>
2024-01-30 18:27:43 +01:00
Maciej Plewka
564e0f0319
performance: Align host mem to 2MB when range is not limited
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
Related-To: NEO-10217
2024-01-30 14:43:13 +01:00
Francois Dugast
278ced35dc
fix: Use capability table to determine engine type for defaultEngine
...
Related-To: GSD-7097
Signed-off-by: Francois Dugast <francois.dugast@intel.com>
2024-01-30 14:28:09 +01:00
Maciej Plewka
620ad5fa89
fix: change freed chunk size when ptr is aligned
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
Related-To: NEO-9945
2024-01-30 12:22:28 +01:00
Compute-Runtime-Validation
fa9c79fb63
Revert "refactor: Add GDI profiling"
...
This reverts commit 524ae7713a
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-01-30 10:47:34 +01:00