Commit Graph

3818 Commits

Author SHA1 Message Date
Raiyan Latif
40471da591 fix: properly retrieve sub-device count in allocateRTDispatchGlobals
Previous method used for retrieiving sub-device count in this path
did not take into account device affinity mask nor device hiearchy
mode, resulting in segmentation faults when attempting to allocate
rtDispatchGlobals structure using improper deviceBitFields.

Related-To: NEO-8422

Signed-off-by: Raiyan Latif <raiyan.latif@intel.com>
2024-03-20 09:23:30 +01:00
Joshua Santosh Ranjan
06fcdd28f3 feature: add debug flag for metrics logs
Related-To: NEO-10125

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2024-03-19 12:33:26 +01:00
Mateusz Jablonski
e9e41714b4 test: restore maxOsContextCount after test cleanup
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-19 12:00:37 +01:00
Jaroslaw Warchulski
7c5640669c fix: incorrect check in parseAffinityMask function
Resolves: GSD-8333
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2024-03-19 09:20:00 +01:00
Mateusz Jablonski
a2742492ab feature: enable xe drm detection by default
driver is built with xe drm support by default

added cmake flag to control xe eu debug API support
NEO_ENABLE_XE_EU_DEBUG_SUPPORT

This flag is disabled by default and uapi-eu-debug headers are not
needed for driver compilation as these headers are not a part of
upstream kernel yet.

Related-To: NEO-10780

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-19 08:47:13 +01:00
Naklicki, Mateusz
e9ee54da24 fix: prepare L3 cache banks getter
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com>
2024-03-18 15:12:16 +01:00
Zbigniew Zdanowicz
d1041e2335 feature: find cpu base address from all command buffers of container
Related-To: NEO-10381

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-18 13:43:08 +01:00
Mateusz Jablonski
19dcc80e44 Revert "build: enable xe drm detection by default"
This reverts commit 973757a58d.

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-18 09:04:59 +01:00
Compute-Runtime-Validation
016c234893 Revert "feature: disable scratch page by default"
This reverts commit dab5469f81.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-16 01:52:00 +01:00
Mateusz Jablonski
1e1d675606 fix: disable passing FtrTile64Optimization to gmmlib
add debug key to control if the value should be passed

Related-To: NEO-10785
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-15 17:42:53 +01:00
Naklicki, Mateusz
adae21b103 fix: prepare L3 cache size getter
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com>
2024-03-15 16:55:45 +01:00
Mateusz Jablonski
0270cd6a5b fix: respect gt id when getting engines for drm context under xe kmd
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-15 16:02:47 +01:00
Bellekallu Rajkiran
9736313d10 feature: Support for ccs mode configuration via SysFs
Add support for configuring ccs mode for all applicable devices
before KMD is loaded.

Use ZEX_NUMBER_OF_CCS to configure ccs mode.

Format is as follows:

ZEX_NUMBER_OF_CCS=NumberOfCcs i,e Setting ZEX_NUMBER_OF_CCS
to 4 sets ccs mode to 4 for all devices for which configuration
is supported.

Related-To: NEO-10378

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2024-03-15 15:51:45 +01:00
Zbigniew Zdanowicz
f496233462 refactor: use single vector to store used heap allocations
Related-To: NEO-10483

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-15 15:01:36 +01:00
Young Jin Yoon
dab5469f81 feature: disable scratch page by default
Modified default values for disableScratch and gpuPageFault
to true and 10 respectively in drm_nep.cpp, in order to
disable scratch pages by default.

Related-To: GSD-5673
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-03-15 11:44:10 +01:00
Mateusz Jablonski
470def9be6 test: fix variable naming
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-15 10:06:46 +01:00
Young Jin Yoon
82728ff394 feature: add logic to iterate for all contexts to check GPU pagefault
Implemented to go through entire contexts in the process and then query
reset status to check the unexpected GPU segfault.

Added a new debug variable GpuFaultCheckThreshold to change the checking
frequency for each hang check for performance analysis.

Related-To: GSD-5673
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-03-15 07:48:39 +01:00
Compute-Runtime-Validation
94cc48f81b Revert "fix: don't use fake userptr flag in ioctl helper xe"
This reverts commit d3ab256f55.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-15 03:08:01 +01:00
Mateusz Jablonski
d3ab256f55 fix: don't use fake userptr flag in ioctl helper xe
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-14 18:41:17 +01:00
Neil R. Spruit
b5f8a38f19 feature: Enable Per IP euStall Functionality
Related-To: NEO-10220

Signed-off-by: Neil R. Spruit <neil.r.spruit@intel.com>
2024-03-14 16:49:52 +01:00
Compute-Runtime-Validation
ef7dbc99f1 Revert "fix: don't use fake userptr flag in ioctl helper xe"
This reverts commit 98824fdaf6.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-14 14:35:14 +01:00
Mateusz Jablonski
833fa6bce1 fix: correct querying engines from xe kmd
we get drm_xe_query_engines, not array of drm_xe_engine_class_instance

Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-14 12:06:25 +01:00
Mateusz Jablonski
98824fdaf6 fix: don't use fake userptr flag in ioctl helper xe
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-14 10:07:38 +01:00
Young Jin Yoon
7b81c4e08f feature: abort when unexpected GPU page fault detected
If ResetStats from i915 is from the GPU page fault, abort
the entire process instead of disabling engines.
Added a fallback mechanism when prelim_drm_i915_reset_stats
fails.

Related-To: GSD-5673
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-03-14 08:14:59 +01:00
Mateusz Jablonski
0210e37f03 fix: respect gt id when finding xe engine info
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-13 20:52:36 +01:00
Bartosz Dunajski
e400a5c624 feature: in-order host counter allocation pooling 2
Related-To: NEO-10507

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-03-13 18:59:02 +01:00
Compute-Runtime-Validation
9cce1183cd Revert "feature: use prelim reset_stats for detailed statisics"
This reverts commit 835dc8b594.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-13 14:31:57 +01:00
Aravind Gopalakrishnan
3f20dd3b49 refactor: Add optional user fence during unbind
Add optional fence and wait operations after unbind operation.

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-03-13 12:47:44 +01:00
Young Jin Yoon
835dc8b594 feature: use prelim reset_stats for detailed statisics
Added getResetStats() in ioctl_helper.h to support extended header for
prelim_drm_i915_reset_stats.
Added new data structure to capture the fault data structure for prelim.

Related-To: GSD-5673
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-03-13 11:37:04 +01:00
Francois Dugast
5483e466e8 fix: Align on strings returned for unknown values
Related-to: NEO-10321

Signed-off-by: Francois Dugast <francois.dugast@intel.com>
2024-03-13 11:21:51 +01:00
Dominik Dabek
d664995221 performance: enable localPreferred on dg2
Use localPreferred for buffer, svmGpu and image allocations on dg2.

Related-To: NEO-9695

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-03-12 22:47:47 +01:00
Ben Ashbaugh
e00230a6c2 fix: correctly report support for SPIR-V 1.0 through 1.3
Related-To: NEO-10336

Signed-off-by: Ben Ashbaugh <ben.ashbaugh@intel.com>
2024-03-12 17:54:09 +01:00
Kamil Kopryk
626b3846b0 refactor: change compute walker to defaultWalkerType
Related-To: NEO-10641
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-03-12 17:52:43 +01:00
Lukasz Jobczyk
c3f1eba24a refactor: Add flag to control DC flush
Related-To: NEO-10556

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-12 14:54:16 +01:00
Mrozek, Michal
ee1a225a41 refactor: remove not used debug variables
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-03-12 10:12:23 +01:00
Dominik Dabek
8bc17778c4 test: refactor prealloc cmd buffer tests
Related-To: NEO-10526

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-03-12 09:28:53 +01:00
Zbigniew Zdanowicz
3f740f3460 feature: add new functionality to in order patch helpers
Related-To: NEO-10385

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-12 09:17:53 +01:00
Mateusz Jablonski
973757a58d build: enable xe drm detection by default
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-11 14:29:20 +01:00
Dominik Dabek
5ba9308804 performance: debug flag for localPreferred
Add flag for setting localPreferred (implicit when gmm localOnly=0 and
NonLocalOnly=0) when allocating buffer, svmGpu and image.

Related-To: NEO-9695

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-03-11 10:51:49 +01:00
Compute-Runtime-Validation
4082e9f028 Revert "feature: in-order host counter allocation pooling"
This reverts commit 2f03c48c7a.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-09 19:47:10 +01:00
Mateusz Hoppe
cb7ac1ada0 feature: add debug key to generate sip header file
- header file can be used with LoadBinarySipFromFile

Related-To: GSD-8253

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-03-08 19:03:43 +01:00
Mrozek, Michal
f71f6d2b72 refactor: remove not needed code
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-03-08 18:18:55 +01:00
Mrozek, Michal
13a347cbde refactor: remove not needed code
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-03-08 15:10:06 +01:00
Mateusz Hoppe
5e15543b14 test: improve unit tests - use virtual file system
- use virtualFileSystem in ULTs, writeDataToFile() does not create file
but instead saves filename. Call to fileExists() checks if file was
previously written

Related-To: NEO-7006

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-03-08 13:34:19 +01:00
Dunajski, Bartosz
2f03c48c7a feature: in-order host counter allocation pooling
Related-To: NEO-10507

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-03-08 13:13:04 +01:00
Kamil Kopryk
168445784e feature: introduce states programming at driver init heapless ocl
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-03-08 12:29:44 +01:00
Mrozek, Michal
c72b9ec448 refactor: remove not needed code
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-03-08 08:53:13 +01:00
Mrozek, Michal
10313b7b84 refactor: remove not needed code
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-03-07 18:50:16 +01:00
Lukasz Jobczyk
6d1a3d404e refactor: Add helper to control flat ring buffer
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-07 17:52:23 +01:00
Mateusz Jablonski
8e7e5ca58f fix: return same UUID format independently on pci bus info presence
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-07 16:56:30 +01:00