Commit Graph

15568 Commits

Author SHA1 Message Date
Raiyan Latif
40471da591 fix: properly retrieve sub-device count in allocateRTDispatchGlobals
Previous method used for retrieiving sub-device count in this path
did not take into account device affinity mask nor device hiearchy
mode, resulting in segmentation faults when attempting to allocate
rtDispatchGlobals structure using improper deviceBitFields.

Related-To: NEO-8422

Signed-off-by: Raiyan Latif <raiyan.latif@intel.com>
2024-03-20 09:23:30 +01:00
ocldev
931eaafa11 build: dependencies update
Signed-off-by: ocldev <ocldev@intel.com>
2024-03-20 09:14:45 +01:00
Brandon Yates
7f8bac7703 fix: Remove Xe Debug stubs for read/writeMemory
The correct implementation is in DebugSessionLinux and
the overrides in DebugSessionLinuxXe are not needed

Related-to: NEO-9669

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2024-03-20 08:43:28 +01:00
ocldev
5c4082cd1a build: level_zero revision update
Signed-off-by: ocldev <ocldev@intel.com>
2024-03-20 08:34:38 +01:00
Pawel Cieslak
c2a1d263d7 ci: infra revision update
Signed-off-by: Pawel Cieslak <pawel.cieslak@intel.com>
2024-03-19 19:29:26 +01:00
Filip Hazubski
1fb1826fff fix: Add DG2 device ids
Add following device ids: 0x56BE, 0x56BF.

Update device brand string for device ids: 0x56BA, 0x56BB, 0x56BC, 0x56BD.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-03-19 12:52:31 +01:00
Filip Hazubski
e0f67da7f4 fix: Update ATS-M device id
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-03-19 12:37:48 +01:00
Joshua Santosh Ranjan
06fcdd28f3 feature: add debug flag for metrics logs
Related-To: NEO-10125

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2024-03-19 12:33:26 +01:00
Mateusz Jablonski
e9e41714b4 test: restore maxOsContextCount after test cleanup
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-19 12:00:37 +01:00
Jaroslaw Warchulski
7c5640669c fix: incorrect check in parseAffinityMask function
Resolves: GSD-8333
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2024-03-19 09:20:00 +01:00
Mateusz Jablonski
a2742492ab feature: enable xe drm detection by default
driver is built with xe drm support by default

added cmake flag to control xe eu debug API support
NEO_ENABLE_XE_EU_DEBUG_SUPPORT

This flag is disabled by default and uapi-eu-debug headers are not
needed for driver compilation as these headers are not a part of
upstream kernel yet.

Related-To: NEO-10780

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-19 08:47:13 +01:00
Compute-Runtime-Validation
fb838afe42 Revert "build: level_zero revision update"
This reverts commit 43a8d349ff.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-18 17:21:07 +01:00
Bartlomiej Wolny
7db5f6d5bf fix: add texture cache flush on image dtor
Related-To: NEO-9794
Signed-off-by: Bartlomiej Wolny <bartlomiej.wolny@intel.com>
2024-03-18 17:03:17 +01:00
Naklicki, Mateusz
e9ee54da24 fix: prepare L3 cache banks getter
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com>
2024-03-18 15:12:16 +01:00
Zbigniew Zdanowicz
d1041e2335 feature: find cpu base address from all command buffers of container
Related-To: NEO-10381

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-18 13:43:08 +01:00
Mateusz Jablonski
6b33d91140 fix: remove not needed check for context param engine count
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-18 13:17:05 +01:00
Mateusz Jablonski
19dcc80e44 Revert "build: enable xe drm detection by default"
This reverts commit 973757a58d.

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-18 09:04:59 +01:00
ocldev
a022acbdf3 feature: brand string update
Related-To: NEO-7929

Updating device brand strings
Based on public support in Windows driver 31.0.101.5319
https://www.intel.com/content/www/us/en/download/741626/

Signed-off-by: ocldev <ocldev@intel.com>
2024-03-18 08:41:00 +01:00
ocldev
9bf86d206d build: dependencies update
Signed-off-by: ocldev <ocldev@intel.com>
2024-03-18 06:15:16 +01:00
ocldev
43a8d349ff build: level_zero revision update
Signed-off-by: ocldev <ocldev@intel.com>
2024-03-17 22:47:22 +01:00
Compute-Runtime-Validation
016c234893 Revert "feature: disable scratch page by default"
This reverts commit dab5469f81.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-16 01:52:00 +01:00
Pawel Cieslak
d2cc12ecfb ci: infra revision update
Signed-off-by: Pawel Cieslak <pawel.cieslak@intel.com>
2024-03-15 18:46:22 +01:00
Mateusz Jablonski
1e1d675606 fix: disable passing FtrTile64Optimization to gmmlib
add debug key to control if the value should be passed

Related-To: NEO-10785
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-15 17:42:53 +01:00
Aleksander Czerwionka
f7342ff910 fix: Add additional ADL-N device ids
Related-To: NEO-10610

Signed-off-by: Aleksander Czerwionka <aleksander.czerwionka@intel.com>
2024-03-15 17:34:17 +01:00
Naklicki, Mateusz
adae21b103 fix: prepare L3 cache size getter
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com>
2024-03-15 16:55:45 +01:00
Mateusz Jablonski
1319ab4efc refactor: don't setup struct members with designated initializers
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-15 16:36:00 +01:00
Mateusz Jablonski
e21180992f fix: remove not needed check for engine instance count
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-15 16:16:21 +01:00
Mateusz Jablonski
0270cd6a5b fix: respect gt id when getting engines for drm context under xe kmd
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-15 16:02:47 +01:00
Bellekallu Rajkiran
9736313d10 feature: Support for ccs mode configuration via SysFs
Add support for configuring ccs mode for all applicable devices
before KMD is loaded.

Use ZEX_NUMBER_OF_CCS to configure ccs mode.

Format is as follows:

ZEX_NUMBER_OF_CCS=NumberOfCcs i,e Setting ZEX_NUMBER_OF_CCS
to 4 sets ccs mode to 4 for all devices for which configuration
is supported.

Related-To: NEO-10378

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2024-03-15 15:51:45 +01:00
Zbigniew Zdanowicz
f496233462 refactor: use single vector to store used heap allocations
Related-To: NEO-10483

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-15 15:01:36 +01:00
Young Jin Yoon
dab5469f81 feature: disable scratch page by default
Modified default values for disableScratch and gpuPageFault
to true and 10 respectively in drm_nep.cpp, in order to
disable scratch pages by default.

Related-To: GSD-5673
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-03-15 11:44:10 +01:00
Young Jin Yoon
9633f49dab fix: make gpuFaultCheckCounter more robust
Modified drm_neo.h and .cpp to check when condition is greater
than and equal to instead of equal, and changed gpuFaultCheckCounter
to be atomic

Related-To: GSD-5673
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-03-15 10:40:12 +01:00
Mateusz Jablonski
470def9be6 test: fix variable naming
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-15 10:06:46 +01:00
Young Jin Yoon
82728ff394 feature: add logic to iterate for all contexts to check GPU pagefault
Implemented to go through entire contexts in the process and then query
reset status to check the unexpected GPU segfault.

Added a new debug variable GpuFaultCheckThreshold to change the checking
frequency for each hang check for performance analysis.

Related-To: GSD-5673
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-03-15 07:48:39 +01:00
ocldev
5111f30116 build: dependencies update
Signed-off-by: ocldev <ocldev@intel.com>
2024-03-15 06:13:55 +01:00
Compute-Runtime-Validation
94cc48f81b Revert "fix: don't use fake userptr flag in ioctl helper xe"
This reverts commit d3ab256f55.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-15 03:08:01 +01:00
Compute-Runtime-Validation
e11917cfcd Revert "fix: remove not needed checks in ioctl helper xe"
This reverts commit 5a6d0b21ac.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-14 21:38:09 +01:00
Artur Harasimiuk
64d13d2396 ci: infra revision update
Related-To: NEO-10758
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2024-03-14 19:20:11 +01:00
Mateusz Jablonski
d3ab256f55 fix: don't use fake userptr flag in ioctl helper xe
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-14 18:41:17 +01:00
Mateusz Jablonski
5a6d0b21ac fix: remove not needed checks in ioctl helper xe
pass gt id to contextSetParam

Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-14 18:14:50 +01:00
Dominik Dabek
c27d601c34 refactor: use actual pool address in usm pool
Pass actual pool address to heap allocator. This removes the need to
calculate pooled pointer from pool address and offset.

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-03-14 17:31:15 +01:00
Neil R. Spruit
b5f8a38f19 feature: Enable Per IP euStall Functionality
Related-To: NEO-10220

Signed-off-by: Neil R. Spruit <neil.r.spruit@intel.com>
2024-03-14 16:49:52 +01:00
Zbigniew Zdanowicz
8fe1a460f8 refactor: simplify isDcFlushAllowed implementation
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-14 15:09:39 +01:00
Compute-Runtime-Validation
ef7dbc99f1 Revert "fix: don't use fake userptr flag in ioctl helper xe"
This reverts commit 98824fdaf6.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-14 14:35:14 +01:00
Mateusz Jablonski
833fa6bce1 fix: correct querying engines from xe kmd
we get drm_xe_query_engines, not array of drm_xe_engine_class_instance

Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-14 12:06:25 +01:00
Fabian Zwolinski
eb259c1f52 fix: set kernel LWS prior to obtaining indirect heaps
When calculating the size of the indirect object heap,
the local work group size from kernel implicit args is taken into account.
If the LWS is not set before this calculation,
it can lead to insufficient ioh allocation size.
Such a problem is seen when local ids are generated by the runtime
and then written to ioh. The write fails due to lack of space in the allocation.

Related-To: IGC-7708
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-03-14 10:38:46 +01:00
Mateusz Jablonski
98824fdaf6 fix: don't use fake userptr flag in ioctl helper xe
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-14 10:07:38 +01:00
Zbigniew Zdanowicz
9815f1e99b refactor: group template implementations and change inl file names
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-14 08:38:05 +01:00
Joshua Santosh Ranjan
a730cf17c0 fix: fix static initialization in zello metrics
Related-To: NEO-10764

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2024-03-14 08:25:42 +01:00
Young Jin Yoon
7b81c4e08f feature: abort when unexpected GPU page fault detected
If ResetStats from i915 is from the GPU page fault, abort
the entire process instead of disabling engines.
Added a fallback mechanism when prelim_drm_i915_reset_stats
fails.

Related-To: GSD-5673
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-03-14 08:14:59 +01:00