Commit Graph

4225 Commits

Author SHA1 Message Date
Morek, Szymon 9ca2091725 performance: Reuse GPU timestamp instead of KMD escape
Resolves: NEO-10615

Signed-off-by: Morek, Szymon <szymon.morek@intel.com>
2024-04-17 09:39:29 +02:00
Mateusz Jablonski 15d7a31148 refactor: add separate function to create IoctlHelperXe
unify ioctl helper xe logic

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-04-17 08:52:30 +02:00
Maciej Bielski 9903009889 refactor: UT cleanup - remove unneeded DrmMockXe2T
Related-To: NEO-9754
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2024-04-16 18:12:13 +02:00
Jaroslaw Warchulski ca70bb99b2 refactor: reduce the number of DrmAllocation constructors
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2024-04-16 15:37:09 +02:00
Bartosz Dunajski 4f52fd504a feature: return experimental sync queue extension if supported
Related-To: NEO-8171

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-04-16 14:48:22 +02:00
Maciej Bielski c2b4699cb6 feature: Add mask of tiles to each memory region in Xe
* Derive that information from GT list
* Use it to compute local memory size given the mask of tiles to account
for
* Leverage IoctlHelper to dispatch execution (i915 vs Xe)

Related-To: NEO-9754
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2024-04-16 14:22:40 +02:00
Lukasz Jobczyk a230c762e0 feature: Adjust PATs for dc flush mitigation
Related-To: NEO-10556

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-04-16 13:31:02 +02:00
Kamil Kopryk 52b4c75f68 refactor: optimize checking if heapless state init is enabled
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-04-15 19:09:25 +02:00
Katarzyna Cencelewska ce3bb1327e feature: add method to switch enable/disable mid thread preemption
Resolves: NEO-8089

Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-04-15 16:26:52 +02:00
Jemale Lockett dd884a0733 fix: stop using gpr15 in non-debug context
Resolves: NEO-11027

Signed-off-by: Jemale Lockett <jemale.lockett@intel.com>
2024-04-15 12:33:51 +02:00
Mateusz Jablonski 5b6acb4f3f fix: ignore unrecognized topology type in xe path
Related-To: NEO-11125
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-04-15 12:16:38 +02:00
Mateusz Jablonski ff4a919004 fix: correct gt id in context param engines in xe path
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-04-15 12:07:48 +02:00
Mateusz Jablonski 62390d3def feature: add number of l3 banks to TopologyData
Related-To: NEO-11125
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-04-15 11:20:43 +02:00
Mateusz Jablonski f676f0c539 test: don't use drm query mock if i915 prelim detection is disabled
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-04-15 10:58:11 +02:00
Mateusz Jablonski 635d495767 fix: correct gtId when getting path to frequency files in xe path
Related-To: NEO-11111
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-04-15 10:44:06 +02:00
Mateusz Jablonski 27398c2b6f fix: get timestamp frequency from default engine in xe path
Related-To: NEO-11111
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-04-15 10:36:08 +02:00
Aleksander Czerwionka b630f58861 fix: remove incorrectly reported extensions
Related-To: NEO-8349

Signed-off-by: Aleksander Czerwionka <aleksander.czerwionka@intel.com>
2024-04-15 10:04:31 +02:00
Lukasz Jobczyk 8a0c425495 feature: Mark selected resources as UC when mitigating dc flush
Related-To: NEO-10556

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-04-12 12:52:13 +02:00
Mateusz Jablonski b5435f7dfe fix: store queried gt list data in ioctl helper xe
with this commit we query the info only once

Related-To: NEO-11111
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-04-12 12:20:34 +02:00
Mateusz Jablonski 8eb17a843b fix: correct path to frequency files in xe path
Related-To: NEO-11111
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-04-12 11:49:48 +02:00
Mateusz Jablonski 5e8c71ca3d refactor: remove not needed members from ioctl helper xe
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-04-12 11:35:54 +02:00
Zbigniew Zdanowicz 3b9e1e4b32 refactor: add getter to in order exec info class
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-04-12 10:17:39 +02:00
Morek, Szymon 016529ac12 performance: Set WB cache mode for sysmem
Related-To: NEO-10867

Signed-off-by: Morek, Szymon <szymon.morek@intel.com>
2024-04-12 09:01:56 +02:00
Bartosz Dunajski ca9df372c1 fix: enable device->host bcs mi_fence on PVC
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-04-11 20:35:22 +02:00
Mateusz Hoppe a9366fff48 refactor: print message with location when cache is enabled
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-04-11 17:08:39 +02:00
Mateusz Jablonski b24ad6a1b0 fix: allocate in system memory when system memory bitfield is set in properties
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-04-11 13:47:57 +02:00
shubham kumar 6a55bbe6cd feature(sysman): Add pmt support in windows
Related-To: NEO-8804

Signed-off-by: shubham kumar <shubham.kumar@intel.com>
2024-04-10 12:42:19 +02:00
Compute-Runtime-Validation 5288100d86 Revert "feature(sysman): Add pmt support in windows"
This reverts commit ebf88c92ca.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-04-09 17:37:41 +02:00
Mateusz Jablonski cb2b572e94 feature: add support for null aub mode
In this mode AUB csr will be created, however, no aub file will be created

Related-To: NEO-11097
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-04-09 16:59:42 +02:00
Compute-Runtime-Validation 9568ee47e7 Revert "fix: remove compiler cache legacy implementation"
This reverts commit 864f42116c.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-04-09 02:06:51 +02:00
Weronika Kapusta 864f42116c fix: remove compiler cache legacy implementation
Related-To: NEO-10679
Signed-off-by: Kapusta, Weronika <weronika.kapusta@intel.com>
2024-04-08 16:40:57 +02:00
Bartosz Dunajski ef35c5cb62 feature: program device to host fence in ocl path
Related-To: NEO-10417

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-04-08 14:57:16 +02:00
Bartosz Dunajski 1627172e28 feature: add helper to check if device to host copy fence is required
Related-To: NEO-10417

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-04-08 11:18:44 +02:00
Compute-Runtime-Validation 59849e9e92 Revert "performance: Remove tlb flush on DG2 Linux"
This reverts commit 7412b4823f.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-04-07 05:35:05 +02:00
Mateusz Hoppe d6675b14b3 fix: release secondaryContexts prior to primary context
Related-To: NEO-10920

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-04-05 16:34:39 +02:00
Bartosz Dunajski 6e762eb3bf test: multitile sync dispatch aub test
Related-To: NEO-8171

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-04-05 14:52:39 +02:00
Lukasz Jobczyk 7412b4823f performance: Remove tlb flush on DG2 Linux
Resolves: NEO-7116

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-04-05 14:17:45 +02:00
Bartosz Dunajski a83f3f7bc1 refactor: multicontext aub fixture api agnostic + improve createModuleFromFile
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-04-04 15:50:40 +02:00
Mateusz Jablonski 9468915768 fix: correct preemption support in xe path
preemption is always supported by xe kmd

Related-To: NEO-10496, HSD-18037744953
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-04-04 13:29:02 +02:00
Maciej Plewka c6960ac947 feature: Bind ring buffer allocations as read only
Related-To: NEO-10398
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-04-04 11:23:15 +02:00
shubham kumar ebf88c92ca feature(sysman): Add pmt support in windows
Related-To: NEO-8804

Signed-off-by: shubham kumar <shubham.kumar@intel.com>
2024-04-04 11:08:58 +02:00
Young Jin Yoon 907129bb33 feature: disable scratch page by default
Modified default values for disableScratch and gpuPageFault
to true and 10 respectively in drm_neo.cpp, in order to
disable scratch pages by default.
Modified to set gpuPageFault to 0 as a default value when
scratch page is not disabled.

Related-To: GSD-5673
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-04-04 09:50:02 +02:00
Zbigniew Zdanowicz 73d558058c feature: add heapless and global stateless scratch address patching
Related-To: NEO-10381

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-04-03 17:04:35 +02:00
Dominik Dabek 9c601f8dc2 fix: increase usm pool chunk alignment
Increase chunk alignment from 512 to 4096.
Adjust tests to account for actual allocation size different than
requested.

Related-To: NEO-11005

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-04-03 13:17:49 +02:00
Plewka, Maciej 5c15aaf48a feature: Bind command buffer allocations as read only
Related-To: NEO-10398
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-04-02 16:08:48 +02:00
Kamil Kopryk 4eae28bd64 feature: introduce heapless state init in L0
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-04-02 12:34:53 +02:00
Dominik Dabek b109094e4b fix: move ulls controller stop thread
Move ulls controller thread stopping to execution environment
destructor. This removes the vptr race from ulls controller destructor.

Put tests that are actually running the ulls controller into multi thread
tests.

That way they are compiled with thread sanitizer and removes the
variable test time from traditional ULTS.

Related-To: NEO-10942

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-04-02 08:26:19 +02:00
Mateusz Jablonski 35788e2c7f test: unify per test case timeout in aub tests
setup timeout to 180s

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-29 09:50:25 +01:00
Mateusz Hoppe f86d4220a5 feature: add bindless samplers support to level zero
- samplers using bindless adressing require patching bindless offsets to
sampler states on kernel's cross thread data

Related-To: NEO-10505

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-03-29 08:07:28 +01:00
Mateusz Jablonski 420e1391b2 fix: handle not aligned gtt size reported by i915
when i915 reports gtt size between 47 and 48 bits we consider
it as 48 bit VA space

Related-To: GSD-8215
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-29 07:51:06 +01:00
Young Jin Yoon d6a14d4ed5 feature: support explicit memory locking
Added lockMemory in context to explicitly locking memory,
Added a boolean flag in graphics_allocation to indicate the allocation
is locked, and modified memory_operations_handler to add lock().

Related-To: NEO-8277
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-03-29 07:31:22 +01:00
Krzysztof Gibala 2374ffa915 refactor: print errno from mmap and munmap calls
Signed-off-by: Krzysztof Gibala <krzysztof.gibala@intel.com>
2024-03-28 19:05:54 +01:00
Jemale Lockett 6d945ad28b fix: Update topology mapping
Updates topology mapping to handle
case when no dual subslices

Resolves: NEO-10670

Signed-off-by: Jemale Lockett <jemale.lockett@intel.com>
2024-03-28 19:03:05 +01:00
Kacper Nowak 35ca2bbea9 refactor: refactor logic for querying atomic FP capabilities
- Move logic from product helper to compiler product helper
- Add method for adjusting fp16 and extra capabilities using release
helper (if present).


Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2024-03-28 15:27:45 +01:00
Mateusz Jablonski bcaa62bed3 build: refactor linux kernel headers
Add cmake flags to provide path to headers to drm / i915 /i915 prelim / xe

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-28 14:48:09 +01:00
Dominik Dabek a7479e6332 fix: ulls controller state and tests
Make the fields in direct submission state atomic.
Add thread yields to waiting loop in tests to avoid compiler optimizing
them away.

Related-To: NEO-10942

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-03-28 14:38:33 +01:00
Compute-Runtime-Validation e3f50e8aa9 Revert "fix: handle not aligned gtt size reported by i915"
This reverts commit dae901c13f.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-28 12:03:23 +01:00
Zbigniew Zdanowicz 489ef2a310 feature: add getter of walker inline data offset
Related-To: NEO-10381

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-28 09:39:49 +01:00
Mateusz Jablonski 3b1e6b61e4 fix: remove setting scratch page flag in xe path
Related-To: HSD-18037680815
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-28 09:16:48 +01:00
Mateusz Jablonski dae901c13f fix: handle not aligned gtt size reported by i915
when i915 reports gtt size between 47 and 48 bits we consider
it as 48 bit VA space

Related-To: GSD-8215
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-28 08:46:53 +01:00
Mateusz Jablonski e435a5cac4 fix: correct timeout for xe user fence
Related-To: HSD-18037675531

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-28 08:34:24 +01:00
Mateusz Jablonski 34a80e915c fix: correct xe kmd flag for bind capture
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-27 19:37:56 +01:00
Bartosz Dunajski c3546f9e62 refactor: enable 64b mi_atomic operands and add cmp/wr operation
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-03-27 18:56:40 +01:00
Maciej Plewka 3d9e95d177 feature: Bind isa allocations as read only
Related-To: NEO-10398
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-03-27 17:39:46 +01:00
Mateusz Jablonski 2eecf29278 fix: correct setting scratch page in xe path
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-27 16:44:21 +01:00
Maciej Plewka b722f3b579 feature: Add interface to bind resources as readonly
Related-To: NEO-10398
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-03-27 14:24:58 +01:00
Mateusz Jablonski 3914f60c96 refactor: remove not needed hardcoded values in xe path
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-27 10:58:31 +01:00
Compute-Runtime-Validation 8e44a46983 Revert "feature: bind resources as read only"
This reverts commit f3d36d3350.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-27 08:51:47 +01:00
Dominik Dabek 295b58a11f fix: move ulls controller thread start
Move starting ulls controller thread outside constructor to avoid vptr
race.

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-03-27 08:06:50 +01:00
Mateusz Jablonski 5b6dd87fc3 refactor: remove not needed hardcoded values in xe path
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-27 07:21:50 +01:00
Mateusz Jablonski 0b57b0fb7a fix: remove invalid hardcoded flags for vm create in xe path
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-26 21:36:19 +01:00
Mateusz Hoppe 420a6092ae test: add test for config query in XE
Related-To: NEO-10168

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-03-26 19:43:11 +01:00
Mateusz Jablonski 1f96e6e251 fix: update uapi headers version to 6.8.1
source of files: https://git.kernel.org/stable/h/v6.8.1

Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-26 16:08:32 +01:00
Bartosz Dunajski fce016ee7a refactor: add new semaphore param
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-03-26 15:46:47 +01:00
Maciej Plewka f3d36d3350 feature: bind resources as read only
Related-to: NEO-10398
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-03-26 14:11:57 +01:00
Dominik Dabek 562d6f114b performance: helper for adjust ULLS on battery
Related-To: NEO-10800

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-03-26 12:57:30 +01:00
Bartosz Dunajski 74dcb5a9e3 feature: initial support for sync dispatch token allocation
Related-To: NEO-8171

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-03-26 12:07:58 +01:00
Brandon Yates 87022c31d3 feature: Register critical section metadata for xe debugger
Related-to: NEO-9161

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2024-03-25 20:56:08 +01:00
Mateusz Hoppe 72725e6ba9 feature: set priority for lowPriority contexts
Related-To: NEO-10168

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-03-25 20:19:43 +01:00
Young Jin Yoon 068f6a25c6 Revert "feature: support explicit memory locking"
This reverts commit 27a3307bb0.

Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-03-25 20:10:20 +01:00
Mateusz Jablonski 78a4a92b44 refactor: reorder members to reduce internal padding in structs
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-25 15:50:00 +01:00
Mateusz Jablonski 6af9856e33 fix: reorder members in BufferObject class
Related-To: HSD-13011781488
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-25 11:44:38 +01:00
Young Jin Yoon 27a3307bb0 feature: support explicit memory locking
Added lockMemory in context to explicitly locking memory,
Added a boolean flag in graphics_allocation to indicate the allocation
is locked, and modified memory_operations_handler to add lock().
Change the logic to work correctly with makeResident() when lock() is
called previously for the same memory region

Related-To: NEO-8277
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-03-25 09:49:18 +01:00
Compute-Runtime-Validation df164174b4 Revert "fix: pass FtrTile64Optimization as-is"
This reverts commit 22d08dabc4.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-22 23:56:28 +01:00
Mateusz Jablonski 8d38a0c726 fix: remove not needed hardcoded caps reported by ioctl helper xe
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-22 19:00:28 +01:00
Krzysztof Gibala 4d797890af fix: Use FtrHwScheduling for setting ftrWddmHwQueues
Related-To: NEO-10212

Signed-off-by: Krzysztof Gibala <krzysztof.gibala@intel.com>
2024-03-22 18:36:24 +01:00
Mateusz Jablonski d94be09020 refactor: remove not needed check for exec softpin
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-22 17:30:49 +01:00
Bartosz Dunajski 9aa81bae75 feature: initial support to enable synchronized dispatch
Related-To: NEO-8171

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-03-22 17:23:58 +01:00
Mateusz Jablonski 8e6240ac70 fix: don't use fake userptr flag in ioctl helper xe
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-22 15:59:56 +01:00
Compute-Runtime-Validation a75db375a1 Revert "fix: remove hardcoded caps reported by ioctl helper xe"
This reverts commit 0b2e8e2848.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-22 15:15:19 +01:00
Dominik Dabek 2b964254d6 performance: debug key for adjust ULLS on battery
ULLS controller timeout settings will be adjusted based on ac line
status and lowest queue throttle from submissions.

Lowest queue throttle is reset when controller stops ULLS.

Related-To: NEO-10800

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-03-22 14:24:00 +01:00
Mateusz Jablonski ec19ce536a refactor: store userptr value in buffer object
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-22 12:49:13 +01:00
Katarzyna Cencelewska da7b03dd15 fix: to always use grfs count in calculateNumThreadsPerThreadGroup
grf size != grf count

Related-To: GSD-8437
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-03-22 11:03:18 +01:00
Mateusz Jablonski 0b2e8e2848 fix: remove hardcoded caps reported by ioctl helper xe
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-22 10:41:54 +01:00
Naklicki, Mateusz 021f4c6299 refactor: use more precise function name
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com>
2024-03-22 10:09:41 +01:00
Katarzyna Cencelewska dd1d52259e refactor: add param rootDeviceEnvironment to calculateNumThreadsPerThreadGroup
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-03-21 22:25:14 +01:00
Young Jin Yoon ec009cf9e3 fix: abort only when disabling scratch page
Modifed getResetStatus to abort only when scratch page is disabled
Removed an incorrect UNRECOVERABLE_IF statement based on the status:
validPageFault can be true when banned flag is not set, if CAT error
does not occur as a result of page fault.

Related-To: GSD-5673
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-03-21 21:55:25 +01:00
Mateusz Jablonski 4df0dd7894 fix: remove hardcoded caps reported by ioctl helper xe
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-21 21:09:20 +01:00
Mateusz Jablonski 92d37b20a6 fix: setup gpu address space based on config info from xe kmd
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-21 18:49:19 +01:00
Mateusz Jablonski 1e343053ba refactor: remove redundant recreating vector of engines in xe kmd path
make ContextParamEngine structure more generic and populate engines
by drm specific methods

Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-21 17:55:39 +01:00
Zbigniew Zdanowicz 12affba420 feature: add override key to change command list update capability
Related-To: NEO-10062

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-21 17:11:27 +01:00
Mateusz Jablonski 22d08dabc4 fix: pass FtrTile64Optimization as-is
Related-To: NEO-10623
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-21 16:52:27 +01:00
Aravind Gopalakrishnan 04b99de4d6 refactor: Force tlb flush during TC after copy
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-03-21 07:25:46 +01:00
Mateusz Hoppe 0cec5ccc32 fix: check if any of devices has BindlessHeapHelper enabled
- report Bindless_images extension in driver only when at least one
device has global bindless heaps enabled

Related-To: NEO-10352

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-03-20 20:50:58 +01:00
Mateusz Jablonski 614ed7ff5f fix: remove not needed appendBaseAddressOffset method
Related-To: NEO-10623
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-20 17:46:14 +01:00
Wenju He 03078541d7 feature: make global bindless heaps resident when created
Make bindless heaps resident right after heap allocation.
Motivation is that SYCL bindless image can be passed as a value argument
or through memory. Therefore, we're not able to make its bindless heap
resident during kernel initialization or setting kernel arguments.

This fixes SYCL bindless image read_write_*D.cpp tests on DG2.

Related-To: NEO-7063
Signed-off-by: Wenju He <wenju.he@intel.com>
2024-03-20 14:40:11 +01:00
Mateusz Hoppe 27b930cabc refactor: allow default setting for UseExternalAllocatorForSshAndDsh
- value of -1 selects driver default setting for external allocator

Related-To: NEO-7063

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-03-20 12:29:56 +01:00
Raiyan Latif 40471da591 fix: properly retrieve sub-device count in allocateRTDispatchGlobals
Previous method used for retrieiving sub-device count in this path
did not take into account device affinity mask nor device hiearchy
mode, resulting in segmentation faults when attempting to allocate
rtDispatchGlobals structure using improper deviceBitFields.

Related-To: NEO-8422

Signed-off-by: Raiyan Latif <raiyan.latif@intel.com>
2024-03-20 09:23:30 +01:00
Joshua Santosh Ranjan 06fcdd28f3 feature: add debug flag for metrics logs
Related-To: NEO-10125

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2024-03-19 12:33:26 +01:00
Mateusz Jablonski e9e41714b4 test: restore maxOsContextCount after test cleanup
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-19 12:00:37 +01:00
Jaroslaw Warchulski 7c5640669c fix: incorrect check in parseAffinityMask function
Resolves: GSD-8333
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2024-03-19 09:20:00 +01:00
Mateusz Jablonski a2742492ab feature: enable xe drm detection by default
driver is built with xe drm support by default

added cmake flag to control xe eu debug API support
NEO_ENABLE_XE_EU_DEBUG_SUPPORT

This flag is disabled by default and uapi-eu-debug headers are not
needed for driver compilation as these headers are not a part of
upstream kernel yet.

Related-To: NEO-10780

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-19 08:47:13 +01:00
Naklicki, Mateusz e9ee54da24 fix: prepare L3 cache banks getter
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com>
2024-03-18 15:12:16 +01:00
Zbigniew Zdanowicz d1041e2335 feature: find cpu base address from all command buffers of container
Related-To: NEO-10381

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-18 13:43:08 +01:00
Mateusz Jablonski 19dcc80e44 Revert "build: enable xe drm detection by default"
This reverts commit 973757a58d.

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-18 09:04:59 +01:00
Compute-Runtime-Validation 016c234893 Revert "feature: disable scratch page by default"
This reverts commit dab5469f81.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-16 01:52:00 +01:00
Mateusz Jablonski 1e1d675606 fix: disable passing FtrTile64Optimization to gmmlib
add debug key to control if the value should be passed

Related-To: NEO-10785
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-15 17:42:53 +01:00
Naklicki, Mateusz adae21b103 fix: prepare L3 cache size getter
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com>
2024-03-15 16:55:45 +01:00
Mateusz Jablonski 0270cd6a5b fix: respect gt id when getting engines for drm context under xe kmd
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-15 16:02:47 +01:00
Bellekallu Rajkiran 9736313d10 feature: Support for ccs mode configuration via SysFs
Add support for configuring ccs mode for all applicable devices
before KMD is loaded.

Use ZEX_NUMBER_OF_CCS to configure ccs mode.

Format is as follows:

ZEX_NUMBER_OF_CCS=NumberOfCcs i,e Setting ZEX_NUMBER_OF_CCS
to 4 sets ccs mode to 4 for all devices for which configuration
is supported.

Related-To: NEO-10378

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2024-03-15 15:51:45 +01:00
Zbigniew Zdanowicz f496233462 refactor: use single vector to store used heap allocations
Related-To: NEO-10483

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-15 15:01:36 +01:00
Young Jin Yoon dab5469f81 feature: disable scratch page by default
Modified default values for disableScratch and gpuPageFault
to true and 10 respectively in drm_nep.cpp, in order to
disable scratch pages by default.

Related-To: GSD-5673
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-03-15 11:44:10 +01:00
Mateusz Jablonski 470def9be6 test: fix variable naming
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-15 10:06:46 +01:00
Young Jin Yoon 82728ff394 feature: add logic to iterate for all contexts to check GPU pagefault
Implemented to go through entire contexts in the process and then query
reset status to check the unexpected GPU segfault.

Added a new debug variable GpuFaultCheckThreshold to change the checking
frequency for each hang check for performance analysis.

Related-To: GSD-5673
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-03-15 07:48:39 +01:00
Compute-Runtime-Validation 94cc48f81b Revert "fix: don't use fake userptr flag in ioctl helper xe"
This reverts commit d3ab256f55.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-15 03:08:01 +01:00
Mateusz Jablonski d3ab256f55 fix: don't use fake userptr flag in ioctl helper xe
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-14 18:41:17 +01:00
Neil R. Spruit b5f8a38f19 feature: Enable Per IP euStall Functionality
Related-To: NEO-10220

Signed-off-by: Neil R. Spruit <neil.r.spruit@intel.com>
2024-03-14 16:49:52 +01:00
Compute-Runtime-Validation ef7dbc99f1 Revert "fix: don't use fake userptr flag in ioctl helper xe"
This reverts commit 98824fdaf6.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-14 14:35:14 +01:00
Mateusz Jablonski 833fa6bce1 fix: correct querying engines from xe kmd
we get drm_xe_query_engines, not array of drm_xe_engine_class_instance

Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-14 12:06:25 +01:00
Mateusz Jablonski 98824fdaf6 fix: don't use fake userptr flag in ioctl helper xe
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-14 10:07:38 +01:00
Young Jin Yoon 7b81c4e08f feature: abort when unexpected GPU page fault detected
If ResetStats from i915 is from the GPU page fault, abort
the entire process instead of disabling engines.
Added a fallback mechanism when prelim_drm_i915_reset_stats
fails.

Related-To: GSD-5673
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-03-14 08:14:59 +01:00
Mateusz Jablonski 0210e37f03 fix: respect gt id when finding xe engine info
Related-To: NEO-10496
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-13 20:52:36 +01:00
Bartosz Dunajski e400a5c624 feature: in-order host counter allocation pooling 2
Related-To: NEO-10507

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-03-13 18:59:02 +01:00
Compute-Runtime-Validation 9cce1183cd Revert "feature: use prelim reset_stats for detailed statisics"
This reverts commit 835dc8b594.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-13 14:31:57 +01:00
Aravind Gopalakrishnan 3f20dd3b49 refactor: Add optional user fence during unbind
Add optional fence and wait operations after unbind operation.

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-03-13 12:47:44 +01:00
Young Jin Yoon 835dc8b594 feature: use prelim reset_stats for detailed statisics
Added getResetStats() in ioctl_helper.h to support extended header for
prelim_drm_i915_reset_stats.
Added new data structure to capture the fault data structure for prelim.

Related-To: GSD-5673
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-03-13 11:37:04 +01:00
Francois Dugast 5483e466e8 fix: Align on strings returned for unknown values
Related-to: NEO-10321

Signed-off-by: Francois Dugast <francois.dugast@intel.com>
2024-03-13 11:21:51 +01:00
Dominik Dabek d664995221 performance: enable localPreferred on dg2
Use localPreferred for buffer, svmGpu and image allocations on dg2.

Related-To: NEO-9695

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-03-12 22:47:47 +01:00
Ben Ashbaugh e00230a6c2 fix: correctly report support for SPIR-V 1.0 through 1.3
Related-To: NEO-10336

Signed-off-by: Ben Ashbaugh <ben.ashbaugh@intel.com>
2024-03-12 17:54:09 +01:00
Kamil Kopryk 626b3846b0 refactor: change compute walker to defaultWalkerType
Related-To: NEO-10641
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-03-12 17:52:43 +01:00
Lukasz Jobczyk c3f1eba24a refactor: Add flag to control DC flush
Related-To: NEO-10556

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-12 14:54:16 +01:00
Mrozek, Michal ee1a225a41 refactor: remove not used debug variables
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-03-12 10:12:23 +01:00
Dominik Dabek 8bc17778c4 test: refactor prealloc cmd buffer tests
Related-To: NEO-10526

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-03-12 09:28:53 +01:00
Zbigniew Zdanowicz 3f740f3460 feature: add new functionality to in order patch helpers
Related-To: NEO-10385

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-12 09:17:53 +01:00
Mateusz Jablonski 973757a58d build: enable xe drm detection by default
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-11 14:29:20 +01:00
Dominik Dabek 5ba9308804 performance: debug flag for localPreferred
Add flag for setting localPreferred (implicit when gmm localOnly=0 and
NonLocalOnly=0) when allocating buffer, svmGpu and image.

Related-To: NEO-9695

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-03-11 10:51:49 +01:00
Compute-Runtime-Validation 4082e9f028 Revert "feature: in-order host counter allocation pooling"
This reverts commit 2f03c48c7a.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-09 19:47:10 +01:00
Mateusz Hoppe cb7ac1ada0 feature: add debug key to generate sip header file
- header file can be used with LoadBinarySipFromFile

Related-To: GSD-8253

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-03-08 19:03:43 +01:00
Mrozek, Michal f71f6d2b72 refactor: remove not needed code
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-03-08 18:18:55 +01:00
Mrozek, Michal 13a347cbde refactor: remove not needed code
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-03-08 15:10:06 +01:00
Mateusz Hoppe 5e15543b14 test: improve unit tests - use virtual file system
- use virtualFileSystem in ULTs, writeDataToFile() does not create file
but instead saves filename. Call to fileExists() checks if file was
previously written

Related-To: NEO-7006

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-03-08 13:34:19 +01:00
Dunajski, Bartosz 2f03c48c7a feature: in-order host counter allocation pooling
Related-To: NEO-10507

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-03-08 13:13:04 +01:00
Kamil Kopryk 168445784e feature: introduce states programming at driver init heapless ocl
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-03-08 12:29:44 +01:00
Mrozek, Michal c72b9ec448 refactor: remove not needed code
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-03-08 08:53:13 +01:00
Mrozek, Michal 10313b7b84 refactor: remove not needed code
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-03-07 18:50:16 +01:00
Lukasz Jobczyk 6d1a3d404e refactor: Add helper to control flat ring buffer
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-07 17:52:23 +01:00
Mateusz Jablonski 8e7e5ca58f fix: return same UUID format independently on pci bus info presence
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-07 16:56:30 +01:00
Michal Mrozek e08668b982 refactor: remove not needed code.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-03-07 13:02:29 +01:00
Zbigniew Zdanowicz fe377b6597 feature: add out cmd pointer to store data imm encoder
Related-To: NEO-10385

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-07 12:52:08 +01:00
Bartosz Dunajski 79d80047ef refactor: improve mmap logging logic
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-03-07 12:15:39 +01:00
Michal Mrozek 660539fe4e refactor: remove not needed code.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-03-06 20:13:57 +01:00
Mateusz Jablonski debbf87671 fix: move ARL-U device id to MTL-M release
Related-To: NEO-10404
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-06 17:33:00 +01:00
Dominik Dabek 033ff28609 performance: enable cmd buffer preallocate xehp+
For L0 immediate cmdlists try to use reusable allocations if available.

Related-To: NEO-10526

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-03-06 16:53:01 +01:00
Mateusz Jablonski 8ae4a3bc7a fix: pass Sku/Wa tables for gmm without additional translations on Windows
Related-To: NEO-10623
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-06 14:58:58 +01:00
Bartosz Dunajski fcd57f94cf refactor: capability to print mmap and munmap calls
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-03-06 14:29:01 +01:00
Brandon Yates 7a0d2df2fe fix: Handle Pat Index Ext not supported on Xe
Xe does not support VmBindPatIndexExtension. This patch
fixes the handling of this case and prevents corrupting
other extensions

Related-to: NEO-9674

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2024-03-06 11:18:31 +01:00
Filip Hazubski d25026b263 refactor: Add getTotalMemBankSize function to ReleaseHelper
Minor refactor of ULTs to not use hard coded banks size.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-03-06 09:53:56 +01:00
Dominik Dabek a04c67ec52 performance(ocl): refactor pool allocators tests
add explicit tests for xe hpc
Related-To: NEO-9700

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-03-05 18:48:55 +01:00
Mateusz Jablonski 8b4bca89bc refactor: move ray tracing support capability to release helper
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-05 17:30:57 +01:00
Filip Hazubski 5f7e56e78b refactor: Unify aub config generation
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-03-05 11:09:34 +01:00
Compute-Runtime-Validation 293858320e Revert "performance: Fill reusable allocation list on xe and later"
This reverts commit 15934ceb43.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-02 07:09:45 +01:00
Fabian Zwolinski 52430762ac fix: cl_cache L0 env vars + refactor code structure
Added support for new Compiler Cache
environment variables in Level Zero.

Moved
`opencl/source/compiler_interface/default_cache_config.cpp`
`level_zero/core/source/compiler_interface/default_cache_config.cpp`
to shared directory
`source/compiler_interface/default_cache_config.cpp`

Switched enabling cache by default from per OS to per API.
Changed default state of cl_cache in Level Zero to disabled.

Related-To: NEO-10045
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-03-01 17:35:08 +01:00
Dunajski, Bartosz 51ae76a25f refactor: improve handling of in-order atomic signaling
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-03-01 15:59:25 +01:00
Lukasz Jobczyk bd6925d51a refactor: Exclude thread dispatch algorithm for overdispatch to function
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-01 15:04:04 +01:00
Mateusz Jablonski a8fbed6120 feature: enable dummy blit WA for PVC
Related-To: NEO-9996
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-01 14:04:46 +01:00
Dunajski, Bartosz ea2ad550a1 refactor: improve handling duplicated in-order host storage
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-03-01 13:54:28 +01:00
Mateusz Jablonski 80aa55a3cb fix: don't program dummy blit prior to MI_FLUSH_DW without postsync
add missing dummy blits before MI_FLUSH_DW with postsync

Related-To: NEO-9996
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-01 12:36:13 +01:00
Kozlowski, Marek 6751d19c19 fix: decanonize pointer to match GPU heap address space
* `zeVirtualMemReserve` `pStart` address may be passed in a canonizated form.

Resolves: NEO-10086

Signed-off-by: Kozlowski, Marek <marek.kozlowski@intel.com>
2024-03-01 12:18:11 +01:00
Dunajski, Bartosz bbe1043f08 feature: initial support for pooling in-order counter allocations
Related-To: NEO-10507

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-03-01 12:08:02 +01:00
Zbigniew Zdanowicz 4baee71974 refactor: add out cmd argument to semaphore wait encoder
Related-To: NEO-10065

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-01 11:43:36 +01:00
Lukasz Jobczyk 15934ceb43 performance: Fill reusable allocation list on xe and later
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-01 09:41:19 +01:00
Compute-Runtime-Validation 5fce046877 Revert "fix: don't program dummy blit prior to MI_FLUSH_DW without postsync"
This reverts commit d796fb559d.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-01 01:43:35 +01:00
Lukasz Jobczyk e5db84f370 performance: Use GEMCreateExt when allocate by KMD
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-29 18:01:55 +01:00
Lukasz Jobczyk 409e19a832 performance: Enable cmd buffer preallocation per CmdQ on xe and later
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-29 17:41:58 +01:00
Dunajski, Bartosz 00964275fb fix: create preemption allocation for Root Csr
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-29 16:46:24 +01:00
Mateusz Jablonski d796fb559d fix: don't program dummy blit prior to MI_FLUSH_DW without postsync
add missing dummy blits before MI_FLUSH_DW with postsync

Related-To: NEO-9996
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-29 13:30:09 +01:00
Fabian Zwolinski a27dafd2e1 fix: cl_cache store binary and debug data in one file
Related-To: NEO-10045
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-02-29 10:13:15 +01:00
Lukasz Jobczyk 676644bc50 performance: Enable internal heap preallocation on xe and later
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-28 17:58:52 +01:00
Lukasz Jobczyk 0fcc67fba9 refactor: Change scope of product helper test
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-28 15:52:25 +01:00
Lukasz Jobczyk 39a44628a3 performance: Enable timestamp wait for queues on xe and later
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-28 14:26:59 +01:00
Lukasz Jobczyk d1dd34d0c7 performance: Enable timestamp wait for events on xe and later
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-28 14:18:14 +01:00
Michal Mrozek 64232ec370 fix: choose proper csr for low priority immediate command lists
Resolves: NEO-10168

Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-02-28 12:45:02 +01:00
Artur Harasimiuk c3d884e548 test: improve test timeout configuration
Allow custom values for different test types, i.e. ULT, AUB, etc. can
have custom timeout by using e.g.: NEO_{ULT,AUB,...}_ITERATION_MAX_TIME.
Old behavior with reading NEO_ULT_ITERATION_MAX_TIME for each test type
is preserved but new envirnoment variable has precedence.

Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2024-02-28 09:05:29 +01:00
Brandon Yates fa4b737326 feature: Implement metadata attaching for vm_bind in xe
Related-to: NEO-9674

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2024-02-28 01:36:20 +01:00
Dunajski, Bartosz 8483a922a7 feature: simplify in-order allocation overflow
Related-To: NEO-10507

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-27 18:25:26 +01:00
Mateusz Jablonski b6b53daa3f fix: don't program dummy blit prior to MI ARB CHECK
Related-To: NEO-9996
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-27 15:46:46 +01:00
Mateusz Jablonski 9f01a831a2 fix: correct command for dummy blit on PVC
use MEM_SET command

Related-To: NEO-9996

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-27 14:27:47 +01:00
Zbigniew Zdanowicz 0bf8e8727e refactor: add output buffer argument to store register to memory encoder
Related-To: NEO-10064

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-02-27 13:31:55 +01:00
Mateusz Jablonski 9cbc33e23d fix: setup correct render/display core family for gmmlib initialization
pass values got from adapter info

Related-To: NEO-10482
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-26 14:49:04 +01:00
Lukasz Jobczyk cfd3edfb2c fix: Align IOH entry
Related-To: NEO-10036

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-26 14:36:31 +01:00
Mateusz Jablonski 459da6a482 refactor: cleanup xe hpg logic related to dummy blit wa
Related-To: NEO-9996
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-26 13:32:18 +01:00
Mateusz Jablonski 3e65e7bdba fix: correct number of max work group count for concurrent kernel on PVC
for single-CCS mode use all EUs

Related-To: NEO-8377
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-26 10:42:50 +01:00
Dunajski, Bartosz 6cdd2d5dca fix: add missing gt_id when creating XE context
Related-To: GSD-8046

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-23 16:50:32 +01:00
Zbigniew Zdanowicz 1995e6aa40 feature: store cpu pointer of command buffer pointing to post sync command
Related-To: NEO-10064

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-02-23 13:12:15 +01:00
Michal Mrozek ef2d3b538b performance: move timestampPacketTagBuffer to local memory
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
Resolves: NEO-7765
2024-02-23 09:55:48 +01:00
Brandon Yates 0fa730e524 build: Update debugger uapi headers to latest
Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2024-02-22 16:07:06 +01:00
Lukasz Jobczyk b44729771c fix: Extend MTL's overridePatIndex to XE LPG
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-21 13:21:30 +01:00
Maciej Plewka 964f0166d7 fix: Reuse freed chunk from left even if not aligned
Related-To: NEO-10416, NEO-10418, NEO-10437

Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-02-21 12:31:54 +01:00
Zbigniew Zdanowicz db9d0c9a58 fix: remove heap resource leak after command list reset
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-02-20 14:38:19 +01:00
Dunajski, Bartosz f31c88d05f fix: size estimation for tlb flush before copy
Related-To: HSD-18036669673

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-19 13:35:35 +01:00
Dominik Dabek 07639401c5 performance: enable pat index, mtl linux
Enable programming pat indexes on mtl linux by default.

Related-To: NEO-7896

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-02-16 18:31:21 +01:00
Dominik Dabek 0120d8a58d performance: program pat index on mtl linux
Enable programming pat indexes on mtl linux for device buffers.

Change DrmMemoryManager::allocateMemoryByKMD to use gemCreateExt.

Set mmap flags based on coherency.
Map as write back on legacy and coherent.
On non-coherent map as write combined.

Changes currently disabled, to enable use debug keys:
DisableGemCreateExtSetPat=0
UseGemCreateExtInAllocateMemoryByKMD=1

Reorder BufferObject to decrease padding.

Related-To: NEO-7896

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-02-16 17:33:07 +01:00
Lukasz Jobczyk cc1732c930 performance: make resident before lock
Resolves: NEO-10369

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-16 12:00:38 +01:00
Compute-Runtime-Validation 4d159bd06c Revert "performance: make resident before lock"
This reverts commit 7e7fac01e3.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-16 09:23:49 +01:00
Dominik Dabek ed011de03e performance: program pat index on mtl linux
Enable programming pat indexes on mtl linux for device buffers.

Change DrmMemoryManager::allocateMemoryByKMD to use gemCreateExt.

Changes currently disabled, can be enabled with flag
DisableGemCreateExtSetPat=0

Related-To: NEO-7896

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-02-15 17:15:28 +01:00
Lukasz Jobczyk 7e7fac01e3 performance: make resident before lock
Resolves: NEO-10369

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-15 11:56:36 +01:00
Michal Mrozek 25b3a3dbfa fix: user proper settings for caches.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-02-15 11:38:34 +01:00
Compute-Runtime-Validation 9570d02134 Revert "fix: correctly report support for SPIR-V 1.0 through 1.3"
This reverts commit 5120ec2f93.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-15 04:00:31 +01:00
Compute-Runtime-Validation 7b340775c6 Revert "performance: program pat index on mtl linux"
This reverts commit 8e0b23db84.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-15 02:06:03 +01:00
Katarzyna Cencelewska 3d51832498 test: add preemption field to mock objects
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-02-14 21:49:18 +01:00
Dunajski, Bartosz 88c5872682 feature: debug flag to flush tlb before copy
Related-To: HSD-18036669673

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-14 20:05:57 +01:00
Dominik Dabek 8e0b23db84 performance: program pat index on mtl linux
Enable programming pat indexes on mtl linux for device buffers.

Change DrmMemoryManager::allocateMemoryByKMD to use gemCreateExt.

Related-To: NEO-7896

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-02-14 18:42:04 +01:00
Kacper Nowak cccd9a3703 refactor: Add method for retrieving extra device info
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2024-02-14 16:38:15 +01:00
Ben Ashbaugh 5120ec2f93 fix: correctly report support for SPIR-V 1.0 through 1.3
Related-To: NEO-10336

Signed-off-by: Ben Ashbaugh <ben.ashbaugh@intel.com>
2024-02-14 09:21:06 +01:00
Maciej Plewka 233f898b59 fix: split freed chunk from right
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>

Related-To: NEO-10280
2024-02-13 16:14:39 +01:00
Jitendra Sharma ff604bda1b feature: Implement thread control for debugger in XE
Related-To: NEO-9668
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2024-02-12 19:46:28 +01:00
Mateusz Jablonski bb5f6d9660 fix: don't query vm bind support on i915 prelim for pre-Xe platforms
Related-To: HSD-18036843571
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-09 12:50:48 +01:00
Brandon Yates ea7ae7564f feature: Implement read/writeGpuMemory for Xe debugger
- makes prelim read/writeGpuMemory generic
- Implements Xe specific ioctls and fsyncs
- Refactors dbg IoctlHelper to use shared base class
for Xe and i915

Related-to: NEO-9668

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2024-02-08 22:09:59 +01:00
Kamil Kopryk 59f661356c feature: use heapless builtins in OCL if supported
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-02-08 15:49:58 +01:00
Lu, Wenbin 5e562ae7b0 fix: store the correct pagesize in SvmAllocationData
Also use the same alignment for both CPU & GPU in shared USM

Related-To: GSD-7103, NEO-9812

Signed-off-by: Wenbin Lu <wenbin.lu@intel.com>
2024-02-08 10:10:22 +01:00
Yoon, Young Jin 97ef964bc4 feature: Add keys to override sync mode for immediate command list
Added OverrideImmediateCmdListSynchronousMode to override synchronous
mode for immediate command list

Related-To: NEO-10316
Signed-off-by: Yoon, Young Jin <young.jin.yoon@intel.com>
2024-02-08 08:35:32 +01:00
Compute-Runtime-Validation 0b64240deb Revert "feature: enable dummy blit WA for PVC"
This reverts commit fb9d225495.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-08 04:27:00 +01:00
Lukasz Jobczyk 486cc71b76 refactor: Add GDI profiling
Resolves: NEO-9236
Related-To: NEO-10036

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-07 18:44:11 +01:00
Maciej Plewka ce17580b28 fix: Use Rcs engine in blender on DG2
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-02-07 18:21:54 +01:00
Dominik Dabek 371788210d performance: limit usm host allocation recycle
Query system total memory size and limit usm host allocation recycle to
use at most x%.
x is read from ExperimentalEnableDeviceAllocationCache for device and
ExperimentalEnableHostAllocationCache for host.

Related-To: GSD-7497

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-02-07 17:45:41 +01:00
Lukasz Jobczyk 5717a726ff fix: Allow free of zero sized allocation
Related-To: NEO-9236

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-07 17:04:11 +01:00
Dunajski, Bartosz 923707a509 feature: initial support for local dispatch size query
Related-To: NEO-8070

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-07 14:47:05 +01:00
Kacper Nowak 999ec9b2ca refactor: Unify logic for getting atomic FP caps 1/n
- Separate logic for fp16/32/46 caps.
- Add aggregated constexprs for local & global caps of given type
- Pass arguments by reference
- Add hwInfo as argument for future refactors
- Add static_asserts in L0 to ensure there is no mismatch between
internal/external caps
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2024-02-07 11:39:36 +01:00
Compute-Runtime-Validation f6ca565323 Revert "feature: enable use of waitpkg intrinsic"
This reverts commit 2d15517fbe.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-06 16:30:26 +01:00
Dominik Dabek b4a2e243dc performance: accept AOT kernels for indirects
Use indirects detection mechanism for AOT kernels if the detection
version is at least 2.

Related-To: NEO-7712

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-02-06 15:59:42 +01:00
Mateusz Jablonski fb9d225495 feature: enable dummy blit WA for PVC
cleanup redundant implementation for DG2

Related-To: NEO-9996
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-06 10:17:12 +01:00
Compute-Runtime-Validation 9b649c1582 Revert "feature: enable bindless mode in L0 on dg2"
This reverts commit fbb8c5a98d.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-06 01:41:03 +01:00
Dunajski, Bartosz f31fafb1e2 refactor: improve debug flag to override bcs mocs
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-05 20:23:54 +01:00
Mateusz Jablonski 8708963c4f test: don't call real cpuid function in ULT
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-05 14:56:26 +01:00
Fabian Zwolinski fbb8c5a98d feature: enable bindless mode in L0 on dg2
Related-To: NEO-7063
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-02-05 09:48:16 +01:00
Kamil Kopryk a4f7dda98f refactor: Add xe print debug key
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-02-02 16:39:51 +01:00
Katarzyna Cencelewska e6ba9766bd feature: add debug flags to force pat index
for cached recouces: OverridePatIndexForCachedTypes
for uncached resouces: OverridePatIndexForUncachedTypes

Related-To: NEO-10157

Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-02-02 16:11:34 +01:00
Brandon Yates 27c089d60d feature: Register ELF for xe debugger
Related-to:  NEO-9674

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2024-02-02 09:19:19 +01:00
Jitendra Sharma 00b1f1c5b5 fix: set runalone mode in xe only for render and compute
Runalone mode in XE is supported only for RENDER and COMPUTE.

Related-To: NEO-9139

Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2024-02-02 09:13:17 +01:00
Dominik Dabek 2cad595a0d performance: debug flag for usm host alloc recycle
set ExperimentalEnableHostAllocationCache=1 to recycle host usm
allocations

Related-To: GSD-7497

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-02-01 16:47:59 +01:00
Francois Dugast e311ba5597 refactor: Move ownership of engine type to caller of setDefaultEngine
Related-To: GSD-7097

Signed-off-by: Francois Dugast <francois.dugast@intel.com>
2024-02-01 14:20:20 +01:00
Compute-Runtime-Validation e7b7eb06e4 Revert "fix: store the correct pagesize in SvmAllocationData"
This reverts commit a104d9199d.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-01 05:00:45 +01:00
Katarzyna Cencelewska 8ca9141019 fix: remove wa with override pat index on xe_lpg
Related-To: NEO-9657
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-01-31 16:30:12 +01:00
Katarzyna Cencelewska d0b009901c fix: use proper gmm resource type for uncache resources
when new coherency model

Resolves: NEO-9657
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-01-31 15:26:11 +01:00
Mateusz Jablonski fa38a74913 test: increase aub tests iteration time on Windows to 1080s
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-31 10:57:35 +01:00
Kamil Kopryk 6d3a53fe7f refactor: Improve scratch programming in heapless mode
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-01-31 10:39:39 +01:00
Lu, Wenbin a104d9199d fix: store the correct pagesize in SvmAllocationData
Related-To: GSD-7103, NEO-9812

Signed-off-by: Lu, Wenbin <wenbin.lu@intel.com>
2024-01-31 09:12:20 +01:00
Compute-Runtime-Validation fb46066abc Revert "fix: enable cache env variables for level-zero"
This reverts commit 743904d2df.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-01-31 08:33:05 +01:00
Yoon, Young Jin cbe35d70a5 fix: initialize libnuma only when flag is set
Modified in memory_info.cpp to initialize libnuma only when
EnableHostAllocationMemPolicy is set.

Related-To: NEO-8276
Signed-off-by: Yoon, Young Jin <young.jin.yoon@intel.com>
2024-01-30 18:27:43 +01:00
Maciej Plewka 564e0f0319 performance: Align host mem to 2MB when range is not limited
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>

Related-To: NEO-10217
2024-01-30 14:43:13 +01:00
Francois Dugast 278ced35dc fix: Use capability table to determine engine type for defaultEngine
Related-To: GSD-7097

Signed-off-by: Francois Dugast <francois.dugast@intel.com>
2024-01-30 14:28:09 +01:00
Fabian Zwolinski 743904d2df fix: enable cache env variables for level-zero
Related-To: NEO-10045
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-01-30 14:01:42 +01:00
Maciej Plewka 620ad5fa89 fix: change freed chunk size when ptr is aligned
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>

Related-To: NEO-9945
2024-01-30 12:22:28 +01:00
Compute-Runtime-Validation fa9c79fb63 Revert "refactor: Add GDI profiling"
This reverts commit 524ae7713a.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-01-30 10:47:34 +01:00
Mateusz Jablonski da16dad344 fix: don't limit vm bind support based on platform
Related-To: GSD-7097
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-29 19:43:15 +01:00
Lukasz Jobczyk 69c9cacd3e feature: Enable ULLS on wddm 2.3
Related-To: NEO-4843

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-01-29 17:33:28 +01:00
Lukasz Jobczyk 58eaece6c9 fix: Make DirectSubmissionInsertExtraMiMemFenceCommands able to force fence
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-01-29 16:43:24 +01:00
Fabian Zwolinski c51b656d2c fix: request instruction cache invalidation on module destroy
Invalidation is requested on both linux and windows,
on Csr's that used Isa allocation.

Related-To: NEO-10045
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-01-29 16:00:34 +01:00
Lukasz Jobczyk 64175d8bb6 feature: Add ULLS support for WDDM 2.3
Resolves: NEO-4843

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-01-29 15:45:43 +01:00
Lukasz Jobczyk 524ae7713a refactor: Add GDI profiling
Resolves: NEO-9236
Related-To: NEO-10036

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-01-29 11:36:04 +01:00
Maciej Plewka 7728123907 fix: Do not use 2mb alignment for host ptr allocs
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>

Related-To: NEO-9945
2024-01-29 11:01:00 +01:00
Lukasz Jobczyk 4704cd49a3 fix: Remove fence handling when reuse cmd buffer
Resolves: NEO-10163
Related-To: NEO-7116

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-01-29 09:08:19 +01:00
Compute-Runtime-Validation 63a5b64c7e Revert "fix: Remove fence handling when reuse cmd buffer"
This reverts commit f3bbd70a58.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-01-27 08:28:57 +01:00
Katarzyna Cencelewska 46fb73026e fix: enable direct submission on release 1274
Related-To: NEO-8187
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-01-26 16:00:03 +01:00
Katarzyna Cencelewska eec01e500a fix: non-coherency issue on arl
Resolves: HSD-15015200338
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-01-26 10:26:33 +01:00
Lukasz Jobczyk f3bbd70a58 fix: Remove fence handling when reuse cmd buffer
Resolves: NEO-10163
Related-To: NEO-7116

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-01-26 10:06:24 +01:00
Compute-Runtime-Validation 7d6845f2aa Revert "performance: Remove tlb flush on DG2 Linux"
This reverts commit 9215f3acb5.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-01-26 07:13:22 +01:00
Katarzyna Cencelewska 67b0b18be3 fix: osAgnostic path for allocate with alignment
Resolves: NEO-9334
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-01-25 11:03:34 +01:00
Mateusz Jablonski cf4619843d test: correct size for allocating fread buffer
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-24 13:39:23 +01:00
Zbigniew Zdanowicz a25eedb5ac feature: add print of cpu flags and address size upon detection
Related-To: NEO-9737

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-01-24 11:03:30 +01:00
Zbigniew Zdanowicz 2d15517fbe feature: enable use of waitpkg intrinsic
Related-To: NEO-9737

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-01-24 10:46:28 +01:00
Jitendra Sharma 548ecec7f8 feature: Implement debugger open IOCTL
Related-To: NEO-8405

Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2024-01-24 09:50:39 +01:00
Brandon Yates 76de854a69 feature: Set Debug Attach Available for Xe
Related-to: NEO-8402

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2024-01-24 09:04:11 +01:00
Mateusz Jablonski dd7083d710 feature(zebin): add support for spill/private size in execution env
add fallback to previous logic based on zeinfo version

Related-To: NEO-9944
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-23 17:26:43 +01:00
Katarzyna Cencelewska 7bbe57c671 feature: add debug info for logging pat indexes
Related-To: HSD-18031172224
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-01-23 13:34:35 +01:00
Mateusz Jablonski a697a3f718 refactor: create new members for storing spill and private memory in scratch
rename private scratch space into scratch space slot 1 as it can be generic

Related-To: NEO-9944
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-23 12:42:25 +01:00
Kacper Nowak 6cf71cce81 refactor: Allow for grouping BCS1+ copy engines in more engine group types
Do not group copy engines from BCS1+ in linked copy group by default.

Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2024-01-23 12:13:09 +01:00
Compute-Runtime-Validation f9f9035b95 Revert "refactor: create new members for storing spill and private memory in ...
This reverts commit 87eb5f554a.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-01-23 09:13:00 +01:00
Compute-Runtime-Validation e949ba7144 Revert "refactor: Add GDI profiling"
This reverts commit 8d56f8fb6b.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-01-23 06:13:02 +01:00
Mateusz Jablonski 87eb5f554a refactor: create new members for storing spill and private memory in scratch
rename private scratch space into scratch space slot 1 as it can be generic

Related-To: NEO-9944
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-22 19:48:48 +01:00
Maciej Plewka 09dcc618c1 fix: prefer compression when allocationg by kmd
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-01-22 16:38:17 +01:00
Dunajski, Bartosz b77e1a6a71 feature: improve reporting max cooperative group count
Related-To: NEO-8210

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-22 14:56:15 +01:00
Lukasz Jobczyk 8d56f8fb6b refactor: Add GDI profiling
Resolves: NEO-9236
Related-To: NEO-10036

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-01-22 14:24:08 +01:00
Dunajski, Bartosz 6f4ed10919 refactor: unify getMaxWorkGroupCount logic
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-22 11:16:45 +01:00
Maciej Bielski 134c718a25 fix: support zeEventPool IPC handles from single process
For all execution paths where needed, use `BufferObjectHandleWrapper`
instances for registration of BO-handles and try to obtain shared
ownership. This allows both sides of IPC communication to be implemented
in the same process and avoid the double-free problem on a BufferObject
of the same handle.

Currently there are two pairs of such calls:
* `zeEventPoolGetIpcHandle()` + `zeEventPoolOpenIpcHandle()`
* `zeMemGetIpcHandle()` + `zeMemOpenIpcHandle()`

The capability of executing both sides from the same process is useful
for testing but not only.

Related-To: NEO-9837
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2024-01-19 14:49:38 +01:00
Zbigniew Zdanowicz b5f698e0c5 feature: add umonitor and umwait synchronization function
Related-To: NEO-9737

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-01-19 14:20:19 +01:00
Dunajski, Bartosz b266f1f3cc refactor: improve implicit scaling interface
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-19 14:09:04 +01:00
Mateusz Jablonski 028a5ee2fc fix: correct calculating highest enabled dual subslice
when no DSS is exposed then calculate highest enabled subslice instead

Related-To: NEO-9614
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-19 10:48:15 +01:00
Yoon, Young Jin 9c7578f5f4 fix: change numa_library to read default policy
Changed NumaLibrary to read default policy instead of using
the address-based policy, and remove unnecessary argument.

Removed numaif.h from the repo as it is no longer required.

Related-To: NEO-8276
Signed-off-by: Yoon, Young Jin <young.jin.yoon@intel.com>
2024-01-19 08:31:41 +01:00
Mateusz Jablonski a673347378 fix: don't allocate preemption surface for BCS engines
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-19 07:59:24 +01:00
Dunajski, Bartosz 3a5e6c0117 fix: cl_intel_subgroup_split_matrix_multiply_accumulate support on PVC
Related-To: GSD-7696

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-18 19:40:59 +01:00
Brandon Yates f56eeb0eb7 fix: Use debug SIP when debugging is enabled
Disable wMTP when debug is enabled

Related-to: NEO-10085

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2024-01-18 18:58:16 +01:00
Dunajski, Bartosz 7fbf724c72 refactor: update Implicit Scaling interface
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-18 16:52:48 +01:00
Dominik Dabek e0ccf22557 fix: indirect access in external functions
Read indirect_stateless_count in module external functions.

If greater than 0, mark all kernels that have the has_stack_calls flag
set from this module as having indirect accesses.

Related-To: NEO-7712

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-01-18 15:29:41 +01:00
Kacper Nowak c238f731af fix: Skip mapping copy engines for unsupported ones
Based on the info retrieved from product helper, skip mapping returned
copy engines from KMD to disabled ones.
- Unify logic for EngineInfo ctors (remove redundant code).
Related-To: NEO-8356
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2024-01-18 09:49:14 +01:00
Lukasz Jobczyk d2eb29660d performance: Refactor cmd buffer reuse for cmd lists
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-01-17 17:28:17 +01:00
Jitendra Sharma aa191b6f88 feature: Set runalone mode for contexts with online debugging
Related-To: NEO-9139

Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2024-01-17 09:01:30 +01:00
Warchulski, Jaroslaw f7dc17e6a2 fix: set mmapPtr in every allocation in multiGraphicAllocation
Related-To: GSD-7360
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2024-01-16 15:18:44 +01:00
Naklicki, Mateusz a28a2d305e fix: align NEO code to new uAPI header
PMU is removed from XeKMD as it is likely to change in the future.

Related-To: NEO-8324

Co-authored-by: Francois Dugast <francois.dugast@intel.com>
Co-authored-by: Naklicki, Mateusz <mateusz.naklicki@intel.com>
Signed-off-by: Francois Dugast <francois.dugast@intel.com>
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com>
2024-01-16 13:36:32 +01:00
Kamil Kopryk 36dcfc0b42 feature: add stateless heapless builtins
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-01-16 11:11:31 +01:00
Mateusz Jablonski 556645e0c5 fix: add unrecoverable to avoid OOB access
Related-To: NEO-9860
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-16 00:12:11 +01:00
Lukasz Jobczyk 9215f3acb5 performance: Remove tlb flush on DG2 Linux
Resolves: NEO-7116

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-01-15 23:20:16 +01:00
Mateusz Jablonski 7b40b01f54 feature: add debug key for toggling bit in 57bit GPU VA for specific allocations
Related-To: NEO-9419
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-15 19:37:00 +01:00
Mateusz Jablonski 2b25cbaa41 test: remove not needed printf
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-15 19:20:45 +01:00
Kamil Kopryk ef8921247c refactor: improve constexpr array usage
Gcc with o2 flag will allocate constexpr array on stack.
Static constexpr std::array will be initialized at compile time.

Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-01-15 17:00:50 +01:00
Dunajski, Bartosz dfc69c746a feature: create InOrderExecInfo for standalone CB Event
Related-To: NEO-8145

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-15 16:56:52 +01:00
Maciej Plewka b348eaf29f fix: use required walk order
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>

Related-To: NEO-10014
2024-01-15 13:44:56 +01:00
Dominik Dabek 997bdfa010 performance: add windows thread priority debug key
Set windows thread priority to "above normal" on wddm init if flag is
set.

Related-To: NEO-8215

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-01-15 08:14:46 +01:00
Compute-Runtime-Validation 9b0c64abef Revert "performance: Enable ULLS on WSL"
This reverts commit 42c2b6b0e2.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-01-15 06:08:44 +01:00
Dominik Dabek b3b402cad2 feature(zebin): read indirect detection version
Read indirect detection version intel compat notes from zebin.

This is to prepare for enabling indirect access optimization in AOT
built kernels.

Related-To: NEO-7712

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-01-12 14:20:52 +01:00
Mateusz Hoppe a972cdaa6b fix: initialize device caps before creating engines
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-01-12 14:09:30 +01:00
Fabian Zwolinski a3bd629aff fix: store image implicit args in bindless slot 1
bindless slot 0: image
bindless slot 1: image implicit args
bindless slot 2: redescribed image

Redescribed image moved one slot higher.
Separate allocation is created for new bindless slot.

Related-To: NEO-9740
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-01-12 13:09:06 +01:00
Lukasz Jobczyk 42c2b6b0e2 performance: Enable ULLS on WSL
Related-To: NEO-10037

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-01-12 12:03:53 +01:00
Fabian Zwolinski 903e581b5f fix: add support for bindless implicit args
Support for:
global_base and const_base in bindless addressing mode.

Related-To: NEO-9855
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-01-12 01:27:17 +01:00
Lukasz Jobczyk 2bda9f0b58 fix: Release usage when put cmd buffer to reusable list
Resolves: NEO-10004
Related-To: NEO-7116

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-01-11 18:40:38 +01:00
Brandon Yates ba0db2488a refactor: Implement Xe Resoure Registration (2/x)
Refactor drm_debug.cpp into IoctlHelper

Related-to: NEO-9161

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2024-01-11 08:26:29 +01:00
Brandon Yates d3baab00db feature: Register resources for Xe at vm creation (1/x)
Add interfaces to IoctlHelperXe for registering resources.
Introduce new debug key EnableXeResourceRegistration to attach
resources to vm_create ioctl.

Related-to: NEO-9161
Signed-off-by: Brandon Yatse <brandon.yates@intel.com>
2024-01-11 08:15:50 +01:00
Krzysztof Gibala d18aa30352 test: use same pattern for zebin attribute decoding
Signed-off-by: Krzysztof Gibala <krzysztof.gibala@intel.com>
2024-01-10 13:17:22 +01:00
Zbigniew Zdanowicz eec553ea6a fix: set ulls ring start state according to returned value from kmd call
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-01-10 12:57:34 +01:00
Mateusz Hoppe 31edeb0765 feature: set HP flag when creating HardwareContextController
- for highPriority context

Related-To: NEO-7824

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-01-10 12:46:29 +01:00
Katarzyna Cencelewska 5d2d3ed899 refactor: modify sip kernel helpers
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-01-09 17:09:10 +01:00
Compute-Runtime-Validation 8df4dab1a3 Revert "performance: Remove tlb flush on DG2 Linux"
This reverts commit 44a162053b.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-01-09 16:53:00 +01:00
Compute-Runtime-Validation 46d860b2bf Revert "feature: Register resources for Xe at vm creation (1/x)"
This reverts commit e1fefd1974.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-01-09 11:36:00 +01:00
Mateusz Jablonski a73fb4d2fe fix: correct reporing kernel private size on L0
unify the logic across APIs

Related-To: NEO-9944
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-09 09:54:05 +01:00
Young Jin Yoon 237a7f5494 fix: fix clang-tidy errors on some numa tests
Modified variables to address clang-tidy issues and MSVC
compilation issues.

Related-To: NEO-8276
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-01-08 21:36:39 +01:00
Brandon Yates e1fefd1974 feature: Register resources for Xe at vm creation (1/x)
Add interfaces to IoctlHelperXe for registering resources.
Introduce new debug key EnableXeResourceRegistration to attach
resources to vm_create ioctl.

Related-to: NEO-9161

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2024-01-08 13:36:24 +01:00
Lukasz Jobczyk 44a162053b performance: Remove tlb flush on DG2 Linux
Resolves: NEO-7116

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-01-08 12:39:19 +01:00
Lukasz Jobczyk 7b8abd8fff fix: pass GMM to Drm::getPatIndex
Related-To: NEO-9543

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-01-08 09:14:14 +01:00
Dominik Dabek 9b52d52062 performance(ocl): enable usm pool allocator
Enable on xe hpg and lpg platforms

Related-To: NEO-9700

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-01-05 16:17:48 +01:00
Dominik Dabek af1620a308 fix(ocl): allocation info from pool svm ptr
Fix querying allocation info from pooled svm ptr.
Handle requested allocation alignment.
Refactor sorted vector usage.
Do not associate device with host pool allocation.

Related-To: NEO-9700

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-01-05 15:20:01 +01:00
Kamil Kopryk 5432b57fd0 refactor: improve readability
Added a time unit to the variable name to make it clear.
Removed incorrect and not needed check.

Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-01-05 14:53:52 +01:00
Lukasz Jobczyk add394722c fix: Create GMM in DrmMemoryManager::createAllocWithAlignment
Resolves: NEO-9739
Related-To: NEO-9543

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-01-05 13:17:11 +01:00
Young Jin Yoon 4ccae1dbb4 feature: support memory policy for GEM_CREATE
Modified ioctl_helper_prelim to support the extension of gem_create_ext,
i.e. prelim_drm_i915_gem_create_ext_mempolicy.

Added two debug variables to be used for the mempolicy extension.

Modified functions in memory_info and drm_memory_manager to support extension

Added numaif.h from https://github.com/numactl/numactl/tree/master,
v2.0.14

Related-To: NEO-8276
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-01-04 23:49:10 +01:00
Maciej Plewka 3970f1bc4c fix: create hwQueue when reinitialize osContext
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>

Related-To: NEO-9877
2024-01-04 12:46:36 +01:00
Lukasz Jobczyk d472cf0a5d fix: Restore handle fence completion when reusing cmd buffer
Related-To: NEO-7116

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-01-04 12:11:39 +01:00
Dunajski, Bartosz 342bfcd17c feature: disable dpas on release 12.61
Related-To: NEO-9736

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-03 13:27:41 +01:00
Compute-Runtime-Validation c3b6ca8011 Revert "fix: Restore handle fence completion when reusing cmd buffer"
This reverts commit 5d90818cce.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-12-30 10:39:16 +01:00
Dunajski, Bartosz 87857a6558 feature: add new PVC device id
Related-To: NEO-9736

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-12-29 19:06:33 +01:00
Lukasz Jobczyk edd6fd5e6c performance: Optimize ULLS start on submit path
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-12-29 15:21:49 +01:00
Lukasz Jobczyk 5d90818cce fix: Restore handle fence completion when reusing cmd buffer
Related-To: NEO-7116

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-12-29 11:38:36 +01:00