Commit Graph

4884 Commits

Author SHA1 Message Date
Mateusz Jablonski 552c129c0b refactor: correct variable naming
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-04 15:14:02 +01:00
Dominik Dabek 6cf6a8def8 performance: add power throttling debug key
Set windows process power throttling hint to HIGH on wddm init

Related-To: NEO-8215

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-12-04 15:03:33 +01:00
Mateusz Jablonski c3d3a4db1f refactor: correct variable naming
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-04 13:45:53 +01:00
John Falkowski 911acd81a2 feature: add SetBOChunkingSize debug variable
Resolves: NEO-9562

Signed-off-by: John Falkowski <john.falkowski@intel.com>
2023-12-04 06:21:01 +01:00
Compute-Runtime-Validation 9add9f12dc Revert "feature: add SetBOChunkingSize debug variable"
This reverts commit e1df8f9112.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-12-03 03:28:16 +01:00
Mateusz Hoppe 8056476d8d fix: program binding table for bindless kernels
- if binding table entries are used in bindless kernel, program Binding
table

Related-To: NEO-7063

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-12-01 19:54:10 +01:00
Michal Mrozek a129c29b40 performance: do not enable v2 algorithm in certain scenarios
- do not enable when SubSliceCount != MaxSubSlicesSupported

Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
Related-To: NEO-6989
2023-12-01 19:40:11 +01:00
Kamil Kopryk aa950a4a96 feature: Add heapless mode programming in l0 1/n
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-12-01 17:01:01 +01:00
Dunajski, Bartosz 2c921ec940 feature: support to use mi_atomic for signalling in-order counter
Related-To: NEO-7966

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-12-01 15:35:12 +01:00
Zbigniew Zdanowicz 4f9fe6bb53 fix: detect gpu reset on windows direct submission
Related-To: NEO-8395

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-12-01 14:13:14 +01:00
Mateusz Jablonski cdcd834712 refactor: correct naming of builtin types
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-01 14:08:21 +01:00
Mateusz Jablonski e57d372608 refactor: correct naming of enum class constants
EngineGroupType
BlitDirection
PostBlitMode
WaitStatus

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-01 11:53:51 +01:00
John Falkowski e1df8f9112 feature: add SetBOChunkingSize debug variable
Resolves: NEO-9562

Signed-off-by: John Falkowski <john.falkowski@intel.com>
2023-12-01 09:39:47 +01:00
Mateusz Jablonski b92b5892b8 refactor: correct variable naming
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-01 08:34:19 +01:00
Kacper Nowak 4a0064033a fix: Correct AIL initialization in runtime
- Initialize AIL helper in runtime (linux/windows/AUB path).
- Return false if AIL configuration initialization is called with empty
AIL helper (is nullptr).
- Skip mentioned condition if AIL is disabled via EnableAIL debug key.
Related-To: NEO-9240
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2023-12-01 02:42:10 +01:00
Mateusz Jablonski c3ac7b78bd refactor: correct variable naming
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-01 02:18:46 +01:00
Mateusz Jablonski c9664e6bad refactor: rename global debug manager to debugManager
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-30 13:00:59 +01:00
Mateusz Jablonski 36194c4e7d refactor: correct variable namings
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-29 23:49:03 +01:00
Dunajski, Bartosz 5772b17924 refactor: Add debug flag to check Device State on failed Wddm submission
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-11-29 18:44:25 +01:00
Fabian Zwolinski 06ec4c3a82 fix: aub residency in HW+AUB mode
Related-To: NEO-9230
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-11-29 11:54:59 +01:00
Zbigniew Zdanowicz fdcf7f6793 performance: enable new dispatch monitor fence policy in direct submission
Related-To: NEO-8395

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-11-28 17:21:46 +01:00
Compute-Runtime-Validation 1395a2cb38 Revert "refactor: linux cl_cache eviction mechanism"
This reverts commit a02ac1c140.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-11-28 08:51:36 +01:00
Naklicki, Mateusz dc29c08abd fix: align NEO code to new uAPI header
Related-To: NEO-9566
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com>
2023-11-27 19:50:23 +01:00
Fabian Zwolinski a02ac1c140 refactor: linux cl_cache eviction mechanism
Refactored eviction mechanism works as follows:
- eviction is needed only if
total size of cache binaries + size of the new binary exceed cache limit
- single evition call removes files with a summed size of 1/3 of the cache limit
- if new binary can not fit in the cache size limit
even after eviction, it will not be saved
- cache limit applies only to
files in cache directory with .cl_cache/.l0_cache extension.
Only these files are counted and only these files are removed

Minor:
- rename variables for better readability
- add `const` where possible

Related-To: NEO-4262
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-11-27 18:20:41 +01:00
Maciej Plewka 8c8bb23356 fix: update completion fence for previous ring during switch
Related-To: NEO-9330

Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-11-27 12:30:01 +01:00
Mateusz Hoppe 83ac95d293 fix: L0 - remove synchronization with events on appends in tbx mode
Related-To: NEO-9400

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-11-27 10:39:55 +01:00
Compute-Runtime-Validation c4328d223a Revert "feature: enable bindless mode in L0 on dg2"
This reverts commit e23aebf3d7.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-11-25 01:40:26 +01:00
Kamil Kopryk 982c2d665b feature: add programming heapless mode in SBA
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-11-24 15:29:19 +01:00
Mateusz Jablonski 220b1e10cf refactor: remove copy/move ctors when class has user-defined dtor
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-24 13:13:07 +01:00
Kamil Kopryk ce7298d512 feature: Add heapless mode programming in ocl
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-11-24 12:53:39 +01:00
Maciej Plewka 1a364f5c98 fix: add synchronization for updating tag and updating residency in GA
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-11-24 12:12:13 +01:00
Francois Dugast f465cf5f27 feature: update Xe KMD header
xe_drm.h header is generated from the series
"uAPI Alignment - Remove unused features"
from https://patchwork.freedesktop.org/series/126278/

This aligns with Xe KMD commit ("drm/xe/pmu: Drop interrupt pmu event").

Related-To: NEO-8324

Signed-off-by: Francois Dugast <francois.dugast@intel.com>
2023-11-24 11:40:42 +01:00
Mateusz Hoppe e23aebf3d7 feature: enable bindless mode in L0 on dg2
Related-To: NEO-7063

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-11-24 09:35:25 +01:00
Dunajski, Bartosz aba1cd8f9c feature: improve waiting and signaling Events via KMD calls
Related-To: NEO-8179

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-11-23 14:53:27 +01:00
Mateusz Hoppe 7ffd151ac3 fix: adjust numArgsStateful based on binding table entries
- global and const buffer may have BTI index allocated, ssh template
must be allocated with size for all stateful args

Related-To: NEO-7063

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-11-23 12:15:39 +01:00
Lukasz Jobczyk ac8c00048e performance: optimize svm allocation tracking
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-11-23 10:54:01 +01:00
Baj, Tomasz c49a9b9787 refactor: remove ReturnSubDevicesAsApiDevices from shared code
Related-To: NEO-9437

Signed-off-by: Baj, Tomasz <tomasz.baj@intel.com>
2023-11-22 15:13:29 +01:00
Dunajski, Bartosz 12b4a9f71d fix: make Tag allocation type OneTimeWritable
Related-To: HSD-18035178611

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-11-22 11:24:44 +01:00
Mateusz Jablonski f63dd1f4f2 fix: don't fail entire initialization when at least one device is compatible
Related-To: NEO-6683

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-21 17:00:42 +01:00
Kacper Nowak 1b932bf119 fix: allow legacy device binary validation logic for Blender on DG2 and MTL
Temporarily opt-out from additional compatibility checks
on DG2 and MTL for Blender and its derivatives AOT-compiled kernels.
This prevents a long kernel recompilation.

Additionally, same behavior can be enforced for other applications
manually via NEO debug key named DoNotUseProductConfigForValidationWa.

Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
Related-To: NEO-9240
2023-11-21 16:05:17 +01:00
Mateusz Jablonski 8dd80efbb1 refactor: move getting thread per eu configs to release helper
Related-To: HSD-18034098647
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-21 09:44:32 +01:00
Baj, Tomasz 97842d8eab fix: program PIPE_CONTROl with RT Flush and CS Stall prior to PIPE_SELECT
Related-To: NEO-9194

Signed-off-by: Baj, Tomasz <tomasz.baj@intel.com>
2023-11-20 16:43:59 +01:00
Lukasz Jobczyk 128dc4a3ad Revert "performance: disable gem close worker on non integrated devices"
This reverts commit 3cf7cbc9e9.


Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-11-20 14:49:04 +01:00
Mateusz Hoppe 3034d5834d fix: do not count samplers in numArgsStateful
- stateful samplers do not require SSH slots in bindless mode
- fixes abort when sampler used in bindless mode

Related-To: NEO-7063

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-11-20 13:10:35 +01:00
Mateusz Jablonski 35c1f34672 refactor: move number of threads per eu to release helper
Related-To: HSD-18034098647
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-20 12:16:33 +01:00
Compute-Runtime-Validation a8d404f859 Revert "refactor: linux cl_cache eviction mechanism"
This reverts commit 2aa0ae0eca.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-11-18 01:13:12 +01:00
Maciej Plewka 560b588cbe fix: Update residency fence value after ring buffer switch
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-11-17 20:46:14 +01:00
Lukasz Jobczyk 3cf7cbc9e9 performance: disable gem close worker on non integrated devices
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-11-17 15:24:03 +01:00
Kacper Nowak c504b497d7 refactor: Store AIL in root device environment
Instead of storing AIL configurations in global table, store it
in root device environment.
This also prevents potential scenario with accessing deleted memory due
to symbol collision when application uses both OCL/L0 libraries.
- AIL is now stored in root device environment, and gets initialized
with other helpers
- Minor: corrected naming in ULTs

Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
Related-To: NEO-9240
2023-11-17 15:18:27 +01:00
Dominik Dabek 6562828095 performance: prealloc internal heap on mtl
Preallocate 1 internal heap allocation per csr on mtl

Related-To: NEO-8152

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-11-17 13:36:21 +01:00
Fabian Zwolinski 2aa0ae0eca refactor: linux cl_cache eviction mechanism
Refactored eviction mechanism works as follows:
- eviction is needed only if
total size of cache binaries + size of the new binary exceed cache limit
- single evition call removes files with a summed size of 1/3 of the cache limit
- if new binary can not fit in the cache size limit
even after eviction, it will not be saved
- cache limit applies only to
files in cache directory with .cl_cache/.l0_cache extension.
Only these files are counted and only these files are removed

Minor:
- rename variables for better readability
- add `const` where possible

Related-To: NEO-4262
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-11-17 09:48:42 +01:00
Compute-Runtime-Validation 27000c06cb Revert "refactor: Store AIL in root device environment"
This reverts commit 02ef252b7d.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-11-16 19:37:42 +01:00
Compute-Runtime-Validation c68f55e4f6 Revert "feature: enable bindless mode in L0 on dg2"
This reverts commit d1d4f9c9f3.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-11-16 19:13:05 +01:00
Mateusz Hoppe d1d4f9c9f3 feature: enable bindless mode in L0 on dg2
Related-To: NEO-7063

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-11-16 17:19:46 +01:00
Fabian Zwolinski f91757c093 fix: enable 4KB allocation alignment through KMD on Windows
Related-To: NEO-9446
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-11-16 13:04:23 +01:00
Dunajski, Bartosz 2981c78d34 fix: check RCS support in OS agnostic path
Related-To: HSD-18035019656

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-11-16 12:59:42 +01:00
Mateusz Jablonski 140532f8b6 fix: fix issues with clang-tidy on Windows
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-16 12:30:11 +01:00
Compute-Runtime-Validation 7f61217a44 Revert "performance: optimize svm allocation tracking"
This reverts commit e91ce78ec8.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-11-16 11:03:19 +01:00
Dominik Dabek 8f06f3f50a performance: add override cacheable to gmm
add attribute to override cacheable attribute to gmm constructor

enable this override for command buffers on mtl

change command buffers back to allocation by kmd

this keeps the quicker allocation which is needed to keep enqueue times
low

Related-To: NEO-8152

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-11-16 10:00:45 +01:00
Lukasz Jobczyk e91ce78ec8 performance: optimize svm allocation tracking
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-11-15 13:58:05 +01:00
Mateusz Jablonski a0f8e8accc fix: initialize max capability values for ARL
setup max values when overriding hw info with debug key

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-15 13:44:11 +01:00
Kacper Nowak 02ef252b7d refactor: Store AIL in root device environment
Instead of storing AIL configurations in global table, store it
in root device environment.
This also prevents potential scenario with accessing deleted memory due
to symbol collision when application uses both OCL/L0 libraries.
- AIL is now stored in root device environment, and gets initialized
with other helpers
- Minor: corrected naming in ULTs

Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
Related-To: NEO-9240
2023-11-15 13:17:56 +01:00
Dominik Dabek 961a8d91d0 refactor: move gmm constructor flags to struct
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-11-15 09:26:13 +01:00
Mateusz Jablonski c454e0d33e fix: handle not exisiting wsl compute helper library
https://github.com/intel/compute-runtime/issues/688
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-14 16:06:14 +01:00
Lukasz Jobczyk 9a8138725a fix: Deferred SVM allocations look up by gpu address
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-11-14 15:27:01 +01:00
Filip Hazubski fee423fa15 fix: Add PVC device id 0x0B6E for Intel(R) Data Center GPU Max 1100C
Resolves: NEO-9440

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2023-11-14 08:03:51 +01:00
Mateusz Jablonski cec6492cd9 fix: fix issues with ubsan in shared and ocl
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-13 12:40:57 +01:00
Compute-Runtime-Validation b3a7f5dccb Revert "fix: support alignments in WSL2 device and shared memory allocation"
This reverts commit ae86635307.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-11-11 16:25:52 +01:00
Lu, Wenbin ae86635307 fix: support alignments in WSL2 device and shared memory allocation
Related-To: NEO-8431

Signed-off-by: Lu, Wenbin <wenbin.lu@intel.com>
2023-11-09 19:47:11 +01:00
Katarzyna Cencelewska fbe61cf547 fix: add kmd wait on cpu when wait on paging fence when needed
add storing of throttle hint value for direct submission

Related-To: NEO-9123
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2023-11-09 19:22:15 +01:00
Zbigniew Zdanowicz 506e03e00c refactor: reposition ail application map
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-11-09 16:59:01 +01:00
Dunajski, Bartosz 30777d4d4c feature: use indirect semaphore for 64b values
Related-To: NEO-8145

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-11-09 16:58:45 +01:00
Konstanty Misiak c160e6ff93 fix: sporadic in software tags test
Related-To: NEO-9272

Signed-off-by: Konstanty Misiak <konstanty.misiak@intel.com>
2023-11-09 15:50:34 +01:00
Mateusz Jablonski 781533565e fix: fix issues with ubsan in shared and OCL tests
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-09 12:39:29 +01:00
Katarzyna Cencelewska 3ad68cb7d6 fix: setting flush stamp to not have zero value
when direct submission and monitor fence is not required then use
current fence value to update flush stamp

Related-To: NEO-9123
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2023-11-09 08:34:21 +01:00
Dominik Dabek 7a6fc209dd performance: prealloc cmdbuffer on mtl
Preallocate 2 command buffers allocations per command queue initialized
on MTL.

Related-To: NEO-8152

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-11-08 17:21:13 +01:00
Michal Mrozek c558fc1b06 performance: enable v2 tg dispatch size algorithm
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
Related-To: NEO-6989
2023-11-08 16:39:49 +01:00
Dunajski, Bartosz a0beb96db8 feature: initial support for implicit convertion to CounterBased Events
Related-To: NEO-8145

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-11-08 14:59:12 +01:00
Katarzyna Cencelewska 06e2fdb2e9 fix: remove copy paste from WddmDirectSubmission methods
instead of use updateTagValueImpl to avoid missing parts

Related-To: NEO-9123
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2023-11-08 08:45:10 +01:00
Compute-Runtime-Validation cdf2cd5699 Revert "feature: enable illegal opcode exception"
This reverts commit 02b6b3bbaa.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-11-08 01:40:23 +01:00
Mateusz Hoppe f36e86f121 fix: program bindless surface state size from sba properties
- surfaceStateSize is in pages, bindless size needs to be programmed in
surface state units

Related-To: NEO-7063

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-11-07 17:45:54 +01:00
Mateusz Hoppe 1c37da280c fix: fix bindless offset patching for images
- usingSurfaceStateHeap indicates if any of the args is using local ssh
in bindless kernels:

without global allocator - ssh is used for all args
with global bindless allocator - ssh used only for buffer with offset
set in surface state, otherwise not used

When any of the args is using ssh - getSurfaceStateHeapDataSize() returns
non-zero size.

Related-To: NEO-7063

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-11-07 11:39:49 +01:00
John Falkowski f0175b3916 feature: set device allocation chunking as default
Device allocation chunking only applies for multi-tile mode for implicit scaling

Related-To: NEO-9051

Signed-off-by: John Falkowski <john.falkowski@intel.com>
2023-11-07 10:58:17 +01:00
Young Jin Yoon 3782b4bd0e feature: add functions to get/set VA space
Added an extension version of zeContextGetVirtualAddressSpaceIpcHandle
and zeContextPutVirtualAddressSpaceIpcHandle, which return
ZE_RESULT_ERROR_UNSUPPORTED_FEATURE for now.

Related-To: NEO-8259

Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2023-11-06 20:08:45 +01:00
Mateusz Hoppe 02b6b3bbaa feature: enable illegal opcode exception
Related-To: NEO-9088

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-11-06 16:09:29 +01:00
Zbigniew Zdanowicz 20c3f45998 refactor: add ulls diagnostic flag to select monitor fence input
Related-To: NEO-8395

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-11-06 15:09:36 +01:00
Zbigniew Zdanowicz e0ce08bb77 fix: detect gpu hang or page fault at direct submission flush to gpu
Related-To: NEO-8395

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-11-06 14:22:02 +01:00
Michal Mrozek ed897c302d performance: Implement V2 version of tg dispatch size algorithm.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
Related-To: NEO-6989

-Prevent imbalance in multi dimensional dispatches
-Make sure to utilize as much Eus as possible
-Prefer highest possible tg dspatch count possible
-Make sure that xe_core doesn't have uneven workgroups
2023-11-03 15:54:04 +01:00
Mateusz Jablonski 098ea3a997 fix: add log about printing assertion failure from kernel
Related-To: NEO-9237
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-03 15:32:07 +01:00
Artur Harasimiuk 6510c725e3 build: build aub_stream_stubs even if AubStream is available
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2023-11-03 14:50:35 +01:00
Zbigniew Zdanowicz 19586277ca refactor: add debug flag to control delay after waiting for paging fence on cpu
Related-To: NEO-8395

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-11-03 12:49:39 +01:00
Kamil Kopryk ae607502a0 feature: Add indirect data and scratch pointer to zeinfo
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-11-03 12:01:58 +01:00
Kamil Kopryk 73bb139e17 refactor: add applyExtraInternalOptions function
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-11-03 09:43:37 +01:00
Compute-Runtime-Validation bcc5a8502b Revert "feature: add functions to get/set VA space"
This reverts commit 0550c80d4f.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-11-03 02:06:36 +01:00
Young Jin Yoon 0550c80d4f feature: add functions to get/set VA space
Added an extension version of zeContextGetVirtualAddressSpaceIpcHandle
and zeContextPutVirtualAddressSpaceIpcHandle, which return
ZE_RESULT_ERROR_UNSUPPORTED_FEATURE for now.

Related-To: NEO-8259

Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2023-11-02 18:42:27 +01:00
Dominik Dabek 43841fd2ef fix: clear AllocationsList tail on free all
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-11-02 18:06:13 +01:00
Zbigniew Zdanowicz 93cf33fda0 performance: remove redundant tlb flush on monitor fence
Related-To: NEO-8395

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-11-02 12:39:05 +01:00
Mateusz Hoppe 4a5d9eeae7 refactor: do not trigger debug break when mkdir fails with EEXIST
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-11-02 12:26:32 +01:00
Aravind Gopalakrishnan 99abb40a45 feature: Add extensions to expose dot product capabilities
- Add new extension property for user to query platform DP support.
- Add new blackbox and ULTs for extension.

Related-To: NEO-8786

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2023-10-31 21:54:39 +01:00
Zbigniew Zdanowicz db10e85526 refactor: add new diagnostic mode for device state
Related-To: NEO-8395

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-10-31 16:02:53 +01:00
Compute-Runtime-Validation c9d6615231 Revert "feature: Add extensions to expose dot product capabilities"
This reverts commit e988fe66e3.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-10-31 05:06:06 +01:00
Michal Mrozek 060fc40938 performance: move constants out of the loop.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2023-10-30 18:16:35 +01:00
Mateusz Jablonski d56abe6f60 refactor: correct variable naming
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-30 17:26:20 +01:00
Aravind Gopalakrishnan e988fe66e3 feature: Add extensions to expose dot product capabilities
- Add new extension property for user to query platform DP support.
- Add new blackbox and ULTs for extension.

Related-To: NEO-8786

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2023-10-30 15:15:51 +01:00
Mateusz Jablonski 036d1e4814 refactor: correct variable naming
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-30 12:48:51 +01:00
tkordek f8ad191b16 feature: Caller ID introduced to aubstream
... to distinguish environment of capturing.

Related-To: NEO-9235

Signed-off-by: Tomasz Kordek <tomasz.kordek@intel.com>
2023-10-30 12:37:49 +01:00
Mateusz Jablonski 80e59ff344 fix: don't call virtual method in ctor
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-30 12:26:57 +01:00
Michal Mrozek 80e3632c41 performance: do not free deferred allocations when not needed
Level Zero doesn't use deferred allocations so no point in paying the
price to check for them in cleanAllocationList.

Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2023-10-27 18:02:43 +02:00
Dominik Dabek 39cf653959 performance(ocl): cmd buffer prealloc per cmdqueue
Add mechanism to preallocate cmd buffer allocations in command stream
receiver reusable allocations list per command queue initialized.

This should limit additional allocations during hot loop.

Needs to be enabled in subsequent commits by setting product helper
method.

Related-To: NEO-8152

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-10-27 16:56:29 +02:00
Mateusz Jablonski 1e18bf0392 fix: define timestamp mask based on timer resolution
Related-To: HSD-18034577551
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-27 11:55:20 +02:00
Mateusz Jablonski 2eebbcd41e fix: don't transfer FtrTile64Optimization to gmmlib
Related-To: HSD-18034338825
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-27 08:07:58 +02:00
Mateusz Hoppe 5d572b9c8f feature: allow freeing memory in aubstream
Related-To: NEO-2707

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-10-26 17:16:23 +02:00
Compute-Runtime-Validation fca2159430 Revert "fix: if device hierarchy is flat then getSubDevicesCount return 1u"
This reverts commit cb0bb57f49.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-10-26 15:40:29 +02:00
Baj, Tomasz cb0bb57f49 fix: if device hierarchy is flat then getSubDevicesCount return 1u
Related-To: NEO-9167

Signed-off-by: Baj, Tomasz <tomasz.baj@intel.com>
2023-10-25 15:51:52 +02:00
Andrzej Koska c4a24d8f31 fix: Avoiding races when modifying queue and Tag data
Related-To: NEO-8196
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2023-10-25 14:20:57 +02:00
Naklicki, Mateusz 1352bde5c8 fix: use pointer instead of std::function
MSVC Compiler found an issue that there could be an attempt to
reference a deleted function when using std::function.
As the functionality of ioctlHelper::getGpuTime is very straightforward,
simple function pointer should be sufficient to make the job and
it mitigates overhead of std::function.

Related-To: NEO-8324
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com>
2023-10-25 06:23:43 +02:00
Compute-Runtime-Validation 69f614a8c2 Revert "fix: allow legacy device binary validation logic for Blender on DG2 p...
This reverts commit d3d15542fb.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-10-24 21:00:19 +02:00
Jablonski, Mateusz ac5f64f5c6 fix: fix compilation error in clang on Windows (2/n)
Signed-off-by: Jablonski, Mateusz <mateusz.jablonski@intel.com>
2023-10-24 15:59:06 +02:00
Mateusz Jablonski 6d2d16d68e fix: avoid overflow of gpu time stamp in ns
Related-To: NEO-8394
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-24 14:56:15 +02:00
Francois Dugast 1fd9d29f54 feature: update Xe KMD header
xe_drm.h header is generated from the series "uAPI Alignment - take 1 v4"
from https://patchwork.freedesktop.org/series/124387/

Related-To: NEO-8324
Signed-off-by: Francois Dugast <francois.dugast@intel.com>
Signed-off-by: Kamil, Kopryk <kamil.kopryk@intel.com>
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com>
2023-10-24 12:03:45 +02:00
Kacper Nowak d3d15542fb fix: allow legacy device binary validation logic for Blender on DG2 platforms
Temporarily opt-out from additional compatibility checks
on DG2 for Blender AOT-compiled kernels.
This prevents a long kernel recompilation.

Additionally, same behavior can be enforced for other applications
manually via NEO debug key named DoNotUseProductConfigForValidationWa.

Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
Related-To: NEO-9240
2023-10-23 18:20:37 +02:00
Zbigniew Zdanowicz 0b42510f41 fix: add l1 cache flush to barrier operations when required
Related-To: NEO-8395

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-10-23 18:18:22 +02:00
Mateusz Hoppe 52b0f32688 fix: offset cpu address when writing chunk in simulated csr
- not only gpuAddress is offset but also cpu address with data needs
to be offset while writing memory.

Related-To: GSD-6604

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-10-23 17:01:25 +02:00
Mateusz Jablonski 8da4a9cbc7 fix: add debug flag to control non walker signalling in in-order cmdlist
set to false by default

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-23 14:43:40 +02:00
Kulkarni, Ashwin Kumar 875f653952 Revert "feature: support UUID feature for windows"
This reverts commit cb2b4214b4.

Related-To: NEO-8799

Signed-off-by: Kulkarni, Ashwin Kumar <ashwin.kumar.kulkarni@intel.com>
2023-10-23 14:08:03 +02:00
Zbigniew Zdanowicz 2e09b5ff66 refactor: reposition preamble helper implementation methods
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-10-20 13:18:29 +02:00
Fabian Zwolinski b8831e87b0 fix: iGPUs - increase size of reported global memory available on Linux to 94%
Related-To: NEO-8989, NEO-7229
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-10-20 10:51:47 +02:00
Fabian Zwolinski ede8663e41 fix: iGPUs - increase size of reported global memory available on Windows to 94%
Related-To: NEO-8989, NEO-7229
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-10-20 10:37:11 +02:00
Fabian Zwolinski 55abd8dc9a fix: reduce globalMemory by internalResourcesSize for iGPUs
internalResourcesSize is a constant 450 MB value.

Related-To: NEO-8989
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-10-20 10:10:41 +02:00
Kozlowski, Marek 90987bd878 feature: add minimal interface for software tag injection
* Add new Software Tag `ArbitraryString`
* Add minimal interface for injecting software tags

Related-To: NEO-5550

Signed-off-by: Marek Kozlowski <marek.kozlowski@intel.com>
2023-10-20 01:49:33 +02:00
Mateusz Hoppe 97faeae16f feature: heapAssigner per root device
- create heapAssigner per root device in memory manager to allow per
device config

Related-To: NEO-7063

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-10-19 19:52:24 +02:00
Dunajski, Bartosz 25195ebc96 fix: capability to write memory chunk in aub/tbx mode
Related-To: GSD-6604

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-10-19 19:13:11 +02:00
Mateusz Jablonski 4dfa12c8eb fix: add mechanism to detect gpu timestamp overflows
unify naming CpuGpu to GpuCpu

Related-To: NEO-8394
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-19 16:31:06 +02:00
Lukasz Jobczyk 750b5ba89a fix: flush necessary caches when dispatch pipe control
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-10-19 15:09:31 +02:00
Spruit, Neil R 58fa968273 fix: Calculate size of buffer surface state given mapped allocations
Related-To: NEO-8350

- given a virtual address part of a mapping to multiple physical
allocations, then the buffer surface state size is increased to
include the allocations which follow the current allocation, which
allows users access to the remainder of the mapped buffer.

Signed-off-by: Spruit, Neil R <neil.r.spruit@intel.com>
2023-10-19 13:38:51 +02:00
Katarzyna Cencelewska ff1a8770fe fix: when gt type media on xe kmd then do not inizialize
also use proper size when query config

Related-To: HSD-18034189281
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2023-10-19 12:37:36 +02:00
Spruit, Neil R 456f07212d fix: Return pageSize/Heap for reservation based on the size and devices
Related-To: NEO-9127,NEO-9142

- Change the heap/pageSize used based on the size of the reservation
required and the device heaps available.
- Return the page size required based on the size requested by the user
for virtual reservation and the devices on the system.
- Check the size passed in by the user in memory map and physical memory
allocation is valid for the heap and page size that is required.

Signed-off-by: Spruit, Neil R <neil.r.spruit@intel.com>
2023-10-19 10:44:13 +02:00
Mateusz Hoppe 5a4fa180d6 feature: control bindless compilation mode based on release
- check releaseHelper support when selecting bindless mode, if not
disabled, prefer bindless mode in L0 API
- bindless mode can be forced with DebugVariable: UseBindlessMode

Related-To: NEO-7063

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-10-19 10:11:11 +02:00
Compute-Runtime-Validation 096e424428 Revert "fix: initialize tile only when gt type main on xe kmd"
This reverts commit cbde47d638.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-10-19 09:47:58 +02:00
John Falkowski f156a74f54 fix: split chunking prefetch flags
Related-To: NEO-9120

Signed-off-by: John Falkowski <john.falkowski@intel.com>
2023-10-18 19:20:42 +02:00
Katarzyna Cencelewska cbde47d638 fix: initialize tile only when gt type main on xe kmd
also use proper size when query config

Related-To: HSD-18034189281
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2023-10-18 16:01:16 +02:00
Compute-Runtime-Validation 713aa05f27 Revert "performance: limit monitor fence dispatch only to task count dispatches"
This reverts commit f26f027ee7.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-10-18 11:15:40 +02:00
Mateusz Jablonski 15484c6778 fix: initialize member before using it
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-18 08:38:22 +02:00
Dunajski, Bartosz 95dc224021 refactor: change default PVC unit tests runner
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-10-18 08:22:27 +02:00
Dominik Dabek 75c4844987 feature(internal): logging kernel dispatch params
Use debug flag PrintKernelDispatchParameters to print params used in
thread group dispatch size heuristic when encoding kernel dispatch.

Related-To: NEO-6989

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-10-17 17:31:54 +02:00
Dunajski, Bartosz af7bcbf99c Revert "refactor: split CpuInaccessible MemoryPool types to Device and System"
This reverts commit 2e8cf5fdf5.

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-10-17 10:03:14 +02:00
Dunajski, Bartosz 0a787a0880 refactor: remove not used code
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-10-16 15:16:07 +02:00
Mateusz Hoppe 8c6b936dac fix: decanonize address passed to AubManager's freeMemory()
- the same address should be used for writeMemory() and freeMemory()

Related-To: NEO-2707

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-10-16 12:58:21 +02:00
Dunajski, Bartosz 2e8cf5fdf5 refactor: split CpuInaccessible MemoryPool types to Device and System
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-10-16 12:47:13 +02:00
Compute-Runtime-Validation 30b066c40e Revert "fix: synchronize host and device timers to avoid device timer overflow"
This reverts commit dae8c34f81.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-10-16 11:16:23 +02:00