Chodor, Jaroslaw
7744251995
refactor: explicit copy/move operations in OpaqueElementAllocator
...
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-16 10:10:05 +02:00
Oskar Hubert Weber
38a3667304
fix: avoid joining thread in deferred deleter
...
Join thread in DllMain (which we are not aware of)
can result in hang occurring in DeferredDeleter, if the library is
freed before FreeLibraryAndExitThread call from within the worker thread,
the thread gets stuck, thus the main thread is stuck on worker->join().
Related-To: NEO-14121
Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com>
2025-04-16 10:02:10 +02:00
Mateusz Jablonski
f6b92aa5d5
build: update default PTL revision to B0
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-04-16 09:44:58 +02:00
Tomasz Biernacik
eff2d1cde8
performance: change usage for semaphoreBuffer on integrated devices
...
Related-To: NEO-9421
Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-04-16 09:07:05 +02:00
Chandio, Bibrak Qamar
bed2ad1ac0
feature: memAdise for System Allocator
...
Related-To: NEO-13715
Implements basic functionality for zeCommandListAppendMemoryAdvise for
system allocator
Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com>
2025-04-16 07:11:14 +02:00
Chodor, Jaroslaw
2ce694c52b
feature: Enabling support for redirecting FCL to IGC for OCLOC
...
This allows for invoking IGC as OCL C translator in OCLOC paths
Accepted debug env :
NEO_OCLOC_UseIgcAsFcl=1 ; forces IGC
NEO_OCLOC_UseIgcAsFcl=2 ; forces FCL
NEO_OCLOC_UseIgcAsFcl=0 ; use defaults
Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-15 16:44:35 +02:00
Tomasz Biernacik
5bf8b66498
feature: disable getting fd from vmExport
...
Related-To: NEO-12680
Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-04-15 16:08:40 +02:00
Maciej Plewka
b8be602bfb
fix: Return max wg count equal one on platforms without dispatch all support
...
Related-To: NEO-14125, GSD-10725
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-04-15 15:41:23 +02:00
Lukasz Jobczyk
06e375fdc6
refactor: Add debug break for partial make resident
...
Resolves: NEO-13711
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-15 14:59:28 +02:00
Lukasz Jobczyk
f7939735da
refactor: Adjust USM cleaner to ULLS light
...
Related-To: NEO-13922
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-15 14:05:02 +02:00
Lukasz Jobczyk
455209aadc
refactor: Simplify fence selection in ULLS
...
Related-To: NEO-14642
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-15 14:04:47 +02:00
Marcel Skierkowski
31f0fd4672
refactor: rename variable slmSize
...
Max programmable slm size is stored in RuntimeCapabilityTable as slmSize.
That is misleading name
Rename the variable slmSize to better reflect the actual meaning of the member.
Related-To: NEO-12949
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-04-14 20:00:29 +02:00
Chodor, Jaroslaw
7bded13638
feature: Enabling support for redirecting FCL to IGC for OCL
...
This allows for invoking IGC as OCL C translator in OpenCL paths
Accepted debug env :
UseIgcAsFcl=1 ; forces IGC
UseIgcAsFcl=2 ; forces FCL
UseIgcAsFcl=0 ; uses defaults
Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-14 18:41:48 +02:00
Jaroslaw Warchulski
3e1aa33924
refactor: cleanup headers
...
Related-To: NEO-5548
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-04-14 14:59:40 +02:00
Maciej Bielski
6ecf5b373e
feature: exit with warning if small bar detected on Linux
...
Return early and provide justification, otherwise SIGBUS error is
returned.
Related-To: NEO-14506
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-04-14 14:43:25 +02:00
Marcel Skierkowski
e82be94368
refactor: Modernize writeDataToFile function
...
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-04-14 14:28:58 +02:00
Kamil Kopryk
dd3d294f87
performance: cache MOCS values
...
This change caches the most used MOCS values:
* getMOCS(GMM_RESOURCE_USAGE_OCL_BUFFER_CONST);
* getMOCS(GMM_RESOURCE_USAGE_OCL_BUFFER);
* getMOCS(GMM_RESOURCE_USAGE_OCL_BUFFER_CACHELINE_MISALIGNED);
inside gmmHelper class during initialization to avoid repeated
calls of virtual functions, branches and/or gmm lib access.
and adds more readably corresponding getters:
* getL1EnabledMOCS
* getL3EnabledMOCS
* getUncachedMOCS
If force all resources uncached is called,
these 3 cached mocs values are reinitialized
It also changes the order of gmmHelper members, to avoid
not needed padding after addressWidth
and simplifies logic in getMocsIndex function
for xehp and later products.
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-04-14 14:12:48 +02:00
Tomasz Biernacik
194c3058b8
performance: change usage for ringBuffer on integrated devices
...
Related-To: NEO-9421
Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-04-14 12:25:38 +02:00
Chodor, Jaroslaw
a079a2654d
feature: Enabling support for DisableDeepBind debug env in ocloc
...
Accepted debug env :
NEO_OCLOC_DisableDeepBind=1
Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-12 23:21:19 +02:00
Tomasz Biernacik
18435df04c
fix: unify deferring MOCS to PAT
...
Related-To: NEO-14643
Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-04-11 20:10:25 +02:00
Chodor, Jaroslaw
72ae27d51e
feature: Enabling debug variables reader in ocloc
...
This allows for using debug variables in ocloc with
a few by-design limitations :
* debug var needs to be prefixed with NEO_OCLOC_
(when setting-up at runtime as environment variable
or using .config file)
* debug var needs to be tagged with S_OCLOC scope
(at compile time, in debug_variable .inl files)
Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-11 19:58:53 +02:00
Chodor, Jaroslaw
9e14eaaed1
feature: Adding support for scoped debug variables
...
This allows for limitting debug variables to specific
components.
Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-11 17:52:17 +02:00
Mateusz Hoppe
3204411aca
refactor: use deviceBitfield from CSR when creating engine
...
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-04-11 17:30:29 +02:00
Szymon Morek
5b20450162
performance: enable compression on PTL
...
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-11 17:26:29 +02:00
Igor Venevtsev
ef52479ce4
feature: add FillImage1dBuffer built-in kernel
...
Resolves: NEO-13984, HSD-18041422852
Signed-off-by: Igor Venevtsev <igor.venevtsev@intel.com>
2025-04-10 14:04:13 +02:00
Marcel Skierkowski
5d01677454
refactor: Mock file system in ULTs
...
Functions: fileExists and loadDataToFile
use IO functions from namespace IoFunctions
Now tests that use these functions
are mocked by default,
but some still require access to real files
and have been restored the ability to read files.
They will be mocked in next PRs.
Related-To: NEO-7006
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-04-09 19:51:46 +02:00
Maciej Plewka
286acf30a3
fix: Dispatch residency fence when stopping ulls on windows
...
Related-To: NEO-14396, HSD-18041496023, HSD-13012953666, NEO-14611
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-04-09 19:51:16 +02:00
Szymon Morek
f4815a9b69
performance: enable ULLS on LNL Linux
...
Related-To: NEO-9004
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-09 14:45:46 +02:00
Lukasz Jobczyk
1b5519a880
fix: Adjust standard heaps when on 57 bit address space
...
Resolves: GSD-10871
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-09 12:51:03 +02:00
Fabian Zwoliński
363aa8cdbd
performance: align local memory Tag allocations to 2MB
...
Align local memory allocations of tag types to 2MB when
2MB alignment is enabled via the product helper
(is2MBLocalMemAlignmentEnabled flag).
Refactored the allocateGraphicsMemoryInDevicePool function to improve
readability and maintainability. Simplified the logic for
determining base size and final alignment by reducing redundant code.
Related-To: NEO-12287
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2025-04-09 09:59:44 +02:00
Aravind Gopalakrishnan
c7fe7dde90
fix: Correct Xe sysfs paths for ccs mode setting
...
Fix the sysfs paths for Xe KMD
Related-To: NEO-10378
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2025-04-09 06:51:18 +02:00
Maciej Plewka
bb36614567
fix: add separate counter for tracking ring buffer completion
...
Related-To: NEO-14396, NEO-9548
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-04-08 16:30:44 +02:00
Jaroslaw Warchulski
fb5da0c3e5
fix: do not forbid compression for xe_lpg Windows and DG2
...
Related-To: NEO-14583, NEO-14593
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-04-08 16:08:59 +02:00
Marcel Skierkowski
b75fbe8e2c
refactor: mock filesystem in ocloc ult pt.1
...
Mocked IO operations in ./ocloc_tests application
Mocked gtest stdout capture in ocloc tests
Related-To: NEO-14084
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-04-08 16:08:42 +02:00
Mateusz Jablonski
1c377dc930
fix: ensure payload arguments are patched before walker command is fetched
...
In case of indirect kernel launch some payload arguments are patched
just before walker command, this change disables prefetch, performs
batch buffer start to next bytes and then re-enable prefetch. All these
operations are performed between MI_STORE_REGISTER_MEM and COMPUTE_WALKER
Related-To: NEO-14584
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-04-08 11:57:45 +02:00
Lukasz Jobczyk
ca45573dd3
fix: Handle new ring buffer residency when switch ULLS light ring
...
Related-To: NEO-14406, NEO-13922
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-08 10:12:57 +02:00
Szymon Morek
ff1dcc1a8a
performance: adust copy from image to host logic
...
Related-To: NEO-14569
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-08 10:02:00 +02:00
Brandon Yates
c4937991cd
feature: Add l0DebuggerEnabled to WalkerArgs
...
Related-to: NEO-12993
Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-04-08 03:53:38 +02:00
Vysochyn, Illia
4c580c0a83
feature: Update COMPUTE_WALKER instruction
...
Fixes incorrect RawData array length.
Related-To: NEO-14588
Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-04-08 00:01:43 +02:00
Young Jin Yoon
0de024dbf9
refactor: separate post-sync into EncodePostSync
...
Created EncodePostSync template struct to organize various post-sync
variables/functions from EncodeDispatchKernel
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2025-04-07 20:06:14 +02:00
Lukasz Jobczyk
5bcf500c13
fix: Extend ULLS light mutex range
...
If some operatioins requires ULLS light stop, execute such operations
under mutex in pair with ULLS stop to ensure no other thread will start
ULLS.
Related-To: NEO-14406, NEO-13922
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-07 17:35:04 +02:00
Vysochyn, Illia
146a7f7b9b
feature: Add support for BUFFER_SIZE explicit argument
...
Related-To: NEO-13972
Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-04-07 14:15:26 +02:00
Lukasz Jobczyk
8978ea5e5a
performance: Do not create global fence allocation on integrated
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-07 11:22:04 +02:00
Jaroslaw Warchulski
df07897144
fix: forbid compression for pre-xe2 platforms
...
Related-To: NEO-9465
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-04-05 00:15:16 +02:00
Compute-Runtime-Validation
f332571d96
Revert "performance: Do not create global fence allocation on integrated"
...
This reverts commit ecf8a07d26
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-04-04 16:26:19 +02:00
Dominik Dabek
bd516b3552
fix: usm reuse, clean from largest
...
When trimming old allocations in usm reuse start from largest
allocations.
This will reduce memory usage more quickly once max hold time is hit.
Related-To: NEO-6893, NEO-14429
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-04-04 14:57:15 +02:00
Lukasz Jobczyk
ecf8a07d26
performance: Do not create global fence allocation on integrated
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-04 11:45:22 +02:00
Dominik Dabek
3703ff550c
fix: use real size when putting into usm reuse
...
Real allocation size should be used to properly apply limits and allow
more usm reuse hits.
Related-To: NEO-6893
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-04-04 09:44:32 +02:00
Bartosz Dunajski
f99870e716
fix: improve media handling 2
...
Related-To: NEO-14462
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-04-03 19:11:53 +02:00
Mateusz Jablonski
bb518adf34
fix: patching payload arguments in inline data in case of indirect kernel
...
Related-To: NEO-14532
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-04-03 17:21:28 +02:00
Szymon Morek
95e0244f70
fix: properly pass info about 3D image
...
Related-To: NEO-14538
It's valid for 3D image to copy 2D region.
Current checks for mip map do not consider that.
This change correctly checks for mip mapped 3D image.
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-03 16:33:57 +02:00
Slawomir Milczarek
7e7e0a000f
refactor: Add ioctl helper for context destruction
...
Related-To: NEO-11817
Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2025-04-03 16:08:53 +02:00
Dominik Dabek
be27367020
performance: usm reuse, avoid looking up svmData
...
Save svmData on putting into reuse, instead of searching each time.
Change UNRECOVERABLE_IF to DEBUG_BREAK_IF.
Related-To: NEO-6893
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-04-03 15:50:49 +02:00
Jaroslaw Warchulski
c010d17842
fix: respect compression flag in capability table
...
Related-To: NEO-9465
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-04-03 15:36:55 +02:00
Szymon Morek
6ea83f322d
fix: do not override user data beyond slice region
...
Related-To: NEO-14538
If user passes slice pitch which is larger than region
to copy, do not override memory beyond region but within
that slice pitch.
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-03 12:37:35 +02:00
Jaroslaw Warchulski
62baf28316
fix: remove unnecesarry WA for DG2 compression
...
Related-To: NEO-9465
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-04-03 08:04:19 +02:00
Brandon Yates
4651e72b0b
fix: Fail device init if kernel debugging is misconfigured
...
Also print error to stderr
Related-to: GSD-10780
Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-04-02 21:06:30 +02:00
Filip Hazubski
504440fc4d
feature: Add ftrHeaplessMode flag
...
Pass hwInfo to isHeaplessModeEnabled and isForceBindlessRequired functions.
Related-To: NEO-14526
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-04-02 21:06:05 +02:00
Lukasz Jobczyk
deca36fd32
fix: Stop ULLS light when evict resource
...
Related-To: NEO-14406
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-02 16:37:43 +02:00
Bartosz Dunajski
bb3927531e
refactor: reduce HWTEST2_F usage in test_in_order_cmdlist_1.cpp
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-04-02 14:29:12 +02:00
Fabian Zwoliński
7ef3880793
feature: implement pool allocator for gpuTimestampDeviceBuffer
...
The patch applies to Level Zero.
Only allocations < 2MB will be fetched from the pool.
Allocations are shared and reused within a given device.
Additionally, I added a new debug flag to control the allocator:
EnableTimestampPoolAllocator
Related-To: NEO-12287
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2025-04-02 14:28:56 +02:00
Szymon Morek
8836f6df0b
fix: forward mip map level for 3D images
...
Related-To: NEO-14539
1D and 2D images have already mip map level set correctly.
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-02 11:46:41 +02:00
Aravind Gopalakrishnan
3a7d7e022c
fix: Add platform support for reservation on svm heap
...
Related-To: GSD-10816
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2025-04-02 02:46:30 +02:00
Brandon Yates
a48d66ad75
feature: Add programExceptions stub to CSR
...
Related-to: NEO-12967
Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-04-01 18:33:40 +02:00
Szymon Morek
3010af596e
performance: add infrastructure for staging with 3D images
...
Related-To: NEO-14026
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-01 15:30:30 +02:00
Mateusz Jablonski
ed37a1e7ef
build: remove not needed flag for builtins compilation
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-04-01 14:18:49 +02:00
Vysochyn, Illia
70af2bc20b
refactor: Adjust size to preferred SLM values array
...
Related-To: NEO-14479
Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-04-01 11:56:50 +02:00
Chandio, Bibrak Qamar
2ba2970492
performance: Waiting on make resident Windows
...
Related-To: NEO-14056
No need to explicitly wait on Windows KMD during make resident as it has
a while loop that does it nevertheless. The KMD wait affects the API
overhead of zeCommandQueueExecuteCommandLists some platforms (MTL, and ARL).
Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com>
2025-04-01 00:12:45 +02:00
Lukasz Jobczyk
0a11a96a53
refactor: Add dedicated method to check if any ULLS light enabled
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-31 16:36:20 +02:00
Andrzej Koska
e3e01e94a0
Revert "performance: enable Direct Submission on LNL Linux"
...
This reverts commit cb3b4d326d
.
Related-To: NEO-14517, NEO-9004
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2025-03-31 15:22:29 +02:00
Szymon Morek
62964a0b08
fix: invalidate caches when heap is placed into reuse list
...
Related-To: NEO-9004
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-31 12:30:29 +02:00
Bartosz Dunajski
831b488685
fix: improve media engine handling
...
Related-To: NEO-14462
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-03-31 10:40:21 +02:00
Maciej Plewka
a5e19330e9
fix: lock csr before locking residency controller in trim to budget path
...
Lock on csr is needed before lock on residency controller to prevent
incorrect lock order. Csr may be locked in waitOnCpu called from trimToBudget,
which may lead to deadlocks
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-03-28 16:18:20 +01:00
Szymon Morek
3fff3dd77b
fix: set misaligned source memory 1-way coherent
...
Related-To: NEO-14443
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-28 14:16:45 +01:00
Filip Hazubski
3d9fc8968e
fix: Add BMG device id
...
Add device ID: 0xE211
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-03-28 13:40:30 +01:00
Katarzyna Cencelewska
92e40afc49
feature: update debug flag DirectSubmissionPrintSemaphoreUsage
...
instead of printf use makro that make flush after printf
Related-To: HSD-14024170600
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2025-03-28 13:36:15 +01:00
Compute-Runtime-Validation
88a48f1c5b
Revert "performance: Improve ULLS light residency management"
...
This reverts commit 35eae3f977
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-28 11:21:27 +01:00
Mateusz Hoppe
c105c77930
fix: calculation of os context count
...
Related-To: NEO-12952
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-03-28 09:55:17 +01:00
Andrzej Koska
cb3b4d326d
performance: enable Direct Submission on LNL Linux
...
Related-To: NEO-9004
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2025-03-27 17:09:54 +01:00
Lukasz Jobczyk
b43b23b6ed
fix: Init wait utils after hwInfo init for both OS
...
Resolves: HSD-18041922513
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-27 16:45:22 +01:00
Szymon Morek
ead0842763
feature: add L0 API to query kernel argument info
...
Related-To: NEO-14358
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-27 16:43:33 +01:00
Bartosz Dunajski
85f2734ca4
fix: correct gt_id to tile_id engine mapping
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-03-27 15:57:57 +01:00
Dominik Dabek
c76edaba4e
fix: enable usm reuse limit based on memory usage
...
Related-To: NEO-14160, NEO-6893
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-03-27 15:14:08 +01:00
Dominik Dabek
915d657420
fix: flag to limit usm reuse based on memory usage
...
Host usm and device usm for igfx checks system memory usage.
Device usm for dgfx checks local memory usage.
If used memory is above limit threshold:
- no new allocations will be saved for reuse
- cleaner will use shorter hold time of 2 seconds
- cleaner will free all eligible allocations, regardless of async
deleter thread having work
Motivation: in case of gfx memory being full, making resident new
allocations will require evictions which leads to massive slowdown on
enqueue calls.
This change aims to minimize cases where extra memory usage from usm
reuse mechanism leads to above situation.
Related-To: NEO-6893, NEO-14160
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-03-27 10:25:19 +01:00
Jack Myers
0aa2c4f0cb
feature: allow removal of heapful code paths
...
Related-To: NEO-13007
Signed-off-by: Jack Myers <jack.myers@intel.com>
2025-03-27 01:34:35 +01:00
Damian Tomczak
0243004907
feature: additional checkers to enable feature
...
Resolves: NEO-13973
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-03-26 18:06:20 +01:00
Mateusz Jablonski
4bc13fa0dc
fix: correct MetricsLibraryGenId for Xe3
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-03-26 16:35:10 +01:00
Lukasz Jobczyk
60b551758c
performance: Adjust waitpkg threshold for discrete devices
...
Related-To: NEO-14336
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-26 14:59:19 +01:00
Compute-Runtime-Validation
a89113fa1a
Revert "fix: Fail device init if kernel debugging is misconfigured"
...
This reverts commit c122bc51f9
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-26 13:27:12 +01:00
Compute-Runtime-Validation
214277125d
Revert "performance: enable ULLS on LNL Linux"
...
This reverts commit a15de0c4f6
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-26 13:26:57 +01:00
Jaroslaw Warchulski
eeeed9edd8
refactor: cleanup in product helpers
...
Related-To: NEO-12681
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-03-25 21:07:55 +01:00
Brandon Yates
c122bc51f9
fix: Fail device init if kernel debugging is misconfigured
...
Also print error to stderr
Related-to: GSD-10780
Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-03-25 20:40:25 +01:00
Wojciech Konior
817c6df46d
fix: unify sources from shared added to dll&tests
...
Related-To: NEO-4372
Signed-off-by: Wojciech Konior <wojciech.konior@intel.com>
2025-03-25 16:39:18 +01:00
Jaroslaw Warchulski
1230ab99aa
refactor: adjust product helper file name after pre-gen12 removal
...
Related-To: NEO-12681
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-03-25 15:57:46 +01:00
Jaroslaw Warchulski
cb2add406e
fix: do not use ftrE2ECompression flag for xe2+
...
Related-To: NEO-9465
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-03-25 09:29:50 +01:00
Lukasz Jobczyk
35eae3f977
performance: Improve ULLS light residency management
...
-skip bo::wait if completion fence not needed
-do not restart exec on evict
Related-To: NEO-13922
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-24 17:48:55 +01:00
Pratik Bari
f67663d863
fix(sysman): Create Engine Handles for multiple gt w.r.t Xe driver
...
For the Xe driver, a single tile can have multiple gts. Now the Engine
handles are created for multiple gts available on the single tile as in
the case for the Xe driver.
Related-To: NEO-14401
Signed-off-by: Pratik Bari <pratik.bari@intel.com>
2025-03-24 17:14:02 +01:00
Bartosz Dunajski
42ced7f5eb
refactor: add NotLockable param print
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-03-24 16:22:21 +01:00
Bartosz Dunajski
8c21a9634f
refactor: add missing file to cmake list
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-03-24 15:19:03 +01:00
Kamil Kopryk
73795ced64
refactor: add setupTimestampPacketFlushL3 function
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-03-24 13:10:03 +01:00
Damian Tomczak
26c23460cb
fix: set heapless mode for builtins compilation
...
Related-to: NEO-13973
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-03-24 12:01:58 +01:00
Lukasz Jobczyk
8fbde30432
fix: Adjust waitpkg threshold to 12 us
...
Resolves: NEO-14434, NEO-14435
Related-To: NEO-14336
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-24 11:43:54 +01:00
Szymon Morek
a15de0c4f6
performance: enable ULLS on LNL Linux
...
Related-To: NEO-9004
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-24 10:44:23 +01:00
Tomasz Biernacik
f4f13dfeec
Revert "performance: disable cpu caching for buffers on PTL"
...
This reverts commit c3f0c4e4ac
.
Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-03-24 10:23:07 +01:00
Dominik Dabek
6e998fc3c1
fix: move host usm reuse max size to mem manager
...
Intialize value on memory manager creation.
Related-To: NEO-6893
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-03-24 08:53:30 +01:00
Lukasz Jobczyk
54cb0e24f8
performance: Switch waitpkg use to tpause for ULLS light
...
Related-To: NEO-13922, NEO-14336
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-21 18:09:37 +01:00
Szymon Morek
bb10290828
fix: make misaligned user memory 2-Way coherent
...
Related-To: NEO-9004
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-21 17:56:37 +01:00
Lukasz Jobczyk
7c75807987
performance: Enable waitpkg with tpause
...
Resolves: NEO-14336
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-21 17:30:55 +01:00
Compute-Runtime-Validation
28dd4cf4d8
Revert "feature: additional checkers to enable feature"
...
This reverts commit c95a67b420
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-21 16:48:05 +01:00
Maciej Plewka
1b7bf690d2
fix: update fence values when residency list passed to switch ring buffer
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-03-21 12:26:08 +01:00
Lukasz Jobczyk
8a85a96ed2
feature: Add 3-level wait scheme with tpause intrinsic
...
Related-To: NEO-14336
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-21 12:12:57 +01:00
Aravind Gopalakrishnan
4e9afb32e6
refactor: Fix coverage issue with parse ccs mode
...
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2025-03-21 11:12:24 +01:00
Damian Tomczak
c95a67b420
feature: additional checkers to enable feature
...
Resolves: NEO-13973
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-03-21 10:38:53 +01:00
Chodor, Jaroslaw
0bff9def3b
feature: New allocators for opaque arrays
...
Adds fast allocators for opaque objects of uniform size.
Related-To: NEO-13406
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-03-20 21:59:44 +01:00
Lukasz Jobczyk
75461d190d
performance: Adjust umwait counter value
...
Related-To: NEO-13922, NEO-14336
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-20 13:57:25 +01:00
Damian Tomczak
e6ff523a8e
fix: heapless has the highest priority as builtin
...
Related-to: NEO-13973
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-03-20 08:10:44 +01:00
Bartosz Dunajski
6e17df4fb3
refactor: improve EncodeAluHelper
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-03-19 15:44:34 +01:00
Michal Mrozek
6a9716b81c
refactor: remove not needed code
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2025-03-19 14:53:57 +01:00
Aravind Gopalakrishnan
724ba20e41
fix: Parse CCS mode setting for non PVC platforms
...
Related-To: GSD-8785
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2025-03-19 12:47:46 +01:00
Lukasz Jobczyk
6cb52f71b4
fix: Avoid mutex deadlock when switch ulls light ring buffer
...
Related-To: NEO-14406
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-19 11:47:40 +01:00
Jitendra Sharma
dda7876d3a
feature: Introduce SipClassType::externalLib
...
Related-To: NEO-13737
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2025-03-19 11:06:03 +01:00
Jack Myers
0e25970853
fix: re-add switch case for once writable query
...
A change related to the tbx fault manager
incorrectly removed a switch case from
`AubHelper::isOneTimeAubWritableAllocationType`.
This fixes that and refactors some APIs to prevent
similar mistakes from happening again by cleaning
up logic.
Addresses show stopper for pre-si pytorch workflows.
Resolves: NEO-14399
Signed-off-by: Jack Myers <jack.myers@intel.com>
2025-03-19 09:54:54 +01:00
Kamil Kopryk
2e729bcb4c
refactor: move isTimestampWaitSupportedForQueues to productHelper
...
Related-to: NEO-13163
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-03-19 09:31:33 +01:00
Compute-Runtime-Validation
5f7f0dd785
Revert "performance: Enable waitpkg"
...
This reverts commit 8ec5434460
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-18 13:28:29 +01:00
Michał Pryba
b9df406a46
refactor: remove minGen field and pregen12 platform cases
...
Related-To: NEO-12681
Signed-off-by: Michał Pryba <michal.pryba@intel.com>
2025-03-18 12:49:34 +01:00
Maciej Plewka
36fa6d66ae
fix: lock csr in stopDirectSubmission if needed
...
Related-To: NEO-13875, NEO-14143, HSD-16026538384, HSD-16026780358
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-03-18 09:30:38 +01:00
Maciej Bielski
a7c46b8213
feature: support L2 cache reservation
...
Related-To: NEO-12837
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-03-17 19:41:55 +01:00
Mateusz Hoppe
9d052cd8ee
fix: pass rootDevice when creating secondary contexts
...
- fix initialization of gfxCoreHelper in drm and wddm
Related-To: NEO-7824, NEO-8171
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-03-17 16:03:27 +01:00
Wojciech Konior
9c8ced8c78
fix: device ids missmatch error print
...
Related-To: NEO-12845
Signed-off-by: Wojciech Konior <wojciech.konior@intel.com>
2025-03-17 12:58:12 +01:00
Compute-Runtime-Validation
d54b74e8bd
Revert "fix: Parse CCS mode setting for non PVC platforms"
...
This reverts commit 5ca78dfdd1
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-17 00:30:44 +01:00
Aravind Gopalakrishnan
5ca78dfdd1
fix: Parse CCS mode setting for non PVC platforms
...
Related-To: GSD-8785
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2025-03-15 01:39:51 +01:00
Kamil Kopryk
7f607427fa
fix: correct dsh generation
...
Don't use hardcoded sampler state and sampler border color state sizes.
Related-To: NEO-14216
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-03-14 21:18:38 +01:00
Kamil Kopryk
2c77d2c698
refactor: add SetMaxBVHLevels debug flag
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-03-14 16:59:48 +01:00
Kamil Kopryk
01edd58be4
refactor: add additional check in addFlushRequiredCommand
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-03-14 13:54:20 +01:00
Tomasz Biernacik
afcbebef0b
Revert "performance: configure Xe2 compression"
...
This reverts commit ec7821ed84
.
Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-03-14 11:25:20 +01:00
Jack Myers
5f78147e16
fix: hotfix for svmcpu tbx uploads
...
Test program in the linked, related issue
is crashing in tbx mode. Tbx server indicated
upload of invalid memory was made before exit.
Running with debug messages showed that the
problematic upload was an svmcpu buffer when
running neo with separate cpu and gpu
buffers for shared memory management.
Using this info, the problem was narrowed down
to a missing unprotect call in page fault manager
related code, resulting in a protected(invalid)
memory region getting uploaded to tbx.
It is unclear yet why this unprotect call was not made,
since other svmcpu buffers were uploaded without issue.
This hotfix forces the unprotect call in the fault handler,
which allows the test program to run to completion. However,
there is now a failing test case.
Considering the critical nature of the associated
NEO issue and that this patch should unblock
the work depending on the fix, this hotfix should
get merged regardless of the failing test case.
In the meantime, I will continue triaging the
failing test and will implement a proper fix
once the root cause is isolated.
Related-To: NEO-13404
Signed-off-by: Jack Myers <jack.myers@intel.com>
2025-03-14 04:47:21 +01:00
Compute-Runtime-Validation
13db35c35f
Revert "fix: set correct default value of cacheable flag"
...
This reverts commit a8d9e7ed35
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-13 20:38:57 +01:00
Damian Tomczak
3bc6ed8424
fix: not every heapless kernel is stateless
...
Related-to: NEO-13973
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-03-13 19:09:01 +01:00
Tomasz Biernacik
c3f0c4e4ac
performance: disable cpu caching for buffers on PTL
...
Related-To: NEO-9421
Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-03-13 18:34:42 +01:00
Tomasz Biernacik
a8d9e7ed35
fix: set correct default value of cacheable flag
...
Related-To: NEO-9421
Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-03-13 18:26:02 +01:00
John Falkowski
4d281cf51d
feature: Implement appendMemoryPrefetch for Shared System USM allocations
...
Related-To: NEO-12989
Signed-off-by: John Falkowski <john.falkowski@intel.com>
2025-03-13 06:26:38 +01:00
Zbigniew Zdanowicz
ddc0b0d03b
feature: disable flat ring buffer for command list append operation
...
Related-To: NEO-10356
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-13 06:06:55 +01:00
shubham kumar
e287174211
feature: Add EU stall support for xe2/xe3 core
...
Related-To: NEO-13777
Signed-off-by: shubham kumar <shubham.kumar@intel.com>
2025-03-12 23:10:24 +01:00
Compute-Runtime-Validation
fa2e3adad3
Revert "feature: Implement appendMemoryPrefetch for Shared System USM Allocat...
...
This reverts commit 97799b3faf
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-12 05:55:32 +01:00
Compute-Runtime-Validation
0d5baa2c30
Revert "performance: Cache timestamps on CPU"
...
This reverts commit 83637404bf
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-12 04:41:46 +01:00
Katarzyna Cencelewska
4890150e12
feature: add method adjustMaxThreadsPerThreadGroup to product helper
...
Related-To: HSD-18028334016, HSD-14022274275
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2025-03-11 15:47:57 +01:00
Mateusz Hoppe
a86105814d
fix: default initialize allocated memory when memory is overwritten
...
Related-To: NEO-5093
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-03-11 15:32:33 +01:00
Lukasz Jobczyk
8ec5434460
performance: Enable waitpkg
...
Resolves: NEO-14336
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-11 14:43:01 +01:00
Lukasz Jobczyk
83637404bf
performance: Cache timestamps on CPU
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-11 13:40:18 +01:00
Mateusz Hoppe
2f1128fdcd
refactor: add consteval to compile time functions in debug manager
...
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-03-11 09:50:32 +01:00
John Falkowski
97799b3faf
feature: Implement appendMemoryPrefetch for Shared System USM Allocations
...
Related-To: NEO-12989
Signed-off-by: John Falkowski <john.falkowski@intel.com>
2025-03-11 09:12:48 +01:00
Lukasz Jobczyk
be8545f7df
refactor: reset all waitpkg variables on test start
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-11 08:44:36 +01:00
Szymon Morek
dacbce7f01
performance: introduce staging read for cl_buffer
...
Related-To: NEO-14026
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-11 08:19:28 +01:00
Fabian Zwoliński
f5e37e725c
Revert "fix: configure ISA Pool params based on productHelper"
...
This reverts commit bf20ae7ae8
.
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2025-03-10 22:41:13 +01:00
Zbigniew Zdanowicz
cd904269ed
fix: request for task count should enable monitor fence dispatch
...
Related-To: NEO-10356
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-10 18:25:35 +01:00
Lukasz Jobczyk
8ede026c5d
refactor: Use gmm resource info to print unknown usage
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-10 16:28:31 +01:00
Tomasz Biernacik
ec7821ed84
performance: configure Xe2 compression
...
Related-To: NEO-9421
Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-03-10 15:40:00 +01:00
Compute-Runtime-Validation
6ee39ed94c
Revert "fix: Fail device init if kernel debugging is misconfigured"
...
This reverts commit b0c92ea425
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-10 12:23:07 +01:00
Brandon Yates
b0c92ea425
fix: Fail device init if kernel debugging is misconfigured
...
Also print error to stderr
Related-to: GSD-10780
Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-03-08 05:02:42 +01:00
Zbigniew Zdanowicz
f8be8414ac
feature: add support for wait event preamble in append command list
...
- add mechanism in queue to trigger start command from queue to regular
- add detection in immediate command list need to dispatch extra start in queue
- fix secondary linear stream in immediate case as it should not use container
- modify tests for primary batch buffer dispatch as default mode
- remove invalid or obsolete tests
Related-To: NEO-10356
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-07 20:31:09 +01:00
Maciej Plewka
c5e1fcf313
fix: Copy tiled 1D array per array element with BLT
...
Related-To: NEO-14147, HSD-14024424096, HSD-14024424178
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-03-07 17:29:04 +01:00
Tomasz Biernacik
e644b09433
performance: override allocation caching on integrated platforms
...
Related-To: NEO-9421
Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-03-07 15:44:00 +01:00
Slawomir Milczarek
8e251ed274
refactor: Extended drm wrapper with primary context export / import handle
...
Related-To: NEO-12952
Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2025-03-07 15:17:34 +01:00
Dominik Dabek
2170f5ca88
refactor: usm reuse to unique ptr
...
Change usm allocation cache in usm manager to unique ptr
Related-To: NEO-6893
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-03-07 15:14:58 +01:00
Lukasz Jobczyk
53062056ec
performance: Enable wait pkg for ULLS light
...
Related-To: NEO-13922
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-07 12:18:38 +01:00
Jaroslaw Warchulski
413194bd2a
Revert "fix: do not prefer image compression on xe_lpg for linux and WSL"
...
This reverts commit 8814b6ac4f
.
Resolves: NEO-14286
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-03-07 11:38:46 +01:00
Maciej Bielski
8ad6a7a86b
feature: unify local memory size getter for i915 and xe
...
Previously i915 was using a sysfs node to query the amount of local
memory of a tile. For xe such sysfs node does not exist so the function
was useless.
Instead of a sysfs node, leverage the local memory size present in
MemoryInfo (queried at init-stage) so that the output is now the same
for both i915 and xe.
Related-To: NEO-11127
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-03-07 09:50:19 +01:00
Szymon Morek
f08b32761e
fix: avoid race when evicting resources
...
Related-To: NEO-13843
DrmMemoryOperationsHandler::mutex is used for residency
handling.
However, this mutex is not being locked during eviction when vm bind
failed.
This is causing races for example when ULLS controller wants to make
resources resident, and user thread wants to evict
them.
Fix is to explicitly obtain lock in problematic path.
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-07 07:01:41 +01:00
Tomasz Biernacik
1aa418ec15
performance: override cacheable value passed to gmm on linux
...
Related-To: NEO-9421
Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-03-06 21:45:01 +01:00
Damian Tomczak
81b0cac65f
fix: raytracing heapless missing allocation
...
Related-to: NEO-12737
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-03-06 17:26:09 +01:00
Lukasz Jobczyk
61fe9ebb8f
performance: Allow power efficient waits with ULLS light
...
Resolves: HSD-18041633458
Related-To: NEO-13922
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-06 14:20:58 +01:00
Maciej Plewka
a93cecac36
fix: treat tiled 1D images as 2D with height 1 for BLT copies
...
Related-To: NEO-14147, HSD-14024424096, HSD-14024424178
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-03-06 11:22:30 +01:00
Dominik Dabek
9eb8e1812c
feature: flag to log usm reuse operations
...
If flag "LogUsmReuse" is set, usm reuse will log operations to csv file.
Each line will contain: timestamp, host/device, operation type,
allocation size, true/false whether operation succeeded.
This data can then be used to produce graphs and help in analyzing
usm reuse in a particular workload.
Related-To: NEO-6893
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-03-06 11:06:27 +01:00
Szymon Morek
82fba79d9d
performance: set 1ms timeout for ulls controller on LNL and PTL
...
Related-To: NEO-13843
Limit scope to Windows only.
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-06 09:13:58 +01:00
Brandon Yates
64b027f71c
feature: Add gfxCoreHelper for StateSip required
...
Related-to: NEO-12967
Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-03-05 20:24:17 +01:00
Kamil Kopryk
4c795027e3
refactor: add check if event L3 flush is needed
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-03-05 18:25:29 +01:00
Zbigniew Zdanowicz
ae1eb076b7
feature: add optional epilogue to flush task method
...
Related-To: NEO-10356
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-05 15:15:59 +01:00
Tomasz Biernacik
92512fdc64
fix: preserve allocation type for memory object
...
Related-To: NEO-9421
Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-03-05 14:48:57 +01:00
Lukasz Jobczyk
5a04d1e10a
refactor: Print unknown gmm usage type value
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-05 10:34:40 +01:00
Compute-Runtime-Validation
98d1d629a8
Revert "feature: Enable defer backing by default"
...
This reverts commit fffe8f8027
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-05 03:35:03 +01:00
Compute-Runtime-Validation
3414abe74a
Revert "fix: make global fence always resident on linux"
...
This reverts commit efb814d979
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-05 02:07:38 +01:00
Zbigniew Zdanowicz
27d7d72033
feature: add pipeline state management for append command list operation
...
- command list append state is managed from internal queue and can be skipped
- initial state configuration should be processed by both kernel and non-kernel
- only kernel operation can process required state, as non-kernel cannot change
Related-To: NEO-10356
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-04 20:43:22 +01:00
Szymon Morek
ff4da67979
fix: signal notify field before KMD wait
...
Related-To: NEO-13870
Currently all monitor fences are triggering
interrupt due to Notify Enable field.
With this change, such field is programmed
right before KMD wait.
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-04 15:19:16 +01:00
Bellekallu Rajkiran
fffe8f8027
feature: Enable defer backing by default
...
Related-To: NEO-13403
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2025-03-04 14:29:53 +01:00
Szymon Morek
efb814d979
fix: make global fence always resident on linux
...
Related-To: NEO-13843
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-04 12:32:53 +01:00
Chuansheng Liu
1876a43024
performance: add low latency hint for i915
...
i915 has provided the low latency hint interface, it helps
improve the performance for light workload a lot.
Related-To: NEO-14250
Signed-off-by: Chuansheng Liu <chuansheng.liu@intel.com>
2025-03-04 11:14:04 +01:00
Zbigniew Zdanowicz
db99c25c79
feature: add support to dispatch epilogue commands into dedicated stream
...
Related-To: NEO-10356
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-03 14:39:34 +01:00
Jaroslaw Warchulski
8814b6ac4f
fix: do not prefer image compression on xe_lpg for linux and WSL
...
Related-To: HSD-18034872015
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-02-28 14:20:57 +01:00
Zbigniew Zdanowicz
08b13750a1
fix: set stall cmd flag for bcs flush task count flag
...
Related-To: NEO-10356
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-02-28 12:24:51 +01:00
Zbigniew Zdanowicz
cae3bb1d0a
feature: add internal interfaces to manage all dispatch models of command lists
...
- add new enum type for command list flush from immediate
- add new argument for flushing immediate command list - regular command list
- add capability to provide additional stream for epilogue commands
- add pointer to provide external csr mutex to lock both execution and flush
Related-To: NEO-10356
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-02-28 12:02:22 +01:00
Lukasz Jobczyk
b7cba510a3
fix: Do not increase host USM alignment when CAL enabled
...
Resolves: GSD-10808
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-28 10:10:42 +01:00
Andrzej Koska
021723e154
fix: Removal of unnecessary offsets during blitter copying
...
When copying with the blitter and setting ArrarIndex to zero, additional
offsets from the origin are no longer necessary
Related-To: NEO-14171
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2025-02-28 09:32:19 +01:00
Katarzyna Cencelewska
0f48dd465f
feature: add debug print with global memory size
...
during check if there is space for scratch available
Related-To: HSD-18041185560
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2025-02-27 23:11:19 +01:00
Kamil Kopryk
4503972faf
refactor: add adjustSamplerStateBorderColor function
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-02-27 14:05:09 +01:00
Jaroslaw Warchulski
837d6f5ad1
fix: do not enable compression on xe_lpg for linux and WSL
...
Resolves: HSD-18034872015
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-02-27 11:12:23 +01:00
Compute-Runtime-Validation
c02dc8208c
Revert "fix: do not enable compression on xe_lpg"
...
This reverts commit a6abda83a4
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-27 03:15:32 +01:00
Lukasz Jobczyk
20d29207cd
refactor: Allow debug key to force USM cleaner with ULLS light
...
Related-To: NEO-13922
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-26 17:52:18 +01:00
Vysochyn, Illia
1eafbc30f8
feature: Enable eu thread scheduling mode override via IDD
...
Enables eu thread scheduling mode override via the
INTERFACE_DESCRIPTOR_DATA and COMPUTE_WALKER.
Defines encodeEuSchedulingPolicy operating on INTERFACE_DESCRIPTOR_DATA
or Xe3 platform.
Adds tests verifying that thread scheduling mode can be overridden via
COMPUTE_WALKER.
Related-To: NEO-13771, HSD-18041256338, HSD-14013056398
Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-02-26 17:04:43 +01:00
Wojciech Konior
7f81d0e3e8
fix: vmbind of i915 canonization check
...
Related-To: NEO-7313
Signed-off-by: Wojciech Konior <wojciech.konior@intel.com>
2025-02-25 19:00:31 +01:00
Jaroslaw Warchulski
a6abda83a4
fix: do not enable compression on xe_lpg
...
Resolves: HSD-18034872015
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-02-25 18:05:56 +01:00
Mateusz Jablonski
374863ba08
refactor: remove cl pipe related logic
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-02-25 17:56:53 +01:00
Jitendra Sharma
d480cc093c
feature: Add method to determine if Sip should be get from external library
...
Related-To: NEO-13735
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2025-02-25 15:14:54 +01:00
Mateusz Hoppe
291745cdf7
feature: add support for ImplicitArgs versioning
...
- define ImplicitArgs version 1
Related-To: NEO-14115
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-02-25 10:27:11 +01:00
Slawomir Milczarek
0a7ed9f9f4
feature: Add ioctl helpers for primary contexts
...
Related-To: NEO-12952
Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2025-02-24 17:37:05 +01:00
Kamil Kopryk
7c3468794e
fix: add option to enable/disable heapless in ocloc
...
Related-To: GSD-10681
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-02-24 17:06:26 +01:00
Lukasz Jobczyk
6dd948a4b3
performance: Pass residency container as const ptr
...
Related-To: NEO-13922
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-24 15:18:55 +01:00
Compute-Runtime-Validation
f907c7dcb2
Revert "performance: set 1ms timeout for ulls controller since xe2"
...
This reverts commit 28c008e438
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-24 15:04:54 +01:00
Lukasz Jobczyk
be946ae56c
performance: Optimize make resident for ULLS light
...
Do not check if ULLS light is active during every Csr::makeResident
call. Store that information once during ULLS init.
Related-To: NEO-13922
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-24 12:59:56 +01:00
Mateusz Jablonski
50e22ef5e7
refactor: remove device enqueue specific code
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-02-24 12:37:51 +01:00
ocldev
fd2a6b55c2
feature: brand string update
...
Related-To: NEO-7929
Updating device brand strings
Based on public support in Windows driver 32.0.101.6559
https://www.intel.com/content/www/us/en/download/785597/
Signed-off-by: ocldev <ocldev@intel.com>
2025-02-24 12:21:52 +01:00
Compute-Runtime-Validation
f64fd7fc42
Revert "performance: disable cpu caching for buffers on PTL"
...
This reverts commit 681ef3693a
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-24 10:07:15 +01:00
Lukasz Jobczyk
1f7c67ea8d
performance: Enable ULLS light for ARLH
...
Related-To: NEO-13922
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-21 18:00:46 +01:00
Compute-Runtime-Validation
66c4288494
Revert "fix: using canonized addr in xe kmd"
...
This reverts commit 683d7c7acf
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-21 17:49:05 +01:00
Lukasz Jobczyk
91c07e24f6
performance: Enable ULLS light for MTLH
...
Related-To: NEO-13922
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-21 16:52:02 +01:00
Lukasz Jobczyk
cad776412b
performance: Enable ULLS light for ARLS, ARLU, MTLU
...
Related-To: NEO-13922
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-21 16:10:18 +01:00
Wojciech Konior
683d7c7acf
fix: using canonized addr in xe kmd
...
Related-To: NEO-7313
Signed-off-by: Wojciech Konior <wojciech.konior@intel.com>
2025-02-21 16:06:06 +01:00
Lukasz Jobczyk
5961850bbc
fix: blink ULLS light
...
Related-To: NEO-13922
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-21 13:31:35 +01:00
Szymon Morek
681ef3693a
performance: disable cpu caching for buffers on PTL
...
Related-To: NEO-14168
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-02-21 13:07:04 +01:00
Compute-Runtime-Validation
a929551cb2
Revert "performance: Enable ULLS light for ARLS, ARLU, MTLU"
...
This reverts commit 0b6b51fa19
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-21 09:35:00 +01:00
Szymon Morek
01a66a9347
fix: download staging buffer before reading from it
...
Related-To: NEO-14154
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-02-21 09:30:23 +01:00
Compute-Runtime-Validation
dda4beb719
Revert "performance: enable Direct Submission on LNL Linux"
...
This reverts commit b7d21b135c
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-21 04:13:28 +01:00
Bartosz Dunajski
85b0d1f84d
fix: clear all TS packets for combined CB event
...
Related-To: NEO-13971
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-02-20 19:24:15 +01:00
Mateusz Jablonski
3e45cf2a6c
feature: add debug key to expose L0 DDI Handles extension
...
Related-To: NEO-13121, NEO-13917
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-02-20 17:24:33 +01:00
Mateusz Jablonski
70f3342c2d
refactor: remove vme specific code
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-02-20 17:08:52 +01:00
Filip Hazubski
794b21a3fa
refactor: Add extra parameters to computeSlmValues function
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-02-20 15:35:52 +01:00
Radoslaw Jablonski
b7d21b135c
performance: enable Direct Submission on LNL Linux
...
Related-To: NEO-9004
Signed-off-by: Radoslaw Jablonski <radoslaw.jablonski@intel.com>
2025-02-20 15:22:44 +01:00
Mateusz Jablonski
7865509aa0
feature: enable WMTP on BMG
...
Related-To: NEO-12814
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-02-20 12:52:40 +01:00
Compute-Runtime-Validation
0996247dfd
Revert "performance: Enable ULLS light for MTLH"
...
This reverts commit 7a62f27d8b
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-20 11:11:51 +01:00
Bartosz Dunajski
a0a735804f
feature: disable counter caching for external storage Events
...
Related-To: NEO-13971
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-02-20 11:11:29 +01:00
Fabian Zwoliński
bf20ae7ae8
fix: configure ISA Pool params based on productHelper
...
When is2MBLocalMemAlignmentEnabled returns true,
increase pool size for builtins from 64k to 2MB.
Additionally, set appropriate alignment for kernel ISA heap allocations.
Additionally, configure isaAllocationPageSize based on productHelper
Related-To: NEO-12287
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2025-02-20 08:42:35 +01:00
Compute-Runtime-Validation
29b3cb3c85
Revert "performance: Enable ULLS light for ARLH"
...
This reverts commit 20d00f53eb
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-20 03:54:49 +01:00
Maciej Plewka
8ba43065c5
fix: handle literal raw strings correctly in printf formater
...
Related-To: NEO-14138
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-02-19 17:14:54 +01:00
Lukasz Jobczyk
7a62f27d8b
performance: Enable ULLS light for MTLH
...
Related-To: NEO-13922
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-19 16:25:08 +01:00
Lukasz Jobczyk
20d00f53eb
performance: Enable ULLS light for ARLH
...
Related-To: NEO-13922
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-19 15:55:07 +01:00
Lukasz Jobczyk
0b6b51fa19
performance: Enable ULLS light for ARLS, ARLU, MTLU
...
Related-To: NEO-13922
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-19 15:47:50 +01:00
Lukasz Jobczyk
42261526e4
refactor: Handle dlopen error
...
Related-To: NEO-13922
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-19 15:28:48 +01:00
Szymon Morek
28c008e438
performance: set 1ms timeout for ulls controller since xe2
...
Related-To: NEO-13843
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-02-19 13:48:26 +01:00
Lukasz Jobczyk
b0e3d8656e
refactor: Add OV detection mechanism for ULLS light
...
Related-To: NEO-13922
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-19 13:28:29 +01:00
Filip Hazubski
b60c02d597
fix: Add asserts to ensure NonCopyable and NonMovable n/n
...
Related-To: NEO-14068
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-02-19 11:36:24 +01:00
Jitendra Sharma
2daca51bd9
build: update eudebug headers
...
Related-To: NEO-14069
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2025-02-19 09:22:15 +01:00
Filip Hazubski
cff9ea3966
performance: Use std::move when initializing RelocationInfo
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-02-18 20:47:52 +01:00
Filip Hazubski
4be1153253
fix: Remove pragma once from inl files
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-02-18 20:19:15 +01:00
Vysochyn, Illia
e59f61b02e
feature: Update INTERFACE_DESCRIPTOR_DATA for Xe3
...
Related-To: NEO-13771, HSD-18041256338, HSD-14013056398
Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-02-18 18:03:13 +01:00
Filip Hazubski
6b6202446b
fix: Add asserts to ensure NonCopyable and NonMovable 3/n
...
Related-To: NEO-14068
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-02-18 17:16:03 +01:00
Bellekallu Rajkiran
ff9d0d7ea0
fix: fix setting up ccs mode
...
Related-To: NEO-14122
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2025-02-18 17:10:51 +01:00
Chodor, Jaroslaw
8f43be8800
refactor: Enforce constexpr evaluation on getNonReleaseKeyName
...
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-02-18 16:03:41 +01:00
Vysochyn, Illia
42e06790da
feature: Update COMPUTE_WALKER for Xe3
...
Related-To: NEO-13771, HSD-18041256338, HSD-14013056398
Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-02-18 15:53:31 +01:00
Bartosz Dunajski
c1f2ff1ad6
fix: disable batched dispatch mode in aub csr
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-02-18 13:45:15 +01:00
Filip Hazubski
c651209617
fix: Add asserts to ensure NonCopyable and NonMovable 2/n
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-02-18 13:08:00 +01:00
Filip Hazubski
6b2b42972a
fix: Add asserts to ensure NonCopyable and NonMovable 1/n
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-02-18 09:41:20 +01:00