Commit Graph

7044 Commits

Author SHA1 Message Date
Chodor, Jaroslaw 7744251995 refactor: explicit copy/move operations in OpaqueElementAllocator
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-16 10:10:05 +02:00
Oskar Hubert Weber 38a3667304 fix: avoid joining thread in deferred deleter
Join thread in DllMain (which we are not aware of)
can result in hang occurring in DeferredDeleter, if the library is
freed before FreeLibraryAndExitThread call from within the worker thread,
the thread gets stuck, thus the main thread is stuck on worker->join().

Related-To: NEO-14121

Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com>
2025-04-16 10:02:10 +02:00
Mateusz Jablonski f6b92aa5d5 build: update default PTL revision to B0
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-04-16 09:44:58 +02:00
Tomasz Biernacik eff2d1cde8 performance: change usage for semaphoreBuffer on integrated devices
Related-To: NEO-9421

Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-04-16 09:07:05 +02:00
Chandio, Bibrak Qamar bed2ad1ac0 feature: memAdise for System Allocator
Related-To: NEO-13715

Implements basic functionality for zeCommandListAppendMemoryAdvise for
system allocator

Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com>
2025-04-16 07:11:14 +02:00
Chodor, Jaroslaw 2ce694c52b feature: Enabling support for redirecting FCL to IGC for OCLOC
This allows for invoking IGC as OCL C translator in OCLOC paths

Accepted debug env :
NEO_OCLOC_UseIgcAsFcl=1 ; forces IGC
NEO_OCLOC_UseIgcAsFcl=2 ; forces FCL
NEO_OCLOC_UseIgcAsFcl=0 ; use defaults

Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-15 16:44:35 +02:00
Tomasz Biernacik 5bf8b66498 feature: disable getting fd from vmExport
Related-To: NEO-12680


Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-04-15 16:08:40 +02:00
Maciej Plewka b8be602bfb fix: Return max wg count equal one on platforms without dispatch all support
Related-To: NEO-14125, GSD-10725
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-04-15 15:41:23 +02:00
Lukasz Jobczyk 06e375fdc6 refactor: Add debug break for partial make resident
Resolves: NEO-13711

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-15 14:59:28 +02:00
Lukasz Jobczyk f7939735da refactor: Adjust USM cleaner to ULLS light
Related-To: NEO-13922

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-15 14:05:02 +02:00
Lukasz Jobczyk 455209aadc refactor: Simplify fence selection in ULLS
Related-To: NEO-14642

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-15 14:04:47 +02:00
Marcel Skierkowski 31f0fd4672 refactor: rename variable slmSize
Max programmable slm size is stored in RuntimeCapabilityTable as slmSize.
That is misleading name
Rename the variable slmSize to better reflect the actual meaning of the member.

Related-To: NEO-12949
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-04-14 20:00:29 +02:00
Chodor, Jaroslaw 7bded13638 feature: Enabling support for redirecting FCL to IGC for OCL
This allows for invoking IGC as OCL C translator in OpenCL paths

Accepted debug env :
UseIgcAsFcl=1 ; forces IGC
UseIgcAsFcl=2 ; forces FCL
UseIgcAsFcl=0 ; uses defaults

Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-14 18:41:48 +02:00
Jaroslaw Warchulski 3e1aa33924 refactor: cleanup headers
Related-To: NEO-5548
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-04-14 14:59:40 +02:00
Maciej Bielski 6ecf5b373e feature: exit with warning if small bar detected on Linux
Return early and provide justification, otherwise SIGBUS error is
returned.

Related-To: NEO-14506
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-04-14 14:43:25 +02:00
Marcel Skierkowski e82be94368 refactor: Modernize writeDataToFile function
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-04-14 14:28:58 +02:00
Kamil Kopryk dd3d294f87 performance: cache MOCS values
This change caches the most used MOCS values:

* getMOCS(GMM_RESOURCE_USAGE_OCL_BUFFER_CONST);
* getMOCS(GMM_RESOURCE_USAGE_OCL_BUFFER);
* getMOCS(GMM_RESOURCE_USAGE_OCL_BUFFER_CACHELINE_MISALIGNED);
inside gmmHelper class during initialization to avoid repeated
calls of virtual functions, branches and/or gmm lib access.

and adds more readably corresponding getters:
* getL1EnabledMOCS
* getL3EnabledMOCS
* getUncachedMOCS

If force all resources uncached is called,
these 3 cached mocs values are reinitialized

It also changes the order of gmmHelper members, to avoid
not needed padding after addressWidth
and simplifies logic in getMocsIndex function
for xehp and later products.

Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-04-14 14:12:48 +02:00
Tomasz Biernacik 194c3058b8 performance: change usage for ringBuffer on integrated devices
Related-To: NEO-9421

Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-04-14 12:25:38 +02:00
Chodor, Jaroslaw a079a2654d feature: Enabling support for DisableDeepBind debug env in ocloc
Accepted debug env :
NEO_OCLOC_DisableDeepBind=1

Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-12 23:21:19 +02:00
Tomasz Biernacik 18435df04c fix: unify deferring MOCS to PAT
Related-To: NEO-14643

Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-04-11 20:10:25 +02:00
Chodor, Jaroslaw 72ae27d51e feature: Enabling debug variables reader in ocloc
This allows for using debug variables in ocloc with
a few by-design limitations :
* debug var needs to be prefixed with NEO_OCLOC_
(when setting-up at runtime as environment variable
or using .config file)
* debug var needs to be tagged with S_OCLOC scope
(at compile time, in debug_variable .inl files)

Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-11 19:58:53 +02:00
Chodor, Jaroslaw 9e14eaaed1 feature: Adding support for scoped debug variables
This allows for limitting debug variables to specific
components.

Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-11 17:52:17 +02:00
Mateusz Hoppe 3204411aca refactor: use deviceBitfield from CSR when creating engine
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-04-11 17:30:29 +02:00
Szymon Morek 5b20450162 performance: enable compression on PTL
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-11 17:26:29 +02:00
Igor Venevtsev ef52479ce4 feature: add FillImage1dBuffer built-in kernel
Resolves: NEO-13984, HSD-18041422852

Signed-off-by: Igor Venevtsev <igor.venevtsev@intel.com>
2025-04-10 14:04:13 +02:00
Marcel Skierkowski 5d01677454 refactor: Mock file system in ULTs
Functions: fileExists and loadDataToFile
use IO functions from namespace IoFunctions

Now tests that use these functions
are mocked by default,
but some still require access to real files
and have been restored the ability to read files.
They will be mocked in next PRs.

Related-To: NEO-7006
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-04-09 19:51:46 +02:00
Maciej Plewka 286acf30a3 fix: Dispatch residency fence when stopping ulls on windows
Related-To: NEO-14396, HSD-18041496023, HSD-13012953666, NEO-14611
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-04-09 19:51:16 +02:00
Szymon Morek f4815a9b69 performance: enable ULLS on LNL Linux
Related-To: NEO-9004

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-09 14:45:46 +02:00
Lukasz Jobczyk 1b5519a880 fix: Adjust standard heaps when on 57 bit address space
Resolves: GSD-10871

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-09 12:51:03 +02:00
Fabian Zwoliński 363aa8cdbd performance: align local memory Tag allocations to 2MB
Align local memory allocations of tag types to 2MB when
2MB alignment is enabled via the product helper
(is2MBLocalMemAlignmentEnabled flag).

Refactored the allocateGraphicsMemoryInDevicePool function to improve
readability and maintainability. Simplified the logic for
determining base size and final alignment by reducing redundant code.

Related-To: NEO-12287
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2025-04-09 09:59:44 +02:00
Aravind Gopalakrishnan c7fe7dde90 fix: Correct Xe sysfs paths for ccs mode setting
Fix the sysfs paths for Xe KMD

Related-To: NEO-10378

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2025-04-09 06:51:18 +02:00
Maciej Plewka bb36614567 fix: add separate counter for tracking ring buffer completion
Related-To: NEO-14396, NEO-9548
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-04-08 16:30:44 +02:00
Jaroslaw Warchulski fb5da0c3e5 fix: do not forbid compression for xe_lpg Windows and DG2
Related-To: NEO-14583, NEO-14593
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-04-08 16:08:59 +02:00
Marcel Skierkowski b75fbe8e2c refactor: mock filesystem in ocloc ult pt.1
Mocked IO operations in ./ocloc_tests application

Mocked gtest stdout capture in ocloc tests

Related-To: NEO-14084
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-04-08 16:08:42 +02:00
Mateusz Jablonski 1c377dc930 fix: ensure payload arguments are patched before walker command is fetched
In case of indirect kernel launch some payload arguments are patched
just before walker command, this change disables prefetch, performs
batch buffer start to next bytes and then re-enable prefetch. All these
operations are performed between MI_STORE_REGISTER_MEM and COMPUTE_WALKER

Related-To: NEO-14584
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-04-08 11:57:45 +02:00
Lukasz Jobczyk ca45573dd3 fix: Handle new ring buffer residency when switch ULLS light ring
Related-To: NEO-14406, NEO-13922

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-08 10:12:57 +02:00
Szymon Morek ff1dcc1a8a performance: adust copy from image to host logic
Related-To: NEO-14569

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-08 10:02:00 +02:00
Brandon Yates c4937991cd feature: Add l0DebuggerEnabled to WalkerArgs
Related-to: NEO-12993

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-04-08 03:53:38 +02:00
Vysochyn, Illia 4c580c0a83 feature: Update COMPUTE_WALKER instruction
Fixes incorrect RawData array length.

Related-To: NEO-14588

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-04-08 00:01:43 +02:00
Young Jin Yoon 0de024dbf9 refactor: separate post-sync into EncodePostSync
Created EncodePostSync template struct to organize various post-sync
variables/functions from EncodeDispatchKernel

Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2025-04-07 20:06:14 +02:00
Lukasz Jobczyk 5bcf500c13 fix: Extend ULLS light mutex range
If some operatioins requires ULLS light stop, execute such operations
under mutex in pair with ULLS stop to ensure no other thread will start
ULLS.

Related-To: NEO-14406, NEO-13922

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-07 17:35:04 +02:00
Vysochyn, Illia 146a7f7b9b feature: Add support for BUFFER_SIZE explicit argument
Related-To: NEO-13972

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-04-07 14:15:26 +02:00
Lukasz Jobczyk 8978ea5e5a performance: Do not create global fence allocation on integrated
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-07 11:22:04 +02:00
Jaroslaw Warchulski df07897144 fix: forbid compression for pre-xe2 platforms
Related-To: NEO-9465
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-04-05 00:15:16 +02:00
Compute-Runtime-Validation f332571d96 Revert "performance: Do not create global fence allocation on integrated"
This reverts commit ecf8a07d26.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-04-04 16:26:19 +02:00
Dominik Dabek bd516b3552 fix: usm reuse, clean from largest
When trimming old allocations in usm reuse start from largest
allocations.
This will reduce memory usage more quickly once max hold time is hit.

Related-To: NEO-6893, NEO-14429

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-04-04 14:57:15 +02:00
Lukasz Jobczyk ecf8a07d26 performance: Do not create global fence allocation on integrated
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-04 11:45:22 +02:00
Dominik Dabek 3703ff550c fix: use real size when putting into usm reuse
Real allocation size should be used to properly apply limits and allow
more usm reuse hits.

Related-To: NEO-6893

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-04-04 09:44:32 +02:00
Bartosz Dunajski f99870e716 fix: improve media handling 2
Related-To: NEO-14462

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-04-03 19:11:53 +02:00
Mateusz Jablonski bb518adf34 fix: patching payload arguments in inline data in case of indirect kernel
Related-To: NEO-14532
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-04-03 17:21:28 +02:00
Szymon Morek 95e0244f70 fix: properly pass info about 3D image
Related-To: NEO-14538

It's valid for 3D image to copy 2D region.
Current checks for mip map do not consider that.
This change correctly checks for mip mapped 3D image.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-03 16:33:57 +02:00
Slawomir Milczarek 7e7e0a000f refactor: Add ioctl helper for context destruction
Related-To: NEO-11817

Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2025-04-03 16:08:53 +02:00
Dominik Dabek be27367020 performance: usm reuse, avoid looking up svmData
Save svmData on putting into reuse, instead of searching each time.
Change UNRECOVERABLE_IF to DEBUG_BREAK_IF.

Related-To: NEO-6893

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-04-03 15:50:49 +02:00
Jaroslaw Warchulski c010d17842 fix: respect compression flag in capability table
Related-To: NEO-9465
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-04-03 15:36:55 +02:00
Szymon Morek 6ea83f322d fix: do not override user data beyond slice region
Related-To: NEO-14538

If user passes slice pitch which is larger than region
to copy, do not override memory beyond region but within
that slice pitch.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-03 12:37:35 +02:00
Jaroslaw Warchulski 62baf28316 fix: remove unnecesarry WA for DG2 compression
Related-To: NEO-9465
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-04-03 08:04:19 +02:00
Brandon Yates 4651e72b0b fix: Fail device init if kernel debugging is misconfigured
Also print error to stderr

Related-to: GSD-10780

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-04-02 21:06:30 +02:00
Filip Hazubski 504440fc4d feature: Add ftrHeaplessMode flag
Pass hwInfo to isHeaplessModeEnabled and isForceBindlessRequired functions.

Related-To: NEO-14526

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-04-02 21:06:05 +02:00
Lukasz Jobczyk deca36fd32 fix: Stop ULLS light when evict resource
Related-To: NEO-14406

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-02 16:37:43 +02:00
Bartosz Dunajski bb3927531e refactor: reduce HWTEST2_F usage in test_in_order_cmdlist_1.cpp
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-04-02 14:29:12 +02:00
Fabian Zwoliński 7ef3880793 feature: implement pool allocator for gpuTimestampDeviceBuffer
The patch applies to Level Zero.
Only allocations < 2MB will be fetched from the pool.
Allocations are shared and reused within a given device.

Additionally, I added a new debug flag to control the allocator:
EnableTimestampPoolAllocator

Related-To: NEO-12287
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2025-04-02 14:28:56 +02:00
Szymon Morek 8836f6df0b fix: forward mip map level for 3D images
Related-To: NEO-14539

1D and 2D images have already mip map level set correctly.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-02 11:46:41 +02:00
Aravind Gopalakrishnan 3a7d7e022c fix: Add platform support for reservation on svm heap
Related-To: GSD-10816

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2025-04-02 02:46:30 +02:00
Brandon Yates a48d66ad75 feature: Add programExceptions stub to CSR
Related-to: NEO-12967

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-04-01 18:33:40 +02:00
Szymon Morek 3010af596e performance: add infrastructure for staging with 3D images
Related-To: NEO-14026

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-01 15:30:30 +02:00
Mateusz Jablonski ed37a1e7ef build: remove not needed flag for builtins compilation
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-04-01 14:18:49 +02:00
Vysochyn, Illia 70af2bc20b refactor: Adjust size to preferred SLM values array
Related-To: NEO-14479

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-04-01 11:56:50 +02:00
Chandio, Bibrak Qamar 2ba2970492 performance: Waiting on make resident Windows
Related-To: NEO-14056

No need to explicitly wait on Windows KMD during make resident as it has
a while loop that does it nevertheless. The KMD wait affects the API
overhead of zeCommandQueueExecuteCommandLists some platforms (MTL, and ARL).

Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com>
2025-04-01 00:12:45 +02:00
Lukasz Jobczyk 0a11a96a53 refactor: Add dedicated method to check if any ULLS light enabled
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-31 16:36:20 +02:00
Andrzej Koska e3e01e94a0 Revert "performance: enable Direct Submission on LNL Linux"
This reverts commit cb3b4d326d.

Related-To: NEO-14517, NEO-9004

Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2025-03-31 15:22:29 +02:00
Szymon Morek 62964a0b08 fix: invalidate caches when heap is placed into reuse list
Related-To: NEO-9004

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-31 12:30:29 +02:00
Bartosz Dunajski 831b488685 fix: improve media engine handling
Related-To: NEO-14462

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-03-31 10:40:21 +02:00
Maciej Plewka a5e19330e9 fix: lock csr before locking residency controller in trim to budget path
Lock on csr is needed before lock on residency controller to prevent
incorrect lock order. Csr may be locked in waitOnCpu called from trimToBudget,
which may lead to deadlocks

Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-03-28 16:18:20 +01:00
Szymon Morek 3fff3dd77b fix: set misaligned source memory 1-way coherent
Related-To: NEO-14443

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-28 14:16:45 +01:00
Filip Hazubski 3d9fc8968e fix: Add BMG device id
Add device ID: 0xE211

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-03-28 13:40:30 +01:00
Katarzyna Cencelewska 92e40afc49 feature: update debug flag DirectSubmissionPrintSemaphoreUsage
instead of printf use makro that make flush after printf

Related-To: HSD-14024170600
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2025-03-28 13:36:15 +01:00
Compute-Runtime-Validation 88a48f1c5b Revert "performance: Improve ULLS light residency management"
This reverts commit 35eae3f977.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-28 11:21:27 +01:00
Mateusz Hoppe c105c77930 fix: calculation of os context count
Related-To: NEO-12952

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-03-28 09:55:17 +01:00
Andrzej Koska cb3b4d326d performance: enable Direct Submission on LNL Linux
Related-To: NEO-9004

Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2025-03-27 17:09:54 +01:00
Lukasz Jobczyk b43b23b6ed fix: Init wait utils after hwInfo init for both OS
Resolves: HSD-18041922513

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-27 16:45:22 +01:00
Szymon Morek ead0842763 feature: add L0 API to query kernel argument info
Related-To: NEO-14358

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-27 16:43:33 +01:00
Bartosz Dunajski 85f2734ca4 fix: correct gt_id to tile_id engine mapping
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-03-27 15:57:57 +01:00
Dominik Dabek c76edaba4e fix: enable usm reuse limit based on memory usage
Related-To: NEO-14160, NEO-6893

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-03-27 15:14:08 +01:00
Dominik Dabek 915d657420 fix: flag to limit usm reuse based on memory usage
Host usm and device usm for igfx checks system memory usage.
Device usm for dgfx checks local memory usage.

If used memory is above limit threshold:
- no new allocations will be saved for reuse
- cleaner will use shorter hold time of 2 seconds
- cleaner will free all eligible allocations, regardless of async
deleter thread having work

Motivation: in case of gfx memory being full, making resident new
allocations will require evictions which leads to massive slowdown on
enqueue calls.
This change aims to minimize cases where extra memory usage from usm
reuse mechanism leads to above situation.

Related-To: NEO-6893, NEO-14160

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-03-27 10:25:19 +01:00
Jack Myers 0aa2c4f0cb feature: allow removal of heapful code paths
Related-To: NEO-13007

Signed-off-by: Jack Myers <jack.myers@intel.com>
2025-03-27 01:34:35 +01:00
Damian Tomczak 0243004907 feature: additional checkers to enable feature
Resolves: NEO-13973

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-03-26 18:06:20 +01:00
Mateusz Jablonski 4bc13fa0dc fix: correct MetricsLibraryGenId for Xe3
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-03-26 16:35:10 +01:00
Lukasz Jobczyk 60b551758c performance: Adjust waitpkg threshold for discrete devices
Related-To: NEO-14336

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-26 14:59:19 +01:00
Compute-Runtime-Validation a89113fa1a Revert "fix: Fail device init if kernel debugging is misconfigured"
This reverts commit c122bc51f9.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-26 13:27:12 +01:00
Compute-Runtime-Validation 214277125d Revert "performance: enable ULLS on LNL Linux"
This reverts commit a15de0c4f6.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-26 13:26:57 +01:00
Jaroslaw Warchulski eeeed9edd8 refactor: cleanup in product helpers
Related-To: NEO-12681
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-03-25 21:07:55 +01:00
Brandon Yates c122bc51f9 fix: Fail device init if kernel debugging is misconfigured
Also print error to stderr

Related-to: GSD-10780

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-03-25 20:40:25 +01:00
Wojciech Konior 817c6df46d fix: unify sources from shared added to dll&tests
Related-To: NEO-4372

Signed-off-by: Wojciech Konior <wojciech.konior@intel.com>
2025-03-25 16:39:18 +01:00
Jaroslaw Warchulski 1230ab99aa refactor: adjust product helper file name after pre-gen12 removal
Related-To: NEO-12681
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-03-25 15:57:46 +01:00
Jaroslaw Warchulski cb2add406e fix: do not use ftrE2ECompression flag for xe2+
Related-To: NEO-9465
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-03-25 09:29:50 +01:00
Lukasz Jobczyk 35eae3f977 performance: Improve ULLS light residency management
-skip bo::wait if completion fence not needed
-do not restart exec on evict

Related-To: NEO-13922

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-24 17:48:55 +01:00
Pratik Bari f67663d863 fix(sysman): Create Engine Handles for multiple gt w.r.t Xe driver
For the Xe driver, a single tile can have multiple gts. Now the Engine
handles are created for multiple gts available on the single tile as in
the case for the Xe driver.

Related-To: NEO-14401

Signed-off-by: Pratik Bari <pratik.bari@intel.com>
2025-03-24 17:14:02 +01:00
Bartosz Dunajski 42ced7f5eb refactor: add NotLockable param print
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-03-24 16:22:21 +01:00
Bartosz Dunajski 8c21a9634f refactor: add missing file to cmake list
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-03-24 15:19:03 +01:00
Kamil Kopryk 73795ced64 refactor: add setupTimestampPacketFlushL3 function
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-03-24 13:10:03 +01:00
Damian Tomczak 26c23460cb fix: set heapless mode for builtins compilation
Related-to: NEO-13973

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-03-24 12:01:58 +01:00
Lukasz Jobczyk 8fbde30432 fix: Adjust waitpkg threshold to 12 us
Resolves: NEO-14434, NEO-14435
Related-To: NEO-14336

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-24 11:43:54 +01:00
Szymon Morek a15de0c4f6 performance: enable ULLS on LNL Linux
Related-To: NEO-9004

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-24 10:44:23 +01:00
Tomasz Biernacik f4f13dfeec Revert "performance: disable cpu caching for buffers on PTL"
This reverts commit c3f0c4e4ac.

Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-03-24 10:23:07 +01:00
Dominik Dabek 6e998fc3c1 fix: move host usm reuse max size to mem manager
Intialize value on memory manager creation.

Related-To: NEO-6893

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-03-24 08:53:30 +01:00
Lukasz Jobczyk 54cb0e24f8 performance: Switch waitpkg use to tpause for ULLS light
Related-To: NEO-13922, NEO-14336

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-21 18:09:37 +01:00
Szymon Morek bb10290828 fix: make misaligned user memory 2-Way coherent
Related-To: NEO-9004

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-21 17:56:37 +01:00
Lukasz Jobczyk 7c75807987 performance: Enable waitpkg with tpause
Resolves: NEO-14336

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-21 17:30:55 +01:00
Compute-Runtime-Validation 28dd4cf4d8 Revert "feature: additional checkers to enable feature"
This reverts commit c95a67b420.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-21 16:48:05 +01:00
Maciej Plewka 1b7bf690d2 fix: update fence values when residency list passed to switch ring buffer
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-03-21 12:26:08 +01:00
Lukasz Jobczyk 8a85a96ed2 feature: Add 3-level wait scheme with tpause intrinsic
Related-To: NEO-14336

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-21 12:12:57 +01:00
Aravind Gopalakrishnan 4e9afb32e6 refactor: Fix coverage issue with parse ccs mode
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2025-03-21 11:12:24 +01:00
Damian Tomczak c95a67b420 feature: additional checkers to enable feature
Resolves: NEO-13973

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-03-21 10:38:53 +01:00
Chodor, Jaroslaw 0bff9def3b feature: New allocators for opaque arrays
Adds fast allocators for opaque objects of uniform size.

Related-To: NEO-13406

Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-03-20 21:59:44 +01:00
Lukasz Jobczyk 75461d190d performance: Adjust umwait counter value
Related-To: NEO-13922, NEO-14336

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-20 13:57:25 +01:00
Damian Tomczak e6ff523a8e fix: heapless has the highest priority as builtin
Related-to: NEO-13973

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-03-20 08:10:44 +01:00
Bartosz Dunajski 6e17df4fb3 refactor: improve EncodeAluHelper
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-03-19 15:44:34 +01:00
Michal Mrozek 6a9716b81c refactor: remove not needed code
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2025-03-19 14:53:57 +01:00
Aravind Gopalakrishnan 724ba20e41 fix: Parse CCS mode setting for non PVC platforms
Related-To: GSD-8785

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2025-03-19 12:47:46 +01:00
Lukasz Jobczyk 6cb52f71b4 fix: Avoid mutex deadlock when switch ulls light ring buffer
Related-To: NEO-14406

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-19 11:47:40 +01:00
Jitendra Sharma dda7876d3a feature: Introduce SipClassType::externalLib
Related-To: NEO-13737
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2025-03-19 11:06:03 +01:00
Jack Myers 0e25970853 fix: re-add switch case for once writable query
A change related to the tbx fault manager
incorrectly removed a switch case from
`AubHelper::isOneTimeAubWritableAllocationType`.

This fixes that and refactors some APIs to prevent
similar mistakes from happening again by cleaning
up logic.

Addresses show stopper for pre-si pytorch workflows.

Resolves: NEO-14399
Signed-off-by: Jack Myers <jack.myers@intel.com>
2025-03-19 09:54:54 +01:00
Kamil Kopryk 2e729bcb4c refactor: move isTimestampWaitSupportedForQueues to productHelper
Related-to: NEO-13163
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-03-19 09:31:33 +01:00
Compute-Runtime-Validation 5f7f0dd785 Revert "performance: Enable waitpkg"
This reverts commit 8ec5434460.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-18 13:28:29 +01:00
Michał Pryba b9df406a46 refactor: remove minGen field and pregen12 platform cases
Related-To: NEO-12681
Signed-off-by: Michał Pryba <michal.pryba@intel.com>
2025-03-18 12:49:34 +01:00
Maciej Plewka 36fa6d66ae fix: lock csr in stopDirectSubmission if needed
Related-To: NEO-13875, NEO-14143, HSD-16026538384, HSD-16026780358
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-03-18 09:30:38 +01:00
Maciej Bielski a7c46b8213 feature: support L2 cache reservation
Related-To: NEO-12837
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-03-17 19:41:55 +01:00
Mateusz Hoppe 9d052cd8ee fix: pass rootDevice when creating secondary contexts
- fix initialization of gfxCoreHelper in drm and wddm

Related-To: NEO-7824, NEO-8171

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-03-17 16:03:27 +01:00
Wojciech Konior 9c8ced8c78 fix: device ids missmatch error print
Related-To: NEO-12845

Signed-off-by: Wojciech Konior <wojciech.konior@intel.com>
2025-03-17 12:58:12 +01:00
Compute-Runtime-Validation d54b74e8bd Revert "fix: Parse CCS mode setting for non PVC platforms"
This reverts commit 5ca78dfdd1.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-17 00:30:44 +01:00
Aravind Gopalakrishnan 5ca78dfdd1 fix: Parse CCS mode setting for non PVC platforms
Related-To: GSD-8785

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2025-03-15 01:39:51 +01:00
Kamil Kopryk 7f607427fa fix: correct dsh generation
Don't use hardcoded sampler state and sampler border color state sizes.

Related-To: NEO-14216

Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-03-14 21:18:38 +01:00
Kamil Kopryk 2c77d2c698 refactor: add SetMaxBVHLevels debug flag
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-03-14 16:59:48 +01:00
Kamil Kopryk 01edd58be4 refactor: add additional check in addFlushRequiredCommand
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-03-14 13:54:20 +01:00
Tomasz Biernacik afcbebef0b Revert "performance: configure Xe2 compression"
This reverts commit ec7821ed84.

Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-03-14 11:25:20 +01:00
Jack Myers 5f78147e16 fix: hotfix for svmcpu tbx uploads
Test program in the linked, related issue
is crashing in tbx mode. Tbx server indicated
upload of invalid memory was made before exit.

Running with debug messages showed that the
problematic upload was an svmcpu buffer when
running neo with separate cpu and gpu
buffers for shared memory management.

Using this info, the problem was narrowed down
to a missing unprotect call in page fault manager
related code, resulting in a protected(invalid)
memory region getting uploaded to tbx.

It is unclear yet why this unprotect call was not made,
since other svmcpu buffers were uploaded without issue.

This hotfix forces the unprotect call in the fault handler,
which allows the test program to run to completion. However,
there is now a failing test case.

Considering the critical nature of the associated
NEO issue and that this patch should unblock
the work depending on the fix, this hotfix should
get merged regardless of the failing test case.

In the meantime, I will continue triaging the
failing test and will implement a proper fix
once the root cause is isolated.

Related-To: NEO-13404
Signed-off-by: Jack Myers <jack.myers@intel.com>
2025-03-14 04:47:21 +01:00
Compute-Runtime-Validation 13db35c35f Revert "fix: set correct default value of cacheable flag"
This reverts commit a8d9e7ed35.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-13 20:38:57 +01:00
Damian Tomczak 3bc6ed8424 fix: not every heapless kernel is stateless
Related-to: NEO-13973

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-03-13 19:09:01 +01:00
Tomasz Biernacik c3f0c4e4ac performance: disable cpu caching for buffers on PTL
Related-To: NEO-9421

Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-03-13 18:34:42 +01:00
Tomasz Biernacik a8d9e7ed35 fix: set correct default value of cacheable flag
Related-To: NEO-9421

Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-03-13 18:26:02 +01:00
John Falkowski 4d281cf51d feature: Implement appendMemoryPrefetch for Shared System USM allocations
Related-To: NEO-12989

Signed-off-by: John Falkowski <john.falkowski@intel.com>
2025-03-13 06:26:38 +01:00
Zbigniew Zdanowicz ddc0b0d03b feature: disable flat ring buffer for command list append operation
Related-To: NEO-10356

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-13 06:06:55 +01:00
shubham kumar e287174211 feature: Add EU stall support for xe2/xe3 core
Related-To: NEO-13777

Signed-off-by: shubham kumar <shubham.kumar@intel.com>
2025-03-12 23:10:24 +01:00
Compute-Runtime-Validation fa2e3adad3 Revert "feature: Implement appendMemoryPrefetch for Shared System USM Allocat...
This reverts commit 97799b3faf.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-12 05:55:32 +01:00
Compute-Runtime-Validation 0d5baa2c30 Revert "performance: Cache timestamps on CPU"
This reverts commit 83637404bf.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-12 04:41:46 +01:00
Katarzyna Cencelewska 4890150e12 feature: add method adjustMaxThreadsPerThreadGroup to product helper
Related-To: HSD-18028334016, HSD-14022274275

Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2025-03-11 15:47:57 +01:00
Mateusz Hoppe a86105814d fix: default initialize allocated memory when memory is overwritten
Related-To: NEO-5093

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-03-11 15:32:33 +01:00
Lukasz Jobczyk 8ec5434460 performance: Enable waitpkg
Resolves: NEO-14336

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-11 14:43:01 +01:00
Lukasz Jobczyk 83637404bf performance: Cache timestamps on CPU
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-11 13:40:18 +01:00
Mateusz Hoppe 2f1128fdcd refactor: add consteval to compile time functions in debug manager
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-03-11 09:50:32 +01:00
John Falkowski 97799b3faf feature: Implement appendMemoryPrefetch for Shared System USM Allocations
Related-To: NEO-12989

Signed-off-by: John Falkowski <john.falkowski@intel.com>
2025-03-11 09:12:48 +01:00
Lukasz Jobczyk be8545f7df refactor: reset all waitpkg variables on test start
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-11 08:44:36 +01:00
Szymon Morek dacbce7f01 performance: introduce staging read for cl_buffer
Related-To: NEO-14026

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-11 08:19:28 +01:00
Fabian Zwoliński f5e37e725c Revert "fix: configure ISA Pool params based on productHelper"
This reverts commit bf20ae7ae8.

Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2025-03-10 22:41:13 +01:00
Zbigniew Zdanowicz cd904269ed fix: request for task count should enable monitor fence dispatch
Related-To: NEO-10356

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-10 18:25:35 +01:00
Lukasz Jobczyk 8ede026c5d refactor: Use gmm resource info to print unknown usage
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-10 16:28:31 +01:00
Tomasz Biernacik ec7821ed84 performance: configure Xe2 compression
Related-To: NEO-9421

Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-03-10 15:40:00 +01:00
Compute-Runtime-Validation 6ee39ed94c Revert "fix: Fail device init if kernel debugging is misconfigured"
This reverts commit b0c92ea425.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-10 12:23:07 +01:00
Brandon Yates b0c92ea425 fix: Fail device init if kernel debugging is misconfigured
Also print error to stderr

Related-to: GSD-10780

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-03-08 05:02:42 +01:00
Zbigniew Zdanowicz f8be8414ac feature: add support for wait event preamble in append command list
- add mechanism in queue to trigger start command from queue to regular
- add detection in immediate command list need to dispatch extra start in queue
- fix secondary linear stream in immediate case as it should not use container
- modify tests for primary batch buffer dispatch as default mode
- remove invalid or obsolete tests

Related-To: NEO-10356

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-07 20:31:09 +01:00
Maciej Plewka c5e1fcf313 fix: Copy tiled 1D array per array element with BLT
Related-To: NEO-14147, HSD-14024424096, HSD-14024424178
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-03-07 17:29:04 +01:00
Tomasz Biernacik e644b09433 performance: override allocation caching on integrated platforms
Related-To: NEO-9421

Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-03-07 15:44:00 +01:00
Slawomir Milczarek 8e251ed274 refactor: Extended drm wrapper with primary context export / import handle
Related-To: NEO-12952

Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2025-03-07 15:17:34 +01:00
Dominik Dabek 2170f5ca88 refactor: usm reuse to unique ptr
Change usm allocation cache in usm manager to unique ptr

Related-To: NEO-6893

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-03-07 15:14:58 +01:00
Lukasz Jobczyk 53062056ec performance: Enable wait pkg for ULLS light
Related-To: NEO-13922

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-07 12:18:38 +01:00
Jaroslaw Warchulski 413194bd2a Revert "fix: do not prefer image compression on xe_lpg for linux and WSL"
This reverts commit 8814b6ac4f.

Resolves: NEO-14286
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-03-07 11:38:46 +01:00
Maciej Bielski 8ad6a7a86b feature: unify local memory size getter for i915 and xe
Previously i915 was using a sysfs node to query the amount of local
memory of a tile. For xe such sysfs node does not exist so the function
was useless.

Instead of a sysfs node, leverage the local memory size present in
MemoryInfo (queried at init-stage) so that the output is now the same
for both i915 and xe.

Related-To: NEO-11127
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-03-07 09:50:19 +01:00
Szymon Morek f08b32761e fix: avoid race when evicting resources
Related-To: NEO-13843

DrmMemoryOperationsHandler::mutex is used for residency
handling.
However, this mutex is not being locked during eviction when vm bind
failed.
This is causing races for example when ULLS controller wants to make
resources resident, and user thread wants to evict
them.
Fix is to explicitly obtain lock in problematic path.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-07 07:01:41 +01:00
Tomasz Biernacik 1aa418ec15 performance: override cacheable value passed to gmm on linux
Related-To: NEO-9421

Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-03-06 21:45:01 +01:00
Damian Tomczak 81b0cac65f fix: raytracing heapless missing allocation
Related-to: NEO-12737

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-03-06 17:26:09 +01:00
Lukasz Jobczyk 61fe9ebb8f performance: Allow power efficient waits with ULLS light
Resolves: HSD-18041633458
Related-To: NEO-13922

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-06 14:20:58 +01:00
Maciej Plewka a93cecac36 fix: treat tiled 1D images as 2D with height 1 for BLT copies
Related-To: NEO-14147, HSD-14024424096, HSD-14024424178
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-03-06 11:22:30 +01:00
Dominik Dabek 9eb8e1812c feature: flag to log usm reuse operations
If flag "LogUsmReuse" is set, usm reuse will log operations to csv file.
Each line will contain: timestamp, host/device, operation type,
allocation size, true/false whether operation succeeded.

This data can then be used to produce graphs and help in analyzing
usm reuse in a particular workload.

Related-To: NEO-6893

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-03-06 11:06:27 +01:00
Szymon Morek 82fba79d9d performance: set 1ms timeout for ulls controller on LNL and PTL
Related-To: NEO-13843

Limit scope to Windows only.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-06 09:13:58 +01:00
Brandon Yates 64b027f71c feature: Add gfxCoreHelper for StateSip required
Related-to: NEO-12967

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-03-05 20:24:17 +01:00
Kamil Kopryk 4c795027e3 refactor: add check if event L3 flush is needed
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-03-05 18:25:29 +01:00
Zbigniew Zdanowicz ae1eb076b7 feature: add optional epilogue to flush task method
Related-To: NEO-10356

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-05 15:15:59 +01:00
Tomasz Biernacik 92512fdc64 fix: preserve allocation type for memory object
Related-To: NEO-9421

Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-03-05 14:48:57 +01:00
Lukasz Jobczyk 5a04d1e10a refactor: Print unknown gmm usage type value
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-05 10:34:40 +01:00
Compute-Runtime-Validation 98d1d629a8 Revert "feature: Enable defer backing by default"
This reverts commit fffe8f8027.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-05 03:35:03 +01:00
Compute-Runtime-Validation 3414abe74a Revert "fix: make global fence always resident on linux"
This reverts commit efb814d979.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-05 02:07:38 +01:00
Zbigniew Zdanowicz 27d7d72033 feature: add pipeline state management for append command list operation
- command list append state is managed from internal queue and can be skipped
- initial state configuration should be processed by both kernel and non-kernel
- only kernel operation can process required state, as non-kernel cannot change

Related-To: NEO-10356

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-04 20:43:22 +01:00
Szymon Morek ff4da67979 fix: signal notify field before KMD wait
Related-To: NEO-13870

Currently all monitor fences are triggering
interrupt due to Notify Enable field.
With this change, such field is programmed
right before KMD wait.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-04 15:19:16 +01:00
Bellekallu Rajkiran fffe8f8027 feature: Enable defer backing by default
Related-To: NEO-13403

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2025-03-04 14:29:53 +01:00
Szymon Morek efb814d979 fix: make global fence always resident on linux
Related-To: NEO-13843

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-04 12:32:53 +01:00
Chuansheng Liu 1876a43024 performance: add low latency hint for i915
i915 has provided the low latency hint interface, it helps
improve the performance for light workload a lot.

Related-To: NEO-14250

Signed-off-by: Chuansheng Liu <chuansheng.liu@intel.com>
2025-03-04 11:14:04 +01:00
Zbigniew Zdanowicz db99c25c79 feature: add support to dispatch epilogue commands into dedicated stream
Related-To: NEO-10356

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-03 14:39:34 +01:00
Jaroslaw Warchulski 8814b6ac4f fix: do not prefer image compression on xe_lpg for linux and WSL
Related-To: HSD-18034872015
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-02-28 14:20:57 +01:00
Zbigniew Zdanowicz 08b13750a1 fix: set stall cmd flag for bcs flush task count flag
Related-To: NEO-10356

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-02-28 12:24:51 +01:00
Zbigniew Zdanowicz cae3bb1d0a feature: add internal interfaces to manage all dispatch models of command lists
- add new enum type for command list flush from immediate
- add new argument for flushing immediate command list - regular command list
- add capability to provide additional stream for epilogue commands
- add pointer to provide external csr mutex to lock both execution and flush

Related-To: NEO-10356

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-02-28 12:02:22 +01:00
Lukasz Jobczyk b7cba510a3 fix: Do not increase host USM alignment when CAL enabled
Resolves: GSD-10808

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-28 10:10:42 +01:00
Andrzej Koska 021723e154 fix: Removal of unnecessary offsets during blitter copying
When copying with the blitter and setting ArrarIndex to zero, additional
offsets from the origin are no longer necessary

Related-To: NEO-14171

Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2025-02-28 09:32:19 +01:00
Katarzyna Cencelewska 0f48dd465f feature: add debug print with global memory size
during check if there is space for scratch available

Related-To: HSD-18041185560
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2025-02-27 23:11:19 +01:00
Kamil Kopryk 4503972faf refactor: add adjustSamplerStateBorderColor function
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-02-27 14:05:09 +01:00
Jaroslaw Warchulski 837d6f5ad1 fix: do not enable compression on xe_lpg for linux and WSL
Resolves: HSD-18034872015
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-02-27 11:12:23 +01:00
Compute-Runtime-Validation c02dc8208c Revert "fix: do not enable compression on xe_lpg"
This reverts commit a6abda83a4.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-27 03:15:32 +01:00
Lukasz Jobczyk 20d29207cd refactor: Allow debug key to force USM cleaner with ULLS light
Related-To: NEO-13922

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-26 17:52:18 +01:00
Vysochyn, Illia 1eafbc30f8 feature: Enable eu thread scheduling mode override via IDD
Enables eu thread scheduling mode override via the
INTERFACE_DESCRIPTOR_DATA and COMPUTE_WALKER.

Defines encodeEuSchedulingPolicy operating on INTERFACE_DESCRIPTOR_DATA
or Xe3 platform.

Adds tests verifying that thread scheduling mode can be overridden via
COMPUTE_WALKER.

Related-To: NEO-13771, HSD-18041256338, HSD-14013056398

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-02-26 17:04:43 +01:00
Wojciech Konior 7f81d0e3e8 fix: vmbind of i915 canonization check
Related-To: NEO-7313

Signed-off-by: Wojciech Konior <wojciech.konior@intel.com>
2025-02-25 19:00:31 +01:00
Jaroslaw Warchulski a6abda83a4 fix: do not enable compression on xe_lpg
Resolves: HSD-18034872015
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-02-25 18:05:56 +01:00
Mateusz Jablonski 374863ba08 refactor: remove cl pipe related logic
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-02-25 17:56:53 +01:00
Jitendra Sharma d480cc093c feature: Add method to determine if Sip should be get from external library
Related-To: NEO-13735
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2025-02-25 15:14:54 +01:00
Mateusz Hoppe 291745cdf7 feature: add support for ImplicitArgs versioning
- define ImplicitArgs version 1

Related-To: NEO-14115

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-02-25 10:27:11 +01:00
Slawomir Milczarek 0a7ed9f9f4 feature: Add ioctl helpers for primary contexts
Related-To: NEO-12952

Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2025-02-24 17:37:05 +01:00
Kamil Kopryk 7c3468794e fix: add option to enable/disable heapless in ocloc
Related-To: GSD-10681
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-02-24 17:06:26 +01:00
Lukasz Jobczyk 6dd948a4b3 performance: Pass residency container as const ptr
Related-To: NEO-13922

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-24 15:18:55 +01:00
Compute-Runtime-Validation f907c7dcb2 Revert "performance: set 1ms timeout for ulls controller since xe2"
This reverts commit 28c008e438.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-24 15:04:54 +01:00
Lukasz Jobczyk be946ae56c performance: Optimize make resident for ULLS light
Do not check if ULLS light is active during every Csr::makeResident
call. Store that information once during ULLS init.

Related-To: NEO-13922

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-24 12:59:56 +01:00
Mateusz Jablonski 50e22ef5e7 refactor: remove device enqueue specific code
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-02-24 12:37:51 +01:00
ocldev fd2a6b55c2 feature: brand string update
Related-To: NEO-7929

Updating device brand strings
Based on public support in Windows driver 32.0.101.6559
https://www.intel.com/content/www/us/en/download/785597/

Signed-off-by: ocldev <ocldev@intel.com>
2025-02-24 12:21:52 +01:00
Compute-Runtime-Validation f64fd7fc42 Revert "performance: disable cpu caching for buffers on PTL"
This reverts commit 681ef3693a.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-24 10:07:15 +01:00
Lukasz Jobczyk 1f7c67ea8d performance: Enable ULLS light for ARLH
Related-To: NEO-13922

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-21 18:00:46 +01:00
Compute-Runtime-Validation 66c4288494 Revert "fix: using canonized addr in xe kmd"
This reverts commit 683d7c7acf.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-21 17:49:05 +01:00
Lukasz Jobczyk 91c07e24f6 performance: Enable ULLS light for MTLH
Related-To: NEO-13922

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-21 16:52:02 +01:00
Lukasz Jobczyk cad776412b performance: Enable ULLS light for ARLS, ARLU, MTLU
Related-To: NEO-13922

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-21 16:10:18 +01:00
Wojciech Konior 683d7c7acf fix: using canonized addr in xe kmd
Related-To: NEO-7313

Signed-off-by: Wojciech Konior <wojciech.konior@intel.com>
2025-02-21 16:06:06 +01:00
Lukasz Jobczyk 5961850bbc fix: blink ULLS light
Related-To: NEO-13922

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-21 13:31:35 +01:00
Szymon Morek 681ef3693a performance: disable cpu caching for buffers on PTL
Related-To: NEO-14168

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-02-21 13:07:04 +01:00
Compute-Runtime-Validation a929551cb2 Revert "performance: Enable ULLS light for ARLS, ARLU, MTLU"
This reverts commit 0b6b51fa19.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-21 09:35:00 +01:00
Szymon Morek 01a66a9347 fix: download staging buffer before reading from it
Related-To: NEO-14154

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-02-21 09:30:23 +01:00
Compute-Runtime-Validation dda4beb719 Revert "performance: enable Direct Submission on LNL Linux"
This reverts commit b7d21b135c.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-21 04:13:28 +01:00
Bartosz Dunajski 85b0d1f84d fix: clear all TS packets for combined CB event
Related-To: NEO-13971

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-02-20 19:24:15 +01:00
Mateusz Jablonski 3e45cf2a6c feature: add debug key to expose L0 DDI Handles extension
Related-To: NEO-13121, NEO-13917
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-02-20 17:24:33 +01:00
Mateusz Jablonski 70f3342c2d refactor: remove vme specific code
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-02-20 17:08:52 +01:00
Filip Hazubski 794b21a3fa refactor: Add extra parameters to computeSlmValues function
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-02-20 15:35:52 +01:00
Radoslaw Jablonski b7d21b135c performance: enable Direct Submission on LNL Linux
Related-To: NEO-9004

Signed-off-by: Radoslaw Jablonski <radoslaw.jablonski@intel.com>
2025-02-20 15:22:44 +01:00
Mateusz Jablonski 7865509aa0 feature: enable WMTP on BMG
Related-To: NEO-12814
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-02-20 12:52:40 +01:00
Compute-Runtime-Validation 0996247dfd Revert "performance: Enable ULLS light for MTLH"
This reverts commit 7a62f27d8b.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-20 11:11:51 +01:00
Bartosz Dunajski a0a735804f feature: disable counter caching for external storage Events
Related-To: NEO-13971

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-02-20 11:11:29 +01:00
Fabian Zwoliński bf20ae7ae8 fix: configure ISA Pool params based on productHelper
When is2MBLocalMemAlignmentEnabled returns true,
increase pool size for builtins from 64k to 2MB.

Additionally, set appropriate alignment for kernel ISA heap allocations.
Additionally, configure isaAllocationPageSize based on productHelper

Related-To: NEO-12287
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2025-02-20 08:42:35 +01:00
Compute-Runtime-Validation 29b3cb3c85 Revert "performance: Enable ULLS light for ARLH"
This reverts commit 20d00f53eb.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-20 03:54:49 +01:00
Maciej Plewka 8ba43065c5 fix: handle literal raw strings correctly in printf formater
Related-To: NEO-14138
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-02-19 17:14:54 +01:00
Lukasz Jobczyk 7a62f27d8b performance: Enable ULLS light for MTLH
Related-To: NEO-13922

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-19 16:25:08 +01:00
Lukasz Jobczyk 20d00f53eb performance: Enable ULLS light for ARLH
Related-To: NEO-13922

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-19 15:55:07 +01:00
Lukasz Jobczyk 0b6b51fa19 performance: Enable ULLS light for ARLS, ARLU, MTLU
Related-To: NEO-13922

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-19 15:47:50 +01:00
Lukasz Jobczyk 42261526e4 refactor: Handle dlopen error
Related-To: NEO-13922

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-19 15:28:48 +01:00
Szymon Morek 28c008e438 performance: set 1ms timeout for ulls controller since xe2
Related-To: NEO-13843

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-02-19 13:48:26 +01:00
Lukasz Jobczyk b0e3d8656e refactor: Add OV detection mechanism for ULLS light
Related-To: NEO-13922

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-19 13:28:29 +01:00
Filip Hazubski b60c02d597 fix: Add asserts to ensure NonCopyable and NonMovable n/n
Related-To: NEO-14068

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-02-19 11:36:24 +01:00
Jitendra Sharma 2daca51bd9 build: update eudebug headers
Related-To: NEO-14069
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2025-02-19 09:22:15 +01:00
Filip Hazubski cff9ea3966 performance: Use std::move when initializing RelocationInfo
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-02-18 20:47:52 +01:00
Filip Hazubski 4be1153253 fix: Remove pragma once from inl files
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-02-18 20:19:15 +01:00
Vysochyn, Illia e59f61b02e feature: Update INTERFACE_DESCRIPTOR_DATA for Xe3
Related-To: NEO-13771, HSD-18041256338, HSD-14013056398

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-02-18 18:03:13 +01:00
Filip Hazubski 6b6202446b fix: Add asserts to ensure NonCopyable and NonMovable 3/n
Related-To: NEO-14068

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-02-18 17:16:03 +01:00
Bellekallu Rajkiran ff9d0d7ea0 fix: fix setting up ccs mode
Related-To: NEO-14122

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2025-02-18 17:10:51 +01:00
Chodor, Jaroslaw 8f43be8800 refactor: Enforce constexpr evaluation on getNonReleaseKeyName
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-02-18 16:03:41 +01:00
Vysochyn, Illia 42e06790da feature: Update COMPUTE_WALKER for Xe3
Related-To: NEO-13771, HSD-18041256338, HSD-14013056398

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-02-18 15:53:31 +01:00
Bartosz Dunajski c1f2ff1ad6 fix: disable batched dispatch mode in aub csr
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-02-18 13:45:15 +01:00
Filip Hazubski c651209617 fix: Add asserts to ensure NonCopyable and NonMovable 2/n
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-02-18 13:08:00 +01:00
Filip Hazubski 6b2b42972a fix: Add asserts to ensure NonCopyable and NonMovable 1/n
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-02-18 09:41:20 +01:00