Commit Graph

5470 Commits

Author SHA1 Message Date
Michal Mrozek c41ac0ad4c refactor: remove not needed code
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2025-04-25 16:44:01 +02:00
Lukasz Jobczyk 97358acabe fix: use mfence instead of sfence on discrete devices
Related-To: NEO-14642

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-25 13:32:25 +02:00
Mateusz Hoppe 41efee1e7c feature: add mode to expose single root device
- simplify API to expose and use only single root device. Disallow
returning subdevices. This is experimental mode.

Related-To: NEO-14559

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-04-25 13:20:56 +02:00
Lukasz Jobczyk dfcdbb20cc fix: Move ULLS semaphore to smem on xe2
Related-To: NEO-14642

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-25 12:54:27 +02:00
Lukasz Jobczyk a75f1434b0 performance: Store vm bind available in drmCsr
Resolves: GSD-10986

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-24 15:45:05 +02:00
Szymon Morek 8ec5688ca1 Revert "performance: extend usage of staging buffers"
This reverts commit 6a4ddf5dfb.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-24 11:02:27 +02:00
Lukasz Jobczyk f36e3c7d23 refactor: Disable gem close worker in ULTs
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-23 17:58:18 +02:00
Michał Pryba edfc512d3b test: adjust test machers after pre-gen12 removal
Related-To: NEO-12681
Signed-off-by: Michał Pryba <michal.pryba@intel.com>
2025-04-23 14:30:21 +02:00
Maciej Plewka 6d085ed0d4 fix: Return max wg count equal one on platforms without dispatch all support
Second push of a change that was reverted due to errors in tests
Related-To: NEO-14125, GSD-10725

Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-04-23 13:48:11 +02:00
Lukasz Jobczyk 53f0832614 fix: Register shared handle size on windows
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-23 13:00:27 +02:00
Lukasz Jobczyk 5d2b673ef6 fix: Adjust waitpkg threshold to 20 us
Resolves: HSD-13012958900

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-23 10:46:22 +02:00
Compute-Runtime-Validation 30f6ddb6c7 Revert "refactor: deallocate via gmm"
This reverts commit bf861d8d49.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-04-23 03:50:43 +02:00
Szymon Morek 6a4ddf5dfb performance: extend usage of staging buffers
Related-To: NEO-14026

Move enabling from xe2 to xe2_and_later

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-22 16:46:52 +02:00
Lukasz Jobczyk bf861d8d49 refactor: deallocate via gmm
Related-To: NEO-13051

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-22 16:46:36 +02:00
Lukasz Jobczyk d510f742da fix: Pass Deallocate2 callback to gmm
Resolves: HSD-18041541320

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-22 12:08:56 +02:00
Aravind Gopalakrishnan b4f95a1a26 feature: Add pidfd option for IPC handle import
Related-To: NEO-10380

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2025-04-18 13:01:27 +02:00
Fabian Zwoliński 204577a91f fix: disable TimestampPoolAllocator
Related-To: NEO-12287, HSD-13012945876
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2025-04-18 11:30:35 +02:00
Compute-Runtime-Validation fe1db94454 Revert "performance: Remove global fence from command stream on BMG"
This reverts commit 0c778ef6f4.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-04-18 09:23:16 +02:00
Young Jin Yoon 05c6612386 refactor: add EncodePostSyncArgs to EncodeDispatchKernelArgs
Refactored various member variables in EncodeDispatchKernel to directly
include EncodePostSync
Changed command encoder and command list to use the modified
EncodeDispatchKernel.

Related-To: NEO-13003
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2025-04-17 20:55:40 +02:00
Slawomir Milczarek a2d25ca31f feature: Add syscall wrappers for pidfdopen and pidfdgetfd
Related-To: NEO-12952

Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2025-04-17 19:50:39 +02:00
Szymon Morek 3596522637 refactor: remove unused logic in ULLS controller
Related-To: NEO-13843

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-17 18:35:20 +02:00
Oskar Hubert Weber 1816c8eac4 fix: avoid joining thread in deferred deleter
Join thread in DllMain (which we are not aware of)
can result in hang occurring in DeferredDeleter, if the library is
freed before FreeLibraryAndExitThread call from within the worker thread,
the thread gets stuck, thus the main thread is stuck on worker->join().

Related-To: NEO-14121

Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com>
2025-04-17 16:17:42 +02:00
Fabian Zwoliński ee769f5983 fix: Add texture cache flush after kernels writing to images on imm cmdlists
- add `hasImageWriteArg` flag to KernelDescriptor,
based on image access qualifier
- add `isPostImageWriteFlushRequired` method to productHelper,
to require cache flush only on selected platforms
- if possible, add the `textureCacheInvalidationEnable` arg to the
existing PC, otherwise add a new PC

Related-To: NEO-13427, HSD-18041321008
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2025-04-17 16:04:31 +02:00
Young Jin Yoon 5a2a792c34 Revert "refactor: add EncodePostSyncArgs to EncodeDispatchKernelArgs"
This reverts commit 40aef1555e.

Related-To: NEO-13003
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2025-04-17 15:37:05 +02:00
Compute-Runtime-Validation b96cf55985 Revert "fix: Return max wg count equal one on platforms without dispatch all ...
This reverts commit b8be602bfb.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-04-17 12:10:11 +02:00
Lukasz Jobczyk d9a40734ae performance: Remove global fence from CW post sync on BMG
Related-To: NEO-14642

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-17 10:23:40 +02:00
Young Jin Yoon 40aef1555e refactor: add EncodePostSyncArgs to EncodeDispatchKernelArgs
Refactored various member variables in EncodeDispatchKernel to directly
include EncodePostSync
Changed command encoder and command list to use the modified
EncodeDispatchKernel.

Related-To: NEO-13003
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2025-04-16 17:46:07 +02:00
Lukasz Jobczyk f335295432 test: correct uint32_t
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-16 17:22:44 +02:00
Lukasz Jobczyk 0c778ef6f4 performance: Remove global fence from command stream on BMG
Related-To: NEO-14642

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-16 16:01:05 +02:00
Szymon Morek 4d60465a9e fix: don't flush texture cache if not required
Related-To: NEO-14645

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-16 10:38:24 +02:00
Chodor, Jaroslaw b423084861 refactor: Change FinalizerInputType debug var to string
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-16 10:33:19 +02:00
Mateusz Jablonski f6b92aa5d5 build: update default PTL revision to B0
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-04-16 09:44:58 +02:00
Tomasz Biernacik eff2d1cde8 performance: change usage for semaphoreBuffer on integrated devices
Related-To: NEO-9421

Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-04-16 09:07:05 +02:00
Chandio, Bibrak Qamar bed2ad1ac0 feature: memAdise for System Allocator
Related-To: NEO-13715

Implements basic functionality for zeCommandListAppendMemoryAdvise for
system allocator

Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com>
2025-04-16 07:11:14 +02:00
shubham kumar 48bbf12eb5 fix: correct gfx_core_helper definitions for EUSS
Related-To: NEO-14337

Signed-off-by: shubham kumar <shubham.kumar@intel.com>
2025-04-15 20:02:36 +02:00
Maciej Plewka b8be602bfb fix: Return max wg count equal one on platforms without dispatch all support
Related-To: NEO-14125, GSD-10725
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-04-15 15:41:23 +02:00
Mateusz Jablonski c16d02736d test: remove not needed test trait
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-04-15 15:20:25 +02:00
Maciej Bielski bf68fed199 test: mark a function as override
Clang-tidy rule: `[modernize-use-override]`

Related-To: NEO-14506
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-04-15 14:32:27 +02:00
Lukasz Jobczyk f7939735da refactor: Adjust USM cleaner to ULLS light
Related-To: NEO-13922

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-15 14:05:02 +02:00
Lukasz Jobczyk 455209aadc refactor: Simplify fence selection in ULLS
Related-To: NEO-14642

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-15 14:04:47 +02:00
Szymon Morek 1432f6a7b2 test: don't call expectMemory on compressed data
Compressed memory can't be compared using expectMemory api.
Instead, compressed memory is transferred to usm host.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-15 09:22:48 +02:00
Marcel Skierkowski 31f0fd4672 refactor: rename variable slmSize
Max programmable slm size is stored in RuntimeCapabilityTable as slmSize.
That is misleading name
Rename the variable slmSize to better reflect the actual meaning of the member.

Related-To: NEO-12949
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-04-14 20:00:29 +02:00
Chodor, Jaroslaw 7bded13638 feature: Enabling support for redirecting FCL to IGC for OCL
This allows for invoking IGC as OCL C translator in OpenCL paths

Accepted debug env :
UseIgcAsFcl=1 ; forces IGC
UseIgcAsFcl=2 ; forces FCL
UseIgcAsFcl=0 ; uses defaults

Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-14 18:41:48 +02:00
Jaroslaw Warchulski 3e1aa33924 refactor: cleanup headers
Related-To: NEO-5548
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-04-14 14:59:40 +02:00
Maciej Bielski 6ecf5b373e feature: exit with warning if small bar detected on Linux
Return early and provide justification, otherwise SIGBUS error is
returned.

Related-To: NEO-14506
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-04-14 14:43:25 +02:00
Marcel Skierkowski e82be94368 refactor: Modernize writeDataToFile function
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-04-14 14:28:58 +02:00
Kamil Kopryk dd3d294f87 performance: cache MOCS values
This change caches the most used MOCS values:

* getMOCS(GMM_RESOURCE_USAGE_OCL_BUFFER_CONST);
* getMOCS(GMM_RESOURCE_USAGE_OCL_BUFFER);
* getMOCS(GMM_RESOURCE_USAGE_OCL_BUFFER_CACHELINE_MISALIGNED);
inside gmmHelper class during initialization to avoid repeated
calls of virtual functions, branches and/or gmm lib access.

and adds more readably corresponding getters:
* getL1EnabledMOCS
* getL3EnabledMOCS
* getUncachedMOCS

If force all resources uncached is called,
these 3 cached mocs values are reinitialized

It also changes the order of gmmHelper members, to avoid
not needed padding after addressWidth
and simplifies logic in getMocsIndex function
for xehp and later products.

Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-04-14 14:12:48 +02:00
Tomasz Biernacik 194c3058b8 performance: change usage for ringBuffer on integrated devices
Related-To: NEO-9421

Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-04-14 12:25:38 +02:00
Tomasz Biernacik 18435df04c fix: unify deferring MOCS to PAT
Related-To: NEO-14643

Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-04-11 20:10:25 +02:00
Chodor, Jaroslaw 9e14eaaed1 feature: Adding support for scoped debug variables
This allows for limitting debug variables to specific
components.

Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-11 17:52:17 +02:00
Szymon Morek 5b20450162 performance: enable compression on PTL
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-11 17:26:29 +02:00
Igor Venevtsev ef52479ce4 feature: add FillImage1dBuffer built-in kernel
Resolves: NEO-13984, HSD-18041422852

Signed-off-by: Igor Venevtsev <igor.venevtsev@intel.com>
2025-04-10 14:04:13 +02:00
Damian Tomczak f9516b8723 test: remove DEFAULT_TEST_FAMILY_NAME from fixture
Related-to: NEO-14395

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-04-09 23:29:25 +02:00
Marcel Skierkowski 5d01677454 refactor: Mock file system in ULTs
Functions: fileExists and loadDataToFile
use IO functions from namespace IoFunctions

Now tests that use these functions
are mocked by default,
but some still require access to real files
and have been restored the ability to read files.
They will be mocked in next PRs.

Related-To: NEO-7006
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-04-09 19:51:46 +02:00
Young Jin Yoon 077fe1ab48 test: simplify postsync mocs tests for gen12lp
Fixed a clang-format error on unassigned value comparison with
one of command_encoder tests for gen12lp

Related-To: NEO-13003
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2025-04-09 19:51:31 +02:00
Maciej Plewka 286acf30a3 fix: Dispatch residency fence when stopping ulls on windows
Related-To: NEO-14396, HSD-18041496023, HSD-13012953666, NEO-14611
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-04-09 19:51:16 +02:00
Szymon Morek f4815a9b69 performance: enable ULLS on LNL Linux
Related-To: NEO-9004

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-09 14:45:46 +02:00
Lukasz Jobczyk 1b5519a880 fix: Adjust standard heaps when on 57 bit address space
Resolves: GSD-10871

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-09 12:51:03 +02:00
Zbigniew Zdanowicz b31c3bb3ca fix: remove doubled operations when executing regular on immediate
- immediate flush does not need to perform below for command list execution
- handling indirect allocations
- performing page fault migration
- making residency of unified shared memory
- perform prefetching when required

Related-To: NEO-10356

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-04-09 11:58:03 +02:00
Fabian Zwoliński 363aa8cdbd performance: align local memory Tag allocations to 2MB
Align local memory allocations of tag types to 2MB when
2MB alignment is enabled via the product helper
(is2MBLocalMemAlignmentEnabled flag).

Refactored the allocateGraphicsMemoryInDevicePool function to improve
readability and maintainability. Simplified the logic for
determining base size and final alignment by reducing redundant code.

Related-To: NEO-12287
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2025-04-09 09:59:44 +02:00
Aravind Gopalakrishnan c7fe7dde90 fix: Correct Xe sysfs paths for ccs mode setting
Fix the sysfs paths for Xe KMD

Related-To: NEO-10378

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2025-04-09 06:51:18 +02:00
Marcel Skierkowski 4b75d9b218 test: mock filesystem in setting manager
Related-To: NEO-14084
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-04-08 18:41:15 +02:00
Bartosz Dunajski ab7364ae19 test: improve HWTEST2_F macro
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-04-08 17:18:14 +02:00
Maciej Plewka bb36614567 fix: add separate counter for tracking ring buffer completion
Related-To: NEO-14396, NEO-9548
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-04-08 16:30:44 +02:00
Jaroslaw Warchulski fb5da0c3e5 fix: do not forbid compression for xe_lpg Windows and DG2
Related-To: NEO-14583, NEO-14593
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-04-08 16:08:59 +02:00
Marcel Skierkowski b75fbe8e2c refactor: mock filesystem in ocloc ult pt.1
Mocked IO operations in ./ocloc_tests application

Mocked gtest stdout capture in ocloc tests

Related-To: NEO-14084
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-04-08 16:08:42 +02:00
Mateusz Jablonski 1c377dc930 fix: ensure payload arguments are patched before walker command is fetched
In case of indirect kernel launch some payload arguments are patched
just before walker command, this change disables prefetch, performs
batch buffer start to next bytes and then re-enable prefetch. All these
operations are performed between MI_STORE_REGISTER_MEM and COMPUTE_WALKER

Related-To: NEO-14584
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-04-08 11:57:45 +02:00
Lukasz Jobczyk ca45573dd3 fix: Handle new ring buffer residency when switch ULLS light ring
Related-To: NEO-14406, NEO-13922

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-08 10:12:57 +02:00
Young Jin Yoon 0de024dbf9 refactor: separate post-sync into EncodePostSync
Created EncodePostSync template struct to organize various post-sync
variables/functions from EncodeDispatchKernel

Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2025-04-07 20:06:14 +02:00
Vysochyn, Illia 950e4a22d8 test: Remove trailing whitespaces
Related-To: NEO-13972

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-04-07 16:34:38 +02:00
Vysochyn, Illia 146a7f7b9b feature: Add support for BUFFER_SIZE explicit argument
Related-To: NEO-13972

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-04-07 14:15:26 +02:00
Lukasz Jobczyk 8978ea5e5a performance: Do not create global fence allocation on integrated
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-07 11:22:04 +02:00
Jaroslaw Warchulski df07897144 fix: forbid compression for pre-xe2 platforms
Related-To: NEO-9465
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-04-05 00:15:16 +02:00
Compute-Runtime-Validation f332571d96 Revert "performance: Do not create global fence allocation on integrated"
This reverts commit ecf8a07d26.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-04-04 16:26:19 +02:00
Dominik Dabek bd516b3552 fix: usm reuse, clean from largest
When trimming old allocations in usm reuse start from largest
allocations.
This will reduce memory usage more quickly once max hold time is hit.

Related-To: NEO-6893, NEO-14429

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-04-04 14:57:15 +02:00
Zbigniew Zdanowicz 58fe89e116 fix: remove doubled memory prefetch operation when executing command list
Related-To: NEO-10356

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-04-04 13:55:16 +02:00
Lukasz Jobczyk ecf8a07d26 performance: Do not create global fence allocation on integrated
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-04 11:45:22 +02:00
Dominik Dabek 3703ff550c fix: use real size when putting into usm reuse
Real allocation size should be used to properly apply limits and allow
more usm reuse hits.

Related-To: NEO-6893

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-04-04 09:44:32 +02:00
Chandio, Bibrak Qamar f344eb9bca test: ULT for makeResidentResources
Related-To: NEO-14056

Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com>
2025-04-04 05:55:08 +02:00
Mateusz Jablonski bb518adf34 fix: patching payload arguments in inline data in case of indirect kernel
Related-To: NEO-14532
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-04-03 17:21:28 +02:00
Szymon Morek 95e0244f70 fix: properly pass info about 3D image
Related-To: NEO-14538

It's valid for 3D image to copy 2D region.
Current checks for mip map do not consider that.
This change correctly checks for mip mapped 3D image.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-03 16:33:57 +02:00
Dominik Dabek be27367020 performance: usm reuse, avoid looking up svmData
Save svmData on putting into reuse, instead of searching each time.
Change UNRECOVERABLE_IF to DEBUG_BREAK_IF.

Related-To: NEO-6893

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-04-03 15:50:49 +02:00
Jaroslaw Warchulski c010d17842 fix: respect compression flag in capability table
Related-To: NEO-9465
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-04-03 15:36:55 +02:00
Szymon Morek 6ea83f322d fix: do not override user data beyond slice region
Related-To: NEO-14538

If user passes slice pitch which is larger than region
to copy, do not override memory beyond region but within
that slice pitch.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-03 12:37:35 +02:00
Jaroslaw Warchulski 62baf28316 fix: remove unnecesarry WA for DG2 compression
Related-To: NEO-9465
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-04-03 08:04:19 +02:00
Filip Hazubski bc87b1cff0 test: Minor test improvements
Related-To: NEO-14526

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-04-03 07:53:18 +02:00
Brandon Yates 4651e72b0b fix: Fail device init if kernel debugging is misconfigured
Also print error to stderr

Related-to: GSD-10780

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-04-02 21:06:30 +02:00
Filip Hazubski 504440fc4d feature: Add ftrHeaplessMode flag
Pass hwInfo to isHeaplessModeEnabled and isForceBindlessRequired functions.

Related-To: NEO-14526

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-04-02 21:06:05 +02:00
Lukasz Jobczyk deca36fd32 fix: Stop ULLS light when evict resource
Related-To: NEO-14406

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-02 16:37:43 +02:00
Fabian Zwoliński 7ef3880793 feature: implement pool allocator for gpuTimestampDeviceBuffer
The patch applies to Level Zero.
Only allocations < 2MB will be fetched from the pool.
Allocations are shared and reused within a given device.

Additionally, I added a new debug flag to control the allocator:
EnableTimestampPoolAllocator

Related-To: NEO-12287
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2025-04-02 14:28:56 +02:00
Szymon Morek 8836f6df0b fix: forward mip map level for 3D images
Related-To: NEO-14539

1D and 2D images have already mip map level set correctly.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-02 11:46:41 +02:00
Aravind Gopalakrishnan 3a7d7e022c fix: Add platform support for reservation on svm heap
Related-To: GSD-10816

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2025-04-02 02:46:30 +02:00
Brandon Yates a48d66ad75 feature: Add programExceptions stub to CSR
Related-to: NEO-12967

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-04-01 18:33:40 +02:00
Szymon Morek 3010af596e performance: add infrastructure for staging with 3D images
Related-To: NEO-14026

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-01 15:30:30 +02:00
Mateusz Jablonski 744ff08454 test: correct verifying programmed GPU addresses
use memcmp instead of comparing dereferenced pointer
when address is programmed within Walker's inline data the memory location
address is 4B aligned and is not 8B aligned

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-04-01 13:18:14 +02:00
Lukasz Jobczyk 0a11a96a53 refactor: Add dedicated method to check if any ULLS light enabled
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-31 16:36:20 +02:00
Andrzej Koska e3e01e94a0 Revert "performance: enable Direct Submission on LNL Linux"
This reverts commit cb3b4d326d.

Related-To: NEO-14517, NEO-9004

Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2025-03-31 15:22:29 +02:00
Bartosz Dunajski 831b488685 fix: improve media engine handling
Related-To: NEO-14462

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-03-31 10:40:21 +02:00
Slawomir Milczarek 3560b016bd test: Add errno check to SysCalls wrapper for mkfifo
Related-To: NEO-11817

Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2025-03-28 17:07:10 +01:00
Maciej Plewka a5e19330e9 fix: lock csr before locking residency controller in trim to budget path
Lock on csr is needed before lock on residency controller to prevent
incorrect lock order. Csr may be locked in waitOnCpu called from trimToBudget,
which may lead to deadlocks

Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-03-28 16:18:20 +01:00
Szymon Morek 3fff3dd77b fix: set misaligned source memory 1-way coherent
Related-To: NEO-14443

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-28 14:16:45 +01:00
Compute-Runtime-Validation 88a48f1c5b Revert "performance: Improve ULLS light residency management"
This reverts commit 35eae3f977.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-28 11:21:27 +01:00
Mateusz Hoppe c105c77930 fix: calculation of os context count
Related-To: NEO-12952

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-03-28 09:55:17 +01:00
Andrzej Koska cb3b4d326d performance: enable Direct Submission on LNL Linux
Related-To: NEO-9004

Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2025-03-27 17:09:54 +01:00
Lukasz Jobczyk b43b23b6ed fix: Init wait utils after hwInfo init for both OS
Resolves: HSD-18041922513

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-27 16:45:22 +01:00
Szymon Morek ead0842763 feature: add L0 API to query kernel argument info
Related-To: NEO-14358

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-27 16:43:33 +01:00
Bartosz Dunajski 85f2734ca4 fix: correct gt_id to tile_id engine mapping
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-03-27 15:57:57 +01:00
Dominik Dabek c76edaba4e fix: enable usm reuse limit based on memory usage
Related-To: NEO-14160, NEO-6893

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-03-27 15:14:08 +01:00
Dominik Dabek 915d657420 fix: flag to limit usm reuse based on memory usage
Host usm and device usm for igfx checks system memory usage.
Device usm for dgfx checks local memory usage.

If used memory is above limit threshold:
- no new allocations will be saved for reuse
- cleaner will use shorter hold time of 2 seconds
- cleaner will free all eligible allocations, regardless of async
deleter thread having work

Motivation: in case of gfx memory being full, making resident new
allocations will require evictions which leads to massive slowdown on
enqueue calls.
This change aims to minimize cases where extra memory usage from usm
reuse mechanism leads to above situation.

Related-To: NEO-6893, NEO-14160

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-03-27 10:25:19 +01:00
Jack Myers 0aa2c4f0cb feature: allow removal of heapful code paths
Related-To: NEO-13007

Signed-off-by: Jack Myers <jack.myers@intel.com>
2025-03-27 01:34:35 +01:00
Damian Tomczak 0243004907 feature: additional checkers to enable feature
Resolves: NEO-13973

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-03-26 18:06:20 +01:00
Mateusz Jablonski 4bc13fa0dc fix: correct MetricsLibraryGenId for Xe3
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-03-26 16:35:10 +01:00
Lukasz Jobczyk 60b551758c performance: Adjust waitpkg threshold for discrete devices
Related-To: NEO-14336

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-26 14:59:19 +01:00
Compute-Runtime-Validation a89113fa1a Revert "fix: Fail device init if kernel debugging is misconfigured"
This reverts commit c122bc51f9.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-26 13:27:12 +01:00
Compute-Runtime-Validation 214277125d Revert "performance: enable ULLS on LNL Linux"
This reverts commit a15de0c4f6.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-26 13:26:57 +01:00
Jaroslaw Warchulski eeeed9edd8 refactor: cleanup in product helpers
Related-To: NEO-12681
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-03-25 21:07:55 +01:00
Brandon Yates c122bc51f9 fix: Fail device init if kernel debugging is misconfigured
Also print error to stderr

Related-to: GSD-10780

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-03-25 20:40:25 +01:00
Jaroslaw Warchulski 75119f1887 test: add compression_tests_xe2_and_later
Related-To: NEO-9465

Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-03-25 18:15:20 +01:00
Lukasz Jobczyk 35eae3f977 performance: Improve ULLS light residency management
-skip bo::wait if completion fence not needed
-do not restart exec on evict

Related-To: NEO-13922

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-24 17:48:55 +01:00
Pratik Bari f67663d863 fix(sysman): Create Engine Handles for multiple gt w.r.t Xe driver
For the Xe driver, a single tile can have multiple gts. Now the Engine
handles are created for multiple gts available on the single tile as in
the case for the Xe driver.

Related-To: NEO-14401

Signed-off-by: Pratik Bari <pratik.bari@intel.com>
2025-03-24 17:14:02 +01:00
Szymon Morek ad49418575 test: change incorrect name of constant
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-24 17:13:45 +01:00
Kamil Kopryk 73795ced64 refactor: add setupTimestampPacketFlushL3 function
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-03-24 13:10:03 +01:00
Damian Tomczak 26c23460cb fix: set heapless mode for builtins compilation
Related-to: NEO-13973

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-03-24 12:01:58 +01:00
Lukasz Jobczyk 8fbde30432 fix: Adjust waitpkg threshold to 12 us
Resolves: NEO-14434, NEO-14435
Related-To: NEO-14336

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-24 11:43:54 +01:00
Szymon Morek a15de0c4f6 performance: enable ULLS on LNL Linux
Related-To: NEO-9004

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-24 10:44:23 +01:00
Tomasz Biernacik f4f13dfeec Revert "performance: disable cpu caching for buffers on PTL"
This reverts commit c3f0c4e4ac.

Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-03-24 10:23:07 +01:00
Dominik Dabek 6e998fc3c1 fix: move host usm reuse max size to mem manager
Intialize value on memory manager creation.

Related-To: NEO-6893

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-03-24 08:53:30 +01:00
Lukasz Jobczyk 54cb0e24f8 performance: Switch waitpkg use to tpause for ULLS light
Related-To: NEO-13922, NEO-14336

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-21 18:09:37 +01:00
Szymon Morek bb10290828 fix: make misaligned user memory 2-Way coherent
Related-To: NEO-9004

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-21 17:56:37 +01:00
Lukasz Jobczyk 7c75807987 performance: Enable waitpkg with tpause
Resolves: NEO-14336

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-21 17:30:55 +01:00
Compute-Runtime-Validation 28dd4cf4d8 Revert "feature: additional checkers to enable feature"
This reverts commit c95a67b420.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-21 16:48:05 +01:00
Maciej Plewka 1b7bf690d2 fix: update fence values when residency list passed to switch ring buffer
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-03-21 12:26:08 +01:00
Lukasz Jobczyk 8a85a96ed2 feature: Add 3-level wait scheme with tpause intrinsic
Related-To: NEO-14336

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-21 12:12:57 +01:00
Aravind Gopalakrishnan 4e9afb32e6 refactor: Fix coverage issue with parse ccs mode
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2025-03-21 11:12:24 +01:00
Damian Tomczak c95a67b420 feature: additional checkers to enable feature
Resolves: NEO-13973

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-03-21 10:38:53 +01:00
Slawomir Milczarek 9e0e485166 test: Extended SysCalls wrapper with flag to fail access function
Related-To: NEO-12952

Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2025-03-21 02:37:17 +01:00
Chodor, Jaroslaw 0bff9def3b feature: New allocators for opaque arrays
Adds fast allocators for opaque objects of uniform size.

Related-To: NEO-13406

Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-03-20 21:59:44 +01:00
Damian Tomczak 1995a7fbcf feature: additional fixes to enable feature and adjusting builtins types
Resolves: NEO-13973

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-03-20 14:23:08 +01:00
Lukasz Jobczyk 75461d190d performance: Adjust umwait counter value
Related-To: NEO-13922, NEO-14336

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-20 13:57:25 +01:00
Damian Tomczak e6ff523a8e fix: heapless has the highest priority as builtin
Related-to: NEO-13973

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-03-20 08:10:44 +01:00
Michal Mrozek 6a9716b81c refactor: remove not needed code
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2025-03-19 14:53:57 +01:00
Aravind Gopalakrishnan 724ba20e41 fix: Parse CCS mode setting for non PVC platforms
Related-To: GSD-8785

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2025-03-19 12:47:46 +01:00
Jakub Nowacki 356557f202 build: enable aub test runners for xe_hpg_core
Related-To: NEO-7579

Signed-off-by: Jakub Nowacki <jakub.nowacki@intel.com>
2025-03-19 12:05:48 +01:00
Lukasz Jobczyk 6cb52f71b4 fix: Avoid mutex deadlock when switch ulls light ring buffer
Related-To: NEO-14406

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-19 11:47:40 +01:00
Jitendra Sharma dda7876d3a feature: Introduce SipClassType::externalLib
Related-To: NEO-13737
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2025-03-19 11:06:03 +01:00
Jack Myers 0e25970853 fix: re-add switch case for once writable query
A change related to the tbx fault manager
incorrectly removed a switch case from
`AubHelper::isOneTimeAubWritableAllocationType`.

This fixes that and refactors some APIs to prevent
similar mistakes from happening again by cleaning
up logic.

Addresses show stopper for pre-si pytorch workflows.

Resolves: NEO-14399
Signed-off-by: Jack Myers <jack.myers@intel.com>
2025-03-19 09:54:54 +01:00
Kamil Kopryk 2e729bcb4c refactor: move isTimestampWaitSupportedForQueues to productHelper
Related-to: NEO-13163
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-03-19 09:31:33 +01:00
Compute-Runtime-Validation 5f7f0dd785 Revert "performance: Enable waitpkg"
This reverts commit 8ec5434460.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-18 13:28:29 +01:00
Maciej Plewka 36fa6d66ae fix: lock csr in stopDirectSubmission if needed
Related-To: NEO-13875, NEO-14143, HSD-16026538384, HSD-16026780358
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-03-18 09:30:38 +01:00
Compute-Runtime-Validation ce2fbb87ba Revert "feature: additional checkers to enable feature"
This reverts commit dbd7d00b27.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-18 03:12:47 +01:00