Zbigniew Zdanowicz
58fe89e116
fix: remove doubled memory prefetch operation when executing command list
...
Related-To: NEO-10356
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-04-04 13:55:16 +02:00
Lukasz Jobczyk
ecf8a07d26
performance: Do not create global fence allocation on integrated
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-04-04 11:45:22 +02:00
Dominik Dabek
3703ff550c
fix: use real size when putting into usm reuse
...
Real allocation size should be used to properly apply limits and allow
more usm reuse hits.
Related-To: NEO-6893
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2025-04-04 09:44:32 +02:00
Chandio, Bibrak Qamar
f344eb9bca
test: ULT for makeResidentResources
...
Related-To: NEO-14056
Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com >
2025-04-04 05:55:08 +02:00
Bartosz Dunajski
f99870e716
fix: improve media handling 2
...
Related-To: NEO-14462
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-04-03 19:11:53 +02:00
Mateusz Jablonski
bb518adf34
fix: patching payload arguments in inline data in case of indirect kernel
...
Related-To: NEO-14532
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-04-03 17:21:28 +02:00
Szymon Morek
95e0244f70
fix: properly pass info about 3D image
...
Related-To: NEO-14538
It's valid for 3D image to copy 2D region.
Current checks for mip map do not consider that.
This change correctly checks for mip mapped 3D image.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-03 16:33:57 +02:00
Slawomir Milczarek
7e7e0a000f
refactor: Add ioctl helper for context destruction
...
Related-To: NEO-11817
Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com >
2025-04-03 16:08:53 +02:00
Dominik Dabek
be27367020
performance: usm reuse, avoid looking up svmData
...
Save svmData on putting into reuse, instead of searching each time.
Change UNRECOVERABLE_IF to DEBUG_BREAK_IF.
Related-To: NEO-6893
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2025-04-03 15:50:49 +02:00
Jaroslaw Warchulski
c010d17842
fix: respect compression flag in capability table
...
Related-To: NEO-9465
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com >
2025-04-03 15:36:55 +02:00
Szymon Morek
6ea83f322d
fix: do not override user data beyond slice region
...
Related-To: NEO-14538
If user passes slice pitch which is larger than region
to copy, do not override memory beyond region but within
that slice pitch.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-03 12:37:35 +02:00
Jaroslaw Warchulski
62baf28316
fix: remove unnecesarry WA for DG2 compression
...
Related-To: NEO-9465
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com >
2025-04-03 08:04:19 +02:00
Filip Hazubski
bc87b1cff0
test: Minor test improvements
...
Related-To: NEO-14526
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2025-04-03 07:53:18 +02:00
Brandon Yates
4651e72b0b
fix: Fail device init if kernel debugging is misconfigured
...
Also print error to stderr
Related-to: GSD-10780
Signed-off-by: Brandon Yates <brandon.yates@intel.com >
2025-04-02 21:06:30 +02:00
Filip Hazubski
504440fc4d
feature: Add ftrHeaplessMode flag
...
Pass hwInfo to isHeaplessModeEnabled and isForceBindlessRequired functions.
Related-To: NEO-14526
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2025-04-02 21:06:05 +02:00
Lukasz Jobczyk
deca36fd32
fix: Stop ULLS light when evict resource
...
Related-To: NEO-14406
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-04-02 16:37:43 +02:00
Bartosz Dunajski
bb3927531e
refactor: reduce HWTEST2_F usage in test_in_order_cmdlist_1.cpp
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-04-02 14:29:12 +02:00
Fabian Zwoliński
7ef3880793
feature: implement pool allocator for gpuTimestampDeviceBuffer
...
The patch applies to Level Zero.
Only allocations < 2MB will be fetched from the pool.
Allocations are shared and reused within a given device.
Additionally, I added a new debug flag to control the allocator:
EnableTimestampPoolAllocator
Related-To: NEO-12287
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com >
2025-04-02 14:28:56 +02:00
Szymon Morek
8836f6df0b
fix: forward mip map level for 3D images
...
Related-To: NEO-14539
1D and 2D images have already mip map level set correctly.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-02 11:46:41 +02:00
Aravind Gopalakrishnan
3a7d7e022c
fix: Add platform support for reservation on svm heap
...
Related-To: GSD-10816
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com >
2025-04-02 02:46:30 +02:00
Brandon Yates
a48d66ad75
feature: Add programExceptions stub to CSR
...
Related-to: NEO-12967
Signed-off-by: Brandon Yates <brandon.yates@intel.com >
2025-04-01 18:33:40 +02:00
Szymon Morek
3010af596e
performance: add infrastructure for staging with 3D images
...
Related-To: NEO-14026
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-01 15:30:30 +02:00
Mateusz Jablonski
ed37a1e7ef
build: remove not needed flag for builtins compilation
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-04-01 14:18:49 +02:00
Mateusz Jablonski
744ff08454
test: correct verifying programmed GPU addresses
...
use memcmp instead of comparing dereferenced pointer
when address is programmed within Walker's inline data the memory location
address is 4B aligned and is not 8B aligned
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-04-01 13:18:14 +02:00
Vysochyn, Illia
70af2bc20b
refactor: Adjust size to preferred SLM values array
...
Related-To: NEO-14479
Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com >
2025-04-01 11:56:50 +02:00
Chandio, Bibrak Qamar
2ba2970492
performance: Waiting on make resident Windows
...
Related-To: NEO-14056
No need to explicitly wait on Windows KMD during make resident as it has
a while loop that does it nevertheless. The KMD wait affects the API
overhead of zeCommandQueueExecuteCommandLists some platforms (MTL, and ARL).
Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com >
2025-04-01 00:12:45 +02:00
Lukasz Jobczyk
0a11a96a53
refactor: Add dedicated method to check if any ULLS light enabled
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-03-31 16:36:20 +02:00
Andrzej Koska
e3e01e94a0
Revert "performance: enable Direct Submission on LNL Linux"
...
This reverts commit cb3b4d326d .
Related-To: NEO-14517, NEO-9004
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com >
2025-03-31 15:22:29 +02:00
Szymon Morek
62964a0b08
fix: invalidate caches when heap is placed into reuse list
...
Related-To: NEO-9004
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-03-31 12:30:29 +02:00
Bartosz Dunajski
831b488685
fix: improve media engine handling
...
Related-To: NEO-14462
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-03-31 10:40:21 +02:00
Slawomir Milczarek
3560b016bd
test: Add errno check to SysCalls wrapper for mkfifo
...
Related-To: NEO-11817
Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com >
2025-03-28 17:07:10 +01:00
Maciej Plewka
a5e19330e9
fix: lock csr before locking residency controller in trim to budget path
...
Lock on csr is needed before lock on residency controller to prevent
incorrect lock order. Csr may be locked in waitOnCpu called from trimToBudget,
which may lead to deadlocks
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2025-03-28 16:18:20 +01:00
Szymon Morek
3fff3dd77b
fix: set misaligned source memory 1-way coherent
...
Related-To: NEO-14443
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-03-28 14:16:45 +01:00
Filip Hazubski
3d9fc8968e
fix: Add BMG device id
...
Add device ID: 0xE211
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2025-03-28 13:40:30 +01:00
Katarzyna Cencelewska
92e40afc49
feature: update debug flag DirectSubmissionPrintSemaphoreUsage
...
instead of printf use makro that make flush after printf
Related-To: HSD-14024170600
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2025-03-28 13:36:15 +01:00
Compute-Runtime-Validation
88a48f1c5b
Revert "performance: Improve ULLS light residency management"
...
This reverts commit 35eae3f977 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-03-28 11:21:27 +01:00
Mateusz Hoppe
c105c77930
fix: calculation of os context count
...
Related-To: NEO-12952
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-03-28 09:55:17 +01:00
Andrzej Koska
cb3b4d326d
performance: enable Direct Submission on LNL Linux
...
Related-To: NEO-9004
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com >
2025-03-27 17:09:54 +01:00
Lukasz Jobczyk
b43b23b6ed
fix: Init wait utils after hwInfo init for both OS
...
Resolves: HSD-18041922513
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-03-27 16:45:22 +01:00
Szymon Morek
ead0842763
feature: add L0 API to query kernel argument info
...
Related-To: NEO-14358
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-03-27 16:43:33 +01:00
Bartosz Dunajski
85f2734ca4
fix: correct gt_id to tile_id engine mapping
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-03-27 15:57:57 +01:00
Dominik Dabek
c76edaba4e
fix: enable usm reuse limit based on memory usage
...
Related-To: NEO-14160, NEO-6893
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2025-03-27 15:14:08 +01:00
Dominik Dabek
915d657420
fix: flag to limit usm reuse based on memory usage
...
Host usm and device usm for igfx checks system memory usage.
Device usm for dgfx checks local memory usage.
If used memory is above limit threshold:
- no new allocations will be saved for reuse
- cleaner will use shorter hold time of 2 seconds
- cleaner will free all eligible allocations, regardless of async
deleter thread having work
Motivation: in case of gfx memory being full, making resident new
allocations will require evictions which leads to massive slowdown on
enqueue calls.
This change aims to minimize cases where extra memory usage from usm
reuse mechanism leads to above situation.
Related-To: NEO-6893, NEO-14160
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2025-03-27 10:25:19 +01:00
Jack Myers
0aa2c4f0cb
feature: allow removal of heapful code paths
...
Related-To: NEO-13007
Signed-off-by: Jack Myers <jack.myers@intel.com >
2025-03-27 01:34:35 +01:00
Damian Tomczak
0243004907
feature: additional checkers to enable feature
...
Resolves: NEO-13973
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com >
2025-03-26 18:06:20 +01:00
Mateusz Jablonski
4bc13fa0dc
fix: correct MetricsLibraryGenId for Xe3
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-03-26 16:35:10 +01:00
Lukasz Jobczyk
60b551758c
performance: Adjust waitpkg threshold for discrete devices
...
Related-To: NEO-14336
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-03-26 14:59:19 +01:00
Compute-Runtime-Validation
a89113fa1a
Revert "fix: Fail device init if kernel debugging is misconfigured"
...
This reverts commit c122bc51f9 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-03-26 13:27:12 +01:00
Compute-Runtime-Validation
214277125d
Revert "performance: enable ULLS on LNL Linux"
...
This reverts commit a15de0c4f6 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-03-26 13:26:57 +01:00
Jaroslaw Warchulski
eeeed9edd8
refactor: cleanup in product helpers
...
Related-To: NEO-12681
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com >
2025-03-25 21:07:55 +01:00