Commit Graph

7865 Commits

Author SHA1 Message Date
Zbigniew Zdanowicz
58fe89e116 fix: remove doubled memory prefetch operation when executing command list
Related-To: NEO-10356

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-04-04 13:55:16 +02:00
Lukasz Jobczyk
ecf8a07d26 performance: Do not create global fence allocation on integrated
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-04 11:45:22 +02:00
Dominik Dabek
3703ff550c fix: use real size when putting into usm reuse
Real allocation size should be used to properly apply limits and allow
more usm reuse hits.

Related-To: NEO-6893

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-04-04 09:44:32 +02:00
Chandio, Bibrak Qamar
f344eb9bca test: ULT for makeResidentResources
Related-To: NEO-14056

Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com>
2025-04-04 05:55:08 +02:00
Bartosz Dunajski
f99870e716 fix: improve media handling 2
Related-To: NEO-14462

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-04-03 19:11:53 +02:00
Mateusz Jablonski
bb518adf34 fix: patching payload arguments in inline data in case of indirect kernel
Related-To: NEO-14532
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-04-03 17:21:28 +02:00
Szymon Morek
95e0244f70 fix: properly pass info about 3D image
Related-To: NEO-14538

It's valid for 3D image to copy 2D region.
Current checks for mip map do not consider that.
This change correctly checks for mip mapped 3D image.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-03 16:33:57 +02:00
Slawomir Milczarek
7e7e0a000f refactor: Add ioctl helper for context destruction
Related-To: NEO-11817

Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2025-04-03 16:08:53 +02:00
Dominik Dabek
be27367020 performance: usm reuse, avoid looking up svmData
Save svmData on putting into reuse, instead of searching each time.
Change UNRECOVERABLE_IF to DEBUG_BREAK_IF.

Related-To: NEO-6893

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-04-03 15:50:49 +02:00
Jaroslaw Warchulski
c010d17842 fix: respect compression flag in capability table
Related-To: NEO-9465
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-04-03 15:36:55 +02:00
Szymon Morek
6ea83f322d fix: do not override user data beyond slice region
Related-To: NEO-14538

If user passes slice pitch which is larger than region
to copy, do not override memory beyond region but within
that slice pitch.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-03 12:37:35 +02:00
Jaroslaw Warchulski
62baf28316 fix: remove unnecesarry WA for DG2 compression
Related-To: NEO-9465
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-04-03 08:04:19 +02:00
Filip Hazubski
bc87b1cff0 test: Minor test improvements
Related-To: NEO-14526

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-04-03 07:53:18 +02:00
Brandon Yates
4651e72b0b fix: Fail device init if kernel debugging is misconfigured
Also print error to stderr

Related-to: GSD-10780

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-04-02 21:06:30 +02:00
Filip Hazubski
504440fc4d feature: Add ftrHeaplessMode flag
Pass hwInfo to isHeaplessModeEnabled and isForceBindlessRequired functions.

Related-To: NEO-14526

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-04-02 21:06:05 +02:00
Lukasz Jobczyk
deca36fd32 fix: Stop ULLS light when evict resource
Related-To: NEO-14406

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-02 16:37:43 +02:00
Bartosz Dunajski
bb3927531e refactor: reduce HWTEST2_F usage in test_in_order_cmdlist_1.cpp
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-04-02 14:29:12 +02:00
Fabian Zwoliński
7ef3880793 feature: implement pool allocator for gpuTimestampDeviceBuffer
The patch applies to Level Zero.
Only allocations < 2MB will be fetched from the pool.
Allocations are shared and reused within a given device.

Additionally, I added a new debug flag to control the allocator:
EnableTimestampPoolAllocator

Related-To: NEO-12287
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2025-04-02 14:28:56 +02:00
Szymon Morek
8836f6df0b fix: forward mip map level for 3D images
Related-To: NEO-14539

1D and 2D images have already mip map level set correctly.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-02 11:46:41 +02:00
Aravind Gopalakrishnan
3a7d7e022c fix: Add platform support for reservation on svm heap
Related-To: GSD-10816

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2025-04-02 02:46:30 +02:00
Brandon Yates
a48d66ad75 feature: Add programExceptions stub to CSR
Related-to: NEO-12967

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-04-01 18:33:40 +02:00
Szymon Morek
3010af596e performance: add infrastructure for staging with 3D images
Related-To: NEO-14026

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-01 15:30:30 +02:00
Mateusz Jablonski
ed37a1e7ef build: remove not needed flag for builtins compilation
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-04-01 14:18:49 +02:00
Mateusz Jablonski
744ff08454 test: correct verifying programmed GPU addresses
use memcmp instead of comparing dereferenced pointer
when address is programmed within Walker's inline data the memory location
address is 4B aligned and is not 8B aligned

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-04-01 13:18:14 +02:00
Vysochyn, Illia
70af2bc20b refactor: Adjust size to preferred SLM values array
Related-To: NEO-14479

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-04-01 11:56:50 +02:00
Chandio, Bibrak Qamar
2ba2970492 performance: Waiting on make resident Windows
Related-To: NEO-14056

No need to explicitly wait on Windows KMD during make resident as it has
a while loop that does it nevertheless. The KMD wait affects the API
overhead of zeCommandQueueExecuteCommandLists some platforms (MTL, and ARL).

Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com>
2025-04-01 00:12:45 +02:00
Lukasz Jobczyk
0a11a96a53 refactor: Add dedicated method to check if any ULLS light enabled
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-31 16:36:20 +02:00
Andrzej Koska
e3e01e94a0 Revert "performance: enable Direct Submission on LNL Linux"
This reverts commit cb3b4d326d.

Related-To: NEO-14517, NEO-9004

Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2025-03-31 15:22:29 +02:00
Szymon Morek
62964a0b08 fix: invalidate caches when heap is placed into reuse list
Related-To: NEO-9004

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-31 12:30:29 +02:00
Bartosz Dunajski
831b488685 fix: improve media engine handling
Related-To: NEO-14462

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-03-31 10:40:21 +02:00
Slawomir Milczarek
3560b016bd test: Add errno check to SysCalls wrapper for mkfifo
Related-To: NEO-11817

Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2025-03-28 17:07:10 +01:00
Maciej Plewka
a5e19330e9 fix: lock csr before locking residency controller in trim to budget path
Lock on csr is needed before lock on residency controller to prevent
incorrect lock order. Csr may be locked in waitOnCpu called from trimToBudget,
which may lead to deadlocks

Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-03-28 16:18:20 +01:00
Szymon Morek
3fff3dd77b fix: set misaligned source memory 1-way coherent
Related-To: NEO-14443

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-28 14:16:45 +01:00
Filip Hazubski
3d9fc8968e fix: Add BMG device id
Add device ID: 0xE211

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-03-28 13:40:30 +01:00
Katarzyna Cencelewska
92e40afc49 feature: update debug flag DirectSubmissionPrintSemaphoreUsage
instead of printf use makro that make flush after printf

Related-To: HSD-14024170600
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2025-03-28 13:36:15 +01:00
Compute-Runtime-Validation
88a48f1c5b Revert "performance: Improve ULLS light residency management"
This reverts commit 35eae3f977.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-28 11:21:27 +01:00
Mateusz Hoppe
c105c77930 fix: calculation of os context count
Related-To: NEO-12952

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-03-28 09:55:17 +01:00
Andrzej Koska
cb3b4d326d performance: enable Direct Submission on LNL Linux
Related-To: NEO-9004

Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2025-03-27 17:09:54 +01:00
Lukasz Jobczyk
b43b23b6ed fix: Init wait utils after hwInfo init for both OS
Resolves: HSD-18041922513

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-27 16:45:22 +01:00
Szymon Morek
ead0842763 feature: add L0 API to query kernel argument info
Related-To: NEO-14358

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-27 16:43:33 +01:00
Bartosz Dunajski
85f2734ca4 fix: correct gt_id to tile_id engine mapping
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-03-27 15:57:57 +01:00
Dominik Dabek
c76edaba4e fix: enable usm reuse limit based on memory usage
Related-To: NEO-14160, NEO-6893

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-03-27 15:14:08 +01:00
Dominik Dabek
915d657420 fix: flag to limit usm reuse based on memory usage
Host usm and device usm for igfx checks system memory usage.
Device usm for dgfx checks local memory usage.

If used memory is above limit threshold:
- no new allocations will be saved for reuse
- cleaner will use shorter hold time of 2 seconds
- cleaner will free all eligible allocations, regardless of async
deleter thread having work

Motivation: in case of gfx memory being full, making resident new
allocations will require evictions which leads to massive slowdown on
enqueue calls.
This change aims to minimize cases where extra memory usage from usm
reuse mechanism leads to above situation.

Related-To: NEO-6893, NEO-14160

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-03-27 10:25:19 +01:00
Jack Myers
0aa2c4f0cb feature: allow removal of heapful code paths
Related-To: NEO-13007

Signed-off-by: Jack Myers <jack.myers@intel.com>
2025-03-27 01:34:35 +01:00
Damian Tomczak
0243004907 feature: additional checkers to enable feature
Resolves: NEO-13973

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-03-26 18:06:20 +01:00
Mateusz Jablonski
4bc13fa0dc fix: correct MetricsLibraryGenId for Xe3
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-03-26 16:35:10 +01:00
Lukasz Jobczyk
60b551758c performance: Adjust waitpkg threshold for discrete devices
Related-To: NEO-14336

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-26 14:59:19 +01:00
Compute-Runtime-Validation
a89113fa1a Revert "fix: Fail device init if kernel debugging is misconfigured"
This reverts commit c122bc51f9.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-26 13:27:12 +01:00
Compute-Runtime-Validation
214277125d Revert "performance: enable ULLS on LNL Linux"
This reverts commit a15de0c4f6.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-26 13:26:57 +01:00
Jaroslaw Warchulski
eeeed9edd8 refactor: cleanup in product helpers
Related-To: NEO-12681
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-03-25 21:07:55 +01:00