Commit Graph

7069 Commits

Author SHA1 Message Date
Dominik Dabek
2174389c4e fix: disable L0 USM pool on dg2, ats
Related-To: NEO-15167, HSD-18042525630

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-06-09 12:05:48 +02:00
Szymon Morek
6a7f6f1ad6 fix: add noexcept to move operators
Without it, copy operator is selected, even if move is explicitly defined.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-06-09 10:32:23 +02:00
Szymon Morek
e030086cf2 performance: reorder checks for staging buffer
defer registerHostPtr call to the last check.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-06-09 10:18:35 +02:00
Brandon Yates
540fa19894 fix: mark resources for capture even when debugger is not init
Some allocations are created before debugger is initialized. This change
will ensure they get marked for capture.

Debugger Enabled check added to bind path where capture flag is set

Related-to: GSD-10856

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-06-04 16:21:22 +02:00
Bartosz Dunajski
2399492b74 refactor: debug flag to override max mem alloc size
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-06-04 15:16:34 +02:00
Mateusz Jablonski
1b2db22238 refactor: simplify compare operator for RuntimeCapabilityTable
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-06-03 20:20:22 +02:00
Maciej Bielski
d667614a4f fix: set gmm LocalOnly flag regardless of extraMemoryFlags
Related-To: NEO-13428
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-06-03 14:00:34 +02:00
Bartosz Dunajski
a015188166 refactor: unify prefetch encode methods
Related-To: NEO-14703

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-06-03 10:07:26 +02:00
Bartosz Dunajski
9647612f98 fix: update prefetch mocs settings
Related-To: NEO-14703

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-06-02 13:22:42 +02:00
Szymon Morek
48b4a33645 Revert "performance: enable low latency hint on Xe"
This reverts commit b74dc9b7e1.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-06-02 12:18:43 +02:00
Dominik Dabek
6d8188bc56 fix: enable usm device pool api wise
For OCL usm device pool enabled on MTL, LNL, PTL
For L0 usm device pool enabled on PTL, disabled on MTL, LNL

Related-To: NEO-6893

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-06-02 10:28:15 +02:00
Andrzej Koska
801a378f42 fix: AIL table extension
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2025-05-30 22:07:04 +02:00
Chodor, Jaroslaw
46c60290b9 refactor: introducing igfxfmid_wrapper
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-05-30 19:26:04 +02:00
Lukasz Jobczyk
48206fe848 performance: Remove release fence from command stream on xe2
Resolves: NEO-14642

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-30 18:50:31 +02:00
Compute-Runtime-Validation
1a50e8e7c0 Revert "performance: use RESOURCE_BARRIER as stalling barrier"
This reverts commit 556c0b64c6.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-05-30 18:00:34 +02:00
Katarzyna Cencelewska
828d6bafa7 fix: return proper value for zeKernelSuggestGroupSize
Resolves: HSD-18042274687
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2025-05-30 16:35:21 +02:00
Radoslaw Jablonski
2762166a7b feature: add device caps query ioctl
Related-To: NEO-13039

Signed-off-by: Radoslaw Jablonski <radoslaw.jablonski@intel.com>
2025-05-30 16:28:28 +02:00
Fabian Zwoliński
e2228201ce fix: Avoid redundant padding in ISA allocations
- Add isaPaddingIncluded flag to AllocationProperties to control ISA
padding
- Modify MemoryManager to skip automatic padding when flag is set
- Update ISAPoolAllocator to manage padding explicitly and set the
flag
- Use actual buffer size for HeapAllocator instead of requested size
- Add test covering different padding scenarios

This prevents double padding addition which was wasting memory when
ISAPoolAllocator managed its own padding on top of MemoryManager's
automatic padding.

Related-To: NEO-12287, HSD-18042276431
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2025-05-30 15:40:38 +02:00
Lukasz Jobczyk
332340b02b refactor: Prework for release fence removal leaving acquire fence
Related-To: NEO-14642

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-30 12:01:28 +02:00
Grochowski, Stanislaw
2d9108f8b6 refactor: change DefaultComputeWalker
Related-To: NEO-14537

Signed-off-by: Grochowski, Stanislaw <stanislaw.grochowski@intel.com>
2025-05-29 18:05:29 +02:00
Filip Hazubski
04965470fb fix: Remove redundant debug key
OverrideNumThreadsPerEu debug variable can be used
instead of removed Enable10ThreadsPerEu.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-05-29 16:58:04 +02:00
Katarzyna Cencelewska
e357e7e404 fix: correct limitation for num threads per thread group
taking into account the max work group limit
when simd1 don't use the same limitation as for simd32

Resolves: NEO-14922
Related-To: NEO-11881
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2025-05-29 16:34:40 +02:00
Dominik Dabek
ad4d7a2ce8 fix: l0 usm pooling, ipc handling
Move getting usm pool for ptr to its own method.
Move trying to free via pooling to its own method.

Use base ptr of pool for tracking in IPCHandleMap.
Track ipc handle refcount in freeMem.
Return error when trying to use a not allocated ptr within pool.

Related-To: NEO-6893

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-05-29 16:07:19 +02:00
Lukasz Jobczyk
19361031fe refactor: Make SemaphoreFenceHelper compliant with rule of three
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-29 15:56:43 +02:00
Maciej Plewka
577c99fe9c fix: clear residency container when failed to make resident
Related-To: NEO-14143
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-05-29 14:56:35 +02:00
Mateusz Jablonski
60dfdda1a3 fix: don't fail initialization for small bar config in i915 path
https://github.com/intel/compute-runtime/issues/831
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-05-29 12:38:09 +02:00
Young Jin Yoon
2773bd8ec0 fix: estimate blitter commands for memory fill
Added getNumberOfBlitsForColorFill, getNumberofBlitsForByteFill,
and getNumberOfBlitsForFill in BlitCommandsHelper to estimate
total number of blitter command generated for
dispatchBlitMemoryColorFill, dispatchBlitMemoryByteFill, and
dispatchBlitMemoryFill, respectively.

Use getNumberOfBlitsForColorFill to determine whether
additionalTimestampNode is required for appendMemoryBlitFill.

Related-To: NEO-13003
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2025-05-29 00:21:35 +02:00
Vysochyn, Illia
3989471ffc fix: Use round robin after stall as a default arbitration mode
Related-To: NEO-14251,HSD-18042035684,HSD-18041638491,HSD-18042040816

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-05-28 17:47:48 +02:00
Szymon Morek
556c0b64c6 performance: use RESOURCE_BARRIER as stalling barrier
Related-To: NEO-14943

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-05-28 12:30:01 +02:00
Maciej Plewka
e6f3ebce5d fix: add pc with stall before barrier with post sync on bmg
Related-To: NEO-14491
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-05-28 09:24:21 +02:00
Kamil Kopryk
b7681a2e7b refactor: redirect flush L3 host usm to external by default
Related-To: NEO-13163

Rename ForceL3FlushAfterPostSync to EnableL3FlushAfterPostSync
Rename DisableFlushL3ForHostUsm to RedirectFlushL3HostUsmToExternal

Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-05-28 08:50:01 +02:00
Aleksandra Nizio
1dfc9227c4 fix: Removind address_mapper.h
Related-To: NEO-14718
Signed-off-by: Aleksandra Nizio <aleksandra.nizio@intel.com>
2025-05-27 20:37:56 +02:00
Chen, Kai
1484e43bb7 feature: Add hasLscStoresWithNonDefaultL1CacheControls flag
Related-to: NEO-14563

Signed-off-by: Chen, Kai <kai.chen@intel.com>
2025-05-27 20:11:01 +02:00
Aleksandra Nizio
e43ec2bbfd fix: Removing stream
Related-To: NEO-14718
Signed-off-by: Aleksandra Nizio <aleksandra.nizio@intel.com>
2025-05-27 18:53:02 +02:00
Bartosz Dunajski
9c19e9eb96 fix: enable single temporary allocations list mode 2
Related-To: NEO-14641

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-05-27 17:35:17 +02:00
Mateusz Jablonski
72e1ec3b42 fix: pass additional adapter info caps to gmmlib for wddm windows
Related-To: HSD-18041684792
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-05-27 12:43:31 +02:00
Lukasz Jobczyk
a0f683e42b fix: Move evict after unlock to wddm layer
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-27 10:52:23 +02:00
Vysochyn, Illia
2168c43a67 feature(zebin): Update version
Related-To: IGC-11358
Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-05-27 10:24:57 +02:00
Lukasz Jobczyk
2e9643f46c performance: Enable small buffer pool allocator on PTL
Resolves: NEO-14817

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-26 22:07:50 +02:00
Mateusz Hoppe
87e2114e42 feature(zebin): rename attribute to "require_iab"
- implicit_arg_buffer_used_by_code is replaced with require_iab

Related-To: NEO-14667

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-05-26 16:27:47 +02:00
Aleksandra Nizio
31fe1978d4 fix: Removing streamProvider and addressMapper
Related-To: NEO-14718
Signed-off-by: Aleksandra Nizio <aleksandra.nizio@intel.com>
2025-05-26 16:01:18 +02:00
Compute-Runtime-Validation
42a8b63725 Revert "fix: enable single temporary allocations list mode"
This reverts commit b496274d1a.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-05-26 13:22:15 +02:00
Compute-Runtime-Validation
65562f83f8 Revert "fix: program 8 maxBVHLevels as 0 - special case"
This reverts commit d247358fdf.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-05-24 02:40:27 +02:00
Jaroslaw Warchulski
48fbf4c1d3 feature: enable bindless mode and global bindless allocator in L0 on ARL
Related-To: NEO-14995
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-05-23 17:34:29 +02:00
Mateusz Hoppe
c4bfe4e487 feature(zebin): add implicit_arg_buffer_used_by_code exec env support
Resolves: NEO-14667

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-05-23 17:11:14 +02:00
Kamil Kopryk
d247358fdf fix: program 8 maxBVHLevels as 0 - special case
Related-To: NEO-14423

Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-05-23 16:37:37 +02:00
Filip Hazubski
3dc32cec88 feature: Add debug key to override number of threads per EU
Related-To: HSD-22021012755, NEO-14749

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-05-23 16:20:40 +02:00
Lukasz Jobczyk
0e3397547c fix: Adjust waitpkg counter for non ulls light
Resolves: NEO-14866, HSD-18042153820

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-23 15:58:21 +02:00
Mateusz Jablonski
98bf872fdd fix: return error when cannot obtain debug surface size
remove default max debug surface size
check state save area size only for debug scenarios
reduce state save area size in unit tests - rely on values from mock

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-05-23 14:48:08 +02:00
Lukasz Jobczyk
86e7d5b276 performance: Skip fence after PC when no post sync write
Related-To: NEO-14642

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-23 10:33:28 +02:00