Commit Graph

7052 Commits

Author SHA1 Message Date
Fabian Zwoliński
e2228201ce fix: Avoid redundant padding in ISA allocations
- Add isaPaddingIncluded flag to AllocationProperties to control ISA
padding
- Modify MemoryManager to skip automatic padding when flag is set
- Update ISAPoolAllocator to manage padding explicitly and set the
flag
- Use actual buffer size for HeapAllocator instead of requested size
- Add test covering different padding scenarios

This prevents double padding addition which was wasting memory when
ISAPoolAllocator managed its own padding on top of MemoryManager's
automatic padding.

Related-To: NEO-12287, HSD-18042276431
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2025-05-30 15:40:38 +02:00
Lukasz Jobczyk
332340b02b refactor: Prework for release fence removal leaving acquire fence
Related-To: NEO-14642

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-30 12:01:28 +02:00
Grochowski, Stanislaw
2d9108f8b6 refactor: change DefaultComputeWalker
Related-To: NEO-14537

Signed-off-by: Grochowski, Stanislaw <stanislaw.grochowski@intel.com>
2025-05-29 18:05:29 +02:00
Filip Hazubski
04965470fb fix: Remove redundant debug key
OverrideNumThreadsPerEu debug variable can be used
instead of removed Enable10ThreadsPerEu.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-05-29 16:58:04 +02:00
Katarzyna Cencelewska
e357e7e404 fix: correct limitation for num threads per thread group
taking into account the max work group limit
when simd1 don't use the same limitation as for simd32

Resolves: NEO-14922
Related-To: NEO-11881
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2025-05-29 16:34:40 +02:00
Dominik Dabek
ad4d7a2ce8 fix: l0 usm pooling, ipc handling
Move getting usm pool for ptr to its own method.
Move trying to free via pooling to its own method.

Use base ptr of pool for tracking in IPCHandleMap.
Track ipc handle refcount in freeMem.
Return error when trying to use a not allocated ptr within pool.

Related-To: NEO-6893

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-05-29 16:07:19 +02:00
Lukasz Jobczyk
19361031fe refactor: Make SemaphoreFenceHelper compliant with rule of three
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-29 15:56:43 +02:00
Maciej Plewka
577c99fe9c fix: clear residency container when failed to make resident
Related-To: NEO-14143
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-05-29 14:56:35 +02:00
Mateusz Jablonski
60dfdda1a3 fix: don't fail initialization for small bar config in i915 path
https://github.com/intel/compute-runtime/issues/831
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-05-29 12:38:09 +02:00
Young Jin Yoon
2773bd8ec0 fix: estimate blitter commands for memory fill
Added getNumberOfBlitsForColorFill, getNumberofBlitsForByteFill,
and getNumberOfBlitsForFill in BlitCommandsHelper to estimate
total number of blitter command generated for
dispatchBlitMemoryColorFill, dispatchBlitMemoryByteFill, and
dispatchBlitMemoryFill, respectively.

Use getNumberOfBlitsForColorFill to determine whether
additionalTimestampNode is required for appendMemoryBlitFill.

Related-To: NEO-13003
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2025-05-29 00:21:35 +02:00
Vysochyn, Illia
3989471ffc fix: Use round robin after stall as a default arbitration mode
Related-To: NEO-14251,HSD-18042035684,HSD-18041638491,HSD-18042040816

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-05-28 17:47:48 +02:00
Szymon Morek
556c0b64c6 performance: use RESOURCE_BARRIER as stalling barrier
Related-To: NEO-14943

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-05-28 12:30:01 +02:00
Maciej Plewka
e6f3ebce5d fix: add pc with stall before barrier with post sync on bmg
Related-To: NEO-14491
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-05-28 09:24:21 +02:00
Kamil Kopryk
b7681a2e7b refactor: redirect flush L3 host usm to external by default
Related-To: NEO-13163

Rename ForceL3FlushAfterPostSync to EnableL3FlushAfterPostSync
Rename DisableFlushL3ForHostUsm to RedirectFlushL3HostUsmToExternal

Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-05-28 08:50:01 +02:00
Aleksandra Nizio
1dfc9227c4 fix: Removind address_mapper.h
Related-To: NEO-14718
Signed-off-by: Aleksandra Nizio <aleksandra.nizio@intel.com>
2025-05-27 20:37:56 +02:00
Chen, Kai
1484e43bb7 feature: Add hasLscStoresWithNonDefaultL1CacheControls flag
Related-to: NEO-14563

Signed-off-by: Chen, Kai <kai.chen@intel.com>
2025-05-27 20:11:01 +02:00
Aleksandra Nizio
e43ec2bbfd fix: Removing stream
Related-To: NEO-14718
Signed-off-by: Aleksandra Nizio <aleksandra.nizio@intel.com>
2025-05-27 18:53:02 +02:00
Bartosz Dunajski
9c19e9eb96 fix: enable single temporary allocations list mode 2
Related-To: NEO-14641

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-05-27 17:35:17 +02:00
Mateusz Jablonski
72e1ec3b42 fix: pass additional adapter info caps to gmmlib for wddm windows
Related-To: HSD-18041684792
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-05-27 12:43:31 +02:00
Lukasz Jobczyk
a0f683e42b fix: Move evict after unlock to wddm layer
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-27 10:52:23 +02:00
Vysochyn, Illia
2168c43a67 feature(zebin): Update version
Related-To: IGC-11358
Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-05-27 10:24:57 +02:00
Lukasz Jobczyk
2e9643f46c performance: Enable small buffer pool allocator on PTL
Resolves: NEO-14817

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-26 22:07:50 +02:00
Mateusz Hoppe
87e2114e42 feature(zebin): rename attribute to "require_iab"
- implicit_arg_buffer_used_by_code is replaced with require_iab

Related-To: NEO-14667

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-05-26 16:27:47 +02:00
Aleksandra Nizio
31fe1978d4 fix: Removing streamProvider and addressMapper
Related-To: NEO-14718
Signed-off-by: Aleksandra Nizio <aleksandra.nizio@intel.com>
2025-05-26 16:01:18 +02:00
Compute-Runtime-Validation
42a8b63725 Revert "fix: enable single temporary allocations list mode"
This reverts commit b496274d1a.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-05-26 13:22:15 +02:00
Compute-Runtime-Validation
65562f83f8 Revert "fix: program 8 maxBVHLevels as 0 - special case"
This reverts commit d247358fdf.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-05-24 02:40:27 +02:00
Jaroslaw Warchulski
48fbf4c1d3 feature: enable bindless mode and global bindless allocator in L0 on ARL
Related-To: NEO-14995
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-05-23 17:34:29 +02:00
Mateusz Hoppe
c4bfe4e487 feature(zebin): add implicit_arg_buffer_used_by_code exec env support
Resolves: NEO-14667

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-05-23 17:11:14 +02:00
Kamil Kopryk
d247358fdf fix: program 8 maxBVHLevels as 0 - special case
Related-To: NEO-14423

Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-05-23 16:37:37 +02:00
Filip Hazubski
3dc32cec88 feature: Add debug key to override number of threads per EU
Related-To: HSD-22021012755, NEO-14749

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-05-23 16:20:40 +02:00
Lukasz Jobczyk
0e3397547c fix: Adjust waitpkg counter for non ulls light
Resolves: NEO-14866, HSD-18042153820

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-23 15:58:21 +02:00
Mateusz Jablonski
98bf872fdd fix: return error when cannot obtain debug surface size
remove default max debug surface size
check state save area size only for debug scenarios
reduce state save area size in unit tests - rely on values from mock

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-05-23 14:48:08 +02:00
Lukasz Jobczyk
86e7d5b276 performance: Skip fence after PC when no post sync write
Related-To: NEO-14642

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-23 10:33:28 +02:00
Mateusz Hoppe
8e5b29f55e feature: optimize local ids generation
- only emit local ids for required dimensions

Related-To: NEO-15007

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-05-23 09:59:31 +02:00
Mateusz Hoppe
0d57dcfdf0 feature: add L0 API querying priority levels - zeDeviceGetPriorityLevels()
Related-To: NEO-14572

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-05-23 09:24:16 +02:00
Compute-Runtime-Validation
593c9e76f2 Revert "fix: correct limitation for num threads per thread group"
This reverts commit 6ad4ad41b1.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-05-23 02:51:17 +02:00
Raiyan Latif
d6849a5605 feature: Add support for External Semaphore TimelineSemaphoreFd
Related-To: NEO-15032

Signed-off-by: Raiyan Latif <raiyan.latif@intel.com>
2025-05-23 00:40:05 +02:00
Filip Hazubski
a5b82acf02 fix: Update getStackSizePerRay for 30.01 release helper
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-05-22 19:58:18 +02:00
Arek G
db84fc713c performance: Disable ULLS if Power Hints are set to 100 (max)
Related-To: NEO-14763, HSD-16026432518

Signed-off-by: Arek G <arkadiusz.grubba@intel.com>
2025-05-22 16:57:17 +02:00
Compute-Runtime-Validation
78839685fc Revert "feature: enable bindless mode and global bindless allocator in L0 on ...
This reverts commit 7ef85b5853.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-05-22 12:52:10 +02:00
Chodor, Jaroslaw
a97563bf48 feature: Adding mechanism for overriding exposed device IP version
Related-To: GSD-10248

Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-05-22 11:59:26 +02:00
Lukasz Jobczyk
680fa1ec1d refactor: Split direct_submission_hw.inl
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-22 11:27:52 +02:00
Szymon Morek
a614c0bb95 performance: set tag buffer non-coherent on iGPU
There's no need to snoop cpu cache's for tag buffer.
Tag buffer is mostly read-only on CPU.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-05-22 10:46:42 +02:00
Bartosz Dunajski
b496274d1a fix: enable single temporary allocations list mode
Related-To: NEO-14641

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-05-22 08:45:46 +02:00
Szymon Morek
a2c41e4749 fix: add mock for staging buffer size
No need to allocate 2MB in unit tests.
This commit changes this size to 4KB.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-05-22 08:36:01 +02:00
Katarzyna Cencelewska
6ad4ad41b1 fix: correct limitation for num threads per thread group
taking into account the max work group limit

Resolves: NEO-14922
Related-To: NEO-11881
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2025-05-22 00:06:21 +02:00
Filip Hazubski
3e59a2f108 fix: Correct IoctlHelperXe logic to query PF support
Move logic in order to respect relevant checks before trying to query support.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-05-21 18:55:17 +02:00
Slawomir Milczarek
0b46fb9f0d refactor: Add allocation type parameter to ioctl helper functions
Related-To: NEO-12952

Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2025-05-21 17:06:33 +02:00
Mateusz Jablonski
8353f68307 feature(internal): add debug key to override max debug surface size
reduce the value in unit tests

Related-To: NEO-14097
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-05-21 14:57:41 +02:00
Bartosz Dunajski
91940d2a79 fix: initial support for single temporary allocations list
Related-To: NEO-14641

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-05-21 13:16:15 +02:00