Fabian Zwoliński
e2228201ce
fix: Avoid redundant padding in ISA allocations
...
- Add isaPaddingIncluded flag to AllocationProperties to control ISA
padding
- Modify MemoryManager to skip automatic padding when flag is set
- Update ISAPoolAllocator to manage padding explicitly and set the
flag
- Use actual buffer size for HeapAllocator instead of requested size
- Add test covering different padding scenarios
This prevents double padding addition which was wasting memory when
ISAPoolAllocator managed its own padding on top of MemoryManager's
automatic padding.
Related-To: NEO-12287, HSD-18042276431
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com >
2025-05-30 15:40:38 +02:00
Lukasz Jobczyk
332340b02b
refactor: Prework for release fence removal leaving acquire fence
...
Related-To: NEO-14642
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-30 12:01:28 +02:00
Grochowski, Stanislaw
2d9108f8b6
refactor: change DefaultComputeWalker
...
Related-To: NEO-14537
Signed-off-by: Grochowski, Stanislaw <stanislaw.grochowski@intel.com >
2025-05-29 18:05:29 +02:00
Filip Hazubski
04965470fb
fix: Remove redundant debug key
...
OverrideNumThreadsPerEu debug variable can be used
instead of removed Enable10ThreadsPerEu.
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2025-05-29 16:58:04 +02:00
Katarzyna Cencelewska
e357e7e404
fix: correct limitation for num threads per thread group
...
taking into account the max work group limit
when simd1 don't use the same limitation as for simd32
Resolves: NEO-14922
Related-To: NEO-11881
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2025-05-29 16:34:40 +02:00
Dominik Dabek
ad4d7a2ce8
fix: l0 usm pooling, ipc handling
...
Move getting usm pool for ptr to its own method.
Move trying to free via pooling to its own method.
Use base ptr of pool for tracking in IPCHandleMap.
Track ipc handle refcount in freeMem.
Return error when trying to use a not allocated ptr within pool.
Related-To: NEO-6893
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2025-05-29 16:07:19 +02:00
Lukasz Jobczyk
19361031fe
refactor: Make SemaphoreFenceHelper compliant with rule of three
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-29 15:56:43 +02:00
Maciej Plewka
577c99fe9c
fix: clear residency container when failed to make resident
...
Related-To: NEO-14143
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2025-05-29 14:56:35 +02:00
Mateusz Jablonski
60dfdda1a3
fix: don't fail initialization for small bar config in i915 path
...
https://github.com/intel/compute-runtime/issues/831
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-05-29 12:38:09 +02:00
Young Jin Yoon
2773bd8ec0
fix: estimate blitter commands for memory fill
...
Added getNumberOfBlitsForColorFill, getNumberofBlitsForByteFill,
and getNumberOfBlitsForFill in BlitCommandsHelper to estimate
total number of blitter command generated for
dispatchBlitMemoryColorFill, dispatchBlitMemoryByteFill, and
dispatchBlitMemoryFill, respectively.
Use getNumberOfBlitsForColorFill to determine whether
additionalTimestampNode is required for appendMemoryBlitFill.
Related-To: NEO-13003
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com >
2025-05-29 00:21:35 +02:00
Vysochyn, Illia
3989471ffc
fix: Use round robin after stall as a default arbitration mode
...
Related-To: NEO-14251,HSD-18042035684,HSD-18041638491,HSD-18042040816
Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com >
2025-05-28 17:47:48 +02:00
Szymon Morek
556c0b64c6
performance: use RESOURCE_BARRIER as stalling barrier
...
Related-To: NEO-14943
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-05-28 12:30:01 +02:00
Maciej Plewka
e6f3ebce5d
fix: add pc with stall before barrier with post sync on bmg
...
Related-To: NEO-14491
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2025-05-28 09:24:21 +02:00
Kamil Kopryk
b7681a2e7b
refactor: redirect flush L3 host usm to external by default
...
Related-To: NEO-13163
Rename ForceL3FlushAfterPostSync to EnableL3FlushAfterPostSync
Rename DisableFlushL3ForHostUsm to RedirectFlushL3HostUsmToExternal
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-05-28 08:50:01 +02:00
Aleksandra Nizio
1dfc9227c4
fix: Removind address_mapper.h
...
Related-To: NEO-14718
Signed-off-by: Aleksandra Nizio <aleksandra.nizio@intel.com >
2025-05-27 20:37:56 +02:00
Chen, Kai
1484e43bb7
feature: Add hasLscStoresWithNonDefaultL1CacheControls flag
...
Related-to: NEO-14563
Signed-off-by: Chen, Kai <kai.chen@intel.com >
2025-05-27 20:11:01 +02:00
Aleksandra Nizio
e43ec2bbfd
fix: Removing stream
...
Related-To: NEO-14718
Signed-off-by: Aleksandra Nizio <aleksandra.nizio@intel.com >
2025-05-27 18:53:02 +02:00
Bartosz Dunajski
9c19e9eb96
fix: enable single temporary allocations list mode 2
...
Related-To: NEO-14641
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-05-27 17:35:17 +02:00
Mateusz Jablonski
72e1ec3b42
fix: pass additional adapter info caps to gmmlib for wddm windows
...
Related-To: HSD-18041684792
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-05-27 12:43:31 +02:00
Lukasz Jobczyk
a0f683e42b
fix: Move evict after unlock to wddm layer
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-27 10:52:23 +02:00
Vysochyn, Illia
2168c43a67
feature(zebin): Update version
...
Related-To: IGC-11358
Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com >
2025-05-27 10:24:57 +02:00
Lukasz Jobczyk
2e9643f46c
performance: Enable small buffer pool allocator on PTL
...
Resolves: NEO-14817
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-26 22:07:50 +02:00
Mateusz Hoppe
87e2114e42
feature(zebin): rename attribute to "require_iab"
...
- implicit_arg_buffer_used_by_code is replaced with require_iab
Related-To: NEO-14667
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-05-26 16:27:47 +02:00
Aleksandra Nizio
31fe1978d4
fix: Removing streamProvider and addressMapper
...
Related-To: NEO-14718
Signed-off-by: Aleksandra Nizio <aleksandra.nizio@intel.com >
2025-05-26 16:01:18 +02:00
Compute-Runtime-Validation
42a8b63725
Revert "fix: enable single temporary allocations list mode"
...
This reverts commit b496274d1a .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-05-26 13:22:15 +02:00
Compute-Runtime-Validation
65562f83f8
Revert "fix: program 8 maxBVHLevels as 0 - special case"
...
This reverts commit d247358fdf .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-05-24 02:40:27 +02:00
Jaroslaw Warchulski
48fbf4c1d3
feature: enable bindless mode and global bindless allocator in L0 on ARL
...
Related-To: NEO-14995
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com >
2025-05-23 17:34:29 +02:00
Mateusz Hoppe
c4bfe4e487
feature(zebin): add implicit_arg_buffer_used_by_code exec env support
...
Resolves: NEO-14667
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-05-23 17:11:14 +02:00
Kamil Kopryk
d247358fdf
fix: program 8 maxBVHLevels as 0 - special case
...
Related-To: NEO-14423
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-05-23 16:37:37 +02:00
Filip Hazubski
3dc32cec88
feature: Add debug key to override number of threads per EU
...
Related-To: HSD-22021012755, NEO-14749
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2025-05-23 16:20:40 +02:00
Lukasz Jobczyk
0e3397547c
fix: Adjust waitpkg counter for non ulls light
...
Resolves: NEO-14866, HSD-18042153820
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-23 15:58:21 +02:00
Mateusz Jablonski
98bf872fdd
fix: return error when cannot obtain debug surface size
...
remove default max debug surface size
check state save area size only for debug scenarios
reduce state save area size in unit tests - rely on values from mock
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-05-23 14:48:08 +02:00
Lukasz Jobczyk
86e7d5b276
performance: Skip fence after PC when no post sync write
...
Related-To: NEO-14642
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-23 10:33:28 +02:00
Mateusz Hoppe
8e5b29f55e
feature: optimize local ids generation
...
- only emit local ids for required dimensions
Related-To: NEO-15007
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-05-23 09:59:31 +02:00
Mateusz Hoppe
0d57dcfdf0
feature: add L0 API querying priority levels - zeDeviceGetPriorityLevels()
...
Related-To: NEO-14572
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-05-23 09:24:16 +02:00
Compute-Runtime-Validation
593c9e76f2
Revert "fix: correct limitation for num threads per thread group"
...
This reverts commit 6ad4ad41b1 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-05-23 02:51:17 +02:00
Raiyan Latif
d6849a5605
feature: Add support for External Semaphore TimelineSemaphoreFd
...
Related-To: NEO-15032
Signed-off-by: Raiyan Latif <raiyan.latif@intel.com >
2025-05-23 00:40:05 +02:00
Filip Hazubski
a5b82acf02
fix: Update getStackSizePerRay for 30.01 release helper
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2025-05-22 19:58:18 +02:00
Arek G
db84fc713c
performance: Disable ULLS if Power Hints are set to 100 (max)
...
Related-To: NEO-14763, HSD-16026432518
Signed-off-by: Arek G <arkadiusz.grubba@intel.com >
2025-05-22 16:57:17 +02:00
Compute-Runtime-Validation
78839685fc
Revert "feature: enable bindless mode and global bindless allocator in L0 on ...
...
This reverts commit 7ef85b5853 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-05-22 12:52:10 +02:00
Chodor, Jaroslaw
a97563bf48
feature: Adding mechanism for overriding exposed device IP version
...
Related-To: GSD-10248
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com >
2025-05-22 11:59:26 +02:00
Lukasz Jobczyk
680fa1ec1d
refactor: Split direct_submission_hw.inl
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-22 11:27:52 +02:00
Szymon Morek
a614c0bb95
performance: set tag buffer non-coherent on iGPU
...
There's no need to snoop cpu cache's for tag buffer.
Tag buffer is mostly read-only on CPU.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-05-22 10:46:42 +02:00
Bartosz Dunajski
b496274d1a
fix: enable single temporary allocations list mode
...
Related-To: NEO-14641
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-05-22 08:45:46 +02:00
Szymon Morek
a2c41e4749
fix: add mock for staging buffer size
...
No need to allocate 2MB in unit tests.
This commit changes this size to 4KB.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-05-22 08:36:01 +02:00
Katarzyna Cencelewska
6ad4ad41b1
fix: correct limitation for num threads per thread group
...
taking into account the max work group limit
Resolves: NEO-14922
Related-To: NEO-11881
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2025-05-22 00:06:21 +02:00
Filip Hazubski
3e59a2f108
fix: Correct IoctlHelperXe logic to query PF support
...
Move logic in order to respect relevant checks before trying to query support.
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2025-05-21 18:55:17 +02:00
Slawomir Milczarek
0b46fb9f0d
refactor: Add allocation type parameter to ioctl helper functions
...
Related-To: NEO-12952
Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com >
2025-05-21 17:06:33 +02:00
Mateusz Jablonski
8353f68307
feature(internal): add debug key to override max debug surface size
...
reduce the value in unit tests
Related-To: NEO-14097
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-05-21 14:57:41 +02:00
Bartosz Dunajski
91940d2a79
fix: initial support for single temporary allocations list
...
Related-To: NEO-14641
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-05-21 13:16:15 +02:00