Lukasz Jobczyk
48206fe848
performance: Remove release fence from command stream on xe2
...
Resolves: NEO-14642
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-30 18:50:31 +02:00
Compute-Runtime-Validation
1a50e8e7c0
Revert "performance: use RESOURCE_BARRIER as stalling barrier"
...
This reverts commit 556c0b64c6 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-05-30 18:00:34 +02:00
Katarzyna Cencelewska
828d6bafa7
fix: return proper value for zeKernelSuggestGroupSize
...
Resolves: HSD-18042274687
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2025-05-30 16:35:21 +02:00
Radoslaw Jablonski
2762166a7b
feature: add device caps query ioctl
...
Related-To: NEO-13039
Signed-off-by: Radoslaw Jablonski <radoslaw.jablonski@intel.com >
2025-05-30 16:28:28 +02:00
Fabian Zwoliński
e2228201ce
fix: Avoid redundant padding in ISA allocations
...
- Add isaPaddingIncluded flag to AllocationProperties to control ISA
padding
- Modify MemoryManager to skip automatic padding when flag is set
- Update ISAPoolAllocator to manage padding explicitly and set the
flag
- Use actual buffer size for HeapAllocator instead of requested size
- Add test covering different padding scenarios
This prevents double padding addition which was wasting memory when
ISAPoolAllocator managed its own padding on top of MemoryManager's
automatic padding.
Related-To: NEO-12287, HSD-18042276431
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com >
2025-05-30 15:40:38 +02:00
Lukasz Jobczyk
332340b02b
refactor: Prework for release fence removal leaving acquire fence
...
Related-To: NEO-14642
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-30 12:01:28 +02:00
Filip Hazubski
047be0a313
test: Correct compiler cache fixture
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2025-05-29 18:58:24 +02:00
Filip Hazubski
04965470fb
fix: Remove redundant debug key
...
OverrideNumThreadsPerEu debug variable can be used
instead of removed Enable10ThreadsPerEu.
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2025-05-29 16:58:04 +02:00
Katarzyna Cencelewska
e357e7e404
fix: correct limitation for num threads per thread group
...
taking into account the max work group limit
when simd1 don't use the same limitation as for simd32
Resolves: NEO-14922
Related-To: NEO-11881
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2025-05-29 16:34:40 +02:00
Dominik Dabek
ad4d7a2ce8
fix: l0 usm pooling, ipc handling
...
Move getting usm pool for ptr to its own method.
Move trying to free via pooling to its own method.
Use base ptr of pool for tracking in IPCHandleMap.
Track ipc handle refcount in freeMem.
Return error when trying to use a not allocated ptr within pool.
Related-To: NEO-6893
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2025-05-29 16:07:19 +02:00
Maciej Plewka
577c99fe9c
fix: clear residency container when failed to make resident
...
Related-To: NEO-14143
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2025-05-29 14:56:35 +02:00
Jitendra Sharma
c741628b00
test: Dont create new mock class for SipKernel
...
Instead of creating new mock class for SipKernel, use MockSipKernel
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com >
2025-05-29 12:49:46 +02:00
Mateusz Jablonski
60dfdda1a3
fix: don't fail initialization for small bar config in i915 path
...
https://github.com/intel/compute-runtime/issues/831
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-05-29 12:38:09 +02:00
Young Jin Yoon
2773bd8ec0
fix: estimate blitter commands for memory fill
...
Added getNumberOfBlitsForColorFill, getNumberofBlitsForByteFill,
and getNumberOfBlitsForFill in BlitCommandsHelper to estimate
total number of blitter command generated for
dispatchBlitMemoryColorFill, dispatchBlitMemoryByteFill, and
dispatchBlitMemoryFill, respectively.
Use getNumberOfBlitsForColorFill to determine whether
additionalTimestampNode is required for appendMemoryBlitFill.
Related-To: NEO-13003
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com >
2025-05-29 00:21:35 +02:00
Vysochyn, Illia
3989471ffc
fix: Use round robin after stall as a default arbitration mode
...
Related-To: NEO-14251,HSD-18042035684,HSD-18041638491,HSD-18042040816
Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com >
2025-05-28 17:47:48 +02:00
Szymon Morek
556c0b64c6
performance: use RESOURCE_BARRIER as stalling barrier
...
Related-To: NEO-14943
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-05-28 12:30:01 +02:00
Maciej Plewka
e6f3ebce5d
fix: add pc with stall before barrier with post sync on bmg
...
Related-To: NEO-14491
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2025-05-28 09:24:21 +02:00
Kamil Kopryk
b7681a2e7b
refactor: redirect flush L3 host usm to external by default
...
Related-To: NEO-13163
Rename ForceL3FlushAfterPostSync to EnableL3FlushAfterPostSync
Rename DisableFlushL3ForHostUsm to RedirectFlushL3HostUsmToExternal
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-05-28 08:50:01 +02:00
Kindracki, Jakub Tomasz
d0234c0bef
test: Remove resetCommandStreamReceiver from tests
...
Related-To: NEO-3927
Signed-off-by: Kindracki, Jakub Tomasz <jakub.tomasz.kindracki@intel.com >
2025-05-27 23:58:44 +02:00
Aleksandra Nizio
1dfc9227c4
fix: Removind address_mapper.h
...
Related-To: NEO-14718
Signed-off-by: Aleksandra Nizio <aleksandra.nizio@intel.com >
2025-05-27 20:37:56 +02:00
Chen, Kai
1484e43bb7
feature: Add hasLscStoresWithNonDefaultL1CacheControls flag
...
Related-to: NEO-14563
Signed-off-by: Chen, Kai <kai.chen@intel.com >
2025-05-27 20:11:01 +02:00
Aleksandra Nizio
e43ec2bbfd
fix: Removing stream
...
Related-To: NEO-14718
Signed-off-by: Aleksandra Nizio <aleksandra.nizio@intel.com >
2025-05-27 18:53:02 +02:00
Bartosz Dunajski
9c19e9eb96
fix: enable single temporary allocations list mode 2
...
Related-To: NEO-14641
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-05-27 17:35:17 +02:00
Mateusz Jablonski
72e1ec3b42
fix: pass additional adapter info caps to gmmlib for wddm windows
...
Related-To: HSD-18041684792
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-05-27 12:43:31 +02:00
Lukasz Jobczyk
a0f683e42b
fix: Move evict after unlock to wddm layer
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-27 10:52:23 +02:00
Lukasz Jobczyk
2e9643f46c
performance: Enable small buffer pool allocator on PTL
...
Resolves: NEO-14817
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-26 22:07:50 +02:00
Mateusz Hoppe
87e2114e42
feature(zebin): rename attribute to "require_iab"
...
- implicit_arg_buffer_used_by_code is replaced with require_iab
Related-To: NEO-14667
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-05-26 16:27:47 +02:00
Aleksandra Nizio
31fe1978d4
fix: Removing streamProvider and addressMapper
...
Related-To: NEO-14718
Signed-off-by: Aleksandra Nizio <aleksandra.nizio@intel.com >
2025-05-26 16:01:18 +02:00
Compute-Runtime-Validation
42a8b63725
Revert "fix: enable single temporary allocations list mode"
...
This reverts commit b496274d1a .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-05-26 13:22:15 +02:00
Compute-Runtime-Validation
65562f83f8
Revert "fix: program 8 maxBVHLevels as 0 - special case"
...
This reverts commit d247358fdf .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-05-24 02:40:27 +02:00
Filip Hazubski
ff7ae28b1b
test: Add test check for dc flush
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2025-05-23 17:45:20 +02:00
Jaroslaw Warchulski
48fbf4c1d3
feature: enable bindless mode and global bindless allocator in L0 on ARL
...
Related-To: NEO-14995
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com >
2025-05-23 17:34:29 +02:00
Mateusz Hoppe
c4bfe4e487
feature(zebin): add implicit_arg_buffer_used_by_code exec env support
...
Resolves: NEO-14667
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-05-23 17:11:14 +02:00
Kamil Kopryk
d247358fdf
fix: program 8 maxBVHLevels as 0 - special case
...
Related-To: NEO-14423
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-05-23 16:37:37 +02:00
Filip Hazubski
3dc32cec88
feature: Add debug key to override number of threads per EU
...
Related-To: HSD-22021012755, NEO-14749
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2025-05-23 16:20:40 +02:00
Lukasz Jobczyk
0e3397547c
fix: Adjust waitpkg counter for non ulls light
...
Resolves: NEO-14866, HSD-18042153820
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-23 15:58:21 +02:00
Mateusz Jablonski
98bf872fdd
fix: return error when cannot obtain debug surface size
...
remove default max debug surface size
check state save area size only for debug scenarios
reduce state save area size in unit tests - rely on values from mock
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-05-23 14:48:08 +02:00
Lukasz Jobczyk
86e7d5b276
performance: Skip fence after PC when no post sync write
...
Related-To: NEO-14642
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-23 10:33:28 +02:00
Mateusz Hoppe
8e5b29f55e
feature: optimize local ids generation
...
- only emit local ids for required dimensions
Related-To: NEO-15007
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-05-23 09:59:31 +02:00
Mateusz Hoppe
0d57dcfdf0
feature: add L0 API querying priority levels - zeDeviceGetPriorityLevels()
...
Related-To: NEO-14572
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-05-23 09:24:16 +02:00
Compute-Runtime-Validation
593c9e76f2
Revert "fix: correct limitation for num threads per thread group"
...
This reverts commit 6ad4ad41b1 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-05-23 02:51:17 +02:00
Raiyan Latif
d6849a5605
feature: Add support for External Semaphore TimelineSemaphoreFd
...
Related-To: NEO-15032
Signed-off-by: Raiyan Latif <raiyan.latif@intel.com >
2025-05-23 00:40:05 +02:00
Filip Hazubski
a5b82acf02
fix: Update getStackSizePerRay for 30.01 release helper
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2025-05-22 19:58:18 +02:00
Arek G
db84fc713c
performance: Disable ULLS if Power Hints are set to 100 (max)
...
Related-To: NEO-14763, HSD-16026432518
Signed-off-by: Arek G <arkadiusz.grubba@intel.com >
2025-05-22 16:57:17 +02:00
Compute-Runtime-Validation
78839685fc
Revert "feature: enable bindless mode and global bindless allocator in L0 on ...
...
This reverts commit 7ef85b5853 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-05-22 12:52:10 +02:00
Szymon Morek
a614c0bb95
performance: set tag buffer non-coherent on iGPU
...
There's no need to snoop cpu cache's for tag buffer.
Tag buffer is mostly read-only on CPU.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-05-22 10:46:42 +02:00
Bartosz Dunajski
b496274d1a
fix: enable single temporary allocations list mode
...
Related-To: NEO-14641
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-05-22 08:45:46 +02:00
Szymon Morek
a2c41e4749
fix: add mock for staging buffer size
...
No need to allocate 2MB in unit tests.
This commit changes this size to 4KB.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-05-22 08:36:01 +02:00
Katarzyna Cencelewska
6ad4ad41b1
fix: correct limitation for num threads per thread group
...
taking into account the max work group limit
Resolves: NEO-14922
Related-To: NEO-11881
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2025-05-22 00:06:21 +02:00
John Falkowski
5f80490385
test: Correct variable name
...
Related-To: NEO-12988
Signed-off-by: John Falkowski <john.falkowski@intel.com >
2025-05-21 20:04:59 +02:00