068fed7512
feature: add support for the cl_khr_expect_assume extensions
...
Related-To: NEO-12696
Signed-off-by: Ben Ashbaugh <ben.ashbaugh@intel.com >
2025-01-23 06:28:28 +01:00
c2dbdb6797
refactor: move blit post sync data to BlitProperties
...
Related-To: NEO-13003
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-01-22 18:35:21 +01:00
9cfc6e6bbe
fix: usm reuse cleaner mt tests
...
Related-To: NEO-13425
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2025-01-22 18:21:18 +01:00
19a0a27862
refactor: adjust unit tests to work with secondary engines
...
Related-To: NEO-12952, NEO-13789
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-01-22 13:31:43 +01:00
c0838e1f76
fix: Apply dispatch all for small TG only on BMG
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-01-22 13:04:44 +01:00
6926582915
feature: use bindless samplers in heapless ocl
...
Related-To: NEO-12741
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-01-22 11:49:48 +01:00
47fc1ce34b
refactor: use designated initializer for EncodeWalkerArgs
...
also make EncodeWalkerArgs an aggregate
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-01-22 11:44:57 +01:00
1e89058232
refactor: change data fields for encoder of additional walker fields
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-01-21 16:49:38 +01:00
850e6c02eb
test: allocate sip allocation for all contexts in initSipKernel
...
- some tests initialize sip kernel but do not free it
- this is causing memory leaks
- as a solution mock sip listener is added that will release allocated
sip kernel
Related-To: NEO-13789
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-01-20 15:46:05 +01:00
b7d7424aab
build: enable c++20
...
Related-To: NEO-10767
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-01-20 12:35:12 +01:00
1ce795c265
refactor: fixes in ults
...
Related-To: NEO-13789
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-01-20 09:57:05 +01:00
6b43e7a4ec
test: heapless fix for CopyImageToBuffer ult
...
Related-to: NEO-12737
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com >
2025-01-20 09:13:54 +01:00
af0bb7964a
performance: Set dispatch all for small TG
...
Resolves: NEO-11814
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-01-20 08:49:59 +01:00
ac78c07899
refactor: unify duplicated code for staging images
...
Pass command type instead of maintaining
two methods for reads and writes
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-01-17 09:52:35 +01:00
c8cbed8e9b
Revert "feature: enable C++20"
...
This reverts commit 137e004767
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-01-17 02:48:11 +01:00
137e004767
feature: enable C++20
...
Related-To: NEO-10767
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-01-16 20:10:15 +01:00
4dd1c80362
refactor: use enum class instead of enum for DeviceHierarchyMode
...
Related-To: GSD-9560
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com >
2025-01-16 15:49:44 +01:00
7d8e08f00b
test: adjust code to compile with c++20 2/n
...
Related-To: NEO-10767
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-01-16 15:24:25 +01:00
d263fda1ad
feature: patch 64bit addresses for bindless images in heapless OCL
...
Related-To: NEO-12741
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-01-16 15:24:02 +01:00
0278d2e652
test: adjust code to compile with c++20
...
Related-To: NEO-10767
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-01-16 12:04:50 +01:00
a9a2f7577f
performance: enable staging write for cl buffers
...
Related-To: NEO-13529
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-01-16 12:04:28 +01:00
99a7b5a4fb
refactor: remove not needed volatile
...
Related-To: NEO-10767
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-01-16 00:43:10 +01:00
a17745532c
performance: Move preemption allocation init to the first submission
...
Related-To: NEO-12323
Signed-off-by: Krzysztof Sprzaczkowski <krzysztof.sprzaczkowski@intel.com >
2025-01-15 20:22:50 +01:00
0b4fe9a0df
performance: add staging transfers for cl buffers
...
Related-To: NEO-13529
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-01-15 12:24:43 +01:00
af031ee0e3
Revert "performance: align structures for 64-bit platforms"
...
This reverts commit 9f07f56f7f
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-01-15 09:02:01 +01:00
6a7bee67e9
refactor: unify device hierarchy mode
...
Related-To: GSD-9560
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com >
2025-01-14 21:08:44 +01:00
b859a3b35b
refactor: remove usesBindfulAddressingForBuffers function
...
Related-to: NEO-12737
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com >
2025-01-14 10:45:04 +01:00
a180afa2e6
fix: respect affinity mask with COMBINED hierarchy in OCL
...
Related-To: GSD-9560
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com >
2025-01-13 15:38:39 +01:00
112abeeeef
fix: don't adjust programmed per thread scratch size
...
when adjusting scratch space size then adjust only allocation size
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-01-10 11:35:50 +01:00
a3b6c1fa6d
fix: correct thread/eu ratio for scratch to Xe2
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-01-09 22:42:36 +01:00
7f9fadc314
fix: regression caused by tbx fault mngr
...
Addresses regressions from the reverted merge
of the tbx fault manager for host memory.
Recursive locking of mutex caused deadlock.
To fix, separate tbx fault data from base
cpu fault data, allowing separate mutexes
for each, eliminating recursive locks on
the same mutex.
By separating, we also help ensure that tbx-related
changes don't affect the original cpu fault manager code
paths.
As an added safe guard preventing critical regressions
and avoiding another auto-revert, the tbx fault manager
is hidden behind a new debug flag which is disabled by default.
Related-To: NEO-12268
Signed-off-by: Jack Myers <jack.myers@intel.com >
2025-01-09 07:48:53 +01:00
9f07f56f7f
performance: align structures for 64-bit platforms
...
Signed-off-by: Semenov Herman (Семенов Герман) <GermanAizek@yandex.ru >
2025-01-09 06:03:39 +01:00
a2aa645404
refactor: Update MI_MEM_FENCE
...
Refactors the MI_MEM_FENCE to align with the latest specification.
Related-To: NEO-13147
Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com >
2025-01-08 16:46:58 +01:00
bd5ca4fed5
fix: flush ocl queue after staging transfer
...
This is required for scenarios without direct submission
Otherwise, staging transfers are enqueued but not actually
flushed. If that happens, subsequent staging transfers can't
reuse previous staging buffers. It results in increased memory usage.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-01-08 14:02:33 +01:00
983b46fbbb
performance: Align host USM to 2MB
...
Only on discrete devices and if size is greater than 2MB
Resolves: NEO-12652
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-01-07 14:32:26 +01:00
2815d4167d
fix: don't use staging for map allocations
...
Related-To: NEO-13572
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-01-07 13:30:11 +01:00
bb1a125f0c
feature: add support for Panther Lake platform
...
Related-To: NEO-12803
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-01-07 11:39:56 +01:00
c867a5ed5d
refactor: remove redundant flag levelZeroSupported
...
all platforms are supported by L0
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-01-03 11:28:11 +01:00
badbecb126
feature: adds support for 3 channel (2/N)
...
Allows user to create L0 images with 3 channel
Related-To: NEO-12887
Signed-off-by: Kulkarni, Ashwin Kumar <ashwin.kumar.kulkarni@intel.com >
2024-12-31 20:02:13 +01:00
f198507875
refactor: Remove 3DSTATE_BTD_BODY structure
...
Removes 3DSTATE_BTD_BODY as redundant structure.
Related-To: NEO-13147
Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com >
2024-12-31 16:27:29 +01:00
c08c9b7f86
refactor: change test to HWTEST
...
- move Heapful matcher to test macros
- check alignment in patchWithRequiredSize
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2024-12-31 10:37:37 +01:00
3179b30bb8
test(ocl): refactor ults
...
Related-To: NEO-12800
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2024-12-30 09:34:12 +01:00
548f93c55a
Revert "performance: enable staging read for images"
...
This reverts commit 6aeb8dffa0
.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-12-23 17:42:12 +01:00
2951f8a411
refactor: Update CFE_STATE
...
Refactors the CFE_STATE to align with the latest specification.
Related-To: NEO-13147
Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com >
2024-12-23 14:31:20 +01:00
1135c1033e
fix: set staging buffer writeable for aub and tbx
...
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-12-23 13:35:01 +01:00
1c1b2db9a9
fix: allowing neo ULT build with optimization enabled 5/n
...
- fix mismatched-new-delete warnings
- fix initialization warnings
- fix attempt to free a non-heap object warnings
Related-To: NEO-8116
Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com >
2024-12-23 13:07:55 +01:00
dd3d5c6460
feature: Add logic for reading loaded library's version
...
getLoadedLibVersion is intended for reading version of a library that was
already loaded into the process prior to this call (i.e. new instance of
the library willnot be loaded).
Note: Currently, only windows version of this functionality is implemented.
Related-To: GSD-10248
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com >
2024-12-20 17:29:19 +01:00
6aeb8dffa0
performance: enable staging read for images
...
Related-To: NEO-12968
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-12-20 15:02:28 +01:00
c858234a3c
test: allow to utilize enqueue token for local size
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2024-12-20 13:58:10 +01:00
cce17c41e8
test: local work size tokens may be optional for reqdWorkgroupSize cases
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2024-12-20 12:22:53 +01:00