Mateusz Hoppe
b17ed79618
fix: update implicitArgs versions
...
- fix layout of implicit args
- add enqueued local size and sync buffer ptr
Related-To: NEO-15160
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-07-22 15:12:55 +02:00
Oskar Hubert Weber
1c722ddb7c
test: mock filesystem in ULTs 2/n
...
Remove USE_REAL_FILE_SYSTEM() macro uses from API, program, kernel and
gtpin tests.
Related-To: NEO-15069
Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com >
2025-07-03 09:44:06 +02:00
Marcel Skierkowski
f0d5cf7530
test: StreamCaptureStdErr on pipes
...
Store data in a pipe instead of a temporary file like gtest
resulting in faster test execution
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com >
2025-07-01 11:52:43 +02:00
Jaroslaw Warchulski
49152d957f
test: cleanup in common matchers
...
IsAtLeastGen12LP / IsNotWithinXeGfxFamily / IsNotXe2HpgCore /
IsWithinXeHpgCoreAndXe3Core / IsWithinXeHpcCoreAndXe3Core -> deleted
IsAtLeastXeHpCore -> IsAtLeastXeCore
IsBeforeXeHpCore / IsBeforeXeHpgCore / IsAtMostXeHpCore -> IsGen12LP
IsXeHpOrXeHpgCore -> IsXeHpgCore
IsXeHpOrXeHpcCore -> IsXeHpcCore
IsNotXeHpOrXeHpgCore -> IsNotXeHpgCore
IsNotXeHpOrXeHpcCore -> IsNotXeHpcCore
IsWithinXeGfxFamily / IsXeHpcOrXeHpgCore / IsXeHpOrXeHpcOrXeHpgCore ->
IsXeCore
IsNotXeHpgOrXeHpcCore -> IsNotXeCore
IsWithinXeHpCoreAndXe2HpgCore -> IsWithinXeCoreAndXe2HpgCore
IsXeHpcCoreOrXe2HpgCore -> IsWithinXeHpcCoreAndXe2HpgCore
IsWithinXeHpCoreAndXe3Core -> IsWithinXeCoreAndXe3Core
IsXe2HpgCoreOrXe3Core -> IsWithinXe2HpgCoreAndXe3Core
IsAtLeastBmg -> IsAtLeastXe2HpgCore
IsBeforeXeHpcCore -> IsAtMostXeHpgCore
IsBeforeXe2HpgCore / IsAtMostArl / IsAtMostXeHpcCore -> IsAtMostXeCore
Related-To: NEO-12681
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com >
2025-06-18 10:48:17 +02:00
Mateusz Jablonski
0a325e09be
fix: correct initialization of Kernel::isBuiltIn
...
value is used in Kernel::initialize but was set after initialize call
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-06-11 18:29:06 +02:00
Kamil Kopryk
bb52c358b9
refactor: flush Task method
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
Related-To: NEO-15011
Also correct ults
2025-06-11 16:36:56 +02:00
Lukasz Jobczyk
62619f9525
refactor: remove unused kernel tunning
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-08 14:18:21 +02:00
Kamil Kopryk
dd3d294f87
performance: cache MOCS values
...
This change caches the most used MOCS values:
* getMOCS(GMM_RESOURCE_USAGE_OCL_BUFFER_CONST);
* getMOCS(GMM_RESOURCE_USAGE_OCL_BUFFER);
* getMOCS(GMM_RESOURCE_USAGE_OCL_BUFFER_CACHELINE_MISALIGNED);
inside gmmHelper class during initialization to avoid repeated
calls of virtual functions, branches and/or gmm lib access.
and adds more readably corresponding getters:
* getL1EnabledMOCS
* getL3EnabledMOCS
* getUncachedMOCS
If force all resources uncached is called,
these 3 cached mocs values are reinitialized
It also changes the order of gmmHelper members, to avoid
not needed padding after addressWidth
and simplifies logic in getMocsIndex function
for xehp and later products.
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-04-14 14:12:48 +02:00
Mateusz Hoppe
34dad4d3de
refactor: store implicit args version in kernel
...
- adjust tests to work with implicitargs v0 and v1
Related-To: NEO-14115
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-03-10 22:58:57 +01:00
Mateusz Hoppe
291745cdf7
feature: add support for ImplicitArgs versioning
...
- define ImplicitArgs version 1
Related-To: NEO-14115
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-02-25 10:27:11 +01:00
Kamil Kopryk
b8723458e4
test: update ults to respect minimal simd
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-02-11 07:57:16 +01:00
Compute-Runtime-Validation
9b968066f2
Revert "test: update ults to respect minimal simd"
...
This reverts commit 4cb9baa2fc .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-02-08 00:06:29 +01:00
Kamil Kopryk
4cb9baa2fc
test: update ults to respect minimal simd
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-02-07 09:32:42 +01:00
Bartosz Dunajski
c2dbdb6797
refactor: move blit post sync data to BlitProperties
...
Related-To: NEO-13003
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-01-22 18:35:21 +01:00
Kamil Kopryk
6926582915
feature: use bindless samplers in heapless ocl
...
Related-To: NEO-12741
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-01-22 11:49:48 +01:00
Kamil Kopryk
d263fda1ad
feature: patch 64bit addresses for bindless images in heapless OCL
...
Related-To: NEO-12741
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-01-16 15:24:02 +01:00
Mateusz Hoppe
c08c9b7f86
refactor: change test to HWTEST
...
- move Heapful matcher to test macros
- check alignment in patchWithRequiredSize
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2024-12-31 10:37:37 +01:00
Filip Hazubski
a0cc124b2e
performance: Pass RootDeviceIndicesContainer by reference
...
Additionally pass std::map by reference in UsmMemAllocPoolsManager c-tor.
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2024-12-17 14:18:30 +01:00
Oskar Hubert Weber
7b4504549e
test: allowing neo ULT build with optimization enabled 4/n
...
Related-To: NEO-8116
Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com >
2024-12-17 13:07:35 +01:00
Mateusz Jablonski
33ece63b91
fix: remove incorrect test kernel
...
test uses image3d_t which is not supported on all platforms
since the functionality can be tested with other kernel
this one can be removed
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-09-20 15:08:27 +02:00
Mateusz Jablonski
7adfa344c0
performance: get reference from getBindlessInfo
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-09-03 12:50:52 +02:00
Kamil Kopryk
d79889c3b2
test: correct expectations in opencl tests if heapless enabled 2/n
...
Related-To: NEO-10641
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-09-03 11:14:17 +02:00
Dominik Dabek
ad229377b9
fix: disable indirect detection if any stack calls
...
Don't know if kernels will be initialized in the order needed to check
for indirect accesses in stack calls.
Remove now unused functionPointerWithIndirectAccessExists and reading
this value from zebin.
Related-To: NEO-12235
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-08-07 14:48:58 +02:00
Fabian Zwoliński
b1a50104a8
fix: include dynamic SLM in clGetKernelWorkGroupInfo and zeKernelGetProperties
...
Current implementation only takes static slmInlineSize into account.
With this change we also include dynamic SLM passed as a kernel arguments.
Related-To: NEO-5761
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com >
2024-07-30 17:59:45 +02:00
Grzegorz Choinski
46f2568902
build: fixes for windows clang with -m32
...
Related-To: NEO-10748
Signed-off-by: Grzegorz Choinski <grzegorz.choinski@intel.com >
2024-07-18 14:49:56 +02:00
Jaroslaw Warchulski
63da79798d
refactor: reduce the number of GraphicsAllocation constructors and derivatives
...
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com >
2024-06-18 21:31:37 +02:00
Mateusz Jablonski
55fad237b8
test: update gtest API usage
...
use *_TEST_SUITE* instead of *_TEST_CASE*
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-05-28 16:20:06 +02:00
Dominik Dabek
ae8c7589dc
refactor: move implicit arg has indirect access
...
Move implicit arg has indirect access boolean to kernelAttributes
Related-To: NEO-11396
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-05-15 13:11:04 +02:00
Dominik Dabek
fd47030ad6
fix: use igc indirect detection v3
...
Update to use igc indirect detection v3. Fix for not detecting indirects
passed as implicit arguments.
Related-To: NEO-11396
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-05-15 06:35:42 +02:00
Szymon Morek
e35b951a00
performance: Allow indirect allocs as pack on OpenCL
...
Related-To: NEO-11228
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-05-10 17:57:42 +02:00
Maciej Plewka
e39893485c
fix: add cache flush as dependency for bcs ccs synchronization
...
Related-to: NEO-9872
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2024-05-09 13:43:39 +02:00
Katarzyna Cencelewska
e9f7df6ae6
refactor: create helper for maxPtssIndex
...
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2024-05-07 12:47:52 +02:00
Fabian Zwoliński
ee71157f7f
fix: opencl support for bindless kernels
...
Related-To: NEO-11156
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com >
2024-04-30 12:02:17 +02:00
Jaroslaw Warchulski
ea057abd36
fix: allow empty set to be passed to clSetKernelExecInfo
...
Related-To: GSD-8538
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com >
2024-04-16 14:11:04 +02:00
Mateusz Jablonski
cb2b572e94
feature: add support for null aub mode
...
In this mode AUB csr will be created, however, no aub file will be created
Related-To: NEO-11097
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-04-09 16:59:42 +02:00
Kamil Kopryk
4eae28bd64
feature: introduce heapless state init in L0
...
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-04-02 12:34:53 +02:00
Dominik Dabek
2b964254d6
performance: debug key for adjust ULLS on battery
...
ULLS controller timeout settings will be adjusted based on ac line
status and lowest queue throttle from submissions.
Lowest queue throttle is reset when controller stops ULLS.
Related-To: NEO-10800
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-03-22 14:24:00 +01:00
Mrozek, Michal
f71f6d2b72
refactor: remove not needed code
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com >
2024-03-08 18:18:55 +01:00
Kamil Kopryk
168445784e
feature: introduce states programming at driver init heapless ocl
...
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-03-08 12:29:44 +01:00
Mateusz Jablonski
373ef608bc
refactor: unify calculating max group size between OCL and L0
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-02-27 13:32:10 +01:00
Mateusz Jablonski
a697a3f718
refactor: create new members for storing spill and private memory in scratch
...
rename private scratch space into scratch space slot 1 as it can be generic
Related-To: NEO-9944
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-01-23 12:42:25 +01:00
Compute-Runtime-Validation
f9f9035b95
Revert "refactor: create new members for storing spill and private memory in ...
...
This reverts commit 87eb5f554a .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-01-23 09:13:00 +01:00
Mateusz Jablonski
87eb5f554a
refactor: create new members for storing spill and private memory in scratch
...
rename private scratch space into scratch space slot 1 as it can be generic
Related-To: NEO-9944
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-01-22 19:48:48 +01:00
Dunajski, Bartosz
b266f1f3cc
refactor: improve implicit scaling interface
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2024-01-19 14:09:04 +01:00
Dominik Dabek
e0ccf22557
fix: indirect access in external functions
...
Read indirect_stateless_count in module external functions.
If greater than 0, mark all kernels that have the has_stack_calls flag
set from this module as having indirect accesses.
Related-To: NEO-7712
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-01-18 15:29:41 +01:00
Mateusz Jablonski
a4888b39c6
build: add clang-tidy restriction for Enum case
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-21 08:58:51 +01:00
Mateusz Jablonski
a6458433dc
refactor: correct naming of enum class constants 9/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-19 10:36:25 +01:00
Mateusz Jablonski
dd1b9d6abc
refactor: correct naming of enum class constants 8/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-19 08:18:18 +01:00
Dunajski, Bartosz
d99104d5bf
refactor: improve ImplicitArg struct handling
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-12-18 15:19:00 +01:00
Ben Ashbaugh
976c6d3acc
feature: initial register count query implementation
...
Adds a per-kernel and per-device query to determine the
number of GRF registers that a kernel was compiled for.
This is an informal query for now, but may be added to
a formally supported extension in the future.
Related-To: NEO-9807
Signed-off-by: Ben Ashbaugh <ben.ashbaugh@intel.com >
2023-12-18 11:45:01 +01:00