Commit Graph

328 Commits

Author SHA1 Message Date
Mateusz Hoppe
b17ed79618 fix: update implicitArgs versions
- fix layout of implicit args
- add enqueued local size and sync buffer ptr

Related-To: NEO-15160

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-07-22 15:12:55 +02:00
Oskar Hubert Weber
1c722ddb7c test: mock filesystem in ULTs 2/n
Remove USE_REAL_FILE_SYSTEM() macro uses from API, program, kernel and
gtpin tests.

Related-To: NEO-15069

Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com>
2025-07-03 09:44:06 +02:00
Marcel Skierkowski
f0d5cf7530 test: StreamCaptureStdErr on pipes
Store data in a pipe instead of a temporary file like gtest
resulting in faster test execution

Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-07-01 11:52:43 +02:00
Jaroslaw Warchulski
49152d957f test: cleanup in common matchers
IsAtLeastGen12LP / IsNotWithinXeGfxFamily / IsNotXe2HpgCore /
IsWithinXeHpgCoreAndXe3Core / IsWithinXeHpcCoreAndXe3Core -> deleted
IsAtLeastXeHpCore -> IsAtLeastXeCore
IsBeforeXeHpCore / IsBeforeXeHpgCore / IsAtMostXeHpCore -> IsGen12LP
IsXeHpOrXeHpgCore -> IsXeHpgCore
IsXeHpOrXeHpcCore -> IsXeHpcCore
IsNotXeHpOrXeHpgCore -> IsNotXeHpgCore
IsNotXeHpOrXeHpcCore -> IsNotXeHpcCore
IsWithinXeGfxFamily / IsXeHpcOrXeHpgCore / IsXeHpOrXeHpcOrXeHpgCore ->
IsXeCore
IsNotXeHpgOrXeHpcCore -> IsNotXeCore
IsWithinXeHpCoreAndXe2HpgCore -> IsWithinXeCoreAndXe2HpgCore
IsXeHpcCoreOrXe2HpgCore -> IsWithinXeHpcCoreAndXe2HpgCore
IsWithinXeHpCoreAndXe3Core -> IsWithinXeCoreAndXe3Core
IsXe2HpgCoreOrXe3Core -> IsWithinXe2HpgCoreAndXe3Core
IsAtLeastBmg -> IsAtLeastXe2HpgCore
IsBeforeXeHpcCore -> IsAtMostXeHpgCore
IsBeforeXe2HpgCore / IsAtMostArl / IsAtMostXeHpcCore -> IsAtMostXeCore

Related-To: NEO-12681
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-06-18 10:48:17 +02:00
Mateusz Jablonski
0a325e09be fix: correct initialization of Kernel::isBuiltIn
value is used in Kernel::initialize but was set after initialize call

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-06-11 18:29:06 +02:00
Kamil Kopryk
bb52c358b9 refactor: flush Task method
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
Related-To: NEO-15011

Also correct ults
2025-06-11 16:36:56 +02:00
Lukasz Jobczyk
62619f9525 refactor: remove unused kernel tunning
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-08 14:18:21 +02:00
Kamil Kopryk
dd3d294f87 performance: cache MOCS values
This change caches the most used MOCS values:

* getMOCS(GMM_RESOURCE_USAGE_OCL_BUFFER_CONST);
* getMOCS(GMM_RESOURCE_USAGE_OCL_BUFFER);
* getMOCS(GMM_RESOURCE_USAGE_OCL_BUFFER_CACHELINE_MISALIGNED);
inside gmmHelper class during initialization to avoid repeated
calls of virtual functions, branches and/or gmm lib access.

and adds more readably corresponding getters:
* getL1EnabledMOCS
* getL3EnabledMOCS
* getUncachedMOCS

If force all resources uncached is called,
these 3 cached mocs values are reinitialized

It also changes the order of gmmHelper members, to avoid
not needed padding after addressWidth
and simplifies logic in getMocsIndex function
for xehp and later products.

Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-04-14 14:12:48 +02:00
Mateusz Hoppe
34dad4d3de refactor: store implicit args version in kernel
- adjust tests to work with implicitargs v0 and v1

Related-To: NEO-14115

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-03-10 22:58:57 +01:00
Mateusz Hoppe
291745cdf7 feature: add support for ImplicitArgs versioning
- define ImplicitArgs version 1

Related-To: NEO-14115

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-02-25 10:27:11 +01:00
Kamil Kopryk
b8723458e4 test: update ults to respect minimal simd
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-02-11 07:57:16 +01:00
Compute-Runtime-Validation
9b968066f2 Revert "test: update ults to respect minimal simd"
This reverts commit 4cb9baa2fc.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-08 00:06:29 +01:00
Kamil Kopryk
4cb9baa2fc test: update ults to respect minimal simd
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-02-07 09:32:42 +01:00
Bartosz Dunajski
c2dbdb6797 refactor: move blit post sync data to BlitProperties
Related-To: NEO-13003

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-01-22 18:35:21 +01:00
Kamil Kopryk
6926582915 feature: use bindless samplers in heapless ocl
Related-To: NEO-12741
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-01-22 11:49:48 +01:00
Kamil Kopryk
d263fda1ad feature: patch 64bit addresses for bindless images in heapless OCL
Related-To: NEO-12741
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-01-16 15:24:02 +01:00
Mateusz Hoppe
c08c9b7f86 refactor: change test to HWTEST
- move Heapful matcher to test macros
- check alignment in patchWithRequiredSize


Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-12-31 10:37:37 +01:00
Filip Hazubski
a0cc124b2e performance: Pass RootDeviceIndicesContainer by reference
Additionally pass std::map by reference in UsmMemAllocPoolsManager c-tor.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-12-17 14:18:30 +01:00
Oskar Hubert Weber
7b4504549e test: allowing neo ULT build with optimization enabled 4/n
Related-To: NEO-8116

Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com>
2024-12-17 13:07:35 +01:00
Mateusz Jablonski
33ece63b91 fix: remove incorrect test kernel
test uses image3d_t which is not supported on all platforms
since the functionality can be tested with other kernel
this one can be removed

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-09-20 15:08:27 +02:00
Mateusz Jablonski
7adfa344c0 performance: get reference from getBindlessInfo
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-09-03 12:50:52 +02:00
Kamil Kopryk
d79889c3b2 test: correct expectations in opencl tests if heapless enabled 2/n
Related-To: NEO-10641
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-09-03 11:14:17 +02:00
Dominik Dabek
ad229377b9 fix: disable indirect detection if any stack calls
Don't know if kernels will be initialized in the order needed to check
for indirect accesses in stack calls.

Remove now unused functionPointerWithIndirectAccessExists and reading
this value from zebin.

Related-To: NEO-12235

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-08-07 14:48:58 +02:00
Fabian Zwoliński
b1a50104a8 fix: include dynamic SLM in clGetKernelWorkGroupInfo and zeKernelGetProperties
Current implementation only takes static slmInlineSize into account.
With this change we also include dynamic SLM passed as a kernel arguments.

Related-To: NEO-5761
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-07-30 17:59:45 +02:00
Grzegorz Choinski
46f2568902 build: fixes for windows clang with -m32
Related-To: NEO-10748
Signed-off-by: Grzegorz Choinski <grzegorz.choinski@intel.com>
2024-07-18 14:49:56 +02:00
Jaroslaw Warchulski
63da79798d refactor: reduce the number of GraphicsAllocation constructors and derivatives
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2024-06-18 21:31:37 +02:00
Mateusz Jablonski
55fad237b8 test: update gtest API usage
use *_TEST_SUITE* instead of *_TEST_CASE*

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-05-28 16:20:06 +02:00
Dominik Dabek
ae8c7589dc refactor: move implicit arg has indirect access
Move implicit arg has indirect access boolean to kernelAttributes

Related-To: NEO-11396

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-05-15 13:11:04 +02:00
Dominik Dabek
fd47030ad6 fix: use igc indirect detection v3
Update to use igc indirect detection v3. Fix for not detecting indirects
passed as implicit arguments.

Related-To: NEO-11396

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-05-15 06:35:42 +02:00
Szymon Morek
e35b951a00 performance: Allow indirect allocs as pack on OpenCL
Related-To: NEO-11228

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-10 17:57:42 +02:00
Maciej Plewka
e39893485c fix: add cache flush as dependency for bcs ccs synchronization
Related-to: NEO-9872
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-05-09 13:43:39 +02:00
Katarzyna Cencelewska
e9f7df6ae6 refactor: create helper for maxPtssIndex
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-05-07 12:47:52 +02:00
Fabian Zwoliński
ee71157f7f fix: opencl support for bindless kernels
Related-To: NEO-11156
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-04-30 12:02:17 +02:00
Jaroslaw Warchulski
ea057abd36 fix: allow empty set to be passed to clSetKernelExecInfo
Related-To: GSD-8538
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2024-04-16 14:11:04 +02:00
Mateusz Jablonski
cb2b572e94 feature: add support for null aub mode
In this mode AUB csr will be created, however, no aub file will be created

Related-To: NEO-11097
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-04-09 16:59:42 +02:00
Kamil Kopryk
4eae28bd64 feature: introduce heapless state init in L0
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-04-02 12:34:53 +02:00
Dominik Dabek
2b964254d6 performance: debug key for adjust ULLS on battery
ULLS controller timeout settings will be adjusted based on ac line
status and lowest queue throttle from submissions.

Lowest queue throttle is reset when controller stops ULLS.

Related-To: NEO-10800

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-03-22 14:24:00 +01:00
Mrozek, Michal
f71f6d2b72 refactor: remove not needed code
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-03-08 18:18:55 +01:00
Kamil Kopryk
168445784e feature: introduce states programming at driver init heapless ocl
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-03-08 12:29:44 +01:00
Mateusz Jablonski
373ef608bc refactor: unify calculating max group size between OCL and L0
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-27 13:32:10 +01:00
Mateusz Jablonski
a697a3f718 refactor: create new members for storing spill and private memory in scratch
rename private scratch space into scratch space slot 1 as it can be generic

Related-To: NEO-9944
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-23 12:42:25 +01:00
Compute-Runtime-Validation
f9f9035b95 Revert "refactor: create new members for storing spill and private memory in ...
This reverts commit 87eb5f554a.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-01-23 09:13:00 +01:00
Mateusz Jablonski
87eb5f554a refactor: create new members for storing spill and private memory in scratch
rename private scratch space into scratch space slot 1 as it can be generic

Related-To: NEO-9944
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-22 19:48:48 +01:00
Dunajski, Bartosz
b266f1f3cc refactor: improve implicit scaling interface
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-19 14:09:04 +01:00
Dominik Dabek
e0ccf22557 fix: indirect access in external functions
Read indirect_stateless_count in module external functions.

If greater than 0, mark all kernels that have the has_stack_calls flag
set from this module as having indirect accesses.

Related-To: NEO-7712

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-01-18 15:29:41 +01:00
Mateusz Jablonski
a4888b39c6 build: add clang-tidy restriction for Enum case
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-21 08:58:51 +01:00
Mateusz Jablonski
a6458433dc refactor: correct naming of enum class constants 9/n
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-19 10:36:25 +01:00
Mateusz Jablonski
dd1b9d6abc refactor: correct naming of enum class constants 8/n
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-19 08:18:18 +01:00
Dunajski, Bartosz
d99104d5bf refactor: improve ImplicitArg struct handling
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-12-18 15:19:00 +01:00
Ben Ashbaugh
976c6d3acc feature: initial register count query implementation
Adds a per-kernel and per-device query to determine the
number of GRF registers that a kernel was compiled for.
This is an informal query for now, but may be added to
a formally supported extension in the future.

Related-To: NEO-9807

Signed-off-by: Ben Ashbaugh <ben.ashbaugh@intel.com>
2023-12-18 11:45:01 +01:00