Commit Graph

370 Commits

Author SHA1 Message Date
Wenbin Lu a82b5182f6 fix: improve the handling of private surface allocation failures
Related-To: NEO-12664

Signed-off-by: Wenbin Lu <wenbin.lu@intel.com>
2024-10-11 01:18:25 +02:00
Artur Harasimiuk 00aedcaed4 refactor: add NOLINT for pending problems
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2024-10-10 11:46:35 +02:00
Wojciech Konior 6b40f9bc5a refactor: engineInstancedType removed
Related-To: NEO-12594

Signed-off-by: Wojciech Konior <wojciech.konior@intel.com>
2024-10-09 16:30:48 +02:00
Maciej Plewka 73e4b6ae7c fix: remove w/a which disables wmtp in kernels with ray tracing
Related-To: NEO-12872
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-10-07 14:28:08 +02:00
Andrzej Koska 6abc5eb1a1 fix: using releaseHelper to determine MTP enablement
Related-To: NEO-12466

Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2024-10-01 15:06:07 +02:00
John Falkowski 1d51f4b91c feature: Add driver-experimental API for retrieval of kernel binary program data
Related-To: NEO-11651

Signed-off-by: John Falkowski <john.falkowski@intel.com>
2024-09-25 20:38:17 +02:00
Compute-Runtime-Validation 5dddd4a67f Revert "feature: Add experimental API for retrieval of kernel binary program ...
This reverts commit 24682e702b.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-09-25 10:01:36 +02:00
John Falkowski 24682e702b feature: Add experimental API for retrieval of kernel binary program data
Related-To: NEO-11651

Signed-off-by: John Falkowski <john.falkowski@intel.com>
2024-09-24 02:48:29 +02:00
Zbigniew Zdanowicz 672d8414f5 fix: remove not needed macro
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-09-19 17:01:17 +02:00
Fabian Zwoliński a6ddb61a5f fix: consider only misaligned buffers in getSurfaceStateHeapDataSize
Adding checking for the presence of implicit args
was only a workaround for the time of implementing
support for bindless `const_base`, `global_base` in IGC.

Related-To: NEO-9855, NEO-7063
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-09-18 13:13:02 +02:00
Bartosz Dunajski b8fb16c603 fix: initialize kernel members at the beginning
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-09-18 11:13:35 +02:00
Zbigniew Zdanowicz 7e00590994 performance: get work group count per tile value when setting new group size
- change interface to function to accept external group size

Related-To: NEO-12639

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-09-16 10:45:01 +02:00
Mateusz Hoppe 7cd9a2aa60 refactor: change name of variable
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-09-13 03:06:44 +02:00
Bartosz Dunajski 4f1262645b refactor: pass extra walker params
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-09-10 15:30:03 +02:00
Filip Hazubski beaddb55e0 fix: Minor code changes
Remove needless checks for module and cmdList.
Add explicit check for surfaceState in setArgImageWithMipLevel.
Explicitly initialize WddmResidencyController::csr.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-09-10 13:31:15 +02:00
Zbigniew Zdanowicz 0628d97ee1 feature: update processing kernel residency
- save position of kernel internal container when allocation can change
- reuse the same position when new allocation arrives
- add index container for additional allocation of image argument
- save position of additional allocation of image argument
- reuse position when for new image argument

Related-To: NEO-11719

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-09-09 14:10:02 +02:00
Fabian Zwoliński 38e1614f4a fix: create and use new allocation type for syncBuffer
Related-To: NEO-11533
Related-To: HSD-18039788811
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-09-06 14:11:33 +02:00
Mateusz Jablonski 14fd9f3f24 fix: correct using L0 loader functions
use zelLoaderTranslateHandle for translating handle to internal handle
get pointer to zelSetDriverTeardown during global ctor
don't load loader library by name
get loader function pointers directly from current process

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-09-04 17:30:25 +02:00
Mateusz Jablonski 7adfa344c0 performance: get reference from getBindlessInfo
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-09-03 12:50:52 +02:00
Mateusz Hoppe d5e275cb49 performance: optimize surface state programming
- eliminate read-modify-write on gfx memory when global bindless mode
enabled

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-09-02 15:31:30 +02:00
Fabian Zwoliński 6a50320f3c fix: use gpuAddress when patching sync buffer
Related-To: NEO-11533
Related-To: HSD-18039788811
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-08-28 17:47:35 +02:00
Mateusz Jablonski d45c16dfc2 fix: add fallback for invalid handles in extension functions
handle context, commandlist, driver, device, event, image and kernel handles

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-08-28 17:22:35 +02:00
Mateusz Hoppe c9e8931a18 fix: global bindless offsets for implicit args
- do not patch bindless offsets for implicit args in with global bindless
allocator, the offsets are patched at kernel initialization time and
do not change

Related-To: NEO-7063

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-08-23 13:21:27 +02:00
Winston Zhang 0590b34cfa feature: refactor and rewrite setErrorDescription
Related-To: NEO-8379

Signed-off-by: Winston Zhang <winston.zhang@intel.com>
2024-08-21 17:26:25 +02:00
Dominik Dabek ad229377b9 fix: disable indirect detection if any stack calls
Don't know if kernels will be initialized in the order needed to check
for indirect accesses in stack calls.

Remove now unused functionPointerWithIndirectAccessExists and reading
this value from zebin.

Related-To: NEO-12235

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-08-07 14:48:58 +02:00
Fabian Zwoliński b1a50104a8 fix: include dynamic SLM in clGetKernelWorkGroupInfo and zeKernelGetProperties
Current implementation only takes static slmInlineSize into account.
With this change we also include dynamic SLM passed as a kernel arguments.

Related-To: NEO-5761
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-07-30 17:59:45 +02:00
Zbigniew Zdanowicz 42bf58a800 refactor: do not add all mapped physical allocations to kernel residency
Related-To: NEO-11719

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-07-24 15:43:48 +02:00
Zbigniew Zdanowicz 1c1e437d4b refactor: split kernel residency into internal and argument containers
Related-To: NEO-11719

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-07-23 17:22:16 +02:00
Zbigniew Zdanowicz f61ab615e7 refactor: do not add bindless heaps to kernel residency
Related-To: NEO-11719

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-07-22 22:44:30 +02:00
Mateusz Hoppe 4c49a08017 feature: add inline samplers bindless addressing support
- inline samplers in bindless addressing mode requires bindless offset
passed in cross thread data

Related-To: NEO-11748

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-06-24 13:02:08 +02:00
Bartosz Dunajski 692def2c79 feature: region group barrier allocation support
Related-To: NEO-11031

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-06-03 18:34:54 +02:00
Zbigniew Zdanowicz 3e1d276233 fix: add missing new line character in print log
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-05-23 17:51:15 +02:00
Zbigniew Zdanowicz 294c3b77ba refactor: add level zero console logging for kernel buffer arguments
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-05-23 11:01:38 +02:00
Dominik Dabek ae8c7589dc refactor: move implicit arg has indirect access
Move implicit arg has indirect access boolean to kernelAttributes

Related-To: NEO-11396

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-05-15 13:11:04 +02:00
Dominik Dabek fd47030ad6 fix: use igc indirect detection v3
Update to use igc indirect detection v3. Fix for not detecting indirects
passed as implicit arguments.

Related-To: NEO-11396

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-05-15 06:35:42 +02:00
Mateusz Hoppe 76af952fcb feature: Bindless sampled image support
Related-To: NEO-10352

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-05-08 17:50:37 +02:00
John Falkowski f1c2ea0b69 fix: kernel access to multiple stateful virtual regions
Related-to: NEO-8350

Signed-off-by: John Falkowski <john.falkowski@intel.com>
2024-04-26 11:18:24 +02:00
Compute-Runtime-Validation 04c32d9e4e Revert "fix: allow kernel access across multiple virtual regions"
This reverts commit b9c1ef65dd.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-04-24 04:00:15 +02:00
John Falkowski b9c1ef65dd fix: allow kernel access across multiple virtual regions
Related to: NEO-8350

Signed-off-by: John Falkowski <john.falkowski@intel.com>
2024-04-23 18:20:52 +02:00
Zbigniew Zdanowicz c60b19a8ba fix: use required walk order when local ids are software generated
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-04-19 08:48:30 +02:00
Zbigniew Zdanowicz 96347c7847 fix: reset per thread size after local id hw generation enabled
Related-To: NEO-10066

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-04-17 14:16:06 +02:00
Mateusz Hoppe f86d4220a5 feature: add bindless samplers support to level zero
- samplers using bindless adressing require patching bindless offsets to
sampler states on kernel's cross thread data

Related-To: NEO-10505

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-03-29 08:07:28 +01:00
Katarzyna Cencelewska da7b03dd15 fix: to always use grfs count in calculateNumThreadsPerThreadGroup
grf size != grf count

Related-To: GSD-8437
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-03-22 11:03:18 +01:00
Katarzyna Cencelewska dd1d52259e refactor: add param rootDeviceEnvironment to calculateNumThreadsPerThreadGroup
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-03-21 22:25:14 +01:00
Zbigniew Zdanowicz b3be51fc35 refactor: add checker for stateless only kernels
Related-To: NEO-10381

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-21 21:54:55 +01:00
Mrozek, Michal f71f6d2b72 refactor: remove not needed code
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-03-08 18:18:55 +01:00
John Falkowski cebedb50c7 feature: get kernel max group size
Implementing support for ze_kernel_max_group_size_ext_properties_t

Resolves: NEO-9617

Signed-off-by: John Falkowski <john.falkowski@intel.com>
2024-02-27 08:40:55 +01:00
Dunajski, Bartosz 5085300449 feature: query to get kernel/module register sizes
Related-To: NEO-7357

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-15 13:00:00 +01:00
Fabian Zwolinski f55cb596c7 fix: remove unnecessary allocation add to residencyContainer
Image implicit args allocation is not related to redescribed image
and is already added to the residencyContainer in setArgImage.

Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-02-02 16:26:33 +01:00
Mateusz Jablonski a697a3f718 refactor: create new members for storing spill and private memory in scratch
rename private scratch space into scratch space slot 1 as it can be generic

Related-To: NEO-9944
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-23 12:42:25 +01:00