Commit Graph

417 Commits

Author SHA1 Message Date
Maciej Bielski
fe174328eb refactor: crossThreadData as std::vector
Related-To: NEO-15374
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-08-08 18:44:07 +02:00
Michal Mrozek
52e413f69e performance: reuse allocation data if available
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2025-08-08 09:36:35 +02:00
Michal Mrozek
f85e4935e2 performance: reuse allocData if available
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2025-08-07 18:26:36 +02:00
Mateusz Hoppe
b7a73653a2 fix: set sync buffer ptr in implicit args
Related-To: NEO-15737

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-08-04 14:22:46 +02:00
Maciej Bielski
a0231c2d22 feature: allow for cloning KernelImp instances
From the `cloneWithStateOverride()` perspecitve the `KernelImp` has
two logical parts handled differently:
- `KernelMutableState`: separate for both regular and cloned instances,
may be optionally overriden when cloning
- rest: separate for regular instances but shared with all clones
(shallow copy), the original instance owns these resources

Related-To: NEO-15374
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-08-01 11:51:45 +02:00
Maciej Bielski
c396367411 feature: Record&Replay support for appendLaunchKernel
Related-To: NEO-15374
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-07-28 12:19:30 +02:00
Mateusz Hoppe
be483491bf feature: extract indirect access buffer major version from ELF's notes
- use version from ELF to create correct layout of implcit args
Related-To: NEO-15574

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-07-25 14:50:49 +02:00
Maciej Bielski
a755ad131d feature: implement KernelMutableState operator=
Related-To: NEO-15374
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-07-25 14:37:04 +02:00
Mateusz Hoppe
575fe7fe33 fix: remove local dispatch support
Related-To: NEO-14561

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-07-24 10:31:18 +02:00
Compute-Runtime-Validation
357fd3108e Revert "refactor: move surface state programming method to base class"
This reverts commit 5893fb08fb.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-07-23 04:37:46 +02:00
Zbigniew Zdanowicz
5893fb08fb refactor: move surface state programming method to base class
Related-To: NEO-15374

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-07-22 19:09:03 +02:00
Mateusz Hoppe
b17ed79618 fix: update implicitArgs versions
- fix layout of implicit args
- add enqueued local size and sync buffer ptr

Related-To: NEO-15160

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-07-22 15:12:55 +02:00
Maciej Bielski
35ecde70df refactor: extract parts of KernelImp data for easier capturing
`KernelImp` is virtual class so capturing the current kernel state
cannot be done by a simple copy-instance of the class. However, this can
be done by extracting the relevant data to a separate class and
capturing a copy of its instance.

Related-To: NEO-15374
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-07-17 23:00:22 +02:00
Zbigniew Zdanowicz
ef5efeac0c refactor: add kernel properties and move implementations to imp class
Related-To: NEO-15374

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-07-15 15:08:44 +02:00
Zbigniew Zdanowicz
442c11b5ba feature: add mutable command list implementation
Related-To: NEO-10492

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-06-25 16:32:08 +02:00
Mateusz Hoppe
b7580a3998 fix: store image arg size in kernel descriptor
- use arg size when patching bindless offset

Related-To: HSD-18042502539

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-06-23 13:37:44 +02:00
Kamil Kopryk
cdf087f3da fix: correct bindless offsets for L0 bindless images extension with heapless
Related-To: NEO-14710
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-06-16 11:50:18 +02:00
Damian Tomczak
5cd1423bab feature: packed surface format
Resolves: NEO-13669

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-06-12 17:32:16 +02:00
Mateusz Hoppe
534f7837fe refactor: add validateWorkgroupSize() method
Related-To: NEO-14209

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-06-11 18:17:57 +02:00
Compute-Runtime-Validation
9fd521f471 Revert "refactor: add validateWorkgroupSize() method"
This reverts commit 708b81bab5.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-06-07 17:25:49 +02:00
Mateusz Hoppe
708b81bab5 refactor: add validateWorkgroupSize() method
Related-To: NEO-14209

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-06-06 09:45:45 +02:00
Bartosz Dunajski
d9ec4b1a18 refactor: improve thread data size getter
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-06-04 11:27:56 +02:00
Chodor, Jaroslaw
46c60290b9 refactor: introducing igfxfmid_wrapper
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-05-30 19:26:04 +02:00
Katarzyna Cencelewska
e357e7e404 fix: correct limitation for num threads per thread group
taking into account the max work group limit
when simd1 don't use the same limitation as for simd32

Resolves: NEO-14922
Related-To: NEO-11881
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2025-05-29 16:34:40 +02:00
Compute-Runtime-Validation
593c9e76f2 Revert "fix: correct limitation for num threads per thread group"
This reverts commit 6ad4ad41b1.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-05-23 02:51:17 +02:00
Katarzyna Cencelewska
6ad4ad41b1 fix: correct limitation for num threads per thread group
taking into account the max work group limit

Resolves: NEO-14922
Related-To: NEO-11881
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2025-05-22 00:06:21 +02:00
Mateusz Jablonski
0dd2535674 refactor: change type of stype member in L0 Core driver extensions
use uint32_t-alias type to avoid casting out of ze_structure_type_t enum range
Related-To: NEO-12901

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-04-18 15:02:31 +02:00
Vysochyn, Illia
146a7f7b9b feature: Add support for BUFFER_SIZE explicit argument
Related-To: NEO-13972

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-04-07 14:15:26 +02:00
Filip Hazubski
504440fc4d feature: Add ftrHeaplessMode flag
Pass hwInfo to isHeaplessModeEnabled and isForceBindlessRequired functions.

Related-To: NEO-14526

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-04-02 21:06:05 +02:00
Mateusz Jablonski
2394f9fd91 feature: update base layout of L0 handles to match ze_handle_t layout
this layout is required by L0 DDI Handle extension

Related-To: NEO-13121, NEO-13917
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-03-28 14:40:20 +01:00
Szymon Morek
ead0842763 feature: add L0 API to query kernel argument info
Related-To: NEO-14358

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-27 16:43:33 +01:00
Szymon Morek
f8ae044db9 fix: correctly set workgroup size for kernels
Related-To: NEO-14367

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-17 14:53:17 +01:00
Kamil Kopryk
2c77d2c698 refactor: add SetMaxBVHLevels debug flag
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-03-14 16:59:48 +01:00
Kamil Kopryk
c23f3c0666 fix: correct programming inline samplers with bindless addressing L0
Related-To: NEO-14216
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-03-12 09:01:29 +01:00
Mateusz Hoppe
7aee5e1b7e refactor: extend patchRegionParams with group count
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-02-27 11:54:18 +01:00
Compute-Runtime-Validation
9785e5c11e Revert "feature: update base layout of L0 handles to match ze_handle_t layout"
This reverts commit 276c606329.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-26 02:37:14 +01:00
Mateusz Hoppe
291745cdf7 feature: add support for ImplicitArgs versioning
- define ImplicitArgs version 1

Related-To: NEO-14115

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-02-25 10:27:11 +01:00
Mateusz Jablonski
276c606329 feature: update base layout of L0 handles to match ze_handle_t layout
this layout is required by L0 DDI Handle extension

Related-To: NEO-13121, NEO-13917
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-02-25 01:23:43 +01:00
Filip Hazubski
6b2b42972a fix: Add asserts to ensure NonCopyable and NonMovable 1/n
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-02-18 09:41:20 +01:00
Zbigniew Zdanowicz
9419e702cc fix: zero kernel residency container for shared system usm pointers
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-02-05 19:07:07 +01:00
Zbigniew Zdanowicz
4c47364dfd refactor: add getter to read kernel argument values
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-01-31 13:30:48 +01:00
Zbigniew Zdanowicz
621ceaf9ec feature: add new properties and getters for kernel slm arguments
- add new vector to read slm offset
- add getter to read new vector
- add getter to read slm argument alignment
- add getter to read slm sizes vector

Related-To: NEO-12933

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-01-15 14:23:04 +01:00
Bartosz Dunajski
db3f268a97 refactor: reorganize l0 api include directories [2/n]
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-12-20 17:16:53 +01:00
Kamil Kopryk
b724c964fc performance: reserve vector before push_back's
Related-To: NEO-12742
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-12-18 10:07:56 +01:00
Kamil Kopryk
3be52fa1d0 refactor: Add ult with check if bindless ss is patched correctly
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-12-18 09:30:13 +01:00
Zbigniew Zdanowicz
56b15f17f7 refactor: unify further calculation to get max work group count
- move available device calculcation into common helper
- change interface to have code available where no descriptor is available
- expand unit test for implementation of new inteface

Related-To: NEO-13350

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-29 17:51:05 +01:00
Zbigniew Zdanowicz
92ebbe75c0 feature: add kernel getters for sync buffer and region barrier allocations
Related-To: NEO-13350

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-28 10:29:48 +01:00
Wenbin Lu
a82b5182f6 fix: improve the handling of private surface allocation failures
Related-To: NEO-12664

Signed-off-by: Wenbin Lu <wenbin.lu@intel.com>
2024-10-11 01:18:25 +02:00
Artur Harasimiuk
00aedcaed4 refactor: add NOLINT for pending problems
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2024-10-10 11:46:35 +02:00
Wojciech Konior
6b40f9bc5a refactor: engineInstancedType removed
Related-To: NEO-12594

Signed-off-by: Wojciech Konior <wojciech.konior@intel.com>
2024-10-09 16:30:48 +02:00