Mateusz Jablonski
08bc465307
refactor: return scratch ptr offset as optional
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-12-10 15:10:06 +01:00
Mateusz Jablonski
405b39f722
fix: move getting scratchPtr offset method to ImplicitArgs class
...
Related-To: NEO-16649
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-11-27 16:06:44 +01:00
Mateusz Hoppe
3b93e23156
test: improve test coverage
...
- in decodeSingleDeviceBinary()
- in ImplicitArgs
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-09-24 15:16:27 +02:00
Mateusz Hoppe
12263b2e7c
fix: implicit arg buffer
...
- refactor validating target device and dcoding Intel GT Notes
- decoded versions are stored in singleDeviceBinary when decoding zebin
- adds parsing Intel GT notes from elf when unpacking binary
- sets indirectAccessBufferMajorVersion with correct value
- fix ImplicitArgsV1 - add simdWidth
- use correct simd size in patchImplicitArgs()
Related-To: NEO-16167, NEO-15211, IGC-12358
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-09-24 09:23:23 +02:00
Mateusz Hoppe
b17ed79618
fix: update implicitArgs versions
...
- fix layout of implicit args
- add enqueued local size and sync buffer ptr
Related-To: NEO-15160
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-07-22 15:12:55 +02:00
Katarzyna Cencelewska
e357e7e404
fix: correct limitation for num threads per thread group
...
taking into account the max work group limit
when simd1 don't use the same limitation as for simd32
Resolves: NEO-14922
Related-To: NEO-11881
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2025-05-29 16:34:40 +02:00
Compute-Runtime-Validation
593c9e76f2
Revert "fix: correct limitation for num threads per thread group"
...
This reverts commit 6ad4ad41b1 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-05-23 02:51:17 +02:00
Katarzyna Cencelewska
6ad4ad41b1
fix: correct limitation for num threads per thread group
...
taking into account the max work group limit
Resolves: NEO-14922
Related-To: NEO-11881
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2025-05-22 00:06:21 +02:00
Mateusz Hoppe
d3b11d1527
fix: correctly patch implicit arg buffer in indirect data
...
- use correct size alignment of implicit arg buffer, crossthread data
should start after the buffer without extra padding
Related-To: NEO-14449
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-05-06 13:25:11 +02:00
Compute-Runtime-Validation
26d3c7527c
Revert "fix: correctly patch implicit arg buffer in indirect data"
...
This reverts commit e345d55fe5 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-05-06 04:38:50 +02:00
Mateusz Hoppe
e345d55fe5
fix: correctly patch implicit arg buffer in indirect data
...
- use correct size alignment of implicit arg buffer, crosshtread data
should start after the buffer without extra padding
Related-To: NEO-14449
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-05-05 16:03:57 +02:00
Mateusz Jablonski
523e53c862
fix: passing hw generation of local ids for implicit args patching
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-04-29 16:28:23 +02:00
Mateusz Hoppe
291745cdf7
feature: add support for ImplicitArgs versioning
...
- define ImplicitArgs version 1
Related-To: NEO-14115
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-02-25 10:27:11 +01:00
Kamil Kopryk
2a9bcdeb83
refactor: pass outImplicitArgs to patchImplicitArgs function
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-08-05 17:31:47 +02:00
Katarzyna Cencelewska
da7b03dd15
fix: to always use grfs count in calculateNumThreadsPerThreadGroup
...
grf size != grf count
Related-To: GSD-8437
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2024-03-22 11:03:18 +01:00
Katarzyna Cencelewska
dd1d52259e
refactor: add param rootDeviceEnvironment to calculateNumThreadsPerThreadGroup
...
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2024-03-21 22:25:14 +01:00
Dunajski, Bartosz
d99104d5bf
refactor: improve ImplicitArg struct handling
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-12-18 15:19:00 +01:00
Mateusz Hoppe
9fd7f9cf05
fix: set ImplicitArgs size to size of defined fields
...
Resolves: NEO-8169
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-07-12 21:30:32 +02:00
Cencelewska, Katarzyna
61f701aba5
fix: Unify logic calculating threads per work group part 3
...
Related-To: NEO-8087
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-07-04 15:27:44 +02:00
Cencelewska, Katarzyna
2e17c21728
fix: Unify logic calculating threads per work group part 2
...
- use calculateNumThreadsPerThreadGroup instead of getThreadsPerWG to
have same flow and proper values of threads per work groups
Related-To: NEO-8087
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-07-04 10:34:02 +02:00
Compute-Runtime-Validation
39740da9d1
Revert "fix: Unify logic calculating threads per work group part 2"
...
This reverts commit 1e8a53bd53 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-07-02 07:09:14 +02:00
Cencelewska, Katarzyna
1e8a53bd53
fix: Unify logic calculating threads per work group part 2
...
- use calculateNumThreadsPerThreadGroup instead of getThreadsPerWG to
have same flow and proper values of threads per work groups
Related-To: NEO-8087
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-06-30 14:16:08 +02:00
Mateusz Jablonski
31f32cc16e
fix implicit args: generate local ids as for grf size 32
...
Related-To: IGC-6936
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-04-07 11:37:07 +02:00
Warchulski, Jaroslaw
8de3898abc
Cleanup includes 31
...
Cleaned up files:
shared/source/command_stream/experimental_command_buffer.h
shared/source/helpers/get_info.h
shared/source/helpers/local_id_gen.h
shared/source/memory_manager/gfx_partition.h
shared/source/memory_manager/host_ptr_manager.h
shared/source/memory_manager/prefetch_manager.h
shared/test/common/mocks/mock_memory_manager.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com >
2023-01-16 11:45:55 +01:00
Warchulski, Jaroslaw
3d59dce80c
Cleanup includes 27
...
Cleaned up files:
opencl/source/command_queue/command_queue.h
shared/source/built_ins/registry/built_ins_registry.h
shared/source/kernel/kernel_descriptor.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com >
2023-01-11 16:10:28 +01:00
Artur Harasimiuk
4bdd8860a1
test.h refactor
...
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com >
2022-07-04 18:20:07 +02:00
Mateusz Jablonski
82e3b10c5a
Fix typo
...
Related-To: NEO-5081
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-02-25 18:10:41 +01:00
Mateusz Jablonski
a2386ad216
Correct programming of implicit args on pre-XeHp platforms
...
On pre-XeHp platforms implicit args aren't at the beginning of indirect data,
GPU address of implicit args buffer is programmed within cross thread data
Related-To: NEO-5081, IGC-4710
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-02-24 20:52:04 +01:00
Mateusz Jablonski
ea6f089e17
Unify implicit args programming across APIs
...
Related-To: NEO-5081
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-02-23 11:52:47 +01:00
Mateusz Jablonski
4f71aaf595
Handle SIMD-1 scenario when programming local ids for implicit args
...
according to implicit args design for SIMD-1 local ids are one-by-one
Resolves: NEO-6692
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-02-17 17:48:54 +01:00
Mateusz Jablonski
b697d75695
Correct dimension order in local ids generated for implicit args
...
when local ids are generated by HW, use same dim order for runtime generation
move common logic to separated file
Related-To: NEO-5081
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-02-04 12:46:59 +01:00