Compute-Runtime-Validation
593c9e76f2
Revert "fix: correct limitation for num threads per thread group"
...
This reverts commit 6ad4ad41b1 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-05-23 02:51:17 +02:00
Katarzyna Cencelewska
6ad4ad41b1
fix: correct limitation for num threads per thread group
...
taking into account the max work group limit
Resolves: NEO-14922
Related-To: NEO-11881
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2025-05-22 00:06:21 +02:00
Vysochyn, Illia
f99a4c2193
feature: Define thread group dispatch size according to kernel metadata
...
Related-To: NEO-10945
Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com >
2025-05-19 16:02:21 +02:00
Mateusz Hoppe
d3b11d1527
fix: correctly patch implicit arg buffer in indirect data
...
- use correct size alignment of implicit arg buffer, crossthread data
should start after the buffer without extra padding
Related-To: NEO-14449
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-05-06 13:25:11 +02:00
Compute-Runtime-Validation
26d3c7527c
Revert "fix: correctly patch implicit arg buffer in indirect data"
...
This reverts commit e345d55fe5 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-05-06 04:38:50 +02:00
Mateusz Hoppe
e345d55fe5
fix: correctly patch implicit arg buffer in indirect data
...
- use correct size alignment of implicit arg buffer, crosshtread data
should start after the buffer without extra padding
Related-To: NEO-14449
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-05-05 16:03:57 +02:00
Mateusz Jablonski
523e53c862
fix: passing hw generation of local ids for implicit args patching
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-04-29 16:28:23 +02:00
Vysochyn, Illia
146a7f7b9b
feature: Add support for BUFFER_SIZE explicit argument
...
Related-To: NEO-13972
Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com >
2025-04-07 14:15:26 +02:00
Mateusz Hoppe
291745cdf7
feature: add support for ImplicitArgs versioning
...
- define ImplicitArgs version 1
Related-To: NEO-14115
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-02-25 10:27:11 +01:00
Michal Mrozek
363aca782f
refactor: remove not needed code.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2024-09-19 18:50:16 +02:00
Kamil Kopryk
2a9bcdeb83
refactor: pass outImplicitArgs to patchImplicitArgs function
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-08-05 17:31:47 +02:00
Katarzyna Cencelewska
da7b03dd15
fix: to always use grfs count in calculateNumThreadsPerThreadGroup
...
grf size != grf count
Related-To: GSD-8437
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2024-03-22 11:03:18 +01:00
Katarzyna Cencelewska
dd1d52259e
refactor: add param rootDeviceEnvironment to calculateNumThreadsPerThreadGroup
...
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2024-03-21 22:25:14 +01:00
Mrozek, Michal
f71f6d2b72
refactor: remove not needed code
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com >
2024-03-08 18:18:55 +01:00
Mateusz Jablonski
a697a3f718
refactor: create new members for storing spill and private memory in scratch
...
rename private scratch space into scratch space slot 1 as it can be generic
Related-To: NEO-9944
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-01-23 12:42:25 +01:00
Fabian Zwolinski
903e581b5f
fix: add support for bindless implicit args
...
Support for:
global_base and const_base in bindless addressing mode.
Related-To: NEO-9855
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com >
2024-01-12 01:27:17 +01:00
Mateusz Jablonski
dd1b9d6abc
refactor: correct naming of enum class constants 8/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-19 08:18:18 +01:00
Dunajski, Bartosz
d99104d5bf
refactor: improve ImplicitArg struct handling
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-12-18 15:19:00 +01:00
Mateusz Jablonski
c9664e6bad
refactor: rename global debug manager to debugManager
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-11-30 13:00:59 +01:00
Mateusz Hoppe
9fd7f9cf05
fix: set ImplicitArgs size to size of defined fields
...
Resolves: NEO-8169
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-07-12 21:30:32 +02:00
Cencelewska, Katarzyna
aa0beb8191
fix: Unify logic calculating threads per work group part 4
...
- also use helper when checking that is simd1 to have same flow
Related-To: NEO-8087
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-07-07 15:34:59 +02:00
Cencelewska, Katarzyna
61f701aba5
fix: Unify logic calculating threads per work group part 3
...
Related-To: NEO-8087
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-07-04 15:27:44 +02:00
Cencelewska, Katarzyna
2e17c21728
fix: Unify logic calculating threads per work group part 2
...
- use calculateNumThreadsPerThreadGroup instead of getThreadsPerWG to
have same flow and proper values of threads per work groups
Related-To: NEO-8087
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-07-04 10:34:02 +02:00
Compute-Runtime-Validation
39740da9d1
Revert "fix: Unify logic calculating threads per work group part 2"
...
This reverts commit 1e8a53bd53 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-07-02 07:09:14 +02:00
Cencelewska, Katarzyna
1e8a53bd53
fix: Unify logic calculating threads per work group part 2
...
- use calculateNumThreadsPerThreadGroup instead of getThreadsPerWG to
have same flow and proper values of threads per work groups
Related-To: NEO-8087
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-06-30 14:16:08 +02:00
Mateusz Hoppe
313fb84fda
feature: bindless addressing mode support
...
- allow bindless kernels to execute
- bindless addressing kernels are using private heaps mode
- do not differentiate bindful and bindless surface state base addresses
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-06-19 12:41:03 +02:00
Mateusz Hoppe
8bc1fb1251
refactor: add function checking bindless addressing
...
- simplify logic to check addressing mode of a kernel
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-06-12 14:42:18 +02:00
Mateusz Hoppe
646c8985e8
refactor: store number of stateful args in KernelDescriptor
...
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-06-12 14:15:43 +02:00
Mateusz Jablonski
31f32cc16e
fix implicit args: generate local ids as for grf size 32
...
Related-To: IGC-6936
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-04-07 11:37:07 +02:00
Michal Mrozek
c77d954900
[perf] simplify setting constant buffer and improve performance
...
- no need to count parameters
- remove unrecoverable which requires fetching additional fields.
Related-To: NEO-5170
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2023-03-02 13:09:52 +01:00
Warchulski, Jaroslaw
8de3898abc
Cleanup includes 31
...
Cleaned up files:
shared/source/command_stream/experimental_command_buffer.h
shared/source/helpers/get_info.h
shared/source/helpers/local_id_gen.h
shared/source/memory_manager/gfx_partition.h
shared/source/memory_manager/host_ptr_manager.h
shared/source/memory_manager/prefetch_manager.h
shared/test/common/mocks/mock_memory_manager.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com >
2023-01-16 11:45:55 +01:00
Warchulski, Jaroslaw
3d59dce80c
Cleanup includes 27
...
Cleaned up files:
opencl/source/command_queue/command_queue.h
shared/source/built_ins/registry/built_ins_registry.h
shared/source/kernel/kernel_descriptor.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com >
2023-01-11 16:10:28 +01:00
Warchulski, Jaroslaw
803bbb89d2
Cleanup includes 13
...
Cleaned up files:
shared/source/helpers/blit_commands_helper.h
shared/source/helpers/heap_assigner.h
shared/source/memory_manager/alignment_selector.h
shared/source/memory_manager/gfx_partition.h
shared/source/memory_manager/memory_manager.h
shared/source/os_interface/os_memory.h
shared/source/utilities/heap_allocator.h
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com >
2022-12-16 12:36:27 +01:00
Kacper Nowak
4e3e55d111
Patchtokens: remove unused code
...
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com >
2022-11-10 18:56:40 +01:00
Krystian Chmielewski
69bef975f0
OCL cache per thread data
...
Add caching mechanism for local ids in OCL.
Signed-off-by: Krystian Chmielewski <krystian.chmielewski@intel.com >
2022-11-02 17:07:29 +01:00
Krystian Chmielewski
b7a780868a
Prepare OCL tests for switch to zebin
...
Signed-off-by: Krystian Chmielewski <krystian.chmielewski@intel.com >
2022-09-16 15:33:26 +02:00
Jim Snow
40fb0c2542
Clean up RTDispatchGlobals patch token parsing
...
Related-To: LOCI-3349
Signed-off-by: Jim Snow <jim.m.snow@intel.com >
2022-09-14 07:38:37 +02:00
Krystian Chmielewski
bb2967d2c6
Add handling for invalid kernel attribute
...
This commit adds handling for "invalid_kernel" kernel's attribute.
This attribute is present when kernel is invalid e.g. could not be
correctly compiled due to missing feature.
Signed-off-by: Krystian Chmielewski <krystian.chmielewski@intel.com >
2022-09-02 17:51:22 +02:00
Warchulski, Jaroslaw
aed890a219
Move files between shared/test/unit_test and /common (fixtures, helpers, mocks)
...
unit_test/fixtures/mock_aub_center_fixture.h -> common/fixtures
unit_test/helpers/raii_hw_helper.h -> common/helpers
unit_test/helpers/static_size3.h -> common/helpers
unit_test/helpers/ult_limits.h -> common/helpers
unit_test/memory_manager/mock_prefetch_manager.h -> common/memory_manager
common/mocks/mock_aub_stream.h -> unit_test/mocks
common/mocks/mock_csr_simulated_common_hw.h -> unit_test/mocks
common/mocks/mock_direct_submission_diagnostic_collector.h -> unit_test/mocks
common/mocks/mock_lrca_helper.h -> unit_test/mocks
common/mocks/mock_tbx_stream.h -> unit_test/mocks
common/mocks/linux/mock_os_context_linux.h -> unit_test/mocks/linux
common/mocks/windows/mock_wddm_direct_submission.h -> unit_test/mocks/windows
Related-To: NEO-6524
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com >
2022-08-11 12:53:19 +02:00
Warchulski, Jaroslaw
e7cca25894
CMake: don't include shared/test/unit_test when shared tests are skipped 1/n
...
Related-To: NEO-6524
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com >
2022-08-09 11:41:09 +02:00
Rafal Maziejuk
ed0c36117e
Apply heuristics when setting TG dispatch size on XE_HPC_CORE
...
The default TG dispatch size can be changed
to a better value based on number of threads in TG or
currently available amount of threads on GPU.
Decision on what TG dispatch size should be are based on
implemented heuristics.
Signed-off-by: Rafal Maziejuk <rafal.maziejuk@intel.com >
Related-To: NEO-6989
2022-08-08 16:43:10 +02:00
Katarzyna Cencelewska
25ce4d0c04
Clean unused code
...
remove parts with device queue
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2022-07-07 14:52:26 +02:00
Artur Harasimiuk
4bdd8860a1
test.h refactor
...
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com >
2022-07-04 18:20:07 +02:00
Kacper Nowak
de75e51228
Zebin: Add support for new buffer_address arg type
...
This change adds support for new "buffer_address" arg type, which adds
buffer address information for stateful-only argument. This will prevent
from generating stateless version of a kernel argument if all its
accesses are to be promoted to the stateful ones.
- Change default value of accessedUsingStatelessAddressingMode flag.
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com >
Related-To: NEO-7048
2022-06-08 17:33:31 +02:00
Bartosz Dunajski
9959896957
Add RT calls support to Kernel descriptor
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2022-04-22 16:33:59 +02:00
Mateusz Jablonski
82e3b10c5a
Fix typo
...
Related-To: NEO-5081
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-02-25 18:10:41 +01:00
Mateusz Jablonski
a2386ad216
Correct programming of implicit args on pre-XeHp platforms
...
On pre-XeHp platforms implicit args aren't at the beginning of indirect data,
GPU address of implicit args buffer is programmed within cross thread data
Related-To: NEO-5081, IGC-4710
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-02-24 20:52:04 +01:00
Mateusz Jablonski
8860fcc868
Handle patch token for implicit args buffer
...
Related-To: NEO-5081, IGC-4710
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-02-23 12:04:54 +01:00
Mateusz Jablonski
ea6f089e17
Unify implicit args programming across APIs
...
Related-To: NEO-5081
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2022-02-23 11:52:47 +01:00
Katarzyna Cencelewska
4994eb18c2
Remove device side enqueue info from kernel descriptor
...
Related-To: NEO-6559
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2022-02-22 14:18:44 +01:00