Commit Graph

670 Commits

Author SHA1 Message Date
Mateusz Jablonski
1c377dc930 fix: ensure payload arguments are patched before walker command is fetched
In case of indirect kernel launch some payload arguments are patched
just before walker command, this change disables prefetch, performs
batch buffer start to next bytes and then re-enable prefetch. All these
operations are performed between MI_STORE_REGISTER_MEM and COMPUTE_WALKER

Related-To: NEO-14584
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-04-08 11:57:45 +02:00
Brandon Yates
c4937991cd feature: Add l0DebuggerEnabled to WalkerArgs
Related-to: NEO-12993

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-04-08 03:53:38 +02:00
Young Jin Yoon
0de024dbf9 refactor: separate post-sync into EncodePostSync
Created EncodePostSync template struct to organize various post-sync
variables/functions from EncodeDispatchKernel

Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2025-04-07 20:06:14 +02:00
Lukasz Jobczyk
8978ea5e5a performance: Do not create global fence allocation on integrated
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-07 11:22:04 +02:00
Compute-Runtime-Validation
f332571d96 Revert "performance: Do not create global fence allocation on integrated"
This reverts commit ecf8a07d26.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-04-04 16:26:19 +02:00
Lukasz Jobczyk
ecf8a07d26 performance: Do not create global fence allocation on integrated
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-04 11:45:22 +02:00
Mateusz Jablonski
bb518adf34 fix: patching payload arguments in inline data in case of indirect kernel
Related-To: NEO-14532
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-04-03 17:21:28 +02:00
Filip Hazubski
504440fc4d feature: Add ftrHeaplessMode flag
Pass hwInfo to isHeaplessModeEnabled and isForceBindlessRequired functions.

Related-To: NEO-14526

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-04-02 21:06:05 +02:00
Jack Myers
0aa2c4f0cb feature: allow removal of heapful code paths
Related-To: NEO-13007

Signed-off-by: Jack Myers <jack.myers@intel.com>
2025-03-27 01:34:35 +01:00
Bartosz Dunajski
6e17df4fb3 refactor: improve EncodeAluHelper
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-03-19 15:44:34 +01:00
Kamil Kopryk
01edd58be4 refactor: add additional check in addFlushRequiredCommand
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-03-14 13:54:20 +01:00
Zbigniew Zdanowicz
f8be8414ac feature: add support for wait event preamble in append command list
- add mechanism in queue to trigger start command from queue to regular
- add detection in immediate command list need to dispatch extra start in queue
- fix secondary linear stream in immediate case as it should not use container
- modify tests for primary batch buffer dispatch as default mode
- remove invalid or obsolete tests

Related-To: NEO-10356

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-07 20:31:09 +01:00
Kamil Kopryk
4c795027e3 refactor: add check if event L3 flush is needed
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-03-05 18:25:29 +01:00
Kamil Kopryk
4503972faf refactor: add adjustSamplerStateBorderColor function
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-02-27 14:05:09 +01:00
Vysochyn, Illia
1eafbc30f8 feature: Enable eu thread scheduling mode override via IDD
Enables eu thread scheduling mode override via the
INTERFACE_DESCRIPTOR_DATA and COMPUTE_WALKER.

Defines encodeEuSchedulingPolicy operating on INTERFACE_DESCRIPTOR_DATA
or Xe3 platform.

Adds tests verifying that thread scheduling mode can be overridden via
COMPUTE_WALKER.

Related-To: NEO-13771, HSD-18041256338, HSD-14013056398

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-02-26 17:04:43 +01:00
Mateusz Hoppe
291745cdf7 feature: add support for ImplicitArgs versioning
- define ImplicitArgs version 1

Related-To: NEO-14115

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-02-25 10:27:11 +01:00
Filip Hazubski
794b21a3fa refactor: Add extra parameters to computeSlmValues function
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-02-20 15:35:52 +01:00
Filip Hazubski
4be1153253 fix: Remove pragma once from inl files
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-02-18 20:19:15 +01:00
Filip Hazubski
4c7900008f refactor: Change wording from NonCopyableOrMovable to NonCopyableAndNonMovable
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-02-17 14:19:10 +01:00
Kamil Kopryk
899abf3aac feature: add NonCopyableOrMovable and NonCopyable concepts
Inheriting from NonCopyableOrMovableClass or NonCopyableClass
does not prevent derived classes from accidentally defining
their own copy or move constructors, violating the intended design.

This commit introduces two concepts:
- NonCopyableOrMovable
- NonCopyable

Using static asserts with these concepts will guarantee the intended
copy and move limitations.

Related-To: NEO-14068
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-02-14 03:55:50 +01:00
Compute-Runtime-Validation
b3807e7a36 Revert "feature: Append recorded commandlist into immediate (8/N)"
This reverts commit 9db77efd03.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-13 16:35:40 +01:00
Aravind Gopalakrishnan
9db77efd03 feature: Append recorded commandlist into immediate (8/N)
- Add primary dispatch capability for immediate command list
- turn off usage of parent linear stream for copy engine

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2025-02-12 12:39:38 +01:00
Bartosz Dunajski
18d7a22861 refactor: pass increment address and value to dispatch params
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-02-10 14:04:34 +01:00
Maciej Plewka
5ef1ff574b fix: Don't check aux capable in case of MSAA surface on xe2 and later
Related-To: NEO-13290
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-02-07 16:47:33 +01:00
Mateusz Hoppe
1877e66bc8 feature: add mechanism to extend cmdlist launch params
Related-To: NEO-13954

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-02-06 22:05:15 +01:00
Michał Pryba
3c027cfedc refactor: adjust file names after pre-gen12 removal 4/n
Related-To: NEO-12681
Signed-off-by: Michał Pryba <michal.pryba@intel.com>
2025-02-05 15:42:32 +01:00
Kamil Kopryk
ef896cc799 refactor: introduce ImageSurfaceState helper class
Moved global functions to the ImageSurfaceStateHelper class,
with declarations in the header file and definitions in the base .inl
file.
This change reduces compilation time by:
- removing unnecessary includes from the header file
- adding explicit template instantiations, which are faster than
implicit template instantiations.

Additionally, the image_skl_and_later.inl file has been removed as it
is no longer needed, and its implementation has been moved to the base .inl

Related-To: NEO-12149

Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-01-30 19:20:31 +01:00
Chodor, Jaroslaw
574fe9fb29 refactor: Pass whole kernel descriptor to barrier programming
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-01-27 01:26:49 +01:00
Lukasz Jobczyk
c0838e1f76 fix: Apply dispatch all for small TG only on BMG
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-01-22 13:04:44 +01:00
Kamil Kopryk
47fc1ce34b refactor: use designated initializer for EncodeWalkerArgs
also make EncodeWalkerArgs an aggregate

Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-01-22 11:44:57 +01:00
Zbigniew Zdanowicz
1e89058232 refactor: change data fields for encoder of additional walker fields
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-01-21 16:49:38 +01:00
Zbigniew Zdanowicz
5d25ae4539 refactor: rename walker partition files
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-01-20 13:33:05 +01:00
Zbigniew Zdanowicz
897c890d03 refactor: add thread group count parameter to implicit scaling functions
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-01-20 13:12:45 +01:00
Lukasz Jobczyk
af0bb7964a performance: Set dispatch all for small TG
Resolves: NEO-11814

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-01-20 08:49:59 +01:00
Zbigniew Zdanowicz
446f9c1964 refactor: change interfaces of walker region encoder
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-01-17 19:18:24 +01:00
Vysochyn, Illia
4270e3c356 feature: Add dynamic stack management on Xe3
Related-To: NEO-13147

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-01-15 10:12:48 +01:00
Compute-Runtime-Validation
58707e0cf7 Revert "refactor: migration to std::make_unique C++17"
This reverts commit 4cf685cb7d.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-01-09 01:41:36 +01:00
Semenov Herman (Семенов Герман)
4cf685cb7d refactor: migration to std::make_unique C++17
Signed-off-by: Semenov Herman (Семенов Герман) <GermanAizek@yandex.ru>
2025-01-08 17:20:00 +01:00
Mateusz Jablonski
bb1a125f0c feature: add support for Panther Lake platform
Related-To: NEO-12803

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-01-07 11:39:56 +01:00
Filip Hazubski
3997d2a6b3 Revert "feature: Append recorded commandlist into immediate (8/N)"
This reverts commit acfeffe274.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-01-03 15:50:52 +01:00
Vysochyn, Illia
f198507875 refactor: Remove 3DSTATE_BTD_BODY structure
Removes 3DSTATE_BTD_BODY as redundant structure.

Related-To: NEO-13147

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2024-12-31 16:27:29 +01:00
Vysochyn, Illia
c05ac6ff70 refactor: Rename enumerators to common naming
Renames SHARED_LOCAL_MEMORY_SIZE_ENCODES and PREFERRED_SLM_ALLOCATION_SIZE
enumerators' members to follow the specification.

Related-To: NEO-13147

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2024-12-24 12:10:26 +01:00
Aravind Gopalakrishnan
acfeffe274 feature: Append recorded commandlist into immediate (8/N)
Add primary dispatch capability for immediate command list

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-12-20 18:52:33 +01:00
Compute-Runtime-Validation
f35d620e06 Revert "feature: Append recorded commandlist into immediate (8/N)"
This reverts commit d8d7419aee.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-12-20 03:35:27 +01:00
Aravind Gopalakrishnan
d8d7419aee feature: Append recorded commandlist into immediate (8/N)
Add primary dispatch capability for immediate command list

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-12-19 19:02:14 +01:00
Kamil Kopryk
3be52fa1d0 refactor: Add ult with check if bindless ss is patched correctly
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-12-18 09:30:13 +01:00
Damian Tomczak
98331e7d63 feature: is48bResourceNeededForRayTracing specialization for rt encoder
Related-to: NEO-10074

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2024-12-12 13:29:07 +01:00
Vysochyn, Illia
c40f015249 refactor: Modify DISPATCH_WALKER naming
Modifies dispatch walker enumerator naming to be unified among compute
walkers.

Related-To: NEO-13147

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2024-11-29 14:00:47 +01:00
Bartosz Dunajski
5e1fa75676 refactor: adjust code to compile with c++20
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-11-29 10:27:29 +01:00
Vysochyn, Illia
afd22999cc refactor: Adjust RENDER_SURFACE_STATE structures naming
Performs minor renaming (mostly capitalization) in order to align with
specification.

Renames L1_CACHE_POLICY to L1_CACHE_CONTROL.

Related-To: NEO-13147

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2024-11-29 09:43:11 +01:00