Mateusz Jablonski
a59ca3d5e8
build: add missing includes
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-10-24 08:41:42 +02:00
Mateusz Jablonski
3cd135e165
refactor: reduce including gmmlib headers
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-10-21 19:06:38 +02:00
Mateusz Jablonski
bc71b2f685
refactor: reduce usage of gmmlib headers
...
use own ImagePlane wrapper
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-10-21 17:03:25 +02:00
Compute-Runtime-Validation
2eb8928ec5
Revert "performance: increase heap size to 4MB"
...
This reverts commit f41bb3517a .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-10-10 22:23:23 +02:00
Jack Myers
f06bb256c7
refactor: sba type helper
...
Signed-off-by: Jack Myers <jack.myers@intel.com >
2025-10-10 11:36:36 +02:00
Szymon Morek
f41bb3517a
performance: increase heap size to 4MB
...
Related-To: NEO-16348
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-10-09 13:03:53 +02:00
Radoslaw Jablonski
54de14a9dc
refactor: apply clang-format's InsertBraces rule
...
Formats code before InsertBraces rule is enabled.
Signed-off-by: Radoslaw Jablonski <radoslaw.jablonski@intel.com >
2025-10-06 15:32:46 +02:00
Bartosz Dunajski
42371ee7bd
fix: decanonize prefetch gpu va
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-10-06 12:54:42 +02:00
Marcel Skierkowski
047f6851f7
refactor: correct programming of SLM size
...
Unify reading hw avaialble slm size:
Pre-Xe2 SLMSizeInKb stores total SLM size across all DSS
Xe2+ SLMSizeInKb stores SLM size per SS
apply restrictions for preferred/programmable SLM size
Related-To: NEO-12949
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com >
2025-09-30 17:57:18 +02:00
Kamil Kopryk
f3ba701a9b
refactor: host function data programming
...
Related-To: NEO-14577
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-09-23 11:27:35 +02:00
Jaroslaw Warchulski
195bf66a49
refactor: fix typos
...
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com >
2025-09-22 12:46:41 +02:00
Lukasz Jobczyk
aa74555682
refactor: remove unused variable
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-09-11 12:01:32 +02:00
Zbigniew Zdanowicz
d8557ff2d3
feature: add scratch address programing to patch preamble
...
Related-To: NEO-15376
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-08-19 00:41:01 +02:00
Damian Tomczak
4b663e7b60
fix: use heapless helpers when heapless enabled
...
Related-to: NEO-14489
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com >
2025-08-18 14:36:58 +02:00
Tomasz Biernacik
28ca00fa1b
feature: adjust max threads per EU count
...
Related-To: NEO-14998
Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com >
2025-08-06 09:02:54 +02:00
Zbigniew Zdanowicz
049bb10ce0
fix: correct encoding when destination gpu address is qword misaligned
...
Related-To: NEO-15376
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-08-01 12:08:55 +02:00
Zbigniew Zdanowicz
b57d0b2544
refactor: add method to arb check encoder to program buffer pointer directly
...
Related-To: NEO-15376
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-31 16:01:00 +02:00
Zbigniew Zdanowicz
08c34968f8
feature: add method to get command buffer gpu address of ending command
...
Related-To: NEO-15376
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-30 15:17:08 +02:00
Alicja Lukaszewicz
3f7887dc6a
refactor: add function for setting additional cache
...
Related-To: NEO-15072, HSD-14024701488
Signed-off-by: Alicja Lukaszewicz <alicja.lukaszewicz@intel.com >
2025-07-30 14:45:08 +02:00
Zbigniew Zdanowicz
b5646b45e9
refactor: change interface to pass command buffer pointer as reference
...
Related-To: NEO-15376
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-30 11:25:53 +02:00
Zbigniew Zdanowicz
1fc0826394
feature: add encoder to program hw commands in data buffers on gpu
...
Related-To: NEO-15376
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-29 20:27:10 +02:00
Zbigniew Zdanowicz
f51ef2beae
feature: add encoder to dispatch commands programming data buffers on gpu
...
Related-To: NEO-15376
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-29 12:18:31 +02:00
Mateusz Hoppe
be483491bf
feature: extract indirect access buffer major version from ELF's notes
...
- use version from ELF to create correct layout of implcit args
Related-To: NEO-15574
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-07-25 14:50:49 +02:00
Mateusz Hoppe
b17ed79618
fix: update implicitArgs versions
...
- fix layout of implicit args
- add enqueued local size and sync buffer ptr
Related-To: NEO-15160
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-07-22 15:12:55 +02:00
Szymon Morek
86b5660c23
performance: introduce staging copy in L0
...
Related-To: NEO-14026
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-07-21 08:50:43 +02:00
Zbigniew Zdanowicz
71659807c4
refactor: add new parameter to use dedicated memory to initialize surface state
...
Related-To: NEO-15374
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2025-07-19 03:10:51 +02:00
Maciej Bielski
35ecde70df
refactor: extract parts of KernelImp data for easier capturing
...
`KernelImp` is virtual class so capturing the current kernel state
cannot be done by a simple copy-instance of the class. However, this can
be done by extracting the relevant data to a separate class and
capturing a copy of its instance.
Related-To: NEO-15374
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com >
2025-07-17 23:00:22 +02:00
Szymon Morek
0db5ce22a1
performance: use resource_barrier on Xe2 and PTL
...
Related-To: NEO-14943
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-06-24 14:24:47 +02:00
Damian Tomczak
5cd1423bab
feature: packed surface format
...
Resolves: NEO-13669
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com >
2025-06-12 17:32:16 +02:00
Bartosz Dunajski
a015188166
refactor: unify prefetch encode methods
...
Related-To: NEO-14703
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-06-03 10:07:26 +02:00
Compute-Runtime-Validation
1a50e8e7c0
Revert "performance: use RESOURCE_BARRIER as stalling barrier"
...
This reverts commit 556c0b64c6 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-05-30 18:00:34 +02:00
Grochowski, Stanislaw
2d9108f8b6
refactor: change DefaultComputeWalker
...
Related-To: NEO-14537
Signed-off-by: Grochowski, Stanislaw <stanislaw.grochowski@intel.com >
2025-05-29 18:05:29 +02:00
Szymon Morek
556c0b64c6
performance: use RESOURCE_BARRIER as stalling barrier
...
Related-To: NEO-14943
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-05-28 12:30:01 +02:00
Maciej Plewka
e6f3ebce5d
fix: add pc with stall before barrier with post sync on bmg
...
Related-To: NEO-14491
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2025-05-28 09:24:21 +02:00
Lukasz Jobczyk
86e7d5b276
performance: Skip fence after PC when no post sync write
...
Related-To: NEO-14642
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-23 10:33:28 +02:00
Mateusz Hoppe
8e5b29f55e
feature: optimize local ids generation
...
- only emit local ids for required dimensions
Related-To: NEO-15007
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-05-23 09:59:31 +02:00
Vysochyn, Illia
f99a4c2193
feature: Define thread group dispatch size according to kernel metadata
...
Related-To: NEO-10945
Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com >
2025-05-19 16:02:21 +02:00
Compute-Runtime-Validation
8839d62c79
Revert "performance: use RESOURCE_BARRIER as stalling barrier"
...
This reverts commit 2a63853349 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-05-16 20:19:06 +02:00
Szymon Morek
2a63853349
performance: use RESOURCE_BARRIER as stalling barrier
...
Related-To: NEO-14943
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-05-16 14:12:18 +02:00
Szymon Morek
bff8f74de7
refactor: remove unused parameter
...
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-05-15 12:23:27 +02:00
Young Jin Yoon
5221b5b00e
fix: patch counter values with additional blit properties
...
Modified BlitCommandsHelper and CommandListHw to patch counter values
when using additional blit properties and in order command list is
enabled.
Related-To: NEO-13003
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com >
2025-05-13 04:32:31 +02:00
Lukasz Jobczyk
1d1414febc
refactor: remove unused dc flush mitigation
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-09 08:26:26 +02:00
Young Jin Yoon
415fe245d7
feature: apply or set additional blit properties
...
Modified appendMemoryCopyBlitRegion, appendCopyImageBlit,
appendMemoryCopyRegion, and appendBlitFill in
CommmandListCoreFamily to change behavior when
useAdditionaBlitProperties is used.
Modified dispatchBlitMemoryFill and
dispatchBlitCommandForImageRegion in BlitCommandHelper to
call applyAdditionalProperties when useAdditionalProperties
is enabled.
Related-To: NEO-13003
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com >
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com >
2025-05-09 00:04:57 +02:00
Naklicki, Mateusz
a0e7ef1ce1
fix: explicitly skip programming of unused scratch data ptr
...
If scratch data is unused, explicitly skip programming.
Related-To: NEO-14130
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com >
2025-05-08 16:56:42 +02:00
Young Jin Yoon
f844dbc88f
feature: Add timestamp node to encode post sync args
...
Added a timestamp node to EncodePostSyncArgs to use
additional timestamp information if required,
Modified event object to use the timestamp added in
EncodePostSyncArgs.
Related-To: NEO-13003
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com >
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com >
2025-05-02 20:55:15 +02:00
Young Jin Yoon
55e7b6a209
feature: support copy function post-sync [3/n]
...
Refactored EncodePostSync to use more generic terms by renaming
variables.
Added some helper functions in EncodePostSync.
Related-To: NEO-13003
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com >
2025-05-01 19:20:01 +02:00
Compute-Runtime-Validation
0c3b765942
Revert "refactor: add BlitSyncPropertiesExt to BlitSyncProperties"
...
This reverts commit b5a259aded .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-05-01 03:23:53 +02:00
Young Jin Yoon
b5a259aded
refactor: add BlitSyncPropertiesExt to BlitSyncProperties
...
Added BlitSyncPropertiesExt to provide additional information for
different platform and/or blitter commands.
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com >
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com >
2025-04-30 19:11:34 +02:00
Mateusz Jablonski
523e53c862
fix: passing hw generation of local ids for implicit args patching
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-04-29 16:28:23 +02:00
Young Jin Yoon
05c6612386
refactor: add EncodePostSyncArgs to EncodeDispatchKernelArgs
...
Refactored various member variables in EncodeDispatchKernel to directly
include EncodePostSync
Changed command encoder and command list to use the modified
EncodeDispatchKernel.
Related-To: NEO-13003
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com >
2025-04-17 20:55:40 +02:00