Kamil Kopryk
f0f89836e0
fix: l3 flush after post sync logic in OCL
...
Related-To: NEO-13163
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-06-23 18:03:21 +02:00
Mateusz Jablonski
0c066b4266
refactor: remove C-style casts 1/n
...
Related-To: NEO-15174
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-06-17 17:50:56 +02:00
Szymon Morek
ac7d936942
fix: invalidate texture cache for BCS image write
...
Related-To: NEO-15029
If BCS is writing new data to an image, texture cache
should be invalidated as it might contain stale data for
that image.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-06-13 13:28:08 +02:00
Mateusz Jablonski
0a325e09be
fix: correct initialization of Kernel::isBuiltIn
...
value is used in Kernel::initialize but was set after initialize call
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-06-11 18:29:06 +02:00
Kamil Kopryk
bb52c358b9
refactor: flush Task method
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
Related-To: NEO-15011
Also correct ults
2025-06-11 16:36:56 +02:00
Szymon Morek
26a83c9f2c
refactor: remove redundant specializations
...
Related-To: NEO-15029
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-06-09 19:30:30 +02:00
Mateusz Hoppe
f1f3ee13db
refactor: set argsExtended in EncodeWalkerArgs
...
Related-To: NEO-14209
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-06-03 11:45:39 +02:00
Compute-Runtime-Validation
1a50e8e7c0
Revert "performance: use RESOURCE_BARRIER as stalling barrier"
...
This reverts commit 556c0b64c6 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-05-30 18:00:34 +02:00
Szymon Morek
556c0b64c6
performance: use RESOURCE_BARRIER as stalling barrier
...
Related-To: NEO-14943
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-05-28 12:30:01 +02:00
Maciej Plewka
e6f3ebce5d
fix: add pc with stall before barrier with post sync on bmg
...
Related-To: NEO-14491
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2025-05-28 09:24:21 +02:00
Kamil Kopryk
b7681a2e7b
refactor: redirect flush L3 host usm to external by default
...
Related-To: NEO-13163
Rename ForceL3FlushAfterPostSync to EnableL3FlushAfterPostSync
Rename DisableFlushL3ForHostUsm to RedirectFlushL3HostUsmToExternal
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-05-28 08:50:01 +02:00
Lukasz Jobczyk
86e7d5b276
performance: Skip fence after PC when no post sync write
...
Related-To: NEO-14642
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-23 10:33:28 +02:00
Mateusz Hoppe
0d57dcfdf0
feature: add L0 API querying priority levels - zeDeviceGetPriorityLevels()
...
Related-To: NEO-14572
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-05-23 09:24:16 +02:00
Kamil Kopryk
caf1781e73
fix: add L3 flush after post sync for zero copy mem objects
...
Related-To: NEO-13163
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-05-22 10:23:26 +02:00
Mateusz Hoppe
b03f625f03
feature: limit max LWS based on preferred number of workgroups per ss
...
- limit max LWS size when SLM and barriers are not used
Related-To: GSD-11112
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-05-16 20:28:00 +02:00
Compute-Runtime-Validation
8839d62c79
Revert "performance: use RESOURCE_BARRIER as stalling barrier"
...
This reverts commit 2a63853349 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-05-16 20:19:06 +02:00
Szymon Morek
2a63853349
performance: use RESOURCE_BARRIER as stalling barrier
...
Related-To: NEO-14943
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-05-16 14:12:18 +02:00
Szymon Morek
bff8f74de7
refactor: remove unused parameter
...
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-05-15 12:23:27 +02:00
Maciej Plewka
85ed1a15e4
fix: do not allow blit for depth image on arl
...
Related-To: NEO-14344
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2025-05-13 15:56:47 +02:00
Lukasz Jobczyk
df2c776aab
refactor: Remove unused cmdq round robin engine assign
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-09 11:03:57 +02:00
Lukasz Jobczyk
1d1414febc
refactor: remove unused dc flush mitigation
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-09 08:26:26 +02:00
Lukasz Jobczyk
62619f9525
refactor: remove unused kernel tunning
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-08 14:18:21 +02:00
Filip Hazubski
ba85f7417d
fix: Add debug toggle to disable flush L3 for host usm
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2025-05-07 16:57:31 +02:00
Szymon Morek
4d60465a9e
fix: don't flush texture cache if not required
...
Related-To: NEO-14645
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-16 10:38:24 +02:00
Jaroslaw Warchulski
3e1aa33924
refactor: cleanup headers
...
Related-To: NEO-5548
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com >
2025-04-14 14:59:40 +02:00
Igor Venevtsev
ef52479ce4
feature: add FillImage1dBuffer built-in kernel
...
Resolves: NEO-13984, HSD-18041422852
Signed-off-by: Igor Venevtsev <igor.venevtsev@intel.com >
2025-04-10 14:04:13 +02:00
Szymon Morek
73ad369b10
fix: invalidate texture cache before reading from image
...
Related-To: NEO-9004
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-09 14:01:22 +02:00
Young Jin Yoon
0de024dbf9
refactor: separate post-sync into EncodePostSync
...
Created EncodePostSync template struct to organize various post-sync
variables/functions from EncodeDispatchKernel
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com >
2025-04-07 20:06:14 +02:00
Lukasz Jobczyk
8978ea5e5a
performance: Do not create global fence allocation on integrated
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-04-07 11:22:04 +02:00
Szymon Morek
91ff22f255
fix: correctly calculate host ptr size for images
...
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-04 17:54:42 +02:00
Szymon Morek
95e0244f70
fix: properly pass info about 3D image
...
Related-To: NEO-14538
It's valid for 3D image to copy 2D region.
Current checks for mip map do not consider that.
This change correctly checks for mip mapped 3D image.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-03 16:33:57 +02:00
Filip Hazubski
504440fc4d
feature: Add ftrHeaplessMode flag
...
Pass hwInfo to isHeaplessModeEnabled and isForceBindlessRequired functions.
Related-To: NEO-14526
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2025-04-02 21:06:05 +02:00
Szymon Morek
8836f6df0b
fix: forward mip map level for 3D images
...
Related-To: NEO-14539
1D and 2D images have already mip map level set correctly.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-02 11:46:41 +02:00
Szymon Morek
f1b789cbb1
performance: enable staging for 3d images
...
Related-To: NEO-14026
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-01 16:32:41 +02:00
Szymon Morek
3010af596e
performance: add infrastructure for staging with 3D images
...
Related-To: NEO-14026
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-01 15:30:30 +02:00
Lukasz Jobczyk
0a11a96a53
refactor: Add dedicated method to check if any ULLS light enabled
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-03-31 16:36:20 +02:00
Jack Myers
0aa2c4f0cb
feature: allow removal of heapful code paths
...
Related-To: NEO-13007
Signed-off-by: Jack Myers <jack.myers@intel.com >
2025-03-27 01:34:35 +01:00
Szymon Morek
d04ac3d60f
fix: flush caches after filling image
...
Related-To: NEO-14454
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-03-25 15:00:54 +01:00
Damian Tomczak
6e05c35878
feature: force stateless for copy buffer
...
Related-to: NEO-6075
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com >
2025-03-24 14:29:25 +01:00
Damian Tomczak
b5a5f35339
feature: force stateless for copy buffer rect
...
Related-to: NEO-6075
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com >
2025-03-24 14:14:44 +01:00
Damian Tomczak
d04abc086b
fix: adjust svm builtins types
...
Related-to: NEO-13973
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com >
2025-03-24 13:55:48 +01:00
Kamil Kopryk
73795ced64
refactor: add setupTimestampPacketFlushL3 function
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-03-24 13:10:03 +01:00
Lukasz Jobczyk
8a85a96ed2
feature: Add 3-level wait scheme with tpause intrinsic
...
Related-To: NEO-14336
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-03-21 12:12:57 +01:00
Damian Tomczak
1995a7fbcf
feature: additional fixes to enable feature and adjusting builtins types
...
Resolves: NEO-13973
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com >
2025-03-20 14:23:08 +01:00
Kamil Kopryk
2e729bcb4c
refactor: move isTimestampWaitSupportedForQueues to productHelper
...
Related-to: NEO-13163
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-03-19 09:31:33 +01:00
Compute-Runtime-Validation
ce2fbb87ba
Revert "feature: additional checkers to enable feature"
...
This reverts commit dbd7d00b27 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-03-18 03:12:47 +01:00
Damian Tomczak
dbd7d00b27
feature: additional checkers to enable feature
...
Resolves: NEO-13973
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com >
2025-03-17 23:32:21 +01:00
Szymon Morek
9432c7f005
fix: program texture cache flush before BCS copy
...
Related-To: NEO-13456
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-03-12 14:41:15 +01:00
Szymon Morek
dacbce7f01
performance: introduce staging read for cl_buffer
...
Related-To: NEO-14026
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-03-11 08:19:28 +01:00
Compute-Runtime-Validation
571e0f2ba3
Revert "performance: enable staging read for cl_buffer"
...
This reverts commit 04b6e87869 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-03-07 20:30:54 +01:00