Commit Graph

826 Commits

Author SHA1 Message Date
Szymon Morek
2a63853349 performance: use RESOURCE_BARRIER as stalling barrier
Related-To: NEO-14943

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-05-16 14:12:18 +02:00
Szymon Morek
bff8f74de7 refactor: remove unused parameter
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-05-15 12:23:27 +02:00
Maciej Plewka
85ed1a15e4 fix: do not allow blit for depth image on arl
Related-To: NEO-14344
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-05-13 15:56:47 +02:00
Lukasz Jobczyk
df2c776aab refactor: Remove unused cmdq round robin engine assign
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-09 11:03:57 +02:00
Lukasz Jobczyk
1d1414febc refactor: remove unused dc flush mitigation
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-09 08:26:26 +02:00
Lukasz Jobczyk
62619f9525 refactor: remove unused kernel tunning
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-08 14:18:21 +02:00
Filip Hazubski
ba85f7417d fix: Add debug toggle to disable flush L3 for host usm
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-05-07 16:57:31 +02:00
Szymon Morek
4d60465a9e fix: don't flush texture cache if not required
Related-To: NEO-14645

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-16 10:38:24 +02:00
Jaroslaw Warchulski
3e1aa33924 refactor: cleanup headers
Related-To: NEO-5548
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-04-14 14:59:40 +02:00
Igor Venevtsev
ef52479ce4 feature: add FillImage1dBuffer built-in kernel
Resolves: NEO-13984, HSD-18041422852

Signed-off-by: Igor Venevtsev <igor.venevtsev@intel.com>
2025-04-10 14:04:13 +02:00
Szymon Morek
73ad369b10 fix: invalidate texture cache before reading from image
Related-To: NEO-9004

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-09 14:01:22 +02:00
Young Jin Yoon
0de024dbf9 refactor: separate post-sync into EncodePostSync
Created EncodePostSync template struct to organize various post-sync
variables/functions from EncodeDispatchKernel

Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2025-04-07 20:06:14 +02:00
Lukasz Jobczyk
8978ea5e5a performance: Do not create global fence allocation on integrated
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-07 11:22:04 +02:00
Szymon Morek
91ff22f255 fix: correctly calculate host ptr size for images
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-04 17:54:42 +02:00
Szymon Morek
95e0244f70 fix: properly pass info about 3D image
Related-To: NEO-14538

It's valid for 3D image to copy 2D region.
Current checks for mip map do not consider that.
This change correctly checks for mip mapped 3D image.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-03 16:33:57 +02:00
Filip Hazubski
504440fc4d feature: Add ftrHeaplessMode flag
Pass hwInfo to isHeaplessModeEnabled and isForceBindlessRequired functions.

Related-To: NEO-14526

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-04-02 21:06:05 +02:00
Szymon Morek
8836f6df0b fix: forward mip map level for 3D images
Related-To: NEO-14539

1D and 2D images have already mip map level set correctly.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-02 11:46:41 +02:00
Szymon Morek
f1b789cbb1 performance: enable staging for 3d images
Related-To: NEO-14026

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-01 16:32:41 +02:00
Szymon Morek
3010af596e performance: add infrastructure for staging with 3D images
Related-To: NEO-14026

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-01 15:30:30 +02:00
Lukasz Jobczyk
0a11a96a53 refactor: Add dedicated method to check if any ULLS light enabled
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-31 16:36:20 +02:00
Jack Myers
0aa2c4f0cb feature: allow removal of heapful code paths
Related-To: NEO-13007

Signed-off-by: Jack Myers <jack.myers@intel.com>
2025-03-27 01:34:35 +01:00
Szymon Morek
d04ac3d60f fix: flush caches after filling image
Related-To: NEO-14454

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-25 15:00:54 +01:00
Damian Tomczak
6e05c35878 feature: force stateless for copy buffer
Related-to: NEO-6075

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-03-24 14:29:25 +01:00
Damian Tomczak
b5a5f35339 feature: force stateless for copy buffer rect
Related-to: NEO-6075

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-03-24 14:14:44 +01:00
Damian Tomczak
d04abc086b fix: adjust svm builtins types
Related-to: NEO-13973

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-03-24 13:55:48 +01:00
Kamil Kopryk
73795ced64 refactor: add setupTimestampPacketFlushL3 function
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-03-24 13:10:03 +01:00
Lukasz Jobczyk
8a85a96ed2 feature: Add 3-level wait scheme with tpause intrinsic
Related-To: NEO-14336

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-21 12:12:57 +01:00
Damian Tomczak
1995a7fbcf feature: additional fixes to enable feature and adjusting builtins types
Resolves: NEO-13973

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-03-20 14:23:08 +01:00
Kamil Kopryk
2e729bcb4c refactor: move isTimestampWaitSupportedForQueues to productHelper
Related-to: NEO-13163
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-03-19 09:31:33 +01:00
Compute-Runtime-Validation
ce2fbb87ba Revert "feature: additional checkers to enable feature"
This reverts commit dbd7d00b27.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-18 03:12:47 +01:00
Damian Tomczak
dbd7d00b27 feature: additional checkers to enable feature
Resolves: NEO-13973

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-03-17 23:32:21 +01:00
Szymon Morek
9432c7f005 fix: program texture cache flush before BCS copy
Related-To: NEO-13456

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-12 14:41:15 +01:00
Szymon Morek
dacbce7f01 performance: introduce staging read for cl_buffer
Related-To: NEO-14026

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-11 08:19:28 +01:00
Compute-Runtime-Validation
571e0f2ba3 Revert "performance: enable staging read for cl_buffer"
This reverts commit 04b6e87869.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-07 20:30:54 +01:00
Szymon Morek
04b6e87869 performance: enable staging read for cl_buffer
Related-To: NEO-14026

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-07 18:56:06 +01:00
Maciej Plewka
6263425cc9 fix: pass args to obtainCommandStream in correct order
Related-To: NEO-13987
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-02-18 20:22:01 +01:00
Filip Hazubski
4be1153253 fix: Remove pragma once from inl files
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-02-18 20:19:15 +01:00
Filip Hazubski
6b2b42972a fix: Add asserts to ensure NonCopyable and NonMovable 1/n
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-02-18 09:41:20 +01:00
Lukasz Jobczyk
8be2199465 performance: Disable BCS for ULLS light
Related-To: NEO-13922

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-02-17 10:12:05 +01:00
Szymon Morek
d477efeb40 fix: disable staging when cpu copy is allowed
Related-To: NEO-14027

If buffer cpu copy is allowed, it means that whole
transfer can be done on cpu memcpy.
In that case, disable staging as it's less performant.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-02-14 14:05:06 +01:00
Szymon Morek
be2ac667fc performance: enable staging for reads from image
Related-To: NEO-14026

Also, provide performance hint if ptr from
user does not meet restrictions.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-02-13 18:15:59 +01:00
Szymon Morek
8152241693 fix: don't override user data during staging read
Related-To: NEO-13574

User can read image with row pitch larger than region width.
In such cases, don't override memory which is out of region
but within row pitch.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-02-12 16:22:48 +01:00
Szymon Morek
b11322332c performance: enable staging write for cl buffers
Related-To: NEO-13529

Also, add size threshold on iGPU on Linux,
and disable staging if imported host ptr could
be reused

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-02-05 09:41:17 +01:00
Mateusz Jablonski
0a068ce96a fix: add pipe control before scratch register write
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-02-03 18:13:18 +01:00
Michał Pryba
75bc74089b refactor: adjust file names after pre-gen12 removal 2/3
Related-To: NEO-12681
Signed-off-by: Michał Pryba <michal.pryba@intel.com>
2025-02-03 15:31:51 +01:00
Szymon Morek
d7c8b42d01 Revert "performance: enable staging write for cl buffers"
This reverts commit a9a2f7577f.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-01-24 23:41:12 +01:00
Bartosz Dunajski
c75bcba1e6 fix: ensure profiling enabled for OCL Blit Split path
Related-To: NEO-13842

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-01-23 13:59:15 +01:00
Michał Pryba
53831f61df refactor: adjust file names after pre-gen12 removal 1/3
Related-To: NEO-12681
Signed-off-by: Michał Pryba <michal.pryba@intel.com>
2025-01-22 22:47:24 +01:00
Bartosz Dunajski
c2dbdb6797 refactor: move blit post sync data to BlitProperties
Related-To: NEO-13003

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-01-22 18:35:21 +01:00
Damian Tomczak
4af92c20e4 feature: enable heapless for enqueueCopyBufferToImage
Related-to: NEO-12737

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-01-22 12:06:16 +01:00