Commit Graph

6538 Commits

Author SHA1 Message Date
Kamil Kopryk
b8504913e3 fix: allow for image array type if array size is 1 for Xe2 and later
This commit enabled the use of the array image type with array size 1
from Xe2.
Additinally, it removes two incorrect unit tests, as array size of 0 is not
correct with the OpenCL specification.

Related-To: NEO-13976
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-02-05 13:50:05 +01:00
Szymon Morek
b11322332c performance: enable staging write for cl buffers
Related-To: NEO-13529

Also, add size threshold on iGPU on Linux,
and disable staging if imported host ptr could
be reused

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-02-05 09:41:17 +01:00
Andrzej Koska
efcbd3875b fix: Fix copying with blitter
Related-To: NEO-12134, NEO-13874

Thanks to this change we avoid programming y1 offset for 2D surface
above the maximum allowable value, i.e. 16 kb for blitter for BMG

Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2025-02-04 19:07:50 +01:00
Michał Pryba
83daa48bb3 refactor: adjust file names after pre-gen12 removal 3/n
Related-To: NEO-12681
Signed-off-by: Michał Pryba <michal.pryba@intel.com>
2025-02-04 16:59:31 +01:00
Maciej Bielski
971b7c27a2 fix: enable usm compression on linux
Related-To: NEO-12056
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-02-04 13:09:04 +01:00
Slawomir Milczarek
733855e9d9 refactor: Add const suffix
Related-To: NEO-12952

Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2025-02-04 12:35:21 +01:00
Chodor, Jaroslaw
a65d4da88b refactor: prevent copy/move on KernelDesc and ZebinExecEnv
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-02-04 11:30:13 +01:00
Mateusz Hoppe
6e35d055f2 feature: make contextGroupSize dependent on number of processes
Related-To: NEO-12952

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-02-03 18:13:33 +01:00
Michał Pryba
75bc74089b refactor: adjust file names after pre-gen12 removal 2/3
Related-To: NEO-12681
Signed-off-by: Michał Pryba <michal.pryba@intel.com>
2025-02-03 15:31:51 +01:00
Bartosz Dunajski
dd1e85a3d4 fix: bcs mem fill pattern alignment
Related-To: NEO-13928

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-02-03 14:43:04 +01:00
Bellekallu Rajkiran
3bef9f886a fix: Crash on over memory allocation
- Add defer backing flag to gem create ioctl
- Make memory resident before lock

Related-To: NEO-13403

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2025-02-03 14:29:42 +01:00
Mateusz Jablonski
a924761ed1 fix: correct flags for creating allocation with existing system memory
on Windows setup ReadOnly flag, it makes there is no ProbeForWrite call within
CreateAllocation2 function call

it doesn't force read-only access from GPU perspective, that access it
defined in MapGpuVa function

Related-To: NEO-12986, HSD-15016952776
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-02-03 11:24:55 +01:00
Maciej Plewka
aace15d6bc fix: remove msaa w/a for xe3
Related-To: NEO-13290, NEO-12587
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-02-03 10:38:01 +01:00
shubham kumar
f2b48b97d2 refactor: Prework to get sampling unit through ioctl query on xe
Related-To: NEO-13777

Signed-off-by: shubham kumar <shubham.kumar@intel.com>
2025-02-03 10:37:46 +01:00
Compute-Runtime-Validation
99d4e6d054 Revert "performance: don't wait immediately after VmBind"
This reverts commit 09118d2011.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-02 18:00:28 +01:00
Fabian Zwoliński
7e9c358a73 fix: apply 2MB alignment to large local memory image allocations
In this patch, we align up the allocation size to 2MB for all
image allocations >= 2MB located in local memory.
2MB alignment support is defined by function:
`is2MBLocalMemAlignmentEnabled`

Related-To: NEO-12287

Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2025-01-31 14:58:24 +01:00
Francois Dugast
f91cde2a5b fix: invalid includes and missing rdtsc() on ARM
Signed-off-by: Francois Dugast <francois.dugast@intel.com>
2025-01-31 13:05:06 +01:00
Compute-Runtime-Validation
3556d9ded6 Revert "fix: set ReadOnly flag when creating allocation with existing system ...
This reverts commit bddb24124d.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-01-31 12:15:12 +01:00
Compute-Runtime-Validation
d23249b061 Revert "fix: Set vmbind user fence when makeMemoryResident"
This reverts commit 80dc4fb43a.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-01-31 11:36:29 +01:00
Brandon Yates
635f69e54a fix: Configure scratch pages for debugger
DG2 requires scratch pages on for debugger. Other platforms do not.

Related-to: NEO-13883

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-01-31 06:49:49 +01:00
Brandon Yates
106e8be9a9 fix: Don't abort application due to gpu fault when debugging is enabled
Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-01-30 23:37:50 +01:00
Filip Hazubski
c306c457db fix: Add BMG device ids
Add following device ids: 0xE210, 0xE215, 0xE216

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-01-30 22:56:38 +01:00
Fabian Zwoliński
7918b44a94 fix: apply 2MB alignment to large local memory allocations
In this patch, we align up the allocation size to 2MB for all
allocations >= 2MB located in local memory.
2MB alignment support is defined by function:
`is2MBLocalMemAlignmentEnabled`

Related-To: NEO-12287

Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2025-01-30 22:09:39 +01:00
Bartosz Dunajski
4846fbe410 refactor: unify blit memory fill methods and pass BlitProperties 2
Related-To: NEO-13003

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-01-30 20:16:10 +01:00
Mateusz Jablonski
9ebb2f1227 refactor: remove not needed code
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-01-30 20:07:05 +01:00
Mateusz Jablonski
bddb24124d fix: set ReadOnly flag when creating allocation with existing system memory
setting ReadOnly flag makes there is no ProbeForWrite call within
CreateAllocation2 function call

it doesn't make read-only access from GPU perspective, that access it
defined in MapGpuVa function

Related-To: NEO-12986, HSD-15016952776
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-01-30 19:43:44 +01:00
Kamil Kopryk
ef896cc799 refactor: introduce ImageSurfaceState helper class
Moved global functions to the ImageSurfaceStateHelper class,
with declarations in the header file and definitions in the base .inl
file.
This change reduces compilation time by:
- removing unnecessary includes from the header file
- adding explicit template instantiations, which are faster than
implicit template instantiations.

Additionally, the image_skl_and_later.inl file has been removed as it
is no longer needed, and its implementation has been moved to the base .inl

Related-To: NEO-12149

Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-01-30 19:20:31 +01:00
Szymon Morek
09118d2011 performance: don't wait immediately after VmBind
Related-To: NEO-13456

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-01-30 18:52:22 +01:00
Maciej Plewka
d37943da29 fix: disable xe2 compression only on needed platforms
Related-To: NEO-13381, NEO-13382
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-01-30 16:06:09 +01:00
Radoslaw Jablonski
5f346b57b0 performance: enable Direct Submission on PTL Linux
Related-To: NEO-12982

Signed-off-by: Radoslaw Jablonski <radoslaw.jablonski@intel.com>
2025-01-30 15:35:19 +01:00
Lukasz Jobczyk
09e23804dc refactor: Use timestamp types for events and in order nodes
Switching tagAllocation into UC causes regressions in non event
scenarios. It is no longer used as dependant type for semaphores.
Restoring previous GMM_USAGE settings for tag.
Marking events and in order nodes using only timestamp types as they
have proper GMM_USAGE settings already and can be both in smem and lmem.

Resolves: NEO-13847

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-01-30 12:20:38 +01:00
Compute-Runtime-Validation
768e15bab2 Revert "performance: reduce ULLS controller timeout to 1ms"
This reverts commit 0db7128503.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-01-30 03:41:58 +01:00
Grzegorz Choinski
c63ac8a049 fix: remove potential string's internal buffer overlaping if resized
based on: 92ea7ee7a3
fixes Release compilation for gcc12

Related-To: NEO-10767
Signed-off-by: Grzegorz Choinski <grzegorz.choinski@intel.com>
Co-authored-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-01-29 19:32:58 +01:00
Radoslaw Jablonski
6ead86f0e1 performance: enable PreferredAllocationMethod by KMD on PTL
Related-To: NEO-10526

Signed-off-by: Radoslaw Jablonski <radoslaw.jablonski@intel.com>
2025-01-29 14:11:21 +01:00
Compute-Runtime-Validation
21021f0340 Revert "refactor: unify blit memory fill methods and pass BlitProperties"
This reverts commit d7ca6568fc.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-01-29 02:09:59 +01:00
Compute-Runtime-Validation
7cbe19c5a9 Revert "fix: Dont enable scratch pages when debugger is on"
This reverts commit 18d04b239e.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-01-29 01:30:59 +01:00
Maciej Bielski
a8779c2387 fix: report ZE_MEMORY_ACCESS_CAP_FLAG_CONCURRENT correctly
At the moment the capability is returned only based on the value
returned by the `productHelper`, which is too liberal. The capability
must also consider the support reported by `memoryManager`. Only then
the support reported is aligned with actual logic of handling
USM-allocations.

Related-To: NEO-10040
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-01-29 00:17:38 +01:00
Andrzej Koska
1abb48c3e0 Revert "feature: Enable Tile64 Optimization Flag"
This reverts commit 105a586615.

Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2025-01-28 22:32:58 +01:00
Szymon Morek
0db7128503 performance: reduce ULLS controller timeout to 1ms
Currently scope limited to Windows only

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-01-28 22:27:09 +01:00
Chandio, Bibrak Qamar
80dc4fb43a fix: Set vmbind user fence when makeMemoryResident
Related-To: NEO-11977, GSD-10293

Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com>
2025-01-28 22:04:37 +01:00
Bartosz Dunajski
d7ca6568fc refactor: unify blit memory fill methods and pass BlitProperties
Related-To: NEO-13003

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-01-28 17:47:31 +01:00
Lukasz Jobczyk
7eac78d253 refactor: Make semaphore dependant resources UC
Change for tagBuffer type on Windows. Timestamp types are already UC on all DC
flush platforms regardless of OS.

Resolves: NEO-13847

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-01-28 14:51:28 +01:00
Compute-Runtime-Validation
ad99c8d818 Revert "feature: Submission for GlobalTimestamp as default"
This reverts commit 3d3c5ddf9f.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-01-28 11:22:20 +01:00
Lukasz Jobczyk
82569f6a0f refactor: Switch host events and in order nodes to tagAllocation type
Motivation is to distinguish between host USM resources and internal
driver resources allocated in host memory.

Related-To: NEO-13847

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-01-28 09:42:19 +01:00
Brandon Yates
18d04b239e fix: Dont enable scratch pages when debugger is on
Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-01-27 22:05:34 +01:00
Bartosz Dunajski
ce58cb0784 refactor: unify blit mem set functions
Related-To: NEO-13003

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-01-27 18:34:48 +01:00
Chodor, Jaroslaw
beedf709bc feature: LookupArray findGreaterEqual
Adding helpers for find first greater equal
value in lookup array. Useful when finding
minimum SLM encoding that satisfies requirements.

Related-To: NEO-12747

Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-01-27 13:02:02 +01:00
Compute-Runtime-Validation
676334807f Revert "refactor: Switch host events and in order nodes to tagAllocation type"
This reverts commit 8bbba3fe5c.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-01-27 06:44:12 +01:00
Chodor, Jaroslaw
574fe9fb29 refactor: Pass whole kernel descriptor to barrier programming
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-01-27 01:26:49 +01:00
Chodor, Jaroslaw
0938a0eaf9 feature: Adding support for extending KernerlDescriptor
Related-To: NEO-12747

Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-01-25 12:08:27 +01:00