Commit Graph

5624 Commits

Author SHA1 Message Date
Mateusz Jablonski
27d7b633e4 build: use drm-next headers instead of upstream kernel headers
pick drm-next uapi headers as of b60301774a8fe6c30b14a95104ec099290a2e904

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-04-17 17:13:18 +02:00
Compute-Runtime-Validation
b96cf55985 Revert "fix: Return max wg count equal one on platforms without dispatch all ...
This reverts commit b8be602bfb.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-04-17 12:10:11 +02:00
Chodor, Jaroslaw
da23da1b0f test: Additional checks in GivenFclRedirection ocloc tests
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-17 10:30:40 +02:00
Lukasz Jobczyk
d9a40734ae performance: Remove global fence from CW post sync on BMG
Related-To: NEO-14642

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-17 10:23:40 +02:00
Grzegorz Wawiorko
213e34135c fix: Update CL_DEVICE_LATEST_CONFORMANCE_VERSION_PASSED value
Signed-off-by: Grzegorz Wawiorko <grzegorz.wawiorko@intel.com>
2025-04-16 12:59:01 +02:00
Szymon Morek
4d60465a9e fix: don't flush texture cache if not required
Related-To: NEO-14645

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-16 10:38:24 +02:00
Chodor, Jaroslaw
2ce694c52b feature: Enabling support for redirecting FCL to IGC for OCLOC
This allows for invoking IGC as OCL C translator in OCLOC paths

Accepted debug env :
NEO_OCLOC_UseIgcAsFcl=1 ; forces IGC
NEO_OCLOC_UseIgcAsFcl=2 ; forces FCL
NEO_OCLOC_UseIgcAsFcl=0 ; use defaults

Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-15 16:44:35 +02:00
Maciej Plewka
b8be602bfb fix: Return max wg count equal one on platforms without dispatch all support
Related-To: NEO-14125, GSD-10725
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-04-15 15:41:23 +02:00
Lukasz Jobczyk
f7939735da refactor: Adjust USM cleaner to ULLS light
Related-To: NEO-13922

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-15 14:05:02 +02:00
Mateusz Jablonski
21f6bb8889 build: update opencl headers to v2024.10.24
https://github.com/intel/compute-runtime/issues/822

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-04-15 10:06:20 +02:00
Szymon Morek
1432f6a7b2 test: don't call expectMemory on compressed data
Compressed memory can't be compared using expectMemory api.
Instead, compressed memory is transferred to usm host.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-15 09:22:48 +02:00
Jaroslaw Warchulski
3e1aa33924 refactor: cleanup headers
Related-To: NEO-5548
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-04-14 14:59:40 +02:00
Marcel Skierkowski
e82be94368 refactor: Modernize writeDataToFile function
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-04-14 14:28:58 +02:00
Kamil Kopryk
dd3d294f87 performance: cache MOCS values
This change caches the most used MOCS values:

* getMOCS(GMM_RESOURCE_USAGE_OCL_BUFFER_CONST);
* getMOCS(GMM_RESOURCE_USAGE_OCL_BUFFER);
* getMOCS(GMM_RESOURCE_USAGE_OCL_BUFFER_CACHELINE_MISALIGNED);
inside gmmHelper class during initialization to avoid repeated
calls of virtual functions, branches and/or gmm lib access.

and adds more readably corresponding getters:
* getL1EnabledMOCS
* getL3EnabledMOCS
* getUncachedMOCS

If force all resources uncached is called,
these 3 cached mocs values are reinitialized

It also changes the order of gmmHelper members, to avoid
not needed padding after addressWidth
and simplifies logic in getMocsIndex function
for xehp and later products.

Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-04-14 14:12:48 +02:00
Chodor, Jaroslaw
a079a2654d feature: Enabling support for DisableDeepBind debug env in ocloc
Accepted debug env :
NEO_OCLOC_DisableDeepBind=1

Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-12 23:21:19 +02:00
Tomasz Biernacik
18435df04c fix: unify deferring MOCS to PAT
Related-To: NEO-14643

Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-04-11 20:10:25 +02:00
Chodor, Jaroslaw
72ae27d51e feature: Enabling debug variables reader in ocloc
This allows for using debug variables in ocloc with
a few by-design limitations :
* debug var needs to be prefixed with NEO_OCLOC_
(when setting-up at runtime as environment variable
or using .config file)
* debug var needs to be tagged with S_OCLOC scope
(at compile time, in debug_variable .inl files)

Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-11 19:58:53 +02:00
Szymon Morek
5b20450162 performance: enable compression on PTL
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-11 17:26:29 +02:00
Igor Venevtsev
ef52479ce4 feature: add FillImage1dBuffer built-in kernel
Resolves: NEO-13984, HSD-18041422852

Signed-off-by: Igor Venevtsev <igor.venevtsev@intel.com>
2025-04-10 14:04:13 +02:00
Damian Tomczak
f9516b8723 test: remove DEFAULT_TEST_FAMILY_NAME from fixture
Related-to: NEO-14395

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-04-09 23:29:25 +02:00
Marcel Skierkowski
5d01677454 refactor: Mock file system in ULTs
Functions: fileExists and loadDataToFile
use IO functions from namespace IoFunctions

Now tests that use these functions
are mocked by default,
but some still require access to real files
and have been restored the ability to read files.
They will be mocked in next PRs.

Related-To: NEO-7006
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-04-09 19:51:46 +02:00
Marcel Skierkowski
092291ce08 test: mock filesystem in ocloc ult pt 2
Related-To: NEO-14084
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-04-09 15:31:53 +02:00
Szymon Morek
73ad369b10 fix: invalidate texture cache before reading from image
Related-To: NEO-9004

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-09 14:01:22 +02:00
Bartosz Dunajski
ab7364ae19 test: improve HWTEST2_F macro
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-04-08 17:18:14 +02:00
Marcel Skierkowski
b75fbe8e2c refactor: mock filesystem in ocloc ult pt.1
Mocked IO operations in ./ocloc_tests application

Mocked gtest stdout capture in ocloc tests

Related-To: NEO-14084
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-04-08 16:08:42 +02:00
Young Jin Yoon
0de024dbf9 refactor: separate post-sync into EncodePostSync
Created EncodePostSync template struct to organize various post-sync
variables/functions from EncodeDispatchKernel

Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2025-04-07 20:06:14 +02:00
Vysochyn, Illia
146a7f7b9b feature: Add support for BUFFER_SIZE explicit argument
Related-To: NEO-13972

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2025-04-07 14:15:26 +02:00
Lukasz Jobczyk
8978ea5e5a performance: Do not create global fence allocation on integrated
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-07 11:22:04 +02:00
Szymon Morek
91ff22f255 fix: correctly calculate host ptr size for images
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-04 17:54:42 +02:00
Compute-Runtime-Validation
f332571d96 Revert "performance: Do not create global fence allocation on integrated"
This reverts commit ecf8a07d26.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-04-04 16:26:19 +02:00
Mateusz Hoppe
cc308719d8 fix: dump zebin when building program
- when unpackSingleDeviceBinary is not called zebin should be dumped
if debug key is set

Related-To: NEO-14153

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-04-04 14:06:33 +02:00
Lukasz Jobczyk
ecf8a07d26 performance: Do not create global fence allocation on integrated
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-04 11:45:22 +02:00
Szymon Morek
95e0244f70 fix: properly pass info about 3D image
Related-To: NEO-14538

It's valid for 3D image to copy 2D region.
Current checks for mip map do not consider that.
This change correctly checks for mip mapped 3D image.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-03 16:33:57 +02:00
Jaroslaw Warchulski
c010d17842 fix: respect compression flag in capability table
Related-To: NEO-9465
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-04-03 15:36:55 +02:00
Filip Hazubski
504440fc4d feature: Add ftrHeaplessMode flag
Pass hwInfo to isHeaplessModeEnabled and isForceBindlessRequired functions.

Related-To: NEO-14526

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-04-02 21:06:05 +02:00
Szymon Morek
8836f6df0b fix: forward mip map level for 3D images
Related-To: NEO-14539

1D and 2D images have already mip map level set correctly.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-02 11:46:41 +02:00
Mateusz Hoppe
94c6bd9382 test: fix unified memory tests
- do not expected memory with unified ptr as source - pagefault migration
is not available in aub mode
- limit buffer size to minimize aub file size and processing time

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-04-01 18:23:15 +02:00
Szymon Morek
f1b789cbb1 performance: enable staging for 3d images
Related-To: NEO-14026

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-01 16:32:41 +02:00
Szymon Morek
3010af596e performance: add infrastructure for staging with 3D images
Related-To: NEO-14026

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-01 15:30:30 +02:00
Mateusz Jablonski
744ff08454 test: correct verifying programmed GPU addresses
use memcmp instead of comparing dereferenced pointer
when address is programmed within Walker's inline data the memory location
address is 4B aligned and is not 8B aligned

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-04-01 13:18:14 +02:00
Lukasz Jobczyk
0a11a96a53 refactor: Add dedicated method to check if any ULLS light enabled
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-31 16:36:20 +02:00
Szymon Morek
62964a0b08 fix: invalidate caches when heap is placed into reuse list
Related-To: NEO-9004

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-31 12:30:29 +02:00
Filip Hazubski
3d9fc8968e fix: Add BMG device id
Add device ID: 0xE211

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-03-28 13:40:30 +01:00
Szymon Morek
ead0842763 feature: add L0 API to query kernel argument info
Related-To: NEO-14358

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-27 16:43:33 +01:00
Jack Myers
0aa2c4f0cb feature: allow removal of heapful code paths
Related-To: NEO-13007

Signed-off-by: Jack Myers <jack.myers@intel.com>
2025-03-27 01:34:35 +01:00
Damian Tomczak
0243004907 feature: additional checkers to enable feature
Resolves: NEO-13973

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-03-26 18:06:20 +01:00
Mateusz Hoppe
1d07198a4c test: refactor non uniform aub fixture
- set allowNonUniform flag in program

Related-To: NEO-14449

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-03-26 11:41:39 +01:00
Kamil Kopryk
402fc037c3 test: correct expectations 2/2
Related-To: NEO-13163
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-03-26 09:03:13 +01:00
Arek G
4751461a30 fix: ocloc: allow passing -device and -device_options in different formats
Related-To: NEO-9563
Signed-off-by: Arek G <arkadiusz.grubba@intel.com>
2025-03-25 18:07:21 +01:00
Kamil Kopryk
b462f95884 test: correct expectations
Related-To: NEO-13163
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-03-25 16:59:05 +01:00