Commit Graph

5150 Commits

Author SHA1 Message Date
Young Jin Yoon
b5a259aded refactor: add BlitSyncPropertiesExt to BlitSyncProperties
Added BlitSyncPropertiesExt to provide additional information for
different platform and/or blitter commands.

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2025-04-30 19:11:34 +02:00
Lukasz Jobczyk
c9a21c158e performance: Remove global fence allocation from BMG
Resolves: NEO-14642

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-30 17:08:44 +02:00
Marcel Skierkowski
a0831d9d54 test: add ults to createDir method
createDir should return success
when the directory exists
or a new directory has been created

Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-04-30 09:45:59 +02:00
Mateusz Jablonski
949a2e5cb9 test: update expectation in test related to ptl acronyms
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-04-29 11:25:20 +02:00
Marcel Skierkowski
85f71062ae fix: check createDir return value
Check if the createDir function returns an error code

Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-04-29 11:10:19 +02:00
Lukasz Jobczyk
6f4a56d440 refactor: pass product helper to isFenceAllocationRequired
Related-To: NEO-14642

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-28 14:09:02 +02:00
Mateusz Hoppe
41efee1e7c feature: add mode to expose single root device
- simplify API to expose and use only single root device. Disallow
returning subdevices. This is experimental mode.

Related-To: NEO-14559

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-04-25 13:20:56 +02:00
Kindracki, Jakub Tomasz
77b4359983 test: Fix typo in a test
Related-To: NEO-3927
Signed-off-by: Kindracki, Jakub Tomasz <jakub.tomasz.kindracki@intel.com>
2025-04-25 10:37:27 +02:00
Michał Pryba
edfc512d3b test: adjust test machers after pre-gen12 removal
Related-To: NEO-12681
Signed-off-by: Michał Pryba <michal.pryba@intel.com>
2025-04-23 14:30:21 +02:00
Maciej Plewka
6d085ed0d4 fix: Return max wg count equal one on platforms without dispatch all support
Second push of a change that was reverted due to errors in tests
Related-To: NEO-14125, GSD-10725

Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-04-23 13:48:11 +02:00
Compute-Runtime-Validation
b96cf55985 Revert "fix: Return max wg count equal one on platforms without dispatch all ...
This reverts commit b8be602bfb.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-04-17 12:10:11 +02:00
Chodor, Jaroslaw
da23da1b0f test: Additional checks in GivenFclRedirection ocloc tests
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-17 10:30:40 +02:00
Lukasz Jobczyk
d9a40734ae performance: Remove global fence from CW post sync on BMG
Related-To: NEO-14642

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-17 10:23:40 +02:00
Grzegorz Wawiorko
213e34135c fix: Update CL_DEVICE_LATEST_CONFORMANCE_VERSION_PASSED value
Signed-off-by: Grzegorz Wawiorko <grzegorz.wawiorko@intel.com>
2025-04-16 12:59:01 +02:00
Szymon Morek
4d60465a9e fix: don't flush texture cache if not required
Related-To: NEO-14645

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-16 10:38:24 +02:00
Chodor, Jaroslaw
2ce694c52b feature: Enabling support for redirecting FCL to IGC for OCLOC
This allows for invoking IGC as OCL C translator in OCLOC paths

Accepted debug env :
NEO_OCLOC_UseIgcAsFcl=1 ; forces IGC
NEO_OCLOC_UseIgcAsFcl=2 ; forces FCL
NEO_OCLOC_UseIgcAsFcl=0 ; use defaults

Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-15 16:44:35 +02:00
Maciej Plewka
b8be602bfb fix: Return max wg count equal one on platforms without dispatch all support
Related-To: NEO-14125, GSD-10725
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-04-15 15:41:23 +02:00
Lukasz Jobczyk
f7939735da refactor: Adjust USM cleaner to ULLS light
Related-To: NEO-13922

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-15 14:05:02 +02:00
Mateusz Jablonski
21f6bb8889 build: update opencl headers to v2024.10.24
https://github.com/intel/compute-runtime/issues/822

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-04-15 10:06:20 +02:00
Szymon Morek
1432f6a7b2 test: don't call expectMemory on compressed data
Compressed memory can't be compared using expectMemory api.
Instead, compressed memory is transferred to usm host.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-15 09:22:48 +02:00
Jaroslaw Warchulski
3e1aa33924 refactor: cleanup headers
Related-To: NEO-5548
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-04-14 14:59:40 +02:00
Marcel Skierkowski
e82be94368 refactor: Modernize writeDataToFile function
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-04-14 14:28:58 +02:00
Kamil Kopryk
dd3d294f87 performance: cache MOCS values
This change caches the most used MOCS values:

* getMOCS(GMM_RESOURCE_USAGE_OCL_BUFFER_CONST);
* getMOCS(GMM_RESOURCE_USAGE_OCL_BUFFER);
* getMOCS(GMM_RESOURCE_USAGE_OCL_BUFFER_CACHELINE_MISALIGNED);
inside gmmHelper class during initialization to avoid repeated
calls of virtual functions, branches and/or gmm lib access.

and adds more readably corresponding getters:
* getL1EnabledMOCS
* getL3EnabledMOCS
* getUncachedMOCS

If force all resources uncached is called,
these 3 cached mocs values are reinitialized

It also changes the order of gmmHelper members, to avoid
not needed padding after addressWidth
and simplifies logic in getMocsIndex function
for xehp and later products.

Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-04-14 14:12:48 +02:00
Chodor, Jaroslaw
a079a2654d feature: Enabling support for DisableDeepBind debug env in ocloc
Accepted debug env :
NEO_OCLOC_DisableDeepBind=1

Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-12 23:21:19 +02:00
Tomasz Biernacik
18435df04c fix: unify deferring MOCS to PAT
Related-To: NEO-14643

Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-04-11 20:10:25 +02:00
Chodor, Jaroslaw
72ae27d51e feature: Enabling debug variables reader in ocloc
This allows for using debug variables in ocloc with
a few by-design limitations :
* debug var needs to be prefixed with NEO_OCLOC_
(when setting-up at runtime as environment variable
or using .config file)
* debug var needs to be tagged with S_OCLOC scope
(at compile time, in debug_variable .inl files)

Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-11 19:58:53 +02:00
Szymon Morek
5b20450162 performance: enable compression on PTL
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-11 17:26:29 +02:00
Igor Venevtsev
ef52479ce4 feature: add FillImage1dBuffer built-in kernel
Resolves: NEO-13984, HSD-18041422852

Signed-off-by: Igor Venevtsev <igor.venevtsev@intel.com>
2025-04-10 14:04:13 +02:00
Damian Tomczak
f9516b8723 test: remove DEFAULT_TEST_FAMILY_NAME from fixture
Related-to: NEO-14395

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-04-09 23:29:25 +02:00
Marcel Skierkowski
5d01677454 refactor: Mock file system in ULTs
Functions: fileExists and loadDataToFile
use IO functions from namespace IoFunctions

Now tests that use these functions
are mocked by default,
but some still require access to real files
and have been restored the ability to read files.
They will be mocked in next PRs.

Related-To: NEO-7006
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-04-09 19:51:46 +02:00
Marcel Skierkowski
092291ce08 test: mock filesystem in ocloc ult pt 2
Related-To: NEO-14084
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-04-09 15:31:53 +02:00
Szymon Morek
73ad369b10 fix: invalidate texture cache before reading from image
Related-To: NEO-9004

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-09 14:01:22 +02:00
Bartosz Dunajski
ab7364ae19 test: improve HWTEST2_F macro
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-04-08 17:18:14 +02:00
Marcel Skierkowski
b75fbe8e2c refactor: mock filesystem in ocloc ult pt.1
Mocked IO operations in ./ocloc_tests application

Mocked gtest stdout capture in ocloc tests

Related-To: NEO-14084
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-04-08 16:08:42 +02:00
Lukasz Jobczyk
8978ea5e5a performance: Do not create global fence allocation on integrated
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-07 11:22:04 +02:00
Szymon Morek
91ff22f255 fix: correctly calculate host ptr size for images
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-04 17:54:42 +02:00
Compute-Runtime-Validation
f332571d96 Revert "performance: Do not create global fence allocation on integrated"
This reverts commit ecf8a07d26.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-04-04 16:26:19 +02:00
Mateusz Hoppe
cc308719d8 fix: dump zebin when building program
- when unpackSingleDeviceBinary is not called zebin should be dumped
if debug key is set

Related-To: NEO-14153

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-04-04 14:06:33 +02:00
Lukasz Jobczyk
ecf8a07d26 performance: Do not create global fence allocation on integrated
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-04 11:45:22 +02:00
Szymon Morek
95e0244f70 fix: properly pass info about 3D image
Related-To: NEO-14538

It's valid for 3D image to copy 2D region.
Current checks for mip map do not consider that.
This change correctly checks for mip mapped 3D image.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-03 16:33:57 +02:00
Jaroslaw Warchulski
c010d17842 fix: respect compression flag in capability table
Related-To: NEO-9465
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-04-03 15:36:55 +02:00
Filip Hazubski
504440fc4d feature: Add ftrHeaplessMode flag
Pass hwInfo to isHeaplessModeEnabled and isForceBindlessRequired functions.

Related-To: NEO-14526

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-04-02 21:06:05 +02:00
Mateusz Hoppe
94c6bd9382 test: fix unified memory tests
- do not expected memory with unified ptr as source - pagefault migration
is not available in aub mode
- limit buffer size to minimize aub file size and processing time

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-04-01 18:23:15 +02:00
Szymon Morek
f1b789cbb1 performance: enable staging for 3d images
Related-To: NEO-14026

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-01 16:32:41 +02:00
Szymon Morek
3010af596e performance: add infrastructure for staging with 3D images
Related-To: NEO-14026

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-01 15:30:30 +02:00
Mateusz Jablonski
744ff08454 test: correct verifying programmed GPU addresses
use memcmp instead of comparing dereferenced pointer
when address is programmed within Walker's inline data the memory location
address is 4B aligned and is not 8B aligned

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-04-01 13:18:14 +02:00
Szymon Morek
62964a0b08 fix: invalidate caches when heap is placed into reuse list
Related-To: NEO-9004

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-31 12:30:29 +02:00
Filip Hazubski
3d9fc8968e fix: Add BMG device id
Add device ID: 0xE211

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-03-28 13:40:30 +01:00
Jack Myers
0aa2c4f0cb feature: allow removal of heapful code paths
Related-To: NEO-13007

Signed-off-by: Jack Myers <jack.myers@intel.com>
2025-03-27 01:34:35 +01:00
Damian Tomczak
0243004907 feature: additional checkers to enable feature
Resolves: NEO-13973

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2025-03-26 18:06:20 +01:00