Mateusz Jablonski
27d7b633e4
build: use drm-next headers instead of upstream kernel headers
...
pick drm-next uapi headers as of b60301774a8fe6c30b14a95104ec099290a2e904
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-04-17 17:13:18 +02:00
Compute-Runtime-Validation
b96cf55985
Revert "fix: Return max wg count equal one on platforms without dispatch all ...
...
This reverts commit b8be602bfb .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-04-17 12:10:11 +02:00
Chodor, Jaroslaw
da23da1b0f
test: Additional checks in GivenFclRedirection ocloc tests
...
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com >
2025-04-17 10:30:40 +02:00
Lukasz Jobczyk
d9a40734ae
performance: Remove global fence from CW post sync on BMG
...
Related-To: NEO-14642
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-04-17 10:23:40 +02:00
Grzegorz Wawiorko
213e34135c
fix: Update CL_DEVICE_LATEST_CONFORMANCE_VERSION_PASSED value
...
Signed-off-by: Grzegorz Wawiorko <grzegorz.wawiorko@intel.com >
2025-04-16 12:59:01 +02:00
Szymon Morek
4d60465a9e
fix: don't flush texture cache if not required
...
Related-To: NEO-14645
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-16 10:38:24 +02:00
Chodor, Jaroslaw
2ce694c52b
feature: Enabling support for redirecting FCL to IGC for OCLOC
...
This allows for invoking IGC as OCL C translator in OCLOC paths
Accepted debug env :
NEO_OCLOC_UseIgcAsFcl=1 ; forces IGC
NEO_OCLOC_UseIgcAsFcl=2 ; forces FCL
NEO_OCLOC_UseIgcAsFcl=0 ; use defaults
Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com >
2025-04-15 16:44:35 +02:00
Maciej Plewka
b8be602bfb
fix: Return max wg count equal one on platforms without dispatch all support
...
Related-To: NEO-14125, GSD-10725
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2025-04-15 15:41:23 +02:00
Lukasz Jobczyk
f7939735da
refactor: Adjust USM cleaner to ULLS light
...
Related-To: NEO-13922
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-04-15 14:05:02 +02:00
Mateusz Jablonski
21f6bb8889
build: update opencl headers to v2024.10.24
...
https://github.com/intel/compute-runtime/issues/822
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-04-15 10:06:20 +02:00
Szymon Morek
1432f6a7b2
test: don't call expectMemory on compressed data
...
Compressed memory can't be compared using expectMemory api.
Instead, compressed memory is transferred to usm host.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-15 09:22:48 +02:00
Jaroslaw Warchulski
3e1aa33924
refactor: cleanup headers
...
Related-To: NEO-5548
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com >
2025-04-14 14:59:40 +02:00
Marcel Skierkowski
e82be94368
refactor: Modernize writeDataToFile function
...
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com >
2025-04-14 14:28:58 +02:00
Kamil Kopryk
dd3d294f87
performance: cache MOCS values
...
This change caches the most used MOCS values:
* getMOCS(GMM_RESOURCE_USAGE_OCL_BUFFER_CONST);
* getMOCS(GMM_RESOURCE_USAGE_OCL_BUFFER);
* getMOCS(GMM_RESOURCE_USAGE_OCL_BUFFER_CACHELINE_MISALIGNED);
inside gmmHelper class during initialization to avoid repeated
calls of virtual functions, branches and/or gmm lib access.
and adds more readably corresponding getters:
* getL1EnabledMOCS
* getL3EnabledMOCS
* getUncachedMOCS
If force all resources uncached is called,
these 3 cached mocs values are reinitialized
It also changes the order of gmmHelper members, to avoid
not needed padding after addressWidth
and simplifies logic in getMocsIndex function
for xehp and later products.
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-04-14 14:12:48 +02:00
Chodor, Jaroslaw
a079a2654d
feature: Enabling support for DisableDeepBind debug env in ocloc
...
Accepted debug env :
NEO_OCLOC_DisableDeepBind=1
Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com >
2025-04-12 23:21:19 +02:00
Tomasz Biernacik
18435df04c
fix: unify deferring MOCS to PAT
...
Related-To: NEO-14643
Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com >
2025-04-11 20:10:25 +02:00
Chodor, Jaroslaw
72ae27d51e
feature: Enabling debug variables reader in ocloc
...
This allows for using debug variables in ocloc with
a few by-design limitations :
* debug var needs to be prefixed with NEO_OCLOC_
(when setting-up at runtime as environment variable
or using .config file)
* debug var needs to be tagged with S_OCLOC scope
(at compile time, in debug_variable .inl files)
Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com >
2025-04-11 19:58:53 +02:00
Szymon Morek
5b20450162
performance: enable compression on PTL
...
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-11 17:26:29 +02:00
Igor Venevtsev
ef52479ce4
feature: add FillImage1dBuffer built-in kernel
...
Resolves: NEO-13984, HSD-18041422852
Signed-off-by: Igor Venevtsev <igor.venevtsev@intel.com >
2025-04-10 14:04:13 +02:00
Damian Tomczak
f9516b8723
test: remove DEFAULT_TEST_FAMILY_NAME from fixture
...
Related-to: NEO-14395
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com >
2025-04-09 23:29:25 +02:00
Marcel Skierkowski
5d01677454
refactor: Mock file system in ULTs
...
Functions: fileExists and loadDataToFile
use IO functions from namespace IoFunctions
Now tests that use these functions
are mocked by default,
but some still require access to real files
and have been restored the ability to read files.
They will be mocked in next PRs.
Related-To: NEO-7006
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com >
2025-04-09 19:51:46 +02:00
Marcel Skierkowski
092291ce08
test: mock filesystem in ocloc ult pt 2
...
Related-To: NEO-14084
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com >
2025-04-09 15:31:53 +02:00
Szymon Morek
73ad369b10
fix: invalidate texture cache before reading from image
...
Related-To: NEO-9004
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-09 14:01:22 +02:00
Bartosz Dunajski
ab7364ae19
test: improve HWTEST2_F macro
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-04-08 17:18:14 +02:00
Marcel Skierkowski
b75fbe8e2c
refactor: mock filesystem in ocloc ult pt.1
...
Mocked IO operations in ./ocloc_tests application
Mocked gtest stdout capture in ocloc tests
Related-To: NEO-14084
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com >
2025-04-08 16:08:42 +02:00
Young Jin Yoon
0de024dbf9
refactor: separate post-sync into EncodePostSync
...
Created EncodePostSync template struct to organize various post-sync
variables/functions from EncodeDispatchKernel
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com >
2025-04-07 20:06:14 +02:00
Vysochyn, Illia
146a7f7b9b
feature: Add support for BUFFER_SIZE explicit argument
...
Related-To: NEO-13972
Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com >
2025-04-07 14:15:26 +02:00
Lukasz Jobczyk
8978ea5e5a
performance: Do not create global fence allocation on integrated
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-04-07 11:22:04 +02:00
Szymon Morek
91ff22f255
fix: correctly calculate host ptr size for images
...
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-04 17:54:42 +02:00
Compute-Runtime-Validation
f332571d96
Revert "performance: Do not create global fence allocation on integrated"
...
This reverts commit ecf8a07d26 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-04-04 16:26:19 +02:00
Mateusz Hoppe
cc308719d8
fix: dump zebin when building program
...
- when unpackSingleDeviceBinary is not called zebin should be dumped
if debug key is set
Related-To: NEO-14153
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-04-04 14:06:33 +02:00
Lukasz Jobczyk
ecf8a07d26
performance: Do not create global fence allocation on integrated
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-04-04 11:45:22 +02:00
Szymon Morek
95e0244f70
fix: properly pass info about 3D image
...
Related-To: NEO-14538
It's valid for 3D image to copy 2D region.
Current checks for mip map do not consider that.
This change correctly checks for mip mapped 3D image.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-03 16:33:57 +02:00
Jaroslaw Warchulski
c010d17842
fix: respect compression flag in capability table
...
Related-To: NEO-9465
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com >
2025-04-03 15:36:55 +02:00
Filip Hazubski
504440fc4d
feature: Add ftrHeaplessMode flag
...
Pass hwInfo to isHeaplessModeEnabled and isForceBindlessRequired functions.
Related-To: NEO-14526
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2025-04-02 21:06:05 +02:00
Szymon Morek
8836f6df0b
fix: forward mip map level for 3D images
...
Related-To: NEO-14539
1D and 2D images have already mip map level set correctly.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-02 11:46:41 +02:00
Mateusz Hoppe
94c6bd9382
test: fix unified memory tests
...
- do not expected memory with unified ptr as source - pagefault migration
is not available in aub mode
- limit buffer size to minimize aub file size and processing time
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-04-01 18:23:15 +02:00
Szymon Morek
f1b789cbb1
performance: enable staging for 3d images
...
Related-To: NEO-14026
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-01 16:32:41 +02:00
Szymon Morek
3010af596e
performance: add infrastructure for staging with 3D images
...
Related-To: NEO-14026
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-01 15:30:30 +02:00
Mateusz Jablonski
744ff08454
test: correct verifying programmed GPU addresses
...
use memcmp instead of comparing dereferenced pointer
when address is programmed within Walker's inline data the memory location
address is 4B aligned and is not 8B aligned
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2025-04-01 13:18:14 +02:00
Lukasz Jobczyk
0a11a96a53
refactor: Add dedicated method to check if any ULLS light enabled
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-03-31 16:36:20 +02:00
Szymon Morek
62964a0b08
fix: invalidate caches when heap is placed into reuse list
...
Related-To: NEO-9004
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-03-31 12:30:29 +02:00
Filip Hazubski
3d9fc8968e
fix: Add BMG device id
...
Add device ID: 0xE211
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2025-03-28 13:40:30 +01:00
Szymon Morek
ead0842763
feature: add L0 API to query kernel argument info
...
Related-To: NEO-14358
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-03-27 16:43:33 +01:00
Jack Myers
0aa2c4f0cb
feature: allow removal of heapful code paths
...
Related-To: NEO-13007
Signed-off-by: Jack Myers <jack.myers@intel.com >
2025-03-27 01:34:35 +01:00
Damian Tomczak
0243004907
feature: additional checkers to enable feature
...
Resolves: NEO-13973
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com >
2025-03-26 18:06:20 +01:00
Mateusz Hoppe
1d07198a4c
test: refactor non uniform aub fixture
...
- set allowNonUniform flag in program
Related-To: NEO-14449
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2025-03-26 11:41:39 +01:00
Kamil Kopryk
402fc037c3
test: correct expectations 2/2
...
Related-To: NEO-13163
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-03-26 09:03:13 +01:00
Arek G
4751461a30
fix: ocloc: allow passing -device and -device_options in different formats
...
Related-To: NEO-9563
Signed-off-by: Arek G <arkadiusz.grubba@intel.com >
2025-03-25 18:07:21 +01:00
Kamil Kopryk
b462f95884
test: correct expectations
...
Related-To: NEO-13163
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2025-03-25 16:59:05 +01:00