Kamil Kopryk
101c914b1f
test: add missing ult for checking patch address in heapless mode OCL
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-05-20 17:10:16 +02:00
Fabian Zwoliński
59829aeb11
fix(ocl): Force rebuild if binary contains bindless kernel
...
Related-To: NEO-7063
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com >
2024-05-20 14:15:17 +02:00
Chodor, Jaroslaw
90927135f9
feature: Adding elf rewriter utility
...
It will aid in various elf patching tasks both in ocloc
and UMD
Related-To: NEO-10190
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com >
2024-05-20 13:44:11 +02:00
Maciej Plewka
90df4b298b
fix: cache flush dependency for queue blocked
...
Related-to: NEO-9872, HSD-18038461954
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2024-05-20 11:55:24 +02:00
Szymon Morek
8989bf4078
fix: set timestamp diff correctly if lower than resolution
...
Related-To: NEO-10615
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-05-17 20:15:58 +02:00
Dominik Dabek
a236171f0d
performance(ocl): enable device usm alloc reuse
...
Enabling on MTL+
Limited to use max 2% of global device memory.
Related-To: NEO-6893, NEO-11463
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-05-17 13:32:45 +02:00
Compute-Runtime-Validation
4c5d567b40
Revert "fix: remove not needed check in OCL CPU copy path 2"
...
This reverts commit 0a27aa9f7b .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-05-17 10:52:17 +02:00
Compute-Runtime-Validation
34f53d5d94
Revert "performance: Reuse GPU timestamps by default"
...
This reverts commit 7aceed58ca .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-05-16 21:52:10 +02:00
Bartosz Dunajski
0a27aa9f7b
fix: remove not needed check in OCL CPU copy path 2
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-05-16 12:38:39 +02:00
Szymon Morek
7aceed58ca
performance: Reuse GPU timestamps by default
...
Related-To: NEO-10615
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-05-15 17:51:42 +02:00
Dominik Dabek
b4d839fe29
performance(usm): l0, add usm host memory pooling
...
Disabled by default.
Related-To: NEO-11356
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-05-15 15:20:51 +02:00
Dominik Dabek
ae8c7589dc
refactor: move implicit arg has indirect access
...
Move implicit arg has indirect access boolean to kernelAttributes
Related-To: NEO-11396
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-05-15 13:11:04 +02:00
Dominik Dabek
fd47030ad6
fix: use igc indirect detection v3
...
Update to use igc indirect detection v3. Fix for not detecting indirects
passed as implicit arguments.
Related-To: NEO-11396
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-05-15 06:35:42 +02:00
Compute-Runtime-Validation
4c43ca518b
Revert "fix: remove not needed check in OCL CPU copy path"
...
This reverts commit 92f9c8be96 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-05-14 20:12:28 +02:00
Bartosz Dunajski
92f9c8be96
fix: remove not needed check in OCL CPU copy path
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-05-14 17:05:55 +02:00
Ratajewski, Andrzej
0b0f7bd4cd
feature: Enable support for cl_intel_subgroup_2d_block_io
...
This change enables support for cl_intel_subgroup_2d_block_io extension for
PVC and later.
Related-To: GSD-7825
Signed-off-by: Ratajewski, Andrzej <andrzej.ratajewski@intel.com >
2024-05-14 15:16:49 +02:00
Szymon Morek
e35b951a00
performance: Allow indirect allocs as pack on OpenCL
...
Related-To: NEO-11228
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-05-10 17:57:42 +02:00
Maciej Plewka
e39893485c
fix: add cache flush as dependency for bcs ccs synchronization
...
Related-to: NEO-9872
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2024-05-09 13:43:39 +02:00
Szymon Morek
10ed479b16
performance: share inter-module ISA allocations
...
Related-To: NEO-10258
Currently each module has it's own GA
for kernel ISA's. This change allows new modules to
reuse existing allocation.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-05-09 08:43:55 +02:00
Compute-Runtime-Validation
902b611179
Revert "fix: Fix front end programming for cooperative dispatch"
...
This reverts commit 36ddfaaf4d .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-05-08 11:14:37 +02:00
Aleksander Czerwionka
4667f55ea2
fix: add missing checks to return proper OpenCL error from API
...
changes affect clEnqueueNDCountKernelINTEL and
clGetKernelMaxConcurrentWorkGroupCountINTEL
Related-To: NEO-9688
Signed-off-by: Aleksander Czerwionka <aleksander.czerwionka@intel.com >
2024-05-07 20:32:36 +02:00
Szymon Morek
34cfba4a2a
test: remove unused method
...
Related-To: NEO-10615
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-05-07 13:11:56 +02:00
Katarzyna Cencelewska
e9f7df6ae6
refactor: create helper for maxPtssIndex
...
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2024-05-07 12:47:52 +02:00
Weronika Kapusta
d6c16c1640
fix: remove compiler cache legacy implementation
...
Related-To: NEO-10679
Signed-off-by: Weronika Kapusta <weronika.kapusta@intel.com >
2024-05-06 18:28:35 +02:00
Szymon Morek
6df46aa062
performance: Iterate over indirect allocations once
...
Related-To: NEO-11228
Iterate only on new allocations when making indirect
allocations resident.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-05-06 15:51:37 +02:00
Szymon Morek
83e8ae4a20
performance: Reuse GPU timestamp instead of KMD escape
...
This can be enabled only if related
debug flag will be set.
Related-To: NEO-10615
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-05-06 14:46:30 +02:00
Compute-Runtime-Validation
8342c0ae2f
Revert "fix: add cache flush as dependency for bcs ccs synchronization"
...
This reverts commit 5e57bb2a32 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-05-01 03:05:47 +02:00
Mrozek, Michal
3ded04341c
refactor: remove not needed code.
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com >
2024-04-30 16:24:21 +02:00
Mrozek, Michal
03dcff1398
refactor: remove not needed code.
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com >
2024-04-30 15:10:34 +02:00
Fabian Zwoliński
ee71157f7f
fix: opencl support for bindless kernels
...
Related-To: NEO-11156
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com >
2024-04-30 12:02:17 +02:00
Maciej Plewka
5e57bb2a32
fix: add cache flush as dependency for bcs ccs synchronization
...
Related-to: NEO-9872
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2024-04-30 11:57:49 +02:00
Lukasz Jobczyk
318616b3b1
refactor: Skip test without cooperative engine
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-04-26 10:25:45 +02:00
Artur Harasimiuk
cbc868c4ed
refactor: add mock_aubstream
...
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com >
2024-04-25 20:08:30 +02:00
Lukasz Jobczyk
36ddfaaf4d
fix: Fix front end programming for cooperative dispatch
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-04-25 12:55:46 +02:00
Katarzyna Cencelewska
eb256f1ffd
refactor: use encode surface state to set pitch for scratch
...
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2024-04-25 11:57:14 +02:00
Lukasz Jobczyk
d122e1c906
fix: Flush render state cache on release shared object
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-04-22 17:37:42 +02:00
Fabian Zwolinski
9553dd1098
documentation: compiler cache programmers guide
...
Related-To: NEO-9416
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com >
2024-04-22 15:44:49 +02:00
Andrzej Koska
ae139aeffd
refactor: Passing information about the engine
...
Extension of the interface with information
about the engine type passed to the function
Related-To: NEO-10678
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com >
2024-04-22 13:33:56 +02:00
Katarzyna Cencelewska
87776f5f25
fix: clean use of coherency type
...
Related-To: NEO-7083
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2024-04-18 15:58:11 +02:00
Milczarek, Slawomir
a2e0f9ac09
fix: Add AUB/TBX writable for clEnqueueWriteBuffer on SVM hostptr
...
Related-To: NEO-11082
Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com >
2024-04-18 12:17:10 +02:00
Compute-Runtime-Validation
da9df9f0e7
Revert "performance: Reuse GPU timestamp instead of KMD escape"
...
This reverts commit 9ca2091725 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-04-18 10:25:15 +02:00
Timo Aaltonen
cb984b3e4b
fix: Fix various typos.
...
Signed-off-by: Timo Aaltonen <tjaalton@debian.org >
2024-04-17 20:03:06 +02:00
Mateusz Jablonski
8676b97412
build: add cmake flag to select platforms for aub generation
...
Related-To: NEO-11097
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-04-17 16:43:21 +02:00
Mateusz Jablonski
4067b899cc
test: don't create aub files when --null_aubstream flag is passed
...
Related-To: NEO-11097
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-04-17 14:47:06 +02:00
Morek, Szymon
9ca2091725
performance: Reuse GPU timestamp instead of KMD escape
...
Resolves: NEO-10615
Signed-off-by: Morek, Szymon <szymon.morek@intel.com >
2024-04-17 09:39:29 +02:00
Jaroslaw Warchulski
ea057abd36
fix: allow empty set to be passed to clSetKernelExecInfo
...
Related-To: GSD-8538
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com >
2024-04-16 14:11:04 +02:00
Katarzyna Cencelewska
ce3bb1327e
feature: add method to switch enable/disable mid thread preemption
...
Resolves: NEO-8089
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2024-04-15 16:26:52 +02:00
Fabian Zwolinski
e7420de011
fix: ocloc fatbinary handle -out_dir without -output
...
Currently if we pass `-out_dir` in ocloc fatbinary cmdline, e.g.:
./ocloc compile -file vector.cl ... -out_dir ../exampler_dir
ocloc sets the name for the output file from the `-output` parameter.
But as the `-output` parameter is not provided,
the file name will be empty and won't be written to disk.
This patch adds support for a scenario
where you pass `-out_dir` without `-output`.
In this case, the file will have default name - input file's base name.
Related-To: NEO-10603
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com >
2024-04-15 15:47:50 +02:00
Aleksander Czerwionka
b630f58861
fix: remove incorrectly reported extensions
...
Related-To: NEO-8349
Signed-off-by: Aleksander Czerwionka <aleksander.czerwionka@intel.com >
2024-04-15 10:04:31 +02:00
Lukasz Jobczyk
8a0c425495
feature: Mark selected resources as UC when mitigating dc flush
...
Related-To: NEO-10556
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-04-12 12:52:13 +02:00