Szymon Morek
83e8ae4a20
performance: Reuse GPU timestamp instead of KMD escape
...
This can be enabled only if related
debug flag will be set.
Related-To: NEO-10615
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-06 14:46:30 +02:00
Compute-Runtime-Validation
8342c0ae2f
Revert "fix: add cache flush as dependency for bcs ccs synchronization"
...
This reverts commit 5e57bb2a32
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-01 03:05:47 +02:00
Mrozek, Michal
3ded04341c
refactor: remove not needed code.
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-04-30 16:24:21 +02:00
Mrozek, Michal
03dcff1398
refactor: remove not needed code.
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-04-30 15:10:34 +02:00
Fabian Zwoliński
ee71157f7f
fix: opencl support for bindless kernels
...
Related-To: NEO-11156
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-04-30 12:02:17 +02:00
Maciej Plewka
5e57bb2a32
fix: add cache flush as dependency for bcs ccs synchronization
...
Related-to: NEO-9872
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-04-30 11:57:49 +02:00
Artur Harasimiuk
cbc868c4ed
refactor: add mock_aubstream
...
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2024-04-25 20:08:30 +02:00
Lukasz Jobczyk
d122e1c906
fix: Flush render state cache on release shared object
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-04-22 17:37:42 +02:00
Andrzej Koska
ae139aeffd
refactor: Passing information about the engine
...
Extension of the interface with information
about the engine type passed to the function
Related-To: NEO-10678
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2024-04-22 13:33:56 +02:00
Milczarek, Slawomir
a2e0f9ac09
fix: Add AUB/TBX writable for clEnqueueWriteBuffer on SVM hostptr
...
Related-To: NEO-11082
Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>
2024-04-18 12:17:10 +02:00
Jaroslaw Warchulski
ea057abd36
fix: allow empty set to be passed to clSetKernelExecInfo
...
Related-To: GSD-8538
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2024-04-16 14:11:04 +02:00
Katarzyna Cencelewska
ce3bb1327e
feature: add method to switch enable/disable mid thread preemption
...
Resolves: NEO-8089
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-04-15 16:26:52 +02:00
Kamil Kopryk
4bb30e51a2
fix: use default csr for scratch in OCL
...
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-04-11 16:58:35 +02:00
Kamil Kopryk
cfc3b6a8c2
refactor: add heaplessModeEnabled argument to isHeaplessStateInitEnabled
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-04-11 10:08:10 +02:00
Mateusz Jablonski
cb2b572e94
feature: add support for null aub mode
...
In this mode AUB csr will be created, however, no aub file will be created
Related-To: NEO-11097
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-04-09 16:59:42 +02:00
Compute-Runtime-Validation
9568ee47e7
Revert "fix: remove compiler cache legacy implementation"
...
This reverts commit 864f42116c
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-04-09 02:06:51 +02:00
Weronika Kapusta
864f42116c
fix: remove compiler cache legacy implementation
...
Related-To: NEO-10679
Signed-off-by: Kapusta, Weronika <weronika.kapusta@intel.com>
2024-04-08 16:40:57 +02:00
Dominik Dabek
0d69dfe165
refactor: remove comment
...
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-04-08 11:43:19 +02:00
Kacper Nowak
35ca2bbea9
refactor: refactor logic for querying atomic FP capabilities
...
- Move logic from product helper to compiler product helper
- Add method for adjusting fp16 and extra capabilities using release
helper (if present).
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2024-03-28 15:27:45 +01:00
Mateusz Jablonski
bcaa62bed3
build: refactor linux kernel headers
...
Add cmake flags to provide path to headers to drm / i915 /i915 prelim / xe
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-28 14:48:09 +01:00
Lukasz Jobczyk
436579ca8c
fix: Disable async release of buffers with external host ptr
...
Related-To: NEO-10036
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-27 15:07:14 +01:00
Compute-Runtime-Validation
707796a50d
Revert "fix: Disable async release of buffers with external host ptr"
...
This reverts commit 9b715c3ffc
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-27 07:49:54 +01:00
Bartosz Dunajski
fce016ee7a
refactor: add new semaphore param
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-03-26 15:46:47 +01:00
Mateusz Jablonski
78a4a92b44
refactor: reorder members to reduce internal padding in structs
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-25 15:50:00 +01:00
Lukasz Jobczyk
9b715c3ffc
fix: Disable async release of buffers with external host ptr
...
Related-To: NEO-10036
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-25 12:55:57 +01:00
Filip Hazubski
07b9117b44
refactor: Explicitly delete copy c-tor and copy assignment of GlBuffer
...
Comply with the rule of three.
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-03-25 10:55:02 +01:00
Compute-Runtime-Validation
e9b866ab11
Revert "fix: Disable async release of buffers with external host ptr"
...
This reverts commit 70a62d89e3
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-23 11:49:42 +01:00
Lukasz Jobczyk
70a62d89e3
fix: Disable async release of buffers with external host ptr
...
Related-To: NEO-10036
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-22 18:11:27 +01:00
Katarzyna Cencelewska
da7b03dd15
fix: to always use grfs count in calculateNumThreadsPerThreadGroup
...
grf size != grf count
Related-To: GSD-8437
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-03-22 11:03:18 +01:00
Katarzyna Cencelewska
dd1d52259e
refactor: add param rootDeviceEnvironment to calculateNumThreadsPerThreadGroup
...
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-03-21 22:25:14 +01:00
Mateusz Hoppe
0cec5ccc32
fix: check if any of devices has BindlessHeapHelper enabled
...
- report Bindless_images extension in driver only when at least one
device has global bindless heaps enabled
Related-To: NEO-10352
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-03-20 20:50:58 +01:00
Bartlomiej Wolny
7db5f6d5bf
fix: add texture cache flush on image dtor
...
Related-To: NEO-9794
Signed-off-by: Bartlomiej Wolny <bartlomiej.wolny@intel.com>
2024-03-18 17:03:17 +01:00
Fabian Zwolinski
eb259c1f52
fix: set kernel LWS prior to obtaining indirect heaps
...
When calculating the size of the indirect object heap,
the local work group size from kernel implicit args is taken into account.
If the LWS is not set before this calculation,
it can lead to insufficient ioh allocation size.
Such a problem is seen when local ids are generated by the runtime
and then written to ioh. The write fails due to lack of space in the allocation.
Related-To: IGC-7708
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-03-14 10:38:46 +01:00
Ben Ashbaugh
e00230a6c2
fix: correctly report support for SPIR-V 1.0 through 1.3
...
Related-To: NEO-10336
Signed-off-by: Ben Ashbaugh <ben.ashbaugh@intel.com>
2024-03-12 17:54:09 +01:00
Grzegorz Wawiorko
87f2357868
fix: Update CL_DEVICE_LATEST_CONFORMANCE_VERSION_PASSED value
...
Signed-off-by: Grzegorz Wawiorko <grzegorz.wawiorko@intel.com>
2024-03-12 08:42:21 +01:00
Mrozek, Michal
f71f6d2b72
refactor: remove not needed code
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-03-08 18:18:55 +01:00
Mateusz Hoppe
5e15543b14
test: improve unit tests - use virtual file system
...
- use virtualFileSystem in ULTs, writeDataToFile() does not create file
but instead saves filename. Call to fileExists() checks if file was
previously written
Related-To: NEO-7006
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-03-08 13:34:19 +01:00
Kamil Kopryk
168445784e
feature: introduce states programming at driver init heapless ocl
...
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-03-08 12:29:44 +01:00
Lukasz Jobczyk
88a8bc8ac6
fix: Download BCS allocations when timestamps are completed
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-08 10:47:53 +01:00
Maciej Plewka
f0281202bf
feature: Signal to OGL creating/destroying shared buffer
...
Related-To: NEO-9151
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-03-06 16:28:28 +01:00
Lukasz Jobczyk
895c951a7f
fix: Unlock CmdQ before download allocations
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-05 14:53:11 +01:00
Mateusz Jablonski
bfbe5a048c
fix: verify retVal from poll function in gl sharing sync on Linux
...
unify logic of synchronization objects
add default iniialization of gl context guard members
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-05 11:26:27 +01:00
Lukasz Jobczyk
246c24f78f
fix: Download allocations when timestamps are complete
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-04 16:05:31 +01:00
Mateusz Jablonski
8c55dd91a6
fix: don't use poll/close syscalls in gl sharing tests
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-04 14:09:46 +01:00
Fabian Zwolinski
52430762ac
fix: cl_cache L0 env vars + refactor code structure
...
Added support for new Compiler Cache
environment variables in Level Zero.
Moved
`opencl/source/compiler_interface/default_cache_config.cpp`
`level_zero/core/source/compiler_interface/default_cache_config.cpp`
to shared directory
`source/compiler_interface/default_cache_config.cpp`
Switched enabling cache by default from per OS to per API.
Changed default state of cl_cache in Level Zero to disabled.
Related-To: NEO-10045
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-03-01 17:35:08 +01:00
Sylvain Munaut
e2c511bc00
feature(ocl): Improve Linux CL/GL sharing support
...
This commit is aimed at drastically improving the support for the CL/GL
sharing
extension on linux. The current support is not really usable as it only
supports a few texture format, and only on EGL contexts. It is also
pretty
buggy since it requires the texture to be bound when placing the CL call
to
share it which is just plain wrong and will not work in many
applications.
This new version makes used of the "official" interop extension from
MESA
which is available for GLX and EGL contexts, allows sharing of buffers
and
not just texture and supports many more formats.
This is still far from being a fully compliant / full featured version
of
the extension, but it's a big step forward in my opinion and allows to
run
some real applications.
I've tested gr-fosphor (SDR spectrum display) and Davinci Resolve as
examples.
Both of theses don't work without theses improvements.
Fixes: https://github.com/intel/compute-runtime/issues/659
Fixes: https://github.com/intel/compute-runtime/issues/667
https://github.com/intel/compute-runtime/pull/673
Related-To: NEO-3599
Signed-off-by: Sylvain Munaut <tnt@246tNt.com>
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-01 13:38:56 +01:00
Zbigniew Zdanowicz
4baee71974
refactor: add out cmd argument to semaphore wait encoder
...
Related-To: NEO-10065
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-01 11:43:36 +01:00
Mateusz Jablonski
373ef608bc
refactor: unify calculating max group size between OCL and L0
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-27 13:32:10 +01:00
Zbigniew Zdanowicz
0bf8e8727e
refactor: add output buffer argument to store register to memory encoder
...
Related-To: NEO-10064
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-02-27 13:31:55 +01:00
Lukasz Jobczyk
cfd3edfb2c
fix: Align IOH entry
...
Related-To: NEO-10036
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-26 14:36:31 +01:00
Kamil Kopryk
0c5cba8ebd
refactor: use common indirectDataAlignment static constexpr value
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-02-16 13:55:00 +01:00
Compute-Runtime-Validation
57c946b61c
Revert "fix: align indirect data pointer to cacheline size in heapless mode"
...
This reverts commit 004e6e647f
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-15 05:49:19 +01:00
Compute-Runtime-Validation
9570d02134
Revert "fix: correctly report support for SPIR-V 1.0 through 1.3"
...
This reverts commit 5120ec2f93
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-15 04:00:31 +01:00
Kacper Nowak
cccd9a3703
refactor: Add method for retrieving extra device info
...
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2024-02-14 16:38:15 +01:00
Kamil Kopryk
004e6e647f
fix: align indirect data pointer to cacheline size in heapless mode
...
Align indirect data pointer to cacheline size in heapless mode,
restore debug_break_if if avx2 load/store operation
gets unaligned pointer,
remove fallback to mm256 loadu/storeu unaligned operation
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-02-14 10:00:15 +01:00
Ben Ashbaugh
5120ec2f93
fix: correctly report support for SPIR-V 1.0 through 1.3
...
Related-To: NEO-10336
Signed-off-by: Ben Ashbaugh <ben.ashbaugh@intel.com>
2024-02-14 09:21:06 +01:00
Compute-Runtime-Validation
4656e2ec12
Revert "refactor: remove not needed check for cpu transfer in ocl"
...
This reverts commit 098e2e76fb
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-13 21:16:43 +01:00
Dunajski, Bartosz
098e2e76fb
refactor: remove not needed check for cpu transfer in ocl
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-13 18:47:14 +01:00
Kacper Nowak
2e997a2542
refactor: Use shared method for setting FP16/32 caps in OCL
...
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2024-02-09 08:47:20 +01:00
Kamil Kopryk
59f661356c
feature: use heapless builtins in OCL if supported
...
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-02-08 15:49:58 +01:00
Lukasz Jobczyk
486cc71b76
refactor: Add GDI profiling
...
Resolves: NEO-9236
Related-To: NEO-10036
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-07 18:44:11 +01:00
Dominik Dabek
371788210d
performance: limit usm host allocation recycle
...
Query system total memory size and limit usm host allocation recycle to
use at most x%.
x is read from ExperimentalEnableDeviceAllocationCache for device and
ExperimentalEnableHostAllocationCache for host.
Related-To: GSD-7497
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-02-07 17:45:41 +01:00
Kacper Nowak
ab1ffea592
refactor: Create function to check if OCL2.1 caps are enabled or enforced
...
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2024-02-06 08:27:09 +01:00
Dominik Dabek
2cad595a0d
performance: debug flag for usm host alloc recycle
...
set ExperimentalEnableHostAllocationCache=1 to recycle host usm
allocations
Related-To: GSD-7497
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-02-01 16:47:59 +01:00
Kamil Kopryk
6d3a53fe7f
refactor: Improve scratch programming in heapless mode
...
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-01-31 10:39:39 +01:00
Compute-Runtime-Validation
fb46066abc
Revert "fix: enable cache env variables for level-zero"
...
This reverts commit 743904d2df
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-01-31 08:33:05 +01:00
Fabian Zwolinski
743904d2df
fix: enable cache env variables for level-zero
...
Related-To: NEO-10045
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-01-30 14:01:42 +01:00
Compute-Runtime-Validation
fa9c79fb63
Revert "refactor: Add GDI profiling"
...
This reverts commit 524ae7713a
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-01-30 10:47:34 +01:00
Dominik Dabek
dcab4863d5
performance(ocl): calculate max buffer pool count
...
Set max buffer pool count to use at most 2 percent of device total memory.
Related-To: NEO-9690
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-01-30 07:04:36 +01:00
Mateusz Jablonski
d2c6283879
fix(ocl): add support for deprecated value of CL_MEM_DEVICE_ID_INTEL
...
Resolves: NEO-10184, NEO-10195, NEO-10196
Resolves: NEO-10200, NEO-10205, NEO-10207
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-29 17:07:26 +01:00
Lukasz Jobczyk
524ae7713a
refactor: Add GDI profiling
...
Resolves: NEO-9236
Related-To: NEO-10036
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-01-29 11:36:04 +01:00
Mateusz Jablonski
327d4c4d48
fix: update OpenCL headers to v2023.12.14
...
https://github.com/KhronosGroup/OpenCL-Headers/releases/tag/v2023.12.14
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-25 16:42:14 +01:00
Katarzyna Cencelewska
67b0b18be3
fix: osAgnostic path for allocate with alignment
...
Resolves: NEO-9334
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-01-25 11:03:34 +01:00
Kamil Kopryk
6d1df09e58
fix: fix exported function address when heapless mode is enabled in OCL
...
Related-To: IGC-8472
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-01-24 16:21:42 +01:00
Dominik Dabek
6e434e0424
performance(ocl): increase buffer pool size
...
increase pool size to 2MB and threshold to 1MB
add limit to the number of pools, set to 2
Related-To: NEO-9690
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-01-24 10:55:29 +01:00
Kamil Kopryk
87d13fcb6e
fix: fix scratch programming in heapless mode
...
Related-To: NEO-10107
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-01-23 15:26:56 +01:00
Mateusz Jablonski
a697a3f718
refactor: create new members for storing spill and private memory in scratch
...
rename private scratch space into scratch space slot 1 as it can be generic
Related-To: NEO-9944
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-23 12:42:25 +01:00
Compute-Runtime-Validation
f9f9035b95
Revert "refactor: create new members for storing spill and private memory in ...
...
This reverts commit 87eb5f554a
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-01-23 09:13:00 +01:00
Compute-Runtime-Validation
e949ba7144
Revert "refactor: Add GDI profiling"
...
This reverts commit 8d56f8fb6b
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-01-23 06:13:02 +01:00
Mateusz Jablonski
87eb5f554a
refactor: create new members for storing spill and private memory in scratch
...
rename private scratch space into scratch space slot 1 as it can be generic
Related-To: NEO-9944
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-22 19:48:48 +01:00
Dunajski, Bartosz
b77e1a6a71
feature: improve reporting max cooperative group count
...
Related-To: NEO-8210
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-22 14:56:15 +01:00
Lukasz Jobczyk
8d56f8fb6b
refactor: Add GDI profiling
...
Resolves: NEO-9236
Related-To: NEO-10036
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-01-22 14:24:08 +01:00
Dunajski, Bartosz
6f4ed10919
refactor: unify getMaxWorkGroupCount logic
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-22 11:16:45 +01:00
Dominik Dabek
7a09c51f3d
fix(ocl): querying memobj offset
...
Querying buffer offset via clGetMemObjectInfo should not modify its
value.
Related-To: NEO-9690
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-01-19 15:48:46 +01:00
Dunajski, Bartosz
b266f1f3cc
refactor: improve implicit scaling interface
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-19 14:09:04 +01:00
Dominik Dabek
e0ccf22557
fix: indirect access in external functions
...
Read indirect_stateless_count in module external functions.
If greater than 0, mark all kernels that have the has_stack_calls flag
set from this module as having indirect accesses.
Related-To: NEO-7712
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-01-18 15:29:41 +01:00
Kamil Kopryk
36dcfc0b42
feature: add stateless heapless builtins
...
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-01-16 11:11:31 +01:00
Zbigniew Zdanowicz
c49695fa6e
fix: flush L3 cache and wait for flush to complete when event is completed
...
Related-To: NEO-8395
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-01-16 07:05:35 +01:00
Dominik Dabek
b3b402cad2
feature(zebin): read indirect detection version
...
Read indirect detection version intel compat notes from zebin.
This is to prepare for enabling indirect access optimization in AOT
built kernels.
Related-To: NEO-7712
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-01-12 14:20:52 +01:00
Kamil Kopryk
f99edf72e1
refactor: Add common HardwareCommandsHelper enablers
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-01-12 09:39:12 +01:00
Mateusz Jablonski
a73fb4d2fe
fix: correct reporing kernel private size on L0
...
unify the logic across APIs
Related-To: NEO-9944
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-09 09:54:05 +01:00
Dominik Dabek
9b52d52062
performance(ocl): enable usm pool allocator
...
Enable on xe hpg and lpg platforms
Related-To: NEO-9700
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-01-05 16:17:48 +01:00
Dominik Dabek
af1620a308
fix(ocl): allocation info from pool svm ptr
...
Fix querying allocation info from pooled svm ptr.
Handle requested allocation alignment.
Refactor sorted vector usage.
Do not associate device with host pool allocation.
Related-To: NEO-9700
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-01-05 15:20:01 +01:00
Kamil Kopryk
2a46350ba3
fix: fix programming indirect data pointer in heapless mode
...
Indirect data pointer should be offsetted by used offset.
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-01-04 07:51:31 +01:00
Compute-Runtime-Validation
5535ef3049
Revert "performance(ocl): enable usm pool allocator"
...
This reverts commit 7bc8424a69
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-12-29 05:54:07 +01:00
Mateusz Hoppe
31e9b5e9fa
feature: add support for secondary contexts in group
...
Related-To: NEO-7824
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-12-28 13:31:08 +01:00
Dominik Dabek
7bc8424a69
performance(ocl): enable usm pool allocator
...
Enable opencl usm pool allocator by default
Related-To: NEO-9700
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-12-28 13:14:41 +01:00
Dominik Dabek
d238a68bae
fix(ocl): usm pool allocator correct size
...
Wrong debug flag was used for setting host allocation pool size
Related-To: NEO-9700
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-12-27 23:14:28 +01:00
Dunajski, Bartosz
df66a0276f
refactor: remove not used logic to check dynamic postsync layout
...
Related-To: NEO-8210
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-12-27 13:12:11 +01:00
Dominik Dabek
2fe3804cc2
performance(ocl): add usm allocation pooling flag
...
EnableDeviceUsmAllocationPool and EnableHostUsmAllocationPool for device
and host allocations respectively.
Pool size will be set to flag value * MB.
Allocation size threshold to be pooled is 1MB.
Pools are created per context.
Related-To: NEO-9700
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-12-27 11:42:01 +01:00
Dominik Dabek
239831f097
fix: return error for 0 size usm allocations
...
according to both level zero and opencl specs, usm allocations with
size=0 should return invalid/unsupported buffer size errors
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-12-22 17:26:39 +01:00
Kacper Nowak
18fb432552
refactor: Stop using hard-coded BCS engine as default one
...
Do not rely on presence of BCS0 engine.
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2023-12-21 17:34:57 +01:00
Mateusz Jablonski
a4888b39c6
build: add clang-tidy restriction for Enum case
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-21 08:58:51 +01:00
Mateusz Jablonski
bf60d77e10
refactor: correct naming of enum class constants n/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-20 10:56:37 +01:00
Mateusz Jablonski
138fb65401
refactor: correct naming of enum class constants 11/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-19 14:52:57 +01:00
Mateusz Jablonski
de93bc6928
refactor: correct naming of enum class constants 10/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-19 11:30:39 +01:00
Mateusz Jablonski
a6458433dc
refactor: correct naming of enum class constants 9/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-19 10:36:25 +01:00
Mateusz Jablonski
dd1b9d6abc
refactor: correct naming of enum class constants 8/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-19 08:18:18 +01:00
Chodor, Jaroslaw
7e795cd3c1
feature: Adding support for OCL C support queries to ocloc
...
Feature needed for automated OCL C compilation with ocloc as backend.
Added queries :
* CL_DEVICE_EXTENSIONS
* CL_DEVICE_EXTENSIONS_WITH_VERSION
* CL_DEVICE_PROFILE
* CL_DEVICE_OPENCL_C_ALL_VERSIONS
* CL_DEVICE_OPENCL_C_FEATURES
Sample command line:
ocloc query -device skl CL_DEVICE_OPENCL_C_FEATURES
Related-To: GSD-7420
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2023-12-18 15:19:16 +01:00
Dunajski, Bartosz
d99104d5bf
refactor: improve ImplicitArg struct handling
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-12-18 15:19:00 +01:00
Mateusz Jablonski
fca28a930f
refactor: correct naming of enum class constants 7/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-18 13:05:56 +01:00
Ben Ashbaugh
976c6d3acc
feature: initial register count query implementation
...
Adds a per-kernel and per-device query to determine the
number of GRF registers that a kernel was compiled for.
This is an informal query for now, but may be added to
a formally supported extension in the future.
Related-To: NEO-9807
Signed-off-by: Ben Ashbaugh <ben.ashbaugh@intel.com>
2023-12-18 11:45:01 +01:00
Mateusz Jablonski
27fbdde4c5
refactor: correct naming of unified memory enums
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-13 15:58:21 +01:00
Mateusz Jablonski
739d181026
refactor: correct naming of enum class constants 6/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-13 14:48:52 +01:00
Mateusz Jablonski
8c90350f3c
refactor: correct naming of enum class constants 5/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-13 10:06:02 +01:00
Mateusz Jablonski
432142c574
refactor: correct naming of enum class constants 4/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-13 08:08:51 +01:00
Mateusz Jablonski
895519db38
refactor: correct naming of NEOImageType enum values
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-12 11:15:28 +01:00
Mateusz Jablonski
b182917d9d
refactor: correct naming of allocation types
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-11 16:23:37 +01:00
Mateusz Jablonski
beafea9b39
refactor: correct naming of enum class constants 2/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-11 13:13:35 +01:00
Dunajski, Bartosz
58323bfb12
refactor: improve PostSync helper methods
...
Related-To: NEO-8210
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-12-11 10:41:30 +01:00
Compute-Runtime-Validation
a2994e9b29
Revert "performance(ocl): set pool allocator threshold 1MB"
...
This reverts commit fc1d93af8e
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-12-09 07:02:42 +01:00
Milczarek, Slawomir
bec9874487
fix: Improved event profiling setup for bcs split enqueue
...
Fixes OpenCL Khronos failures in test profiling with bcs split
Related-To: NEO-8927
Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>
2023-12-07 21:29:55 +01:00
Dunajski, Bartosz
aab3e7f7c5
refactor: improve getting postsync_data struct type
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-12-07 15:41:44 +01:00
Kamil Kopryk
7a14db7f9b
refactor: add max front end threads in EncodeWalkerArgs
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-12-07 11:38:46 +01:00
Lukasz Jobczyk
c8c3f862f4
refactor: Add key to force zero copy without coherency
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-12-07 07:56:54 +01:00
Katarzyna Cencelewska
39ccf0297f
fix: Skip rebuild binary forced by flag when lack of ir
...
when flag RebuildPrecompiledKernels set and call clCreateProgramWithBinary
skip rebuild if ir binary not available
Resolves: HSD-18035219734
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2023-12-06 20:00:05 +01:00
Dominik Dabek
fc1d93af8e
performance(ocl): set pool allocator threshold 1MB
...
Increase pool allocator threshold to 1MB
Remove stack allocations based on threshold in tests.
Related-To: NEO-9690
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-12-06 19:55:48 +01:00
Dunajski, Bartosz
9ce3713ace
refactor: improve Walker args handling
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-12-06 16:10:26 +01:00
Mateusz Jablonski
da957d1a37
refactor: correct naming of enum class constants 1/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-05 14:26:42 +01:00
Maciej Plewka
f8eefbd020
fix: return error code for usm param passed to clSetKernelExecInfo
...
Related-To: NEO-8202
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-12-05 09:14:00 +01:00
Mateusz Jablonski
6849d33326
refactor: remove redundant definitions KB/MB/GB
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-05 08:55:18 +01:00
Mateusz Jablonski
92838c05fd
refactor: correct variable naming
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-04 19:02:36 +01:00
Mateusz Jablonski
c48e57e2de
refactor: rename register offsets and constants
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-04 18:31:39 +01:00
Milczarek, Slawomir
87247e6949
fix: Add path for clCreateBuffer with host unified memory ptr
...
Related-To: NEO-9612
Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>
2023-12-04 18:16:56 +01:00
Mateusz Jablonski
9486cd2a26
refactor: rename WALKER_TYPE to DefaultWalkerType
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-04 16:29:32 +01:00
Mateusz Jablonski
552c129c0b
refactor: correct variable naming
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-04 15:14:02 +01:00
Kamil Kopryk
933e5ab818
refactor: remove dead code
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-12-04 14:31:00 +01:00
Mateusz Jablonski
c3d3a4db1f
refactor: correct variable naming
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-04 13:45:53 +01:00
Baj, Tomasz
fe0c7f1004
fix: when size is 0 then don't copy
...
Related-To: NEO-8381
Signed-off-by: Baj, Tomasz <tomasz.baj@intel.com>
2023-12-04 11:18:00 +01:00
Kamil Kopryk
aa950a4a96
feature: Add heapless mode programming in l0 1/n
...
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-12-01 17:01:01 +01:00
Mateusz Jablonski
cdcd834712
refactor: correct naming of builtin types
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-01 14:08:21 +01:00
Mateusz Jablonski
e57d372608
refactor: correct naming of enum class constants
...
EngineGroupType
BlitDirection
PostBlitMode
WaitStatus
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-01 11:53:51 +01:00
Mateusz Jablonski
b92b5892b8
refactor: correct variable naming
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-01 08:34:19 +01:00
Mateusz Jablonski
c3ac7b78bd
refactor: correct variable naming
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-01 02:18:46 +01:00
Mateusz Jablonski
c9664e6bad
refactor: rename global debug manager to debugManager
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-30 13:00:59 +01:00
Mateusz Jablonski
36194c4e7d
refactor: correct variable namings
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-29 23:49:03 +01:00
Mateusz Jablonski
aac8686095
fix: correct calculating submit cpu timestamp
...
Related-To: NEO-9590, HSD-18035219634
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-29 17:42:41 +01:00
Kamil Kopryk
ce7298d512
feature: Add heapless mode programming in ocl
...
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-11-24 12:53:39 +01:00
Lukasz Jobczyk
ac8c00048e
performance: optimize svm allocation tracking
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-11-23 10:54:01 +01:00
Fabian Zwolinski
40fd1b0336
fix: report correct version of cl_khr_external_memory extension
...
Related-To: NEO-9530
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-11-17 15:49:56 +01:00
Kacper Nowak
c504b497d7
refactor: Store AIL in root device environment
...
Instead of storing AIL configurations in global table, store it
in root device environment.
This also prevents potential scenario with accessing deleted memory due
to symbol collision when application uses both OCL/L0 libraries.
- AIL is now stored in root device environment, and gets initialized
with other helpers
- Minor: corrected naming in ULTs
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
Related-To: NEO-9240
2023-11-17 15:18:27 +01:00
Compute-Runtime-Validation
27000c06cb
Revert "refactor: Store AIL in root device environment"
...
This reverts commit 02ef252b7d
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-11-16 19:37:42 +01:00
Andrzej Koska
63aaa3b63f
fix: Race removal between enqueueKernel and clFinish
...
Related-To: NEO-8196
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2023-11-16 17:33:44 +01:00
Mateusz Jablonski
140532f8b6
fix: fix issues with clang-tidy on Windows
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-16 12:30:11 +01:00
Compute-Runtime-Validation
7f61217a44
Revert "performance: optimize svm allocation tracking"
...
This reverts commit e91ce78ec8
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-11-16 11:03:19 +01:00
Warchulski, Jaroslaw
d358ed051f
fix: remove unnecessary check in patchPrivateSurface function
...
Related-To: GSD-3840
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-11-16 10:04:41 +01:00
Lukasz Jobczyk
e91ce78ec8
performance: optimize svm allocation tracking
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-11-15 13:58:05 +01:00
Kacper Nowak
02ef252b7d
refactor: Store AIL in root device environment
...
Instead of storing AIL configurations in global table, store it
in root device environment.
This also prevents potential scenario with accessing deleted memory due
to symbol collision when application uses both OCL/L0 libraries.
- AIL is now stored in root device environment, and gets initialized
with other helpers
- Minor: corrected naming in ULTs
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
Related-To: NEO-9240
2023-11-15 13:17:56 +01:00
Compute-Runtime-Validation
20cef540e0
Revert "performance: re-enable map/UnmapMemObject on CPU"
...
This reverts commit ef7c1f3940
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-11-10 16:33:12 +01:00
Baj, Tomasz
10a97548c2
refactor: remove ReturnSubDevicesAsApiDevices from OCL and L0
...
Related-To: NEO-9437
Signed-off-by: Baj, Tomasz <tomasz.baj@intel.com>
2023-11-09 15:43:06 +01:00
Mateusz Jablonski
75ed29ceef
build: correct flags for building clang on Windows with UBSAN
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-08 15:42:14 +01:00
Kamil Kopryk
ab67687e20
feature: apply appendExtraInternalOptions in ocl and ocloc
...
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-11-08 09:01:13 +01:00
Igor Venevtsev
ef7c1f3940
performance: re-enable map/UnmapMemObject on CPU
...
Resolves: NEO-9398
Signed-off-by: Igor Venevtsev <igor.venevtsev@intel.com>
2023-11-08 02:27:22 +01:00
Mateusz Jablonski
534234bd79
build: fix linking ocl library
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-03 13:01:00 +01:00
Mateusz Jablonski
e301418423
fix: correct issues with clang tidy
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-31 12:23:42 +01:00
Baj, Tomasz
28c6dbf8c1
fix: set ParentDevice for subDevice only
...
Related-To: GSD-6609
Signed-off-by: Baj, Tomasz <tomasz.baj@intel.com>
2023-10-31 12:11:39 +01:00
Mateusz Jablonski
9c7f31620a
refactor: correct variable naming
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-31 10:29:15 +01:00
Mateusz Jablonski
d56abe6f60
refactor: correct variable naming
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-30 17:26:20 +01:00
Mateusz Jablonski
036d1e4814
refactor: correct variable naming
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-30 12:48:51 +01:00
Michal Mrozek
80e3632c41
performance: do not free deferred allocations when not needed
...
Level Zero doesn't use deferred allocations so no point in paying the
price to check for them in cleanAllocationList.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2023-10-27 18:02:43 +02:00
Dominik Dabek
39cf653959
performance(ocl): cmd buffer prealloc per cmdqueue
...
Add mechanism to preallocate cmd buffer allocations in command stream
receiver reusable allocations list per command queue initialized.
This should limit additional allocations during hot loop.
Needs to be enabled in subsequent commits by setting product helper
method.
Related-To: NEO-8152
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-10-27 16:56:29 +02:00
Mateusz Hoppe
40392b33f4
refactor: cleanup - remove not used code from Program
...
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-10-26 15:21:17 +02:00
Zbigniew Zdanowicz
8a07380548
refactor: remove redundant method
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-10-25 18:02:16 +02:00
Jablonski, Mateusz
8b682a6bcb
fix: fix compilation error in clang on Windows (3/n)
...
Related-To: NEO-9248
Signed-off-by: Jablonski, Mateusz <mateusz.jablonski@intel.com>
2023-10-25 17:38:56 +02:00
Andrzej Koska
c4a24d8f31
fix: Avoiding races when modifying queue and Tag data
...
Related-To: NEO-8196
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2023-10-25 14:20:57 +02:00
Mateusz Jablonski
7e7eea9995
fix: calculate event timestamps based on submit time and host time diff
...
Related-To: HSD-18034426016, HSD-18034426678
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-25 12:24:40 +02:00
Lukasz Jobczyk
2fdfc64053
refactor: add const to one time init variables
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-10-25 06:49:56 +02:00
Jablonski, Mateusz
ac5f64f5c6
fix: fix compilation error in clang on Windows (2/n)
...
Signed-off-by: Jablonski, Mateusz <mateusz.jablonski@intel.com>
2023-10-24 15:59:06 +02:00
Mateusz Jablonski
6d2d16d68e
fix: avoid overflow of gpu time stamp in ns
...
Related-To: NEO-8394
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-24 14:56:15 +02:00
Mateusz Jablonski
ad3aeb6eea
fix: use submit timestamp in kernel start time calculation
...
Related-To: NEO-8394
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-20 12:07:44 +02:00
Mateusz Jablonski
4dfa12c8eb
fix: add mechanism to detect gpu timestamp overflows
...
unify naming CpuGpu to GpuCpu
Related-To: NEO-8394
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-19 16:31:06 +02:00
Kacper Nowak
26666d8c6f
fix: Use getGpuAddressToPatch() of pattern allocation
...
When using FillBuffer builtin, use GPU address to patch (i.e. gpuAddress
plus allocationOffset minus gpuBaseAddress) instead of gpuAddress.
This change fixes a bug - possible abort due to huge difference between
gpuAddress() and gpuAddressToPatch() values.
Related-To: NEO-7973
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2023-10-19 16:23:55 +02:00
Lukasz Jobczyk
750b5ba89a
fix: flush necessary caches when dispatch pipe control
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-10-19 15:09:31 +02:00
Mateusz Hoppe
5a4fa180d6
feature: control bindless compilation mode based on release
...
- check releaseHelper support when selecting bindless mode, if not
disabled, prefer bindless mode in L0 API
- bindless mode can be forced with DebugVariable: UseBindlessMode
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-10-19 10:11:11 +02:00
Mateusz Jablonski
c68f7c8fe2
fix: setup both cpu & gpu timestamp when setting timestamp in event
...
Related-To: NEO-8394
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-17 08:51:05 +02:00
Mateusz Jablonski
a3faf5dacc
feature: Add support for ARL-S
...
Related-To: NEO-8187
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-13 10:28:21 +02:00
Mateusz Jablonski
c5c2e6808e
fix: propagate error from getting CL_MEM_ALLOCATION_HANDLE_INTEL info
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-12 10:04:01 +02:00
Milczarek, Slawomir
e3260de8ca
fix: clCloneKernel to set kernel arg value with buffer mem object
...
Related-To: NEO-8927
Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>
2023-10-11 12:55:26 +02:00
Mateusz Jablonski
61f0c79c24
fix: handle failure on getting CL_MEM_ALLOCATION_HANDLE_INTEL info
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-11 10:33:37 +02:00
Venevtsev, Igor
50b2262339
fix: stop direct submission blocking on platform destruction
...
Resolves: GSD-6341
Signed-off-by: Venevtsev, Igor <igor.venevtsev@intel.com>
2023-10-06 17:03:38 +02:00
Milczarek, Slawomir
e4c162c572
fix: Re-generate metada on every program build
...
Related-To: NEO-8927
Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>
2023-10-05 18:47:03 +02:00
Mateusz Jablonski
72182c28e9
refactor: add NonCopyableOrMovableClass to classes that don't need copy ctors
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-05 13:11:01 +02:00
Mrozek, Michal
6f58ec9d2c
fix: set walker params prior to adjusting interface descriptor
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2023-10-04 16:37:54 +02:00
Mateusz Jablonski
fc508212de
refactor: pass big parameters as reference instead of by value
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-04 14:53:13 +02:00
Mateusz Jablonski
6b3ad5b3e3
fix: add unrecoverable in case of nullptr
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-04 14:38:05 +02:00
Mateusz Jablonski
895a586083
refactor: use reference in for-range loops
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-03 18:56:53 +02:00
Mateusz Jablonski
110164a52a
fix: remove invalid std::forward
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-03 16:41:41 +02:00
Mateusz Jablonski
7d811384d7
fix: avoid double free of mcsAllocation in case multi root device mem obj
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-03 13:46:28 +02:00
Naklicki, Mateusz
0461af492d
fix: unify path for getting number of grfs per thread
...
Related-To: NEO-8043
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com>
2023-10-03 08:17:46 +02:00
Dunajski, Bartosz
e2387a450b
fix: check for new bcs submission before releasing queue resources
...
Related-To: NEO-9005
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-29 12:05:05 +02:00
Mateusz Jablonski
a033df33ff
fix: remove preferSmallWorkgroupSizeForKernel method
...
Related-To: HSD-18033866078
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-29 11:55:09 +02:00
Mateusz Jablonski
3a21b3b228
refactor: remove not needed code
...
Related-To: NEO-7527
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-28 07:52:39 +02:00
Mateusz Jablonski
03874b8815
refactor: remove not needed code
...
Related-To: NEO-7527
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-27 17:45:54 +02:00
Dunajski, Bartosz
42481de5d0
refactor: limit lock scope to avoid potential deadlock
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-25 10:21:20 +02:00
Kacper Nowak
0f161c1f19
fix(ocl): Add missing tracing mechanism for clCreateSubDevices API call
...
Related-To: NEO-8898
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2023-09-25 10:09:47 +02:00
Dunajski, Bartosz
80d0c74605
fix: track registered CSR clients 2
...
Related-To: NEO-8884
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-25 09:35:55 +02:00
Compute-Runtime-Validation
ade538ce54
Revert "fix: track registered CSR clients"
...
This reverts commit 53f635e392
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-09-24 10:07:26 +02:00
Dunajski, Bartosz
480c058cb2
feature: in-order patching for ComputeWalker
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-22 15:00:44 +02:00
Konstanty Misiak
a1a8d8fded
fix(wa): Override AuxilarySurfaceMode when required
...
Related-To: NEO-8012
Signed-off-by: Konstanty Misiak <konstanty.misiak@intel.com>
2023-09-22 13:41:34 +02:00
Dunajski, Bartosz
53f635e392
fix: track registered CSR clients
...
In L0 its not possible to track objects relations. For example CmdList
may be removed before Event.
In such case, Event needs to safely skip unregister call, without
accessing CmdList/CmdQueue object.
Related-To: NEO-8884
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-22 09:42:55 +02:00
Maciej Plewka
8658fdb04e
fix: Use stack vec for api specific prefix
...
Related-To: NEO-8388, GSD-6296
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-09-21 16:10:54 +02:00
Fabian Zwolinski
e96dd344c3
feature: add new environment variables for compiler cache on Windows
...
Changes:
- replaced registry keys with environment variables
for cl_cache in OCL
- added compiler cache helpers
- implemented support for new env vars on Windows
- added tests
New env vars mechanism works as follows:
If `PERSISTENT_CACHE` is set,
driver checks if `NEO_CACHE_DIR` is set.
If `NEO_CACHE_DIR` is not set,
driver uses `%LocalAppData%\NEO\neo_compiler_cache`
as `cl_cache` destination folder.
If `NEO_CACHE_DIR` is not set and `%LocalAppData%`
path could not be obtained,
compiler cache is disabled.
In the current Windows implementation,
special characters in the folder path are not supported.
Related-To: NEO-8092
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-09-21 12:40:48 +02:00
Dominik Dabek
0936fca8cd
fix(ocl): set split barrier required correctly
...
Fix bug introduced in neo 27314 - splitBarrierRequired was set for all
commands, should be only for cl_command_barrier.
Related-To: NEO-8147
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-09-21 11:32:50 +02:00
Maciej Plewka
49cc570e59
fix: move adjust depth to image hw
...
Related-To: NEO-8390, HSD-16021488507
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-09-20 15:54:45 +02:00
Mateusz Jablonski
8da3879944
refactor: remove not needed code
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-19 14:40:24 +02:00
Dominik Dabek
1b7e178b25
performance(ocl): program barrier pc in taskStream
...
Program barrier to task stream, before next enqueue kernel.
This will reduce the number of batch buffer starts for sequences of
enqueue, barrier, enqueue, ... .
Related-To: NEO-8147
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-09-19 11:48:02 +02:00
Dominik Dabek
ee767822b8
performance(ocl): add bcsEngineCount to cmdq
...
Related-To: NEO-8155
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-09-15 15:49:56 +02:00
Maciej Plewka
44b3f18567
refactor: Use release helper for adjusting depth
...
Related-To: NEO-8295, HSD-14019991753
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-09-15 13:24:37 +02:00
Baj, Tomasz
e10f39017d
fix: Add ImageInfo to createGraphicsAllocation on Linux
...
Related-To: NEO-6757
Signed-off-by: Baj, Tomasz <tomasz.baj@intel.com>
2023-09-14 12:58:59 +02:00
Mrozek, Michal
9bb64d0b15
refactor: remove not needed code
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2023-09-13 11:43:09 +02:00
Mrozek, Michal
d9f938f3db
refactor: remove not needed code
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2023-09-12 14:25:04 +02:00
Mrozek, Michal
15a0bf9d7b
refactor: remove not needed code.
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2023-09-12 13:52:47 +02:00
Dunajski, Bartosz
6648065703
feature: add indirect semaphore mode
...
Related-To: NEO-8242
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-12 13:15:51 +02:00
Dunajski, Bartosz
def3f2e9ad
refactor: improve semaphore programming
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-12 11:24:11 +02:00
Filip Hazubski
d7db6ac467
feature: Add preferredPlatformName field to RuntimeCapabilityTable
...
For all of the devices, preferredPlatformName is initialized with
nullptr by default and platform name will be initialized to driver's default
platform name, at the moment this is "Intel(R) OpenCL Graphics".
When Platform is initialized and preferredPlatformName is not nullptr then
Platform name will be set to the value stored in preferredPlatformName.
Add ENABLE_LEGACY_PLATFORM_NAME AIL enum related to added functionality.
Move PlatformInfo to NEO namespace.
Related-To: HSD-22018809561
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2023-09-12 11:07:14 +02:00
Mrozek, Michal
ae4757711e
refactor: remove not needed code
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2023-09-12 09:39:01 +02:00
Mrozek, Michal
cac547946a
refactor: remove not needed code
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2023-09-12 08:51:58 +02:00
Compute-Runtime-Validation
b5e9c10f64
Revert "performance(ocl): program barrier pc in taskStream"
...
This reverts commit 839c2d6737
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-09-12 01:32:28 +02:00
Mrozek, Michal
0a28d622fa
refactor: remove not needed code
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2023-09-11 17:42:16 +02:00
Dominik Dabek
839c2d6737
performance(ocl): program barrier pc in taskStream
...
Program barrier immediately to task stream.
This will reduce the number of batch buffer starts.
Related-To: NEO-8147
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-09-11 13:23:26 +02:00
Lukasz Jobczyk
a38ac3557b
performance: Disable blit enqueue on xe_lpg
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-09-11 13:10:23 +02:00
Compute-Runtime-Validation
7b5ef1b2f4
Revert "performance: Disable blit enqueue on xe_lpg"
...
This reverts commit ac97b1c493
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-09-11 10:10:21 +02:00
Michal Mrozek
65188fc90b
refactor: remove not needed code
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2023-09-08 17:22:18 +02:00
Kacper Nowak
66c030b5a0
fix: Remove unneeded cast from clCloneKernel API call
...
Related-To: NEO-8330
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2023-09-08 14:21:12 +02:00
Lukasz Jobczyk
ac97b1c493
performance: Disable blit enqueue on xe_lpg
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-09-08 14:04:24 +02:00
Maciej Plewka
09c1d474c9
fix: adjust depth limitations for images
...
Related-To: NEO-8239, HSD-14019991752
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-09-08 13:05:49 +02:00
Dunajski, Bartosz
269adbe43b
refactor: add L0 dispatch params debug logs
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-08 05:35:32 +02:00
Dunajski, Bartosz
8f6183e758
fix: check queue post post completion resources when releasing Event
...
Related-To: NEO-8316
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-04 08:23:56 +02:00
Kacper Nowak
43321dd463
fix: Remove unneeded cast from clLinkProgram API call
...
Related-To: NEO-8330
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2023-09-01 15:36:51 +02:00
Kacper Kasper
269cba1659
feature: enable cl_cache by default on Linux
...
Resolves: NEO-4262
Signed-off-by: Kacper Kasper <kacper.k.kasper@intel.com>
2023-09-01 15:13:13 +02:00
Compute-Runtime-Validation
377d886c77
Revert "feature: enable cl_cache by default on Linux"
...
This reverts commit eeb8a5e1fb
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-09-01 04:58:52 +02:00
Kacper Kasper
eeb8a5e1fb
feature: enable cl_cache by default on Linux
...
Resolves: NEO-4262
Signed-off-by: Kacper Kasper <kacper.k.kasper@intel.com>
2023-08-31 15:52:11 +02:00
Kacper Kasper
ba4867c3d0
fix: compiler cache file extension and path creation
...
* When env variables did not have trailing slash, directory creation
could fail.
* If .cache in $HOME did not exist cache would have been disabled.
Signed-off-by: Kacper Kasper <kacper.k.kasper@intel.com>
2023-08-31 13:44:19 +02:00
Dunajski, Bartosz
f1fc273cfc
fix: check for new submission before releasing TSP nodes in wait call.
...
Related-To: NEO-8318
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-08-31 13:19:19 +02:00
Artur Harasimiuk
b578e4ad57
refactor: cmake cleanup
...
Related-To: NEO-8256, NEO-8257
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2023-08-28 11:40:48 +02:00
Compute-Runtime-Validation
5114988f34
Revert "refactor: cmake cleanup"
...
This reverts commit 9da3960123
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-08-28 03:05:49 +02:00
Artur Harasimiuk
9da3960123
refactor: cmake cleanup
...
Related-To: NEO-8256, NEO-8257
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2023-08-25 18:07:48 +02:00
Mateusz Jablonski
cac91dab20
fix: Move PVC device ids to common place
...
Related-To: NEO-8187
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-08-25 13:09:19 +02:00
Wang, Elaine
c4728b312c
feature: Add ARGB support for openCL VA surface
...
Support VA surface format VA_FOURCC_ARGB
Related-To: HSD-22018705184
Signed-off-by: Wang, Elaine <elaine.wang@intel.com>
2023-08-24 08:37:28 +02:00
Dominik Dabek
5c5c718af3
performance: detect indirect access in kernel, PVC
...
Enabling on pvc after patch in igc.
Enabling only for JIT kernels because AOT could have been compiled with
IGC older than required.
Related-To: NEO-7712
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-08-24 02:15:11 +02:00