Michal Mrozek
d52ca080bd
Revert "performance: improve pool handling"
...
This reverts commit a3c3b6533a
.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-08-06 13:04:02 +02:00
Kamil Kopryk
2a9bcdeb83
refactor: pass outImplicitArgs to patchImplicitArgs function
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-08-05 17:31:47 +02:00
Fabian Zwoliński
674c4a15ad
fix: use correct gpu address when bindless heaps helper is enabled
...
Related-To: NEO-7063
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-08-05 15:09:57 +02:00
Bartosz Dunajski
ec34656e0e
fix: debug flag to defer first device submission
...
Related-To: HSD-18039343751
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-02 09:31:25 +02:00
Mateusz Hoppe
0196a0f72f
refactor(ocl): internal linker version script with OpenCL versions
...
- use the same map as in:
https://github.com/KhronosGroup/OpenCL-ICD-Loader/blob/main/
loader/linux/icd_exports.map
this allows to skip loader and link directly with libigdrcl.so
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-07-31 20:18:05 +02:00
Michal Mrozek
47009cec90
refactor: remove not needed code
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-07-31 17:07:56 +02:00
Zbigniew Zdanowicz
b33fe6ccf1
feature: adding flag to block dispatch implicit scaling commands
...
- this feature is part of making compute walker command view
- compute walker is programed for implicit scaling but not dispatched
- together with new flag, comes the refactor to reduce number of arguments
Related-To: NEO-11972
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-07-31 14:24:27 +02:00
Fabian Zwoliński
b1a50104a8
fix: include dynamic SLM in clGetKernelWorkGroupInfo and zeKernelGetProperties
...
Current implementation only takes static slmInlineSize into account.
With this change we also include dynamic SLM passed as a kernel arguments.
Related-To: NEO-5761
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-07-30 17:59:45 +02:00
Michal Mrozek
e668b4965c
performance: demote unrecoverable to debug_break
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-07-30 16:51:00 +02:00
Kamil Kopryk
65fcbff55c
refactor: Simplify code
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-07-29 14:26:51 +02:00
Compute-Runtime-Validation
b1bc4f4cad
Revert "fix: Add missing fp64 extensions in caps initialization"
...
This reverts commit 9a486dd5a1
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-07-26 14:47:02 +02:00
Szymon Morek
ace883ca55
performance: don't flush gpgpu if not required
...
Related-To: NEO-12124
If queue is OOQ and there are no cross-engine dependencies,
don't flush CCS before submitting copy on BCS.
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-26 06:49:45 +02:00
Maciej Plewka
1cd00b5b89
fix: use per product cache line size to align heaps
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-07-24 17:29:20 +02:00
Maciej Plewka
afee8814ef
refactor: get ioh alignment from static function
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-07-24 14:43:31 +02:00
Szymon Morek
a7fbc90ebd
fix: re-enable staging buffer copy when ccs is busy
...
Related-To: NEO-11501
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-22 18:14:46 +02:00
Szymon Morek
39ec7facee
performance: use BCS for transfers if CCS is busy
...
Related-To: NEO-11501
Also, if device is iGPU, don't use staging buffers
in that case.
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-22 15:36:26 +02:00
Szymon Morek
6a11e8a077
fix: revert changes around zero-copy
...
Related-To: NEO-12018
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-19 12:29:18 +02:00
Mateusz Hoppe
b6299b8a21
feature: add support for HP copy engine context
...
- add support for contect group with HP copy engine
- choose HP copy engine when available
Related-To: NEO-11983
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-07-19 12:23:03 +02:00
Filip Hazubski
f8867ac3ac
fix: Minor code improvements
...
Add explicit pointer checks to CommandQueue::blitEnqueueAllowed.
Explicitly check result of getDeviceArgValueIdx during ocloc compile.
Explicitly remove unused StagingBufferManager functions.
Move chunkCopyFunc by reference.
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-07-19 09:24:37 +02:00
Compute-Runtime-Validation
7ad15639fc
Revert "feature: add support for HP copy engine context"
...
This reverts commit 3fbcbcaef2
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-07-18 21:02:14 +02:00
Szymon Morek
33ab962121
fix: adjust compression hint usage for ocl buffers
...
Related-To: NEO-11989
Also, use zero-copy on lnl
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-18 18:24:48 +02:00
Michal Mrozek
20d6910b66
performance: move usm pool init to first alloc call
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-07-18 16:07:22 +02:00
Mateusz Hoppe
3fbcbcaef2
feature: add support for HP copy engine context
...
- add support for contect group with HP copy engine
- choose HP copy engine when available
Related-To: NEO-11983
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-07-18 16:07:07 +02:00
Dominik Dabek
c1c9ac634b
performance(ocl): enable host usm alloc recycle
...
Enable at threshold of 2% system memory.
Related-To: NEO-6893
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-07-17 19:33:56 +02:00
Mateusz Jablonski
8a60742a8d
fix: correct reported num subslices per slice in fused config
...
Related-To: NEO-12073
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-17 17:17:28 +02:00
Dominik Dabek
fc9de71feb
fix(ocl): finish in release ogl object if needed
...
Finish cache flushes before exiting api call if releasing displayable
ogl object or dcflush is mitigated.
Related-To: NEO-11694
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-07-16 11:21:32 +02:00
Compute-Runtime-Validation
9a6403f3bc
Revert "refactor: Add dc flush mitigation infrastructure"
...
This reverts commit d6076941a8
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-07-15 11:47:30 +02:00
Lukasz Jobczyk
d6076941a8
refactor: Add dc flush mitigation infrastructure
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-07-12 14:45:51 +02:00
Bartosz Dunajski
e188de2489
fix: initialize page tables before access for TSP allocation in TBX mode
...
Related-To: NEO-8340
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-07-11 16:03:19 +02:00
Lukasz Jobczyk
b0a5f2cced
fix: Stop direct submission before signal GL event
...
Related-To: NEO-10556
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-07-11 13:45:42 +02:00
Dominik Dabek
bdeccab7aa
fix: bcs enqueue after marker properly waits
...
For an example sequence of:
IOQ_1 -> enqueue copy, enqueue marker with waitlist (out event)
IOQ_2 -> enqueue marker with waitlist (event), enqueue copy
Add missing synchronization between the enqueue copies
Related-To: NEO-11694
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-07-11 10:36:18 +02:00
Michal Mrozek
05eb4e7a0d
performance: add debug flag to disable l1 flush
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-07-11 10:09:46 +02:00
Szymon Morek
dbd96372be
performance: adjust staging buffer usage
...
Related-To: NEO-11928
Don't copy through staging buffer if dst usm allocation
was not used before and transfer would be splitted.
Also, don't use staging buffers for mapped ocl buffers.
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-10 10:19:18 +02:00
Michal Mrozek
4cabc9e4d2
performance: remove not needed code.
...
events are already created with queued state.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-07-10 07:35:05 +02:00
Bartosz Dunajski
90d1a210d8
refactor: remove not needed check
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-07-09 16:11:33 +02:00
Mateusz Jablonski
778645c11e
feature: add initial support for Xe2 platforms
...
Related-To: NEO-8188, NEO-10774
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-09 10:30:27 +02:00
Szymon Morek
7d25965a78
performance: change buffer type for new coherency model
...
Related-To: NEO-11882
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-09 09:33:53 +02:00
Lukasz Jobczyk
bbeb5224d8
fix: Add IOH alignment to estimation
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-07-08 12:29:54 +02:00
Mateusz Jablonski
4f4b8fed15
refactor: remove not needed code related to deprecated device ip version
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-05 09:54:48 +02:00
Compute-Runtime-Validation
38872b7e1b
Revert "refactor: Add dc flush mitigation infrastructure"
...
This reverts commit 1cba900ad9
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-07-04 08:20:18 +02:00
Lukasz Jobczyk
a96f2ea13a
performance: disable blit enqueue on LNL
...
Resolves: NEO-11471
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-07-04 05:41:31 +02:00
Szymon Morek
daeb6e897a
performance: limit barrier usage in non-usm copies
...
Related-To: NEO-11501
Don't emit barrier if non-usm copy through staging
buffers with OOQ and single transfer was issued.
No need to send barrier if there was single transfer -
there's nothing to aggregate
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-03 13:45:52 +02:00
Compute-Runtime-Validation
b8246d806f
Revert "performance: change buffer type for new coherency model"
...
This reverts commit c979495265
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-07-03 02:11:19 +02:00
Morek, Szymon
aed96cede4
fix: flush barrier when profiling enabled
...
Related-To: NEO-10615
Signed-off-by: Morek, Szymon <szymon.morek@intel.com>
2024-07-02 13:25:07 +02:00
Szymon Morek
c979495265
performance: change buffer type for new coherency model
...
Related-To: NEO-11882
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-02 12:49:41 +02:00
Szymon Morek
53f9a48fce
fix: set start TS on cpu if TS packet write disabled
...
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-01 17:14:10 +02:00
Oskar Hubert Weber
9a486dd5a1
fix: Add missing fp64 extensions in caps initialization
...
In a scenario where fp64 emulation is both supported and enabled,
we want to include 2 additional extensions:
- cl_khr_fp64 to deviceExtensions
- __opencl_c_fp64 to openclCFeatures (OpenCL ver. 3.0)
Related-To: NEO-7611
Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com>
2024-07-01 13:09:13 +02:00
Dominik Dabek
76e8be5c39
fix(ocl): handle gl sharing displayable textures
...
Displayable textures always need dc flush.
Related-To: NEO-11694
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-06-27 13:43:20 +02:00
Kamil Kopryk
70e52ce4d8
refactor: add seperate enablers for compute walker and gpgpu walker
...
Related-To: NEO-10641
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-27 10:27:48 +02:00
Szymon Morek
8ee92b840f
fix: don't set start timestamp on cpu for bcs
...
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-06-26 13:30:58 +02:00
Kamil Kopryk
fbc29bb43d
refactor: use indirectDataAlignment from gen cmds
...
Related-To: NEO-10641
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-25 21:02:36 +02:00
Michal Mrozek
a3c3b6533a
performance: improve pool handling
...
Related-To: NEO-11731
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-06-25 17:04:17 +02:00
Kamil Kopryk
2ec2f8c085
refactor: add heaplessStateInitEnabled bool to dispatch walker args
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-25 15:53:19 +02:00
Lukasz Jobczyk
1cba900ad9
refactor: Add dc flush mitigation infrastructure
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-06-25 14:53:24 +02:00
Artur Harasimiuk
bbe11205e3
build: add cmake flags to control platform support
...
Related-To: NEO-6683
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2024-06-25 09:08:03 +02:00
Szymon Morek
29e3eb512c
performance: non-usm copy through staging buffers
...
Related-To: NEO-11501
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-06-25 07:18:53 +02:00
Lukasz Jobczyk
fed90f5c8e
fix: Add infrastructure to force dc flush when mitigate dc
...
-force dc on next tag update after RT kernel
-force dc when release shared object
Related-To: NEO-10556
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-06-24 17:15:03 +02:00
Lukasz Jobczyk
773da10099
fix: Copy hostptr using BCS when mitigate dc flush
...
Related-To: NEO-10556
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-06-24 16:26:44 +02:00
Compute-Runtime-Validation
7136dfbd38
Revert "performance: improve pool handling"
...
This reverts commit 5f0b9efd2b
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-06-24 12:47:19 +02:00
Filip Hazubski
9513df52a6
refactor: Explicitly delete unused functions of BarrierCommand
...
Comply with the rule of five.
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-06-24 11:18:17 +02:00
Kamil Kopryk
0c019c9f2b
performance: don't check multiple times if gpgpu is initialized
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-24 10:12:46 +02:00
Filip Hazubski
2f6eaf149a
fix: Update SIP kernel initialization logic
...
Initialize SIP kernel when shared device is being initialized
instead of api-specific device.
Initialize debugger when shared device is being initialized
instead of during platform or driver initialization.
Add missing makeResident calls for SIP kernel in heapless paths.
Related-To: HSD-18038645398, HSD-18038819112
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-06-20 18:30:46 +02:00
Kamil Kopryk
78c7d8878d
refactor: change method to get indirect data address
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-20 14:24:51 +02:00
Lukasz Jobczyk
ce0047de39
fix: Use proper GA object to check shared handle in debug break
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-06-19 11:08:37 +02:00
Jaroslaw Warchulski
daec249430
refactor: remove not needed createGraphicsAllocationFromNTHandle method
...
Related-To: NEO-11498
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2024-06-18 14:25:00 +02:00
Dominik Dabek
b6d86d2648
refactor: tests for buffer pool
...
add support for future AIL
Related-To: NEO-11694
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-06-18 13:55:21 +02:00
Mrozek, Michal
5f0b9efd2b
performance: improve pool handling
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
Resolves: NEO-11731
2024-06-14 12:02:34 +02:00
Alicja Lukaszewicz
730fed3206
fix: add missing tracing to all functions in api.cpp
...
Related-To: NEO-11576
Signed-off-by: Alicja Lukaszewicz <alicja.lukaszewicz@intel.com>
2024-06-13 14:55:13 +02:00
Jaroslaw Warchulski
76a05c1cab
refactor: pass arrayIndex to Wddm::openNTHandle function
...
Related-To: NEO-11498
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2024-06-12 12:11:35 +02:00
Filip Hazubski
5ae2709e6e
fix: Remove allow-zebin and enable-zebin compile options
...
Zebin is enabled by default.
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-06-10 14:08:40 +02:00
Bartosz Dunajski
da9c009b88
feature: assign unique interrupt to queue
...
Related-To: NEO-8179
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-06-07 10:06:31 +02:00
Jaroslaw Warchulski
0ac1be7669
refactor: pass arrayIndex to Wddm::openSharedHandle function
...
Related-To: NEO-11498
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2024-06-07 09:51:26 +02:00
Alicja Lukaszewicz
b4c31492d3
fix: add missing tracing to all functions in api.cpp
...
Related-To: NEO-11576
Signed-off-by: Alicja Lukaszewicz <alicja.lukaszewicz@intel.com>
2024-06-06 12:14:21 +02:00
Bartosz Dunajski
e88127dcc3
refactor: remove not used multi context per engine code
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-06-06 11:41:15 +02:00
Bartosz Dunajski
bc37e14e87
refactor: fallback paths for secondary context debug flags
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-06-05 12:02:16 +02:00
Bartosz Dunajski
88ed909e57
feature: assign secondary BCS engines at queue creation
...
Related-To: NEO-7824
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-06-03 09:41:43 +02:00
Mateusz Hoppe
8e79e7a174
refactor: fix gpu address formatting in logs
...
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-05-31 13:55:26 +02:00
Szymon Morek
5e92d530de
performance: Reuse GPU timestamps by default on Windows
...
Related-To: NEO-10615
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-28 12:25:30 +02:00
Szymon Morek
387ec34207
fix: correct timestamps if cpu diff below resolution
...
Related-To: NEO-10615
Currently if cpu diff is equals to zero, gpu timestamp
will be decreased relatively to submit timestamp.
This commit fixes that so timestamps will be aligned.
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-28 11:12:37 +02:00
Compute-Runtime-Validation
0b2c9e92e7
Revert "performance: Reuse GPU timestamps by default on Windows"
...
This reverts commit bca3fecaa0
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-25 07:59:00 +02:00
Szymon Morek
bca3fecaa0
performance: Reuse GPU timestamps by default on Windows
...
Related-To: NEO-10615
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-24 20:11:45 +02:00
Bartosz Dunajski
0a74d718f2
refactor: allow using different engine types for secondary contexts
...
Related-To: NEO-7824
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-05-24 09:39:46 +02:00
Fabian Zwoliński
59829aeb11
fix(ocl): Force rebuild if binary contains bindless kernel
...
Related-To: NEO-7063
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-05-20 14:15:17 +02:00
Maciej Plewka
90df4b298b
fix: cache flush dependency for queue blocked
...
Related-to: NEO-9872, HSD-18038461954
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-05-20 11:55:24 +02:00
Szymon Morek
8989bf4078
fix: set timestamp diff correctly if lower than resolution
...
Related-To: NEO-10615
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-17 20:15:58 +02:00
Dominik Dabek
a236171f0d
performance(ocl): enable device usm alloc reuse
...
Enabling on MTL+
Limited to use max 2% of global device memory.
Related-To: NEO-6893, NEO-11463
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-05-17 13:32:45 +02:00
Compute-Runtime-Validation
4c5d567b40
Revert "fix: remove not needed check in OCL CPU copy path 2"
...
This reverts commit 0a27aa9f7b
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-17 10:52:17 +02:00
Compute-Runtime-Validation
34f53d5d94
Revert "performance: Reuse GPU timestamps by default"
...
This reverts commit 7aceed58ca
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-16 21:52:10 +02:00
Bartosz Dunajski
0a27aa9f7b
fix: remove not needed check in OCL CPU copy path 2
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-05-16 12:38:39 +02:00
Szymon Morek
7aceed58ca
performance: Reuse GPU timestamps by default
...
Related-To: NEO-10615
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-15 17:51:42 +02:00
Dominik Dabek
b4d839fe29
performance(usm): l0, add usm host memory pooling
...
Disabled by default.
Related-To: NEO-11356
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-05-15 15:20:51 +02:00
Dominik Dabek
ae8c7589dc
refactor: move implicit arg has indirect access
...
Move implicit arg has indirect access boolean to kernelAttributes
Related-To: NEO-11396
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-05-15 13:11:04 +02:00
Dominik Dabek
fd47030ad6
fix: use igc indirect detection v3
...
Update to use igc indirect detection v3. Fix for not detecting indirects
passed as implicit arguments.
Related-To: NEO-11396
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-05-15 06:35:42 +02:00
Compute-Runtime-Validation
4c43ca518b
Revert "fix: remove not needed check in OCL CPU copy path"
...
This reverts commit 92f9c8be96
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-14 20:12:28 +02:00
Bartosz Dunajski
92f9c8be96
fix: remove not needed check in OCL CPU copy path
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-05-14 17:05:55 +02:00
Szymon Morek
e35b951a00
performance: Allow indirect allocs as pack on OpenCL
...
Related-To: NEO-11228
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-10 17:57:42 +02:00
Maciej Plewka
e39893485c
fix: add cache flush as dependency for bcs ccs synchronization
...
Related-to: NEO-9872
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-05-09 13:43:39 +02:00
Szymon Morek
10ed479b16
performance: share inter-module ISA allocations
...
Related-To: NEO-10258
Currently each module has it's own GA
for kernel ISA's. This change allows new modules to
reuse existing allocation.
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-09 08:43:55 +02:00
Aleksander Czerwionka
4667f55ea2
fix: add missing checks to return proper OpenCL error from API
...
changes affect clEnqueueNDCountKernelINTEL and
clGetKernelMaxConcurrentWorkGroupCountINTEL
Related-To: NEO-9688
Signed-off-by: Aleksander Czerwionka <aleksander.czerwionka@intel.com>
2024-05-07 20:32:36 +02:00
Weronika Kapusta
d6c16c1640
fix: remove compiler cache legacy implementation
...
Related-To: NEO-10679
Signed-off-by: Weronika Kapusta <weronika.kapusta@intel.com>
2024-05-06 18:28:35 +02:00
Szymon Morek
83e8ae4a20
performance: Reuse GPU timestamp instead of KMD escape
...
This can be enabled only if related
debug flag will be set.
Related-To: NEO-10615
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-06 14:46:30 +02:00
Compute-Runtime-Validation
8342c0ae2f
Revert "fix: add cache flush as dependency for bcs ccs synchronization"
...
This reverts commit 5e57bb2a32
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-01 03:05:47 +02:00
Mrozek, Michal
3ded04341c
refactor: remove not needed code.
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-04-30 16:24:21 +02:00
Mrozek, Michal
03dcff1398
refactor: remove not needed code.
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-04-30 15:10:34 +02:00
Fabian Zwoliński
ee71157f7f
fix: opencl support for bindless kernels
...
Related-To: NEO-11156
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-04-30 12:02:17 +02:00
Maciej Plewka
5e57bb2a32
fix: add cache flush as dependency for bcs ccs synchronization
...
Related-to: NEO-9872
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-04-30 11:57:49 +02:00
Artur Harasimiuk
cbc868c4ed
refactor: add mock_aubstream
...
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2024-04-25 20:08:30 +02:00
Lukasz Jobczyk
d122e1c906
fix: Flush render state cache on release shared object
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-04-22 17:37:42 +02:00
Andrzej Koska
ae139aeffd
refactor: Passing information about the engine
...
Extension of the interface with information
about the engine type passed to the function
Related-To: NEO-10678
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2024-04-22 13:33:56 +02:00
Milczarek, Slawomir
a2e0f9ac09
fix: Add AUB/TBX writable for clEnqueueWriteBuffer on SVM hostptr
...
Related-To: NEO-11082
Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>
2024-04-18 12:17:10 +02:00
Jaroslaw Warchulski
ea057abd36
fix: allow empty set to be passed to clSetKernelExecInfo
...
Related-To: GSD-8538
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2024-04-16 14:11:04 +02:00
Katarzyna Cencelewska
ce3bb1327e
feature: add method to switch enable/disable mid thread preemption
...
Resolves: NEO-8089
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-04-15 16:26:52 +02:00
Kamil Kopryk
4bb30e51a2
fix: use default csr for scratch in OCL
...
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-04-11 16:58:35 +02:00
Kamil Kopryk
cfc3b6a8c2
refactor: add heaplessModeEnabled argument to isHeaplessStateInitEnabled
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-04-11 10:08:10 +02:00
Mateusz Jablonski
cb2b572e94
feature: add support for null aub mode
...
In this mode AUB csr will be created, however, no aub file will be created
Related-To: NEO-11097
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-04-09 16:59:42 +02:00
Compute-Runtime-Validation
9568ee47e7
Revert "fix: remove compiler cache legacy implementation"
...
This reverts commit 864f42116c
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-04-09 02:06:51 +02:00
Weronika Kapusta
864f42116c
fix: remove compiler cache legacy implementation
...
Related-To: NEO-10679
Signed-off-by: Kapusta, Weronika <weronika.kapusta@intel.com>
2024-04-08 16:40:57 +02:00
Dominik Dabek
0d69dfe165
refactor: remove comment
...
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-04-08 11:43:19 +02:00
Kacper Nowak
35ca2bbea9
refactor: refactor logic for querying atomic FP capabilities
...
- Move logic from product helper to compiler product helper
- Add method for adjusting fp16 and extra capabilities using release
helper (if present).
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2024-03-28 15:27:45 +01:00
Mateusz Jablonski
bcaa62bed3
build: refactor linux kernel headers
...
Add cmake flags to provide path to headers to drm / i915 /i915 prelim / xe
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-28 14:48:09 +01:00
Lukasz Jobczyk
436579ca8c
fix: Disable async release of buffers with external host ptr
...
Related-To: NEO-10036
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-27 15:07:14 +01:00
Compute-Runtime-Validation
707796a50d
Revert "fix: Disable async release of buffers with external host ptr"
...
This reverts commit 9b715c3ffc
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-27 07:49:54 +01:00
Bartosz Dunajski
fce016ee7a
refactor: add new semaphore param
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-03-26 15:46:47 +01:00
Mateusz Jablonski
78a4a92b44
refactor: reorder members to reduce internal padding in structs
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-25 15:50:00 +01:00
Lukasz Jobczyk
9b715c3ffc
fix: Disable async release of buffers with external host ptr
...
Related-To: NEO-10036
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-25 12:55:57 +01:00
Filip Hazubski
07b9117b44
refactor: Explicitly delete copy c-tor and copy assignment of GlBuffer
...
Comply with the rule of three.
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-03-25 10:55:02 +01:00
Compute-Runtime-Validation
e9b866ab11
Revert "fix: Disable async release of buffers with external host ptr"
...
This reverts commit 70a62d89e3
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-23 11:49:42 +01:00
Lukasz Jobczyk
70a62d89e3
fix: Disable async release of buffers with external host ptr
...
Related-To: NEO-10036
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-22 18:11:27 +01:00
Katarzyna Cencelewska
da7b03dd15
fix: to always use grfs count in calculateNumThreadsPerThreadGroup
...
grf size != grf count
Related-To: GSD-8437
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-03-22 11:03:18 +01:00
Katarzyna Cencelewska
dd1d52259e
refactor: add param rootDeviceEnvironment to calculateNumThreadsPerThreadGroup
...
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-03-21 22:25:14 +01:00
Mateusz Hoppe
0cec5ccc32
fix: check if any of devices has BindlessHeapHelper enabled
...
- report Bindless_images extension in driver only when at least one
device has global bindless heaps enabled
Related-To: NEO-10352
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-03-20 20:50:58 +01:00
Bartlomiej Wolny
7db5f6d5bf
fix: add texture cache flush on image dtor
...
Related-To: NEO-9794
Signed-off-by: Bartlomiej Wolny <bartlomiej.wolny@intel.com>
2024-03-18 17:03:17 +01:00
Fabian Zwolinski
eb259c1f52
fix: set kernel LWS prior to obtaining indirect heaps
...
When calculating the size of the indirect object heap,
the local work group size from kernel implicit args is taken into account.
If the LWS is not set before this calculation,
it can lead to insufficient ioh allocation size.
Such a problem is seen when local ids are generated by the runtime
and then written to ioh. The write fails due to lack of space in the allocation.
Related-To: IGC-7708
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-03-14 10:38:46 +01:00
Ben Ashbaugh
e00230a6c2
fix: correctly report support for SPIR-V 1.0 through 1.3
...
Related-To: NEO-10336
Signed-off-by: Ben Ashbaugh <ben.ashbaugh@intel.com>
2024-03-12 17:54:09 +01:00
Grzegorz Wawiorko
87f2357868
fix: Update CL_DEVICE_LATEST_CONFORMANCE_VERSION_PASSED value
...
Signed-off-by: Grzegorz Wawiorko <grzegorz.wawiorko@intel.com>
2024-03-12 08:42:21 +01:00
Mrozek, Michal
f71f6d2b72
refactor: remove not needed code
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-03-08 18:18:55 +01:00
Mateusz Hoppe
5e15543b14
test: improve unit tests - use virtual file system
...
- use virtualFileSystem in ULTs, writeDataToFile() does not create file
but instead saves filename. Call to fileExists() checks if file was
previously written
Related-To: NEO-7006
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-03-08 13:34:19 +01:00
Kamil Kopryk
168445784e
feature: introduce states programming at driver init heapless ocl
...
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-03-08 12:29:44 +01:00
Lukasz Jobczyk
88a8bc8ac6
fix: Download BCS allocations when timestamps are completed
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-08 10:47:53 +01:00
Maciej Plewka
f0281202bf
feature: Signal to OGL creating/destroying shared buffer
...
Related-To: NEO-9151
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-03-06 16:28:28 +01:00
Lukasz Jobczyk
895c951a7f
fix: Unlock CmdQ before download allocations
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-05 14:53:11 +01:00
Mateusz Jablonski
bfbe5a048c
fix: verify retVal from poll function in gl sharing sync on Linux
...
unify logic of synchronization objects
add default iniialization of gl context guard members
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-05 11:26:27 +01:00
Lukasz Jobczyk
246c24f78f
fix: Download allocations when timestamps are complete
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-04 16:05:31 +01:00
Mateusz Jablonski
8c55dd91a6
fix: don't use poll/close syscalls in gl sharing tests
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-04 14:09:46 +01:00
Fabian Zwolinski
52430762ac
fix: cl_cache L0 env vars + refactor code structure
...
Added support for new Compiler Cache
environment variables in Level Zero.
Moved
`opencl/source/compiler_interface/default_cache_config.cpp`
`level_zero/core/source/compiler_interface/default_cache_config.cpp`
to shared directory
`source/compiler_interface/default_cache_config.cpp`
Switched enabling cache by default from per OS to per API.
Changed default state of cl_cache in Level Zero to disabled.
Related-To: NEO-10045
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-03-01 17:35:08 +01:00
Sylvain Munaut
e2c511bc00
feature(ocl): Improve Linux CL/GL sharing support
...
This commit is aimed at drastically improving the support for the CL/GL
sharing
extension on linux. The current support is not really usable as it only
supports a few texture format, and only on EGL contexts. It is also
pretty
buggy since it requires the texture to be bound when placing the CL call
to
share it which is just plain wrong and will not work in many
applications.
This new version makes used of the "official" interop extension from
MESA
which is available for GLX and EGL contexts, allows sharing of buffers
and
not just texture and supports many more formats.
This is still far from being a fully compliant / full featured version
of
the extension, but it's a big step forward in my opinion and allows to
run
some real applications.
I've tested gr-fosphor (SDR spectrum display) and Davinci Resolve as
examples.
Both of theses don't work without theses improvements.
Fixes: https://github.com/intel/compute-runtime/issues/659
Fixes: https://github.com/intel/compute-runtime/issues/667
https://github.com/intel/compute-runtime/pull/673
Related-To: NEO-3599
Signed-off-by: Sylvain Munaut <tnt@246tNt.com>
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-01 13:38:56 +01:00
Zbigniew Zdanowicz
4baee71974
refactor: add out cmd argument to semaphore wait encoder
...
Related-To: NEO-10065
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-01 11:43:36 +01:00
Mateusz Jablonski
373ef608bc
refactor: unify calculating max group size between OCL and L0
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-27 13:32:10 +01:00
Zbigniew Zdanowicz
0bf8e8727e
refactor: add output buffer argument to store register to memory encoder
...
Related-To: NEO-10064
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-02-27 13:31:55 +01:00
Lukasz Jobczyk
cfd3edfb2c
fix: Align IOH entry
...
Related-To: NEO-10036
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-26 14:36:31 +01:00