Commit Graph

2502 Commits

Author SHA1 Message Date
Michal Mrozek d52ca080bd Revert "performance: improve pool handling"
This reverts commit a3c3b6533a.

Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-08-06 13:04:02 +02:00
Kamil Kopryk 2a9bcdeb83 refactor: pass outImplicitArgs to patchImplicitArgs function
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-08-05 17:31:47 +02:00
Fabian Zwoliński 674c4a15ad fix: use correct gpu address when bindless heaps helper is enabled
Related-To: NEO-7063
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-08-05 15:09:57 +02:00
Bartosz Dunajski ec34656e0e fix: debug flag to defer first device submission
Related-To: HSD-18039343751

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-02 09:31:25 +02:00
Mateusz Hoppe 0196a0f72f refactor(ocl): internal linker version script with OpenCL versions
- use the same map as in:
https://github.com/KhronosGroup/OpenCL-ICD-Loader/blob/main/
loader/linux/icd_exports.map

this allows to skip loader and link directly with libigdrcl.so

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-07-31 20:18:05 +02:00
Michal Mrozek 47009cec90 refactor: remove not needed code
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-07-31 17:07:56 +02:00
Zbigniew Zdanowicz b33fe6ccf1 feature: adding flag to block dispatch implicit scaling commands
- this feature is part of making compute walker command view
- compute walker is programed for implicit scaling but not dispatched
- together with new flag, comes the refactor to reduce number of arguments

Related-To: NEO-11972

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-07-31 14:24:27 +02:00
Fabian Zwoliński b1a50104a8 fix: include dynamic SLM in clGetKernelWorkGroupInfo and zeKernelGetProperties
Current implementation only takes static slmInlineSize into account.
With this change we also include dynamic SLM passed as a kernel arguments.

Related-To: NEO-5761
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-07-30 17:59:45 +02:00
Michal Mrozek e668b4965c performance: demote unrecoverable to debug_break
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-07-30 16:51:00 +02:00
Kamil Kopryk 65fcbff55c refactor: Simplify code
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-07-29 14:26:51 +02:00
Compute-Runtime-Validation b1bc4f4cad Revert "fix: Add missing fp64 extensions in caps initialization"
This reverts commit 9a486dd5a1.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-07-26 14:47:02 +02:00
Szymon Morek ace883ca55 performance: don't flush gpgpu if not required
Related-To: NEO-12124

If queue is OOQ and there are no cross-engine dependencies,
don't flush CCS before submitting copy on BCS.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-26 06:49:45 +02:00
Maciej Plewka 1cd00b5b89 fix: use per product cache line size to align heaps
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-07-24 17:29:20 +02:00
Maciej Plewka afee8814ef refactor: get ioh alignment from static function
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-07-24 14:43:31 +02:00
Szymon Morek a7fbc90ebd fix: re-enable staging buffer copy when ccs is busy
Related-To: NEO-11501

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-22 18:14:46 +02:00
Szymon Morek 39ec7facee performance: use BCS for transfers if CCS is busy
Related-To: NEO-11501

Also, if device is iGPU, don't use staging buffers
in that case.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-22 15:36:26 +02:00
Szymon Morek 6a11e8a077 fix: revert changes around zero-copy
Related-To: NEO-12018

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-19 12:29:18 +02:00
Mateusz Hoppe b6299b8a21 feature: add support for HP copy engine context
- add support for contect group with HP copy engine
- choose HP copy engine when available

Related-To: NEO-11983

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-07-19 12:23:03 +02:00
Filip Hazubski f8867ac3ac fix: Minor code improvements
Add explicit pointer checks to CommandQueue::blitEnqueueAllowed.

Explicitly check result of getDeviceArgValueIdx during ocloc compile.

Explicitly remove unused StagingBufferManager functions.
Move chunkCopyFunc by reference.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-07-19 09:24:37 +02:00
Compute-Runtime-Validation 7ad15639fc Revert "feature: add support for HP copy engine context"
This reverts commit 3fbcbcaef2.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-07-18 21:02:14 +02:00
Szymon Morek 33ab962121 fix: adjust compression hint usage for ocl buffers
Related-To: NEO-11989

Also, use zero-copy on lnl

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-18 18:24:48 +02:00
Michal Mrozek 20d6910b66 performance: move usm pool init to first alloc call
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-07-18 16:07:22 +02:00
Mateusz Hoppe 3fbcbcaef2 feature: add support for HP copy engine context
- add support for contect group with HP copy engine
- choose HP copy engine when available

Related-To: NEO-11983

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-07-18 16:07:07 +02:00
Dominik Dabek c1c9ac634b performance(ocl): enable host usm alloc recycle
Enable at threshold of 2% system memory.

Related-To: NEO-6893

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-07-17 19:33:56 +02:00
Mateusz Jablonski 8a60742a8d fix: correct reported num subslices per slice in fused config
Related-To: NEO-12073
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-17 17:17:28 +02:00
Dominik Dabek fc9de71feb fix(ocl): finish in release ogl object if needed
Finish cache flushes before exiting api call if releasing displayable
ogl object or dcflush is mitigated.

Related-To: NEO-11694

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-07-16 11:21:32 +02:00
Compute-Runtime-Validation 9a6403f3bc Revert "refactor: Add dc flush mitigation infrastructure"
This reverts commit d6076941a8.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-07-15 11:47:30 +02:00
Lukasz Jobczyk d6076941a8 refactor: Add dc flush mitigation infrastructure
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-07-12 14:45:51 +02:00
Bartosz Dunajski e188de2489 fix: initialize page tables before access for TSP allocation in TBX mode
Related-To: NEO-8340

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-07-11 16:03:19 +02:00
Lukasz Jobczyk b0a5f2cced fix: Stop direct submission before signal GL event
Related-To: NEO-10556

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-07-11 13:45:42 +02:00
Dominik Dabek bdeccab7aa fix: bcs enqueue after marker properly waits
For an example sequence of:
IOQ_1 -> enqueue copy, enqueue marker with waitlist (out event)
IOQ_2 -> enqueue marker with waitlist (event), enqueue copy

Add missing synchronization between the enqueue copies

Related-To: NEO-11694

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-07-11 10:36:18 +02:00
Michal Mrozek 05eb4e7a0d performance: add debug flag to disable l1 flush
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-07-11 10:09:46 +02:00
Szymon Morek dbd96372be performance: adjust staging buffer usage
Related-To: NEO-11928

Don't copy through staging buffer if dst usm allocation
was not used before and transfer would be splitted.
Also, don't use staging buffers for mapped ocl buffers.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-10 10:19:18 +02:00
Michal Mrozek 4cabc9e4d2 performance: remove not needed code.
events are already created with queued state.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-07-10 07:35:05 +02:00
Bartosz Dunajski 90d1a210d8 refactor: remove not needed check
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-07-09 16:11:33 +02:00
Mateusz Jablonski 778645c11e feature: add initial support for Xe2 platforms
Related-To: NEO-8188, NEO-10774
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-09 10:30:27 +02:00
Szymon Morek 7d25965a78 performance: change buffer type for new coherency model
Related-To: NEO-11882

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-09 09:33:53 +02:00
Lukasz Jobczyk bbeb5224d8 fix: Add IOH alignment to estimation
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-07-08 12:29:54 +02:00
Mateusz Jablonski 4f4b8fed15 refactor: remove not needed code related to deprecated device ip version
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-05 09:54:48 +02:00
Compute-Runtime-Validation 38872b7e1b Revert "refactor: Add dc flush mitigation infrastructure"
This reverts commit 1cba900ad9.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-07-04 08:20:18 +02:00
Lukasz Jobczyk a96f2ea13a performance: disable blit enqueue on LNL
Resolves: NEO-11471

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-07-04 05:41:31 +02:00
Szymon Morek daeb6e897a performance: limit barrier usage in non-usm copies
Related-To: NEO-11501

Don't emit barrier if non-usm copy through staging
buffers with OOQ and single transfer was issued.
No need to send barrier if there was single transfer -
there's nothing to aggregate

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-03 13:45:52 +02:00
Compute-Runtime-Validation b8246d806f Revert "performance: change buffer type for new coherency model"
This reverts commit c979495265.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-07-03 02:11:19 +02:00
Morek, Szymon aed96cede4 fix: flush barrier when profiling enabled
Related-To: NEO-10615

Signed-off-by: Morek, Szymon <szymon.morek@intel.com>
2024-07-02 13:25:07 +02:00
Szymon Morek c979495265 performance: change buffer type for new coherency model
Related-To: NEO-11882

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-02 12:49:41 +02:00
Szymon Morek 53f9a48fce fix: set start TS on cpu if TS packet write disabled
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-01 17:14:10 +02:00
Oskar Hubert Weber 9a486dd5a1 fix: Add missing fp64 extensions in caps initialization
In a scenario where fp64 emulation is both supported and enabled,
we want to include 2 additional extensions:
- cl_khr_fp64 to deviceExtensions
- __opencl_c_fp64 to openclCFeatures (OpenCL ver. 3.0)

Related-To: NEO-7611

Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com>
2024-07-01 13:09:13 +02:00
Dominik Dabek 76e8be5c39 fix(ocl): handle gl sharing displayable textures
Displayable textures always need dc flush.

Related-To: NEO-11694

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-06-27 13:43:20 +02:00
Kamil Kopryk 70e52ce4d8 refactor: add seperate enablers for compute walker and gpgpu walker
Related-To: NEO-10641
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-27 10:27:48 +02:00
Szymon Morek 8ee92b840f fix: don't set start timestamp on cpu for bcs
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-06-26 13:30:58 +02:00
Kamil Kopryk fbc29bb43d refactor: use indirectDataAlignment from gen cmds
Related-To: NEO-10641
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-25 21:02:36 +02:00
Michal Mrozek a3c3b6533a performance: improve pool handling
Related-To: NEO-11731
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-06-25 17:04:17 +02:00
Kamil Kopryk 2ec2f8c085 refactor: add heaplessStateInitEnabled bool to dispatch walker args
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-25 15:53:19 +02:00
Lukasz Jobczyk 1cba900ad9 refactor: Add dc flush mitigation infrastructure
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-06-25 14:53:24 +02:00
Artur Harasimiuk bbe11205e3 build: add cmake flags to control platform support
Related-To: NEO-6683
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2024-06-25 09:08:03 +02:00
Szymon Morek 29e3eb512c performance: non-usm copy through staging buffers
Related-To: NEO-11501

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-06-25 07:18:53 +02:00
Lukasz Jobczyk fed90f5c8e fix: Add infrastructure to force dc flush when mitigate dc
-force dc on next tag update after RT kernel
-force dc when release shared object

Related-To: NEO-10556

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-06-24 17:15:03 +02:00
Lukasz Jobczyk 773da10099 fix: Copy hostptr using BCS when mitigate dc flush
Related-To: NEO-10556

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-06-24 16:26:44 +02:00
Compute-Runtime-Validation 7136dfbd38 Revert "performance: improve pool handling"
This reverts commit 5f0b9efd2b.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-06-24 12:47:19 +02:00
Filip Hazubski 9513df52a6 refactor: Explicitly delete unused functions of BarrierCommand
Comply with the rule of five.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-06-24 11:18:17 +02:00
Kamil Kopryk 0c019c9f2b performance: don't check multiple times if gpgpu is initialized
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-24 10:12:46 +02:00
Filip Hazubski 2f6eaf149a fix: Update SIP kernel initialization logic
Initialize SIP kernel when shared device is being initialized
instead of api-specific device.

Initialize debugger when shared device is being initialized
instead of during platform or driver initialization.

Add missing makeResident calls for SIP kernel in heapless paths.

Related-To: HSD-18038645398, HSD-18038819112

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-06-20 18:30:46 +02:00
Kamil Kopryk 78c7d8878d refactor: change method to get indirect data address
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-20 14:24:51 +02:00
Lukasz Jobczyk ce0047de39 fix: Use proper GA object to check shared handle in debug break
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-06-19 11:08:37 +02:00
Jaroslaw Warchulski daec249430 refactor: remove not needed createGraphicsAllocationFromNTHandle method
Related-To: NEO-11498
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2024-06-18 14:25:00 +02:00
Dominik Dabek b6d86d2648 refactor: tests for buffer pool
add support for future AIL

Related-To: NEO-11694

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-06-18 13:55:21 +02:00
Mrozek, Michal 5f0b9efd2b performance: improve pool handling
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
Resolves: NEO-11731
2024-06-14 12:02:34 +02:00
Alicja Lukaszewicz 730fed3206 fix: add missing tracing to all functions in api.cpp
Related-To: NEO-11576

Signed-off-by: Alicja Lukaszewicz <alicja.lukaszewicz@intel.com>
2024-06-13 14:55:13 +02:00
Jaroslaw Warchulski 76a05c1cab refactor: pass arrayIndex to Wddm::openNTHandle function
Related-To: NEO-11498
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2024-06-12 12:11:35 +02:00
Filip Hazubski 5ae2709e6e fix: Remove allow-zebin and enable-zebin compile options
Zebin is enabled by default.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-06-10 14:08:40 +02:00
Bartosz Dunajski da9c009b88 feature: assign unique interrupt to queue
Related-To: NEO-8179

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-06-07 10:06:31 +02:00
Jaroslaw Warchulski 0ac1be7669 refactor: pass arrayIndex to Wddm::openSharedHandle function
Related-To: NEO-11498
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2024-06-07 09:51:26 +02:00
Alicja Lukaszewicz b4c31492d3 fix: add missing tracing to all functions in api.cpp
Related-To: NEO-11576

Signed-off-by: Alicja Lukaszewicz <alicja.lukaszewicz@intel.com>
2024-06-06 12:14:21 +02:00
Bartosz Dunajski e88127dcc3 refactor: remove not used multi context per engine code
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-06-06 11:41:15 +02:00
Bartosz Dunajski bc37e14e87 refactor: fallback paths for secondary context debug flags
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-06-05 12:02:16 +02:00
Bartosz Dunajski 88ed909e57 feature: assign secondary BCS engines at queue creation
Related-To: NEO-7824

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-06-03 09:41:43 +02:00
Mateusz Hoppe 8e79e7a174 refactor: fix gpu address formatting in logs
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-05-31 13:55:26 +02:00
Szymon Morek 5e92d530de performance: Reuse GPU timestamps by default on Windows
Related-To: NEO-10615

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-28 12:25:30 +02:00
Szymon Morek 387ec34207 fix: correct timestamps if cpu diff below resolution
Related-To: NEO-10615

Currently if cpu diff is equals to zero, gpu timestamp
will be decreased relatively to submit timestamp.
This commit fixes that so timestamps will be aligned.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-28 11:12:37 +02:00
Compute-Runtime-Validation 0b2c9e92e7 Revert "performance: Reuse GPU timestamps by default on Windows"
This reverts commit bca3fecaa0.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-25 07:59:00 +02:00
Szymon Morek bca3fecaa0 performance: Reuse GPU timestamps by default on Windows
Related-To: NEO-10615

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-24 20:11:45 +02:00
Bartosz Dunajski 0a74d718f2 refactor: allow using different engine types for secondary contexts
Related-To: NEO-7824

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-05-24 09:39:46 +02:00
Fabian Zwoliński 59829aeb11 fix(ocl): Force rebuild if binary contains bindless kernel
Related-To: NEO-7063
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-05-20 14:15:17 +02:00
Maciej Plewka 90df4b298b fix: cache flush dependency for queue blocked
Related-to: NEO-9872, HSD-18038461954
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-05-20 11:55:24 +02:00
Szymon Morek 8989bf4078 fix: set timestamp diff correctly if lower than resolution
Related-To: NEO-10615

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-17 20:15:58 +02:00
Dominik Dabek a236171f0d performance(ocl): enable device usm alloc reuse
Enabling on MTL+
Limited to use max 2% of global device memory.

Related-To: NEO-6893, NEO-11463

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-05-17 13:32:45 +02:00
Compute-Runtime-Validation 4c5d567b40 Revert "fix: remove not needed check in OCL CPU copy path 2"
This reverts commit 0a27aa9f7b.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-17 10:52:17 +02:00
Compute-Runtime-Validation 34f53d5d94 Revert "performance: Reuse GPU timestamps by default"
This reverts commit 7aceed58ca.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-16 21:52:10 +02:00
Bartosz Dunajski 0a27aa9f7b fix: remove not needed check in OCL CPU copy path 2
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-05-16 12:38:39 +02:00
Szymon Morek 7aceed58ca performance: Reuse GPU timestamps by default
Related-To: NEO-10615

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-15 17:51:42 +02:00
Dominik Dabek b4d839fe29 performance(usm): l0, add usm host memory pooling
Disabled by default.

Related-To: NEO-11356

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-05-15 15:20:51 +02:00
Dominik Dabek ae8c7589dc refactor: move implicit arg has indirect access
Move implicit arg has indirect access boolean to kernelAttributes

Related-To: NEO-11396

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-05-15 13:11:04 +02:00
Dominik Dabek fd47030ad6 fix: use igc indirect detection v3
Update to use igc indirect detection v3. Fix for not detecting indirects
passed as implicit arguments.

Related-To: NEO-11396

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-05-15 06:35:42 +02:00
Compute-Runtime-Validation 4c43ca518b Revert "fix: remove not needed check in OCL CPU copy path"
This reverts commit 92f9c8be96.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-14 20:12:28 +02:00
Bartosz Dunajski 92f9c8be96 fix: remove not needed check in OCL CPU copy path
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-05-14 17:05:55 +02:00
Szymon Morek e35b951a00 performance: Allow indirect allocs as pack on OpenCL
Related-To: NEO-11228

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-10 17:57:42 +02:00
Maciej Plewka e39893485c fix: add cache flush as dependency for bcs ccs synchronization
Related-to: NEO-9872
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-05-09 13:43:39 +02:00
Szymon Morek 10ed479b16 performance: share inter-module ISA allocations
Related-To: NEO-10258

Currently each module has it's own GA
for kernel ISA's. This change allows new modules to
reuse existing allocation.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-09 08:43:55 +02:00
Aleksander Czerwionka 4667f55ea2 fix: add missing checks to return proper OpenCL error from API
changes affect clEnqueueNDCountKernelINTEL and
clGetKernelMaxConcurrentWorkGroupCountINTEL

Related-To: NEO-9688

Signed-off-by: Aleksander Czerwionka <aleksander.czerwionka@intel.com>
2024-05-07 20:32:36 +02:00
Weronika Kapusta d6c16c1640 fix: remove compiler cache legacy implementation
Related-To: NEO-10679

Signed-off-by: Weronika Kapusta <weronika.kapusta@intel.com>
2024-05-06 18:28:35 +02:00
Szymon Morek 83e8ae4a20 performance: Reuse GPU timestamp instead of KMD escape
This can be enabled only if related
debug flag will be set.

Related-To: NEO-10615

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-06 14:46:30 +02:00
Compute-Runtime-Validation 8342c0ae2f Revert "fix: add cache flush as dependency for bcs ccs synchronization"
This reverts commit 5e57bb2a32.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-01 03:05:47 +02:00
Mrozek, Michal 3ded04341c refactor: remove not needed code.
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-04-30 16:24:21 +02:00
Mrozek, Michal 03dcff1398 refactor: remove not needed code.
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-04-30 15:10:34 +02:00
Fabian Zwoliński ee71157f7f fix: opencl support for bindless kernels
Related-To: NEO-11156
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-04-30 12:02:17 +02:00
Maciej Plewka 5e57bb2a32 fix: add cache flush as dependency for bcs ccs synchronization
Related-to: NEO-9872
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-04-30 11:57:49 +02:00
Artur Harasimiuk cbc868c4ed refactor: add mock_aubstream
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2024-04-25 20:08:30 +02:00
Lukasz Jobczyk d122e1c906 fix: Flush render state cache on release shared object
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-04-22 17:37:42 +02:00
Andrzej Koska ae139aeffd refactor: Passing information about the engine
Extension of the interface with information
about the engine type passed to the function

Related-To: NEO-10678

Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2024-04-22 13:33:56 +02:00
Milczarek, Slawomir a2e0f9ac09 fix: Add AUB/TBX writable for clEnqueueWriteBuffer on SVM hostptr
Related-To: NEO-11082

Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>
2024-04-18 12:17:10 +02:00
Jaroslaw Warchulski ea057abd36 fix: allow empty set to be passed to clSetKernelExecInfo
Related-To: GSD-8538
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2024-04-16 14:11:04 +02:00
Katarzyna Cencelewska ce3bb1327e feature: add method to switch enable/disable mid thread preemption
Resolves: NEO-8089

Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-04-15 16:26:52 +02:00
Kamil Kopryk 4bb30e51a2 fix: use default csr for scratch in OCL
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-04-11 16:58:35 +02:00
Kamil Kopryk cfc3b6a8c2 refactor: add heaplessModeEnabled argument to isHeaplessStateInitEnabled
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-04-11 10:08:10 +02:00
Mateusz Jablonski cb2b572e94 feature: add support for null aub mode
In this mode AUB csr will be created, however, no aub file will be created

Related-To: NEO-11097
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-04-09 16:59:42 +02:00
Compute-Runtime-Validation 9568ee47e7 Revert "fix: remove compiler cache legacy implementation"
This reverts commit 864f42116c.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-04-09 02:06:51 +02:00
Weronika Kapusta 864f42116c fix: remove compiler cache legacy implementation
Related-To: NEO-10679
Signed-off-by: Kapusta, Weronika <weronika.kapusta@intel.com>
2024-04-08 16:40:57 +02:00
Dominik Dabek 0d69dfe165 refactor: remove comment
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-04-08 11:43:19 +02:00
Kacper Nowak 35ca2bbea9 refactor: refactor logic for querying atomic FP capabilities
- Move logic from product helper to compiler product helper
- Add method for adjusting fp16 and extra capabilities using release
helper (if present).


Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2024-03-28 15:27:45 +01:00
Mateusz Jablonski bcaa62bed3 build: refactor linux kernel headers
Add cmake flags to provide path to headers to drm / i915 /i915 prelim / xe

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-28 14:48:09 +01:00
Lukasz Jobczyk 436579ca8c fix: Disable async release of buffers with external host ptr
Related-To: NEO-10036

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-27 15:07:14 +01:00
Compute-Runtime-Validation 707796a50d Revert "fix: Disable async release of buffers with external host ptr"
This reverts commit 9b715c3ffc.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-27 07:49:54 +01:00
Bartosz Dunajski fce016ee7a refactor: add new semaphore param
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-03-26 15:46:47 +01:00
Mateusz Jablonski 78a4a92b44 refactor: reorder members to reduce internal padding in structs
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-25 15:50:00 +01:00
Lukasz Jobczyk 9b715c3ffc fix: Disable async release of buffers with external host ptr
Related-To: NEO-10036

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-25 12:55:57 +01:00
Filip Hazubski 07b9117b44 refactor: Explicitly delete copy c-tor and copy assignment of GlBuffer
Comply with the rule of three.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-03-25 10:55:02 +01:00
Compute-Runtime-Validation e9b866ab11 Revert "fix: Disable async release of buffers with external host ptr"
This reverts commit 70a62d89e3.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-23 11:49:42 +01:00
Lukasz Jobczyk 70a62d89e3 fix: Disable async release of buffers with external host ptr
Related-To: NEO-10036

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-22 18:11:27 +01:00
Katarzyna Cencelewska da7b03dd15 fix: to always use grfs count in calculateNumThreadsPerThreadGroup
grf size != grf count

Related-To: GSD-8437
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-03-22 11:03:18 +01:00
Katarzyna Cencelewska dd1d52259e refactor: add param rootDeviceEnvironment to calculateNumThreadsPerThreadGroup
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-03-21 22:25:14 +01:00
Mateusz Hoppe 0cec5ccc32 fix: check if any of devices has BindlessHeapHelper enabled
- report Bindless_images extension in driver only when at least one
device has global bindless heaps enabled

Related-To: NEO-10352

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-03-20 20:50:58 +01:00
Bartlomiej Wolny 7db5f6d5bf fix: add texture cache flush on image dtor
Related-To: NEO-9794
Signed-off-by: Bartlomiej Wolny <bartlomiej.wolny@intel.com>
2024-03-18 17:03:17 +01:00
Fabian Zwolinski eb259c1f52 fix: set kernel LWS prior to obtaining indirect heaps
When calculating the size of the indirect object heap,
the local work group size from kernel implicit args is taken into account.
If the LWS is not set before this calculation,
it can lead to insufficient ioh allocation size.
Such a problem is seen when local ids are generated by the runtime
and then written to ioh. The write fails due to lack of space in the allocation.

Related-To: IGC-7708
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-03-14 10:38:46 +01:00
Ben Ashbaugh e00230a6c2 fix: correctly report support for SPIR-V 1.0 through 1.3
Related-To: NEO-10336

Signed-off-by: Ben Ashbaugh <ben.ashbaugh@intel.com>
2024-03-12 17:54:09 +01:00
Grzegorz Wawiorko 87f2357868 fix: Update CL_DEVICE_LATEST_CONFORMANCE_VERSION_PASSED value
Signed-off-by: Grzegorz Wawiorko <grzegorz.wawiorko@intel.com>
2024-03-12 08:42:21 +01:00
Mrozek, Michal f71f6d2b72 refactor: remove not needed code
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-03-08 18:18:55 +01:00
Mateusz Hoppe 5e15543b14 test: improve unit tests - use virtual file system
- use virtualFileSystem in ULTs, writeDataToFile() does not create file
but instead saves filename. Call to fileExists() checks if file was
previously written

Related-To: NEO-7006

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-03-08 13:34:19 +01:00
Kamil Kopryk 168445784e feature: introduce states programming at driver init heapless ocl
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-03-08 12:29:44 +01:00
Lukasz Jobczyk 88a8bc8ac6 fix: Download BCS allocations when timestamps are completed
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-08 10:47:53 +01:00
Maciej Plewka f0281202bf feature: Signal to OGL creating/destroying shared buffer
Related-To: NEO-9151
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-03-06 16:28:28 +01:00
Lukasz Jobczyk 895c951a7f fix: Unlock CmdQ before download allocations
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-05 14:53:11 +01:00
Mateusz Jablonski bfbe5a048c fix: verify retVal from poll function in gl sharing sync on Linux
unify logic of synchronization objects
add default iniialization of gl context guard members

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-05 11:26:27 +01:00
Lukasz Jobczyk 246c24f78f fix: Download allocations when timestamps are complete
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-04 16:05:31 +01:00
Mateusz Jablonski 8c55dd91a6 fix: don't use poll/close syscalls in gl sharing tests
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-04 14:09:46 +01:00
Fabian Zwolinski 52430762ac fix: cl_cache L0 env vars + refactor code structure
Added support for new Compiler Cache
environment variables in Level Zero.

Moved
`opencl/source/compiler_interface/default_cache_config.cpp`
`level_zero/core/source/compiler_interface/default_cache_config.cpp`
to shared directory
`source/compiler_interface/default_cache_config.cpp`

Switched enabling cache by default from per OS to per API.
Changed default state of cl_cache in Level Zero to disabled.

Related-To: NEO-10045
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-03-01 17:35:08 +01:00
Sylvain Munaut e2c511bc00 feature(ocl): Improve Linux CL/GL sharing support
This commit is aimed at drastically improving the support for the CL/GL
sharing
extension on linux. The current support is not really usable as it only
supports a few texture format, and only on EGL contexts. It is also
pretty
buggy since it requires the texture to be bound when placing the CL call
to
share it which is just plain wrong and will not work in many
applications.
This new version makes used of the "official" interop extension from
MESA
which is available for GLX and EGL contexts, allows sharing of buffers
and
not just texture and supports many more formats.
This is still far from being a fully compliant / full featured version
of
the extension, but it's a big step forward in my opinion and allows to
run
some real applications.
I've tested gr-fosphor (SDR spectrum display) and Davinci Resolve as
examples.
Both of theses don't work without theses improvements.

Fixes: https://github.com/intel/compute-runtime/issues/659
Fixes: https://github.com/intel/compute-runtime/issues/667

https://github.com/intel/compute-runtime/pull/673

Related-To: NEO-3599

Signed-off-by: Sylvain Munaut <tnt@246tNt.com>
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-01 13:38:56 +01:00
Zbigniew Zdanowicz 4baee71974 refactor: add out cmd argument to semaphore wait encoder
Related-To: NEO-10065

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-01 11:43:36 +01:00
Mateusz Jablonski 373ef608bc refactor: unify calculating max group size between OCL and L0
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-27 13:32:10 +01:00
Zbigniew Zdanowicz 0bf8e8727e refactor: add output buffer argument to store register to memory encoder
Related-To: NEO-10064

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-02-27 13:31:55 +01:00
Lukasz Jobczyk cfd3edfb2c fix: Align IOH entry
Related-To: NEO-10036

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-26 14:36:31 +01:00