Commit Graph

2502 Commits

Author SHA1 Message Date
Jaroslaw Warchulski 05250e7460 fix: Remove patchtokens fallback AIL
Related-To: NEO-12358
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2024-09-12 18:26:19 +02:00
Zbigniew Zdanowicz 8bb92ff445 performance: avoid reading from gfx memory when modifying walker command
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-09-11 18:58:33 +02:00
Lukasz Jobczyk 48dc0a823d fix: Check dst pitch in copy rect misaligned case
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-09-11 11:45:10 +02:00
Zbigniew Zdanowicz 7ce4a8adc2 performance: replace virtual calls with native class methods
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-09-11 11:10:40 +02:00
Dominik Dabek b2fc7345cf performance: redesign usm alloc reuse mechanism
Dedicated pools for different allocations size ranges.
Additional reused allocations will create their own pools.
Do not reuse allocations >256MB.

Related-To: NEO-6893, NEO-12299, NEO-12349

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-09-10 18:12:14 +02:00
Bartosz Dunajski 4f1262645b refactor: pass extra walker params
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-09-10 15:30:03 +02:00
Filip Hazubski beaddb55e0 fix: Minor code changes
Remove needless checks for module and cmdList.
Add explicit check for surfaceState in setArgImageWithMipLevel.
Explicitly initialize WddmResidencyController::csr.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-09-10 13:31:15 +02:00
Mateusz Jablonski 54bda0e986 fix: In Linux CL/GL sharing
- always issue flush request before export

Apparently it's expected to flush the object (which might convert them
from one format to another for export, or remove aux buffer uses or
anything not supported by export).

- use modifier to select tiling mode

Previously we just assumed that whatever tiling mode was picked by mesa
will match the one picked by GMMLIB but that's not always the case
and in particular on Arc and Xe it doesn't work ... Mesa picks Tile4
and GMMLIB picks Tile64 ...

Fixes: #761
Fixes: #736

Signed-off-by: Sylvain Munaut <tnt@246tNt.com>
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-09-06 21:40:18 +02:00
Mateusz Hoppe 9d5f8eb587 refactor: save hpCopyEngine in Device
- remove loop iteration to get hpCopyEngine

Related-To: NEO-11983

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-09-06 17:56:53 +02:00
Lukasz Jobczyk 03690e9b83 fix: Set special queue after its setup
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-09-06 12:05:24 +02:00
Kamil Kopryk 95b035a071 fix: global stateless heap creation in heapless path
Related-To: NEO-10681
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-09-05 16:56:00 +02:00
Andrzej Koska b0e7a11e9a refactor: Improving information transfer about the copy engine
Related-To: NEO-11934

Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2024-09-05 16:11:52 +02:00
Compute-Runtime-Validation d842f65cf1 Revert "fix: submit dummy exec to pin memory during zeContextMakeMemoryReside...
This reverts commit f9b87d53e6.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-09-05 03:28:03 +02:00
Maciej Plewka f9b87d53e6 fix: submit dummy exec to pin memory during zeContextMakeMemoryResident call
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>

Related-To: NEO-11879
2024-09-04 14:07:29 +02:00
Mateusz Hoppe 37b7caa137 fix: correct program header generation for shared isa allocation
- when kernels share single allocation, LOAD address in program headers
should point to correct virtual address including kernel offset

Related-To: NEO-7788, GSD-9836

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-09-03 17:50:29 +02:00
Mateusz Jablonski 7adfa344c0 performance: get reference from getBindlessInfo
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-09-03 12:50:52 +02:00
Lukasz Jobczyk 8b760f8528 performance: Copy hostptr on cpu for host buffer when mitigate dc flush
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-08-30 19:55:45 +02:00
Bartosz Dunajski 847180dc9a refactor: add debug prints for timestamp waits
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-30 19:55:15 +02:00
Mateusz Jablonski c934877790 refactor: remove not needed function
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-08-30 12:18:14 +02:00
Dominik Dabek 4552960e1f performance(ocl): limit flushes on resolve via pc
When resolving dependencies via pipecontrol skip hdc and untyped data
port cache flushes. Leave only command streamer stall.

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-08-29 17:19:19 +02:00
Lukasz Jobczyk 7550e55d64 fix: Always copy hostptr on gpu when mitigate dc flush
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-08-29 14:53:57 +02:00
Lukasz Jobczyk 496012d82f performance: Use copy buffer rect middle builtin
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-08-28 15:32:00 +02:00
Lukasz Jobczyk 5aa5d40937 performance: Mitigate dc flush on LNL Windows
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-08-28 13:35:17 +02:00
Lukasz Jobczyk 0b848a5fdb fix: Don't use copy buffer rect middle builtin
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-08-27 12:01:58 +02:00
Fabian Zwoliński 90bc6c10ea fix(ocl): use correct surface state index for bindless offset in setArgImage
Related-To: NEO-11156
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-08-27 09:42:46 +02:00
Compute-Runtime-Validation ad0d6f5435 Revert "refactor: Add dc flush mitigation infrastructure"
This reverts commit e4412e385a.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-08-27 02:35:06 +02:00
Dominik Dabek a47ca96a42 fix(ocl): allocate small buffer pool uncompressed
Related-To: HSD-15016054429

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-08-26 13:00:47 +02:00
Lukasz Jobczyk e4412e385a refactor: Add dc flush mitigation infrastructure
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-08-26 10:38:56 +02:00
Compute-Runtime-Validation 5dbbaa39b9 Revert "fix: ulls controller sleep, windows"
This reverts commit 6455d4648c.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-08-24 06:37:58 +02:00
Compute-Runtime-Validation 956dd8e17d Revert "fix: set properly resource params when setAllocationType"
This reverts commit 2e0884a301.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-08-24 05:01:38 +02:00
Katarzyna Cencelewska 2e0884a301 fix: set properly resource params when setAllocationType
gmm params: usage, cachable and resource info
should be set properly when override allocation type

Resolves: HSD-22020344331
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-08-23 16:57:23 +02:00
Lukasz Jobczyk 9152b6ac04 performance: Defer special queue init to first use
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-08-23 14:22:51 +02:00
Mateusz Hoppe c9e8931a18 fix: global bindless offsets for implicit args
- do not patch bindless offsets for implicit args in with global bindless
allocator, the offsets are patched at kernel initialization time and
do not change

Related-To: NEO-7063

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-08-23 13:21:27 +02:00
Dominik Dabek 6455d4648c fix: ulls controller sleep, windows
Request higher resolution for windows periodic timers for ulls
controller sleep.

Allows for controller thread to sleep with granularity of 1ms.

Related-To: NEO-10800

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-08-23 12:05:26 +02:00
Bartosz Dunajski 696b02bfd3 fix: improve TBX downloading after L0 Event sync
Related-To: HSD-18038498579

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-23 10:42:17 +02:00
Compute-Runtime-Validation 4b01058706 Revert "performance: Defer special queue init to first use"
This reverts commit 25bb3c87ad.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-08-23 09:09:17 +02:00
Lukasz Jobczyk c1a5fb089b performance: Add copy buffer rect middle builtin
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-08-22 10:30:17 +02:00
Lukasz Jobczyk 25bb3c87ad performance: Defer special queue init to first use
Resolves: NEO-12332

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-08-22 10:07:44 +02:00
Compute-Runtime-Validation d8ea5516b2 Revert "performance: Add copy buffer rect middle builtin"
This reverts commit bbb44c7a4d.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-08-22 04:41:42 +02:00
Mateusz Jablonski 7ac41615cd fix: create thread with function pointer
don't create async thread in neo shared tests

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-08-21 18:02:37 +02:00
Lukasz Jobczyk bbb44c7a4d performance: Add copy buffer rect middle builtin
Resolves: NEO-12132

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-08-21 14:41:44 +02:00
Mateusz Jablonski f617093a6a fix: add missing nullptr check before accessing ail helper
Fixes: #755
Fixes: #754
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-08-20 16:58:53 +02:00
Mateusz Jablonski 579af57161 refactor: don't call OsLibrary::load directly, use function pointer
this allows mocking this call in ULT

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-08-20 08:59:26 +02:00
Compute-Runtime-Validation 9b652f4a34 Revert "feature: Improving information transfer about the copy engine"
This reverts commit 17ffdff4f1.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-08-15 22:06:31 +02:00
Andrzej Koska 17ffdff4f1 feature: Improving information transfer about the copy engine
Related-To: NEO-11934

Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2024-08-14 11:28:29 +02:00
Maciej Plewka 27488a8315 fix: Add option to use divergent paths for barriers
Related-To: NEO-12159, IGC-9846, HSD-16024235164, HSD-15014782386

Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-08-12 12:15:54 +02:00
Bartosz Dunajski d76ac1d1de fix: scratch controller residency
Related-To: HSD-18039519400

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-09 14:47:56 +02:00
Kamil Kopryk 775b14a7f6 fix: add ioh alignment in heapless
Related-To: NEO-11871

Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-08-09 12:20:00 +02:00
Dominik Dabek ad229377b9 fix: disable indirect detection if any stack calls
Don't know if kernels will be initialized in the order needed to check
for indirect accesses in stack calls.

Remove now unused functionPointerWithIndirectAccessExists and reading
this value from zebin.

Related-To: NEO-12235

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-08-07 14:48:58 +02:00
Kamil Kopryk 38a194eee6 fix: scratch address from implicit args in ocl
Related-To: NEO-12237
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-08-07 09:40:27 +02:00
Michal Mrozek d52ca080bd Revert "performance: improve pool handling"
This reverts commit a3c3b6533a.

Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-08-06 13:04:02 +02:00
Kamil Kopryk 2a9bcdeb83 refactor: pass outImplicitArgs to patchImplicitArgs function
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-08-05 17:31:47 +02:00
Fabian Zwoliński 674c4a15ad fix: use correct gpu address when bindless heaps helper is enabled
Related-To: NEO-7063
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-08-05 15:09:57 +02:00
Bartosz Dunajski ec34656e0e fix: debug flag to defer first device submission
Related-To: HSD-18039343751

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-02 09:31:25 +02:00
Mateusz Hoppe 0196a0f72f refactor(ocl): internal linker version script with OpenCL versions
- use the same map as in:
https://github.com/KhronosGroup/OpenCL-ICD-Loader/blob/main/
loader/linux/icd_exports.map

this allows to skip loader and link directly with libigdrcl.so

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-07-31 20:18:05 +02:00
Michal Mrozek 47009cec90 refactor: remove not needed code
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-07-31 17:07:56 +02:00
Zbigniew Zdanowicz b33fe6ccf1 feature: adding flag to block dispatch implicit scaling commands
- this feature is part of making compute walker command view
- compute walker is programed for implicit scaling but not dispatched
- together with new flag, comes the refactor to reduce number of arguments

Related-To: NEO-11972

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-07-31 14:24:27 +02:00
Fabian Zwoliński b1a50104a8 fix: include dynamic SLM in clGetKernelWorkGroupInfo and zeKernelGetProperties
Current implementation only takes static slmInlineSize into account.
With this change we also include dynamic SLM passed as a kernel arguments.

Related-To: NEO-5761
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-07-30 17:59:45 +02:00
Michal Mrozek e668b4965c performance: demote unrecoverable to debug_break
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-07-30 16:51:00 +02:00
Kamil Kopryk 65fcbff55c refactor: Simplify code
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-07-29 14:26:51 +02:00
Compute-Runtime-Validation b1bc4f4cad Revert "fix: Add missing fp64 extensions in caps initialization"
This reverts commit 9a486dd5a1.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-07-26 14:47:02 +02:00
Szymon Morek ace883ca55 performance: don't flush gpgpu if not required
Related-To: NEO-12124

If queue is OOQ and there are no cross-engine dependencies,
don't flush CCS before submitting copy on BCS.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-26 06:49:45 +02:00
Maciej Plewka 1cd00b5b89 fix: use per product cache line size to align heaps
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-07-24 17:29:20 +02:00
Maciej Plewka afee8814ef refactor: get ioh alignment from static function
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-07-24 14:43:31 +02:00
Szymon Morek a7fbc90ebd fix: re-enable staging buffer copy when ccs is busy
Related-To: NEO-11501

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-22 18:14:46 +02:00
Szymon Morek 39ec7facee performance: use BCS for transfers if CCS is busy
Related-To: NEO-11501

Also, if device is iGPU, don't use staging buffers
in that case.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-22 15:36:26 +02:00
Szymon Morek 6a11e8a077 fix: revert changes around zero-copy
Related-To: NEO-12018

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-19 12:29:18 +02:00
Mateusz Hoppe b6299b8a21 feature: add support for HP copy engine context
- add support for contect group with HP copy engine
- choose HP copy engine when available

Related-To: NEO-11983

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-07-19 12:23:03 +02:00
Filip Hazubski f8867ac3ac fix: Minor code improvements
Add explicit pointer checks to CommandQueue::blitEnqueueAllowed.

Explicitly check result of getDeviceArgValueIdx during ocloc compile.

Explicitly remove unused StagingBufferManager functions.
Move chunkCopyFunc by reference.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-07-19 09:24:37 +02:00
Compute-Runtime-Validation 7ad15639fc Revert "feature: add support for HP copy engine context"
This reverts commit 3fbcbcaef2.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-07-18 21:02:14 +02:00
Szymon Morek 33ab962121 fix: adjust compression hint usage for ocl buffers
Related-To: NEO-11989

Also, use zero-copy on lnl

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-18 18:24:48 +02:00
Michal Mrozek 20d6910b66 performance: move usm pool init to first alloc call
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-07-18 16:07:22 +02:00
Mateusz Hoppe 3fbcbcaef2 feature: add support for HP copy engine context
- add support for contect group with HP copy engine
- choose HP copy engine when available

Related-To: NEO-11983

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-07-18 16:07:07 +02:00
Dominik Dabek c1c9ac634b performance(ocl): enable host usm alloc recycle
Enable at threshold of 2% system memory.

Related-To: NEO-6893

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-07-17 19:33:56 +02:00
Mateusz Jablonski 8a60742a8d fix: correct reported num subslices per slice in fused config
Related-To: NEO-12073
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-17 17:17:28 +02:00
Dominik Dabek fc9de71feb fix(ocl): finish in release ogl object if needed
Finish cache flushes before exiting api call if releasing displayable
ogl object or dcflush is mitigated.

Related-To: NEO-11694

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-07-16 11:21:32 +02:00
Compute-Runtime-Validation 9a6403f3bc Revert "refactor: Add dc flush mitigation infrastructure"
This reverts commit d6076941a8.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-07-15 11:47:30 +02:00
Lukasz Jobczyk d6076941a8 refactor: Add dc flush mitigation infrastructure
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-07-12 14:45:51 +02:00
Bartosz Dunajski e188de2489 fix: initialize page tables before access for TSP allocation in TBX mode
Related-To: NEO-8340

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-07-11 16:03:19 +02:00
Lukasz Jobczyk b0a5f2cced fix: Stop direct submission before signal GL event
Related-To: NEO-10556

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-07-11 13:45:42 +02:00
Dominik Dabek bdeccab7aa fix: bcs enqueue after marker properly waits
For an example sequence of:
IOQ_1 -> enqueue copy, enqueue marker with waitlist (out event)
IOQ_2 -> enqueue marker with waitlist (event), enqueue copy

Add missing synchronization between the enqueue copies

Related-To: NEO-11694

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-07-11 10:36:18 +02:00
Michal Mrozek 05eb4e7a0d performance: add debug flag to disable l1 flush
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-07-11 10:09:46 +02:00
Szymon Morek dbd96372be performance: adjust staging buffer usage
Related-To: NEO-11928

Don't copy through staging buffer if dst usm allocation
was not used before and transfer would be splitted.
Also, don't use staging buffers for mapped ocl buffers.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-10 10:19:18 +02:00
Michal Mrozek 4cabc9e4d2 performance: remove not needed code.
events are already created with queued state.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-07-10 07:35:05 +02:00
Bartosz Dunajski 90d1a210d8 refactor: remove not needed check
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-07-09 16:11:33 +02:00
Mateusz Jablonski 778645c11e feature: add initial support for Xe2 platforms
Related-To: NEO-8188, NEO-10774
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-09 10:30:27 +02:00
Szymon Morek 7d25965a78 performance: change buffer type for new coherency model
Related-To: NEO-11882

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-09 09:33:53 +02:00
Lukasz Jobczyk bbeb5224d8 fix: Add IOH alignment to estimation
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-07-08 12:29:54 +02:00
Mateusz Jablonski 4f4b8fed15 refactor: remove not needed code related to deprecated device ip version
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-05 09:54:48 +02:00
Compute-Runtime-Validation 38872b7e1b Revert "refactor: Add dc flush mitigation infrastructure"
This reverts commit 1cba900ad9.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-07-04 08:20:18 +02:00
Lukasz Jobczyk a96f2ea13a performance: disable blit enqueue on LNL
Resolves: NEO-11471

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-07-04 05:41:31 +02:00
Szymon Morek daeb6e897a performance: limit barrier usage in non-usm copies
Related-To: NEO-11501

Don't emit barrier if non-usm copy through staging
buffers with OOQ and single transfer was issued.
No need to send barrier if there was single transfer -
there's nothing to aggregate

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-03 13:45:52 +02:00
Compute-Runtime-Validation b8246d806f Revert "performance: change buffer type for new coherency model"
This reverts commit c979495265.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-07-03 02:11:19 +02:00
Morek, Szymon aed96cede4 fix: flush barrier when profiling enabled
Related-To: NEO-10615

Signed-off-by: Morek, Szymon <szymon.morek@intel.com>
2024-07-02 13:25:07 +02:00
Szymon Morek c979495265 performance: change buffer type for new coherency model
Related-To: NEO-11882

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-02 12:49:41 +02:00
Szymon Morek 53f9a48fce fix: set start TS on cpu if TS packet write disabled
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-01 17:14:10 +02:00
Oskar Hubert Weber 9a486dd5a1 fix: Add missing fp64 extensions in caps initialization
In a scenario where fp64 emulation is both supported and enabled,
we want to include 2 additional extensions:
- cl_khr_fp64 to deviceExtensions
- __opencl_c_fp64 to openclCFeatures (OpenCL ver. 3.0)

Related-To: NEO-7611

Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com>
2024-07-01 13:09:13 +02:00
Dominik Dabek 76e8be5c39 fix(ocl): handle gl sharing displayable textures
Displayable textures always need dc flush.

Related-To: NEO-11694

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-06-27 13:43:20 +02:00
Kamil Kopryk 70e52ce4d8 refactor: add seperate enablers for compute walker and gpgpu walker
Related-To: NEO-10641
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-27 10:27:48 +02:00
Szymon Morek 8ee92b840f fix: don't set start timestamp on cpu for bcs
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-06-26 13:30:58 +02:00