Bartosz Dunajski
9aa81bae75
feature: initial support to enable synchronized dispatch
...
Related-To: NEO-8171
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-03-22 17:23:58 +01:00
Dominik Dabek
2b964254d6
performance: debug key for adjust ULLS on battery
...
ULLS controller timeout settings will be adjusted based on ac line
status and lowest queue throttle from submissions.
Lowest queue throttle is reset when controller stops ULLS.
Related-To: NEO-10800
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-03-22 14:24:00 +01:00
Katarzyna Cencelewska
da7b03dd15
fix: to always use grfs count in calculateNumThreadsPerThreadGroup
...
grf size != grf count
Related-To: GSD-8437
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2024-03-22 11:03:18 +01:00
Katarzyna Cencelewska
dd1d52259e
refactor: add param rootDeviceEnvironment to calculateNumThreadsPerThreadGroup
...
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2024-03-21 22:25:14 +01:00
Mateusz Hoppe
0cec5ccc32
fix: check if any of devices has BindlessHeapHelper enabled
...
- report Bindless_images extension in driver only when at least one
device has global bindless heaps enabled
Related-To: NEO-10352
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2024-03-20 20:50:58 +01:00
Mateusz Jablonski
614ed7ff5f
fix: remove not needed appendBaseAddressOffset method
...
Related-To: NEO-10623
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-03-20 17:46:14 +01:00
Wenju He
03078541d7
feature: make global bindless heaps resident when created
...
Make bindless heaps resident right after heap allocation.
Motivation is that SYCL bindless image can be passed as a value argument
or through memory. Therefore, we're not able to make its bindless heap
resident during kernel initialization or setting kernel arguments.
This fixes SYCL bindless image read_write_*D.cpp tests on DG2.
Related-To: NEO-7063
Signed-off-by: Wenju He <wenju.he@intel.com >
2024-03-20 14:40:11 +01:00
Bartosz Dunajski
e400a5c624
feature: in-order host counter allocation pooling 2
...
Related-To: NEO-10507
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-03-13 18:59:02 +01:00
Zbigniew Zdanowicz
4fb4e731d7
refactor: add load register immediate encoder on command pointer
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-03-12 09:38:03 +01:00
Zbigniew Zdanowicz
3f740f3460
feature: add new functionality to in order patch helpers
...
Related-To: NEO-10385
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-03-12 09:17:53 +01:00
Compute-Runtime-Validation
4082e9f028
Revert "feature: in-order host counter allocation pooling"
...
This reverts commit 2f03c48c7a .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-03-09 19:47:10 +01:00
Mrozek, Michal
13a347cbde
refactor: remove not needed code
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com >
2024-03-08 15:10:06 +01:00
Mateusz Hoppe
5e15543b14
test: improve unit tests - use virtual file system
...
- use virtualFileSystem in ULTs, writeDataToFile() does not create file
but instead saves filename. Call to fileExists() checks if file was
previously written
Related-To: NEO-7006
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2024-03-08 13:34:19 +01:00
Dunajski, Bartosz
2f03c48c7a
feature: in-order host counter allocation pooling
...
Related-To: NEO-10507
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2024-03-08 13:13:04 +01:00
Kamil Kopryk
168445784e
feature: introduce states programming at driver init heapless ocl
...
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-03-08 12:29:44 +01:00
Mrozek, Michal
10313b7b84
refactor: remove not needed code
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com >
2024-03-07 18:50:16 +01:00
Zbigniew Zdanowicz
fe377b6597
feature: add out cmd pointer to store data imm encoder
...
Related-To: NEO-10385
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-03-07 12:52:08 +01:00
Michal Mrozek
660539fe4e
refactor: remove not needed code.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2024-03-06 20:13:57 +01:00
Dominik Dabek
033ff28609
performance: enable cmd buffer preallocate xehp+
...
For L0 immediate cmdlists try to use reusable allocations if available.
Related-To: NEO-10526
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-03-06 16:53:01 +01:00
Compute-Runtime-Validation
293858320e
Revert "performance: Fill reusable allocation list on xe and later"
...
This reverts commit 15934ceb43 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-03-02 07:09:45 +01:00
Fabian Zwolinski
52430762ac
fix: cl_cache L0 env vars + refactor code structure
...
Added support for new Compiler Cache
environment variables in Level Zero.
Moved
`opencl/source/compiler_interface/default_cache_config.cpp`
`level_zero/core/source/compiler_interface/default_cache_config.cpp`
to shared directory
`source/compiler_interface/default_cache_config.cpp`
Switched enabling cache by default from per OS to per API.
Changed default state of cl_cache in Level Zero to disabled.
Related-To: NEO-10045
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com >
2024-03-01 17:35:08 +01:00
Dunajski, Bartosz
51ae76a25f
refactor: improve handling of in-order atomic signaling
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2024-03-01 15:59:25 +01:00
Dunajski, Bartosz
ea2ad550a1
refactor: improve handling duplicated in-order host storage
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2024-03-01 13:54:28 +01:00
Mateusz Jablonski
80aa55a3cb
fix: don't program dummy blit prior to MI_FLUSH_DW without postsync
...
add missing dummy blits before MI_FLUSH_DW with postsync
Related-To: NEO-9996
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-03-01 12:36:13 +01:00
Dunajski, Bartosz
bbe1043f08
feature: initial support for pooling in-order counter allocations
...
Related-To: NEO-10507
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2024-03-01 12:08:02 +01:00
Zbigniew Zdanowicz
4baee71974
refactor: add out cmd argument to semaphore wait encoder
...
Related-To: NEO-10065
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-03-01 11:43:36 +01:00
Lukasz Jobczyk
15934ceb43
performance: Fill reusable allocation list on xe and later
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-03-01 09:41:19 +01:00
Compute-Runtime-Validation
5fce046877
Revert "fix: don't program dummy blit prior to MI_FLUSH_DW without postsync"
...
This reverts commit d796fb559d .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-03-01 01:43:35 +01:00
Mateusz Jablonski
d796fb559d
fix: don't program dummy blit prior to MI_FLUSH_DW without postsync
...
add missing dummy blits before MI_FLUSH_DW with postsync
Related-To: NEO-9996
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-02-29 13:30:09 +01:00
Lukasz Jobczyk
39a44628a3
performance: Enable timestamp wait for queues on xe and later
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-02-28 14:26:59 +01:00
Dunajski, Bartosz
8483a922a7
feature: simplify in-order allocation overflow
...
Related-To: NEO-10507
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2024-02-27 18:25:26 +01:00
Mateusz Jablonski
b6b53daa3f
fix: don't program dummy blit prior to MI ARB CHECK
...
Related-To: NEO-9996
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-02-27 15:46:46 +01:00
Zbigniew Zdanowicz
0bf8e8727e
refactor: add output buffer argument to store register to memory encoder
...
Related-To: NEO-10064
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-02-27 13:31:55 +01:00
Lukasz Jobczyk
cfd3edfb2c
fix: Align IOH entry
...
Related-To: NEO-10036
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-02-26 14:36:31 +01:00
Zbigniew Zdanowicz
1995e6aa40
feature: store cpu pointer of command buffer pointing to post sync command
...
Related-To: NEO-10064
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-02-23 13:12:15 +01:00
Dunajski, Bartosz
915b19b0be
refactor: remove not needed debug_break_if
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2024-02-19 16:28:58 +01:00
Dunajski, Bartosz
f31c88d05f
fix: size estimation for tlb flush before copy
...
Related-To: HSD-18036669673
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2024-02-19 13:35:35 +01:00
Dunajski, Bartosz
be7b80746b
fix: update incorrect debug_break
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2024-02-16 16:55:59 +01:00
Kamil Kopryk
0c5cba8ebd
refactor: use common indirectDataAlignment static constexpr value
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-02-16 13:55:00 +01:00
Lukasz Jobczyk
cc1732c930
performance: make resident before lock
...
Resolves: NEO-10369
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-02-16 12:00:38 +01:00
Compute-Runtime-Validation
4d159bd06c
Revert "performance: make resident before lock"
...
This reverts commit 7e7fac01e3 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-02-16 09:23:49 +01:00
Lukasz Jobczyk
7e7fac01e3
performance: make resident before lock
...
Resolves: NEO-10369
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-02-15 11:56:36 +01:00
Michal Mrozek
25b3a3dbfa
fix: user proper settings for caches.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2024-02-15 11:38:34 +01:00
Compute-Runtime-Validation
57c946b61c
Revert "fix: align indirect data pointer to cacheline size in heapless mode"
...
This reverts commit 004e6e647f .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-02-15 05:49:19 +01:00
Compute-Runtime-Validation
d22b2a8a38
Revert "feature: add support for the cl_khr_extended_bit_ops extension"
...
This reverts commit 6dec2143df .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-02-15 05:34:10 +01:00
Kamil Kopryk
004e6e647f
fix: align indirect data pointer to cacheline size in heapless mode
...
Align indirect data pointer to cacheline size in heapless mode,
restore debug_break_if if avx2 load/store operation
gets unaligned pointer,
remove fallback to mm256 loadu/storeu unaligned operation
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-02-14 10:00:15 +01:00
Ben Ashbaugh
6dec2143df
feature: add support for the cl_khr_extended_bit_ops extension
...
Related-To: NEO-10337
Signed-off-by: Ben Ashbaugh <ben.ashbaugh@intel.com >
2024-02-14 09:11:44 +01:00
Kamil Kopryk
2b91286754
fix: fix load store uint16x16 avx2 for unaligned ptr
...
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-02-08 09:35:30 +01:00
Lukasz Jobczyk
486cc71b76
refactor: Add GDI profiling
...
Resolves: NEO-9236
Related-To: NEO-10036
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-02-07 18:44:11 +01:00
Maciej Plewka
ce17580b28
fix: Use Rcs engine in blender on DG2
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2024-02-07 18:21:54 +01:00