Zbigniew Zdanowicz
74fbd32515
feature: add wrapper to access template method getting local id generation
...
Related-To: NEO-10066
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-04-18 11:45:56 +02:00
Jemale Lockett
79f8993220
refactor: move GPR15 to separate debugger context
...
Signed-off-by: Jemale Lockett <jemale.lockett@intel.com>
2024-04-18 10:42:23 +02:00
Bartosz Dunajski
02ebb857cb
fix: create new InOrderExecInfo on cmd list reset
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-04-17 20:31:48 +02:00
Aleksander Czerwionka
b630f58861
fix: remove incorrectly reported extensions
...
Related-To: NEO-8349
Signed-off-by: Aleksander Czerwionka <aleksander.czerwionka@intel.com>
2024-04-15 10:04:31 +02:00
Zbigniew Zdanowicz
3b9e1e4b32
refactor: add getter to in order exec info class
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-04-12 10:17:39 +02:00
Kamil Kopryk
cfc3b6a8c2
refactor: add heaplessModeEnabled argument to isHeaplessStateInitEnabled
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-04-11 10:08:10 +02:00
Mateusz Jablonski
cb2b572e94
feature: add support for null aub mode
...
In this mode AUB csr will be created, however, no aub file will be created
Related-To: NEO-11097
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-04-09 16:59:42 +02:00
Kamil Kopryk
4eae28bd64
feature: introduce heapless state init in L0
...
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-04-02 12:34:53 +02:00
Mateusz Hoppe
f86d4220a5
feature: add bindless samplers support to level zero
...
- samplers using bindless adressing require patching bindless offsets to
sampler states on kernel's cross thread data
Related-To: NEO-10505
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-03-29 08:07:28 +01:00
Kacper Nowak
35ca2bbea9
refactor: refactor logic for querying atomic FP capabilities
...
- Move logic from product helper to compiler product helper
- Add method for adjusting fp16 and extra capabilities using release
helper (if present).
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2024-03-28 15:27:45 +01:00
Bartosz Dunajski
fce016ee7a
refactor: add new semaphore param
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-03-26 15:46:47 +01:00
Bartosz Dunajski
74dcb5a9e3
feature: initial support for sync dispatch token allocation
...
Related-To: NEO-8171
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-03-26 12:07:58 +01:00
Bartosz Dunajski
9aa81bae75
feature: initial support to enable synchronized dispatch
...
Related-To: NEO-8171
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-03-22 17:23:58 +01:00
Dominik Dabek
2b964254d6
performance: debug key for adjust ULLS on battery
...
ULLS controller timeout settings will be adjusted based on ac line
status and lowest queue throttle from submissions.
Lowest queue throttle is reset when controller stops ULLS.
Related-To: NEO-10800
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-03-22 14:24:00 +01:00
Katarzyna Cencelewska
da7b03dd15
fix: to always use grfs count in calculateNumThreadsPerThreadGroup
...
grf size != grf count
Related-To: GSD-8437
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-03-22 11:03:18 +01:00
Katarzyna Cencelewska
dd1d52259e
refactor: add param rootDeviceEnvironment to calculateNumThreadsPerThreadGroup
...
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-03-21 22:25:14 +01:00
Mateusz Hoppe
0cec5ccc32
fix: check if any of devices has BindlessHeapHelper enabled
...
- report Bindless_images extension in driver only when at least one
device has global bindless heaps enabled
Related-To: NEO-10352
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-03-20 20:50:58 +01:00
Mateusz Jablonski
614ed7ff5f
fix: remove not needed appendBaseAddressOffset method
...
Related-To: NEO-10623
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-20 17:46:14 +01:00
Wenju He
03078541d7
feature: make global bindless heaps resident when created
...
Make bindless heaps resident right after heap allocation.
Motivation is that SYCL bindless image can be passed as a value argument
or through memory. Therefore, we're not able to make its bindless heap
resident during kernel initialization or setting kernel arguments.
This fixes SYCL bindless image read_write_*D.cpp tests on DG2.
Related-To: NEO-7063
Signed-off-by: Wenju He <wenju.he@intel.com>
2024-03-20 14:40:11 +01:00
Bartosz Dunajski
e400a5c624
feature: in-order host counter allocation pooling 2
...
Related-To: NEO-10507
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-03-13 18:59:02 +01:00
Zbigniew Zdanowicz
4fb4e731d7
refactor: add load register immediate encoder on command pointer
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-12 09:38:03 +01:00
Zbigniew Zdanowicz
3f740f3460
feature: add new functionality to in order patch helpers
...
Related-To: NEO-10385
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-12 09:17:53 +01:00
Compute-Runtime-Validation
4082e9f028
Revert "feature: in-order host counter allocation pooling"
...
This reverts commit 2f03c48c7a
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-09 19:47:10 +01:00
Mrozek, Michal
13a347cbde
refactor: remove not needed code
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-03-08 15:10:06 +01:00
Mateusz Hoppe
5e15543b14
test: improve unit tests - use virtual file system
...
- use virtualFileSystem in ULTs, writeDataToFile() does not create file
but instead saves filename. Call to fileExists() checks if file was
previously written
Related-To: NEO-7006
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-03-08 13:34:19 +01:00
Dunajski, Bartosz
2f03c48c7a
feature: in-order host counter allocation pooling
...
Related-To: NEO-10507
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-03-08 13:13:04 +01:00
Kamil Kopryk
168445784e
feature: introduce states programming at driver init heapless ocl
...
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-03-08 12:29:44 +01:00
Mrozek, Michal
10313b7b84
refactor: remove not needed code
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-03-07 18:50:16 +01:00
Zbigniew Zdanowicz
fe377b6597
feature: add out cmd pointer to store data imm encoder
...
Related-To: NEO-10385
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-07 12:52:08 +01:00
Michal Mrozek
660539fe4e
refactor: remove not needed code.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-03-06 20:13:57 +01:00
Dominik Dabek
033ff28609
performance: enable cmd buffer preallocate xehp+
...
For L0 immediate cmdlists try to use reusable allocations if available.
Related-To: NEO-10526
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-03-06 16:53:01 +01:00
Compute-Runtime-Validation
293858320e
Revert "performance: Fill reusable allocation list on xe and later"
...
This reverts commit 15934ceb43
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-02 07:09:45 +01:00
Fabian Zwolinski
52430762ac
fix: cl_cache L0 env vars + refactor code structure
...
Added support for new Compiler Cache
environment variables in Level Zero.
Moved
`opencl/source/compiler_interface/default_cache_config.cpp`
`level_zero/core/source/compiler_interface/default_cache_config.cpp`
to shared directory
`source/compiler_interface/default_cache_config.cpp`
Switched enabling cache by default from per OS to per API.
Changed default state of cl_cache in Level Zero to disabled.
Related-To: NEO-10045
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-03-01 17:35:08 +01:00
Dunajski, Bartosz
51ae76a25f
refactor: improve handling of in-order atomic signaling
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-03-01 15:59:25 +01:00
Dunajski, Bartosz
ea2ad550a1
refactor: improve handling duplicated in-order host storage
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-03-01 13:54:28 +01:00
Mateusz Jablonski
80aa55a3cb
fix: don't program dummy blit prior to MI_FLUSH_DW without postsync
...
add missing dummy blits before MI_FLUSH_DW with postsync
Related-To: NEO-9996
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-01 12:36:13 +01:00
Dunajski, Bartosz
bbe1043f08
feature: initial support for pooling in-order counter allocations
...
Related-To: NEO-10507
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-03-01 12:08:02 +01:00
Zbigniew Zdanowicz
4baee71974
refactor: add out cmd argument to semaphore wait encoder
...
Related-To: NEO-10065
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-01 11:43:36 +01:00
Lukasz Jobczyk
15934ceb43
performance: Fill reusable allocation list on xe and later
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-01 09:41:19 +01:00
Compute-Runtime-Validation
5fce046877
Revert "fix: don't program dummy blit prior to MI_FLUSH_DW without postsync"
...
This reverts commit d796fb559d
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-01 01:43:35 +01:00
Mateusz Jablonski
d796fb559d
fix: don't program dummy blit prior to MI_FLUSH_DW without postsync
...
add missing dummy blits before MI_FLUSH_DW with postsync
Related-To: NEO-9996
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-29 13:30:09 +01:00
Lukasz Jobczyk
39a44628a3
performance: Enable timestamp wait for queues on xe and later
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-28 14:26:59 +01:00
Dunajski, Bartosz
8483a922a7
feature: simplify in-order allocation overflow
...
Related-To: NEO-10507
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-27 18:25:26 +01:00
Mateusz Jablonski
b6b53daa3f
fix: don't program dummy blit prior to MI ARB CHECK
...
Related-To: NEO-9996
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-27 15:46:46 +01:00
Zbigniew Zdanowicz
0bf8e8727e
refactor: add output buffer argument to store register to memory encoder
...
Related-To: NEO-10064
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-02-27 13:31:55 +01:00
Lukasz Jobczyk
cfd3edfb2c
fix: Align IOH entry
...
Related-To: NEO-10036
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-26 14:36:31 +01:00
Zbigniew Zdanowicz
1995e6aa40
feature: store cpu pointer of command buffer pointing to post sync command
...
Related-To: NEO-10064
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-02-23 13:12:15 +01:00
Dunajski, Bartosz
915b19b0be
refactor: remove not needed debug_break_if
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-19 16:28:58 +01:00
Dunajski, Bartosz
f31c88d05f
fix: size estimation for tlb flush before copy
...
Related-To: HSD-18036669673
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-19 13:35:35 +01:00
Dunajski, Bartosz
be7b80746b
fix: update incorrect debug_break
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-16 16:55:59 +01:00
Kamil Kopryk
0c5cba8ebd
refactor: use common indirectDataAlignment static constexpr value
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-02-16 13:55:00 +01:00
Lukasz Jobczyk
cc1732c930
performance: make resident before lock
...
Resolves: NEO-10369
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-16 12:00:38 +01:00
Compute-Runtime-Validation
4d159bd06c
Revert "performance: make resident before lock"
...
This reverts commit 7e7fac01e3
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-16 09:23:49 +01:00
Lukasz Jobczyk
7e7fac01e3
performance: make resident before lock
...
Resolves: NEO-10369
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-15 11:56:36 +01:00
Michal Mrozek
25b3a3dbfa
fix: user proper settings for caches.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-02-15 11:38:34 +01:00
Compute-Runtime-Validation
57c946b61c
Revert "fix: align indirect data pointer to cacheline size in heapless mode"
...
This reverts commit 004e6e647f
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-15 05:49:19 +01:00
Compute-Runtime-Validation
d22b2a8a38
Revert "feature: add support for the cl_khr_extended_bit_ops extension"
...
This reverts commit 6dec2143df
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-15 05:34:10 +01:00
Kamil Kopryk
004e6e647f
fix: align indirect data pointer to cacheline size in heapless mode
...
Align indirect data pointer to cacheline size in heapless mode,
restore debug_break_if if avx2 load/store operation
gets unaligned pointer,
remove fallback to mm256 loadu/storeu unaligned operation
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-02-14 10:00:15 +01:00
Ben Ashbaugh
6dec2143df
feature: add support for the cl_khr_extended_bit_ops extension
...
Related-To: NEO-10337
Signed-off-by: Ben Ashbaugh <ben.ashbaugh@intel.com>
2024-02-14 09:11:44 +01:00
Kamil Kopryk
2b91286754
fix: fix load store uint16x16 avx2 for unaligned ptr
...
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-02-08 09:35:30 +01:00
Lukasz Jobczyk
486cc71b76
refactor: Add GDI profiling
...
Resolves: NEO-9236
Related-To: NEO-10036
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-07 18:44:11 +01:00
Maciej Plewka
ce17580b28
fix: Use Rcs engine in blender on DG2
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-02-07 18:21:54 +01:00
Kacper Nowak
ab1ffea592
refactor: Create function to check if OCL2.1 caps are enabled or enforced
...
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2024-02-06 08:27:09 +01:00
Dunajski, Bartosz
f31fafb1e2
refactor: improve debug flag to override bcs mocs
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-05 20:23:54 +01:00
Dominik Dabek
2cad595a0d
performance: debug flag for usm host alloc recycle
...
set ExperimentalEnableHostAllocationCache=1 to recycle host usm
allocations
Related-To: GSD-7497
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-02-01 16:47:59 +01:00
Mateusz Jablonski
2237961deb
performance: add macro for ULT-only unrecoverable if
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-01 15:09:31 +01:00
Compute-Runtime-Validation
fa9c79fb63
Revert "refactor: Add GDI profiling"
...
This reverts commit 524ae7713a
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-01-30 10:47:34 +01:00
Fabian Zwolinski
c51b656d2c
fix: request instruction cache invalidation on module destroy
...
Invalidation is requested on both linux and windows,
on Csr's that used Isa allocation.
Related-To: NEO-10045
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-01-29 16:00:34 +01:00
Lukasz Jobczyk
524ae7713a
refactor: Add GDI profiling
...
Resolves: NEO-9236
Related-To: NEO-10036
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-01-29 11:36:04 +01:00
Mateusz Jablonski
a697a3f718
refactor: create new members for storing spill and private memory in scratch
...
rename private scratch space into scratch space slot 1 as it can be generic
Related-To: NEO-9944
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-23 12:42:25 +01:00
Compute-Runtime-Validation
f9f9035b95
Revert "refactor: create new members for storing spill and private memory in ...
...
This reverts commit 87eb5f554a
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-01-23 09:13:00 +01:00
Compute-Runtime-Validation
e949ba7144
Revert "refactor: Add GDI profiling"
...
This reverts commit 8d56f8fb6b
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-01-23 06:13:02 +01:00
Mateusz Jablonski
87eb5f554a
refactor: create new members for storing spill and private memory in scratch
...
rename private scratch space into scratch space slot 1 as it can be generic
Related-To: NEO-9944
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-22 19:48:48 +01:00
Dunajski, Bartosz
b77e1a6a71
feature: improve reporting max cooperative group count
...
Related-To: NEO-8210
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-22 14:56:15 +01:00
Lukasz Jobczyk
8d56f8fb6b
refactor: Add GDI profiling
...
Resolves: NEO-9236
Related-To: NEO-10036
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-01-22 14:24:08 +01:00
Dunajski, Bartosz
6f4ed10919
refactor: unify getMaxWorkGroupCount logic
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-22 11:16:45 +01:00
Dunajski, Bartosz
b266f1f3cc
refactor: improve implicit scaling interface
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-19 14:09:04 +01:00
Mateusz Jablonski
028a5ee2fc
fix: correct calculating highest enabled dual subslice
...
when no DSS is exposed then calculate highest enabled subslice instead
Related-To: NEO-9614
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-19 10:48:15 +01:00
Jitendra Sharma
aa191b6f88
feature: Set runalone mode for contexts with online debugging
...
Related-To: NEO-9139
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2024-01-17 09:01:30 +01:00
John Falkowski
10ef2a28ff
fix: correct for default hierachry
...
Resolves: NEO-10006
Signed-off-by: John Falkowski <john.falkowski@intel.com>
2024-01-15 22:15:49 +01:00
Dunajski, Bartosz
dfc69c746a
feature: create InOrderExecInfo for standalone CB Event
...
Related-To: NEO-8145
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-15 16:56:52 +01:00
Fabian Zwolinski
a3bd629aff
fix: store image implicit args in bindless slot 1
...
bindless slot 0: image
bindless slot 1: image implicit args
bindless slot 2: redescribed image
Redescribed image moved one slot higher.
Separate allocation is created for new bindless slot.
Related-To: NEO-9740
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-01-12 13:09:06 +01:00
Katarzyna Cencelewska
5d2d3ed899
refactor: modify sip kernel helpers
...
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-01-09 17:09:10 +01:00
Mateusz Jablonski
a73fb4d2fe
fix: correct reporing kernel private size on L0
...
unify the logic across APIs
Related-To: NEO-9944
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-09 09:54:05 +01:00
Dunajski, Bartosz
57900c5455
refactor: improve single slice dispatch mode handling
...
Related-To: NEO-8210
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-04 15:24:09 +01:00
Dunajski, Bartosz
d7b6f11ced
refactor: improve creating 48b resources
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-12-29 10:05:42 +01:00
Mateusz Hoppe
31e9b5e9fa
feature: add support for secondary contexts in group
...
Related-To: NEO-7824
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-12-28 13:31:08 +01:00
Dunajski, Bartosz
df66a0276f
refactor: remove not used logic to check dynamic postsync layout
...
Related-To: NEO-8210
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-12-27 13:12:11 +01:00
Compute-Runtime-Validation
f2c9d7172a
Revert "fix: store image implicit args in bindless slot 1"
...
This reverts commit b6d572ddb8
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-12-23 12:01:42 +01:00
Dunajski, Bartosz
d6560c5bba
refactor: pass additional in-order params to PatchCmd helper
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-12-21 19:21:56 +01:00
Kacper Nowak
18fb432552
refactor: Stop using hard-coded BCS engine as default one
...
Do not rely on presence of BCS0 engine.
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2023-12-21 17:34:57 +01:00
Dunajski, Bartosz
0f275ab893
refactor: improve InOrder Walker patching
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-12-21 14:30:46 +01:00
Mateusz Jablonski
a4888b39c6
build: add clang-tidy restriction for Enum case
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-21 08:58:51 +01:00
Mateusz Hoppe
c90dab7a2d
feature: introduce highPriority engine usage
...
Related-To: NEO-7824
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-12-20 20:59:43 +01:00
Fabian Zwolinski
b6d572ddb8
fix: store image implicit args in bindless slot 1
...
bindless slot 0: image
bindless slot 1: image implicit args
bindless slot 2: redescribed image
Redescribed image moved one slot higher.
Separate allocation is created for new bindless slot.
Related-To: NEO-9740
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-12-20 19:09:14 +01:00
Aravind Gopalakrishnan
f5045348ad
fix: Report correct DP support for platforms
...
Related-To: NEO-9702
- Remove older interfaces and add new release helper
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2023-12-20 10:35:49 +01:00
Mateusz Jablonski
138fb65401
refactor: correct naming of enum class constants 11/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-19 14:52:57 +01:00
Mateusz Jablonski
de93bc6928
refactor: correct naming of enum class constants 10/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-19 11:30:39 +01:00
Mateusz Jablonski
a6458433dc
refactor: correct naming of enum class constants 9/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-19 10:36:25 +01:00
Mateusz Jablonski
dd1b9d6abc
refactor: correct naming of enum class constants 8/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-19 08:18:18 +01:00
Chodor, Jaroslaw
7e795cd3c1
feature: Adding support for OCL C support queries to ocloc
...
Feature needed for automated OCL C compilation with ocloc as backend.
Added queries :
* CL_DEVICE_EXTENSIONS
* CL_DEVICE_EXTENSIONS_WITH_VERSION
* CL_DEVICE_PROFILE
* CL_DEVICE_OPENCL_C_ALL_VERSIONS
* CL_DEVICE_OPENCL_C_FEATURES
Sample command line:
ocloc query -device skl CL_DEVICE_OPENCL_C_FEATURES
Related-To: GSD-7420
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2023-12-18 15:19:16 +01:00
Mateusz Jablonski
fca28a930f
refactor: correct naming of enum class constants 7/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-18 13:05:56 +01:00
Dunajski, Bartosz
1219c48620
refactor: improve post sync handling and in-order unit tests
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-12-15 11:17:56 +01:00
Lu, Wenbin
67fa39c9a1
fix: get right page size when malloc uses 0 alignment
...
Related-To: GSD-7103
Signed-off-by: Lu, Wenbin <wenbin.lu@intel.com>
2023-12-13 15:00:56 +01:00
Mateusz Jablonski
739d181026
refactor: correct naming of enum class constants 6/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-13 14:48:52 +01:00
Mateusz Jablonski
261c4d3e72
refactor: add NOLINT for hardware info padding warning
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-13 12:18:11 +01:00
Mateusz Jablonski
432142c574
refactor: correct naming of enum class constants 4/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-13 08:08:51 +01:00
Mateusz Jablonski
0428c0acd1
refactor: correct naming of HeapIndex enum values
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-12 11:46:17 +01:00
Dunajski, Bartosz
717bc0c03f
refactor: pass InOrderExecInfo to encode method
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-12-11 16:43:30 +01:00
Mateusz Jablonski
b182917d9d
refactor: correct naming of allocation types
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-11 16:23:37 +01:00
Mateusz Jablonski
beafea9b39
refactor: correct naming of enum class constants 2/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-11 13:13:35 +01:00
Dunajski, Bartosz
9ce3713ace
refactor: improve Walker args handling
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-12-06 16:10:26 +01:00
Mateusz Jablonski
1bbc7a57ab
refactor: rename VFE_STATE_TYPE -> FrontEndStateCommand
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-05 14:51:01 +01:00
Mateusz Jablonski
da957d1a37
refactor: correct naming of enum class constants 1/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-05 14:26:42 +01:00
Mateusz Jablonski
6849d33326
refactor: remove redundant definitions KB/MB/GB
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-05 08:55:18 +01:00
Mateusz Jablonski
c48e57e2de
refactor: rename register offsets and constants
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-04 18:31:39 +01:00
Mateusz Jablonski
9486cd2a26
refactor: rename WALKER_TYPE to DefaultWalkerType
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-04 16:29:32 +01:00
Mateusz Jablonski
c3d3a4db1f
refactor: correct variable naming
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-04 13:45:53 +01:00
Mateusz Jablonski
e57d372608
refactor: correct naming of enum class constants
...
EngineGroupType
BlitDirection
PostBlitMode
WaitStatus
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-01 11:53:51 +01:00
Mateusz Jablonski
b92b5892b8
refactor: correct variable naming
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-01 08:34:19 +01:00
Mateusz Jablonski
c3ac7b78bd
refactor: correct variable naming
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-01 02:18:46 +01:00
Mateusz Jablonski
c9664e6bad
refactor: rename global debug manager to debugManager
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-30 13:00:59 +01:00
Mateusz Jablonski
36194c4e7d
refactor: correct variable namings
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-29 23:49:03 +01:00
Kamil Kopryk
982c2d665b
feature: add programming heapless mode in SBA
...
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-11-24 15:29:19 +01:00
Kamil Kopryk
ce7298d512
feature: Add heapless mode programming in ocl
...
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-11-24 12:53:39 +01:00
Baj, Tomasz
c49a9b9787
refactor: remove ReturnSubDevicesAsApiDevices from shared code
...
Related-To: NEO-9437
Signed-off-by: Baj, Tomasz <tomasz.baj@intel.com>
2023-11-22 15:13:29 +01:00
Mateusz Jablonski
8dd80efbb1
refactor: move getting thread per eu configs to release helper
...
Related-To: HSD-18034098647
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-21 09:44:32 +01:00
Baj, Tomasz
97842d8eab
fix: program PIPE_CONTROl with RT Flush and CS Stall prior to PIPE_SELECT
...
Related-To: NEO-9194
Signed-off-by: Baj, Tomasz <tomasz.baj@intel.com>
2023-11-20 16:43:59 +01:00
Mateusz Jablonski
35c1f34672
refactor: move number of threads per eu to release helper
...
Related-To: HSD-18034098647
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-20 12:16:33 +01:00
Mateusz Jablonski
a0f8e8accc
fix: initialize max capability values for ARL
...
setup max values when overriding hw info with debug key
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-15 13:44:11 +01:00
Dunajski, Bartosz
30777d4d4c
feature: use indirect semaphore for 64b values
...
Related-To: NEO-8145
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-11-09 16:58:45 +01:00
Mateusz Hoppe
f36e86f121
fix: program bindless surface state size from sba properties
...
- surfaceStateSize is in pages, bindless size needs to be programmed in
surface state units
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-11-07 17:45:54 +01:00
Kamil Kopryk
73bb139e17
refactor: add applyExtraInternalOptions function
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-11-03 09:43:37 +01:00
Michal Mrozek
060fc40938
performance: move constants out of the loop.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2023-10-30 18:16:35 +01:00
Mateusz Jablonski
1e18bf0392
fix: define timestamp mask based on timer resolution
...
Related-To: HSD-18034577551
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-27 11:55:20 +02:00
Compute-Runtime-Validation
fca2159430
Revert "fix: if device hierarchy is flat then getSubDevicesCount return 1u"
...
This reverts commit cb0bb57f49
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-10-26 15:40:29 +02:00
Baj, Tomasz
cb0bb57f49
fix: if device hierarchy is flat then getSubDevicesCount return 1u
...
Related-To: NEO-9167
Signed-off-by: Baj, Tomasz <tomasz.baj@intel.com>
2023-10-25 15:51:52 +02:00
Mateusz Jablonski
6d2d16d68e
fix: avoid overflow of gpu time stamp in ns
...
Related-To: NEO-8394
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-24 14:56:15 +02:00
Zbigniew Zdanowicz
2e09b5ff66
refactor: reposition preamble helper implementation methods
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-10-20 13:18:29 +02:00
Mateusz Hoppe
97faeae16f
feature: heapAssigner per root device
...
- create heapAssigner per root device in memory manager to allow per
device config
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-10-19 19:52:24 +02:00
Mateusz Jablonski
4dfa12c8eb
fix: add mechanism to detect gpu timestamp overflows
...
unify naming CpuGpu to GpuCpu
Related-To: NEO-8394
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-19 16:31:06 +02:00
Lukasz Jobczyk
750b5ba89a
fix: flush necessary caches when dispatch pipe control
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-10-19 15:09:31 +02:00
Mateusz Hoppe
5a4fa180d6
feature: control bindless compilation mode based on release
...
- check releaseHelper support when selecting bindless mode, if not
disabled, prefer bindless mode in L0 API
- bindless mode can be forced with DebugVariable: UseBindlessMode
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-10-19 10:11:11 +02:00
John Falkowski
f156a74f54
fix: split chunking prefetch flags
...
Related-To: NEO-9120
Signed-off-by: John Falkowski <john.falkowski@intel.com>
2023-10-18 19:20:42 +02:00
Mateusz Jablonski
a420e34b10
fix: explicitly remove assign operators when not needed
...
when class defines copy/move ctor then corresponding assign operator(s)
should be defined or deleted
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-12 10:25:15 +02:00
Mateusz Jablonski
3fdcf049bf
fix: set default device hierarchy to composite for all platforms except xe hpc
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-12 10:09:50 +02:00
Latif, Raiyan
dee5ecfdf3
fix: ReturnSubDevicesAsApiDevices flag being ignored
...
Proper subdevice count being returned now in GfxCoreHelper
path, as previous method ignored the usage of the
ReturnSubDevicesAsApiDevices flag.
Related-To: LOCI-4859
Signed-off-by: Latif, Raiyan <raiyan.latif@intel.com>
2023-10-10 17:05:00 +02:00
Mateusz Jablonski
420f273a6c
fix: don't wait on condition in unit tests
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-10 15:14:30 +02:00
Mateusz Hoppe
c2d69e5857
feature: allocate SPECIAL_SSH heap in front window from EXTERNAL heap
...
- SPECIAL_SSH is used for debug surface SurfaceState which must be
located at bindless offset zero
- limit size of external front window
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-10-09 14:54:39 +02:00
Filip Hazubski
0c8a514349
fix: Switch default device hierarchy to FLAT
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2023-10-06 15:24:50 +02:00
Filip Hazubski
08e92d154f
fix: Add getDefaultDeviceHierarchy call to GfxCoreHelper
...
Added getDefaultDeviceHierarchy call that describes default device
hierarchy for a gfx core. Refactored L0 and OCL paths to use this
value by default and override this value when user sets
ZE_FLAT_DEVICE_HIERARCHY environment variable or
ReturnSubDevicesAsApiDevices debug key.
Updated ReturnSubDevicesAsApiDevices to force COMPOSITE device hierarchy
when set to 0.
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2023-10-06 12:32:41 +02:00
Mateusz Jablonski
382fc952f2
refactor: add NonAssignableClass to define classes without assign operator
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-05 15:16:58 +02:00
Mateusz Jablonski
712ba60452
fix: add unrecoverable to avoid nullptr access
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-05 13:20:21 +02:00
Mateusz Jablonski
6d259ac4b7
fix: add unrecoverable to avoid out of bound access
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-05 12:03:28 +02:00
Mateusz Jablonski
ad2701ad26
fix: add unrecoverable to avoid out of bound access
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-04 15:59:18 +02:00
John Falkowski
56f05303c9
feature: add support for zeMemGet/SetAtomicAccessAttributeExp
...
Resolves: NEO-8219
Signed-off-by: John Falkowski <john.falkowski@intel.com>
2023-10-02 15:59:17 +02:00
Hoppe, Mateusz
5c565efe28
feature: bindless global heap with debugger
...
- program debugSurface's SurfaceState at the beginning of Bindless Surface
State Heap - SPECIAL_SSH
- ensure SPECIAL_SSH is resident
Related-To: NEO-7063
Signed-off-by: Hoppe, Mateusz <mateusz.hoppe@intel.com>
2023-09-29 13:13:46 +02:00
Mateusz Jablonski
a033df33ff
fix: remove preferSmallWorkgroupSizeForKernel method
...
Related-To: HSD-18033866078
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-29 11:55:09 +02:00
Mateusz Jablonski
3a21b3b228
refactor: remove not needed code
...
Related-To: NEO-7527
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-28 07:52:39 +02:00
Mateusz Jablonski
5f846d8a13
refactor: remove not needed code
...
Related-To: NEO-7527
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-27 18:17:04 +02:00
Mateusz Jablonski
03874b8815
refactor: remove not needed code
...
Related-To: NEO-7527
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-27 17:45:54 +02:00
Mateusz Jablonski
09044dfbaa
refactor: remove not needed code
...
Related-To: NEO-7527
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-27 14:35:49 +02:00
Kacper Nowak
160303924d
refactor: Correct logic for SIMD1
...
- For calculating number of threads per workgroup, for SIMD 1, return
local work size (each software thread should be mapped into a whole hardware
thread).
- Correct logic of calculating space for per thread data for SIMD 1.
- Minor: unit tests refactor.
- Corrected naming.
Related-To: NEO-8261
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2023-09-26 15:28:37 +02:00
Dominik Dabek
eebf2bbd26
performance(ocl): timestamp packet count per gfx
...
Add support for different timestamp packet counts per gfx family.
Change all packet counts to 1 except for xe-hpc.
Related-To: NEO-8154
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-09-25 20:34:58 +02:00
Maciej Plewka
8658fdb04e
fix: Use stack vec for api specific prefix
...
Related-To: NEO-8388, GSD-6296
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-09-21 16:10:54 +02:00
Maciej Bielski
97e7cda912
feature: Optimize intra-module kernel ISA allocations
...
So far, there is a separate page allocated for each kernel's ISA within
`KernelImmutableData::initialize()`. Apparently the ISA blocks are often
much smaller than a 64k page, which leads to poor memory utilization and
was even observed to cause the device OOM error if a single module has
several keys.
Improve the situation by reusing the parent allocation (owned by the
module instance) for modules, which kernel ISAs can fit together within
a single 64k page. This improves the memory utilization on a single
module level.
Related-To: NEO-7788
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2023-09-21 13:55:45 +02:00
Katarzyna Cencelewska
d7d46a9fc5
refactor: use initialized variable in getHighestEnabledDualSubSlice
...
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2023-09-20 14:49:56 +02:00
Mateusz Jablonski
b1808f7830
fix: correct suggested number of work groups for concurrent kernels on PVC
...
value depends on CCS count:
- single CCS mode (default) - 50% available
- two CCS mode - 25% available
- four CCS mode - 12.5% available
Related-To: NEO-8377
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-20 13:40:22 +02:00
Mateusz Hoppe
69f5ca6345
feature: bindless addressing - flush state cache after reusing SS slot
...
- when Surface State is reused for new resource, State Cache needs to be
invalidated
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-09-20 12:53:32 +02:00
Dunajski, Bartosz
d3d5da1f72
feature: initial 64b in-order CmdList support
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-19 15:23:37 +02:00
Compute-Runtime-Validation
913a926fd4
Revert "feature: Optimize intra-module kernel ISA allocations"
...
This reverts commit c348831470
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-09-19 14:16:05 +02:00
Maciej Bielski
c348831470
feature: Optimize intra-module kernel ISA allocations
...
So far, there is a separate page allocated for each kernel's ISA within
`KernelImmutableData::initialize()`. Apparently the ISA blocks are often
much smaller than a 64k page, which leads to poor memory utilization and
was even observed to cause the device OOM error if a single module has
several keys.
Improve the situation by reusing the parent allocation (owned by the
module instance) for modules, which kernel ISAs can fit together within
a single 64k page. This improves the memory utilization on a single
module level.
Related-To: NEO-7788
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2023-09-19 12:05:09 +02:00
Compute-Runtime-Validation
73731d3be5
Revert "fix: correct suggested number of work groups for concurrent kernels o...
...
This reverts commit 6fc673b0fe
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-09-15 04:21:58 +02:00
Mateusz Jablonski
6fc673b0fe
fix: correct suggested number of work groups for concurrent kernels on PVC
...
value depends on CCS count:
- single CCS mode (default) - no limitations
- two CCS mode - 25% available
- four CCS mode - 12.5% available
Related-To: NEO-8377
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-14 16:13:54 +02:00
Mateusz Jablonski
2f7c33c1fd
refactor: move xe hpg specific appendBlitCommandsBlockCopy to xe hpg file
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-13 10:49:28 +02:00
Dunajski, Bartosz
7562842a58
refactor: remove LogicalStateHelper
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-13 10:29:53 +02:00
Compute-Runtime-Validation
413365a7bf
Revert "fix: Correct logic for SIMD1"
...
This reverts commit fc099ead2e
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-09-13 08:23:59 +02:00
Kacper Nowak
fc099ead2e
fix: Correct logic for SIMD1
...
- For calculating number of threads per workgroup, treat simd 1 as it
was simd 32
- Correct logic of calculating space for per thread data for simd 1
- Minor: unit tests refactor
- Corrected naming
Related-To: NEO-8261
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2023-09-13 07:03:12 +02:00
Mrozek, Michal
d9f938f3db
refactor: remove not needed code
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2023-09-12 14:25:04 +02:00
Dunajski, Bartosz
6648065703
feature: add indirect semaphore mode
...
Related-To: NEO-8242
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-12 13:15:51 +02:00
Dunajski, Bartosz
2a6be2fccd
feature: update conditional bb start to use qword data
...
Related-To: NEO-8242
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-12 11:24:28 +02:00
Dunajski, Bartosz
def3f2e9ad
refactor: improve semaphore programming
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-12 11:24:11 +02:00
Filip Hazubski
d7db6ac467
feature: Add preferredPlatformName field to RuntimeCapabilityTable
...
For all of the devices, preferredPlatformName is initialized with
nullptr by default and platform name will be initialized to driver's default
platform name, at the moment this is "Intel(R) OpenCL Graphics".
When Platform is initialized and preferredPlatformName is not nullptr then
Platform name will be set to the value stored in preferredPlatformName.
Add ENABLE_LEGACY_PLATFORM_NAME AIL enum related to added functionality.
Move PlatformInfo to NEO namespace.
Related-To: HSD-22018809561
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2023-09-12 11:07:14 +02:00
Mateusz Jablonski
c851896482
refactor: move XeHpg specific setExtraAllocationData definition to Xe Hpg file
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-11 13:32:00 +02:00
Compute-Runtime-Validation
1579c69316
Revert "performance: allocate timestamp packet tag buffer in local mem on DG2"
...
This reverts commit 819908ec94
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-09-11 12:07:26 +02:00
Fabian Zwolinski
b2ba1fbecf
fix: enforce zebin format by default in Ocloc for ICL/TGL/RKL/ADL
...
Products for which zebin has been set as default format in OCLOC:
- ICELAKE_LP
- TIGERLAKE_LP
- ROCKETLAKE
- ALDERLAKE_S
- ALDERLAKE_P
- ALDERLAKE_N
The default format does not override `--format` parameter.
Related-To: NEO-8334
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-09-08 16:16:18 +02:00
Fabian Zwolinski
10675134e1
feature: Add process safety to Windows compiler cache
...
Related-To: NEO-8092
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
Co-authored-by: Diedrich, Kamil <kamil.diedrich@intel.com>
2023-09-06 15:34:15 +02:00
Maciej Plewka
3b3e17e738
performance: Use vector for private allocs to reuse
...
Related-To: HSD-18033105655, HSD-18033153203
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-09-04 13:34:38 +02:00
Mateusz Jablonski
91b26277a4
feature: add method to adjust hw info for igc
...
Related-To: NEO-8203
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-04 11:58:13 +02:00
Jitendra Sharma
9818ef61a5
feature: Report correct GRF register count
...
Based on Large GRF enabled or not, report correct GRF
register.
Related-To: NEO-6788
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2023-09-04 11:42:48 +02:00
Compute-Runtime-Validation
154530ad23
Revert "feature: Report correct GRF register count"
...
This reverts commit 8eb3fe222e
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-09-01 15:12:57 +02:00
Jitendra Sharma
8eb3fe222e
feature: Report correct GRF register count
...
Based on Large GRF enabled or not, report correct GRF
register.
Related-To: NEO-6788
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2023-08-31 18:48:29 +02:00
Slawomir Milczarek
5833d65bc0
feature: Add support for memadvise with system preferred location
...
Related-To: NEO-7851
Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2023-08-31 13:09:55 +02:00
Mateusz Jablonski
819908ec94
performance: allocate timestamp packet tag buffer in local mem on DG2
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-08-30 12:02:04 +02:00
Mateusz Hoppe
00de13939d
feature: bindless addressing allocator - reuse of bindless slots
...
- introduce 2 reuse pools to bindlessHeapHelper
- one pool stores slots for reuse, second pool stores released slots
- stateCacheDirty flags keep track of state cache - when pools are
switched - flags are set indicating flushing caches is needed after
old slots have been reused for new allocations
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-08-29 09:51:36 +02:00
Mateusz Jablonski
dbbc57777a
refactor: move MTL device ids to common place
...
Related-To: NEO-8187
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-08-25 00:12:24 +02:00
Mateusz Jablonski
db52486bbf
refactor: move MTL specific definition of ProductHelper to Xe Lpg
...
Related-To: NEO-8187
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-08-24 10:50:26 +02:00
John Falkowski
d49190f4ae
feature: Add debug/release variables prefixes
...
Add debug/release variables with prefixes for Level Zero, OpenCL and NEO
Resolves: NEO-6357
Signed-off-by: John Falkowski <john.falkowski@intel.com>
2023-08-22 15:15:45 +02:00
Mateusz Hoppe
8435160db4
feature: bindless addressing for images
...
- program surface states for redescribed images correctly. Image copy
to/from memory are using redescribed surface states,
- refactor state base address programming - program address and size
together, set max size at the beginning due to lack of Enable flag
- set GpuBase in WddmAllocation when external heap is used
- return max ssh required size from kernelInfo or based on stateful args
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-08-18 15:59:20 +02:00
Artur Harasimiuk
f6e0c0cf89
Revert "feature: Add debug/release variable prefixes"
...
This reverts commit ec95d9314a
.
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2023-08-18 12:42:39 +02:00
John Falkowski
ec95d9314a
feature: Add debug/release variable prefixes
...
Resolves: NEO-6357
Signed-off-by: John Falkowski <john.falkowski@intel.com>
2023-08-10 14:01:09 +02:00
Fabian Zwolinski
6fca8ee195
refactor: Remove SourceLevelDebugger
...
Removed:
- SourceLevelDebugger (with tests)
- DebuggerLibrary
- DebuggerLibraryRestore
- debuggerSupported field from hwInfo.capabilityTable
- HasSourceLevelDebuggerSupport matcher
- ExperimentalEnableSourceLevelDebugger debug var
- EnableMockSourceLevelDebugger debug var
- DebuggerOptDisable debug var
- lib_names.h.in file
- third_party/source_level_debugger/igfx_debug_interchange_types.h
Related-To: NEO-7213
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-08-10 11:14:02 +02:00
Mateusz Hoppe
bcba74f839
feature: bindless global allocator with scratch
...
- allocate SSH in cmdContainer when scratch allocation used with
private heaps
- scratch SurfaceStates are addressed relative to
SurfaceStateBaseAddress and have to be placed on SSH
- remove not used SCRATCH_SSH heap type from bindelssHeapHelper
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-08-03 13:24:59 +02:00
Mateusz Jablonski
659f7b645f
refactor: move methods from CompilerProductHelper to ReleaseHelper
...
Related-To: NEO-7786
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-08-02 10:40:48 +02:00
Mateusz Jablonski
02aa4b6acc
refactor: unify isMatrixMultiplyAccumulateSupported definition
...
Related-To: NEO-7786
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-08-01 14:39:53 +02:00
Zbigniew Zdanowicz
22ab72ed3d
performance: do not flush caches when writing global timestamp
...
Related-To: HSD-18032167035
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-08-01 10:46:18 +02:00
Mateusz Jablonski
1e95ed33f9
refactor: remove redundant parameter from HardwareInfo ctor
...
hw ip version is queried from KMD or set based on device id and rev id
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-07-31 16:12:57 +02:00
Mateusz Jablonski
27e459dfd0
fix: add missing cache flushes on MTL and later integrated GPUs
...
hdc pipeline / untyped dataport cache flushes were applied only on discrete GPUs
Related-To: GSD-5085
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-07-28 19:30:13 +02:00
Mateusz Jablonski
f84f22d23c
refactor: remove unused isSharedContext variable
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-07-26 12:14:23 +02:00
Mateusz Hoppe
67d39f88e6
feature: bindless addressing - store bindlessInfo in allocation
...
- store surface state info for bindless addressing in graphics
allocation
- remove map in BindlessHeapsHelper - bindlessInfo is constant for
the lifetime of an allocation
- program bindless offsets and surface states for images when used in
bindless kernel
- handle ouf of memory on surface state heap - return error
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-07-24 14:48:35 +02:00
Kacper Nowak
b908203001
fix: Compile built-ins per release
...
- Preserve releases on CMake level.
- Instead of generating builtins per platform, generate them per-release
(+ correct naming accordingly).
- Stop using revisions in builtin compilation logic path, as they are
already embedded in release (device ip).
- Remove platform names & revisions from names for generated files
(related to builtins).
- Remove unnecessary code, refactor ULT logic.
Related-To: NEO-7783
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2023-07-11 16:02:36 +02:00
Andrzej Ratajewski
dc0796c2a1
feature: Add cl_khr_spirv_linkonce_odr to supported extensions
...
Related-To: NEO-8165
Signed-off-by: Andrzej Ratajewski <andrzej.ratajewski@intel.com>
2023-07-11 13:19:55 +02:00
Young Jin Yoon
81822e3716
refactor: rename pageSize2Mb to pageSize2M
...
The previous name "pageSize2Mb" defined in
shared/source/helpers/constant.h is inconsistent to other variable,
i.e. pageSize64k.
Furthermore, it's a bit misleading because the page size is defined in
Megabytes (MB), not in Megabits (Mb).
Related-to: NEO-7695
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2023-07-10 20:12:09 +02:00
Cencelewska, Katarzyna
aa0beb8191
fix: Unify logic calculating threads per work group part 4
...
- also use helper when checking that is simd1 to have same flow
Related-To: NEO-8087
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>
2023-07-07 15:34:59 +02:00
Mateusz Hoppe
4aba0f0340
feature: global bindless surface state base support
...
- program global bindless ssba when external allocator used (
UseExternalAllocatorForSshAndDsh)
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-07-06 18:31:49 +02:00
Dominik Dabek
f1f0f2410c
refactor: remove unused getBaseTimeout
...
Related-To: GSD-3612
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-07-05 16:12:15 +02:00
Mateusz Hoppe
0cc8330a1a
refactor: Unify bindless surface state base programming
...
- program BSSBA in one place when sbaProperties used
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-07-04 16:34:43 +02:00
Cencelewska, Katarzyna
61f701aba5
fix: Unify logic calculating threads per work group part 3
...
Related-To: NEO-8087
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>
2023-07-04 15:27:44 +02:00
Cencelewska, Katarzyna
2e17c21728
fix: Unify logic calculating threads per work group part 2
...
- use calculateNumThreadsPerThreadGroup instead of getThreadsPerWG to
have same flow and proper values of threads per work groups
Related-To: NEO-8087
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>
2023-07-04 10:34:02 +02:00
Compute-Runtime-Validation
39740da9d1
Revert "fix: Unify logic calculating threads per work group part 2"
...
This reverts commit 1e8a53bd53
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-07-02 07:09:14 +02:00
Cencelewska, Katarzyna
1e8a53bd53
fix: Unify logic calculating threads per work group part 2
...
- use calculateNumThreadsPerThreadGroup instead of getThreadsPerWG to
have same flow and proper values of threads per work groups
Related-To: NEO-8087
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>
2023-06-30 14:16:08 +02:00
Zbigniew Zdanowicz
21823af419
performance: add skeleton method to cmdlist immediate flush task
...
Related-To: NEO-7808
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-06-30 10:46:20 +02:00
Dominik Dabek
10ac167bdc
fix(ocl): do not multiply kmdNotify waitTimeout
...
With direct submission disabled this resulted in waitTimeout long enough
that kmdWait fallback was rarely used.
This caused more CPU spin time.
Related-To: GSD-3612
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-06-29 16:06:28 +02:00
Cencelewska, Katarzyna
0d7aefe66b
fix: Unify logic calculating threads per work group part 1
...
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>
2023-06-29 10:43:22 +02:00
Dunajski, Bartosz
ecb415bf62
feature: reenable RelaxedOrdering
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-06-28 12:20:17 +02:00
Cencelewska, Katarzyna
68d81c82a7
fix: Use proper value about hw local id generations
...
- remove useless flag ForceNumberOfThreadsInGpgpuThreadGroup
- add new flag "RemoveRestrictionsOnNumberOfThreadsInGpgpuThreadGroup"
to restore old path without restrictions about number of threads in
thread group
- fix forwarding information about hw local ids generations to
calculate numOfThreadsInThreadGroup correctly
Related-To: NEO-7952, NEO-7982
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>
2023-06-26 16:35:42 +02:00
Dunajski, Bartosz
2b5e475db9
refactor: use hex values to print TSP usage
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-06-23 11:22:10 +02:00
Dunajski, Bartosz
b004a27e4e
refactor: Debug flag to print TSP usage
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-06-22 14:47:39 +02:00
Lukasz Jobczyk
0bc5eead84
fix: Remove not needed BCS split helper
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-06-22 14:36:22 +02:00
Fabian Zwolinski
99d0823e8f
fix: Append extra extensions when FP64 emulation is enabled
...
Related-To: NEO-7611
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-06-22 08:38:53 +02:00
Dunajski, Bartosz
46e8c3f5dd
fix: reenable RelaxedOrdering for OCL
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-06-20 13:05:25 +02:00
Cencelewska, Katarzyna
7cb3278eb3
fix: add function to calculate number of threads per tg
...
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>
2023-06-13 14:02:24 +02:00
Dunajski, Bartosz
3d49658f50
feature: new multitile post sync layout for immediate write [2/n]
...
No functional changes in this commit. This is prework.
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-06-09 14:20:34 +02:00
Jaime Arteaga
2efd6e547a
feature: Add support for chunking in the UMD (1/N)
...
Read if support for chunking is available in the KMD.
If available, KMD will create a BO with 1 or more chunks,
depending on the chunk size selected.
Related-To: NEO-7695
Sync to
https://github.com/intel-gpu/drm-uapi-helper/releases/tag/v2.0-rc18
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
Signed-off-by: John Falkowski <john.falkowski@intel.com>
2023-06-02 23:27:40 +02:00
Daria Hinz
68b6cfebd2
feature: Add default ip version values to helper
...
Signed-off-by: Daria Hinz <daria.hinz@intel.com>
Related-To: NEO-8021
2023-06-02 12:10:21 +02:00
Cencelewska, Katarzyna
d2436a8231
fix: add limitations for setting gmm flag Cacheable
...
- move isCachingOnCpuAvailable to product helper
- isCachingOnCpuAvailable should return false on mtl
- if wsl, skip checking method from product helper
Related-To: NEO-7194
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>
2023-05-30 17:04:57 +02:00
Mateusz Jablonski
61055478d4
fix: adjust scope of disable L3 for debug WA
...
Related-To: HSD-1609398399
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-05-30 14:23:16 +02:00
Dunajski, Bartosz
5aeffbf673
refactor: define initial value for TimestampPacket
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-05-30 12:09:05 +02:00
Mateusz Hoppe
1c196b9f3d
refactor: change ApiSpecificConfig functions names
...
- better description of the meaning of functions
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-05-30 09:20:01 +02:00
Mateusz Jablonski
c544004b8e
fix: move getProductConfigFromHwInfo to CompilerProductHelper
...
add tests for default PVC configs
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-05-29 14:08:52 +02:00
Dunajski, Bartosz
814de81aca
feature: replace in-order sync allocation with TimestampPacket
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-05-29 09:32:35 +02:00
Daria Hinz
331f167cfe
feature: Add debug flag for setting hw ip version
...
Signed-off-by: Daria Hinz <daria.hinz@intel.com>
Related-To: NEO-7954
2023-05-23 15:32:46 +02:00
Filip Hazubski
d234bc970d
refactor: Move getMaxNumSamplers function to ProductHelper
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2023-05-18 09:25:07 +02:00
Cencelewska, Katarzyna
5f22e9eaca
fix: don't set Cacheable on xe_hp and later
...
Related-To: NEO-7194
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>
2023-05-18 09:17:32 +02:00
Lukasz Jobczyk
9f755b54ad
performance: Bidirectional BCS split implementation for OCL
...
Related-To: NEO-7877
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-05-16 13:34:18 +02:00
Milczarek, Slawomir
66eb1c9c0a
refactor: Add helpers to control kmd migration support on PVC platform
...
This commit keeps KMD migration still disabled by default on PVC platform.
Related-To: NEO-6465
Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>
2023-05-15 13:51:19 +02:00
Daria Hinz
1d64b94f30
feature: Match config based on device & rev ID
...
This change matches the appropriate aot config
for the combination of device ID and revision ID.
Signed-off-by: Daria Hinz <daria.hinz@intel.com>
Related-To: NEO-7905
2023-05-10 08:22:09 +02:00
Dunajski, Bartosz
cfacbbd811
refactor: Simplify OverrideBlitterMocs usage
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-05-09 19:22:57 +02:00
Warchulski, Jaroslaw
7fdf4985a3
feature: add support for cl_khr_external_memory extension
...
Related-To: NEO-7069
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-05-05 15:51:39 +02:00
Fabian Zwolinski
cbce863dc2
refactor: Rename member variables to camelCase 3/n
...
Additionally enable clang-tidy check for member variables
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-04-28 16:01:14 +02:00
Kamil Kopryk
171e97e71a
feature: introduce release helper
...
Related-To: NEO-7786
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-04-27 21:38:08 +02:00
Fabian Zwolinski
e351a90f81
refactor: Rename member variables to camelCase 2/n
...
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-04-27 20:39:22 +02:00
Mateusz Jablonski
2f9135a4e6
fix: change type of container with registered engines per root device
...
use StackVec instead of unordered map
resize container at MemoryManager's creation time
Related-To: NEO-7925
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-04-27 17:06:42 +02:00
Mateusz Jablonski
32d8a3bc6d
fix: store registered engines per root device
...
in most cases we need to iterate over engines associated to single root device
Related-To: NEO-7925
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-04-27 10:54:07 +02:00
Fabian Zwolinski
2022592f3d
Apply CamelCase for class and struct names 2/2
...
Additionally change .clang-tidy not to ignore struct names.
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-04-25 13:10:23 +02:00
Milczarek, Slawomir
bf778be99e
[fix] zeCommandListAppendMemAdvise to clear preferred location correctly
...
The memadvise with preferred location for kmd-migrated shared allocation
is set to device associated with cmd list by default to migrate data
to lmem on non-atomic gpu page fault as well (for performance reasons).
Related-To: NEO-7252
Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>
2023-04-24 14:51:49 +02:00
Andrzej Koska
b22f3f0796
Fix for handling incorrect patchSize in patchWithRequiredSize
...
This fix adds additional size control to patched memory
Related-To: NEO-7760
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2023-04-20 16:07:45 +02:00
Daria Hinz
2647efc23c
feat: Add support for ip version value in ocloc
...
Ocloc supports passing hw ip version value to -device arg in
the form of major.minor.revision.
This change adds support for directly passed value as uint32_t as well.
Support added for single and fat binary.
Signed-off-by: Daria Hinz <daria.hinz@intel.com>
Related-To: NEO-7903
2023-04-20 13:20:11 +02:00
Mateusz Jablonski
688d958fe8
fix: handle single storage USM in gtpin helpers
...
use GPU address from gpu allocation instead of CPU allocation
check page fault manager presence before migrating to GPU domain
Related-To: NEO-7690
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-04-19 14:26:25 +02:00
Mateusz Hoppe
97a8f15fbd
fix: cleanup memory leaks in ocloc
...
- enable memory leaks detection in ocloc tests
Related-To: NEO-7467
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-04-19 13:48:49 +02:00
Kacper Nowak
c7adbc2140
Add debug key for dumping ELF to file
...
Add "DumpZEBin" debug flag. When this flag is enabled, Zebin will be
dumped to a .elf file (with appropiate suffix, in case such file has
been dumped before).
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
Related-To: NEO-7895
2023-04-18 20:40:25 +02:00
Maciej Plewka
730578112f
Report cl_khr_integer_dot_product extension
...
Related-To: NEO-6206
With this commit OpenCL will report cl_khr_integer_dot_product extension
in version 2. With all properties enabled.
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-04-14 14:46:18 +02:00
Konstanty Misiak
1f37e69fd2
Refactor of IO functions
...
Related-To: NEO-4562
Signed-off-by: Konstanty Misiak <konstanty.misiak@intel.com>
2023-04-13 10:46:47 +02:00
Maciej Plewka
095108edd8
Abort when blit resolve operation dispatched for different tilings
...
Related-To: NEO-7872, NEO-7873, NEO-7874, NEO-7875
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-04-12 13:17:00 +02:00
Compute-Runtime-Validation
6357ab73ea
Revert "Report cl_khr_integer_dot_product extension"
...
This reverts commit 31983ec043
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-04-06 14:38:45 +02:00
Maciej Plewka
31983ec043
Report cl_khr_integer_dot_product extension
...
Related-To: NEO-6206
With this commir OpenCL will report cl_khr_integer_dot_product extension
in version 2. With all properties enabled.
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-04-05 20:28:00 +02:00
Zbigniew Zdanowicz
4c7bc2ca98
[feature, perf] add alogrithm to chain command buffers in container
...
This feature is part of performance improvement to dispatch and start
command buffers as primary batch buffers.
When exhausted command buffer is closed, then reserve exact space for chained
batch buffer start and bind it to the next command buffer.
When closing command buffer, then save ending pointer and
reserve aligned space.
Related-To: NEO-7807
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-04-05 15:49:01 +02:00
Compute-Runtime-Validation
c927c46495
Revert "Report cl_khr_integer_dot_product extension"
...
This reverts commit dbd9b19b35
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-04-05 04:16:29 +02:00
Maciej Plewka
dbd9b19b35
Report cl_khr_integer_dot_product extension
...
Releated-to: NEO-6206
With this commir OpenCL will report cl_khr_integer_dot_product extension
in version 2. With all properties enabled.
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-04-04 11:15:38 +02:00
Mateusz Jablonski
ab265bca93
fix blitter programming for media compression on xe hpg platforms
...
- when destination surface is media compressed then disable compression bit
- rename command field CompressionType->ControlSurfaceType
- program this field only on Xe Hpg platforms
Related-To: NEO-7415
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-03-31 17:15:28 +02:00
Rafal Maziejuk
b9828b543e
feature: adjust maxWorkGroupSize value
...
Related-To: NEO-7357
Signed-off-by: Rafal Maziejuk <rafal.maziejuk@intel.com>
2023-03-28 15:19:52 +02:00
Zbigniew Zdanowicz
6437c1a91e
Flush state caches after command list is destroyed
...
When state base address tracking is enabled and command list use private heaps
then command list at destroy time must calls all compute CSRs that were using
that heap to invalidate state caches.
This allows new command list to reuse the same heap allocation for different
surface states, so before new use cached states are invalidated.
Related-To: NEO-5055
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-03-28 14:52:30 +02:00
Mateusz Jablonski
5d482ffe87
refactor: add constexpr for tera byte
...
Related-To: NEO-7665
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-03-28 12:55:52 +02:00
Dunajski, Bartosz
b3c2fa41c5
OCL: Optimize IOQ barriers handling
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-03-24 12:52:47 +01:00
Zbigniew Zdanowicz
179429b25c
Add stateless arguments checker for single kernel
...
Related-To: NEO-5055
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-03-23 10:55:15 +01:00
Mateusz Jablonski
5610eae710
refactor: fix typo Barrierl -> Barrier
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-03-21 15:58:24 +01:00
Cencelewska, Katarzyna
a4a296d59f
wa: enable wa to add additional dummy blits after blit copy
...
- reduce number of dummy blits where are not needed
- track if dummy blit required in cmdlist
Related-To: NEO-7450
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>
2023-03-17 10:43:00 +01:00
Filip Hazubski
0bee81c0c0
refactor: Move isLinearStoragePreferred function from gfx to product helper
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2023-03-15 18:51:59 +01:00
Mateusz Jablonski
e3c5cbc22f
fix ocl/ocloc: expose cl_ext_float_atomics for OCL 2.0+
...
Related-To: NEO-7800
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-03-15 14:48:17 +01:00
Mateusz Jablonski
db719b2acb
refactor: rename function getExtensions->getDeviceExtensions
...
Related-To: NEO-7800
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-03-15 12:46:46 +01:00
Mateusz Jablonski
028e1bb1b6
fix ocloc: define extensions in one place
...
unify exposed device extensions between OCL and ocloc
Related-To: NEO-7800
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-03-15 11:14:46 +01:00
Dunajski, Bartosz
60741b05aa
Add missing struct member initialization
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-03-15 10:09:05 +01:00
Zbigniew Zdanowicz
b6d0a64638
Revert: "compute events select device memory for xe hpg family"
...
This reverts commit 072233d170
.
Resolves: HSD-18028222329
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-03-14 17:18:59 +01:00
Mateusz Jablonski
faba16f657
refactor: move extensions string creation to CompilerProductHelper
...
Related-To: NEO-7800
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-03-14 16:27:21 +01:00
Dunajski, Bartosz
477448c097
RelaxedOrdering dispatch for OCL 2/n
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-03-14 15:48:06 +01:00
Mateusz Jablonski
39eab81c83
refactor: add separate helper for extension support
...
cl_intel_subgroup_split_matrix_multiply_accumulate
Related-To: NEO-7800
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-03-14 15:08:29 +01:00
Mateusz Jablonski
340f932ca2
refactor: move GfxCoreHelper::getExtensions to CompilerProductHelper
...
Related-To: NEO-7800
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-03-14 13:56:19 +01:00
Mateusz Jablonski
483f9fe340
refactor: move extensions support logic to compiler product helper
...
- cl_intel_subgroup_local_block_io
- cl_intel_dot_accumulate
- cl_intel_create_buffer_with_properties
- cl_khr_subgroup_named_barrier
- cl_intel_subgroup_extended_block_read
Related-To: NEO-7800
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-03-14 12:09:26 +01:00
Dunajski, Bartosz
97a35debde
RelaxedOrdering dispatch for OCL 1/n
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-03-13 17:37:12 +01:00
Zhenjie Pan
00b675643e
fix: missed error handler of SubmissionStatus::FAILED
...
Related-To: NEO-7802
Signed-off-by: Pan Zhenjie <zhenjie.pan@intel.com>
2023-03-13 16:08:27 +01:00
Raiyan Latif
d5c909c9f9
Fix calculation of number of Ray-Tracing stacks
...
MaxDualSubSlicesSupported is filled inside GT_SYSTEM_INFO
structure when querying the KMD appropriately with the
number of enabled DualSubSlices. However we need to find
the highest index of the last enabled DualSubSlice.
For proper allocation of thread scratch space, allocation
has to be done based on native die config (including unfused
or non-enabled DualSubSlices). Since HW doesn't provide us a
way to know the exact native die config, in SW we need to
allocate RT stacks with enough size based on the last used
DualSubSlice.
The IsDynamicallyPopulated field in GT_SYSTEM_INFO is used to
indicate if system details are populated either via Fuse reg.
or hard-coded. Based on this field's value, we calcuate the
numRtStacks appropriately.
Related-To: LOCI-3954
Signed-off-by: Raiyan Latif <raiyan.latif@intel.com>
2023-03-13 10:48:10 +01:00
Kamil Kopryk
5a99635232
Don't check addressing mode for gtpin surface
...
If gtpin is used, then don't check addressing mode
of the last explicit arg, which is
gtpin's surface.
Related-To: NEO-6075
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-03-10 16:01:24 +01:00
Cencelewska, Katarzyna
398c7b2d29
refactor, remove typo in struct name
...
change name of EncodeSempahore to EncodeSemaphore
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>
2023-03-10 15:44:25 +01:00
Mateusz Jablonski
89cd4bd4a5
refactor: move getters for device extension support to compiler product helper
...
isMatrixMultiplyAccumulateSupported
isBFloat16ConversionSupported
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-03-10 15:41:26 +01:00
Kamil Kopryk
fa8579602f
refactor: rename product helper files n/n
...
Related-To: NEO-7703
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-03-10 13:24:38 +01:00
Zbigniew Zdanowicz
0950f5a23e
Set global heap size to constant value
...
Related-To: NEO-5055
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-03-09 17:17:32 +01:00
Mateusz Jablonski
5080250a64
refactor: remove include from enable_compiler_product_helper files
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-03-09 11:21:37 +01:00
Cencelewska, Katarzyna
c274309d7b
wa: add dummy blits before command MI_FLUSH_DW
...
to guarantee that all subblt got complete for previous copy
affect xe hpg
temporary changes under flag ForceDummyBlitWa
Related-To: NEO-7450
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>
2023-03-09 10:40:35 +01:00
Mateusz Hoppe
37dbec305d
feature: add AssertHandler
...
- initial implementation to support assert() on GPU
Related-To: NEO-5753
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-03-08 17:55:23 +01:00
Dominik Dabek
69a16fd3ed
feature: check indirect access for kernel
...
Do not make indirect allocations resident if kernel does not use
indirect access.
For both level zero and opencl.
Currently disabled by default, enable with debug flag
DetectIndirectAccessInKernel
Related-To: NEO-7712
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-03-08 16:58:26 +01:00