Bartosz Dunajski
c3312f21f7
feature: dont initialize in-order TS nodes
...
Related-To: NEO-11925
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-07-23 13:31:23 +02:00
Bartosz Dunajski
3758e99cbf
feature: initial support for standalone CB Events Timestamps allocator
...
Related-To: NEO-11925
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-07-22 10:13:55 +02:00
Mateusz Hoppe
b6299b8a21
feature: add support for HP copy engine context
...
- add support for contect group with HP copy engine
- choose HP copy engine when available
Related-To: NEO-11983
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-07-19 12:23:03 +02:00
Compute-Runtime-Validation
7ad15639fc
Revert "feature: add support for HP copy engine context"
...
This reverts commit 3fbcbcaef2
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-07-18 21:02:14 +02:00
Mateusz Hoppe
3fbcbcaef2
feature: add support for HP copy engine context
...
- add support for contect group with HP copy engine
- choose HP copy engine when available
Related-To: NEO-11983
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-07-18 16:07:07 +02:00
Filip Hazubski
46f9133bf2
fix: Correct logic to select internal BCS engine
...
When BCS3 is not available, use last available copy engine as internal.
Related-To: HSD-18039263936
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-07-18 12:02:50 +02:00
Mateusz Jablonski
778645c11e
feature: add initial support for Xe2 platforms
...
Related-To: NEO-8188, NEO-10774
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-09 10:30:27 +02:00
Mateusz Jablonski
f171a22c0a
refactor: include reg_configs_common using full path
...
add missing register offset value
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-03 13:30:46 +02:00
Szymon Morek
3dd051c3ee
performance: adjust compression handling
...
Related-To: NEO-11882
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-03 09:37:11 +02:00
Morek, Szymon
aed96cede4
fix: flush barrier when profiling enabled
...
Related-To: NEO-10615
Signed-off-by: Morek, Szymon <szymon.morek@intel.com>
2024-07-02 13:25:07 +02:00
Dominik Dabek
76e8be5c39
fix(ocl): handle gl sharing displayable textures
...
Displayable textures always need dc flush.
Related-To: NEO-11694
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-06-27 13:43:20 +02:00
Mateusz Jablonski
85289f6658
refactor: extract common code for populating ftr and wa table
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-06-27 13:00:15 +02:00
Mateusz Jablonski
d51997be4f
fix: setup L3 cache config when available from release helper
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-06-26 16:55:51 +02:00
Mateusz Jablonski
66d4d141e1
fix: extract common logic for filling default gt system info
...
gt system info should be queried from KMD
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-06-25 20:21:57 +02:00
Michal Mrozek
a3c3b6533a
performance: improve pool handling
...
Related-To: NEO-11731
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-06-25 17:04:17 +02:00
Compute-Runtime-Validation
7136dfbd38
Revert "performance: improve pool handling"
...
This reverts commit 5f0b9efd2b
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-06-24 12:47:19 +02:00
Kamil Kopryk
d97dccc409
refactor: rename MaximumNumberOfThreads debug key
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-19 13:06:42 +02:00
Ratajewski, Andrzej
f7888fac0d
feature: Enable support for cl_intel_subgroup_matrix_multiply_accumulate_tf32
...
This change enables support for
`cl_intel_subgroup_matrix_multiply_accumulate_tf32` extension for PVC B0
and later.
Related-To: GSD-7825
Signed-off-by: Ratajewski, Andrzej <andrzej.ratajewski@intel.com>
2024-06-19 11:43:03 +02:00
Kamil Kopryk
f05ff9b156
refactor: rename OverDispatchControl debug key
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-18 18:59:43 +02:00
Kamil Kopryk
c40f0f2421
refactor: rename ComputeOverdispatchDisable debug key
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-17 13:18:29 +02:00
Pawel Szymichowski
521add6ca7
feature: Enable support for cl_intel_subgroup_buffer_prefetch
...
This change enables support for cl_intel_subgroup_buffer_prefetch extension for
PVC and later.
Related-To: GSD-7825
Signed-off-by: Ratajewski, Andrzej <andrzej.ratajewski@intel.com>
2024-06-14 20:44:52 +02:00
Mrozek, Michal
5f0b9efd2b
performance: improve pool handling
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
Resolves: NEO-11731
2024-06-14 12:02:34 +02:00
Bartosz Dunajski
da9c009b88
feature: assign unique interrupt to queue
...
Related-To: NEO-8179
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-06-07 10:06:31 +02:00
Mateusz Jablonski
01c36a38c6
feature: add support for generic id acronyms in ocloc
...
Related-To: NEO-11568
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-06-06 10:06:44 +02:00
Mateusz Jablonski
d7c7cb203c
fix: add missing support for acronyms without dashes
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-05-29 17:07:06 +02:00
Aleksander Czerwionka
683eb97d9e
refactor: [ocloc] remove dead code in gen12lp backward compatibility
...
Related-To: NEO-10148
Signed-off-by: Aleksander Czerwionka <aleksander.czerwionka@intel.com>
2024-05-29 16:26:31 +02:00
Chodor, Jaroslaw
ad155da67a
feature: Enabling platforms in ocloc by default
...
This feature enables supported platforms in ocloc even
if not enabled for driver.
Allows sharing single ocloc instance for multiple driver-platform
configurations
Related-To: NEO-10531
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2024-05-28 12:09:47 +02:00
Compute-Runtime-Validation
9d312995e2
Revert "feature: split cache_policy_dg2_and_later.inl file"
...
This reverts commit 5df982a85d
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-28 07:03:47 +02:00
Aleksander Czerwionka
251463d77a
feature: [Ocloc] provide backward compatibility for gen12lp family merge
...
Related-To: NEO-10148
Signed-off-by: Aleksander Czerwionka <aleksander.czerwionka@intel.com>
2024-05-27 17:35:31 +02:00
Jitendra Sharma
5df982a85d
feature: split cache_policy_dg2_and_later.inl file
...
Move getCachingPolicyOptions method present in existing
cache_policy_dg2_and_later.inl in new
get_caching_policy_options.inl file.
This would help in reusing getCachingPolicyOptions
method in any newly created cache_policy_* file.
Related-To: NEO-8306
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2024-05-27 12:34:14 +02:00
Aravind Gopalakrishnan
b39e99a17a
feature: Prepare hw info to report CXL type
...
Related-To: NEO-11362
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-05-24 07:09:42 +02:00
Fabian Zwoliński
78bd3da078
fix: ocloc - add bindless mode options if bindless is enabled
...
If release helper is not present,
do not add bindless mode options.
Related-To: NEO-7063
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-05-23 14:10:28 +02:00
Mateusz Jablonski
2577f1123e
refactor: remove not needed code
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-05-22 14:01:21 +02:00
Fabian Zwoliński
59829aeb11
fix(ocl): Force rebuild if binary contains bindless kernel
...
Related-To: NEO-7063
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-05-20 14:15:17 +02:00
Maciej Plewka
90df4b298b
fix: cache flush dependency for queue blocked
...
Related-to: NEO-9872, HSD-18038461954
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-05-20 11:55:24 +02:00
Young Jin Yoon
e204d27190
fix: print to stdout for disable scratch page
...
Modified to print out error messages to stdout when disable scratch page
is used.
Related-To: GSD-7611
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-05-16 15:05:07 +02:00
Aravind Gopalakrishnan
12ab4e5446
feature: Add 3 channel 8/16/32 bit formats
...
Related-To: NEO-11292
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-05-16 13:28:04 +02:00
Dominik Dabek
b4d839fe29
performance(usm): l0, add usm host memory pooling
...
Disabled by default.
Related-To: NEO-11356
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-05-15 15:20:51 +02:00
Ratajewski, Andrzej
0b0f7bd4cd
feature: Enable support for cl_intel_subgroup_2d_block_io
...
This change enables support for cl_intel_subgroup_2d_block_io extension for
PVC and later.
Related-To: GSD-7825
Signed-off-by: Ratajewski, Andrzej <andrzej.ratajewski@intel.com>
2024-05-14 15:16:49 +02:00
Maciej Plewka
e39893485c
fix: add cache flush as dependency for bcs ccs synchronization
...
Related-to: NEO-9872
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-05-09 13:43:39 +02:00
Mateusz Hoppe
76af952fcb
feature: Bindless sampled image support
...
Related-To: NEO-10352
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-05-08 17:50:37 +02:00
Aleksander Czerwionka
4667f55ea2
fix: add missing checks to return proper OpenCL error from API
...
changes affect clEnqueueNDCountKernelINTEL and
clGetKernelMaxConcurrentWorkGroupCountINTEL
Related-To: NEO-9688
Signed-off-by: Aleksander Czerwionka <aleksander.czerwionka@intel.com>
2024-05-07 20:32:36 +02:00
Bartosz Dunajski
e5882e0d31
feature: pass GraphicsAllocation to fence wait
...
Related-To: NEO-8179
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-05-07 17:59:28 +02:00
Andrzej Koska
e44c7f2627
fix: BCS MMIO programming
...
When remap is enabled, we must set different base offset for copy engines.
Copy engines must use BCS0 base.
Related-To: NEO-10678
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2024-05-07 14:08:28 +02:00
Katarzyna Cencelewska
e9f7df6ae6
refactor: create helper for maxPtssIndex
...
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-05-07 12:47:52 +02:00
Bartosz Dunajski
ce36812f8d
feature: obtain GraphicsAllocation from external CB event
...
Related-To: NEO-8179
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-05-06 22:52:36 +02:00
Compute-Runtime-Validation
8342c0ae2f
Revert "fix: add cache flush as dependency for bcs ccs synchronization"
...
This reverts commit 5e57bb2a32
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-01 03:05:47 +02:00
Maciej Plewka
5e57bb2a32
fix: add cache flush as dependency for bcs ccs synchronization
...
Related-to: NEO-9872
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-04-30 11:57:49 +02:00
Bartosz Dunajski
2a2596c13b
refactor: pass additional data to ioctl helper
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-04-26 14:53:14 +02:00
John Falkowski
f1c2ea0b69
fix: kernel access to multiple stateful virtual regions
...
Related-to: NEO-8350
Signed-off-by: John Falkowski <john.falkowski@intel.com>
2024-04-26 11:18:24 +02:00
Michal Mrozek
4aa7c6c99e
performance: optimize counter based waiting schemes
...
- store latest waited counter value.
- do not wait on values that are already completed.
- disable mechanism when counter overflows.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-04-25 17:20:29 +02:00
Katarzyna Cencelewska
eb256f1ffd
refactor: use encode surface state to set pitch for scratch
...
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-04-25 11:57:14 +02:00
Mateusz Hoppe
d38ac4806b
feature: global bindless sampler offsets
...
- use global heap base for DSH when bindlessHeapsHelper enabled
Related-To: NEO-10505
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-04-24 21:12:17 +02:00
Andrzej Koska
625d2d7f50
refactor: Passing information about the engine
...
Part 2
Related-To: NEO-10678
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2024-04-24 09:08:35 +02:00
Compute-Runtime-Validation
04c32d9e4e
Revert "fix: allow kernel access across multiple virtual regions"
...
This reverts commit b9c1ef65dd
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-04-24 04:00:15 +02:00
John Falkowski
b9c1ef65dd
fix: allow kernel access across multiple virtual regions
...
Related to: NEO-8350
Signed-off-by: John Falkowski <john.falkowski@intel.com>
2024-04-23 18:20:52 +02:00
Andrzej Koska
ae139aeffd
refactor: Passing information about the engine
...
Extension of the interface with information
about the engine type passed to the function
Related-To: NEO-10678
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2024-04-22 13:33:56 +02:00
Zbigniew Zdanowicz
74fbd32515
feature: add wrapper to access template method getting local id generation
...
Related-To: NEO-10066
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-04-18 11:45:56 +02:00
Jemale Lockett
79f8993220
refactor: move GPR15 to separate debugger context
...
Signed-off-by: Jemale Lockett <jemale.lockett@intel.com>
2024-04-18 10:42:23 +02:00
Bartosz Dunajski
02ebb857cb
fix: create new InOrderExecInfo on cmd list reset
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-04-17 20:31:48 +02:00
Aleksander Czerwionka
b630f58861
fix: remove incorrectly reported extensions
...
Related-To: NEO-8349
Signed-off-by: Aleksander Czerwionka <aleksander.czerwionka@intel.com>
2024-04-15 10:04:31 +02:00
Zbigniew Zdanowicz
3b9e1e4b32
refactor: add getter to in order exec info class
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-04-12 10:17:39 +02:00
Kamil Kopryk
cfc3b6a8c2
refactor: add heaplessModeEnabled argument to isHeaplessStateInitEnabled
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-04-11 10:08:10 +02:00
Mateusz Jablonski
cb2b572e94
feature: add support for null aub mode
...
In this mode AUB csr will be created, however, no aub file will be created
Related-To: NEO-11097
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-04-09 16:59:42 +02:00
Kamil Kopryk
4eae28bd64
feature: introduce heapless state init in L0
...
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-04-02 12:34:53 +02:00
Mateusz Hoppe
f86d4220a5
feature: add bindless samplers support to level zero
...
- samplers using bindless adressing require patching bindless offsets to
sampler states on kernel's cross thread data
Related-To: NEO-10505
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-03-29 08:07:28 +01:00
Kacper Nowak
35ca2bbea9
refactor: refactor logic for querying atomic FP capabilities
...
- Move logic from product helper to compiler product helper
- Add method for adjusting fp16 and extra capabilities using release
helper (if present).
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2024-03-28 15:27:45 +01:00
Bartosz Dunajski
fce016ee7a
refactor: add new semaphore param
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-03-26 15:46:47 +01:00
Bartosz Dunajski
74dcb5a9e3
feature: initial support for sync dispatch token allocation
...
Related-To: NEO-8171
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-03-26 12:07:58 +01:00
Bartosz Dunajski
9aa81bae75
feature: initial support to enable synchronized dispatch
...
Related-To: NEO-8171
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-03-22 17:23:58 +01:00
Dominik Dabek
2b964254d6
performance: debug key for adjust ULLS on battery
...
ULLS controller timeout settings will be adjusted based on ac line
status and lowest queue throttle from submissions.
Lowest queue throttle is reset when controller stops ULLS.
Related-To: NEO-10800
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-03-22 14:24:00 +01:00
Katarzyna Cencelewska
da7b03dd15
fix: to always use grfs count in calculateNumThreadsPerThreadGroup
...
grf size != grf count
Related-To: GSD-8437
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-03-22 11:03:18 +01:00
Katarzyna Cencelewska
dd1d52259e
refactor: add param rootDeviceEnvironment to calculateNumThreadsPerThreadGroup
...
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-03-21 22:25:14 +01:00
Mateusz Hoppe
0cec5ccc32
fix: check if any of devices has BindlessHeapHelper enabled
...
- report Bindless_images extension in driver only when at least one
device has global bindless heaps enabled
Related-To: NEO-10352
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-03-20 20:50:58 +01:00
Mateusz Jablonski
614ed7ff5f
fix: remove not needed appendBaseAddressOffset method
...
Related-To: NEO-10623
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-20 17:46:14 +01:00
Wenju He
03078541d7
feature: make global bindless heaps resident when created
...
Make bindless heaps resident right after heap allocation.
Motivation is that SYCL bindless image can be passed as a value argument
or through memory. Therefore, we're not able to make its bindless heap
resident during kernel initialization or setting kernel arguments.
This fixes SYCL bindless image read_write_*D.cpp tests on DG2.
Related-To: NEO-7063
Signed-off-by: Wenju He <wenju.he@intel.com>
2024-03-20 14:40:11 +01:00
Bartosz Dunajski
e400a5c624
feature: in-order host counter allocation pooling 2
...
Related-To: NEO-10507
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-03-13 18:59:02 +01:00
Zbigniew Zdanowicz
4fb4e731d7
refactor: add load register immediate encoder on command pointer
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-12 09:38:03 +01:00
Zbigniew Zdanowicz
3f740f3460
feature: add new functionality to in order patch helpers
...
Related-To: NEO-10385
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-12 09:17:53 +01:00
Compute-Runtime-Validation
4082e9f028
Revert "feature: in-order host counter allocation pooling"
...
This reverts commit 2f03c48c7a
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-09 19:47:10 +01:00
Mrozek, Michal
13a347cbde
refactor: remove not needed code
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-03-08 15:10:06 +01:00
Mateusz Hoppe
5e15543b14
test: improve unit tests - use virtual file system
...
- use virtualFileSystem in ULTs, writeDataToFile() does not create file
but instead saves filename. Call to fileExists() checks if file was
previously written
Related-To: NEO-7006
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-03-08 13:34:19 +01:00
Dunajski, Bartosz
2f03c48c7a
feature: in-order host counter allocation pooling
...
Related-To: NEO-10507
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-03-08 13:13:04 +01:00
Kamil Kopryk
168445784e
feature: introduce states programming at driver init heapless ocl
...
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-03-08 12:29:44 +01:00
Mrozek, Michal
10313b7b84
refactor: remove not needed code
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-03-07 18:50:16 +01:00
Zbigniew Zdanowicz
fe377b6597
feature: add out cmd pointer to store data imm encoder
...
Related-To: NEO-10385
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-07 12:52:08 +01:00
Michal Mrozek
660539fe4e
refactor: remove not needed code.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-03-06 20:13:57 +01:00
Dominik Dabek
033ff28609
performance: enable cmd buffer preallocate xehp+
...
For L0 immediate cmdlists try to use reusable allocations if available.
Related-To: NEO-10526
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-03-06 16:53:01 +01:00
Compute-Runtime-Validation
293858320e
Revert "performance: Fill reusable allocation list on xe and later"
...
This reverts commit 15934ceb43
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-02 07:09:45 +01:00
Fabian Zwolinski
52430762ac
fix: cl_cache L0 env vars + refactor code structure
...
Added support for new Compiler Cache
environment variables in Level Zero.
Moved
`opencl/source/compiler_interface/default_cache_config.cpp`
`level_zero/core/source/compiler_interface/default_cache_config.cpp`
to shared directory
`source/compiler_interface/default_cache_config.cpp`
Switched enabling cache by default from per OS to per API.
Changed default state of cl_cache in Level Zero to disabled.
Related-To: NEO-10045
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-03-01 17:35:08 +01:00
Dunajski, Bartosz
51ae76a25f
refactor: improve handling of in-order atomic signaling
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-03-01 15:59:25 +01:00
Dunajski, Bartosz
ea2ad550a1
refactor: improve handling duplicated in-order host storage
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-03-01 13:54:28 +01:00
Mateusz Jablonski
80aa55a3cb
fix: don't program dummy blit prior to MI_FLUSH_DW without postsync
...
add missing dummy blits before MI_FLUSH_DW with postsync
Related-To: NEO-9996
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-01 12:36:13 +01:00
Dunajski, Bartosz
bbe1043f08
feature: initial support for pooling in-order counter allocations
...
Related-To: NEO-10507
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-03-01 12:08:02 +01:00
Zbigniew Zdanowicz
4baee71974
refactor: add out cmd argument to semaphore wait encoder
...
Related-To: NEO-10065
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-03-01 11:43:36 +01:00
Lukasz Jobczyk
15934ceb43
performance: Fill reusable allocation list on xe and later
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-03-01 09:41:19 +01:00
Compute-Runtime-Validation
5fce046877
Revert "fix: don't program dummy blit prior to MI_FLUSH_DW without postsync"
...
This reverts commit d796fb559d
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-01 01:43:35 +01:00
Mateusz Jablonski
d796fb559d
fix: don't program dummy blit prior to MI_FLUSH_DW without postsync
...
add missing dummy blits before MI_FLUSH_DW with postsync
Related-To: NEO-9996
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-29 13:30:09 +01:00
Lukasz Jobczyk
39a44628a3
performance: Enable timestamp wait for queues on xe and later
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-28 14:26:59 +01:00
Dunajski, Bartosz
8483a922a7
feature: simplify in-order allocation overflow
...
Related-To: NEO-10507
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-27 18:25:26 +01:00
Mateusz Jablonski
b6b53daa3f
fix: don't program dummy blit prior to MI ARB CHECK
...
Related-To: NEO-9996
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-27 15:46:46 +01:00
Zbigniew Zdanowicz
0bf8e8727e
refactor: add output buffer argument to store register to memory encoder
...
Related-To: NEO-10064
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-02-27 13:31:55 +01:00
Lukasz Jobczyk
cfd3edfb2c
fix: Align IOH entry
...
Related-To: NEO-10036
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-26 14:36:31 +01:00
Zbigniew Zdanowicz
1995e6aa40
feature: store cpu pointer of command buffer pointing to post sync command
...
Related-To: NEO-10064
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-02-23 13:12:15 +01:00
Dunajski, Bartosz
915b19b0be
refactor: remove not needed debug_break_if
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-19 16:28:58 +01:00
Dunajski, Bartosz
f31c88d05f
fix: size estimation for tlb flush before copy
...
Related-To: HSD-18036669673
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-19 13:35:35 +01:00
Dunajski, Bartosz
be7b80746b
fix: update incorrect debug_break
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-16 16:55:59 +01:00
Kamil Kopryk
0c5cba8ebd
refactor: use common indirectDataAlignment static constexpr value
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-02-16 13:55:00 +01:00
Lukasz Jobczyk
cc1732c930
performance: make resident before lock
...
Resolves: NEO-10369
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-16 12:00:38 +01:00
Compute-Runtime-Validation
4d159bd06c
Revert "performance: make resident before lock"
...
This reverts commit 7e7fac01e3
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-16 09:23:49 +01:00
Lukasz Jobczyk
7e7fac01e3
performance: make resident before lock
...
Resolves: NEO-10369
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-15 11:56:36 +01:00
Michal Mrozek
25b3a3dbfa
fix: user proper settings for caches.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-02-15 11:38:34 +01:00
Compute-Runtime-Validation
57c946b61c
Revert "fix: align indirect data pointer to cacheline size in heapless mode"
...
This reverts commit 004e6e647f
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-15 05:49:19 +01:00
Compute-Runtime-Validation
d22b2a8a38
Revert "feature: add support for the cl_khr_extended_bit_ops extension"
...
This reverts commit 6dec2143df
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-02-15 05:34:10 +01:00
Kamil Kopryk
004e6e647f
fix: align indirect data pointer to cacheline size in heapless mode
...
Align indirect data pointer to cacheline size in heapless mode,
restore debug_break_if if avx2 load/store operation
gets unaligned pointer,
remove fallback to mm256 loadu/storeu unaligned operation
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-02-14 10:00:15 +01:00
Ben Ashbaugh
6dec2143df
feature: add support for the cl_khr_extended_bit_ops extension
...
Related-To: NEO-10337
Signed-off-by: Ben Ashbaugh <ben.ashbaugh@intel.com>
2024-02-14 09:11:44 +01:00
Kamil Kopryk
2b91286754
fix: fix load store uint16x16 avx2 for unaligned ptr
...
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-02-08 09:35:30 +01:00
Lukasz Jobczyk
486cc71b76
refactor: Add GDI profiling
...
Resolves: NEO-9236
Related-To: NEO-10036
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-02-07 18:44:11 +01:00
Maciej Plewka
ce17580b28
fix: Use Rcs engine in blender on DG2
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-02-07 18:21:54 +01:00
Kacper Nowak
ab1ffea592
refactor: Create function to check if OCL2.1 caps are enabled or enforced
...
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2024-02-06 08:27:09 +01:00
Dunajski, Bartosz
f31fafb1e2
refactor: improve debug flag to override bcs mocs
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-05 20:23:54 +01:00
Dominik Dabek
2cad595a0d
performance: debug flag for usm host alloc recycle
...
set ExperimentalEnableHostAllocationCache=1 to recycle host usm
allocations
Related-To: GSD-7497
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-02-01 16:47:59 +01:00
Mateusz Jablonski
2237961deb
performance: add macro for ULT-only unrecoverable if
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-01 15:09:31 +01:00
Compute-Runtime-Validation
fa9c79fb63
Revert "refactor: Add GDI profiling"
...
This reverts commit 524ae7713a
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-01-30 10:47:34 +01:00
Fabian Zwolinski
c51b656d2c
fix: request instruction cache invalidation on module destroy
...
Invalidation is requested on both linux and windows,
on Csr's that used Isa allocation.
Related-To: NEO-10045
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-01-29 16:00:34 +01:00
Lukasz Jobczyk
524ae7713a
refactor: Add GDI profiling
...
Resolves: NEO-9236
Related-To: NEO-10036
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-01-29 11:36:04 +01:00
Mateusz Jablonski
a697a3f718
refactor: create new members for storing spill and private memory in scratch
...
rename private scratch space into scratch space slot 1 as it can be generic
Related-To: NEO-9944
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-23 12:42:25 +01:00
Compute-Runtime-Validation
f9f9035b95
Revert "refactor: create new members for storing spill and private memory in ...
...
This reverts commit 87eb5f554a
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-01-23 09:13:00 +01:00
Compute-Runtime-Validation
e949ba7144
Revert "refactor: Add GDI profiling"
...
This reverts commit 8d56f8fb6b
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-01-23 06:13:02 +01:00
Mateusz Jablonski
87eb5f554a
refactor: create new members for storing spill and private memory in scratch
...
rename private scratch space into scratch space slot 1 as it can be generic
Related-To: NEO-9944
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-22 19:48:48 +01:00
Dunajski, Bartosz
b77e1a6a71
feature: improve reporting max cooperative group count
...
Related-To: NEO-8210
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-22 14:56:15 +01:00
Lukasz Jobczyk
8d56f8fb6b
refactor: Add GDI profiling
...
Resolves: NEO-9236
Related-To: NEO-10036
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-01-22 14:24:08 +01:00
Dunajski, Bartosz
6f4ed10919
refactor: unify getMaxWorkGroupCount logic
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-22 11:16:45 +01:00
Dunajski, Bartosz
b266f1f3cc
refactor: improve implicit scaling interface
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-19 14:09:04 +01:00
Mateusz Jablonski
028a5ee2fc
fix: correct calculating highest enabled dual subslice
...
when no DSS is exposed then calculate highest enabled subslice instead
Related-To: NEO-9614
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-19 10:48:15 +01:00
Jitendra Sharma
aa191b6f88
feature: Set runalone mode for contexts with online debugging
...
Related-To: NEO-9139
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2024-01-17 09:01:30 +01:00
John Falkowski
10ef2a28ff
fix: correct for default hierachry
...
Resolves: NEO-10006
Signed-off-by: John Falkowski <john.falkowski@intel.com>
2024-01-15 22:15:49 +01:00
Dunajski, Bartosz
dfc69c746a
feature: create InOrderExecInfo for standalone CB Event
...
Related-To: NEO-8145
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-15 16:56:52 +01:00
Fabian Zwolinski
a3bd629aff
fix: store image implicit args in bindless slot 1
...
bindless slot 0: image
bindless slot 1: image implicit args
bindless slot 2: redescribed image
Redescribed image moved one slot higher.
Separate allocation is created for new bindless slot.
Related-To: NEO-9740
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-01-12 13:09:06 +01:00
Katarzyna Cencelewska
5d2d3ed899
refactor: modify sip kernel helpers
...
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-01-09 17:09:10 +01:00
Mateusz Jablonski
a73fb4d2fe
fix: correct reporing kernel private size on L0
...
unify the logic across APIs
Related-To: NEO-9944
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-09 09:54:05 +01:00
Dunajski, Bartosz
57900c5455
refactor: improve single slice dispatch mode handling
...
Related-To: NEO-8210
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-04 15:24:09 +01:00
Dunajski, Bartosz
d7b6f11ced
refactor: improve creating 48b resources
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-12-29 10:05:42 +01:00
Mateusz Hoppe
31e9b5e9fa
feature: add support for secondary contexts in group
...
Related-To: NEO-7824
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-12-28 13:31:08 +01:00
Dunajski, Bartosz
df66a0276f
refactor: remove not used logic to check dynamic postsync layout
...
Related-To: NEO-8210
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-12-27 13:12:11 +01:00
Compute-Runtime-Validation
f2c9d7172a
Revert "fix: store image implicit args in bindless slot 1"
...
This reverts commit b6d572ddb8
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-12-23 12:01:42 +01:00
Dunajski, Bartosz
d6560c5bba
refactor: pass additional in-order params to PatchCmd helper
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-12-21 19:21:56 +01:00
Kacper Nowak
18fb432552
refactor: Stop using hard-coded BCS engine as default one
...
Do not rely on presence of BCS0 engine.
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2023-12-21 17:34:57 +01:00
Dunajski, Bartosz
0f275ab893
refactor: improve InOrder Walker patching
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-12-21 14:30:46 +01:00
Mateusz Jablonski
a4888b39c6
build: add clang-tidy restriction for Enum case
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-21 08:58:51 +01:00
Mateusz Hoppe
c90dab7a2d
feature: introduce highPriority engine usage
...
Related-To: NEO-7824
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-12-20 20:59:43 +01:00
Fabian Zwolinski
b6d572ddb8
fix: store image implicit args in bindless slot 1
...
bindless slot 0: image
bindless slot 1: image implicit args
bindless slot 2: redescribed image
Redescribed image moved one slot higher.
Separate allocation is created for new bindless slot.
Related-To: NEO-9740
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-12-20 19:09:14 +01:00
Aravind Gopalakrishnan
f5045348ad
fix: Report correct DP support for platforms
...
Related-To: NEO-9702
- Remove older interfaces and add new release helper
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2023-12-20 10:35:49 +01:00
Mateusz Jablonski
138fb65401
refactor: correct naming of enum class constants 11/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-19 14:52:57 +01:00
Mateusz Jablonski
de93bc6928
refactor: correct naming of enum class constants 10/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-19 11:30:39 +01:00
Mateusz Jablonski
a6458433dc
refactor: correct naming of enum class constants 9/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-19 10:36:25 +01:00
Mateusz Jablonski
dd1b9d6abc
refactor: correct naming of enum class constants 8/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-19 08:18:18 +01:00
Chodor, Jaroslaw
7e795cd3c1
feature: Adding support for OCL C support queries to ocloc
...
Feature needed for automated OCL C compilation with ocloc as backend.
Added queries :
* CL_DEVICE_EXTENSIONS
* CL_DEVICE_EXTENSIONS_WITH_VERSION
* CL_DEVICE_PROFILE
* CL_DEVICE_OPENCL_C_ALL_VERSIONS
* CL_DEVICE_OPENCL_C_FEATURES
Sample command line:
ocloc query -device skl CL_DEVICE_OPENCL_C_FEATURES
Related-To: GSD-7420
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2023-12-18 15:19:16 +01:00
Mateusz Jablonski
fca28a930f
refactor: correct naming of enum class constants 7/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-18 13:05:56 +01:00
Dunajski, Bartosz
1219c48620
refactor: improve post sync handling and in-order unit tests
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-12-15 11:17:56 +01:00
Lu, Wenbin
67fa39c9a1
fix: get right page size when malloc uses 0 alignment
...
Related-To: GSD-7103
Signed-off-by: Lu, Wenbin <wenbin.lu@intel.com>
2023-12-13 15:00:56 +01:00
Mateusz Jablonski
739d181026
refactor: correct naming of enum class constants 6/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-13 14:48:52 +01:00
Mateusz Jablonski
261c4d3e72
refactor: add NOLINT for hardware info padding warning
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-13 12:18:11 +01:00
Mateusz Jablonski
432142c574
refactor: correct naming of enum class constants 4/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-13 08:08:51 +01:00
Mateusz Jablonski
0428c0acd1
refactor: correct naming of HeapIndex enum values
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-12 11:46:17 +01:00
Dunajski, Bartosz
717bc0c03f
refactor: pass InOrderExecInfo to encode method
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-12-11 16:43:30 +01:00
Mateusz Jablonski
b182917d9d
refactor: correct naming of allocation types
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-11 16:23:37 +01:00
Mateusz Jablonski
beafea9b39
refactor: correct naming of enum class constants 2/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-11 13:13:35 +01:00
Dunajski, Bartosz
9ce3713ace
refactor: improve Walker args handling
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-12-06 16:10:26 +01:00
Mateusz Jablonski
1bbc7a57ab
refactor: rename VFE_STATE_TYPE -> FrontEndStateCommand
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-05 14:51:01 +01:00
Mateusz Jablonski
da957d1a37
refactor: correct naming of enum class constants 1/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-05 14:26:42 +01:00
Mateusz Jablonski
6849d33326
refactor: remove redundant definitions KB/MB/GB
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-05 08:55:18 +01:00
Mateusz Jablonski
c48e57e2de
refactor: rename register offsets and constants
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-04 18:31:39 +01:00
Mateusz Jablonski
9486cd2a26
refactor: rename WALKER_TYPE to DefaultWalkerType
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-04 16:29:32 +01:00
Mateusz Jablonski
c3d3a4db1f
refactor: correct variable naming
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-04 13:45:53 +01:00
Mateusz Jablonski
e57d372608
refactor: correct naming of enum class constants
...
EngineGroupType
BlitDirection
PostBlitMode
WaitStatus
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-01 11:53:51 +01:00
Mateusz Jablonski
b92b5892b8
refactor: correct variable naming
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-01 08:34:19 +01:00
Mateusz Jablonski
c3ac7b78bd
refactor: correct variable naming
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-01 02:18:46 +01:00
Mateusz Jablonski
c9664e6bad
refactor: rename global debug manager to debugManager
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-30 13:00:59 +01:00
Mateusz Jablonski
36194c4e7d
refactor: correct variable namings
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-29 23:49:03 +01:00
Kamil Kopryk
982c2d665b
feature: add programming heapless mode in SBA
...
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-11-24 15:29:19 +01:00
Kamil Kopryk
ce7298d512
feature: Add heapless mode programming in ocl
...
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-11-24 12:53:39 +01:00
Baj, Tomasz
c49a9b9787
refactor: remove ReturnSubDevicesAsApiDevices from shared code
...
Related-To: NEO-9437
Signed-off-by: Baj, Tomasz <tomasz.baj@intel.com>
2023-11-22 15:13:29 +01:00
Mateusz Jablonski
8dd80efbb1
refactor: move getting thread per eu configs to release helper
...
Related-To: HSD-18034098647
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-21 09:44:32 +01:00
Baj, Tomasz
97842d8eab
fix: program PIPE_CONTROl with RT Flush and CS Stall prior to PIPE_SELECT
...
Related-To: NEO-9194
Signed-off-by: Baj, Tomasz <tomasz.baj@intel.com>
2023-11-20 16:43:59 +01:00
Mateusz Jablonski
35c1f34672
refactor: move number of threads per eu to release helper
...
Related-To: HSD-18034098647
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-20 12:16:33 +01:00
Mateusz Jablonski
a0f8e8accc
fix: initialize max capability values for ARL
...
setup max values when overriding hw info with debug key
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-15 13:44:11 +01:00
Dunajski, Bartosz
30777d4d4c
feature: use indirect semaphore for 64b values
...
Related-To: NEO-8145
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-11-09 16:58:45 +01:00
Mateusz Hoppe
f36e86f121
fix: program bindless surface state size from sba properties
...
- surfaceStateSize is in pages, bindless size needs to be programmed in
surface state units
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-11-07 17:45:54 +01:00
Kamil Kopryk
73bb139e17
refactor: add applyExtraInternalOptions function
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-11-03 09:43:37 +01:00
Michal Mrozek
060fc40938
performance: move constants out of the loop.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2023-10-30 18:16:35 +01:00
Mateusz Jablonski
1e18bf0392
fix: define timestamp mask based on timer resolution
...
Related-To: HSD-18034577551
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-27 11:55:20 +02:00
Compute-Runtime-Validation
fca2159430
Revert "fix: if device hierarchy is flat then getSubDevicesCount return 1u"
...
This reverts commit cb0bb57f49
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-10-26 15:40:29 +02:00
Baj, Tomasz
cb0bb57f49
fix: if device hierarchy is flat then getSubDevicesCount return 1u
...
Related-To: NEO-9167
Signed-off-by: Baj, Tomasz <tomasz.baj@intel.com>
2023-10-25 15:51:52 +02:00
Mateusz Jablonski
6d2d16d68e
fix: avoid overflow of gpu time stamp in ns
...
Related-To: NEO-8394
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-24 14:56:15 +02:00
Zbigniew Zdanowicz
2e09b5ff66
refactor: reposition preamble helper implementation methods
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-10-20 13:18:29 +02:00
Mateusz Hoppe
97faeae16f
feature: heapAssigner per root device
...
- create heapAssigner per root device in memory manager to allow per
device config
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-10-19 19:52:24 +02:00
Mateusz Jablonski
4dfa12c8eb
fix: add mechanism to detect gpu timestamp overflows
...
unify naming CpuGpu to GpuCpu
Related-To: NEO-8394
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-19 16:31:06 +02:00
Lukasz Jobczyk
750b5ba89a
fix: flush necessary caches when dispatch pipe control
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-10-19 15:09:31 +02:00
Mateusz Hoppe
5a4fa180d6
feature: control bindless compilation mode based on release
...
- check releaseHelper support when selecting bindless mode, if not
disabled, prefer bindless mode in L0 API
- bindless mode can be forced with DebugVariable: UseBindlessMode
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-10-19 10:11:11 +02:00
John Falkowski
f156a74f54
fix: split chunking prefetch flags
...
Related-To: NEO-9120
Signed-off-by: John Falkowski <john.falkowski@intel.com>
2023-10-18 19:20:42 +02:00
Mateusz Jablonski
a420e34b10
fix: explicitly remove assign operators when not needed
...
when class defines copy/move ctor then corresponding assign operator(s)
should be defined or deleted
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-12 10:25:15 +02:00
Mateusz Jablonski
3fdcf049bf
fix: set default device hierarchy to composite for all platforms except xe hpc
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-12 10:09:50 +02:00
Latif, Raiyan
dee5ecfdf3
fix: ReturnSubDevicesAsApiDevices flag being ignored
...
Proper subdevice count being returned now in GfxCoreHelper
path, as previous method ignored the usage of the
ReturnSubDevicesAsApiDevices flag.
Related-To: LOCI-4859
Signed-off-by: Latif, Raiyan <raiyan.latif@intel.com>
2023-10-10 17:05:00 +02:00
Mateusz Jablonski
420f273a6c
fix: don't wait on condition in unit tests
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-10 15:14:30 +02:00
Mateusz Hoppe
c2d69e5857
feature: allocate SPECIAL_SSH heap in front window from EXTERNAL heap
...
- SPECIAL_SSH is used for debug surface SurfaceState which must be
located at bindless offset zero
- limit size of external front window
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-10-09 14:54:39 +02:00
Filip Hazubski
0c8a514349
fix: Switch default device hierarchy to FLAT
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2023-10-06 15:24:50 +02:00
Filip Hazubski
08e92d154f
fix: Add getDefaultDeviceHierarchy call to GfxCoreHelper
...
Added getDefaultDeviceHierarchy call that describes default device
hierarchy for a gfx core. Refactored L0 and OCL paths to use this
value by default and override this value when user sets
ZE_FLAT_DEVICE_HIERARCHY environment variable or
ReturnSubDevicesAsApiDevices debug key.
Updated ReturnSubDevicesAsApiDevices to force COMPOSITE device hierarchy
when set to 0.
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2023-10-06 12:32:41 +02:00
Mateusz Jablonski
382fc952f2
refactor: add NonAssignableClass to define classes without assign operator
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-05 15:16:58 +02:00
Mateusz Jablonski
712ba60452
fix: add unrecoverable to avoid nullptr access
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-05 13:20:21 +02:00
Mateusz Jablonski
6d259ac4b7
fix: add unrecoverable to avoid out of bound access
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-05 12:03:28 +02:00
Mateusz Jablonski
ad2701ad26
fix: add unrecoverable to avoid out of bound access
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-04 15:59:18 +02:00
John Falkowski
56f05303c9
feature: add support for zeMemGet/SetAtomicAccessAttributeExp
...
Resolves: NEO-8219
Signed-off-by: John Falkowski <john.falkowski@intel.com>
2023-10-02 15:59:17 +02:00
Hoppe, Mateusz
5c565efe28
feature: bindless global heap with debugger
...
- program debugSurface's SurfaceState at the beginning of Bindless Surface
State Heap - SPECIAL_SSH
- ensure SPECIAL_SSH is resident
Related-To: NEO-7063
Signed-off-by: Hoppe, Mateusz <mateusz.hoppe@intel.com>
2023-09-29 13:13:46 +02:00
Mateusz Jablonski
a033df33ff
fix: remove preferSmallWorkgroupSizeForKernel method
...
Related-To: HSD-18033866078
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-29 11:55:09 +02:00
Mateusz Jablonski
3a21b3b228
refactor: remove not needed code
...
Related-To: NEO-7527
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-28 07:52:39 +02:00
Mateusz Jablonski
5f846d8a13
refactor: remove not needed code
...
Related-To: NEO-7527
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-27 18:17:04 +02:00
Mateusz Jablonski
03874b8815
refactor: remove not needed code
...
Related-To: NEO-7527
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-27 17:45:54 +02:00
Mateusz Jablonski
09044dfbaa
refactor: remove not needed code
...
Related-To: NEO-7527
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-27 14:35:49 +02:00
Kacper Nowak
160303924d
refactor: Correct logic for SIMD1
...
- For calculating number of threads per workgroup, for SIMD 1, return
local work size (each software thread should be mapped into a whole hardware
thread).
- Correct logic of calculating space for per thread data for SIMD 1.
- Minor: unit tests refactor.
- Corrected naming.
Related-To: NEO-8261
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2023-09-26 15:28:37 +02:00
Dominik Dabek
eebf2bbd26
performance(ocl): timestamp packet count per gfx
...
Add support for different timestamp packet counts per gfx family.
Change all packet counts to 1 except for xe-hpc.
Related-To: NEO-8154
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-09-25 20:34:58 +02:00
Maciej Plewka
8658fdb04e
fix: Use stack vec for api specific prefix
...
Related-To: NEO-8388, GSD-6296
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-09-21 16:10:54 +02:00
Maciej Bielski
97e7cda912
feature: Optimize intra-module kernel ISA allocations
...
So far, there is a separate page allocated for each kernel's ISA within
`KernelImmutableData::initialize()`. Apparently the ISA blocks are often
much smaller than a 64k page, which leads to poor memory utilization and
was even observed to cause the device OOM error if a single module has
several keys.
Improve the situation by reusing the parent allocation (owned by the
module instance) for modules, which kernel ISAs can fit together within
a single 64k page. This improves the memory utilization on a single
module level.
Related-To: NEO-7788
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2023-09-21 13:55:45 +02:00
Katarzyna Cencelewska
d7d46a9fc5
refactor: use initialized variable in getHighestEnabledDualSubSlice
...
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2023-09-20 14:49:56 +02:00
Mateusz Jablonski
b1808f7830
fix: correct suggested number of work groups for concurrent kernels on PVC
...
value depends on CCS count:
- single CCS mode (default) - 50% available
- two CCS mode - 25% available
- four CCS mode - 12.5% available
Related-To: NEO-8377
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-20 13:40:22 +02:00
Mateusz Hoppe
69f5ca6345
feature: bindless addressing - flush state cache after reusing SS slot
...
- when Surface State is reused for new resource, State Cache needs to be
invalidated
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-09-20 12:53:32 +02:00
Dunajski, Bartosz
d3d5da1f72
feature: initial 64b in-order CmdList support
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-19 15:23:37 +02:00
Compute-Runtime-Validation
913a926fd4
Revert "feature: Optimize intra-module kernel ISA allocations"
...
This reverts commit c348831470
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-09-19 14:16:05 +02:00
Maciej Bielski
c348831470
feature: Optimize intra-module kernel ISA allocations
...
So far, there is a separate page allocated for each kernel's ISA within
`KernelImmutableData::initialize()`. Apparently the ISA blocks are often
much smaller than a 64k page, which leads to poor memory utilization and
was even observed to cause the device OOM error if a single module has
several keys.
Improve the situation by reusing the parent allocation (owned by the
module instance) for modules, which kernel ISAs can fit together within
a single 64k page. This improves the memory utilization on a single
module level.
Related-To: NEO-7788
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2023-09-19 12:05:09 +02:00
Compute-Runtime-Validation
73731d3be5
Revert "fix: correct suggested number of work groups for concurrent kernels o...
...
This reverts commit 6fc673b0fe
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-09-15 04:21:58 +02:00
Mateusz Jablonski
6fc673b0fe
fix: correct suggested number of work groups for concurrent kernels on PVC
...
value depends on CCS count:
- single CCS mode (default) - no limitations
- two CCS mode - 25% available
- four CCS mode - 12.5% available
Related-To: NEO-8377
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-14 16:13:54 +02:00
Mateusz Jablonski
2f7c33c1fd
refactor: move xe hpg specific appendBlitCommandsBlockCopy to xe hpg file
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-13 10:49:28 +02:00
Dunajski, Bartosz
7562842a58
refactor: remove LogicalStateHelper
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-13 10:29:53 +02:00
Compute-Runtime-Validation
413365a7bf
Revert "fix: Correct logic for SIMD1"
...
This reverts commit fc099ead2e
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-09-13 08:23:59 +02:00
Kacper Nowak
fc099ead2e
fix: Correct logic for SIMD1
...
- For calculating number of threads per workgroup, treat simd 1 as it
was simd 32
- Correct logic of calculating space for per thread data for simd 1
- Minor: unit tests refactor
- Corrected naming
Related-To: NEO-8261
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2023-09-13 07:03:12 +02:00
Mrozek, Michal
d9f938f3db
refactor: remove not needed code
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2023-09-12 14:25:04 +02:00
Dunajski, Bartosz
6648065703
feature: add indirect semaphore mode
...
Related-To: NEO-8242
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-12 13:15:51 +02:00
Dunajski, Bartosz
2a6be2fccd
feature: update conditional bb start to use qword data
...
Related-To: NEO-8242
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-12 11:24:28 +02:00
Dunajski, Bartosz
def3f2e9ad
refactor: improve semaphore programming
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-12 11:24:11 +02:00
Filip Hazubski
d7db6ac467
feature: Add preferredPlatformName field to RuntimeCapabilityTable
...
For all of the devices, preferredPlatformName is initialized with
nullptr by default and platform name will be initialized to driver's default
platform name, at the moment this is "Intel(R) OpenCL Graphics".
When Platform is initialized and preferredPlatformName is not nullptr then
Platform name will be set to the value stored in preferredPlatformName.
Add ENABLE_LEGACY_PLATFORM_NAME AIL enum related to added functionality.
Move PlatformInfo to NEO namespace.
Related-To: HSD-22018809561
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2023-09-12 11:07:14 +02:00
Mateusz Jablonski
c851896482
refactor: move XeHpg specific setExtraAllocationData definition to Xe Hpg file
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-11 13:32:00 +02:00
Compute-Runtime-Validation
1579c69316
Revert "performance: allocate timestamp packet tag buffer in local mem on DG2"
...
This reverts commit 819908ec94
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-09-11 12:07:26 +02:00
Fabian Zwolinski
b2ba1fbecf
fix: enforce zebin format by default in Ocloc for ICL/TGL/RKL/ADL
...
Products for which zebin has been set as default format in OCLOC:
- ICELAKE_LP
- TIGERLAKE_LP
- ROCKETLAKE
- ALDERLAKE_S
- ALDERLAKE_P
- ALDERLAKE_N
The default format does not override `--format` parameter.
Related-To: NEO-8334
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-09-08 16:16:18 +02:00
Fabian Zwolinski
10675134e1
feature: Add process safety to Windows compiler cache
...
Related-To: NEO-8092
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
Co-authored-by: Diedrich, Kamil <kamil.diedrich@intel.com>
2023-09-06 15:34:15 +02:00
Maciej Plewka
3b3e17e738
performance: Use vector for private allocs to reuse
...
Related-To: HSD-18033105655, HSD-18033153203
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-09-04 13:34:38 +02:00
Mateusz Jablonski
91b26277a4
feature: add method to adjust hw info for igc
...
Related-To: NEO-8203
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-04 11:58:13 +02:00
Jitendra Sharma
9818ef61a5
feature: Report correct GRF register count
...
Based on Large GRF enabled or not, report correct GRF
register.
Related-To: NEO-6788
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2023-09-04 11:42:48 +02:00
Compute-Runtime-Validation
154530ad23
Revert "feature: Report correct GRF register count"
...
This reverts commit 8eb3fe222e
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-09-01 15:12:57 +02:00
Jitendra Sharma
8eb3fe222e
feature: Report correct GRF register count
...
Based on Large GRF enabled or not, report correct GRF
register.
Related-To: NEO-6788
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2023-08-31 18:48:29 +02:00
Slawomir Milczarek
5833d65bc0
feature: Add support for memadvise with system preferred location
...
Related-To: NEO-7851
Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2023-08-31 13:09:55 +02:00