Dominik Dabek
819ffea90f
fix: reenable indirect detection for non-VC, PVC
...
Issue is limited to detection in VC, can reenable for other kernels.
Related-To: NEO-13372
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-12-04 14:59:02 +01:00
Dominik Dabek
5167f34a8a
fix: disable indirect detection, PVC
...
Related-To: NEO-13372, GSD-10403
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-12-03 13:53:47 +01:00
Bartosz Dunajski
0ecbc627bd
refactor: remove not used dispatch walk order param
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-12-03 09:58:03 +01:00
Zbigniew Zdanowicz
56b15f17f7
refactor: unify further calculation to get max work group count
...
- move available device calculcation into common helper
- change interface to have code available where no descriptor is available
- expand unit test for implementation of new inteface
Related-To: NEO-13350
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-29 17:51:05 +01:00
Zbigniew Zdanowicz
c5ed6bf73c
refactor: split sync buffer and region allocation creation code
...
- split the allocation code from command list or kernel
- allow to call allocation code in all parts of the driver
Related-To: NEO-13350
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-29 11:41:39 +01:00
Bartosz Dunajski
5e1fa75676
refactor: adjust code to compile with c++20
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-11-29 10:27:29 +01:00
Vysochyn, Illia
afd22999cc
refactor: Adjust RENDER_SURFACE_STATE structures naming
...
Performs minor renaming (mostly capitalization) in order to align with
specification.
Renames L1_CACHE_POLICY to L1_CACHE_CONTROL.
Related-To: NEO-13147
Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2024-11-29 09:43:11 +01:00
Mateusz Jablonski
fa58073095
refactor: remove not used usings/typedefs/variables
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-11-28 16:19:39 +01:00
Mateusz Jablonski
2039b1c41b
refactor: remove not needed code
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-11-28 16:16:30 +01:00
Zbigniew Zdanowicz
6b7235cd6c
refactor: change parameter names into more meaningful
...
- change additional size into local region size
- change walk order into dispatch walk order to distinguish for local id walk
Related-To: NEO-13350
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-27 16:15:02 +01:00
Maciej Plewka
fccca2dba7
refactor: unify getGpgpuEngineInstances for xe2+
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-11-22 14:45:58 +01:00
Wenbin Lu
47383d752f
fix: report correct number of XeCore per Cluster
...
Related-To: NEO-10060
Signed-off-by: Wenbin Lu <wenbin.lu@intel.com>
2024-11-22 09:40:28 +01:00
Alicja Lukaszewicz
789efc8909
fix: remove number of RT stacks from capability table
...
Related-To: NEO-10830
Signed-off-by: Alicja Lukaszewicz <alicja.lukaszewicz@intel.com>
2024-11-21 14:46:50 +01:00
Katarzyna Cencelewska
4ad8c17db9
feature: add debug flags for timestamps
...
PrintCalculatedTimestamps - print ts in level zero paths
PrintTimestampPacketContents - add logging also to level zero paths
ForceUseOnlyGlobalTimestamps - force using a global ts
Related-To: HSD-14023527252
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-11-21 11:28:08 +01:00
Maciej Plewka
62d8e3e4b0
fix: Align thread group count to dss on all platforms
...
Related-To: NEO-13263, GSD-10327
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-11-18 16:55:20 +01:00
Alicja Lukaszewicz
68dc7fb33b
feature: add calculation of stack count for sync RT
...
Related-To: NEO-10830
Signed-off-by: Alicja Lukaszewicz <alicja.lukaszewicz@intel.com>
2024-11-18 14:52:00 +01:00
Filip Hazubski
8797c326b6
refactor: Move isDummyBlitWaRequired function to release helper
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-11-15 13:22:00 +01:00
Mateusz Jablonski
f55ad93baf
fix: remove L3 config from release helper
...
L3 bank count should be queried from KMD
L3 bank size should be queried from device blob
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-11-12 14:28:36 +01:00
Marcel Skierkowski
49d999abe6
fix: remove defaultProfilingTimerResolution from RuntimeCapabilityTable
...
Related-To: NEO-12275
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2024-11-04 12:02:11 +01:00
Chodor, Jaroslaw
5f908ce092
feature: adding support for custom compiler backends
...
This adds abbility to load different versions of the backend
compiler based on underlying device.
Related-To: NEO-12747
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2024-10-23 19:55:36 +02:00
Bartosz Dunajski
ff80a02fcb
refactor: parse extra zebin params
...
Related-To: NEO-12591
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-10-21 09:59:33 +02:00
Maciej Plewka
9d6d6e85f1
fix: align thread group to dss size if kernel uses slm
...
Related-To: NEO-12133
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-10-17 15:30:19 +02:00
Tomasz Biernacik
c982981dde
feature: add number of rt stacks to capability table
...
Related-To: NEO-12138
Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2024-10-17 14:46:19 +02:00
Maciej Plewka
deb27d0363
fix: align thread group count to fit within dss
...
Related-To: NEO-12133
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-10-16 18:34:39 +02:00
Bartosz Dunajski
52e9a6e07f
feature: Initial CB events IPC support
...
Related-To: NEO-11925
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-10-16 13:33:59 +02:00
Compute-Runtime-Validation
2098e64dc1
Revert "feature: adding support for custom compiler backends"
...
This reverts commit 8098bcc48d
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-10-16 02:07:25 +02:00
Jitendra Sharma
9bd4878841
feature: update GRF register implementation
...
Related-To: NEO-8314
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2024-10-15 13:47:34 +02:00
Bartosz Dunajski
acef3a1e71
feature: pass external device allocation to CB event
...
Related-To: NEO-11925
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-10-15 09:37:59 +02:00
Chodor, Jaroslaw
8098bcc48d
feature: adding support for custom compiler backends
...
This adds abbility to load different versions of the backend
compiler based on underlying device.
Related-To: NEO-12747
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2024-10-14 18:23:11 +02:00
Szymon Morek
7f2b806413
fix: Override timestamp width from KMD
...
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-10-14 13:38:33 +02:00
Wojciech Konior
8a6626da23
refactor: two engineInstanced-methods removed
...
Related-To: NEO-12594
Signed-off-by: Wojciech Konior <wojciech.konior@intel.com>
2024-10-11 18:34:06 +02:00
Mateusz Jablonski
3c06b316e6
refactor: remove legacy code
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-10-10 09:54:22 +02:00
Mateusz Hoppe
31265edfee
fix: program RenderTargetCacheFlush in PC prior to PIPELINE_SELECT
...
- fix code by removing csStallOnly that skipped seeting RTCF flag
Related-To: NEO-9194
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-10-09 16:59:43 +02:00
Wojciech Konior
6b40f9bc5a
refactor: engineInstancedType removed
...
Related-To: NEO-12594
Signed-off-by: Wojciech Konior <wojciech.konior@intel.com>
2024-10-09 16:30:48 +02:00
Mateusz Hoppe
5ae2552b4b
fix: track shifted contextIds in bitset in bindlessHeapsHelper
...
- bitset is 64 bit in size, context ids may go beyond that limit
when multiple devices are available
- this change subtracts contextId of first context for a given root
device - tracked state dirty contexts ids are now zero-based
Resolves: GSD-10025
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-10-09 10:32:29 +02:00
Mateusz Jablonski
552930a75f
fix: don't setup preemption surface when debugger is active
...
Related-To: NEO-12878
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-10-08 13:58:11 +02:00
Katarzyna Cencelewska
42ca656edb
fix: change logic to calculate available thread count
...
don't use magic number, value depend on grf size
Related-To: HSD-18039369782
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-10-07 15:46:33 +02:00
Dominik Dabek
6d6c4267b3
performance: enable indirect detection
...
Platforms other than PVC require detection version >= 4
Vector compiler kernels require >= 6
Related-To: NEO-12491
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-10-07 14:38:22 +02:00
Bartosz Dunajski
9a280892f8
refactor: add timestamps to xe logs
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-10-07 11:42:55 +02:00
Filip Hazubski
388c7dc591
fix: Update SyncBuffer size calculation
...
Add a constant to describe minimal size of SyncBuffer object.
Related-To: HSD-18039952263, HSD-18039940553, HSD-18039937640
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-10-03 15:57:24 +02:00
Compute-Runtime-Validation
31618d9e18
Revert "fix: add workaround for incorrect DRM system info"
...
This reverts commit 7ac991fa3f
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-10-03 15:22:21 +02:00
Mateusz Jablonski
14c8f1f15d
refactor: remove not used function
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-10-02 16:14:40 +02:00
Mateusz Jablonski
9819b1e2e4
refactor: remove not needed flag ftrGpGpuMidThreadLevelPreempt
...
the flag was used for mid thread preemption support on pre-gen12 platforms
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-10-01 12:52:12 +02:00
Mateusz Jablonski
9db83b8231
refactor: unify isMidThreadPreemptionSupported function
...
mid thread preemption can be enabled only by ftrWalkerMTP flag
pre-Xe2 devices doesn't support MTP
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-10-01 11:26:04 +02:00
Filip Hazubski
72cf384c7d
refactor: Fix typo
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-10-01 09:31:02 +02:00
Compute-Runtime-Validation
ef1b569a85
Revert "performance: Do not create global fence allocation on integrated"
...
This reverts commit 6bf5183eff
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-10-01 08:14:39 +02:00
Lukasz Jobczyk
6bf5183eff
performance: Do not create global fence allocation on integrated
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-09-30 13:13:27 +02:00
Compute-Runtime-Validation
6cb0e45330
Revert "performance: Do not create global fence allocation on integrated"
...
This reverts commit 50eb6af9ac
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-09-27 11:48:01 +02:00
Lukasz Jobczyk
50eb6af9ac
performance: Do not create global fence allocation on integrated
...
Resolves: NEO-12324
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-09-27 09:32:42 +02:00
Lukasz Jobczyk
c93998bcb9
performance: Do not program additional synchronization on integrated
...
Related-To: NEO-12324
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-09-26 10:54:31 +02:00
Mateusz Hoppe
8000133b2a
refactor: add method to adjust regular context count
...
Related-To: NEO-12258
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-09-25 13:46:46 +02:00
Maciej Plewka
80f75ceace
fix: submit dummy exec to pin memory during zeContextMakeMemoryResident call
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-09-23 14:43:59 +02:00
Bartosz Dunajski
d7ce841081
fix: InOrderExecInfo upload to tbx
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-09-20 16:03:09 +02:00
Mateusz Hoppe
4a068c8eab
fix: correclty program StateBaseAddress in global bindless mode
...
- prepare bindful ssh when kernel requires ssh heap and
SurfaceStateBaseAddress
- remove lastAppendedKernelBindlesMode - local ssh heap may be needed
for bindless kernels with scratch or misaligned buffer args
- use ssh heap gpu address to program SurfaceStateBaseAddress, global base is
used for BindlessSurfaceState and DynamicState
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-09-20 11:57:05 +02:00
Filip Hazubski
ebc19b4a70
feature: Add logic to disable bindless addressing via AIL
...
Add mockable Device functions to get ReleaseHelper and AILConfiguration.
Resolves: NEO-12699
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-09-18 13:49:51 +02:00
Jitendra Sharma
25f2504bf5
fix: set eudebug extension property when debugging enabled
...
In order to debug on XE, XE_EXEC_QUEUE_SET_PROPERTY_EUDEBUG
needs to be set up.
Related-To: NEO-12691
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2024-09-18 08:32:03 +02:00
Zbigniew Zdanowicz
6e0aa1781c
refactor: modify kernel helper method to pass kernel data directly
...
Related-To: NEO-12639
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-09-16 16:25:13 +02:00
Mateusz Jablonski
78604bd475
refactor: remove not needed code
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-09-16 12:12:43 +02:00
Dominik Dabek
3bd2befe74
refactor: indirect detection helpers, VC
...
Allow for different required version for VC compiled kernels.
Define constant for detection disabled.
Related-To: NEO-12491
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-09-13 19:42:57 +02:00
Dominik Dabek
571d703135
refactor: indirect detection helpers
...
Check indirect detection version from igc header for JIT.
Move required version to its own method.
This allows for different required versions per platform.
Related-To: NEO-12491
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-09-11 14:49:54 +02:00
Zbigniew Zdanowicz
7ce4a8adc2
performance: replace virtual calls with native class methods
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-09-11 11:10:40 +02:00
Fabian Zwoliński
ea5b586c37
fix: move pathExists out of sys calls and do not mock it
...
Moved pathExists from SysCalls to path.h.
In ULTs, use unchanged pathExists and mock stat, getFileAttributesA instead.
Add Windows and Linux ULTs for pathExists.
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-09-10 19:24:45 +02:00
Mateusz Hoppe
a6bf424417
refactor: determine hp copy engine once after gfx core helper is created
...
- do not determine engine on every call to getDefaultHpCopyEngine()
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-09-10 10:00:48 +02:00
Wenbin Lu
7ac991fa3f
fix: add workaround for incorrect DRM system info
...
Related-To: NEO-9489
Signed-off-by: Wenbin Lu <wenbin.lu@intel.com>
2024-09-10 08:48:01 +02:00
Filip Hazubski
7d16521c7b
performance: Correct alignment checks
...
Only use checks in debug builds.
Resolves: HSD-18039597713
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-09-10 08:43:36 +02:00
Mateusz Jablonski
54bda0e986
fix: In Linux CL/GL sharing
...
- always issue flush request before export
Apparently it's expected to flush the object (which might convert them
from one format to another for export, or remove aux buffer uses or
anything not supported by export).
- use modifier to select tiling mode
Previously we just assumed that whatever tiling mode was picked by mesa
will match the one picked by GMMLIB but that's not always the case
and in particular on Arc and Xe it doesn't work ... Mesa picks Tile4
and GMMLIB picks Tile64 ...
Fixes : #761
Fixes : #736
Signed-off-by: Sylvain Munaut <tnt@246tNt.com>
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-09-06 21:40:18 +02:00
Fabian Zwoliński
38e1614f4a
fix: create and use new allocation type for syncBuffer
...
Related-To: NEO-11533
Related-To: HSD-18039788811
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-09-06 14:11:33 +02:00
Kamil Kopryk
95b035a071
fix: global stateless heap creation in heapless path
...
Related-To: NEO-10681
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-09-05 16:56:00 +02:00
Andrzej Koska
b0e7a11e9a
refactor: Improving information transfer about the copy engine
...
Related-To: NEO-11934
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2024-09-05 16:11:52 +02:00
Compute-Runtime-Validation
d842f65cf1
Revert "fix: submit dummy exec to pin memory during zeContextMakeMemoryReside...
...
This reverts commit f9b87d53e6
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-09-05 03:28:03 +02:00
Maciej Plewka
f9b87d53e6
fix: submit dummy exec to pin memory during zeContextMakeMemoryResident call
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
Related-To: NEO-11879
2024-09-04 14:07:29 +02:00
Mateusz Hoppe
482e077252
feature: add regular and hp contexts in group without dedicated hp engine
...
- if no hp copy engine available, create group with regular and hp
contexts
Related-To: NEO-11983
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-08-28 14:01:10 +02:00
Mateusz Hoppe
3f6fc1d4d2
feature: switch to new aubstream createHardwareContext2 interface
...
Related-To: NEO-12257
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-08-26 14:10:53 +02:00
Damian Tomczak
df9fa343d1
feature: limit max workgroup size to 1k items
...
Resolves: NEO-11881
Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2024-08-26 09:20:51 +02:00
Mateusz Jablonski
f617093a6a
fix: add missing nullptr check before accessing ail helper
...
Fixes : #755
Fixes : #754
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-08-20 16:58:53 +02:00
Compute-Runtime-Validation
9b652f4a34
Revert "feature: Improving information transfer about the copy engine"
...
This reverts commit 17ffdff4f1
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-08-15 22:06:31 +02:00
Andrzej Koska
17ffdff4f1
feature: Improving information transfer about the copy engine
...
Related-To: NEO-11934
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2024-08-14 11:28:29 +02:00
Ben Ashbaugh
32f80e1131
feature: add support for the cl_khr_extended_bit_ops extension
...
Related-To: NEO-10337
Signed-off-by: Ben Ashbaugh <ben.ashbaugh@intel.com>
2024-08-12 17:44:31 +02:00
Filip Hazubski
54b5adb767
fix: Add checks to align functions
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-08-12 10:08:15 +02:00
Maciej Bielski
a4060013de
refactor: move CLOS-related steps from core- to product-helper
...
Future HW will not support cache reservation uniquely for the whole
platform. Implementation of some functions may vary between products.
Related-To: NEO-10158
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2024-08-12 09:27:04 +02:00
Szymon Morek
d4c1631ac7
performance: don't wait for paging fence on user thread
...
Related-To: NEO-12197
Currently for new resources user thread must wait before submitting
actual workload. With this commit, instead of waiting on user thread,
request is sent to background ULLS controller thread and additional
semaphore is programmed. ULLS controller will perform actual wait
and signal semaphore when paging fence reaches required value.
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-08-07 08:30:51 +02:00
Bartosz Dunajski
24cfd203ab
fix: dont download tbx allocations on heapless first device submission
...
Related-To: HSD-18039476929
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-06 14:03:42 +02:00
Michal Mrozek
d52ca080bd
Revert "performance: improve pool handling"
...
This reverts commit a3c3b6533a
.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-08-06 13:04:02 +02:00
Fabian Zwoliński
674c4a15ad
fix: use correct gpu address when bindless heaps helper is enabled
...
Related-To: NEO-7063
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-08-05 15:09:57 +02:00
Compute-Runtime-Validation
e27efd701f
Revert "fix: correct calculating max subslice space"
...
This reverts commit 67f2500c03
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-08-02 12:28:13 +02:00
Mateusz Jablonski
67f2500c03
fix: correct calculating max subslice space
...
computeMaxNeededSubSliceSpace is no longer needed as getHighestEnabledSubSlice
already determines maximum index from all enabled subslices
Related-To: NEO-12073
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-08-01 16:38:24 +02:00
Bartosz Dunajski
5ccfd6f2be
fix: add missing AlignLocalMemoryVaTo2MB flag support
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-01 13:54:51 +02:00
Compute-Runtime-Validation
2d1b263e9a
Revert "refactor: remove redundant function computeMaxNeededSubSliceSpace"
...
This reverts commit c0b96dcd6e
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-08-01 03:54:56 +02:00
Mateusz Jablonski
c0b96dcd6e
refactor: remove redundant function computeMaxNeededSubSliceSpace
...
use GfxCoreHelper::getHighestEnabledDualSubSlice instead
Related-To: NEO-12073
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-31 14:50:13 +02:00
Mateusz Jablonski
aa32f7420b
fix: correct getting max enabled dual subslice
...
when no dss info provided, assume max dss per slice are enabled
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-31 11:15:23 +02:00
Mateusz Jablonski
838894e074
fix: correct getting max enabled subslice
...
when no subslice info provided, assume max subslices per slice are enabled
Related-To: NEO-12073
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-31 10:42:57 +02:00
Mateusz Jablonski
1fa7e14d0b
fix: query engine info prior to query topology
...
don't setup l3 bank count in default gt sys info
Related-To: NEO-12073
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-25 16:17:44 +02:00
Maciej Plewka
afee8814ef
refactor: get ioh alignment from static function
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-07-24 14:43:31 +02:00
Bartosz Dunajski
c3312f21f7
feature: dont initialize in-order TS nodes
...
Related-To: NEO-11925
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-07-23 13:31:23 +02:00
Bartosz Dunajski
3758e99cbf
feature: initial support for standalone CB Events Timestamps allocator
...
Related-To: NEO-11925
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-07-22 10:13:55 +02:00
Mateusz Hoppe
b6299b8a21
feature: add support for HP copy engine context
...
- add support for contect group with HP copy engine
- choose HP copy engine when available
Related-To: NEO-11983
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-07-19 12:23:03 +02:00
Compute-Runtime-Validation
7ad15639fc
Revert "feature: add support for HP copy engine context"
...
This reverts commit 3fbcbcaef2
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-07-18 21:02:14 +02:00
Mateusz Hoppe
3fbcbcaef2
feature: add support for HP copy engine context
...
- add support for contect group with HP copy engine
- choose HP copy engine when available
Related-To: NEO-11983
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-07-18 16:07:07 +02:00
Filip Hazubski
46f9133bf2
fix: Correct logic to select internal BCS engine
...
When BCS3 is not available, use last available copy engine as internal.
Related-To: HSD-18039263936
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-07-18 12:02:50 +02:00
Mateusz Jablonski
778645c11e
feature: add initial support for Xe2 platforms
...
Related-To: NEO-8188, NEO-10774
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-09 10:30:27 +02:00
Mateusz Jablonski
f171a22c0a
refactor: include reg_configs_common using full path
...
add missing register offset value
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-03 13:30:46 +02:00
Szymon Morek
3dd051c3ee
performance: adjust compression handling
...
Related-To: NEO-11882
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-03 09:37:11 +02:00
Morek, Szymon
aed96cede4
fix: flush barrier when profiling enabled
...
Related-To: NEO-10615
Signed-off-by: Morek, Szymon <szymon.morek@intel.com>
2024-07-02 13:25:07 +02:00
Dominik Dabek
76e8be5c39
fix(ocl): handle gl sharing displayable textures
...
Displayable textures always need dc flush.
Related-To: NEO-11694
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-06-27 13:43:20 +02:00
Mateusz Jablonski
85289f6658
refactor: extract common code for populating ftr and wa table
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-06-27 13:00:15 +02:00
Mateusz Jablonski
d51997be4f
fix: setup L3 cache config when available from release helper
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-06-26 16:55:51 +02:00
Mateusz Jablonski
66d4d141e1
fix: extract common logic for filling default gt system info
...
gt system info should be queried from KMD
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-06-25 20:21:57 +02:00
Michal Mrozek
a3c3b6533a
performance: improve pool handling
...
Related-To: NEO-11731
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-06-25 17:04:17 +02:00
Compute-Runtime-Validation
7136dfbd38
Revert "performance: improve pool handling"
...
This reverts commit 5f0b9efd2b
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-06-24 12:47:19 +02:00
Kamil Kopryk
d97dccc409
refactor: rename MaximumNumberOfThreads debug key
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-19 13:06:42 +02:00
Ratajewski, Andrzej
f7888fac0d
feature: Enable support for cl_intel_subgroup_matrix_multiply_accumulate_tf32
...
This change enables support for
`cl_intel_subgroup_matrix_multiply_accumulate_tf32` extension for PVC B0
and later.
Related-To: GSD-7825
Signed-off-by: Ratajewski, Andrzej <andrzej.ratajewski@intel.com>
2024-06-19 11:43:03 +02:00
Kamil Kopryk
f05ff9b156
refactor: rename OverDispatchControl debug key
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-18 18:59:43 +02:00
Kamil Kopryk
c40f0f2421
refactor: rename ComputeOverdispatchDisable debug key
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-17 13:18:29 +02:00
Pawel Szymichowski
521add6ca7
feature: Enable support for cl_intel_subgroup_buffer_prefetch
...
This change enables support for cl_intel_subgroup_buffer_prefetch extension for
PVC and later.
Related-To: GSD-7825
Signed-off-by: Ratajewski, Andrzej <andrzej.ratajewski@intel.com>
2024-06-14 20:44:52 +02:00
Mrozek, Michal
5f0b9efd2b
performance: improve pool handling
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
Resolves: NEO-11731
2024-06-14 12:02:34 +02:00
Bartosz Dunajski
da9c009b88
feature: assign unique interrupt to queue
...
Related-To: NEO-8179
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-06-07 10:06:31 +02:00
Mateusz Jablonski
01c36a38c6
feature: add support for generic id acronyms in ocloc
...
Related-To: NEO-11568
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-06-06 10:06:44 +02:00
Mateusz Jablonski
d7c7cb203c
fix: add missing support for acronyms without dashes
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-05-29 17:07:06 +02:00
Aleksander Czerwionka
683eb97d9e
refactor: [ocloc] remove dead code in gen12lp backward compatibility
...
Related-To: NEO-10148
Signed-off-by: Aleksander Czerwionka <aleksander.czerwionka@intel.com>
2024-05-29 16:26:31 +02:00
Chodor, Jaroslaw
ad155da67a
feature: Enabling platforms in ocloc by default
...
This feature enables supported platforms in ocloc even
if not enabled for driver.
Allows sharing single ocloc instance for multiple driver-platform
configurations
Related-To: NEO-10531
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2024-05-28 12:09:47 +02:00
Compute-Runtime-Validation
9d312995e2
Revert "feature: split cache_policy_dg2_and_later.inl file"
...
This reverts commit 5df982a85d
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-28 07:03:47 +02:00
Aleksander Czerwionka
251463d77a
feature: [Ocloc] provide backward compatibility for gen12lp family merge
...
Related-To: NEO-10148
Signed-off-by: Aleksander Czerwionka <aleksander.czerwionka@intel.com>
2024-05-27 17:35:31 +02:00
Jitendra Sharma
5df982a85d
feature: split cache_policy_dg2_and_later.inl file
...
Move getCachingPolicyOptions method present in existing
cache_policy_dg2_and_later.inl in new
get_caching_policy_options.inl file.
This would help in reusing getCachingPolicyOptions
method in any newly created cache_policy_* file.
Related-To: NEO-8306
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2024-05-27 12:34:14 +02:00
Aravind Gopalakrishnan
b39e99a17a
feature: Prepare hw info to report CXL type
...
Related-To: NEO-11362
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-05-24 07:09:42 +02:00
Fabian Zwoliński
78bd3da078
fix: ocloc - add bindless mode options if bindless is enabled
...
If release helper is not present,
do not add bindless mode options.
Related-To: NEO-7063
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-05-23 14:10:28 +02:00
Mateusz Jablonski
2577f1123e
refactor: remove not needed code
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-05-22 14:01:21 +02:00
Fabian Zwoliński
59829aeb11
fix(ocl): Force rebuild if binary contains bindless kernel
...
Related-To: NEO-7063
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-05-20 14:15:17 +02:00
Maciej Plewka
90df4b298b
fix: cache flush dependency for queue blocked
...
Related-to: NEO-9872, HSD-18038461954
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-05-20 11:55:24 +02:00
Young Jin Yoon
e204d27190
fix: print to stdout for disable scratch page
...
Modified to print out error messages to stdout when disable scratch page
is used.
Related-To: GSD-7611
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-05-16 15:05:07 +02:00
Aravind Gopalakrishnan
12ab4e5446
feature: Add 3 channel 8/16/32 bit formats
...
Related-To: NEO-11292
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-05-16 13:28:04 +02:00
Dominik Dabek
b4d839fe29
performance(usm): l0, add usm host memory pooling
...
Disabled by default.
Related-To: NEO-11356
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-05-15 15:20:51 +02:00
Ratajewski, Andrzej
0b0f7bd4cd
feature: Enable support for cl_intel_subgroup_2d_block_io
...
This change enables support for cl_intel_subgroup_2d_block_io extension for
PVC and later.
Related-To: GSD-7825
Signed-off-by: Ratajewski, Andrzej <andrzej.ratajewski@intel.com>
2024-05-14 15:16:49 +02:00
Maciej Plewka
e39893485c
fix: add cache flush as dependency for bcs ccs synchronization
...
Related-to: NEO-9872
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-05-09 13:43:39 +02:00
Mateusz Hoppe
76af952fcb
feature: Bindless sampled image support
...
Related-To: NEO-10352
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-05-08 17:50:37 +02:00
Aleksander Czerwionka
4667f55ea2
fix: add missing checks to return proper OpenCL error from API
...
changes affect clEnqueueNDCountKernelINTEL and
clGetKernelMaxConcurrentWorkGroupCountINTEL
Related-To: NEO-9688
Signed-off-by: Aleksander Czerwionka <aleksander.czerwionka@intel.com>
2024-05-07 20:32:36 +02:00
Bartosz Dunajski
e5882e0d31
feature: pass GraphicsAllocation to fence wait
...
Related-To: NEO-8179
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-05-07 17:59:28 +02:00
Andrzej Koska
e44c7f2627
fix: BCS MMIO programming
...
When remap is enabled, we must set different base offset for copy engines.
Copy engines must use BCS0 base.
Related-To: NEO-10678
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2024-05-07 14:08:28 +02:00
Katarzyna Cencelewska
e9f7df6ae6
refactor: create helper for maxPtssIndex
...
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-05-07 12:47:52 +02:00
Bartosz Dunajski
ce36812f8d
feature: obtain GraphicsAllocation from external CB event
...
Related-To: NEO-8179
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-05-06 22:52:36 +02:00
Compute-Runtime-Validation
8342c0ae2f
Revert "fix: add cache flush as dependency for bcs ccs synchronization"
...
This reverts commit 5e57bb2a32
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-01 03:05:47 +02:00
Maciej Plewka
5e57bb2a32
fix: add cache flush as dependency for bcs ccs synchronization
...
Related-to: NEO-9872
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-04-30 11:57:49 +02:00
Bartosz Dunajski
2a2596c13b
refactor: pass additional data to ioctl helper
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-04-26 14:53:14 +02:00
John Falkowski
f1c2ea0b69
fix: kernel access to multiple stateful virtual regions
...
Related-to: NEO-8350
Signed-off-by: John Falkowski <john.falkowski@intel.com>
2024-04-26 11:18:24 +02:00
Michal Mrozek
4aa7c6c99e
performance: optimize counter based waiting schemes
...
- store latest waited counter value.
- do not wait on values that are already completed.
- disable mechanism when counter overflows.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-04-25 17:20:29 +02:00
Katarzyna Cencelewska
eb256f1ffd
refactor: use encode surface state to set pitch for scratch
...
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-04-25 11:57:14 +02:00
Mateusz Hoppe
d38ac4806b
feature: global bindless sampler offsets
...
- use global heap base for DSH when bindlessHeapsHelper enabled
Related-To: NEO-10505
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-04-24 21:12:17 +02:00
Andrzej Koska
625d2d7f50
refactor: Passing information about the engine
...
Part 2
Related-To: NEO-10678
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2024-04-24 09:08:35 +02:00
Compute-Runtime-Validation
04c32d9e4e
Revert "fix: allow kernel access across multiple virtual regions"
...
This reverts commit b9c1ef65dd
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-04-24 04:00:15 +02:00
John Falkowski
b9c1ef65dd
fix: allow kernel access across multiple virtual regions
...
Related to: NEO-8350
Signed-off-by: John Falkowski <john.falkowski@intel.com>
2024-04-23 18:20:52 +02:00
Andrzej Koska
ae139aeffd
refactor: Passing information about the engine
...
Extension of the interface with information
about the engine type passed to the function
Related-To: NEO-10678
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2024-04-22 13:33:56 +02:00