Commit Graph

4838 Commits

Author SHA1 Message Date
Fabian Zwoliński
d2ce3badfc fix: bindlessHeapsHelper handle unavailable external heap
This PR handles the situation in which a component
has reserved a front window space for itself in the external heap,
so that the Compute Runtime cannot access this area.

In such a situation, we perform the following steps:
1. reserve 4GB chunk in heapStandard
2. split our chunk into 2 parts: heapFrontWindow, heapRegular
3. from this point on, map all linearStream allocations in reserved 4GB
chunk

Patch applies to Windows and WSL.
Patch only applies when the bindless global allocator is enabled.

Related-To: HSD-16025889919
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-12-05 14:18:01 +01:00
Chandio, Bibrak Qamar
ab2e831a4a fix: zeDeviceGetGlobalTimestamp to use submisison
Related-To: GSD-10253, GSD-9467, GSD-9381, NEO-11908

When EnableGlobalTimestampViaSubmission is set then
zeDeviceGetGlobalTimestamp uses immediate cmd submission
method to get GPU time.

Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com>
2024-12-04 19:10:07 +01:00
Lukasz Jobczyk
d40a804bca performance: Allocate by KMD on BMG
Related-To: NEO-10526

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-12-04 14:41:25 +01:00
Dominik Dabek
1252b10ba9 fix: limit usm device reuse based on used memory
Calculate available memory for usm device reuse based as (total device
memory - used memory) * fraction for reuse.

Use sys mem allocs for devices without local memory.

Related-To: NEO-12902

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-12-04 08:11:23 +01:00
Compute-Runtime-Validation
d4bfa0f758 Revert "performance: Allocate by KMD on BMG"
This reverts commit 331fffaeea.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-12-04 07:03:32 +01:00
Compute-Runtime-Validation
b1fa7a0e24 Revert "performance: Enable timestamp wait for queues on Xe2"
This reverts commit 2789c50090.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-12-04 02:53:31 +01:00
Chodor, Jaroslaw
49e904df74 feature: Parse actual_kernel_start_offset zeinfo entry
This is a deprecated and redundant entry but needs to
be preserved for compatibility reasons.

Related-To: GSD-10402
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2024-12-03 18:57:06 +01:00
Lukasz Jobczyk
331fffaeea performance: Allocate by KMD on BMG
Related-To: NEO-10526

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-12-03 15:25:56 +01:00
Lukasz Jobczyk
2789c50090 performance: Enable timestamp wait for queues on Xe2
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-12-03 14:56:34 +01:00
Szymon Morek
d5e5c8e59e fix: call flushMonitorFence on BCS CSR
Related-To: NEO-12477

Also, make sure that global fence is always resident

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-12-03 12:48:54 +01:00
Szymon Morek
e6d11eb04b performance: stop ULLS for BCS during migration
Related-To: NEO-13340

When regular copy CSR has enabled direct submission,
stop it before migration on internal CSR.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-12-02 17:57:12 +01:00
Bartosz Dunajski
9629ab3cc3 fix: disable fence wait if not supported on given CSR type
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-12-02 15:32:23 +01:00
Compute-Runtime-Validation
f5cda5ba53 Revert "fix: call flushMonitorFence on BCS CSR"
This reverts commit fc8aa8246b.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-11-30 22:31:30 +01:00
Zbigniew Zdanowicz
56b15f17f7 refactor: unify further calculation to get max work group count
- move available device calculcation into common helper
- change interface to have code available where no descriptor is available
- expand unit test for implementation of new inteface

Related-To: NEO-13350

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-29 17:51:05 +01:00
Dominik Dabek
99a353a15a feature: flags for logging indirect detection
Add flag to log information for indirect detection debugging.
Add flag to disable indirect detection by kernel name.
Add flag to force indirect detection enable/disable for CM kernels.

Related-To: NEO-13372

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-11-29 16:37:47 +01:00
Szymon Morek
fc8aa8246b fix: call flushMonitorFence on BCS CSR
Related-To: NEO-12477

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-11-29 16:31:40 +01:00
Mateusz Jablonski
b1f7a3d125 test: remove not used usings/typedefs in shared tests
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-11-29 16:31:25 +01:00
Mateusz Jablonski
d5b5592450 test: remove code duplication of genCmdCast<MI_ARB_CHECK *>
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-11-29 14:34:31 +01:00
Vysochyn, Illia
c40f015249 refactor: Modify DISPATCH_WALKER naming
Modifies dispatch walker enumerator naming to be unified among compute
walkers.

Related-To: NEO-13147

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2024-11-29 14:00:47 +01:00
Zbigniew Zdanowicz
c5ed6bf73c refactor: split sync buffer and region allocation creation code
- split the allocation code from command list or kernel
- allow to call allocation code in all parts of the driver

Related-To: NEO-13350

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-29 11:41:39 +01:00
Mateusz Jablonski
f2b0dad964 test: remove not used usings/typedefs in shared tests
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-11-29 10:46:25 +01:00
Mateusz Jablonski
d4e201db86 test: remove not used usings/typedefs/variables in shared tests
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-11-29 10:46:10 +01:00
Bartosz Dunajski
5e1fa75676 refactor: adjust code to compile with c++20
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-11-29 10:27:29 +01:00
Vysochyn, Illia
afd22999cc refactor: Adjust RENDER_SURFACE_STATE structures naming
Performs minor renaming (mostly capitalization) in order to align with
specification.

Renames L1_CACHE_POLICY to L1_CACHE_CONTROL.

Related-To: NEO-13147

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2024-11-29 09:43:11 +01:00
Mateusz Jablonski
f86d2cee41 test: remove not used usings/typedefs/variables in OCL tests
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-11-29 07:33:36 +01:00
Compute-Runtime-Validation
bced7e4621 Revert "performance: stop ULLS for BCS during migration"
This reverts commit 81ba52aac4.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-11-29 04:42:26 +01:00
Vysochyn, Illia
89c3aab321 refactor: Extract PostSyncType to outer abstract layer
This modification serves to simplify the integration of generated
compute walker structures.

Related-To: NEO-13147

Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>
2024-11-28 18:09:21 +01:00
Mateusz Jablonski
fa58073095 refactor: remove not used usings/typedefs/variables
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-11-28 16:19:39 +01:00
Mateusz Jablonski
2039b1c41b refactor: remove not needed code
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-11-28 16:16:30 +01:00
Szymon Morek
81ba52aac4 performance: stop ULLS for BCS during migration
Related-To: NEO-13340

When regular copy CSR has enabled direct submission,
stop it before migration on internal CSR.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-11-27 20:06:50 +01:00
Zbigniew Zdanowicz
6b7235cd6c refactor: change parameter names into more meaningful
- change additional size into local region size
- change walk order into dispatch walk order to distinguish for local id walk

Related-To: NEO-13350

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-27 16:15:02 +01:00
Zbigniew Zdanowicz
6453a5ec31 fix: correct sequence of estimates to get correct size for start command
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-27 09:38:39 +01:00
Mateusz Jablonski
974e8ae63f performance: don't call evict with empty handle list
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-11-27 09:25:36 +01:00
Dominik Dabek
e55aa958b7 fix: track usm reuse usage in multiple contexts
Add tracking of memory used for usm reuse mechanism when multiple cl
contexts are used.
Tracking for device added to NEO::Device, for host added to
NEO::MemoryManager.

This fixes usm reuse using x% of memory per each context instead of
globally.

Related-To: NEO-13308

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-11-26 16:00:45 +01:00
Mateusz Jablonski
db6fe7892c fix: remove destroyed allocations from eviction lists
mark explicitly made resident allocations

Related-To: NEO-13246, GSD-10319
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-11-25 22:19:54 +01:00
Bartosz Dunajski
1ddcb92f1d refactor: pass device to encoder functions
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-11-25 16:25:04 +01:00
Oskar Hubert Weber
5dc5c839a6 test: allowing neo ULT build with optimization enabled 2/n
Fixes mismatched-new-delete warnings

Related-To: NEO-8116

Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com>
2024-11-25 16:21:40 +01:00
Bartosz Dunajski
5862cbcb9f refactor: add max local region size query
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-11-22 17:33:22 +01:00
Wenbin Lu
d03ba275c5 test: use explicit value in NumSubSlicesPerSlice ULT
Related-To: NEO-10060

Signed-off-by: Wenbin Lu <wenbin.lu@intel.com>
2024-11-22 17:17:33 +01:00
Bellekallu Rajkiran
0c3a767ae6 fix: Enable immediate binding for make resident on linux
Related-To: NEO-11977

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2024-11-22 15:56:18 +01:00
Maciej Plewka
0454a9480a fix: do not expose render engine by default on xe2
Related-To: HSD-18040606360
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-11-22 11:58:26 +01:00
Wenbin Lu
47383d752f fix: report correct number of XeCore per Cluster
Related-To: NEO-10060

Signed-off-by: Wenbin Lu <wenbin.lu@intel.com>
2024-11-22 09:40:28 +01:00
Compute-Runtime-Validation
fc5cab4445 Revert "fix: do not expose render engine by default on xe2"
This reverts commit ff324424eb.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-11-22 02:50:41 +01:00
Bellekallu Rajkiran
8fdada84d6 test: Move product matcher to common matchers header
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2024-11-21 15:47:45 +01:00
Alicja Lukaszewicz
789efc8909 fix: remove number of RT stacks from capability table
Related-To: NEO-10830

Signed-off-by: Alicja Lukaszewicz <alicja.lukaszewicz@intel.com>
2024-11-21 14:46:50 +01:00
Maciej Plewka
ff324424eb fix: do not expose render engine by default on xe2
Related-To: HSD-18040606360
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-11-21 13:13:32 +01:00
Katarzyna Cencelewska
4ad8c17db9 feature: add debug flags for timestamps
PrintCalculatedTimestamps - print ts in level zero paths
PrintTimestampPacketContents - add logging also to level zero paths
ForceUseOnlyGlobalTimestamps - force using a global ts

Related-To: HSD-14023527252
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-11-21 11:28:08 +01:00
Maciej Plewka
46c345789d refactor: move depth limitation from release helper to image_hw
Related-To: NEO-8390, HSD-16021488507
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-11-21 11:24:06 +01:00
Szymon Morek
d8ac8641e5 performance: enable ULLS on copy engine for BMG
Related-To: NEO-12477

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-11-21 10:05:24 +01:00
Bartosz Dunajski
dab4166837 fix: add missing aub polls on sync points
Related-To: HSD-14023925176

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-11-21 09:17:54 +01:00