Commit Graph

1066 Commits

Author SHA1 Message Date
Filip Hazubski
f09fd527f1 Remove redundant using declarations
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2022-04-04 17:10:21 +02:00
Michal Mrozek
9016772351 No dynamic allocations on hot calls [2/n]
- Prevent FULL vector copy.

Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2022-04-01 16:22:09 +02:00
Mateusz Hoppe
beff0019d1 SBA tracking for single address space
Related-To: NEO-6539


Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2022-04-01 15:24:11 +02:00
Mateusz Hoppe
4374197c9d Fixes for bindless configuration
- enhance ults code, do not inject memory manager
- fix some issues related to bindless global heap

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2022-04-01 15:08:43 +02:00
Lukasz Jobczyk
6fee7c9de1 Unify the appendWaitOnEvents method
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-03-31 14:48:19 +02:00
Dominik Dabek
308f9ccfff Do not allocate dynamic state heap if not needed
Dynamic state heap is only used for sampler data.

Related-To: NEO-6821

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2022-03-31 12:13:13 +02:00
Bartosz Dunajski
08e3853982 Debug flag to add extra MI_MEM_FENCE for DirectSubmission
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2022-03-30 16:07:25 +02:00
Zbigniew Zdanowicz
a6d99f416d Use context end offset for partitioned events
Related-To: NEO-6811

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-03-30 14:20:03 +02:00
Mateusz Hoppe
681c09d314 Extend LinearStream with gpuBase
- added methods to set and query gpu base address

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2022-03-29 15:15:53 +02:00
Zbigniew Zdanowicz
9858438121 Limit multiple partition count to compute command lists
Related-To: NEO-6811

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-03-29 07:29:08 +02:00
Compute-Runtime-Validation
90c6d7d9c9 Revert "Expose copy engines in parent device with implicit scaling"
This reverts commit 6154fb8ea7.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-03-28 22:06:10 +02:00
Jaime Arteaga
6154fb8ea7 Expose copy engines in parent device with implicit scaling
When using implicit scaling, expose the copy engines from
sub-device 0 in the root device. This to facilitate
programming models of layers above.

Related-To: NEO-6815

Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
2022-03-28 16:48:04 +02:00
Dominik Dabek
fccda460d6 Optimize zeCommandListCreate
Do not allocate heap if command list is copy only.

Related-To: NEO-6821

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>

Co-authored-by: Michal Mrozek <michal.mrozek@intel.com>
2022-03-28 16:31:10 +02:00
Konstanty Misiak
174c27eb31 Fix CFEFusedEUDispatch debug flag
Signed-off-by: Konstanty Misiak <konstanty.misiak@intel.com>
2022-03-28 12:32:05 +02:00
Compute-Runtime-Validation
34a5c6cd28 Revert "Remove programming first SCM from the command list"
This reverts commit 9c06af79ad.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-03-28 11:15:35 +02:00
Filip Hazubski
586e8510de Remove unneeded include
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2022-03-26 09:55:25 +01:00
Jim Snow
0a926c7d12 Add Windows image memory export functionality
Adds zeImageGetAllocPropertiesExt function, implementation code shared
shared with zeMemGetAllocProperties moved into common helper function.

Related-To: LOCI-2665

Signed-off-by: Jim Snow <jim.m.snow@intel.com>
2022-03-26 05:11:35 +01:00
Jaime Arteaga
e8a6842b7e Add method to read kernel base address
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
2022-03-25 21:49:22 +01:00
Zbigniew Zdanowicz
e4d8dfcdd9 Implement fence creation flag
Related-To: NEO-6242

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-03-25 12:04:43 +01:00
Zbigniew Zdanowicz
bb753784cc Optimize post sync operations for fence and task count update
Related-To: NEO-6242

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-03-25 10:57:10 +01:00
Lukasz Jobczyk
a230f267e1 Poll task count indefinitely on high throttle command queue
Resolves: NEO-6781

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-03-25 10:06:16 +01:00
Patryk Wrobel
14954acd12 Detect GPU hang in command list immediate
This change introduces checking of values returned
by blocking calls used in cmdlist_hw_immediate.inl.

Signed-off-by: Patryk Wrobel <patryk.wrobel@intel.com>
Related-To: NEO-6681
2022-03-24 12:42:19 +01:00
Krzysztof Gibala
ebc006ad53 Move SBA related WAs logic from CSR to EncodeWA
Signed-off-by: Krzysztof Gibala <krzysztof.gibala@intel.com>
2022-03-24 12:24:56 +01:00
Zbigniew Zdanowicz
814b0ecb87 Improve wait function for Fence object
Related-To: NEO-6242

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-03-24 10:48:49 +01:00
Kacper Nowak
1b6a907b13 Code refactor
This commit:
- Optimizes usage of std::unordered map
- Corrects variables naming and output messages
- Minor changes in using std::string methods
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2022-03-23 23:12:21 +01:00
Milczarek, Slawomir
f03f530327 Extend zeCommandListAppendMemoryPrefetch to migrate to associated device
Related-To: NEO-6740

Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>
2022-03-23 14:21:17 +01:00
Baj, Tomasz
1f7fd45251 Move device UUID from L0 to shared
Related-To: NEO-5681

Signed-off-by: Baj, Tomasz <tomasz.baj@intel.com>
2022-03-23 12:52:24 +01:00
Jitendra Sharma
f52f3df274 Add platform specific getter of debug surface size
For different platforms based on number of available threads
and debug surface layout, calculate max debug surface size.

Related-To: NEO-6676
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2022-03-22 12:18:40 +01:00
Bartosz Dunajski
05a0664720 Change default lod preclamp mode in sampler_state
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2022-03-22 11:24:09 +01:00
Filip Hazubski
9c06af79ad Remove programming first SCM from the command list
For first kernel in command list the required state described with SCM
command is stored and programmed when the command list is executed, only
if current stream state requires reprogramming.

Resolves: NEO-6728

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2022-03-21 11:24:07 +01:00
Zbigniew Zdanowicz
46f4472965 Refactor Fence class interface and implementation
Related-To: NEO-6242

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-03-21 10:35:26 +01:00
Filip Hazubski
35240e8533 Correct DispatchFlags initialization for immediate command list execution
With executeCommandListImmediateWithFlushTask set flag
containsAnyKernel to false to properly handle next kernel being added to
the immediate command list.

Related-To: NEO-6728

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2022-03-18 13:25:56 +01:00
Zbigniew Zdanowicz
1540d06c25 Correct infinite timeout argument for event synchronization
Related-To: NEO-6242

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-03-18 13:07:09 +01:00
Dominik Dabek
b03119b964 Add setArgBuffer caching to l0
When trying to set the same values we don't need to set it again.
The pointer and allocId need to be the same.

Related-To: NEO-6737

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2022-03-17 10:39:45 +01:00
Compute-Runtime-Validation
963536253f Revert "Correct infinite timeout argument for event synchronization"
This reverts commit 878466a1ea.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-03-17 10:31:55 +01:00
Joshua Santosh Ranjan
05a150f49f L0 Core Add Support For pci_speed_ext
This patch adds support for reading PCI bandwidth, generation
and linkwidth information from sysfs nodes for the linux
platform.

Related-To: LOCI-2969

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-03-16 22:47:59 +01:00
Zbigniew Zdanowicz
878466a1ea Correct infinite timeout argument for event synchronization
Related-To: NEO-6242

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-03-16 17:58:36 +01:00
Filip Hazubski
a5ac961726 Do not program thread arbitration policy on command list init and reset
This logic is now redundant and can be removed.

Related-To: NEO-6728

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2022-03-16 12:09:42 +01:00
Filip Hazubski
cd95572443 Reuse common logic of programming SCM fields for gen 9 and gen 11
Logic related to programming non coherent and thread arbitration policy for
gens 9 and 11 has been moved to EncodeComputeMode object, where similar
logic for gens gen12lp and newer is located.

Functions PreambleHelper::programThreadArbitration and
PreambleHelper::getThreadArbitrationCommandsSize have been removed.

Redundant setForceNonCoherent call has been removed from XE HPG

Related-To: NEO-6728

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2022-03-16 10:04:32 +01:00
Filip Hazubski
32b0f7b014 Remove redundant value CsrSizeRequestFlags::numGrfRequiredChanged
Related-To: NEO-5995

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2022-03-15 15:08:15 +01:00
Jaime Arteaga
5038cf6cfc Refactor getMemoryAccessProperties()
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
2022-03-14 17:15:21 +01:00
Daria Hinz
452050ae40 Refactoring the use of PVC device ids
Replacing the old device id implementation
& clearing PVC XT temporary.

Related-To: NEO-6742
Signed-off-by: Daria Hinz <daria.hinz@intel.com>
2022-03-14 15:54:52 +01:00
Filip Hazubski
b79d9a8e10 Correct structs to explicitly initialize members
Affected structs are DebugAreaHeader, Ras and APITracerImp.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2022-03-14 15:40:28 +01:00
Compute-Runtime-Validation
cee7ded064 Revert "Add setArgBuffer caching to l0"
This reverts commit e6460e5534.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-03-13 00:50:42 +01:00
Spruit, Neil R
c8e60ce3ba Support Circular and chained deps in L0 Dynamic Link
- Added support for Dynamic Module link to allow usage of
symbols that exist thru a circular dependency from the export->import
module & chained dependencies with importModule -> ExportModule ->
ExportModule2.

- Adds a dependency walker which collects all dependent modules whose
export surface state must be made resident for a given kernel
execution to successfully access each function resolved on different
modules.

- Expanded the Dynamic Link Black Box test to allow for testing either
simple linking and circular + chained linkage.

Signed-off-by: Spruit, Neil R <neil.r.spruit@intel.com>
2022-03-11 21:15:08 +01:00
Jaime Arteaga
0e9e0ff5e9 Add ZE_experimental_image extensions to list of supported ones
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
2022-03-11 17:00:14 +01:00
Bartosz Dunajski
c843d152ce Enable prepatcher WA for L0
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2022-03-11 15:20:32 +01:00
Dominik Dabek
e6460e5534 Add setArgBuffer caching to l0
When trying to set the same values we don't need to set it again.
The pointer and allocId need to be the same.

Related-To: NEO-6737

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2022-03-11 14:20:15 +01:00
Filip Hazubski
3eab7009ac Move SCM related WAs logic from CSR to EncodeComputeMode
This will help with unifying the logic between APIs and GENs.

Related-To: NEO-6728

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2022-03-11 14:00:53 +01:00
Milczarek, Slawomir
c0b7f05897 Add memory prefetch for kmd migrated shared allocations
This feature is disabled by default, controlled with the knob
AppendMemoryPrefetchForKmdMigratedSharedAllocations

Related-To: NEO-6740

Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>
2022-03-09 16:02:18 +01:00