Commit Graph

1924 Commits

Author SHA1 Message Date
Jitendra Sharma
f52f3df274 Add platform specific getter of debug surface size
For different platforms based on number of available threads
and debug surface layout, calculate max debug surface size.

Related-To: NEO-6676
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2022-03-22 12:18:40 +01:00
Bartosz Dunajski
05a0664720 Change default lod preclamp mode in sampler_state
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2022-03-22 11:24:09 +01:00
Filip Hazubski
9c06af79ad Remove programming first SCM from the command list
For first kernel in command list the required state described with SCM
command is stored and programmed when the command list is executed, only
if current stream state requires reprogramming.

Resolves: NEO-6728

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2022-03-21 11:24:07 +01:00
Artur Harasimiuk
2c16ac9355 code cleanup
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2022-03-21 10:51:43 +01:00
Zbigniew Zdanowicz
46f4472965 Refactor Fence class interface and implementation
Related-To: NEO-6242

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-03-21 10:35:26 +01:00
Filip Hazubski
35240e8533 Correct DispatchFlags initialization for immediate command list execution
With executeCommandListImmediateWithFlushTask set flag
containsAnyKernel to false to properly handle next kernel being added to
the immediate command list.

Related-To: NEO-6728

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2022-03-18 13:25:56 +01:00
Zbigniew Zdanowicz
1540d06c25 Correct infinite timeout argument for event synchronization
Related-To: NEO-6242

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-03-18 13:07:09 +01:00
Dominik Dabek
b03119b964 Add setArgBuffer caching to l0
When trying to set the same values we don't need to set it again.
The pointer and allocId need to be the same.

Related-To: NEO-6737

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2022-03-17 10:39:45 +01:00
Compute-Runtime-Validation
963536253f Revert "Correct infinite timeout argument for event synchronization"
This reverts commit 878466a1ea.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-03-17 10:31:55 +01:00
Jaime Arteaga
c0e2251ceb Skip adding allocations to remote devices if not allocated there
When making graphics allocations resident in multi-GPU scenarios,
we should make them resident only if there's an allocation for that
device. So return appropriate null pointer and skip it.

Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
2022-03-16 23:58:07 +01:00
Joshua Santosh Ranjan
05a150f49f L0 Core Add Support For pci_speed_ext
This patch adds support for reading PCI bandwidth, generation
and linkwidth information from sysfs nodes for the linux
platform.

Related-To: LOCI-2969

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-03-16 22:47:59 +01:00
Bellekallu Rajkiran
061af9c284 Sysman : Add check on engine handle creation
Add check whether init succeeded on handle creation.

Related-To: LOCI-3005

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-03-16 18:08:42 +01:00
Zbigniew Zdanowicz
878466a1ea Correct infinite timeout argument for event synchronization
Related-To: NEO-6242

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-03-16 17:58:36 +01:00
Filip Hazubski
a5ac961726 Do not program thread arbitration policy on command list init and reset
This logic is now redundant and can be removed.

Related-To: NEO-6728

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2022-03-16 12:09:42 +01:00
Filip Hazubski
cd95572443 Reuse common logic of programming SCM fields for gen 9 and gen 11
Logic related to programming non coherent and thread arbitration policy for
gens 9 and 11 has been moved to EncodeComputeMode object, where similar
logic for gens gen12lp and newer is located.

Functions PreambleHelper::programThreadArbitration and
PreambleHelper::getThreadArbitrationCommandsSize have been removed.

Redundant setForceNonCoherent call has been removed from XE HPG

Related-To: NEO-6728

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2022-03-16 10:04:32 +01:00
Aravind Gopalakrishnan
5d90e2ab1d Find copy engines correctly before using in test cases
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2022-03-15 18:49:14 +01:00
Filip Hazubski
32b0f7b014 Remove redundant value CsrSizeRequestFlags::numGrfRequiredChanged
Related-To: NEO-5995

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2022-03-15 15:08:15 +01:00
Mateusz Jablonski
c7e76ce109 Add support for a new relocation name for implicit args
Related-To: NEO-5081
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2022-03-15 12:03:51 +01:00
Mayank Raghuwanshi
fac75222a3 Move Pci bdf parsing to os specific layer
Related-To: LOCI-2879

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2022-03-15 08:58:30 +01:00
Jaime Arteaga
5038cf6cfc Refactor getMemoryAccessProperties()
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
2022-03-14 17:15:21 +01:00
Bellekallu Rajkiran
604f2c625e Sysman: Utilize Ioctl helpers for memory module
Memory module translates memory regions without any
runtime query on underlying hardware/kernel. Use
ioctl helpers for memory region translation.

Related-To: LOCI-3006

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-03-14 16:46:29 +01:00
Daria Hinz
452050ae40 Refactoring the use of PVC device ids
Replacing the old device id implementation
& clearing PVC XT temporary.

Related-To: NEO-6742
Signed-off-by: Daria Hinz <daria.hinz@intel.com>
2022-03-14 15:54:52 +01:00
Filip Hazubski
b79d9a8e10 Correct structs to explicitly initialize members
Affected structs are DebugAreaHeader, Ras and APITracerImp.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2022-03-14 15:40:28 +01:00
Compute-Runtime-Validation
cee7ded064 Revert "Add setArgBuffer caching to l0"
This reverts commit e6460e5534.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-03-13 00:50:42 +01:00
Spruit, Neil R
c8e60ce3ba Support Circular and chained deps in L0 Dynamic Link
- Added support for Dynamic Module link to allow usage of
symbols that exist thru a circular dependency from the export->import
module & chained dependencies with importModule -> ExportModule ->
ExportModule2.

- Adds a dependency walker which collects all dependent modules whose
export surface state must be made resident for a given kernel
execution to successfully access each function resolved on different
modules.

- Expanded the Dynamic Link Black Box test to allow for testing either
simple linking and circular + chained linkage.

Signed-off-by: Spruit, Neil R <neil.r.spruit@intel.com>
2022-03-11 21:15:08 +01:00
Jaime Arteaga
0e9e0ff5e9 Add ZE_experimental_image extensions to list of supported ones
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
2022-03-11 17:00:14 +01:00
Bartosz Dunajski
c843d152ce Enable prepatcher WA for L0
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2022-03-11 15:20:32 +01:00
Dominik Dabek
e6460e5534 Add setArgBuffer caching to l0
When trying to set the same values we don't need to set it again.
The pointer and allocId need to be the same.

Related-To: NEO-6737

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2022-03-11 14:20:15 +01:00
Filip Hazubski
3eab7009ac Move SCM related WAs logic from CSR to EncodeComputeMode
This will help with unifying the logic between APIs and GENs.

Related-To: NEO-6728

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2022-03-11 14:00:53 +01:00
Kamil Kopryk
7d6bee26c7 Move pvc helpers to pvc files
Related-To: NEO-6631
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2022-03-11 13:46:59 +01:00
Joshua Santosh Ranjan
c7ce397b17 Metrics IP Sampling Add Support for Streamer APIs
This Patch adds support for collecting IP Metrics using
StreamerOpen, StreamerClose and StreamerReadData

Related-To: LOCI-2755
Related-To: LOCI-2756

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-03-11 03:03:22 +01:00
Grzegorz Choinski
6133d8c4b5 fix paths to prebuilt kernels
It otherwise looks for different path to copy kernel than it is generated into

Signed-off-by: Grzegorz Choinski <grzegorz.choinski@intel.com>
2022-03-09 18:55:44 +01:00
Milczarek, Slawomir
c0b7f05897 Add memory prefetch for kmd migrated shared allocations
This feature is disabled by default, controlled with the knob
AppendMemoryPrefetchForKmdMigratedSharedAllocations

Related-To: NEO-6740

Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>
2022-03-09 16:02:18 +01:00
Mateusz Jablonski
3e8a6680e2 Allow undefined symbols in symbol table for OCL
Related-To: NEO-5081
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2022-03-09 10:32:15 +01:00
Kamil Kopryk
038d1d54fa Correct xe_hpc tests
Related-To: NEO-6631


Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2022-03-09 09:21:30 +01:00
Filip Hazubski
dd01cff879 Unify logic determining thread arbitration policy value
Related-To: NEO-6728

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2022-03-08 13:14:56 +01:00
Lisanna Dettwyler
e6e6ae2d2b Don't use low-priority CSR for copy-only cmdq
getCsrForLowPriority will always return a CSR with a compute engine
regardless of the ordinal used. When a copy-only ordinal is used to
create a command queue or command list, batch buffers are written using
BCS style commands rather than CCS style. This caused BCS commands to be
submitted to a low-priority compute engine which was invalid. This patch
ensures a copy engine is selected when a copy-only ordinal is used even
if ZE_COMMAND_QUEUE_PRIORITY_PRIORITY_LOW is specified.

Related-To: LOCI-2980

Signed-off-by: Lisanna Dettwyler <lisanna.dettwyler@intel.com>
2022-03-07 21:38:12 +01:00
Zbigniew Zdanowicz
8f5dd3cff5 Refactor immediate command list flushing
Related-To: NEO-6242

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-03-07 20:37:28 +01:00
Filip Hazubski
80b520bc9b Change ThreadArbitrationPolicy enum type to int32_t
Change ThreadArbitrationPolicy::NotPresent value to -1
Update initial values to ThreadArbitrationPolicy::NotPresent

Related-To: NEO-6728

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2022-03-07 20:04:24 +01:00
Filip Hazubski
0f54a57910 Update programming STATE_COMPUTE_MODE
Remove function clearComputeModePropertiesIfNeeded.
If a field has to be programmed unconditionally, ignore isDirty flag.

Related-To: NEO-6728

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2022-03-07 16:23:59 +01:00
Compute-Runtime-Validation
1a823356a3 Revert "Add prelim support for ras diagnostics and firmware"
This reverts commit 5a2145ad8d.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-03-06 11:31:15 +01:00
Jaime Arteaga
a010fb3634 Dont close shared handle on imported allocations
Related-To: LOCI-2272

Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
2022-03-06 04:35:47 +01:00
Bellekallu Rajkiran
7ec8d8ef91 Add prelim support for frequency events and fabric port
Related-To: LOCI-2864

Source of prelim headers:
https://github.com/intel-gpu/drm-uapi-helper/tree/master/i915-shared-headers

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-03-06 02:47:20 +01:00
Bellekallu Rajkiran
01db5ef22a Add prelim support for memory, engine and scheduler
Related-To: LOCI-2864

Source of prelim headers:
https://github.com/intel-gpu/drm-uapi-helper/tree/master/i915-shared-headers

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-03-04 19:56:30 +01:00
Krystian Chmielewski
0ccce5a6d7 Zebin: set kernel barriers based on ext funcs
This change allows for modifying kernel's barrier count
based on called external functions metadata passed
via zeInfo section in zebin.

Added parsing external functions metadata.
Added resolving external functions call graph.
Added updating kernel barriers based on called external functions.
Added support for L0 dynamic link.

Signed-off-by: Krystian Chmielewski <krystian.chmielewski@intel.com>
2022-03-04 14:21:50 +01:00
Bellekallu Rajkiran
65fd55a893 Add prelim performance factor support
Related-To: LOCI-2864

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-03-04 08:44:10 +01:00
Compute-Runtime-Validation
d06bbd07e5 Revert "Add ze_eu_count_t to get total number of EUs"
This reverts commit 30ea8ea48e.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-03-04 06:55:17 +01:00
Joshua Santosh Ranjan
41988fc429 Metrics IP Sampling Enumeration Support
This patch adds enumeration of metric group for
IP sampling.

Related-To: LOCI-2754


Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-03-04 00:22:18 +01:00
Bellekallu Rajkiran
5a2145ad8d Add prelim support for ras diagnostics and firmware
Related-To: LOCI-2864

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-03-03 18:51:21 +01:00
Bellekallu Rajkiran
922a224cc9 Add prelim support for temperature, power and global operations
Related-To: LOCI-2864

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-03-03 18:11:34 +01:00