The following modifications were done as part of the fix
for warm reset.
1. Release sysman resources before quiscenting the GPU.
2. Add additional checks to confirm quiscenting of the GPU
before launching the diagnostics tests.
3. Fixed warm reset with wait time to allow the changes to be
propagated to the entire GPU PCI tree.
4. Modified the ULT's completely to avoid the usage of MOCKS.
5. Made Diagnostics handle creation per-device from per-SubDevice.
Related-To: LOCI-3053
Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com>
Define single .clang-tidy configuration with all used checks and use
NOLINT to selectively silence tool. That way cleanup should be easier.
third_part/ has its own configuration that disables clang-tidy for this
folder.
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
Add support for getCardPowerDomain API.
Modify check to verify whether hwmon dir exists.
Modify ULTs to verify updated power interface.
Update power control/query interface.
Related-To: LOCI-2735, LOCI-3046
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
warm and cold reset are common functionality,
the code is being moved to the common sysman implementation
from diagnostics specific files.
Related-To: LOCI-1908
Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com>
This patch adds support for reading PCI bandwidth, generation
and linkwidth information from sysfs nodes for the linux
platform.
Related-To: LOCI-2969
Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
This change uses value of cpuAddress from monitored fence
to detect GPU hang.
Related-To: NEO-5313
Signed-off-by: Patryk Wrobel <patryk.wrobel@intel.com>
Related-To: NEO-6575
This is needed to fix accessing IoctlHelper
after driver detach.
This way we are also reducing accessing
sysfs file in Drm::getPrelimVersion
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
Fixed by avoiding library function access if library is unavailable.
Related-To: LOCI-2719
Signed-off-by: Ranjan, Joshua Santhosh <joshua.santosh.ranjan@intel.com>
Use pread sys call instead of mmap and munmap
to get telemetry info.
Related-To: LOCI-2634
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
Before performing gpu device reset, first all level zero resources
and gpu device specific resources have to be cleaned up. Also as
after device reset, state of gpu device would be lost.
Hence after performing gpu device reset, level zero device have
to be reinitialized by querying gpu device again.
This change is aimed at reinitializing the level zero resources
after gpu device reset, so that user could continue using level zero
devices after device reset.
Related-To: LOCI-2627
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
firmware Utils was always enumerating the same firmware
device handle for all sysman devices.
Related-To:LOCI-2609
Signed-off-by: T J Vivek Vilvaraj <t.j.vivek.vilvaraj@intel.com>
This change introduce
- new method deviceReinit, which could be used, to reinitialize
existing level zero device.
Related-To: LOCI-2612
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
This change updates Temperature APIs to get correct current
temperature based on updated PMT interface.
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
Netlink and generic netlink are standard linux
interfaces to allow management traffic between
user space and kernel subsystems over sockets.
This patch adds a loader for the generic netlink
library to allow Level 0 Sysman to manage
linux hardware that implements the generic
netlink interface. ULTs updated.
Signed-off-by: William Jordan <bill.jordan@intel.com>