The following modifications were done as part of the fix
for warm reset.
1. Release sysman resources before quiscenting the GPU.
2. Add additional checks to confirm quiscenting of the GPU
before launching the diagnostics tests.
3. Fixed warm reset with wait time to allow the changes to be
propagated to the entire GPU PCI tree.
4. Modified the ULT's completely to avoid the usage of MOCKS.
5. Made Diagnostics handle creation per-device from per-SubDevice.
Related-To: LOCI-3053
Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com>
Add support for getCardPowerDomain API.
Modify check to verify whether hwmon dir exists.
Modify ULTs to verify updated power interface.
Update power control/query interface.
Related-To: LOCI-2735, LOCI-3046
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
Earlier sysman memory module was using logical subdeviceId
exposed by core to retrieve memory telmetry data, replace
the logical subdeviceId with actual subdeviceId for collecting
telemetry data.
Related-To: LOCI-2828
Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
Before performing gpu device reset, first all level zero resources
and gpu device specific resources have to be cleaned up. Also as
after device reset, state of gpu device would be lost.
Hence after performing gpu device reset, level zero device have
to be reinitialized by querying gpu device again.
This change is aimed at reinitializing the level zero resources
after gpu device reset, so that user could continue using level zero
devices after device reset.
Related-To: LOCI-2627
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
Dates corrected in copyright headers to reflect original publication date
(2018 for OpenCL, 2020 for Level Zero).
Signed-off-by: lgotszal <lukasz.gotszald@intel.com>
This change updates Temperature APIs to get correct current
temperature based on updated PMT interface.
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
Add boilerplate for all scheduler APIs listed in spec.
Change-Id: I0ba7e4d969f409e1a3360204dbc16037898d64c8
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
Implemented function level reset.
Implementation is:
Make sure we are root (otherwise, return insufficient permissions)
Make sure no one has the device open
(otherwise, return hande object in use)
Close our file handle
Unbind the device from the kernel driver
Make sure no one still has the device open
(otherwise, kill them)
Perform function level reset (FLR)
Rebind the device to the kernel driver
Change-Id: Ic57b95487e73b5a5f2d03e619d813bf4199adf40
Signed-off-by: Bill Jordan <bill.jordan@intel.com>