Compute-Runtime-Validation
b91e630bb5
Revert "Update zesMemoryGetBandWidth for PVC steppings"
...
This reverts commit 81544445aa .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2022-04-22 10:31:59 +02:00
Mayank Raghuwanshi
c637903132
Modify getSupportedRasErrorTypes function for gt Ras errors
...
Related-To: LOCI-2934
Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com >
2022-04-22 08:26:07 +02:00
Mayank Raghuwanshi
81544445aa
Update zesMemoryGetBandWidth for PVC steppings
...
Related-To: LOCI-2999
Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com >
2022-04-22 07:59:11 +02:00
Bellekallu Rajkiran
6f2efa4d54
Update fabric id based on new sysfs entry
...
Added new iaf directory path to query
fabric id.
Related-To: LOCI-3034
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com >
2022-04-20 09:05:41 +02:00
Bellekallu Rajkiran
f7e04b32aa
Update throttle reason status sysfs names
...
Related-To: LOCI-1843
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com >
2022-04-14 08:45:17 +02:00
Bellekallu Rajkiran
51988148de
Sysman: Remove redundant files
...
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com >
2022-04-11 18:32:57 +02:00
Mayank Raghuwanshi
5a804841cb
Add pmt xml offsets for ATS-M
...
Related-To: LOCI-2960
Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com >
2022-04-11 07:29:44 +02:00
Vilvaraj, T J Vivek
47f7b4f509
sysman: clean up code duplication for reset
...
warm and cold reset are common functionality,
the code is being moved to the common sysman implementation
from diagnostics specific files.
Related-To: LOCI-1908
Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com >
2022-03-31 14:10:39 +02:00
Bellekallu Rajkiran
cdf3d00919
Update throttle reason status sysfs name
...
Related-To: LOCI-1843
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com >
2022-03-29 11:02:47 +02:00
Bellekallu Rajkiran
cf9a5ed7d7
Add prelim support for ras diagnostics and firmware
...
Related-To: LOCI-2864
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com >
2022-03-24 06:58:25 +01:00
Bellekallu Rajkiran
061af9c284
Sysman : Add check on engine handle creation
...
Add check whether init succeeded on handle creation.
Related-To: LOCI-3005
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com >
2022-03-16 18:08:42 +01:00
Mayank Raghuwanshi
fac75222a3
Move Pci bdf parsing to os specific layer
...
Related-To: LOCI-2879
Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com >
2022-03-15 08:58:30 +01:00
Bellekallu Rajkiran
604f2c625e
Sysman: Utilize Ioctl helpers for memory module
...
Memory module translates memory regions without any
runtime query on underlying hardware/kernel. Use
ioctl helpers for memory region translation.
Related-To: LOCI-3006
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com >
2022-03-14 16:46:29 +01:00
Filip Hazubski
b79d9a8e10
Correct structs to explicitly initialize members
...
Affected structs are DebugAreaHeader, Ras and APITracerImp.
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2022-03-14 15:40:28 +01:00
Compute-Runtime-Validation
1a823356a3
Revert "Add prelim support for ras diagnostics and firmware"
...
This reverts commit 5a2145ad8d .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2022-03-06 11:31:15 +01:00
Bellekallu Rajkiran
7ec8d8ef91
Add prelim support for frequency events and fabric port
...
Related-To: LOCI-2864
Source of prelim headers:
https://github.com/intel-gpu/drm-uapi-helper/tree/master/i915-shared-headers
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com >
2022-03-06 02:47:20 +01:00
Bellekallu Rajkiran
01db5ef22a
Add prelim support for memory, engine and scheduler
...
Related-To: LOCI-2864
Source of prelim headers:
https://github.com/intel-gpu/drm-uapi-helper/tree/master/i915-shared-headers
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com >
2022-03-04 19:56:30 +01:00
Bellekallu Rajkiran
65fd55a893
Add prelim performance factor support
...
Related-To: LOCI-2864
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com >
2022-03-04 08:44:10 +01:00
Bellekallu Rajkiran
5a2145ad8d
Add prelim support for ras diagnostics and firmware
...
Related-To: LOCI-2864
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com >
2022-03-03 18:51:21 +01:00
Bellekallu Rajkiran
922a224cc9
Add prelim support for temperature, power and global operations
...
Related-To: LOCI-2864
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com >
2022-03-03 18:11:34 +01:00
Bellekallu Rajkiran
b6d3b4cca6
Sysman: Fix incorrect frequency request value
...
Sysman queries frequency request value from an incorrect
sysFs node which results in incorrect frequency request
value.
Modify sysFs node to query from correct node.
Related-To: LOCI-2887
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com >
2022-02-24 05:15:18 +01:00
T.J.Vivek Vilvaraj
1e6a38035e
sysman: serialize access to libigsc.
...
In a multi thread environment the access to the external
library needs to be synchronized.
Resolves: LOCI-2871, LOCI-2873
Signed-off-by: T.J.Vivek Vilvaraj <t.j.vivek.vilvaraj@intel.com >
2022-02-15 08:08:44 +01:00
Bellekallu Rajkiran
0bd60e524a
Initialize telemetry device entry variable
...
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com >
2022-01-31 20:33:53 +01:00
Mayank Raghuwanshi
90963b95ad
Update mechanism for getting subdeviceId and onSubdevice for memory
...
Earlier sysman memory module was using logical subdeviceId
exposed by core to retrieve memory telmetry data, replace
the logical subdeviceId with actual subdeviceId for collecting
telemetry data.
Related-To: LOCI-2828
Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com >
2022-01-28 07:52:47 +01:00
Ayush Pandey
715b9d31d2
Find sscanf alternative.
...
Used strtol() to write sscanfUtil to extraxt info of BDF pcipath.
Related-To: LOCI-1002
Signed-off-by: Ayush Pandey <ayush.pandey@intel.com >
2022-01-21 09:02:48 +01:00
Szymon Morek
26a24e8fde
Query engine info with distances
...
If prelim kernel is being used, query distances
and set correctly number of available engines
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2022-01-10 13:30:26 +01:00
Szymon Morek
6258575e5e
Use queryEngineInfo with prelim ioctls
...
If prelim kernel is being used, query engines
with prelim ioctls.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2022-01-04 12:15:04 +01:00
Mayank Raghuwanshi
25403cf85d
Add support for device level performance factor
...
Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com >
2021-12-28 07:13:51 +01:00
T J Vivek Vilvaraj
9a39cad07d
sysman:add reinitialization code to diagnostics
...
Signed-off-by: T J Vivek Vilvaraj <t.j.vivek.vilvaraj@intel.com >
2021-12-23 18:27:06 +01:00
Vilvaraj, T J Vivek
15f102a7cb
sysman:modify diagnostics tests
...
Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com >
2021-12-22 14:53:38 +01:00
Szymon Morek
2647d563c7
Remove i915 structs from MemoryInfo
...
Use structs defined in ioctl_helper.h instead of
i915 dependent ones to avoid conflicts between
different kernels
Related-To: NEO-6149
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2021-12-22 12:46:59 +01:00
T J Vivek Vilvaraj
b95428078e
sysman: filter diagnostics related accesses
...
prevent diagnostics related calls on unsupported platforms
Signed-off-by: T J Vivek Vilvaraj <t.j.vivek.vilvaraj@intel.com >
2021-12-20 12:49:49 +01:00
Ranjan, Joshua Santhosh
5a2a19fa1a
Sysman Fix FirmwareUtil Cleanup
...
Fixed by avoiding library function access if library is unavailable.
Related-To: LOCI-2719
Signed-off-by: Ranjan, Joshua Santhosh <joshua.santosh.ranjan@intel.com >
2021-12-15 09:18:18 +01:00
Bellekallu Rajkiran
4ae2f6e111
Sysman: Add support for device level energy counters
...
Related-To: LOCI-2724
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com >
2021-12-14 01:33:59 +01:00
Daniel Enriquez
cf70a57efb
Sysman WindowS: Fix Max Memory Bandwidth.
...
Signed-off-by: Daniel Enriquez <daniel.enriquez.montanez@intel.com >
2021-12-10 00:36:15 +01:00
Mayank Raghuwanshi
94d09f75b7
Get RAS HBM errors count using firmware interface
...
-- master-commit
Add functionality to retrieve memory errors from Firmware
-- master-commit
Related-To: LOCI-2491, LOCI-2726
Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com >
2021-12-08 18:57:24 +01:00
Joshua Santosh Ranjan
feae44bce8
Added Fabric RAS error support
...
fabric error counts are read from sysfs nodes
Related-To: LOCI-2613
Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com >
2021-12-08 07:00:39 +01:00
Bellekallu Rajkiran
a1121ccb6b
Sysman: Replace mmap with pread
...
Use pread sys call instead of mmap and munmap
to get telemetry info.
Related-To: LOCI-2634
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com >
2021-12-06 17:44:29 +01:00
Pichika Uday Kiran
7764924387
sysman: Avoid creating the IGSC libary handle in ULTs
...
- Contains the changes to avoid invoking IGSC library
during ULT execution.
Related-To: LOCI-2719
Signed-off-by: Pichika Uday Kiran <pichika.uday.kiran@intel.com >
2021-12-02 19:30:35 +01:00
Mayank Raghuwanshi
2ec2d514ec
Update create Handle mechanism for sysman RAS
...
Use set instead of vector to get the supported error types,
using vector may cause duplication of error types when quering
supported error types from different interfaces which in turn
may cause duplication of handles.
Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com >
2021-12-02 12:39:30 +01:00
Vilvaraj, T J Vivek
0d86842780
Sysman: add Cold Reset to diagnostics API
...
the diagnostics API expects the device to be correctly reset based on
the type of diagnostics result. Cold reset is expected when there is
some repair scheduled on the Device.
Relates-to:LOCI-2508
Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com >
2021-11-30 20:12:34 +01:00
Bellekallu Rajkiran
ede0123561
Update sysfs path for setting standby mode
...
Related-To: LOCI-2734
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com >
2021-11-29 23:54:06 +01:00
Szymon Morek
12777bd758
Move MemoryInfoImpl logic to MemoryInfo
...
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2021-11-29 13:13:11 +01:00
Daniel Enriquez
0ce5c6c9c9
Windows Sysman: Updating VRAM memory Type.
...
Adding support for the complete range of memory types.
Signed-off-by: Daniel Enriquez <daniel.enriquez.montanez@intel.com >
2021-11-27 11:51:44 +01:00
Vilvaraj, T J Vivek
35607e7830
sysman: add warm reset capability to diagnostics.
...
Relates-to:LOCI-2507
Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com >
2021-11-25 21:00:19 +01:00
Daniel Enriquez
f131b75d39
Events Windows:Fix corner case for the exit handle.
...
Corner case where the signal state is not restored after registering events.
Signed-off-by: Daniel Enriquez <daniel.enriquez.montanez@intel.com >
2021-11-22 15:19:26 +01:00
Vilvaraj, T J Vivek
b91cec5655
sysman: mock firmware utility in sysman
...
firmware utility needs to be mocked to prevent file access while
executing ULT's
Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com >
2021-11-17 07:57:46 +01:00
T J Vivek Vilvaraj
e1a1e96110
sysman:close diagnostics handles before reset
...
Relates-to:LOCI-2650
Signed-off-by: T J Vivek Vilvaraj <t.j.vivek.vilvaraj@intel.com >
2021-11-15 21:30:13 +01:00
Jitendra Sharma
1186c1aae3
zesSysmanDeviceReset: Reinitialize device after device reset
...
Before performing gpu device reset, first all level zero resources
and gpu device specific resources have to be cleaned up. Also as
after device reset, state of gpu device would be lost.
Hence after performing gpu device reset, level zero device have
to be reinitialized by querying gpu device again.
This change is aimed at reinitializing the level zero resources
after gpu device reset, so that user could continue using level zero
devices after device reset.
Related-To: LOCI-2627
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com >
2021-11-07 23:43:48 +01:00
Daniel Enriquez
dbe0ba2a0f
Sysman for Windows: Multiple fixes and optimizations
...
Signed-off-by: Daniel Enriquez <daniel.enriquez.montanez@intel.com >
2021-11-04 20:11:39 +01:00