Commit Graph

496 Commits

Author SHA1 Message Date
Vilvaraj, T J Vivek 4d5d31d80d sysman: simplify ULT for device reset
device reset is heavily dependant on gmock for ULT
reduce depencency on GMock.

Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com>
2022-01-04 13:26:56 +01:00
Szymon Morek 6258575e5e Use queryEngineInfo with prelim ioctls
If prelim kernel is being used, query engines
with prelim ioctls.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2022-01-04 12:15:04 +01:00
Szymon Morek d9f6757378 Change return type from unique_ptr to vector
In some of the drm functions there is a pattern
to store array in unique_ptr and pass it's length
as an argument. This commit simplifies this.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2021-12-28 10:33:53 +01:00
Mayank Raghuwanshi 25403cf85d Add support for device level performance factor
Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2021-12-28 07:13:51 +01:00
Joshua Santosh Ranjan e1ef9ac79a Revert "Metric Detect Paranoid mode and fail gracefully"
This reverts commit a720282358dff08fb36b95eaf9bf184efa315f48.

This revert is to avoid suggesting to disable paranoid mode.
This revert also avoids L0 metrics mandating paranoid mode
setting.

Related-To: LOCI-2822

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2021-12-23 19:30:47 +01:00
T J Vivek Vilvaraj 9a39cad07d sysman:add reinitialization code to diagnostics
Signed-off-by: T J Vivek Vilvaraj <t.j.vivek.vilvaraj@intel.com>
2021-12-23 18:27:06 +01:00
Vilvaraj, T J Vivek 15f102a7cb sysman:modify diagnostics tests
Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com>
2021-12-22 14:53:38 +01:00
Filip Hazubski eec857932a Split test file
Split test_metric_streamer_2.cpp file

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2021-12-22 14:40:43 +01:00
Szymon Morek 2647d563c7 Remove i915 structs from MemoryInfo
Use structs defined in ioctl_helper.h instead of
i915 dependent ones to avoid conflicts between
different kernels

Related-To: NEO-6149

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2021-12-22 12:46:59 +01:00
T J Vivek Vilvaraj b95428078e sysman: filter diagnostics related accesses
prevent diagnostics related calls on unsupported platforms


Signed-off-by: T J Vivek Vilvaraj <t.j.vivek.vilvaraj@intel.com>
2021-12-20 12:49:49 +01:00
Michal Mrozek 62faecf6d5 Optimize virtual calls #2.
Optimize frequently used virtual cost.
Compiler cannot inline those which causes overhead.

Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2021-12-16 16:26:59 +01:00
Mateusz Jablonski ee18008750 Remove duplicated test.h header
use full path to include test.h
add missing headers to source list


Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2021-12-15 15:34:45 +01:00
Mateusz Jablonski ad184642a6 Use full path to include test.h 3/n
fix files in level_zero

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2021-12-15 13:42:33 +01:00
Ranjan, Joshua Santhosh 5a2a19fa1a Sysman Fix FirmwareUtil Cleanup
Fixed by avoiding library function access if library is unavailable.


Related-To: LOCI-2719

Signed-off-by: Ranjan, Joshua Santhosh <joshua.santosh.ranjan@intel.com>
2021-12-15 09:18:18 +01:00
Bellekallu Rajkiran 4ae2f6e111 Sysman: Add support for device level energy counters
Related-To: LOCI-2724

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2021-12-14 01:33:59 +01:00
Aleksei Keisel 1e2a57d533 Update MDAPI headers
Resolves: LOCI-2692
Signed-off-by: Aleksei Keisel aleksei.keisel@intel.com
2021-12-10 21:17:33 +01:00
Daniel Enriquez cf70a57efb Sysman WindowS: Fix Max Memory Bandwidth.
Signed-off-by: Daniel Enriquez <daniel.enriquez.montanez@intel.com>
2021-12-10 00:36:15 +01:00
Filip Hazubski cf4ce308d9 Rename function
Rename multiDeviceCapable to implicitScalingCapable
Rename isMultiDeviceCapable to isImplicitScalingCapable

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2021-12-09 11:14:08 +01:00
Mayank Raghuwanshi 94d09f75b7 Get RAS HBM errors count using firmware interface
-- master-commit
Add functionality to retrieve memory errors from Firmware
-- master-commit

Related-To: LOCI-2491, LOCI-2726

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2021-12-08 18:57:24 +01:00
Joshua Santosh Ranjan feae44bce8 Added Fabric RAS error support
fabric error counts are read from sysfs nodes

Related-To: LOCI-2613

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2021-12-08 07:00:39 +01:00
Bellekallu Rajkiran a1121ccb6b Sysman: Replace mmap with pread
Use pread sys call instead of mmap and munmap
to get telemetry info.

Related-To: LOCI-2634


Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2021-12-06 17:44:29 +01:00
Pichika Uday Kiran 7764924387 sysman: Avoid creating the IGSC libary handle in ULTs
- Contains the changes to avoid invoking IGSC library
during ULT execution.


Related-To: LOCI-2719
Signed-off-by: Pichika Uday Kiran <pichika.uday.kiran@intel.com>
2021-12-02 19:30:35 +01:00
Mayank Raghuwanshi 2ec2d514ec Update create Handle mechanism for sysman RAS
Use set instead of vector to get the supported error types,
using vector may cause duplication of error types when quering
supported error types from different interfaces which in turn
may cause duplication of handles.

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2021-12-02 12:39:30 +01:00
Vilvaraj, T J Vivek 0d86842780 Sysman: add Cold Reset to diagnostics API
the diagnostics API expects the device to be correctly reset based on
the type of diagnostics result. Cold reset is expected when there is
some repair scheduled on the Device.

Relates-to:LOCI-2508


Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com>
2021-11-30 20:12:34 +01:00
Mateusz Jablonski 8128a5593f Add missing include to sysman memory test
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2021-11-30 10:39:46 +01:00
Bellekallu Rajkiran ede0123561 Update sysfs path for setting standby mode
Related-To: LOCI-2734

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2021-11-29 23:54:06 +01:00
Szymon Morek 12777bd758 Move MemoryInfoImpl logic to MemoryInfo
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2021-11-29 13:13:11 +01:00
Daniel Enriquez 0ce5c6c9c9 Windows Sysman: Updating VRAM memory Type.
Adding support for the complete range of memory types.

Signed-off-by: Daniel Enriquez <daniel.enriquez.montanez@intel.com>
2021-11-27 11:51:44 +01:00
Vilvaraj, T J Vivek 35607e7830 sysman: add warm reset capability to diagnostics.
Relates-to:LOCI-2507

Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com>
2021-11-25 21:00:19 +01:00
Joshua Santosh Ranjan ed6b30af12 Metrics Library Release For Query Case
Release Metrics Library after Query related objects are released

Related-To: LOCI-2656

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2021-11-24 19:24:31 +01:00
Daniel Enriquez f131b75d39 Events Windows:Fix corner case for the exit handle.
Corner case where the signal state is not restored after registering events.

Signed-off-by: Daniel Enriquez <daniel.enriquez.montanez@intel.com>
2021-11-22 15:19:26 +01:00
Ayush Pandey 8130727232 Reduce the event listen timeout in sysman ULTs
Long timeout for event listen API is resulting in higher ULT execution time
for sysman ULTs. Hence reducing this timeout.

Related-To: NEO-6412

Signed-off-by: Ayush Pandey <ayush.pandey@intel.com>
2021-11-17 14:06:43 +01:00
Joshua Santosh Ranjan d15eed035b Metrics Restore addressOffsetCCSOffset after query programming
Related-To: LOCI-2711

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2021-11-17 11:57:51 +01:00
Vilvaraj, T J Vivek b91cec5655 sysman: mock firmware utility in sysman
firmware utility needs to be mocked to prevent file access while
executing ULT's


Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com>
2021-11-17 07:57:46 +01:00
Mateusz Hoppe 35795357e9 DebugSession - add printBitmask()
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2021-11-15 21:40:27 +01:00
T J Vivek Vilvaraj e1a1e96110 sysman:close diagnostics handles before reset
Relates-to:LOCI-2650

Signed-off-by: T J Vivek Vilvaraj <t.j.vivek.vilvaraj@intel.com>
2021-11-15 21:30:13 +01:00
Zbigniew Zdanowicz f90932cca7 Use references instead copy ctors
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2021-11-15 14:42:33 +01:00
Artur Harasimiuk ea373d2664 cmake: cleanup variable names
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2021-11-10 21:31:17 +01:00
Artur Harasimiuk 39cb34047b temporarily skip sysman tests
Related-To: NEO-6412

Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2021-11-09 20:28:25 +01:00
Artur Harasimiuk 9d3ea387aa refactor test_loader to not include sysman
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2021-11-08 17:23:32 +01:00
Jitendra Sharma 1186c1aae3 zesSysmanDeviceReset: Reinitialize device after device reset
Before performing gpu device reset, first all level zero resources
and gpu device specific resources have to be cleaned up. Also as
after device reset, state of gpu device would be lost.
Hence after performing gpu device reset, level zero device have
to be reinitialized by querying gpu device again.
This change is aimed at reinitializing the level zero resources
after gpu device reset, so that user could continue using level zero
devices after device reset.

Related-To: LOCI-2627

Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2021-11-07 23:43:48 +01:00
Zbigniew Zdanowicz 3d3881e7ad Remove not needed virtual function from tests
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2021-11-05 12:04:14 +01:00
Mateusz Hoppe ee418efadf Per-thread scratch offset calculation
Related-To: NEO-6404

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2021-11-05 11:03:17 +01:00
Daniel Enriquez dbe0ba2a0f Sysman for Windows: Multiple fixes and optimizations
Signed-off-by: Daniel Enriquez <daniel.enriquez.montanez@intel.com>
2021-11-04 20:11:39 +01:00
Igor Venevtsev a3c3fe18ce Cleanup IGC includes
- Remove IGC_COMMON_PARENT_DIR WA

Signed-off-by: Igor Venevtsev <igor.venevtsev@intel.com>
2021-11-04 14:44:59 +01:00
Artur Harasimiuk 3b47a9766a cmake: L0 black_box_tests refactor
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2021-11-04 10:26:13 +01:00
Pichika Uday Kiran 99cbd1bb92 firmware: Read new firmware version from sysfs
- Read the newly added firmware version from sysfs node


Related-To: LOCI-2557
Signed-off-by: Pichika Uday Kiran <pichika.uday.kiran@intel.com>
2021-11-03 14:03:12 +01:00
Mateusz Hoppe 4d6a2d0a57 Enhance logs in EuThread
- do not overwrite memoryHandle in resumeThread() when thread is
not stopped
- add SIP Commands

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2021-11-03 11:04:14 +01:00
Pichika Uday Kiran d9f47d6d7d firmware: Add canControl property to firmware properties
Related-To: LOCI-2679
Signed-off-by: Pichika Uday Kiran <pichika.uday.kiran@intel.com>
2021-11-03 10:04:25 +01:00
Pichika Uday Kiran 52cf6bf383 firmware: Enhance the firmware utility module
- There are functions exposed through Firmware Util interface to get
the firmware version and perform firmware flash based on firmware type.
Instead this functionality is being handled with generic functions. Hence
this commit removes the redundant code.

Related-To: LOCI-2568
Signed-off-by: Pichika Uday Kiran <pichika.uday.kiran@intel.com>
2021-11-02 11:42:18 +01:00