Commit Graph

48 Commits

Author SHA1 Message Date
Bari, Pratik
b55e5ddc87 feature(sysman): Add L3 Fabric Error in Sysman Dir
- Added the Display error category in the tools code as well.
- Added the index increment logic for the tools code as well.

Related-To: NEO-10362

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2024-02-28 08:48:08 +01:00
Bari, Pratik
00d36b5cee feature(sysman): Added L3 Fabric Errors
Related-To: NEO-8560

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2024-02-06 10:18:47 +01:00
Mateusz Jablonski
368070069e refactor: remove intel_hwconfig_types file from third party
remove not needed querying of vs/hs/ds/ps/gs threads

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-27 15:32:23 +01:00
Bellekallu Rajkiran
b39aafec26 feature(sysman): Add Support for ras exp API's
Related-To: NEO-8839, NEO-8873

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-12-22 10:09:21 +01:00
Bellekallu Rajkiran
35ead06765 fix: Fix Sysman ULT failures
- Merge Ras and globalOps prelim files with non-prelim
files.

Related-To: NEO-9521

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-12-01 15:11:22 +01:00
Mateusz Jablonski
c9664e6bad refactor: rename global debug manager to debugManager
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-30 13:00:59 +01:00
Compute-Runtime-Validation
a3daca1e70 Revert "fix: Fix Sysman ULT failures"
This reverts commit 4398e4297f.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-11-17 10:52:28 +01:00
Bellekallu Rajkiran
4398e4297f fix: Fix Sysman ULT failures
- Merge Ras and globalOps prelim files with non-prelim
files.

Related-To: NEO-9521

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-11-14 11:28:56 +01:00
Mayank Raghuwanshi
a69110a7ec feature: Add support for RAS mdfi errors
Related-To: LOCI-4479

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-06-13 10:14:36 +02:00
Devarinti, Puneeth Kumar Reddy
c03867b55c feature: Add debug logs for RAS module
Related-To: LOCI-3880

Signed-off-by: Devarinti, Puneeth Kumar Reddy <puneeth.kumar.reddy.devarinti@intel.com>
2023-05-09 08:12:06 +02:00
Mayank Raghuwanshi
9cc5763800 fix: Revert spec 1.5 RAS changes from Sysman
Related-To: LOCI-4351

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-04-27 05:29:33 +02:00
Mayank Raghuwanshi
3816b85fa0 Add check for memory type before calculating ras hbm errors
Related-To: LOCI-3500

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-03-31 13:47:41 +02:00
Mayank Raghuwanshi
065232eac7 Add support for ras l3 fabric errors
Related-To: LOCI-3966

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-03-30 12:47:45 +02:00
Mateusz Jablonski
659cacf2c9 refactor l0 cmake: reduce include directories
Related-To: NEO-7507
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-03-17 13:41:55 +01:00
Mateusz Jablonski
cb7437b6b7 refactor l0 tools: cleanup cmake files
append sources in current directory

Related-To: NEO-7507
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-03-16 14:05:14 +01:00
Mayank Raghuwanshi
29ed6ea077 Add support l3_bank and subslice Ras errors
Related-To: LOCI-3662

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-02-22 09:14:15 +01:00
Mayank Raghuwanshi
07d3353b1f Add support for sysman zesFabricPortGetFabricErrorCounters API
Related-To: LOCI-3398

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-02-13 06:50:23 +01:00
Mayank Raghuwanshi
5a833e2c08 Add support for RAS CSC HW errors
Related-To: LOCI-3699

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-02-03 18:36:12 +01:00
Mayank Raghuwanshi
9968857c29 Change category for some sysman ras errors
Related-To: LOCI-3648

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2022-12-23 18:43:41 +01:00
Joshua Santosh Ranjan
522076cf82 Avoid adding subdevice flag if ReturnSubDevicesAsApiDevices is set
Related-To: LOCI-3656

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-12-19 05:15:51 +01:00
Joshua Santosh Ranjan
7c050291bf Fix fabric ras errors accumulated to all devices
This patch fixes the issue that fabric ras errors
from all devies are reported for all devices.

Related-To: LOCI-3548

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-11-16 12:03:50 +01:00
Mayank Raghuwanshi
ffcca3ba53 Use physical subdeviceId for sysman ras, freq and standby module
Related-To: LOCI-2925, LOCI-2926, LOCI-3236
Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2022-11-14 14:10:23 +01:00
Joshua Santosh Ranjan
436ec1234b Sysman Add support for auxiliary bus for fabric Ras
Related-To: LOCI-3531

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-10-28 18:18:33 +02:00
Kamil Kopryk
582ed0565b Use memcpy_s instead of memcpy
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2022-08-01 12:43:29 +02:00
Mateusz Hoppe
5956aea18d Limit header includes from level_zero device.h
- remove including debugger_l0.h from device.h
- add getL0Debugger() to shared NEO Device

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2022-07-06 16:41:17 +02:00
Mayank Raghuwanshi
281c98dcf9 Add firmware util interface for sysman windows
Related-To: LOCI-3132

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2022-06-24 08:42:48 +02:00
Artur Harasimiuk
e9be9b64c6 clang-tidy configuration cleanup
Define single .clang-tidy configuration with all used checks and use
NOLINT to selectively silence tool. That way cleanup should be easier.
third_part/ has its own configuration that disables clang-tidy for this
folder.

Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2022-05-11 14:02:04 +02:00
Mayank Raghuwanshi
c637903132 Modify getSupportedRasErrorTypes function for gt Ras errors
Related-To: LOCI-2934

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2022-04-22 08:26:07 +02:00
Bellekallu Rajkiran
cf9a5ed7d7 Add prelim support for ras diagnostics and firmware
Related-To: LOCI-2864

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-03-24 06:58:25 +01:00
Compute-Runtime-Validation
1a823356a3 Revert "Add prelim support for ras diagnostics and firmware"
This reverts commit 5a2145ad8d.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-03-06 11:31:15 +01:00
Bellekallu Rajkiran
5a2145ad8d Add prelim support for ras diagnostics and firmware
Related-To: LOCI-2864

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-03-03 18:51:21 +01:00
Mayank Raghuwanshi
2ec2d514ec Update create Handle mechanism for sysman RAS
Use set instead of vector to get the supported error types,
using vector may cause duplication of error types when quering
supported error types from different interfaces which in turn
may cause duplication of handles.

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2021-12-02 12:39:30 +01:00
Kamil Kopryk
9ccf43e441 Correct branch_dir_suffix in cmake
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
Related-To: NEO-6245
2021-09-14 16:00:20 +02:00
lgotszal
5c43c6fd94 Update MIT copyright headers to always use SPDX
Related-to: IGC-4296

Signed-off-by: lgotszal <lukasz.gotszald@intel.com>
2021-06-23 14:00:21 +02:00
Mayank Raghuwanshi
0f973f146e Implement zesRasGetConfig and zesRasSetConfig
Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2021-03-02 16:07:01 +01:00
Mayank Raghuwanshi
5cd5705239 Implement clear option for zesRasGetState
Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2021-01-21 15:00:13 +01:00
mraghuwa
978003e96e Add subdevice support for RAS module
Change-Id: Iced5aeed86d6b19a4710992155257e420ae1296f
Signed-off-by: mraghuwa <mayank.raghuwanshi@intel.com>
2020-10-27 17:54:29 +01:00
mraghuwa
a8a013b0c3 Implement zesRasGetState to retrieve cache errors
Change-Id: I9fbba505db6551f510cb20ea71604af53db61960
Signed-off-by: mraghuwa <mayank.raghuwanshi@intel.com>
2020-10-20 15:45:36 +02:00
mraghuwa
2643346b48 Update Sysman RAS Module
Change-Id: I2b99dae4336811ea4b539da48c1434657a9cf62a
Signed-off-by: mraghuwa <mayank.raghuwanshi@intel.com>
2020-10-09 08:23:19 +02:00
Pawel Cieslak
fb821f21f5 Cmake format script
Related-To: NEO-1157

Change-Id: Ie1b907e838cfb9ad0d75cc8971d415f7c77103c9
Signed-off-by: Pawel Cieslak <pawel.cieslak@intel.com>
2020-08-19 16:36:30 +02:00
mraghuwa
220c575850 Update Ras Api's as per latest spec
Change-Id: I29b77eee0832fcca6d989f9ef41b01b17232a91e
Signed-off-by: mraghuwa <mayank.raghuwanshi@intel.com>
2020-08-06 08:31:03 +02:00
Jaime Arteaga
902fc2f6c4 level-zero v1.0 (2/N)
Change-Id: I1419231a721fab210e166d26a264cae04d661dcd
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
Signed-off-by: macabral <matias.a.cabral@intel.com>
Signed-off-by: davidoli <david.olien@intel.com>
Signed-off-by: Aravind Gopalakrishnan <Aravind.Gopalakrishnan@intel.com>
Signed-off-by: Spruit, Neil R <neil.r.spruit@intel.com>
Signed-off-by: Latif, Raiyan <raiyan.latif@intel.com>
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2020-08-03 13:11:13 +02:00
Bill Jordan
e8bd440773 Don't allow copy or moving Sysman related objects.
Change-Id: I70dd97bffa1c4d08f05eb796c6d6a2eb66f06f4b
Signed-off-by: Bill Jordan <bill.jordan@intel.com>
2020-07-10 21:05:15 +02:00
Vilvaraj, T J Vivek
3c50e1ede6 fix unintialized class member in Ctor.
Change-Id: Idc3e8a2ddccaf9c94639a3f499824e86de830fd4
2020-07-07 21:48:18 +02:00
T.J. Vivek Vilvaraj
96a7b1e066 add rules to install RAS udev rules
- create rules to install Udev rules in configurable location
- create files relating to RAS counters

Change-Id: Iebd57ba2dd09494ea4586b305cd56c86a71fb8b0
2020-07-02 10:25:21 +02:00
Vilvaraj, T J Vivek
0c9c55cd17 add counter support for RAS.
- added dual handle support for RAS Correctable and Uncorrectable Errors.
- added reset counter for RAS.
- added Os Specific ULT for RAS

Change-Id: Ia10115bf6720ab211f549571e810ec0d6c0801ec
2020-06-25 08:48:11 +02:00
Vilvaraj, T J Vivek
5c029b4471 Implement RAS APIs
- Implemented zetSysmanGetProperties
- Implemented zetSysmanGetState

Change-Id: Id40a1cb036693156c31c6e9ee78c3113ef35e5c2
Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com>
2020-04-29 18:03:05 +02:00
Vilvaraj, T J Vivek
324b1f5c60 RAS APIs boilerplate for Level Zero Sysman.
-Implement zetSysmanRasGet API

Change-Id: Ib06e4a5d087a1e684c918413c3d98da2082e4f91
Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com>
2020-04-21 20:04:53 +02:00