Commit Graph

29 Commits

Author SHA1 Message Date
c9664e6bad refactor: rename global debug manager to debugManager
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-30 13:00:59 +01:00
943ffb31ef feature(sysman): Add support for device reset extension
Related-To: NEO-8654

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-10-31 23:39:18 +01:00
6994106c1f fix: Read pci config space on demand.
Storing config space during sysman init might result in
providing inappropriate properties as config space can be
modified.

Read config space whenever properties are queried.

Related-To: LOCI-4513

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-07-01 00:37:38 +02:00
d3a31957db feature(sysman): Add delay for HBM diagnostics
Add debug variable to set sleep duration for HBM
IFR to complete

Related-To: LOCI-4298

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-05-03 20:27:21 +02:00
659cacf2c9 refactor l0 cmake: reduce include directories
Related-To: NEO-7507
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-03-17 13:41:55 +01:00
cb7437b6b7 refactor l0 tools: cleanup cmake files
append sources in current directory

Related-To: NEO-7507
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-03-16 14:05:14 +01:00
5fbb10df2d Debug: Add debug logs for diagnostics module
Related-To: LOCI-3882

Signed-off-by: Devarinti, Puneeth Kumar Reddy <puneeth.kumar.reddy.devarinti@intel.com>
2023-02-27 04:40:11 +01:00
43b790957d style: format code using clang-format 15.0.6
Related-To: NEO-7500
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-01-05 10:33:47 +01:00
8d39ab7bcd Diagnostics: Remove duplicate code
gpuProcessCleanUp is defined in multile places,
Reuse common code for cleaning up all processes.

Related-To: LOCI-3746

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-12-20 09:58:29 +01:00
9ad3f6190f do not sleep in ULTs
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2022-10-21 19:37:52 +02:00
de06d91db8 Sysman: Fix few memory leaks
Invoking FwDeviceInit from several modules without closing
igsc device result in memory leak.

Add support to invoke FwDeviceInit only during creation of fw util
interface.

Related-To: LOCI-3204

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-09-14 16:16:30 +02:00
ceff16084d Fix temperature handle enumeration issue on single tile devices
Add platform check to read pmt offsets corresponding to
tile instead of root node for single tile devices.

Related-To: LOCI-2575

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-09-06 10:57:35 +02:00
281c98dcf9 Add firmware util interface for sysman windows
Related-To: LOCI-3132

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2022-06-24 08:42:48 +02:00
c0121eb824 SysMan: fix issues in execution environment restoration.
the scpoe of the restorer is till the LinuxSysmanImp is deleted.
Ideally the scope of restorer needs to be function level.

Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com>
2022-06-15 11:38:23 +02:00
973bcb9dbc Sysman: cleanup execution env referrals
convert the present system of calling Inc/Dec of
execution environment to a more elegant solution

Resolves: LOCI-3165

Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com>
2022-06-06 18:40:23 +02:00
da52303e6e SysMan: Diagnostics warm reset fix.
The following modifications were done as part of the fix
for warm reset.
1. Release sysman resources before quiscenting the GPU.
2. Add additional checks to confirm quiscenting of the GPU
before launching the diagnostics tests.
3. Fixed warm reset with wait time to allow the changes to be
propagated to the entire GPU PCI tree.
4. Modified the ULT's completely to avoid the usage of MOCKS.
5. Made Diagnostics handle creation per-device from per-SubDevice.

Related-To: LOCI-3053

Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com>
2022-06-06 10:09:47 +02:00
47f7b4f509 sysman: clean up code duplication for reset
warm and cold reset are common functionality,
the code is being moved to the common sysman implementation
from diagnostics specific files.

Related-To: LOCI-1908
Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com>
2022-03-31 14:10:39 +02:00
cf9a5ed7d7 Add prelim support for ras diagnostics and firmware
Related-To: LOCI-2864

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-03-24 06:58:25 +01:00
1a823356a3 Revert "Add prelim support for ras diagnostics and firmware"
This reverts commit 5a2145ad8d.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-03-06 11:31:15 +01:00
5a2145ad8d Add prelim support for ras diagnostics and firmware
Related-To: LOCI-2864

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-03-03 18:51:21 +01:00
9a39cad07d sysman:add reinitialization code to diagnostics
Signed-off-by: T J Vivek Vilvaraj <t.j.vivek.vilvaraj@intel.com>
2021-12-23 18:27:06 +01:00
b95428078e sysman: filter diagnostics related accesses
prevent diagnostics related calls on unsupported platforms


Signed-off-by: T J Vivek Vilvaraj <t.j.vivek.vilvaraj@intel.com>
2021-12-20 12:49:49 +01:00
0d86842780 Sysman: add Cold Reset to diagnostics API
the diagnostics API expects the device to be correctly reset based on
the type of diagnostics result. Cold reset is expected when there is
some repair scheduled on the Device.

Relates-to:LOCI-2508


Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com>
2021-11-30 20:12:34 +01:00
35607e7830 sysman: add warm reset capability to diagnostics.
Relates-to:LOCI-2507

Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com>
2021-11-25 21:00:19 +01:00
9ccf43e441 Correct branch_dir_suffix in cmake
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
Related-To: NEO-6245
2021-09-14 16:00:20 +02:00
d1e8f859d4 sysman: add multi-Device Support for diagnostics
Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com>
2021-05-27 11:48:16 +02:00
3bd4bca911 Copyright header update
Dates corrected in copyright headers to reflect original publication date
(2018 for OpenCL, 2020 for Level Zero).

Signed-off-by: lgotszal <lukasz.gotszald@intel.com>
2021-05-17 20:38:19 +02:00
24a745f4bd sysman:add support for Diagnostis API.
Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com>
2021-05-04 20:37:38 +02:00
457d05420d sysman: add diagnostics boilerplate code
Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com>
2021-03-19 05:33:20 +01:00