Commit Graph

407 Commits

Author SHA1 Message Date
Mateusz Hoppe bcbe3278a9 Metrics: Fix potential divide by zero
- return error when frequency is zero

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2022-04-25 14:49:08 +02:00
Compute-Runtime-Validation b91e630bb5 Revert "Update zesMemoryGetBandWidth for PVC steppings"
This reverts commit 81544445aa.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-04-22 10:31:59 +02:00
Mayank Raghuwanshi c637903132 Modify getSupportedRasErrorTypes function for gt Ras errors
Related-To: LOCI-2934

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2022-04-22 08:26:07 +02:00
Mayank Raghuwanshi 81544445aa Update zesMemoryGetBandWidth for PVC steppings
Related-To: LOCI-2999
Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2022-04-22 07:59:11 +02:00
davidoli 5dfd68a729 Add Ip sampling metrics calculate values Support
Related-To: LOCI-2757

Signed-off-by: davidoli <david.olien@intel.com>
2022-04-21 20:02:40 +02:00
Kamil Kopryk 10be59cb15 Improve isIpSamplingSupported helper
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
Related-To: NEO-6738
2022-04-20 17:02:20 +02:00
Bellekallu Rajkiran 6f2efa4d54 Update fabric id based on new sysfs entry
Added new iaf directory path to query
fabric id.

Related-To: LOCI-3034

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-04-20 09:05:41 +02:00
Bellekallu Rajkiran f7e04b32aa Update throttle reason status sysfs names
Related-To: LOCI-1843

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-04-14 08:45:17 +02:00
Joshua Santosh Ranjan 0441295e21 Metrics: Move Workload Partition Setting to Metric Group Enumeration
This patch fixes an issue about incorrect setting
of WorkloadPartionEnabled flag to ClientOptions.

Related-To: LOCI-3021

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-04-14 08:36:37 +02:00
Bellekallu Rajkiran 51988148de Sysman: Remove redundant files
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-04-11 18:32:57 +02:00
Mayank Raghuwanshi 5a804841cb Add pmt xml offsets for ATS-M
Related-To: LOCI-2960
Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2022-04-11 07:29:44 +02:00
Mateusz Hoppe 933fbd4f63 Fix scratch offset calculation in DebugSession
- account for ThreadEuRatio when calculating per-thread
scratch offset

Related-To: NEO-6677

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2022-04-05 10:33:49 +02:00
Vilvaraj, T J Vivek 47f7b4f509 sysman: clean up code duplication for reset
warm and cold reset are common functionality,
the code is being moved to the common sysman implementation
from diagnostics specific files.

Related-To: LOCI-1908
Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com>
2022-03-31 14:10:39 +02:00
Bellekallu Rajkiran cdf3d00919 Update throttle reason status sysfs name
Related-To: LOCI-1843

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-03-29 11:02:47 +02:00
Bellekallu Rajkiran cf9a5ed7d7 Add prelim support for ras diagnostics and firmware
Related-To: LOCI-2864

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-03-24 06:58:25 +01:00
Joshua Santosh Ranjan 6e14df9b84 Metrics Ip sampling Avoid retries when starting measurement
This patch avoids retrying if ioctl fails.

Related-To: LOCI-3010

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-03-24 02:24:43 +01:00
Bellekallu Rajkiran 061af9c284 Sysman : Add check on engine handle creation
Add check whether init succeeded on handle creation.

Related-To: LOCI-3005

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-03-16 18:08:42 +01:00
Mayank Raghuwanshi fac75222a3 Move Pci bdf parsing to os specific layer
Related-To: LOCI-2879

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2022-03-15 08:58:30 +01:00
Bellekallu Rajkiran 604f2c625e Sysman: Utilize Ioctl helpers for memory module
Memory module translates memory regions without any
runtime query on underlying hardware/kernel. Use
ioctl helpers for memory region translation.

Related-To: LOCI-3006

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-03-14 16:46:29 +01:00
Filip Hazubski b79d9a8e10 Correct structs to explicitly initialize members
Affected structs are DebugAreaHeader, Ras and APITracerImp.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2022-03-14 15:40:28 +01:00
Joshua Santosh Ranjan c7ce397b17 Metrics IP Sampling Add Support for Streamer APIs
This Patch adds support for collecting IP Metrics using
StreamerOpen, StreamerClose and StreamerReadData

Related-To: LOCI-2755
Related-To: LOCI-2756

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-03-11 03:03:22 +01:00
Compute-Runtime-Validation 1a823356a3 Revert "Add prelim support for ras diagnostics and firmware"
This reverts commit 5a2145ad8d.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-03-06 11:31:15 +01:00
Bellekallu Rajkiran 7ec8d8ef91 Add prelim support for frequency events and fabric port
Related-To: LOCI-2864

Source of prelim headers:
https://github.com/intel-gpu/drm-uapi-helper/tree/master/i915-shared-headers

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-03-06 02:47:20 +01:00
Bellekallu Rajkiran 01db5ef22a Add prelim support for memory, engine and scheduler
Related-To: LOCI-2864

Source of prelim headers:
https://github.com/intel-gpu/drm-uapi-helper/tree/master/i915-shared-headers

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-03-04 19:56:30 +01:00
Bellekallu Rajkiran 65fd55a893 Add prelim performance factor support
Related-To: LOCI-2864

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-03-04 08:44:10 +01:00
Joshua Santosh Ranjan 41988fc429 Metrics IP Sampling Enumeration Support
This patch adds enumeration of metric group for
IP sampling.

Related-To: LOCI-2754


Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-03-04 00:22:18 +01:00
Bellekallu Rajkiran 5a2145ad8d Add prelim support for ras diagnostics and firmware
Related-To: LOCI-2864

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-03-03 18:51:21 +01:00
Bellekallu Rajkiran 922a224cc9 Add prelim support for temperature, power and global operations
Related-To: LOCI-2864

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-03-03 18:11:34 +01:00
Mateusz Hoppe 7a2c5e28c1 Add getLastCounter() to EuThread
Related-To: NEO-6447

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2022-02-25 17:14:01 +01:00
Bellekallu Rajkiran b6d3b4cca6 Sysman: Fix incorrect frequency request value
Sysman queries frequency request value from an incorrect
sysFs node which results in incorrect frequency request
value.
Modify sysFs node to query from correct node.

Related-To: LOCI-2887

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-02-24 05:15:18 +01:00
Joshua Santosh Ranjan d24c6cedfb Metrics Ip Sampling Fix Inclusions
This patch fixes isolation build issues
due to inclusions.

Related-To: LOCI-2707

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-02-23 10:02:21 +01:00
Joshua Santosh Ranjan 10f98b45db Metrics Add Os specific implementation Structure for IP Sampling
This patch adds OS specific implementation for IP Sampling.
Implementation for linux is provided as part of this patch.

Related-To: LOCI-2787

--- master-files
level_zero/tools/source/metrics/linux/os_metric_ip_sampling_imp_linux.cpp
level_zero/tools/source/metrics/os_metric_ip_sampling.h
level_zero/tools/source/metrics/windows/os_metric_ip_sampling_imp_windows.cpp
level_zero/tools/test/unit_tests/sources/metrics/linux/test_metric_ip_sampling_linux_prelim.cpp
level_zero/tools/test/unit_tests/sources/metrics/linux/test_metric_ip_sampling_linux_upstream.cpp
level_zero/tools/test/unit_tests/sources/metrics/windows/test_metric_ip_sampling_windows.cpp
--- master-files

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-02-18 23:15:44 +01:00
T.J.Vivek Vilvaraj 1e6a38035e sysman: serialize access to libigsc.
In a multi thread environment the access to the external
library needs to be synchronized.

Resolves: LOCI-2871, LOCI-2873

Signed-off-by: T.J.Vivek Vilvaraj <t.j.vivek.vilvaraj@intel.com>
2022-02-15 08:08:44 +01:00
Joshua Santosh Ranjan cec0ea2809 Metrics Rename OA specific files
Rename OA specific files with _oa so that
implementation of other metric sources
could be added seamlessly.

Related-To: LOCI-2945

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-02-14 18:35:19 +01:00
Joshua Santosh Ranjan 596fe02dd3 Metrics Refactor Metric Streamer
This patch moves OA specific Metric Streamer implementation
to OA specific classes.

Related-To: LOCI-2905

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-02-08 22:42:05 +01:00
Joshua Santosh Ranjan 82ad3d61be Metrics Refactor MetricQuery and Metric Query Pool
This patch is to refactor MetricQuery and MetricQueryPool
so that Stall sampling metric could be integrated seamlessly

Related-To: LOCI-2904

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-02-07 19:34:22 +01:00
Joshua Santosh Ranjan 93e117fa9e Metrics Refactor Metric Group
Refactor Metric Group Implementation to move OA specific
implementation to OA specific classes.
This is so that stall sampling specific Metric Group
implementation could be done seamlessly.

Related-To: LOCI-2753

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-02-04 18:28:39 +01:00
Bartosz Dunajski 4b0d986876 Move AllocationType enum out of GraphicsAllocation class
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2022-02-04 17:49:09 +01:00
Joshua Santosh Ranjan f1c50a8c89 Metric Refactor Metric Context
Rename Metric Context and move OA specific functions
and members from Metric Context to OA specific class(MetricSource).
This refactoring is done so that additional Metric Source
like Stall Sampling could be Seamlessly implemented.

Related-To: LOCI-2753


Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-02-03 21:34:14 +01:00
Joshua Santosh Ranjan 78fa21f31a Metrics Refactor Rename Metric Source Specific Classes
Renaming Oa Specific classes.

Related-To: LOCI-2753

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-02-01 21:42:57 +01:00
Bellekallu Rajkiran 0bd60e524a Initialize telemetry device entry variable
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-01-31 20:33:53 +01:00
Mayank Raghuwanshi 90963b95ad Update mechanism for getting subdeviceId and onSubdevice for memory
Earlier sysman memory module was using logical subdeviceId
exposed by core to retrieve memory telmetry data, replace
the logical subdeviceId with actual subdeviceId for collecting
telemetry data.

Related-To: LOCI-2828

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2022-01-28 07:52:47 +01:00
Ayush Pandey 715b9d31d2 Find sscanf alternative.
Used strtol() to write sscanfUtil to extraxt info of BDF pcipath.

Related-To: LOCI-1002

Signed-off-by: Ayush Pandey <ayush.pandey@intel.com>
2022-01-21 09:02:48 +01:00
Robert Krzemien c724f35abb Fixed offsets in calculation for multidevices. Fixed metric types.
Related-To: LOCI-2870
Signed-off-by: Robert Krzemien <robert.krzemien@intel.com>
2022-01-19 17:10:09 +01:00
Szymon Morek 26a24e8fde Query engine info with distances
If prelim kernel is being used, query distances
and set correctly number of available engines

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2022-01-10 13:30:26 +01:00
Szymon Morek 6258575e5e Use queryEngineInfo with prelim ioctls
If prelim kernel is being used, query engines
with prelim ioctls.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2022-01-04 12:15:04 +01:00
Mayank Raghuwanshi 25403cf85d Add support for device level performance factor
Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2021-12-28 07:13:51 +01:00
Joshua Santosh Ranjan e1ef9ac79a Revert "Metric Detect Paranoid mode and fail gracefully"
This reverts commit a720282358dff08fb36b95eaf9bf184efa315f48.

This revert is to avoid suggesting to disable paranoid mode.
This revert also avoids L0 metrics mandating paranoid mode
setting.

Related-To: LOCI-2822

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2021-12-23 19:30:47 +01:00
T J Vivek Vilvaraj 9a39cad07d sysman:add reinitialization code to diagnostics
Signed-off-by: T J Vivek Vilvaraj <t.j.vivek.vilvaraj@intel.com>
2021-12-23 18:27:06 +01:00
Vilvaraj, T J Vivek 15f102a7cb sysman:modify diagnostics tests
Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com>
2021-12-22 14:53:38 +01:00