Commit Graph

427 Commits

Author SHA1 Message Date
davidoli
5dfd68a729 Add Ip sampling metrics calculate values Support
Related-To: LOCI-2757

Signed-off-by: davidoli <david.olien@intel.com>
2022-04-21 20:02:40 +02:00
Bellekallu Rajkiran
f7e04b32aa Update throttle reason status sysfs names
Related-To: LOCI-1843

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-04-14 08:45:17 +02:00
Joshua Santosh Ranjan
0441295e21 Metrics: Move Workload Partition Setting to Metric Group Enumeration
This patch fixes an issue about incorrect setting
of WorkloadPartionEnabled flag to ClientOptions.

Related-To: LOCI-3021

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-04-14 08:36:37 +02:00
Brandon Yates
d70b1a2e2a Filter L0 Debugger support by platfom
Related-To: NEO-6678
Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2022-04-13 13:03:40 +02:00
Joshua Santosh Ranjan
16af941f03 Metrics IpSampling Add Blackbox Test
This patch adds blackbox test for IpSampling
feature.
This patch also adds support to use custom
workloads.

Related-To: LOCI-3009

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-04-08 07:56:54 +02:00
Mateusz Hoppe
933fbd4f63 Fix scratch offset calculation in DebugSession
- account for ThreadEuRatio when calculating per-thread
scratch offset

Related-To: NEO-6677

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2022-04-05 10:33:49 +02:00
Vilvaraj, T J Vivek
8f88d375ab Sysman: remove gmocks from standby module
Related-To: LOCI-2030
Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com>
2022-04-04 10:41:12 +02:00
Vilvaraj, T J Vivek
47f7b4f509 sysman: clean up code duplication for reset
warm and cold reset are common functionality,
the code is being moved to the common sysman implementation
from diagnostics specific files.

Related-To: LOCI-1908
Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com>
2022-03-31 14:10:39 +02:00
Bellekallu Rajkiran
cf9a5ed7d7 Add prelim support for ras diagnostics and firmware
Related-To: LOCI-2864

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-03-24 06:58:25 +01:00
Joshua Santosh Ranjan
6e14df9b84 Metrics Ip sampling Avoid retries when starting measurement
This patch avoids retrying if ioctl fails.

Related-To: LOCI-3010

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-03-24 02:24:43 +01:00
Joshua Santosh Ranjan
05a150f49f L0 Core Add Support For pci_speed_ext
This patch adds support for reading PCI bandwidth, generation
and linkwidth information from sysfs nodes for the linux
platform.

Related-To: LOCI-2969

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-03-16 22:47:59 +01:00
Mayank Raghuwanshi
fac75222a3 Move Pci bdf parsing to os specific layer
Related-To: LOCI-2879

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2022-03-15 08:58:30 +01:00
Daria Hinz
452050ae40 Refactoring the use of PVC device ids
Replacing the old device id implementation
& clearing PVC XT temporary.

Related-To: NEO-6742
Signed-off-by: Daria Hinz <daria.hinz@intel.com>
2022-03-14 15:54:52 +01:00
Joshua Santosh Ranjan
c7ce397b17 Metrics IP Sampling Add Support for Streamer APIs
This Patch adds support for collecting IP Metrics using
StreamerOpen, StreamerClose and StreamerReadData

Related-To: LOCI-2755
Related-To: LOCI-2756

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-03-11 03:03:22 +01:00
Compute-Runtime-Validation
1a823356a3 Revert "Add prelim support for ras diagnostics and firmware"
This reverts commit 5a2145ad8d.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-03-06 11:31:15 +01:00
Bellekallu Rajkiran
7ec8d8ef91 Add prelim support for frequency events and fabric port
Related-To: LOCI-2864

Source of prelim headers:
https://github.com/intel-gpu/drm-uapi-helper/tree/master/i915-shared-headers

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-03-06 02:47:20 +01:00
Bellekallu Rajkiran
01db5ef22a Add prelim support for memory, engine and scheduler
Related-To: LOCI-2864

Source of prelim headers:
https://github.com/intel-gpu/drm-uapi-helper/tree/master/i915-shared-headers

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-03-04 19:56:30 +01:00
Bellekallu Rajkiran
65fd55a893 Add prelim performance factor support
Related-To: LOCI-2864

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-03-04 08:44:10 +01:00
Joshua Santosh Ranjan
41988fc429 Metrics IP Sampling Enumeration Support
This patch adds enumeration of metric group for
IP sampling.

Related-To: LOCI-2754


Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-03-04 00:22:18 +01:00
Bellekallu Rajkiran
5a2145ad8d Add prelim support for ras diagnostics and firmware
Related-To: LOCI-2864

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-03-03 18:51:21 +01:00
Bellekallu Rajkiran
922a224cc9 Add prelim support for temperature, power and global operations
Related-To: LOCI-2864

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-03-03 18:11:34 +01:00
Bellekallu Rajkiran
f9d271f6cf Sysman: Fix Performance factor ULT
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-03-03 14:29:52 +01:00
Mateusz Hoppe
7a2c5e28c1 Add getLastCounter() to EuThread
Related-To: NEO-6447

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2022-02-25 17:14:01 +01:00
Bellekallu Rajkiran
b6d3b4cca6 Sysman: Fix incorrect frequency request value
Sysman queries frequency request value from an incorrect
sysFs node which results in incorrect frequency request
value.
Modify sysFs node to query from correct node.

Related-To: LOCI-2887

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-02-24 05:15:18 +01:00
Joshua Santosh Ranjan
d24c6cedfb Metrics Ip Sampling Fix Inclusions
This patch fixes isolation build issues
due to inclusions.

Related-To: LOCI-2707

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-02-23 10:02:21 +01:00
Joshua Santosh Ranjan
10f98b45db Metrics Add Os specific implementation Structure for IP Sampling
This patch adds OS specific implementation for IP Sampling.
Implementation for linux is provided as part of this patch.

Related-To: LOCI-2787

--- master-files
level_zero/tools/source/metrics/linux/os_metric_ip_sampling_imp_linux.cpp
level_zero/tools/source/metrics/os_metric_ip_sampling.h
level_zero/tools/source/metrics/windows/os_metric_ip_sampling_imp_windows.cpp
level_zero/tools/test/unit_tests/sources/metrics/linux/test_metric_ip_sampling_linux_prelim.cpp
level_zero/tools/test/unit_tests/sources/metrics/linux/test_metric_ip_sampling_linux_upstream.cpp
level_zero/tools/test/unit_tests/sources/metrics/windows/test_metric_ip_sampling_windows.cpp
--- master-files

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-02-18 23:15:44 +01:00
Joshua Santosh Ranjan
cec0ea2809 Metrics Rename OA specific files
Rename OA specific files with _oa so that
implementation of other metric sources
could be added seamlessly.

Related-To: LOCI-2945

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-02-14 18:35:19 +01:00
Patryk Wrobel
18cafd3a52 Implement GPU hang detection on Windows
This change uses value of cpuAddress from monitored fence
to detect GPU hang.

Related-To: NEO-5313
Signed-off-by: Patryk Wrobel <patryk.wrobel@intel.com>
2022-02-09 17:22:52 +01:00
Joshua Santosh Ranjan
82ad3d61be Metrics Refactor MetricQuery and Metric Query Pool
This patch is to refactor MetricQuery and MetricQueryPool
so that Stall sampling metric could be integrated seamlessly

Related-To: LOCI-2904

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-02-07 19:34:22 +01:00
Joshua Santosh Ranjan
93e117fa9e Metrics Refactor Metric Group
Refactor Metric Group Implementation to move OA specific
implementation to OA specific classes.
This is so that stall sampling specific Metric Group
implementation could be done seamlessly.

Related-To: LOCI-2753

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-02-04 18:28:39 +01:00
Daniel Chabrowski
f2c4231a27 Extend DRM mock and queryTopology tests - part 1
Signed-off-by: Daniel Chabrowski daniel.chabrowski@intel.com
Related-To: NEO-6591
2022-02-04 12:02:49 +01:00
Joshua Santosh Ranjan
f1c50a8c89 Metric Refactor Metric Context
Rename Metric Context and move OA specific functions
and members from Metric Context to OA specific class(MetricSource).
This refactoring is done so that additional Metric Source
like Stall Sampling could be Seamlessly implemented.

Related-To: LOCI-2753


Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-02-03 21:34:14 +01:00
Joshua Santosh Ranjan
78fa21f31a Metrics Refactor Rename Metric Source Specific Classes
Renaming Oa Specific classes.

Related-To: LOCI-2753

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-02-01 21:42:57 +01:00
Szymon Morek
d9886f67ee Make IoctlHelper member of Drm
Related-To: NEO-6575

This is needed to fix accessing IoctlHelper
after driver detach.
This way we are also reducing accessing
sysfs file in Drm::getPrelimVersion

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2022-02-01 08:26:59 +01:00
Mayank Raghuwanshi
90963b95ad Update mechanism for getting subdeviceId and onSubdevice for memory
Earlier sysman memory module was using logical subdeviceId
exposed by core to retrieve memory telmetry data, replace
the logical subdeviceId with actual subdeviceId for collecting
telemetry data.

Related-To: LOCI-2828

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2022-01-28 07:52:47 +01:00
Robert Krzemien
c724f35abb Fixed offsets in calculation for multidevices. Fixed metric types.
Related-To: LOCI-2870
Signed-off-by: Robert Krzemien <robert.krzemien@intel.com>
2022-01-19 17:10:09 +01:00
Ayush Pandey
5f8913f291 ULT fix for pmt handle
Fixed failing ULTs  for the  pmt object created in map were not deleted

Related-To: LOCI-2835

Signed-off-by: Ayush Pandey <ayush.pandey@intel.com>
2022-01-18 16:35:28 +01:00
Zbigniew Zdanowicz
4238679078 Refactor implicit scaling device support
Related-To: NEO-6589

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-01-18 13:08:43 +01:00
Szymon Morek
26a24e8fde Query engine info with distances
If prelim kernel is being used, query distances
and set correctly number of available engines

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2022-01-10 13:30:26 +01:00
Spruit, Neil R
02f075c541 Power Saving Hint Support for Level Zero in Windows
- Added Functionality to pass ze_power_saving_hint_type_t to zeContextCreate
included in the pNext extensions in ze_context_desc_t.
- Enables handling a hint value 0-100 with 0 being no power savings
and 100 being maximum power savings.
- ZE_RESULT_ERROR_INVALID_ENUMERATION is returned given an invalid hint.

Related-To: LOCI-2567

Signed-off-by: Spruit, Neil R <neil.r.spruit@intel.com>
2022-01-06 23:56:59 +01:00
Vilvaraj, T J Vivek
4d5d31d80d sysman: simplify ULT for device reset
device reset is heavily dependant on gmock for ULT
reduce depencency on GMock.

Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com>
2022-01-04 13:26:56 +01:00
Szymon Morek
6258575e5e Use queryEngineInfo with prelim ioctls
If prelim kernel is being used, query engines
with prelim ioctls.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2022-01-04 12:15:04 +01:00
Szymon Morek
d9f6757378 Change return type from unique_ptr to vector
In some of the drm functions there is a pattern
to store array in unique_ptr and pass it's length
as an argument. This commit simplifies this.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2021-12-28 10:33:53 +01:00
Mayank Raghuwanshi
25403cf85d Add support for device level performance factor
Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2021-12-28 07:13:51 +01:00
Joshua Santosh Ranjan
e1ef9ac79a Revert "Metric Detect Paranoid mode and fail gracefully"
This reverts commit a720282358dff08fb36b95eaf9bf184efa315f48.

This revert is to avoid suggesting to disable paranoid mode.
This revert also avoids L0 metrics mandating paranoid mode
setting.

Related-To: LOCI-2822

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2021-12-23 19:30:47 +01:00
T J Vivek Vilvaraj
9a39cad07d sysman:add reinitialization code to diagnostics
Signed-off-by: T J Vivek Vilvaraj <t.j.vivek.vilvaraj@intel.com>
2021-12-23 18:27:06 +01:00
Vilvaraj, T J Vivek
15f102a7cb sysman:modify diagnostics tests
Signed-off-by: Vilvaraj, T J Vivek <t.j.vivek.vilvaraj@intel.com>
2021-12-22 14:53:38 +01:00
Filip Hazubski
eec857932a Split test file
Split test_metric_streamer_2.cpp file

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2021-12-22 14:40:43 +01:00
Szymon Morek
2647d563c7 Remove i915 structs from MemoryInfo
Use structs defined in ioctl_helper.h instead of
i915 dependent ones to avoid conflicts between
different kernels

Related-To: NEO-6149

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2021-12-22 12:46:59 +01:00
T J Vivek Vilvaraj
b95428078e sysman: filter diagnostics related accesses
prevent diagnostics related calls on unsupported platforms


Signed-off-by: T J Vivek Vilvaraj <t.j.vivek.vilvaraj@intel.com>
2021-12-20 12:49:49 +01:00