Commit Graph

748 Commits

Author SHA1 Message Date
Mayank Raghuwanshi
a69110a7ec feature: Add support for RAS mdfi errors
Related-To: LOCI-4479

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-06-13 10:14:36 +02:00
Jitendra Sharma
365ea7a178 feature: Add support for enabling compute unit debug mode
Add support for enabling compute unit debug mode in sysman
which is not initialized by core.

Related-To: LOCI-4370

Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2023-06-05 12:37:35 +02:00
Bellekallu Rajkiran
3c072a6cd1 fix: WA for VF bar resource allocation post Warm reset
On Warm reset, With default bar size set by bios, VF bar
allocation is getting failed because of bug in pci driver
which impacts SRIOV functionality.

Resize VF bar size for succesful allocation of VF bar
post warm reset.

Related-To: LOCI-4481

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-06-02 13:16:34 +02:00
Mateusz Jablonski
82ed9e9a96 refactor: simplify L0 tools/sysman headers
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-05-29 10:59:25 +02:00
Mayank Raghuwanshi
d0d5a03be7 feature: Add support for Experimental zexSysmanMemoryGetBandwidth
Related-To: LOCI-3838

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-05-25 00:46:07 +02:00
Mateusz Hoppe
c4f36a6808 fix(debugger): ack pending events before closing session
- ack module / isa vm bind events before close(fd)

Related-To: NEO-7960

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-05-11 17:21:23 +02:00
Brandon Yates
273635e8e7 test: Fix ULT for zetDebugGetThreadRegisterSetProperties
Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2023-05-10 08:28:21 +02:00
Brandon Yates
713f166d17 feature(debugger): Implement zetDebugGetThreadRegisterSetProperties (1/n)
Current SIP implementation has constant reg descs for all threads.
Initial implementation of this API always returns same reg descs as the
non-thread version of API. When SIP exposes per-thread reg descs,
this API will be updated to expose them.

Related-to: NEO-7370
Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2023-05-08 14:46:48 +02:00
Singh, Prasoon
d21430b17e fix(sysman): Replace normal pointers with smart pointers (15/n)
Replacing normal pointers by smart pointers in diagnostics module of L0 sysman.

Related-To: LOCI-2810

Signed-off-by: Singh, Prasoon <prasoon.singh@intel.com>
2023-05-04 16:50:39 +02:00
Singh, Prasoon
531779ffaf fix(sysman): Replace normal pointers with smart pointers (14/n)
Replacing normal pointers by smart pointers in performance module of L0 sysman.

Related-To: LOCI-2810

Signed-off-by: Singh, Prasoon <prasoon.singh@intel.com>
2023-05-04 11:18:00 +02:00
Bellekallu Rajkiran
d3a31957db feature(sysman): Add delay for HBM diagnostics
Add debug variable to set sleep duration for HBM
IFR to complete

Related-To: LOCI-4298

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-05-03 20:27:21 +02:00
Bellekallu Rajkiran
c55c1b4276 refactor(sysman): Remove support for Array and scan diagnostics
Related-To: LOCI-4323

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-05-01 05:30:36 +02:00
Fabian Zwolinski
e351a90f81 refactor: Rename member variables to camelCase 2/n
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-04-27 20:39:22 +02:00
Mayank Raghuwanshi
9cc5763800 fix: Revert spec 1.5 RAS changes from Sysman
Related-To: LOCI-4351

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-04-27 05:29:33 +02:00
Fabian Zwolinski
c441e9e971 refactor: Rename member variables to camelCase
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-04-26 16:05:07 +02:00
Kulkarni, Ashwin Kumar
c93b084a4b Change return code for APIs with no support in windows
Related-To: LOCI-3863

Signed-off-by: Kulkarni, Ashwin Kumar <ashwin.kumar.kulkarni@intel.com>
2023-04-21 07:28:20 +02:00
Mateusz Jablonski
51b8dc66a3 fix ocloc/ult: set default PVC device to pvc xt C0
ensure default hw ip version matches the value from helper
change pvc ult execution to revision 3

Related-To: NEO-7738
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-04-18 13:48:48 +02:00
Mateusz Hoppe
bfd32d6284 fix: read state save area once for all threads for resume(ALL)
- when resume(all) is called - all threads' sr counter needs to be
verified. Reading state save area separately for all threads takes
longer than reading whole state save area once. State save area is
only read again if sr counter wasn't updated
- fail while reading state save area means threads might have completed
execution
- this fix optimizes time spent in resume(all), that may be called before
debugger detaches

Related-To: NEO-7897

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-04-17 15:41:13 +02:00
Compute-Runtime-Validation
e79fb5f39b Revert "fix ocloc/ult: set default PVC device id to pvc xt device id"
This reverts commit bd84ba819b.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-04-15 11:43:21 +02:00
Bellekallu Rajkiran
007f5d70bf [Fix, Sysman] Map uevent to device based on device path
Related-To: LOCI-4307

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-04-14 23:25:39 +02:00
Mateusz Hoppe
079105a5c2 fix: optimize ATT handling - read state save area once for all threads
- reading state save area for every threads takes too long when all
application threads have completed and there are stale ATT events to
process
- on detach gdb seemed to be frozen waiting for ATT event to be handled
- fix is to read state save area once - and check SIP counter for every
thread in ATT bitmask

Related-To: NEO-7897

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-04-14 10:57:18 +02:00
Mateusz Jablonski
bd84ba819b fix ocloc/ult: set default PVC device id to pvc xt device id
ensure default hw ip version matches the value from helper
change pvc ult execution to revision 3

Related-To: NEO-7738
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-04-13 10:54:28 +02:00
Konstanty Misiak
1f37e69fd2 Refactor of IO functions
Related-To: NEO-4562

Signed-off-by: Konstanty Misiak <konstanty.misiak@intel.com>
2023-04-13 10:46:47 +02:00
Bellekallu Rajkiran
24f73f4686 fix(Sysman): Support for fabric port health change event.
Related-To: LOCI-4053

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-04-12 06:46:19 +02:00
Mateusz Hoppe
e26ebfc51b fix: check exception reason for stopped threads
- before marking interrupt request check exception reason. If there is
exception other than forced exception or forced external halt treat
thread as stopped and generate distinct event for it.

Related-To: NEO-7869

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-04-05 12:01:20 +02:00
Singh, Prasoon
71fe65b327 [Sysman] Replace normal pointers with smart pointers (10/n)
Replacing normal pointers by smart pointers in fan module of L0 sysman.

Related-To: LOCI-2810

Signed-off-by: Singh, Prasoon <prasoon.singh@intel.com>
2023-04-05 11:05:44 +02:00
Bellekallu Rajkiran
310380119a feat(sysman): Support for registering events during listen
Related-To: LOCI-3682

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-04-04 07:07:19 +02:00
Singh, Prasoon
42a422d9f4 [Sysman] Replace normal pointers with smart pointers (8/n)
Replacing normal pointers by smart pointers in fabric_port module of L0 sysman.

Related-To: LOCI-2810

Signed-off-by: Singh, Prasoon <prasoon.singh@intel.com>
2023-04-03 09:34:52 +02:00
Mayank Raghuwanshi
d0fd7c1667 Improve Sysman windows ult coverage
Related-To: LOCI-4141

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-03-31 15:00:15 +02:00
Mayank Raghuwanshi
3816b85fa0 Add check for memory type before calculating ras hbm errors
Related-To: LOCI-3500

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-03-31 13:47:41 +02:00
Mateusz Hoppe
bfb59cc573 fix: do not resume threads that have pending THREAD_STOPPED events
- threads with pending events cannot be resumed by resume(all) as they
they were not yet discovered by debugger as stopped.
- set reportAsStopped when THREAD_STOPPED event is read
- when resuming, only resume a thread when it was reported

Related-To: NEO-7817

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-03-31 12:42:25 +02:00
Singh, Prasoon
1a6f03c36e [Sysman] Replace normal pointers with smart pointers (4/n).
Replacing normal pointers by smart pointers in temperature module of L0 sysman.

Related-To: LOCI-2810

Signed-off-by: Singh, Prasoon <prasoon.singh@intel.com>
2023-03-30 13:07:02 +02:00
Mayank Raghuwanshi
065232eac7 Add support for ras l3 fabric errors
Related-To: LOCI-3966

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-03-30 12:47:45 +02:00
Singh, Prasoon
ac929eaf61 (Sysman):Changes made in standby API.
Replacing normal pointers by smart pointers in standby module of L0 sysman.

Related-To: LOCI-2810

Signed-off-by: Singh, Prasoon <prasoon.singh@intel.com>
2023-03-30 06:45:00 +02:00
Singh, Prasoon
49827b7122 (Sysman):Changes made in engine API.
Replacing normal pointers by smart pointers in engine module of L0 sysman.

Related-To: LOCI-2810

Signed-off-by: Singh, Prasoon <prasoon.singh@intel.com>
2023-03-30 06:41:26 +02:00
Bari, Pratik
b161af5886 Improve coverage in Fs Access
- Added ULTS to improve the code coverage in fs_access.
- Used the shared functions in the ULTs.
- Added the function for pwrite in the shared code.
Related-To: LOCI-2117

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2023-03-30 04:21:18 +02:00
Singh, Prasoon
fa2bb2509b [Sysman] Replace normal pointers with smart pointers (6/n)
Replacing normal pointers by smart pointers in scheduler module of L0 sysman.

Related-To: LOCI-2810

Signed-off-by: Singh, Prasoon <prasoon.singh@intel.com>
2023-03-29 16:01:08 +02:00
Singh, Prasoon
4475e2228d [Sysman] Replace normal pointers with smart pointers (7/n)
Replacing normal pointers by smart pointers in memory  module of L0 sysman.

Related-To: LOCI-2810

Signed-off-by: Singh, Prasoon <prasoon.singh@intel.com>
2023-03-29 10:59:40 +02:00
Yates, Brandon
0f6140deb0 fix(l0debug): windows topology map should always be created
Topology map was only being created when ZET_ENABLE_PROGAM_DEBUGGING was
set.  This was not correct.  Now it is unconditionally created at init,
and debug attach will fail if it is not valid.

Related-to: LOCI-3937

Signed-off-by: Yates, Brandon <brandon.yates@intel.com>
2023-03-27 16:24:57 +02:00
Bari, Pratik
fb82af429b Improve ULT function coverage in Fan Component
As the fan component is not supported on Linux, hence for the ULTs a
fan handle is created and used which improves the coverage for the Fan
component.
Related-To: LOCI-2113

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2023-03-27 10:06:32 +02:00
Bari, Pratik
e813ae0f4a Added changes for maxBandwidth computation in Memory Component.
The memoryBusWidth used for the computation of maxBandwidth has been
changed to 128 bytes from 128 bits. Hence the final maxBandwidth need
not be further divided by 8 to convert it into bytes/sec.

Related-To: LOCI-3833

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2023-03-23 20:20:59 +01:00
Kulkarni, Ashwin Kumar
9d866dac9e Support maxBandwidth in zesDevicePciGetState for windows
Fix to update zes_pci_speed_t.maxBandwidth

Related-To: LOCI-4056

Signed-off-by: Kulkarni, Ashwin Kumar <ashwin.kumar.kulkarni@intel.com>
2023-03-20 06:12:44 +01:00
Philip Meyer
2606788250 Sysman - Modify Consumption of KMD Energy Counter Timestamp
Signed-off-by: Philip Meyer philip.meyer@intel.com

Resolves: HSD-14018885283
2023-03-20 06:05:24 +01:00
Mateusz Jablonski
659cacf2c9 refactor l0 cmake: reduce include directories
Related-To: NEO-7507
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-03-17 13:41:55 +01:00
Mateusz Jablonski
a7830eb478 refactor l0 cmake: add CMakeLists.txt files to solution
Related-To: NEO-7507
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-03-17 12:09:03 +01:00
Mateusz Jablonski
cb7437b6b7 refactor l0 tools: cleanup cmake files
append sources in current directory

Related-To: NEO-7507
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-03-16 14:05:14 +01:00
Filip Hazubski
0025cb25f4 test: Correct logic of disabling slices in gtSystemInfo
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2023-03-16 09:13:47 +01:00
Mateusz Hoppe
f2275caf3a performance: L0 debug - improve single-step time
- check threads with ATT only when more than one thread is resumed
- if single thread is resumed - check only that thread
- delay thread state check - allowing thread to reenter SIP

Related-To: NEO-7750

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-03-15 09:48:35 +01:00
Joshua Santosh Ranjan
e2177f44f3 [Metrics]Fix OA buffer size request for streamer
This patch uses maximum OA buffer size for the
platform when opening an io stream.

Related-To: LOCI-4103

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2023-03-14 11:10:11 +01:00
Kamil Kopryk
fa8579602f refactor: rename product helper files n/n
Related-To: NEO-7703
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-03-10 13:24:38 +01:00