Commit Graph

1008 Commits

Author SHA1 Message Date
Mayank Raghuwanshi
a69110a7ec feature: Add support for RAS mdfi errors
Related-To: LOCI-4479

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-06-13 10:14:36 +02:00
Kumar, Shubham
c99a08bea8 feature: Support for Fan module using zesinint on windows
Added ULTs for Fan module in new sysman implementation
Added support for Fan in zello_sysman

Related-To: LOCI-4409, LOCI-4412

Signed-off-by: Kumar, Shubham <shubham.kumar@intel.com>
2023-06-13 02:12:20 +02:00
Jitendra Sharma
365ea7a178 feature: Add support for enabling compute unit debug mode
Add support for enabling compute unit debug mode in sysman
which is not initialized by core.

Related-To: LOCI-4370

Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2023-06-05 12:37:35 +02:00
Bellekallu Rajkiran
3c072a6cd1 fix: WA for VF bar resource allocation post Warm reset
On Warm reset, With default bar size set by bios, VF bar
allocation is getting failed because of bug in pci driver
which impacts SRIOV functionality.

Resize VF bar size for succesful allocation of VF bar
post warm reset.

Related-To: LOCI-4481

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-06-02 13:16:34 +02:00
Mateusz Jablonski
82ed9e9a96 refactor: simplify L0 tools/sysman headers
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-05-29 10:59:25 +02:00
Mayank Raghuwanshi
d0d5a03be7 feature: Add support for Experimental zexSysmanMemoryGetBandwidth
Related-To: LOCI-3838

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-05-25 00:46:07 +02:00
Bari, Pratik
af2c61c54b feature(sysman): Added support for Performance APIs
Added support for the Performance APIs in the new sysman design.
Added ULTs for the Performance APIs in the new sysman design.
In the black box test for performance, the API to compute the number
of sub devices has been changed in order to get the sub device count
regardless of the sysman initialization using zeInit or zesInit.

Related-To: LOCI-4294

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2023-05-16 06:59:29 +02:00
Mateusz Hoppe
c4f36a6808 fix(debugger): ack pending events before closing session
- ack module / isa vm bind events before close(fd)

Related-To: NEO-7960

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-05-11 17:21:23 +02:00
Devarinti, Puneeth Kumar Reddy
52afb9fc85 feature: Add debug logs for scheduler module
Related-To: LOCI-3880

Signed-off-by: Devarinti, Puneeth Kumar Reddy <puneeth.kumar.reddy.devarinti@intel.com>
2023-05-11 12:14:34 +02:00
Dunajski, Bartosz
41478c5972 feature: Experimental support of immediate cmd list in-order execution [6/n]
Related-To: LOCI-4332

- Signal appendWaitOnEvents API call
- Signal appendBarrier call
- Handle sync allocation residency

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-05-11 11:41:49 +02:00
Brandon Yates
273635e8e7 test: Fix ULT for zetDebugGetThreadRegisterSetProperties
Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2023-05-10 08:28:21 +02:00
Devarinti, Puneeth Kumar Reddy
c03867b55c feature: Add debug logs for RAS module
Related-To: LOCI-3880

Signed-off-by: Devarinti, Puneeth Kumar Reddy <puneeth.kumar.reddy.devarinti@intel.com>
2023-05-09 08:12:06 +02:00
Brandon Yates
713f166d17 feature(debugger): Implement zetDebugGetThreadRegisterSetProperties (1/n)
Current SIP implementation has constant reg descs for all threads.
Initial implementation of this API always returns same reg descs as the
non-thread version of API. When SIP exposes per-thread reg descs,
this API will be updated to expose them.

Related-to: NEO-7370
Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2023-05-08 14:46:48 +02:00
Singh, Prasoon
d21430b17e fix(sysman): Replace normal pointers with smart pointers (15/n)
Replacing normal pointers by smart pointers in diagnostics module of L0 sysman.

Related-To: LOCI-2810

Signed-off-by: Singh, Prasoon <prasoon.singh@intel.com>
2023-05-04 16:50:39 +02:00
Singh, Prasoon
531779ffaf fix(sysman): Replace normal pointers with smart pointers (14/n)
Replacing normal pointers by smart pointers in performance module of L0 sysman.

Related-To: LOCI-2810

Signed-off-by: Singh, Prasoon <prasoon.singh@intel.com>
2023-05-04 11:18:00 +02:00
Bellekallu Rajkiran
d3a31957db feature(sysman): Add delay for HBM diagnostics
Add debug variable to set sleep duration for HBM
IFR to complete

Related-To: LOCI-4298

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-05-03 20:27:21 +02:00
Bellekallu Rajkiran
c55c1b4276 refactor(sysman): Remove support for Array and scan diagnostics
Related-To: LOCI-4323

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-05-01 05:30:36 +02:00
Devarinti, Puneeth Kumar Reddy
1c44e3ab3e feature: Add debug logs for ECC module
Related-To: LOCI-3878

Signed-off-by: Devarinti, Puneeth Kumar Reddy <puneeth.kumar.reddy.devarinti@intel.com>
2023-04-28 09:57:22 +02:00
Fabian Zwolinski
e351a90f81 refactor: Rename member variables to camelCase 2/n
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-04-27 20:39:22 +02:00
Bari, Pratik
4647564cc9 feature(sysman): Added support for Temperature APIs
Added support for the Temperature APIs in the new sysman design.
Added ULTs for the Temperature APIs in the new sysman design.

Related-To: LOCI-4293

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2023-04-27 16:45:23 +02:00
Mayank Raghuwanshi
9cc5763800 fix: Revert spec 1.5 RAS changes from Sysman
Related-To: LOCI-4351

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-04-27 05:29:33 +02:00
Fabian Zwolinski
c441e9e971 refactor: Rename member variables to camelCase
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-04-26 16:05:07 +02:00
Fabian Zwolinski
e2e00413a8 Apply CamelCase for class and struct names
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-04-24 15:36:27 +02:00
Mayank Raghuwanshi
b733d56a36 Make calls to igsc from Sysman thread safe
Related-To: LOCI-4325

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-04-21 15:51:27 +02:00
Kulkarni, Ashwin Kumar
c93b084a4b Change return code for APIs with no support in windows
Related-To: LOCI-3863

Signed-off-by: Kulkarni, Ashwin Kumar <ashwin.kumar.kulkarni@intel.com>
2023-04-21 07:28:20 +02:00
Mateusz Jablonski
51b8dc66a3 fix ocloc/ult: set default PVC device to pvc xt C0
ensure default hw ip version matches the value from helper
change pvc ult execution to revision 3

Related-To: NEO-7738
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-04-18 13:48:48 +02:00
Mateusz Hoppe
bfd32d6284 fix: read state save area once for all threads for resume(ALL)
- when resume(all) is called - all threads' sr counter needs to be
verified. Reading state save area separately for all threads takes
longer than reading whole state save area once. State save area is
only read again if sr counter wasn't updated
- fail while reading state save area means threads might have completed
execution
- this fix optimizes time spent in resume(all), that may be called before
debugger detaches

Related-To: NEO-7897

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-04-17 15:41:13 +02:00
Devarinti, Puneeth Kumar Reddy
239ce79f43 Debug: Add debug logs for global module
Related-To: LOCI-3876

Signed-off-by: Devarinti, Puneeth Kumar Reddy <puneeth.kumar.reddy.devarinti@intel.com>
2023-04-17 11:30:12 +02:00
Compute-Runtime-Validation
e79fb5f39b Revert "fix ocloc/ult: set default PVC device id to pvc xt device id"
This reverts commit bd84ba819b.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-04-15 11:43:21 +02:00
Bellekallu Rajkiran
007f5d70bf [Fix, Sysman] Map uevent to device based on device path
Related-To: LOCI-4307

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-04-14 23:25:39 +02:00
Mateusz Hoppe
079105a5c2 fix: optimize ATT handling - read state save area once for all threads
- reading state save area for every threads takes too long when all
application threads have completed and there are stale ATT events to
process
- on detach gdb seemed to be frozen waiting for ATT event to be handled
- fix is to read state save area once - and check SIP counter for every
thread in ATT bitmask

Related-To: NEO-7897

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-04-14 10:57:18 +02:00
Devarinti, Puneeth Kumar Reddy
94dc789212 Debug: Add debug logs for pci module
Related-To: LOCI-3876

Signed-off-by: Devarinti, Puneeth Kumar Reddy <puneeth.kumar.reddy.devarinti@intel.com>
2023-04-13 13:00:01 +02:00
Mateusz Jablonski
bd84ba819b fix ocloc/ult: set default PVC device id to pvc xt device id
ensure default hw ip version matches the value from helper
change pvc ult execution to revision 3

Related-To: NEO-7738
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-04-13 10:54:28 +02:00
Devarinti, Puneeth Kumar Reddy
a684e0ffc6 Debug: Add debug logs for fabricport module
Related-To: LOCI-3882

Signed-off-by: Devarinti, Puneeth Kumar Reddy <puneeth.kumar.reddy.devarinti@intel.com>
2023-04-13 10:47:04 +02:00
Konstanty Misiak
1f37e69fd2 Refactor of IO functions
Related-To: NEO-4562

Signed-off-by: Konstanty Misiak <konstanty.misiak@intel.com>
2023-04-13 10:46:47 +02:00
Bellekallu Rajkiran
24f73f4686 fix(Sysman): Support for fabric port health change event.
Related-To: LOCI-4053

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-04-12 06:46:19 +02:00
Bari, Pratik
f8623fadaf Added support for Standby APIs
- Added support for the Standby APIs in the new sysman design.
- Added ULTs for the Standby APIs in the new sysman design.

Related-To: LOCI-4097

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2023-04-10 06:54:50 +02:00
Jitendra Sharma
d29ed25f8b Add support for global_operations in new sysman design
Related-To: LOCI-4135
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2023-04-05 17:25:03 +02:00
Mateusz Hoppe
d1393e08d3 refactor: remove debug break when EU CONTROL ioctl fails
- when no threads are executing, interrupt all may fail and debug break
fires - although error is handled and correct event is returned. To
prevent abort, debug break has to be removed

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-04-05 13:07:31 +02:00
Mateusz Hoppe
e26ebfc51b fix: check exception reason for stopped threads
- before marking interrupt request check exception reason. If there is
exception other than forced exception or forced external halt treat
thread as stopped and generate distinct event for it.

Related-To: NEO-7869

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-04-05 12:01:20 +02:00
Singh, Prasoon
71fe65b327 [Sysman] Replace normal pointers with smart pointers (10/n)
Replacing normal pointers by smart pointers in fan module of L0 sysman.

Related-To: LOCI-2810

Signed-off-by: Singh, Prasoon <prasoon.singh@intel.com>
2023-04-05 11:05:44 +02:00
Bellekallu Rajkiran
310380119a feat(sysman): Support for registering events during listen
Related-To: LOCI-3682

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-04-04 07:07:19 +02:00
Joshua Santosh Ranjan
5dbe905e74 [Sysman]Update Firmware module to support zesInit
Related-To: LOCI-4227

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2023-04-04 06:48:52 +02:00
Singh, Prasoon
42a422d9f4 [Sysman] Replace normal pointers with smart pointers (8/n)
Replacing normal pointers by smart pointers in fabric_port module of L0 sysman.

Related-To: LOCI-2810

Signed-off-by: Singh, Prasoon <prasoon.singh@intel.com>
2023-04-03 09:34:52 +02:00
Mayank Raghuwanshi
d0fd7c1667 Improve Sysman windows ult coverage
Related-To: LOCI-4141

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-03-31 15:00:15 +02:00
Mayank Raghuwanshi
3816b85fa0 Add check for memory type before calculating ras hbm errors
Related-To: LOCI-3500

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-03-31 13:47:41 +02:00
Mateusz Hoppe
bfb59cc573 fix: do not resume threads that have pending THREAD_STOPPED events
- threads with pending events cannot be resumed by resume(all) as they
they were not yet discovered by debugger as stopped.
- set reportAsStopped when THREAD_STOPPED event is read
- when resuming, only resume a thread when it was reported

Related-To: NEO-7817

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-03-31 12:42:25 +02:00
Joshua Santosh Ranjan
0f280dbadf [Sysman] Update Scheduler module for zesInit
Related-To: LOCI-4119

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2023-03-31 10:43:05 +02:00
Singh, Prasoon
1a6f03c36e [Sysman] Replace normal pointers with smart pointers (4/n).
Replacing normal pointers by smart pointers in temperature module of L0 sysman.

Related-To: LOCI-2810

Signed-off-by: Singh, Prasoon <prasoon.singh@intel.com>
2023-03-30 13:07:02 +02:00
Mayank Raghuwanshi
065232eac7 Add support for ras l3 fabric errors
Related-To: LOCI-3966

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-03-30 12:47:45 +02:00