Commit Graph

943 Commits

Author SHA1 Message Date
Brandon Yates 9db08001c4 fix(debugger): Set flags for MME regset
Related-to: NEO-8286

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2023-08-23 16:38:35 +02:00
Bari, Pratik f9e4381c1e feature: Add FsAccess and Derived class in Sysman Kmd Interface
- The FsAccess, ProcFsAccess and SysfsAccess classes have been added in
a file in the Sysman Shared directory. The instances of these classes
are maintained in the Sysman Kmd Interface classes.
- Added functions for opendir, readdir and closedir in the shared code.
- Added a ULT in the sysman directory to cover these new functions from
the shared code.

Related-To: LOCI-4689

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2023-08-23 11:38:57 +02:00
Spruit, Neil R 5c75449508 fix: Track Sub Device Hierarchy for use by Metrics Library
Related-To: LOCI-4819

- When communicating with the Metrics Libraries, if one is using a
specific sub device, then the correct physical sub device index and
number of sub devices must be used.

- When Affinity Mask is set, this hierarchy information is lost,
therefore the sub device hierarchy data is stored in a map to each
"device" to be returned to the user allowing for the correct sub device
information to be passed to the metrics library.

Signed-off-by: Spruit, Neil R <neil.r.spruit@intel.com>
2023-08-22 21:37:25 +02:00
Kacper Kasper 991febcdf4 fix: extend cache hash with compiler commit sha, lib size and mtime
Related-To: NEO-4262

Signed-off-by: Kacper Kasper <kacper.k.kasper@intel.com>
2023-08-22 15:28:57 +02:00
Bellekallu Rajkiran 81d037f59b feature(sysman): Add support for fabric error counters
Related-To: LOCI-4620

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-08-16 15:05:32 +02:00
Kacper Nowak 1afaf37f78 refactor: Unify GTPin initialization logic between APIs
Add support for new GTPin loading logic in OCL path, similar to existing
in L0 - invoking exposed, dedicated API call (OpenGTPinOCL).
- Move logic to shared, including unit tests
- Check whether instrumentation is required on
  context creation and if yes,
  make a call to OpenGTPinOCL function.
  Handle potential errors gracefully without exiting.
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2023-08-09 17:15:24 +02:00
Compute-Runtime-Validation 65df34bbc7 Revert "refactor: Unify GTPin initialization logic between APIs"
This reverts commit 68a5108e05.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-08-07 04:31:52 +02:00
Kacper Nowak 68a5108e05 refactor: Unify GTPin initialization logic between APIs
Add support for new GTPin loading logic in OCL path, similar to existing
in L0 - invoking exposed, dedicated API call (OpenGTPinOCL).
- Move logic to shared, including unit tests
- Check whether instrumentation is required on
  clGetPlatformIDs API call, and if yes,
  make a call to OpenGTPinOCL function.
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2023-08-05 18:22:08 +02:00
Kulkarni, Ashwin Kumar 5cd39dd155 fix(sysman): supports return of correct temperature domains on igfx
Related-To: LOCI-4741

Signed-off-by: Kulkarni, Ashwin Kumar <ashwin.kumar.kulkarni@intel.com>
2023-08-04 12:28:50 +02:00
Bellekallu Rajkiran 63fa05e776 fix: Proper update of device count that generated events
numDeviceEvents out param is not being updated in case
there are no events and device events map is not overridden
on multiple register calls result in failure.

Related-To: LOCI-4751

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-08-04 11:58:56 +02:00
Bari, Pratik 3f083360a2 feature(sysman): Added sysfs filenames for the memory module
- The sysfs filenames have been added in the sysfsNameToFileMap of the
SysmanKmdInterface classes.
- The functions returning the sysfs filenames have been removed from the
shared directory.
- The ULTs have been added to return the sysfs filenames.

Related-To: LOCI-4699

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2023-08-03 22:36:17 +02:00
Bari, Pratik b85222b4f3 feature(sysman): Added changes for Porting Standby API
The corresponding sysfs file names used by the standby module has been
added in the sysfsNameToFileMap.
The ULTs have been added in the standby module covering the new
filename.
The base path for the sysfs filenames corresponding to the Xe driver
have been modified.

Related-To: LOCI-4406

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2023-08-02 16:02:47 +02:00
Bari, Pratik 64bf895cf7 feature(sysman): Create boilerplate for spec 1.7 in sysman
- Added new sysman APIs introduced in spec1.7
- Added ULTs to cover the new APIs

Related-To: LOCI-4667

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2023-08-02 13:51:09 +02:00
Mateusz Jablonski 1e95ed33f9 refactor: remove redundant parameter from HardwareInfo ctor
hw ip version is queried from KMD or set based on device id and rev id

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-07-31 16:12:57 +02:00
Mateusz Jablonski c22482f6f7 test: unify main function for test targets
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-07-28 20:00:45 +02:00
Bellekallu Rajkiran 8778a0acca feature(sysman): Optimize Sysfs reads
Maintain cache of file names and file descriptor to
avoid invoking open and close system calls on every
read call.

Related-To: LOCI-4556

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-07-27 10:08:10 +02:00
Matias Cabral 6946dd561f refactor: metrics enums rename for L0 spec version 1.7
Resolves: LOCI-4642

Signed-off-by: Matias Cabral <matias.a.cabral@intel.com>
2023-07-26 21:07:46 +02:00
Bellekallu Rajkiran 424784f7cf feature(sysman): Add power control support with xe driver
Support to get power related sysfs names based on drm
driver installed.

Related-To: LOCI-4403

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-07-26 12:50:25 +02:00
Brandon Yates 3637f519c3 test: fix DebugSession ULT
Use size of actual SIP SSA header as input to test

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2023-07-25 17:12:15 +02:00
Mayank Raghuwanshi 171e63409c feature: Add common Interface for timestamp in sysman
Related-To: LOCI-4511

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-07-25 09:32:51 +02:00
Mayank Raghuwanshi 175ceb9bd1 feature: Add support for media freq domain on linux
Related-To: LOCI-4552

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-07-25 08:12:33 +02:00
Mayank Raghuwanshi af6798f257 feature: Add support for 64 bit memory counters for zesInit
Related-To: LOCI-4613, LOCI-4612

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-07-20 11:04:24 +02:00
Compute-Runtime-Validation 5c60b4d973 Revert "feature: Add support for Media frequncy Domain on linux"
This reverts commit 085293b230.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-07-20 03:04:52 +02:00
Mayank Raghuwanshi 085293b230 feature: Add support for Media frequncy Domain on linux
Related-To: LOCI-4552

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-07-19 16:43:27 +02:00
Artur Harasimiuk 9524b65b51 refactor: remove unused code
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2023-07-19 12:00:40 +02:00
Bari, Pratik 16b56b9a6f feature(sysman): New shared directory in the sysman code.
The files containing the new 'SysmanKmdInterface' has been moved into
the new 'shared' directory in the existing sysman directory. More files
are to be added in this directory which will be used by all the sysman
modules in the sysman and tools directories.

Related-To: LOCI-4649

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2023-07-18 16:06:31 +02:00
Artur Harasimiuk faa8907344 refactor: remove unused code
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2023-07-18 14:52:43 +02:00
Mateusz Hoppe 8e07dd30cb performance(debugger): optimize ATT handling - minimize mem allocs
- do not allocate state save area every time attention event
is handled
- keep allocated memory for subsequent events
- remove not needed DBEUG_BREAK

Related-To: NEO-8183

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-07-18 08:37:04 +02:00
Bellekallu Rajkiran 461ecc9b3b fix(sysman): Align board number string sequence
Related-To: LOCI-4631

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-07-17 16:07:43 +02:00
Mateusz Hoppe 9a53f20e6c performance(debugger): optimize ATT event handling
- check threads with attention state before reading SR_IDENT
- memory read is costly, for threads already in stopped state
there is no need to read sr_ident and check thread state again
- single stepping one thread performance is substantially improved,
by the factor of 6x

Related-To: NEO-8183

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-07-17 10:42:34 +02:00
Bari, Pratik b9837ef068 feature(sysman): Added changes for Porting Frequency API
The new classes SysmanKmdInterface, SysmanKmdInterfaceI915 and
SysmanKmdInterfaceXe have been introduced.
A map is maintained in the SysmanKmdInterfaceI915 and
SysmanKmdInterfaceXe class for the sysfs file names.
The access specifier of the function getDrmVersion has been changed from
protected to public so as to use it in the sysman code. This is required
for the SysmanKmdInterface pointer to point to the
SysmanKmdInterfaceI915 and SysmanKmdInterfaceXe accordingly.
The ULTs have been added for the new sysfs file path corresponding to
the i915 and the Xe driver.

Related-To: LOCI-4399

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2023-07-13 08:41:05 +02:00
Mateusz Hoppe 9f466a9967 test: fix DebugSession ULT
- use size of state save area header when comparing contents

Related-To: NEO-7630

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-07-10 11:11:04 +02:00
Mayank Raghuwanshi ddd7e6087f fix: Add guid check while calculating sysman memory Bandwidth
Related-To: LOCI-4597

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-07-05 19:36:44 +02:00
Compute-Runtime-Validation 7b45b205e3 Revert "fix: Add guid check while calculating Sysman memory Bandwidth"
This reverts commit c3e2e145c5.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-07-04 14:23:50 +02:00
Mayank Raghuwanshi c3e2e145c5 fix: Add guid check while calculating Sysman memory Bandwidth
Related-To: LOCI-4597

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-07-03 19:32:17 +02:00
Bellekallu Rajkiran 6994106c1f fix: Read pci config space on demand.
Storing config space during sysman init might result in
providing inappropriate properties as config space can be
modified.

Read config space whenever properties are queried.

Related-To: LOCI-4513

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-07-01 00:37:38 +02:00
Mayank Raghuwanshi d02885767d feature: Add Support for 64 bit aggregated read/write counters
Related-To: LOCI-4529

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-06-28 18:14:47 +02:00
Brandon Yates bef89a3a7c feature(debugger): Force GRF count to 128
System Routine changes for implementing large-grf debugging
will result in GRF count of 256 being reported for kernels
in all GRF modes. Applications using L0 Debug API do not
yet have a way to determine actual GRF count for kernel.

This commit is a temporary adjustment to avoid breaking debug
API users after new system routine is enabled.

This problem will be fully solved in follow up patch with switch to
zetDebugGetThreadRegisterSetProperties

Related-to: NEO-7370

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2023-06-27 14:24:52 +02:00
Joshua Santosh Ranjan 0853cb71b7 feature: initial support for MetricGroupGetExportData
Related-To: LOCI-4356, LOCI-4357

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2023-06-27 00:09:45 +02:00
Aravind Gopalakrishnan ba4c4a5af0 feature(sysman): Retrieve uuid for Sysman device properties
- Abstracts product helpers logic for uuid
- Add UUID retrieval for Linux for Sysman via zesInit path

Related-To: LOCI-4137

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2023-06-26 19:17:21 +02:00
Matias Cabral 6f068c4a84 test: minor fix in metrics IP sampling test fixture
Add override to Setup/TearDown fixture methods

Signed-off-by: Matias Cabral <matias.a.cabral@intel.com>
2023-06-23 07:57:30 +02:00
Kamil Kopryk 082d33bb7c fix: correct query topology on xe
Related-To: NEO-7996
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-06-22 13:24:52 +02:00
Matias Cabral 79d600d767 test: minor fix in metrics test variables names
Signed-off-by: Matias Cabral <matias.a.cabral@intel.com>
2023-06-21 10:09:42 +02:00
Matias Cabral 96517a08aa feature: Implement zetMetricGroupGetGlobalTimestampsExp()
Resolves: LOCI-3072

Signed-off-by: Matias Cabral <matias.a.cabral@intel.com>
2023-06-21 09:48:41 +02:00
Bellekallu Rajkiran fe4330e588 feature(sysman): Add support for events module
- Port events module to new sysman design
- Add events ULTs for new sysman interface

Related-To: LOCI-4120

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-06-21 07:06:43 +02:00
Bari, Pratik a15e8a9679 feature: Added changes for Porting Memory API with XE driver
The Memory Info object is used in the getState function for memory.
Some of the ULTS in the memory modules has been modified.
A function to return the sysfs nodes for the Memory address range has
been added in the IoctlHelper class corresponding to the XE and i915
driver.

Related-To: LOCI-4397

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2023-06-20 21:38:17 +02:00
Rafal Maziejuk 1b54e61f24 refactor: unify engine handle context mock structs
Signed-off-by: Rafal Maziejuk <rafal.maziejuk@intel.com>
Related-To: NEO-6904
2023-06-19 08:34:54 +02:00
Mayank Raghuwanshi ac01d02247 feature: Add support for frequency media domain for windows
Related-To: LOCI-4287

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-06-19 06:19:10 +02:00
Matias Cabral cfa187aec6 feature: Support for metrics group exp extension
Support zet_metric_global_timestamps_resolution_exp_t

Resolves: LOCI-4350

Signed-off-by: Matias Cabral <matias.a.cabral@intel.com>
2023-06-16 07:48:32 +02:00
shubham kumar c824c48f21 feature(sysman): Support for PowerLimitExt APIs in windows
Related-To: LOCI-3407

Signed-off-by: shubham kumar <shubham.kumar@intel.com>
2023-06-14 19:16:25 +02:00
Mayank Raghuwanshi a69110a7ec feature: Add support for RAS mdfi errors
Related-To: LOCI-4479

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-06-13 10:14:36 +02:00
Kumar, Shubham c99a08bea8 feature: Support for Fan module using zesinint on windows
Added ULTs for Fan module in new sysman implementation
Added support for Fan in zello_sysman

Related-To: LOCI-4409, LOCI-4412

Signed-off-by: Kumar, Shubham <shubham.kumar@intel.com>
2023-06-13 02:12:20 +02:00
Jitendra Sharma 365ea7a178 feature: Add support for enabling compute unit debug mode
Add support for enabling compute unit debug mode in sysman
which is not initialized by core.

Related-To: LOCI-4370

Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2023-06-05 12:37:35 +02:00
Bellekallu Rajkiran 3c072a6cd1 fix: WA for VF bar resource allocation post Warm reset
On Warm reset, With default bar size set by bios, VF bar
allocation is getting failed because of bug in pci driver
which impacts SRIOV functionality.

Resize VF bar size for succesful allocation of VF bar
post warm reset.

Related-To: LOCI-4481

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-06-02 13:16:34 +02:00
Mateusz Jablonski 82ed9e9a96 refactor: simplify L0 tools/sysman headers
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-05-29 10:59:25 +02:00
Mayank Raghuwanshi d0d5a03be7 feature: Add support for Experimental zexSysmanMemoryGetBandwidth
Related-To: LOCI-3838

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-05-25 00:46:07 +02:00
Bari, Pratik af2c61c54b feature(sysman): Added support for Performance APIs
Added support for the Performance APIs in the new sysman design.
Added ULTs for the Performance APIs in the new sysman design.
In the black box test for performance, the API to compute the number
of sub devices has been changed in order to get the sub device count
regardless of the sysman initialization using zeInit or zesInit.

Related-To: LOCI-4294

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2023-05-16 06:59:29 +02:00
Mateusz Hoppe c4f36a6808 fix(debugger): ack pending events before closing session
- ack module / isa vm bind events before close(fd)

Related-To: NEO-7960

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-05-11 17:21:23 +02:00
Brandon Yates 273635e8e7 test: Fix ULT for zetDebugGetThreadRegisterSetProperties
Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2023-05-10 08:28:21 +02:00
Brandon Yates 713f166d17 feature(debugger): Implement zetDebugGetThreadRegisterSetProperties (1/n)
Current SIP implementation has constant reg descs for all threads.
Initial implementation of this API always returns same reg descs as the
non-thread version of API. When SIP exposes per-thread reg descs,
this API will be updated to expose them.

Related-to: NEO-7370
Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2023-05-08 14:46:48 +02:00
Singh, Prasoon d21430b17e fix(sysman): Replace normal pointers with smart pointers (15/n)
Replacing normal pointers by smart pointers in diagnostics module of L0 sysman.

Related-To: LOCI-2810

Signed-off-by: Singh, Prasoon <prasoon.singh@intel.com>
2023-05-04 16:50:39 +02:00
Singh, Prasoon 531779ffaf fix(sysman): Replace normal pointers with smart pointers (14/n)
Replacing normal pointers by smart pointers in performance module of L0 sysman.

Related-To: LOCI-2810

Signed-off-by: Singh, Prasoon <prasoon.singh@intel.com>
2023-05-04 11:18:00 +02:00
Bellekallu Rajkiran d3a31957db feature(sysman): Add delay for HBM diagnostics
Add debug variable to set sleep duration for HBM
IFR to complete

Related-To: LOCI-4298

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-05-03 20:27:21 +02:00
Bellekallu Rajkiran c55c1b4276 refactor(sysman): Remove support for Array and scan diagnostics
Related-To: LOCI-4323

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-05-01 05:30:36 +02:00
Fabian Zwolinski e351a90f81 refactor: Rename member variables to camelCase 2/n
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-04-27 20:39:22 +02:00
Mayank Raghuwanshi 9cc5763800 fix: Revert spec 1.5 RAS changes from Sysman
Related-To: LOCI-4351

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-04-27 05:29:33 +02:00
Fabian Zwolinski c441e9e971 refactor: Rename member variables to camelCase
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-04-26 16:05:07 +02:00
Kulkarni, Ashwin Kumar c93b084a4b Change return code for APIs with no support in windows
Related-To: LOCI-3863

Signed-off-by: Kulkarni, Ashwin Kumar <ashwin.kumar.kulkarni@intel.com>
2023-04-21 07:28:20 +02:00
Mateusz Jablonski 51b8dc66a3 fix ocloc/ult: set default PVC device to pvc xt C0
ensure default hw ip version matches the value from helper
change pvc ult execution to revision 3

Related-To: NEO-7738
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-04-18 13:48:48 +02:00
Mateusz Hoppe bfd32d6284 fix: read state save area once for all threads for resume(ALL)
- when resume(all) is called - all threads' sr counter needs to be
verified. Reading state save area separately for all threads takes
longer than reading whole state save area once. State save area is
only read again if sr counter wasn't updated
- fail while reading state save area means threads might have completed
execution
- this fix optimizes time spent in resume(all), that may be called before
debugger detaches

Related-To: NEO-7897

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-04-17 15:41:13 +02:00
Compute-Runtime-Validation e79fb5f39b Revert "fix ocloc/ult: set default PVC device id to pvc xt device id"
This reverts commit bd84ba819b.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-04-15 11:43:21 +02:00
Bellekallu Rajkiran 007f5d70bf [Fix, Sysman] Map uevent to device based on device path
Related-To: LOCI-4307

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-04-14 23:25:39 +02:00
Mateusz Hoppe 079105a5c2 fix: optimize ATT handling - read state save area once for all threads
- reading state save area for every threads takes too long when all
application threads have completed and there are stale ATT events to
process
- on detach gdb seemed to be frozen waiting for ATT event to be handled
- fix is to read state save area once - and check SIP counter for every
thread in ATT bitmask

Related-To: NEO-7897

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-04-14 10:57:18 +02:00
Mateusz Jablonski bd84ba819b fix ocloc/ult: set default PVC device id to pvc xt device id
ensure default hw ip version matches the value from helper
change pvc ult execution to revision 3

Related-To: NEO-7738
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-04-13 10:54:28 +02:00
Konstanty Misiak 1f37e69fd2 Refactor of IO functions
Related-To: NEO-4562

Signed-off-by: Konstanty Misiak <konstanty.misiak@intel.com>
2023-04-13 10:46:47 +02:00
Bellekallu Rajkiran 24f73f4686 fix(Sysman): Support for fabric port health change event.
Related-To: LOCI-4053

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-04-12 06:46:19 +02:00
Mateusz Hoppe e26ebfc51b fix: check exception reason for stopped threads
- before marking interrupt request check exception reason. If there is
exception other than forced exception or forced external halt treat
thread as stopped and generate distinct event for it.

Related-To: NEO-7869

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-04-05 12:01:20 +02:00
Singh, Prasoon 71fe65b327 [Sysman] Replace normal pointers with smart pointers (10/n)
Replacing normal pointers by smart pointers in fan module of L0 sysman.

Related-To: LOCI-2810

Signed-off-by: Singh, Prasoon <prasoon.singh@intel.com>
2023-04-05 11:05:44 +02:00
Bellekallu Rajkiran 310380119a feat(sysman): Support for registering events during listen
Related-To: LOCI-3682

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-04-04 07:07:19 +02:00
Singh, Prasoon 42a422d9f4 [Sysman] Replace normal pointers with smart pointers (8/n)
Replacing normal pointers by smart pointers in fabric_port module of L0 sysman.

Related-To: LOCI-2810

Signed-off-by: Singh, Prasoon <prasoon.singh@intel.com>
2023-04-03 09:34:52 +02:00
Mayank Raghuwanshi d0fd7c1667 Improve Sysman windows ult coverage
Related-To: LOCI-4141

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-03-31 15:00:15 +02:00
Mayank Raghuwanshi 3816b85fa0 Add check for memory type before calculating ras hbm errors
Related-To: LOCI-3500

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-03-31 13:47:41 +02:00
Mateusz Hoppe bfb59cc573 fix: do not resume threads that have pending THREAD_STOPPED events
- threads with pending events cannot be resumed by resume(all) as they
they were not yet discovered by debugger as stopped.
- set reportAsStopped when THREAD_STOPPED event is read
- when resuming, only resume a thread when it was reported

Related-To: NEO-7817

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-03-31 12:42:25 +02:00
Singh, Prasoon 1a6f03c36e [Sysman] Replace normal pointers with smart pointers (4/n).
Replacing normal pointers by smart pointers in temperature module of L0 sysman.

Related-To: LOCI-2810

Signed-off-by: Singh, Prasoon <prasoon.singh@intel.com>
2023-03-30 13:07:02 +02:00
Mayank Raghuwanshi 065232eac7 Add support for ras l3 fabric errors
Related-To: LOCI-3966

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-03-30 12:47:45 +02:00
Singh, Prasoon ac929eaf61 (Sysman):Changes made in standby API.
Replacing normal pointers by smart pointers in standby module of L0 sysman.

Related-To: LOCI-2810

Signed-off-by: Singh, Prasoon <prasoon.singh@intel.com>
2023-03-30 06:45:00 +02:00
Singh, Prasoon 49827b7122 (Sysman):Changes made in engine API.
Replacing normal pointers by smart pointers in engine module of L0 sysman.

Related-To: LOCI-2810

Signed-off-by: Singh, Prasoon <prasoon.singh@intel.com>
2023-03-30 06:41:26 +02:00
Bari, Pratik b161af5886 Improve coverage in Fs Access
- Added ULTS to improve the code coverage in fs_access.
- Used the shared functions in the ULTs.
- Added the function for pwrite in the shared code.
Related-To: LOCI-2117

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2023-03-30 04:21:18 +02:00
Singh, Prasoon fa2bb2509b [Sysman] Replace normal pointers with smart pointers (6/n)
Replacing normal pointers by smart pointers in scheduler module of L0 sysman.

Related-To: LOCI-2810

Signed-off-by: Singh, Prasoon <prasoon.singh@intel.com>
2023-03-29 16:01:08 +02:00
Singh, Prasoon 4475e2228d [Sysman] Replace normal pointers with smart pointers (7/n)
Replacing normal pointers by smart pointers in memory  module of L0 sysman.

Related-To: LOCI-2810

Signed-off-by: Singh, Prasoon <prasoon.singh@intel.com>
2023-03-29 10:59:40 +02:00
Yates, Brandon 0f6140deb0 fix(l0debug): windows topology map should always be created
Topology map was only being created when ZET_ENABLE_PROGAM_DEBUGGING was
set.  This was not correct.  Now it is unconditionally created at init,
and debug attach will fail if it is not valid.

Related-to: LOCI-3937

Signed-off-by: Yates, Brandon <brandon.yates@intel.com>
2023-03-27 16:24:57 +02:00
Bari, Pratik fb82af429b Improve ULT function coverage in Fan Component
As the fan component is not supported on Linux, hence for the ULTs a
fan handle is created and used which improves the coverage for the Fan
component.
Related-To: LOCI-2113

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2023-03-27 10:06:32 +02:00
Bari, Pratik e813ae0f4a Added changes for maxBandwidth computation in Memory Component.
The memoryBusWidth used for the computation of maxBandwidth has been
changed to 128 bytes from 128 bits. Hence the final maxBandwidth need
not be further divided by 8 to convert it into bytes/sec.

Related-To: LOCI-3833

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2023-03-23 20:20:59 +01:00
Kulkarni, Ashwin Kumar 9d866dac9e Support maxBandwidth in zesDevicePciGetState for windows
Fix to update zes_pci_speed_t.maxBandwidth

Related-To: LOCI-4056

Signed-off-by: Kulkarni, Ashwin Kumar <ashwin.kumar.kulkarni@intel.com>
2023-03-20 06:12:44 +01:00
Philip Meyer 2606788250 Sysman - Modify Consumption of KMD Energy Counter Timestamp
Signed-off-by: Philip Meyer philip.meyer@intel.com

Resolves: HSD-14018885283
2023-03-20 06:05:24 +01:00
Mateusz Jablonski 659cacf2c9 refactor l0 cmake: reduce include directories
Related-To: NEO-7507
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-03-17 13:41:55 +01:00
Mateusz Jablonski a7830eb478 refactor l0 cmake: add CMakeLists.txt files to solution
Related-To: NEO-7507
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-03-17 12:09:03 +01:00
Mateusz Jablonski cb7437b6b7 refactor l0 tools: cleanup cmake files
append sources in current directory

Related-To: NEO-7507
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-03-16 14:05:14 +01:00
Filip Hazubski 0025cb25f4 test: Correct logic of disabling slices in gtSystemInfo
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2023-03-16 09:13:47 +01:00
Mateusz Hoppe f2275caf3a performance: L0 debug - improve single-step time
- check threads with ATT only when more than one thread is resumed
- if single thread is resumed - check only that thread
- delay thread state check - allowing thread to reenter SIP

Related-To: NEO-7750

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-03-15 09:48:35 +01:00
Bari, Pratik adbda5906a Added support for Power APIs
- Added support for the Power APIs in the new sysman design.
- Added ULTs for the Power APIs in the new sysman design.
- Added change to measure the average power in the black box test for
Power.

Related-To: LOCI-3891

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2023-03-15 08:23:45 +01:00
Joshua Santosh Ranjan e2177f44f3 [Metrics]Fix OA buffer size request for streamer
This patch uses maximum OA buffer size for the
platform when opening an io stream.

Related-To: LOCI-4103

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2023-03-14 11:10:11 +01:00
Kamil Kopryk fa8579602f refactor: rename product helper files n/n
Related-To: NEO-7703
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-03-10 13:24:38 +01:00
Mateusz Jablonski fbe01e534b fix l0 debug: correct UNRECOVERABLE_IF condition
correct debug API tests to respect more than 8 threads per eu

Related-To: NEO-7442

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-03-10 12:54:31 +01:00
Yates, Brandon 11bec57d10 fix(L0 debug): Fix singleThread conversion for some hw configs
Configs with low slices disabled produced wrong EuThread vector

Signed-off-by: Yates, Brandon <brandon.yates@intel.com>
2023-03-10 10:30:49 +01:00
Jitendra Sharma 266a495837 [Sysman]: Add support for Fabric APIs
Add support for Fabric APIs in new sysman design.
This design is independent of level zero core.

Related-To: LOCI-4091
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2023-03-10 07:31:21 +01:00
Bellekallu Rajkiran f0c7eef667 Sysman: Fix and refactor event implementation
Remove unnecessary prelim implementation for events.
Modify uevent property name to check memory health
events.

Related-To: LOCI-4089

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-03-09 09:34:11 +01:00
Mateusz Hoppe caecbb12f0 fix: Add ATT scan after interrupt in handling ATT event
- make sure to capture all threads after interrupt, also those that
could be lagging to enter SIP

Related-To: NEO-7776

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-03-07 11:53:25 +01:00
Zbigniew Zdanowicz 49def723b7 Unify layout of command list class
Related-To: NEO-5055

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-03-07 11:38:47 +01:00
Bellekallu Rajkiran 2282f26734 feature(sysman): Support events for multiple devices
Related-To: LOCI-3683

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-03-07 09:50:32 +01:00
Bari, Pratik 9acba4ea2a Added changes in the Temperature Component
- The 'paramInfo' for the sensor type ZES_TEMP_SENSORS_MEMORY has been
corrected from TemperatureDomainPackage to TemperatureDomainHBM.
- The ULTs for temperature have been enabled for windows.

Related-To: LOCI-3149

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2023-03-07 09:47:34 +01:00
Mateusz Jablonski 5b2e1f619c fix hw config linux: set slice info based on slice count
Related-To: NEO-7622
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-03-06 15:47:40 +01:00
Mateusz Hoppe 802848a43f fix: L0Debug - allow access only for reported stopped threads
- read/write registers/memory only allowed for threads reported as
stopped by events
- threads newly stopped, accidentally, that are resumed immediately
are not allowed register/memory access

Related-To: NEO-7776

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-03-03 13:34:57 +01:00
Bari, Pratik 5d04c15297 Added alternate way to initialize sysman
A new environment variable ZELLO_SYSMAN_USE_ZESINIT is used to
distinguish the sysman initialization with zeInit and zesInit.
For sysman initialization using zeInit, ZES_ENABLE_SYSMAN is set
if it is not set initially.

Related-To: LOCI-3949

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2023-03-02 09:08:29 +01:00
Mateusz Jablonski 5af9509877 Correct scope of WA for pipecontrol before nonpipelined state
- for DG2 platforms it is valid only for G10/G11/G12
- for MTL platforms it is valid only for 12.70.0 and 12.71.0

Additionally:
- setup default hw ip version for each platform
- merge dg2 specific product helper tests to single file

Related-To: HSD-14015808183, HSD-14015812625, HSD-14016015202
Related-To: HSD-14015812559, HSD-14015816823
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-02-24 09:27:12 +01:00
Mateusz Hoppe 6947220298 perf(debugger): Optimize registers access during debugging
Register read/write implementation was unnecessarily checking magic values
which were taking significant time. By removing those checks, register
access is speed-up by a factor of 3.

Resolves: NEO-7754

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-02-23 12:14:03 +01:00
Mateusz Hoppe e8b4841577 perf(debugger): Optimize single-stepping of eu threads
Single-stepping threads is based on resume() and THREAD_STOPPED events
To speed up single-stepping, stopped threads must be discovered
immediately after resume() is called.
This change is adding ATT scan and event generation in resume()

Related-To: NEO-7750

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-02-23 11:06:13 +01:00
Mayank Raghuwanshi 29ed6ea077 Add support l3_bank and subslice Ras errors
Related-To: LOCI-3662

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-02-22 09:14:15 +01:00
Bellekallu Rajkiran ac5d719e25 Fix (sysman): Modify device wedged status retrieval mechanism
Related-To: LOCI-3551

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-02-20 23:21:24 +01:00
Mayank Raghuwanshi eea0ff8e16 Fix clang tidy issues in fabric ults
Related-To: LOCI-4054

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-02-20 11:59:12 +01:00
Ranjan, Joshua Santhosh 8ae2e3dac9 Add per tile metric data support for stall sampling
Related-To: LOCI-3869

Signed-off-by: Ranjan, Joshua Santhosh <joshua.santosh.ranjan@intel.com>
2023-02-19 12:16:13 +01:00
Warchulski, Jaroslaw b485c025d0 Cleanup includes 57
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-02-17 11:19:59 +01:00
Warchulski, Jaroslaw 0556d543a3 Cleanup includes 56
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-02-16 14:42:44 +01:00
Warchulski, Jaroslaw 89ddfbb83c Cleanup includes 54
Cleaned up files:
level_zero/tools/test/unit_tests/sources/sysman/windows/mock_sysman_fixture.h
shared/test/common/os_interface/windows/wddm_fixture.h
shared/test/unit_test/command_stream/compute_mode_tests.h
shared/test/unit_test/encoders/walker_partition_fixture_xehp_and_later.h
shared/test/unit_test/fixtures/preemption_fixture.h
shared/test/unit_test/helpers/blit_commands_helper_tests.inl
shared/test/unit_test/helpers/simd_helper_tests.inl

Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-02-15 13:59:56 +01:00
Warchulski, Jaroslaw 8c17313c8b Cleanup includes 53
Cleaned up files:
opencl/source/mem_obj/image.inl
shared/offline_compiler/source/decoder/zebin_manipulator.h
shared/source/aub_mem_dump/aub_alloc_dump.h
shared/source/compiler_interface/intermediate_representations.h
shared/source/helpers/blit_commands_helper_base.inl
shared/source/utilities/debug_file_reader.h
shared/source/utilities/software_tags.h
shared/source/xe_hpc_core/hw_cmds_pvc.h

Related-To: NEO-5548

Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-02-15 13:21:14 +01:00
Bellekallu Rajkiran 1c63753ebe feature(sysman): Support for reset event upon repair
Related-To: LOCI-3720

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-02-14 08:12:22 +01:00
Warchulski, Jaroslaw 5ec9de90ee Cleanup includes 52
Cleaned up files:
level_zero/core/source/driver/driver.h
level_zero/tools/source/sysman/fabric_port/windows/os_fabric_port_imp.h
level_zero/tools/source/sysman/pci/os_pci.h
shared/source/debug_settings/debug_settings_manager.h
shared/source/gmm_helper/page_table_mngr.h
shared/source/gmm_helper/windows/gmm_memory_base.h
shared/source/kernel/kernel_arg_metadata.h
shared/test/common/libult/linux/drm_mock.h
shared/test/unit_test/fixtures/command_container_fixture.h
shared/test/unit_test/fixtures/product_config_fixture.h
shared/test/unit_test/helpers/simd_helper_tests_pvc_and_later.inl
shared/test/unit_test/os_interface/hw_info_config_tests.h

Related-To: NEO-5548

Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-02-13 11:39:34 +01:00
Lukasz Jobczyk e6181d7933 Reuse builtin module and init selected builtin on device init
-start async thread at device initialization which initializes selected
builtins and exits
-share module across builtins using same binary

Resolves: NEO-7644

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-02-13 10:55:57 +01:00
Mayank Raghuwanshi 07d3353b1f Add support for sysman zesFabricPortGetFabricErrorCounters API
Related-To: LOCI-3398

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-02-13 06:50:23 +01:00
Warchulski, Jaroslaw b224ec947e Cleanup includes 50
Cleaned up files:
shared/source/helpers/hw_info.h

Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-02-10 20:26:13 +01:00
Compute-Runtime-Validation 526297fa8f Revert "Reuse builtin module and init selected builtin on device init"
This reverts commit 3be46ef645.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-02-09 15:40:38 +01:00
Lukasz Jobczyk 3be46ef645 Reuse builtin module and init selected builtin on device init
-start async thread at device initialization which initializes selected
builtins and exits
-share module across builtins using same binary

Resolves: NEO-7644

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-02-08 17:04:42 +01:00
Mayank Raghuwanshi 5a833e2c08 Add support for RAS CSC HW errors
Related-To: LOCI-3699

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-02-03 18:36:12 +01:00
Lukasz Jobczyk 43ff955199 Revert "Reuse builtin module and init selected builtin on device init"
This reverts commit bdc3499585b954c674b24c5bf21aeefd68f6fee2.

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-02-03 13:53:59 +01:00
Mateusz Jablonski 24c5352350 refactor: remove redundant including of compiler_cache.h
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-02-03 11:16:31 +01:00
Bellekallu Rajkiran 47486ca55a Add support for memory health information
Get Memory health information from igsc.

Related-To: LOCI-3395

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-02-02 21:50:36 +01:00
Kamil Kopryk 524d945e8c refactor: rename l0 hw_helpers dir to gfx_core_helpers
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-02-02 11:15:56 +01:00
Kamil Kopryk 2484c7ceb2 refactor: rename hw_helper files to gfx_core_helper files
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-02-01 19:37:51 +01:00
Yates, Brandon f53d9103ad Fix(L0Debug): Fix thread creation on dg2 128EU
Not enough EUThread objects were being created
resulting in a crash during breeakpoint
processing

Related-to: LOCI-3937

Signed-off-by: Yates, Brandon <brandon.yates@intel.com>
2023-01-31 19:48:50 +01:00
Lukasz Jobczyk d4fe981fa9 Reuse builtin module and init selected builtin on device init
-start async thread at device initialization which initializes selected
builtins and exits
-share module across builtins using same binary

Resolves: NEO-7644

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-01-31 09:57:31 +01:00
Compute-Runtime-Validation c726d3b3fe Revert "Reuse builtin module and init selected builtin on device init"
This reverts commit 2845414845.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-01-31 05:50:36 +01:00
Lukasz Jobczyk 2845414845 Reuse builtin module and init selected builtin on device init
-start async thread at device initialization which initializes selected
builtins and exits
-share module across builtins using same binary

Resolves: NEO-7644

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-01-30 20:40:18 +01:00
Mayank Raghuwanshi d7292a6b0d Add Support for sysman Performance factor on windows
Related-To: LOCI-3342

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-01-30 15:55:07 +01:00
Kamil Kopryk d2628babbc refactor: don't use global ProductHelper getter 20/n
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-01-30 09:29:18 +01:00
Matias Cabral 2f09a07445 Increase min version of SIP with SLM support
Resolves: NEO-7506

Signed-off-by: Matias Cabral <matias.a.cabral@intel.com>
2023-01-27 19:06:58 +01:00
shubham kumar df9069c7bb Fix related to windows TDR issue
Related-To: LOCI-3540

Signed-off-by: shubham kumar <shubham.kumar@intel.com>
2023-01-27 06:27:58 +01:00
Yates, Brandon b3eb1561f8 feat(L0debug): Enable MTL
Turn runtime capability flag for mtl debug support on

Related-to: NEO-7659
Signed-off-by: Yates, Brandon <brandon.yates@intel.com>
2023-01-26 20:42:29 +01:00
Bari, Pratik fb8c844e00 Removal of productFamily condition checks in the ULTs
The condition checks which included the IGFX_PVC and IGFX_XE_HP_SDV
flags have been removed and necessary changes have been done in the
test cases.

Related-To: LOCI-3801

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2023-01-26 08:09:45 +01:00
Warchulski, Jaroslaw e21cf516d3 Cleanup includes 44
Cleaned up files:
level_zero/tools/source/sysman/windows/os_sysman_imp.h
opencl/source/command_queue/command_queue.h
shared/source/command_container/cmdcontainer.h

Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-25 16:05:22 +01:00
Warchulski, Jaroslaw c43233dabf Cleanup includes 42
Cleaned up files:
level_zero/core/source/kernel/kernel_hw.h
shared/source/helpers/common_types.h
shared/test/common/libult/linux/drm_mock.h
shared/test/common/libult/ult_command_stream_receiver.h

Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-25 09:16:39 +01:00