Commit Graph

407 Commits

Author SHA1 Message Date
Bellekallu Rajkiran
2282f26734 feature(sysman): Support events for multiple devices
Related-To: LOCI-3683

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-03-07 09:50:32 +01:00
Bari, Pratik
9acba4ea2a Added changes in the Temperature Component
- The 'paramInfo' for the sensor type ZES_TEMP_SENSORS_MEMORY has been
corrected from TemperatureDomainPackage to TemperatureDomainHBM.
- The ULTs for temperature have been enabled for windows.

Related-To: LOCI-3149

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2023-03-07 09:47:34 +01:00
Compute-Runtime-Validation
8bce514403 Revert "[Sysman]: Remove not used functions in fsAccess"
This reverts commit d47c1de7aa.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-03-02 10:27:05 +01:00
Jitendra Sharma
d47c1de7aa [Sysman]: Remove not used functions in fsAccess
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2023-03-02 07:49:34 +01:00
Devarinti, Puneeth Kumar Reddy
535debff2d Debug: Add debug logs for temperature module
Related-To: LOCI-3881

Signed-off-by: Devarinti, Puneeth Kumar Reddy <puneeth.kumar.reddy.devarinti@intel.com>
2023-03-01 22:32:00 +01:00
Devarinti, Puneeth Kumar Reddy
996d73b768 Debug: Add debug logs for firmware module
Related-To: LOCI-3877

Signed-off-by: Devarinti, Puneeth Kumar Reddy <puneeth.kumar.reddy.devarinti@intel.com>
2023-03-01 22:27:39 +01:00
Devarinti, Puneeth Kumar Reddy
cc3245378c Debug: Add debug logs for power module
Related-To: LOCI-3879

Signed-off-by: Devarinti, Puneeth Kumar Reddy <puneeth.kumar.reddy.devarinti@intel.com>
2023-02-27 04:43:35 +01:00
Devarinti, Puneeth Kumar Reddy
5fbb10df2d Debug: Add debug logs for diagnostics module
Related-To: LOCI-3882

Signed-off-by: Devarinti, Puneeth Kumar Reddy <puneeth.kumar.reddy.devarinti@intel.com>
2023-02-27 04:40:11 +01:00
Mayank Raghuwanshi
29ed6ea077 Add support l3_bank and subslice Ras errors
Related-To: LOCI-3662

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-02-22 09:14:15 +01:00
Bellekallu Rajkiran
ac5d719e25 Fix (sysman): Modify device wedged status retrieval mechanism
Related-To: LOCI-3551

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-02-20 23:21:24 +01:00
Warchulski, Jaroslaw
b485c025d0 Cleanup includes 57
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-02-17 11:19:59 +01:00
Jitendra Sharma
0bc4294af4 Add support for Sysman init and sysman driver init
With this change we are adding support for initializing
Sysman and also providing support to get Sysman driver
handles and device handles.

Related-To: LOCI-3950

Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2023-02-16 08:10:39 +01:00
Bellekallu Rajkiran
1c63753ebe feature(sysman): Support for reset event upon repair
Related-To: LOCI-3720

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-02-14 08:12:22 +01:00
Warchulski, Jaroslaw
5ec9de90ee Cleanup includes 52
Cleaned up files:
level_zero/core/source/driver/driver.h
level_zero/tools/source/sysman/fabric_port/windows/os_fabric_port_imp.h
level_zero/tools/source/sysman/pci/os_pci.h
shared/source/debug_settings/debug_settings_manager.h
shared/source/gmm_helper/page_table_mngr.h
shared/source/gmm_helper/windows/gmm_memory_base.h
shared/source/kernel/kernel_arg_metadata.h
shared/test/common/libult/linux/drm_mock.h
shared/test/unit_test/fixtures/command_container_fixture.h
shared/test/unit_test/fixtures/product_config_fixture.h
shared/test/unit_test/helpers/simd_helper_tests_pvc_and_later.inl
shared/test/unit_test/os_interface/hw_info_config_tests.h

Related-To: NEO-5548

Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-02-13 11:39:34 +01:00
Mayank Raghuwanshi
07d3353b1f Add support for sysman zesFabricPortGetFabricErrorCounters API
Related-To: LOCI-3398

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-02-13 06:50:23 +01:00
Warchulski, Jaroslaw
64f735481d Cleanup includes 48
Cleaned up files:
shared/source/command_container/command_encoder.inl
shared/source/os_interface/hw_info_config.h

Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-02-10 17:23:02 +01:00
Warchulski, Jaroslaw
a2e6a8284b Cleanup includes 47
Cleaned up files:
level_zero/tools/source/debug/windows/debug_session.h
level_zero/tools/source/sysman/memory/windows/os_memory_imp.h
level_zero/tools/source/sysman/windows/kmd_sys_manager.h
opencl/test/unit_test/aub_tests/command_stream/copy_engine_aub_tests_xehp_and
shared/source/command_container/command_encoder.inl
shared/source/command_stream/command_stream_receiver_hw_xehp_and_later.inl
shared/source/helpers/blit_commands_helper_base.inl
shared/test/unit_test/image/image_surface_state_fixture.h
shared/test/unit_test/os_interface/windows/os_interface_win_tests.h

Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-02-10 17:07:30 +01:00
Mayank Raghuwanshi
5a833e2c08 Add support for RAS CSC HW errors
Related-To: LOCI-3699

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-02-03 18:36:12 +01:00
Bellekallu Rajkiran
47486ca55a Add support for memory health information
Get Memory health information from igsc.

Related-To: LOCI-3395

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2023-02-02 21:50:36 +01:00
Kamil Kopryk
2484c7ceb2 refactor: rename hw_helper files to gfx_core_helper files
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-02-01 19:37:51 +01:00
Mayank Raghuwanshi
d7292a6b0d Add Support for sysman Performance factor on windows
Related-To: LOCI-3342

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-01-30 15:55:07 +01:00
Kamil Kopryk
d2628babbc refactor: don't use global ProductHelper getter 20/n
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-01-30 09:29:18 +01:00
shubham kumar
df9069c7bb Fix related to windows TDR issue
Related-To: LOCI-3540

Signed-off-by: shubham kumar <shubham.kumar@intel.com>
2023-01-27 06:27:58 +01:00
Warchulski, Jaroslaw
e21cf516d3 Cleanup includes 44
Cleaned up files:
level_zero/tools/source/sysman/windows/os_sysman_imp.h
opencl/source/command_queue/command_queue.h
shared/source/command_container/cmdcontainer.h

Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-25 16:05:22 +01:00
Warchulski, Jaroslaw
bc13db734d Cleanup includes 41
Cleaned up files:
shared/source/command_stream/aub_command_stream_receiver_hw.h
shared/source/helpers/common_types.h
shared/source/os_interface/linux/drm_neo.h
shared/source/os_interface/windows/hw_device_id.h

Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-24 14:37:36 +01:00
Warchulski, Jaroslaw
11764dd9bf Cleanup includes 40
Cleaned up files:
shared/source/os_interface/linux/drm_neo.h
shared/source/os_interface/windows/wddm/um_km_data_translator.h

Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-23 16:19:35 +01:00
shubham kumar
01eb8daea9 Fix to use enum CurrentTotalAllocableMem for size in zesmemoryGetState.
Related-To: LOCI-3532

Signed-off-by: shubham kumar <shubham.kumar@intel.com>
2023-01-23 02:20:29 +01:00
Warchulski, Jaroslaw
286c672ef4 Cleanup includes 37
Cleaned up files:
level_zero/core/source/event/event.h

Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-20 12:34:39 +01:00
Mayank Raghuwanshi
52593fd4f4 Update mechanism for retrieving timestamp for Sysman memory
Related-To: LOCI-3679

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2023-01-17 00:15:44 +01:00
Warchulski, Jaroslaw
c275008e51 Cleanup includes 32
Cleaned up files:
level_zero/core/source/cmdlist/cmdlist_hw.h
level_zero/core/source/cmdqueue/cmdqueue.h
level_zero/core/source/event/event.h
opencl/source/helpers/get_info_status_mapper.h
opencl/source/helpers/hardware_commands_helper.h
shared/source/helpers/per_thread_data.h

Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-16 20:41:37 +01:00
Mateusz Jablonski
91a9b925f7 fix: Append device id to ambigous device names
Related-To: NEO-7537
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-01-13 13:31:18 +01:00
Warchulski, Jaroslaw
bd81b5546d Cleanup includes 25
Cleaned up files:
level_zero/core/source/device/device.h
opencl/source/helpers/dispatch_info.h
shared/source/os_interface/os_interface.h

Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-10 12:54:45 +01:00
Mateusz Jablonski
43b790957d style: format code using clang-format 15.0.6
Related-To: NEO-7500
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-01-05 10:33:47 +01:00
Warchulski, Jaroslaw
0eac749fcc Cleanup includes 19
Cleaned up files:
opencl/source/api/cl_types.h
shared/source/compiler_interface/external_functions.h
shared/source/compiler_interface/linker.h
shared/source/device_binary_format/elf/elf.h
shared/source/helpers/preamble.h
shared/source/memory_manager/definitions/storage_info.h
shared/source/memory_manager/memory_manager.h
shared/source/memory_manager/os_agnostic_memory_manager.h
shared/source/program/program_info.h

Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-03 11:02:05 +01:00
Singh, Prasoon
93ca711d23 Adding debug logs in standby module
Related-To: LOCI-3471

Signed-off-by: Singh, Prasoon prasoon.singh@intel.com
2023-01-02 19:47:28 +01:00
Singh, Prasoon
6254dd057d Enabled debug logs in frequency module
Related-To: LOCI-3471

Signed-off-by: Singh, Prasoon <prasoon.singh@intel.com>
2023-01-02 19:47:13 +01:00
Mayank Raghuwanshi
9968857c29 Change category for some sysman ras errors
Related-To: LOCI-3648

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2022-12-23 18:43:41 +01:00
Warchulski, Jaroslaw
f275eea6ec Cleanup includes 14
Cleaned up files:
shared/source/device/device.h

Related-To: NEO-5548

Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2022-12-23 10:46:34 +01:00
Bellekallu Rajkiran
8d39ab7bcd Diagnostics: Remove duplicate code
gpuProcessCleanUp is defined in multile places,
Reuse common code for cleaning up all processes.

Related-To: LOCI-3746

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2022-12-20 09:58:29 +01:00
Puneeth Kumar Reddy D
893c3cdb53 Added debug logs for memory module
Related-To: LOCI-3473

Signed-off-by: Puneeth Kumar Reddy D <puneeth.kumar.reddy.devarinti@intel.com>
2022-12-19 18:22:26 +01:00
Puneeth Kumar Reddy D
d92bca4ebb Added debug logs for engine modules
Related-To: LOCI-3473

Signed-off-by: Puneeth Kumar Reddy D <puneeth.kumar.reddy.devarinti@intel.com>
2022-12-19 18:06:37 +01:00
Joshua Santosh Ranjan
522076cf82 Avoid adding subdevice flag if ReturnSubDevicesAsApiDevices is set
Related-To: LOCI-3656

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-12-19 05:15:51 +01:00
Mayank Raghuwanshi
5edbca1aa2 Use physical subdevice for sysman engine module
Related-To: LOCI-3231

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2022-12-16 07:11:57 +01:00
Kamil Kopryk
232b886056 Rename HwInfoConfig to ProductHelper
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2022-12-14 14:39:52 +01:00
Jitendra Sharma
391941c447 Sysman: Enhance Scheduler compute unit debug mode implementation
This change helps in achieving the following:
- Moves the OS specific code from scheduler_imp.cpp to os specific
files.
- Frees any drm resource, including level zero's before enabling/dis
-abling Debug mode. And once Debug mode is toggled, reinitialize of
level zero occurs.
- If current mode is Debug mode and any other mode is requested by user,
then new mode will be made effective by unsetting debug mode.

Related-To: LOCI-866

Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2022-12-12 17:58:28 +01:00
Kamil Kopryk
03b687881f Rename HwHelper -> GfxCoreHelper
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2022-12-09 10:29:06 +01:00
Warchulski, Jaroslaw
be647d42d9 Cleanup includes 12
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2022-12-07 13:14:15 +01:00
Jitendra Sharma
5baf75b9a8 Sysman: Redesign event API to effectively use uevents
Earlier implementation of sysman events API was based on file
creation in the filesystem. Whenever a uevent for some event
which needs to be monitored arrive, at that time a file was
created in the filesystem based on some preinstalled udev rules.
This approach was inefficient as it heavily depends over file
system and second with this approach losing events is always a
possibility.

Now with this change, we are removing our dependency over file
creation in filesystem. Rather we will be using libudev library
to monitor the uevents. This approach could also be extended,
when we want to listen to all the uevents for all the gpu
devices present in the system.

Related-To: LOCI-2140
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2022-12-07 07:29:57 +01:00
Warchulski, Jaroslaw
c10aa90815 Cleanup includes 11
Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2022-12-06 12:25:30 +01:00
Mayank Raghuwanshi
69e6c8b1c6 Add support for retrieving max b/w for DG2
Related-To: LOCI-3425

Signed-off-by: Mayank Raghuwanshi <mayank.raghuwanshi@intel.com>
2022-12-01 10:45:32 +01:00