Commit Graph

230 Commits

Author SHA1 Message Date
Joshua Santosh Ranjan
6292a3304a Fix Metric Activation Procedure
This patch implements Deactivate previously activated metric groups,
if they are not part of the current list

Related-To: LOCI-3160

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-07-08 22:18:42 +02:00
Mateusz Hoppe
5956aea18d Limit header includes from level_zero device.h
- remove including debugger_l0.h from device.h
- add getL0Debugger() to shared NEO Device

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2022-07-06 16:41:17 +02:00
Joshua Santosh Ranjan
e8494abbe8 Add support for ze_device_memory_ext_properties_t
Related-To: LOCI-3099

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-07-06 08:18:22 +02:00
Lukasz Jobczyk
bcdeeb22b5 Add forward declaration on EngineGroupT
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-07-05 13:05:04 +02:00
Lukasz Jobczyk
0d2d41691c Bind BCS2-8 statically
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2022-07-05 08:52:52 +02:00
Jaime Arteaga
5cd2e098d6 Add ze_eu_count_t to get total number of EUs
Related-To: LOCI-2667

Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
2022-07-01 23:37:09 +02:00
Mateusz Hoppe
673bf3b553 Move DebuggerL0 to shared
Related-To: NEO-7075

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2022-06-23 10:05:45 +02:00
Compute-Runtime-Validation
c45ddab1c3 Revert "Move DebuggerL0 to shared"
This reverts commit 7eba4db2e7.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-06-22 04:38:32 +02:00
Mateusz Hoppe
7eba4db2e7 Move DebuggerL0 to shared
Related-To: NEO-7075

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2022-06-21 17:10:46 +02:00
Katarzyna Cencelewska
c58657378e Fix set engine index by flag NodeOrdinal for l0
Related-To: LOCI-3154

Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2022-06-10 12:04:16 +02:00
Katarzyna Cencelewska
461a9c3fdf Set proper desc.index when flag NodeOrdinal
Resolves: LOCI-3154

Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2022-06-06 23:09:13 +02:00
Mateusz Jablonski
5bf1c84ed3 Cleanup includes in sampler, image, builtin and event
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2022-06-03 10:21:59 +02:00
Mateusz Jablonski
747c2d3d83 Cleanup includes in cmdlist, cmdqueue, context, device, driver and module
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2022-06-03 02:54:41 +02:00
Mateusz Jablonski
2a4c68dc38 Remove not needed dependencies from device_imp.h
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2022-06-02 18:35:34 +02:00
Katarzyna Cencelewska
8ff34fea29 Fix typo
-engineGroupTyp to engineGroupType
-remove gap between commits to use same desc.ordinal in createCommandQueue

Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2022-05-24 12:15:12 +02:00
Katarzyna Cencelewska
637767500d Add implementation for flag NodeOrdinal to l0
create method adjustCommandQueueDesc

Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2022-05-24 11:10:59 +02:00
Pawel Wilma
d9858bf206 Add debug variable to override device name
This commit introduces debug variable to override device name reported
by CL_DEVICE_NAME property in OpenCL and ze_device_properties_t.name in
level_zero

Signed-off-by: Pawel Wilma <pawel.wilma@intel.com>
2022-05-19 15:16:13 +02:00
Krystian Chmielewski
c59abbe495 Add option for extending cmdlist
Signed-off-by: Krystian Chmielewski <krystian.chmielewski@intel.com>
2022-05-18 14:49:50 +02:00
Katarzyna Cencelewska
96e1eb7467 Move variables baseDieRev and baseDieA0Masked from xe_hpc to pvc
Pvc specific variables should be located in pvc struct

Related-To: NEO-6738
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2022-05-17 12:19:16 +02:00
Maciej Plewka
b0d84f88d7 Return correct device name in L0
With this commit L0 driver will report the same device name as OpenCL
driver

Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2022-05-12 10:53:33 +02:00
Artur Harasimiuk
e9be9b64c6 clang-tidy configuration cleanup
Define single .clang-tidy configuration with all used checks and use
NOLINT to selectively silence tool. That way cleanup should be easier.
third_part/ has its own configuration that disables clang-tidy for this
folder.

Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2022-05-11 14:02:04 +02:00
Jaime Arteaga
3c3dab8fe0 Expose copy engines in parent device with implicit scaling
When using implicit scaling, expose the copy engines from
sub-device 0 in the root device. This to facilitate
programming models of layers above.

Related-To: NEO-6815

Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
2022-04-14 00:26:14 +02:00
Brandon Yates
d70b1a2e2a Filter L0 Debugger support by platfom
Related-To: NEO-6678
Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2022-04-13 13:03:40 +02:00
Patryk Wrobel
19dded25ef Detect GPU hang in remaining calls of command queue and list
This change introduces checking of waits status in
CommandQueue and CommandList classes.

Related-To: NEO-6681
Signed-off-by: Patryk Wrobel <patryk.wrobel@intel.com>
2022-04-11 13:31:44 +02:00
Jaime Arteaga
8b88f920ac Use Internal Copy engine from Sub-device for Shared Migrations
This when we have implicit scaling, so fallback to the sub-device
for BCS migration support.

Related-To: LOCI-3014

Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
2022-04-11 09:51:55 +02:00
Zbigniew Zdanowicz
9858438121 Limit multiple partition count to compute command lists
Related-To: NEO-6811

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-03-29 07:29:08 +02:00
Compute-Runtime-Validation
90c6d7d9c9 Revert "Expose copy engines in parent device with implicit scaling"
This reverts commit 6154fb8ea7.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-03-28 22:06:10 +02:00
Jaime Arteaga
6154fb8ea7 Expose copy engines in parent device with implicit scaling
When using implicit scaling, expose the copy engines from
sub-device 0 in the root device. This to facilitate
programming models of layers above.

Related-To: NEO-6815

Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
2022-03-28 16:48:04 +02:00
Baj, Tomasz
1f7fd45251 Move device UUID from L0 to shared
Related-To: NEO-5681

Signed-off-by: Baj, Tomasz <tomasz.baj@intel.com>
2022-03-23 12:52:24 +01:00
Jitendra Sharma
f52f3df274 Add platform specific getter of debug surface size
For different platforms based on number of available threads
and debug surface layout, calculate max debug surface size.

Related-To: NEO-6676
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2022-03-22 12:18:40 +01:00
Joshua Santosh Ranjan
05a150f49f L0 Core Add Support For pci_speed_ext
This patch adds support for reading PCI bandwidth, generation
and linkwidth information from sysfs nodes for the linux
platform.

Related-To: LOCI-2969

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-03-16 22:47:59 +01:00
Jaime Arteaga
5038cf6cfc Refactor getMemoryAccessProperties()
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
2022-03-14 17:15:21 +01:00
Lisanna Dettwyler
e6e6ae2d2b Don't use low-priority CSR for copy-only cmdq
getCsrForLowPriority will always return a CSR with a compute engine
regardless of the ordinal used. When a copy-only ordinal is used to
create a command queue or command list, batch buffers are written using
BCS style commands rather than CCS style. This caused BCS commands to be
submitted to a low-priority compute engine which was invalid. This patch
ensures a copy engine is selected when a copy-only ordinal is used even
if ZE_COMMAND_QUEUE_PRIORITY_PRIORITY_LOW is specified.

Related-To: LOCI-2980

Signed-off-by: Lisanna Dettwyler <lisanna.dettwyler@intel.com>
2022-03-07 21:38:12 +01:00
Filip Hazubski
80b520bc9b Change ThreadArbitrationPolicy enum type to int32_t
Change ThreadArbitrationPolicy::NotPresent value to -1
Update initial values to ThreadArbitrationPolicy::NotPresent

Related-To: NEO-6728

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2022-03-07 20:04:24 +01:00
Compute-Runtime-Validation
d06bbd07e5 Revert "Add ze_eu_count_t to get total number of EUs"
This reverts commit 30ea8ea48e.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-03-04 06:55:17 +01:00
Jaime Arteaga
30ea8ea48e Add ze_eu_count_t to get total number of EUs
Related-To: LOCI-2667

Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
2022-03-03 01:30:59 +01:00
Joshua Santosh Ranjan
10f98b45db Metrics Add Os specific implementation Structure for IP Sampling
This patch adds OS specific implementation for IP Sampling.
Implementation for linux is provided as part of this patch.

Related-To: LOCI-2787

--- master-files
level_zero/tools/source/metrics/linux/os_metric_ip_sampling_imp_linux.cpp
level_zero/tools/source/metrics/os_metric_ip_sampling.h
level_zero/tools/source/metrics/windows/os_metric_ip_sampling_imp_windows.cpp
level_zero/tools/test/unit_tests/sources/metrics/linux/test_metric_ip_sampling_linux_prelim.cpp
level_zero/tools/test/unit_tests/sources/metrics/linux/test_metric_ip_sampling_linux_upstream.cpp
level_zero/tools/test/unit_tests/sources/metrics/windows/test_metric_ip_sampling_windows.cpp
--- master-files

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-02-18 23:15:44 +01:00
Compute-Runtime-Validation
83025da96f Revert "Add ze_eu_count_t to get total number of EUs"
This reverts commit 24c18beada.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-02-17 09:42:22 +01:00
Jaime Arteaga
24c18beada Add ze_eu_count_t to get total number of EUs
Related-To: LOCI-2667

Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
2022-02-16 17:25:26 +01:00
Bartosz Dunajski
4b0d986876 Move AllocationType enum out of GraphicsAllocation class
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2022-02-04 17:49:09 +01:00
Joshua Santosh Ranjan
f1c50a8c89 Metric Refactor Metric Context
Rename Metric Context and move OA specific functions
and members from Metric Context to OA specific class(MetricSource).
This refactoring is done so that additional Metric Source
like Stall Sampling could be Seamlessly implemented.

Related-To: LOCI-2753


Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2022-02-03 21:34:14 +01:00
Compute-Runtime-Validation
900e63102c Revert "Add ze_eu_count_t to get total number of EUs"
This reverts commit efdfdeba7f.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-02-02 02:59:25 +01:00
Jaime Arteaga
4aef9925b0 Add support for zeDeviceGetP2PProperties
Related-to: LOCI-2784

Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
2022-02-01 17:06:43 +01:00
Mateusz Hoppe
4d8d2412f9 Add subslice ID remapping for single slice configs
Related-To: NEO-6565

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2022-02-01 16:38:57 +01:00
Jaime Arteaga
efdfdeba7f Add ze_eu_count_t to get total number of EUs
Related-To: LOCI-2667

Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
2022-02-01 00:19:08 +01:00
Zbigniew Zdanowicz
8aaa927869 Return default context for multi-tile device in low-priority queue
Related-To: NEO-6589

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-01-20 14:48:11 +01:00
Zbigniew Zdanowicz
4238679078 Refactor implicit scaling device support
Related-To: NEO-6589

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2022-01-18 13:08:43 +01:00
Compute-Runtime-Validation
b249c10e09 Revert "Add ze_eu_count_t to get total number of EUs"
This reverts commit 635c02e1ff.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2022-01-10 20:06:52 +01:00
Jaime Arteaga
635c02e1ff Add ze_eu_count_t to get total number of EUs
Related-To: LOCI-2667

Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
2022-01-06 17:38:56 +01:00
Young Jin Yoon
32ccf30c61 Add support for pci_ext_properties_t
Added API definition in ze_device.cpp, and added function declaration of
getPciProperties() in device.h and device_imp.cpp
Initially returns -1 for all values of ze_pci_speed_ext_t for now, simply
because we do not have function to retrieve the information of the PCI
speed.

Related-To: LOCI-2669
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2022-01-05 10:43:54 +01:00