Commit Graph

246 Commits

Author SHA1 Message Date
Wenbin Lu
a0faad6558 feature: report multi-hop fabric connections
In additon to physical connections we should also report multi-hop
logical connections (MDFI + XeLink) as have positive bandwidth.

Use a modified BFS algorithm to try to find a path between fabric
vertices that are not directly connected together because the KMD always
try to use MDFI link first, then go to XeLink.

Multi-hop connections are bi-directional but might not be symmetric, so
for every pair of vertices A & B that are not directly connected, we
need to try to find both `A -> B` and `B -> A`.

Related-To: GSD-7126

Signed-off-by: Wenbin Lu <wenbin.lu@intel.com>
2024-03-06 06:36:48 +01:00
Fabian Zwolinski
6b2281e66c fix: make sure builtins async init is complete when finishing driver init
Related-To: NEO-10045
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-02-20 21:15:57 +01:00
Mateusz Hoppe
d9b662a735 fix: ignore ZET_ENABLE_PROGRAM_DEBUGGING when system misconfigured
- if debug not enabled in the system, ignore env var for enabling
debugging

Resolves: NEO-10370

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-02-19 15:54:17 +01:00
Dunajski, Bartosz
f404f3ceb1 refactor: initial support for per platform extension string
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-19 14:47:30 +01:00
Mateusz Hoppe
22802d8680 feature: add ZE_experimental_bindless_image extension
- add definitions and functions
- Level Zero spec commit: 49c972463796e614597b028bd344e1b2f64a63b7

Related-To: NEO-10352

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-02-13 15:06:41 +01:00
Dunajski, Bartosz
e86a860127 refactor: remove unordered_map with extensions functions from Driver class
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-09 10:24:54 +01:00
Dominik Dabek
371788210d performance: limit usm host allocation recycle
Query system total memory size and limit usm host allocation recycle to
use at most x%.
x is read from ExperimentalEnableDeviceAllocationCache for device and
ExperimentalEnableHostAllocationCache for host.

Related-To: GSD-7497

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-02-07 17:45:41 +01:00
John Falkowski
10ef2a28ff fix: correct for default hierachry
Resolves: NEO-10006

Signed-off-by: John Falkowski <john.falkowski@intel.com>
2024-01-15 22:15:49 +01:00
Mateusz Jablonski
27fbdde4c5 refactor: correct naming of unified memory enums
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-13 15:58:21 +01:00
Mateusz Jablonski
432142c574 refactor: correct naming of enum class constants 4/n
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-13 08:08:51 +01:00
Dunajski, Bartosz
9121209c41 feature: api support for Event sync mode
Related-To: NEO-8179

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-12-12 09:42:57 +01:00
Mateusz Jablonski
b182917d9d refactor: correct naming of allocation types
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-11 16:23:37 +01:00
Compute-Runtime-Validation
33a36e5e56 Revert "feature: api support for Event sync mode"
This reverts commit 156154813a.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-12-11 14:08:19 +01:00
Dunajski, Bartosz
156154813a feature: api support for Event sync mode
Related-To: NEO-8179

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-12-08 19:17:13 +01:00
Dunajski, Bartosz
81aba9362a feature: support to query wait on memory data size
Related-To: NEO-8145

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-12-08 18:00:42 +01:00
Mateusz Jablonski
c9664e6bad refactor: rename global debug manager to debugManager
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-30 13:00:59 +01:00
Mateusz Jablonski
36194c4e7d refactor: correct variable namings
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-29 23:49:03 +01:00
Dunajski, Bartosz
470ab44d35 feature: add Counter Based Events to extension list
Related-To: NEO-8145

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-11-17 11:10:56 +01:00
Mateusz Jablonski
78b01e25d3 fix: correct number of returned L0 drivers
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-15 10:14:11 +01:00
Lukasz Jobczyk
9a8138725a fix: Deferred SVM allocations look up by gpu address
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-11-14 15:27:01 +01:00
Baj, Tomasz
10a97548c2 refactor: remove ReturnSubDevicesAsApiDevices from OCL and L0
Related-To: NEO-9437

Signed-off-by: Baj, Tomasz <tomasz.baj@intel.com>
2023-11-09 15:43:06 +01:00
Young Jin Yoon
3782b4bd0e feature: add functions to get/set VA space
Added an extension version of zeContextGetVirtualAddressSpaceIpcHandle
and zeContextPutVirtualAddressSpaceIpcHandle, which return
ZE_RESULT_ERROR_UNSUPPORTED_FEATURE for now.

Related-To: NEO-8259

Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2023-11-06 20:08:45 +01:00
Compute-Runtime-Validation
ec24eb0a4c Revert "fix: correct number of returned L0 drivers"
This reverts commit dcb351e9dd.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-11-04 02:24:26 +01:00
Mateusz Jablonski
dcb351e9dd fix: correct number of returned L0 drivers
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-03 13:53:25 +01:00
Compute-Runtime-Validation
bcc5a8502b Revert "feature: add functions to get/set VA space"
This reverts commit 0550c80d4f.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-11-03 02:06:36 +01:00
Young Jin Yoon
0550c80d4f feature: add functions to get/set VA space
Added an extension version of zeContextGetVirtualAddressSpaceIpcHandle
and zeContextPutVirtualAddressSpaceIpcHandle, which return
ZE_RESULT_ERROR_UNSUPPORTED_FEATURE for now.

Related-To: NEO-8259

Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2023-11-02 18:42:27 +01:00
Aravind Gopalakrishnan
99abb40a45 feature: Add extensions to expose dot product capabilities
- Add new extension property for user to query platform DP support.
- Add new blackbox and ULTs for extension.

Related-To: NEO-8786

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2023-10-31 21:54:39 +01:00
Compute-Runtime-Validation
c9d6615231 Revert "feature: Add extensions to expose dot product capabilities"
This reverts commit e988fe66e3.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-10-31 05:06:06 +01:00
Aravind Gopalakrishnan
e988fe66e3 feature: Add extensions to expose dot product capabilities
- Add new extension property for user to query platform DP support.
- Add new blackbox and ULTs for extension.

Related-To: NEO-8786

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2023-10-30 15:15:51 +01:00
Mateusz Jablonski
a420e34b10 fix: explicitly remove assign operators when not needed
when class defines copy/move ctor then corresponding assign operator(s)
should be defined or deleted

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-12 10:25:15 +02:00
Spruit, Neil R
c8674c16c6 feature: Track IPC Handles in DriverHandle vs Context
Related-To: NEO-9116

- To allow for IPC handles to be shared between contexts, the ipc handle
tracking is now moved to the driver handle to be tracked globally.

Signed-off-by: Spruit, Neil R <neil.r.spruit@intel.com>
2023-10-11 19:17:47 +02:00
Filip Hazubski
08e92d154f fix: Add getDefaultDeviceHierarchy call to GfxCoreHelper
Added getDefaultDeviceHierarchy call that describes default device
hierarchy for a gfx core. Refactored L0 and OCL paths to use this
value by default and override this value when user sets
ZE_FLAT_DEVICE_HIERARCHY environment variable or
ReturnSubDevicesAsApiDevices debug key.

Updated ReturnSubDevicesAsApiDevices to force COMPOSITE device hierarchy
when set to 0.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2023-10-06 12:32:41 +02:00
Mateusz Jablonski
382fc952f2 refactor: add NonAssignableClass to define classes without assign operator
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-05 15:16:58 +02:00
Lu, Wenbin
e996241063 fix: disable locked ptr memcpy for imported BOs
Related-To: GSD-6300

Signed-off-by: Lu, Wenbin <wenbin.lu@intel.com>
2023-10-04 11:22:17 +02:00
Hoppe, Mateusz
5c565efe28 feature: bindless global heap with debugger
- program debugSurface's SurfaceState at the beginning of Bindless Surface
State Heap - SPECIAL_SSH
- ensure SPECIAL_SSH is resident

Related-To: NEO-7063

Signed-off-by: Hoppe, Mateusz <mateusz.hoppe@intel.com>
2023-09-29 13:13:46 +02:00
Mateusz Jablonski
64fd270d56 refactor: pass reference to pointer as arg to findAllocationDataForRange
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-27 12:20:23 +02:00
Latif, Raiyan
9bb559d468 feature: Ray Tracing Acceleration Structure (RTAS) Support
Related-To: LOCI-3419

Signed-off-by: Latif, Raiyan <raiyan.latif@intel.com>
2023-09-20 19:57:56 +02:00
Compute-Runtime-Validation
47dac196d9 Revert "feature: Ray Tracing Acceleration Structure (RTAS) Support"
This reverts commit 179abf00de.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-09-16 04:37:55 +02:00
Latif, Raiyan
179abf00de feature: Ray Tracing Acceleration Structure (RTAS) Support
Related-To: LOCI-3419

Signed-off-by: Latif, Raiyan <raiyan.latif@intel.com>
2023-09-15 19:05:20 +02:00
Compute-Runtime-Validation
c46abf149a Revert "feature: Ray Tracing Acceleration Structure (RTAS) Support"
This reverts commit faf9d3bcab.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-09-07 02:13:52 +02:00
Latif, Raiyan
faf9d3bcab feature: Ray Tracing Acceleration Structure (RTAS) Support
Related-To: LOCI-3419

Signed-off-by: Latif, Raiyan <raiyan.latif@intel.com>
2023-09-05 22:21:09 +02:00
Spruit, Neil R
ea7b7d7009 feature: Support for Combined Device Hierarchy
Related-To: LOCI-4662

Signed-off-by: Spruit, Neil R <neil.r.spruit@intel.com>
2023-08-21 10:26:20 +02:00
Zhang, Winston
8962c95000 feature: zeDriverGetLastErrorDescription error messages (module)
When there is a PRINT_DEBUG_MESSAGE message in module that is
applicable to the user, it is now also set to
Driver::zeDriverGetLastErrorDescription.
ULTs are also added to verify that setErrorDescription successfully
stores the error message.

Related-To: LOCI-4653

Signed-off-by: Zhang, Winston <winston.zhang@intel.com>
2023-08-16 00:14:49 +02:00
Kacper Nowak
1afaf37f78 refactor: Unify GTPin initialization logic between APIs
Add support for new GTPin loading logic in OCL path, similar to existing
in L0 - invoking exposed, dedicated API call (OpenGTPinOCL).
- Move logic to shared, including unit tests
- Check whether instrumentation is required on
  context creation and if yes,
  make a call to OpenGTPinOCL function.
  Handle potential errors gracefully without exiting.
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2023-08-09 17:15:24 +02:00
Compute-Runtime-Validation
65df34bbc7 Revert "refactor: Unify GTPin initialization logic between APIs"
This reverts commit 68a5108e05.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-08-07 04:31:52 +02:00
Kacper Nowak
68a5108e05 refactor: Unify GTPin initialization logic between APIs
Add support for new GTPin loading logic in OCL path, similar to existing
in L0 - invoking exposed, dedicated API call (OpenGTPinOCL).
- Move logic to shared, including unit tests
- Check whether instrumentation is required on
  clGetPlatformIDs API call, and if yes,
  make a call to OpenGTPinOCL function.
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2023-08-05 18:22:08 +02:00
Michal Mrozek
a0ba92a202 fix: fix indirect allocation handling during ipc import.
Allocation counter was not update which resulted in residency being skipped.
Resolves: NEO-8198
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2023-07-21 06:44:36 +02:00
Neil R Spruit
8efd8853fb feature: L0 Device Hierarchy support for Composite & Flat modes
Related-To: LOCI-4615

- Added Support for users to set ZE_FLAT_DEVICE_HIERARCHY to either FLAT
or COMPOSITE to change how devices are returned in zeDeviceGet and
clGetDeviceIDs.
- COMPOSITE is default behavior that exists today.
- FLAT returns all sub devices which have no sub devices and all root
devices that have no sub devices in zeDeviceGet ie with all devices
flattened out in order.
- Added zeDeviceGetRootDevice for one to retrieve the Root Device for
any SubDevice.

Signed-off-by: Neil R Spruit <neil.r.spruit@intel.com>
2023-07-20 01:00:54 +02:00
Young Jin Yoon
c5d675570a feature: support for zeDriverGetLastErrorDescription
Added setErrorDescription() and getErrorDescription() in DriverHandle
to record and retrieve the custom string for errors.

Related-To: LOCI-4619
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2023-06-30 17:12:32 +02:00
Zbigniew Zdanowicz
1067167637 test: adding testing debug flag overriding driver version
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-06-27 17:58:03 +02:00