Commit Graph

512 Commits

Author SHA1 Message Date
Bartosz Dunajski
db3f268a97 refactor: reorganize l0 api include directories [2/n]
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-12-20 17:16:53 +01:00
Bartosz Dunajski
47f74b8a7b refactor: reorganize l0 api include directories [1/n]
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-12-20 12:59:08 +01:00
Compute-Runtime-Validation
124e755b9d Revert "fix: regression caused by tbx fault mngr"
This reverts commit 9a14fe2478.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-12-19 17:35:03 +01:00
Lukasz Jobczyk
085988c5e9 performance: Signal inOrder counter with pipe control
When on DC flush platform signal inOrder counter directly with pipe
control. Skip not needed inOrder timestamp with its reset and semaphore.
Currently only for non profiling immediate cmd list case.

Related-To: NEO-13441

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-12-19 08:02:56 +01:00
Jack Myers
9a14fe2478 fix: regression caused by tbx fault mngr
Addresses regressions from the reverted merge
of the tbx fault manager for host memory.

This fixes attempts by the tbx fault manager
to protect/unprotect host buffer memory, even
if the host ptr was not driver-allocated.

In the case of the smoke test that triggered
the critical regression, clCreateBuffer was
called with the CL_MEM_USE_HOST_PTR flag.
The subsequent `mprotect` calls on the
provided host ptr then failed.

Related-To: NEO-12268
Signed-off-by: Jack Myers <jack.myers@intel.com>
2024-12-18 23:16:36 +01:00
Bartosz Dunajski
b1dea19fbd refactor: move tag initialization to allocator [1/n]
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-12-17 17:53:13 +01:00
Compute-Runtime-Validation
6c5d9a6ed7 Revert "feature: extend TBX page fault manager from CPU implementation"
This reverts commit 51c0e80299.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-12-12 12:30:22 +01:00
Chandio, Bibrak Qamar
8cf4804fcd fix: Overhead in zeDeviceGetGlobalTimestamps
Related-To: NEO-11908

There is overhead when submission method is used for
zeDeviceGetGlobalTimestamps. This fixes it.

Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com>
2024-12-12 08:54:19 +01:00
Jack Myers
51c0e80299 feature: extend TBX page fault manager from CPU implementation
In TBX mode, the host could not write to host buffers after access from device
code due to the lack of a migration mechanism post-initial TBX upload.
Migration is unnecessary with real hardware, but required for TBX.

This patch introduces a new page fault manager type that extends the original
CPU fault manager, enabling automatic migration of host buffers in TBX mode.

Refactoring was necessary to avoid diamond inheritance, achieved by using a
template parameter as the base class for OS-specific fault managers.

Related-To: NEO-12268
Signed-off-by: Jack Myers <jack.myers@intel.com>
2024-12-11 09:09:50 +01:00
Chandio, Bibrak Qamar
ab2e831a4a fix: zeDeviceGetGlobalTimestamp to use submisison
Related-To: GSD-10253, GSD-9467, GSD-9381, NEO-11908

When EnableGlobalTimestampViaSubmission is set then
zeDeviceGetGlobalTimestamp uses immediate cmd submission
method to get GPU time.

Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com>
2024-12-04 19:10:07 +01:00
shubham kumar
69f81d5d57 fix: Check for root device flag in multi-device usecase
Related-To: NEO-12381

Signed-off-by: shubham kumar <shubham.kumar@intel.com>
2024-12-03 07:03:39 +01:00
Bartosz Dunajski
c6accb8d53 fix: select correct CSR for submission if copy offload is not allowed
Related-To: GSD-10194

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-11-29 11:41:53 +01:00
Bartosz Dunajski
7bf22ed33e feature: counter based allocation peer sharing
Related-To: NEO-13079

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-11-13 15:01:32 +01:00
Bartosz Dunajski
2d6fb1af4b fix: in-order relaxed ordering handling for signalEvent
Related-To: NEO-13082

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-10-25 15:30:33 +02:00
Mateusz Jablonski
8fbe873e1b build: correct setting for i915/xe prelim setting in package scripts
rename i915 flag to uppercase, for consistency
disable i915 / xe prelim by default in package scripts

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-10-09 15:55:59 +02:00
Brandon Yates
013c49db02 fix: change getBindlessDebugSipKernel to also get heapless sip
Rename function to getDebugSipKernel and select either bindless
or heapless depending on mode

Related-to: NEO-8396

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2024-10-03 20:47:40 +02:00
Bartosz Dunajski
b8fd1bda36 feature: use sysInfo helper to detect memory type
Related-To: NEO-12807

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-09-30 18:19:42 +02:00
Bartosz Dunajski
2618f586fa fix: copy offload mmio programming
Related-To: NEO-12771

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-09-24 19:37:58 +02:00
Bartosz Dunajski
d7ce841081 fix: InOrderExecInfo upload to tbx
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-09-20 16:03:09 +02:00
Filip Hazubski
beaddb55e0 fix: Minor code changes
Remove needless checks for module and cmdList.
Add explicit check for surfaceState in setArgImageWithMipLevel.
Explicitly initialize WddmResidencyController::csr.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-09-10 13:31:15 +02:00
Mateusz Hoppe
9d5f8eb587 refactor: save hpCopyEngine in Device
- remove loop iteration to get hpCopyEngine

Related-To: NEO-11983

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-09-06 17:56:53 +02:00
Mateusz Jablonski
14fd9f3f24 fix: correct using L0 loader functions
use zelLoaderTranslateHandle for translating handle to internal handle
get pointer to zelSetDriverTeardown during global ctor
don't load loader library by name
get loader function pointers directly from current process

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-09-04 17:30:25 +02:00
Mateusz Jablonski
43e3957e66 fix: add nullptr check in cache reservation API
in case of AUB/TBX mode there is no os interface

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-09-03 15:43:35 +02:00
Mateusz Jablonski
d45c16dfc2 fix: add fallback for invalid handles in extension functions
handle context, commandlist, driver, device, event, image and kernel handles

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-08-28 17:22:35 +02:00
Mateusz Hoppe
482e077252 feature: add regular and hp contexts in group without dedicated hp engine
- if no hp copy engine available, create group with regular and hp
contexts

Related-To: NEO-11983

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-08-28 14:01:10 +02:00
Mateusz Hoppe
ce07b86d87 fix: return valid LP csr from root device
Related-To: NEO-12257

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-08-27 23:33:57 +02:00
Bartosz Dunajski
1795642da9 fix: disallow enabling synchronized dispatch on non-ioq queues
Related-To: HSD-18039265269

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-23 19:30:24 +02:00
Mateusz Jablonski
5b51146673 fix: fail L0 device creation when invalid state save area header was delivered
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-08-22 16:24:40 +02:00
Aravind Gopalakrishnan
cb8063f71d feature: Append recorded command list into immediate (3/N)
- Use correct stream for dispatch
- Add capability to append signal event
- Check available space globally in immediate append call

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-08-16 17:40:28 +02:00
Jack Myers
f5d00b2616 feature: 2d-block-load-transpose query
Implemented device property query API for determining
support capabilities regarding 2d-block-load-tranpose
features for which not all Intel devices support.

Related-To: NEO-11592
Signed-off-by: Jack Myers <jack.myers@intel.com>
2024-07-30 18:21:07 +02:00
Mateusz Hoppe
a734a738fa fix: use regular engine for HP copy engine if no HP engine
- map HIGH PRIORTY queues to regular engine for copy engine if there is
no HP copy engine available

Related-To: NEO-11983

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-07-26 17:09:31 +02:00
Maciej Bielski
2165154530 feature: add support for L3-cache reservation
Resolves: NEO-7849

Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2024-07-24 10:10:28 +02:00
Bartosz Dunajski
c3312f21f7 feature: dont initialize in-order TS nodes
Related-To: NEO-11925

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-07-23 13:31:23 +02:00
Bartosz Dunajski
3758e99cbf feature: initial support for standalone CB Events Timestamps allocator
Related-To: NEO-11925

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-07-22 10:13:55 +02:00
Mateusz Hoppe
b6299b8a21 feature: add support for HP copy engine context
- add support for contect group with HP copy engine
- choose HP copy engine when available

Related-To: NEO-11983

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-07-19 12:23:03 +02:00
Compute-Runtime-Validation
7ad15639fc Revert "feature: add support for HP copy engine context"
This reverts commit 3fbcbcaef2.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-07-18 21:02:14 +02:00
Mateusz Hoppe
3fbcbcaef2 feature: add support for HP copy engine context
- add support for contect group with HP copy engine
- choose HP copy engine when available

Related-To: NEO-11983

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-07-18 16:07:07 +02:00
Bartosz Dunajski
024c015dab feature: allow creating L0 BCS LowPriority Queues
Related-To: NEO-7824

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-07-03 11:20:37 +02:00
Bartosz Dunajski
b4903a9a0f refactor: unify Queue CSR assignment methods
Related-To: NEO-7824

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-07-02 12:45:12 +02:00
Jitendra Sharma
51fc9059c2 feature: program debug surface address
Related-To: NEO-7986
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2024-07-01 16:30:14 +02:00
Wenbin Lu
cf71353d84 fix: report P2P atomic access for XeLink or MDFI only
Related-To: NEO-11753

Signed-off-by: Wenbin Lu <wenbin.lu@intel.com>
2024-06-24 17:28:22 +02:00
Filip Hazubski
2f6eaf149a fix: Update SIP kernel initialization logic
Initialize SIP kernel when shared device is being initialized
instead of api-specific device.

Initialize debugger when shared device is being initialized
instead of during platform or driver initialization.

Add missing makeResident calls for SIP kernel in heapless paths.

Related-To: HSD-18038645398, HSD-18038819112

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-06-20 18:30:46 +02:00
Aravind Gopalakrishnan
510a51b9f4 feature: Add support for d3d12 handle types for import
Related-To: NEO-11292

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-06-17 08:30:10 +02:00
Bartosz Dunajski
31f3e3e81d feature: initial support for media query api
Related-To: NEO-9777

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-06-14 17:28:49 +02:00
Aravind Gopalakrishnan
b7b45e1410 feature: Add capability to query additional memory properties ext
Related-To: NEO-11362

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-06-14 17:03:23 +02:00
Bartosz Dunajski
703fabdffd refactor: improve extracting Queue properties
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-06-07 13:13:53 +02:00
Bartosz Dunajski
da9c009b88 feature: assign unique interrupt to queue
Related-To: NEO-8179

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-06-07 10:06:31 +02:00
Bartosz Dunajski
e88127dcc3 refactor: remove not used multi context per engine code
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-06-06 11:41:15 +02:00
Bartosz Dunajski
88ed909e57 feature: assign secondary BCS engines at queue creation
Related-To: NEO-7824

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-06-03 09:41:43 +02:00
Aravind Gopalakrishnan
0eaed66334 feature: Append recorded command list into immediate (1/N)
Related-To: NEO-10356

- Refactoring wait on event to skip flush when needed
- Refactor execute call interface to accept linear stream
- Program wait on events in new interface

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-05-29 05:59:32 +02:00