Commit Graph

3967 Commits

Author SHA1 Message Date
Lukasz Jobczyk
83ebbb01d3 performance: Add flag to mitigate host visible signal in CB events
Related-To: NEO-13441

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-12-24 12:35:55 +01:00
Bartosz Dunajski
fc1dd3b178 fix: ensure regular event completion in in-order mode 3
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-12-24 12:10:11 +01:00
Mateusz Hoppe
3abd99030a refactor: remove not needed code in BuiltinFunctionsLibImpl
- builtins are released with the object, no need for explicit release of
just first builtin in array

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-12-24 09:58:35 +01:00
Oskar Hubert Weber
1c1b2db9a9 fix: allowing neo ULT build with optimization enabled 5/n
- fix mismatched-new-delete warnings
- fix initialization warnings
- fix attempt to free a non-heap object warnings

Related-To: NEO-8116

Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com>
2024-12-23 13:07:55 +01:00
Aravind Gopalakrishnan
acfeffe274 feature: Append recorded commandlist into immediate (8/N)
Add primary dispatch capability for immediate command list

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-12-20 18:52:33 +01:00
Bartosz Dunajski
db3f268a97 refactor: reorganize l0 api include directories [2/n]
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-12-20 17:16:53 +01:00
Bartosz Dunajski
54fb82f689 fix: improve event timestamp completion tracking
Related-To: HSD-18041011542

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-12-20 16:07:27 +01:00
Lukasz Jobczyk
363f2becd4 performance: Signal inOrder counter with pipe control, part 2
When on DC flush platform signal inOrder counter directly with pipe
control. Obtain profiling from CW. Only for immediate in order cmd
lists.

Related-To: NEO-13441

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-12-20 14:21:34 +01:00
Bartosz Dunajski
47f74b8a7b refactor: reorganize l0 api include directories [1/n]
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-12-20 12:59:08 +01:00
Kamil Kopryk
c96ad9dcbf fix: patch 64bit row and slice pitch for builtins in L0 heapless
Related-To: NEO-12149
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-12-20 10:55:07 +01:00
Compute-Runtime-Validation
61aa75d6a9 Revert "performance: Signal inOrder counter with pipe control, part 2"
This reverts commit 136e5b4f6c.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-12-20 04:38:46 +01:00
Compute-Runtime-Validation
f35d620e06 Revert "feature: Append recorded commandlist into immediate (8/N)"
This reverts commit d8d7419aee.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-12-20 03:35:27 +01:00
Compute-Runtime-Validation
630cbe8e41 Revert "fix: ensure regular event completion in in-order mode 2"
This reverts commit 9c2e5f5568.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-12-20 03:09:27 +01:00
Aravind Gopalakrishnan
7ea8b59a1d fix: Add immediate command list append API to reported extensions list
Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-12-19 21:22:19 +01:00
Aravind Gopalakrishnan
d8d7419aee feature: Append recorded commandlist into immediate (8/N)
Add primary dispatch capability for immediate command list

Related-To: NEO-10356

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-12-19 19:02:14 +01:00
Compute-Runtime-Validation
124e755b9d Revert "fix: regression caused by tbx fault mngr"
This reverts commit 9a14fe2478.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-12-19 17:35:03 +01:00
Lukasz Jobczyk
136e5b4f6c performance: Signal inOrder counter with pipe control, part 2
When on DC flush platform signal inOrder counter directly with pipe
control. Obtain profiling from CW. Only for immediate in order cmd
lists.

Related-To: NEO-13441

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-12-19 14:21:36 +01:00
Mateusz Jablonski
6843632298 refactor: move SWTagsManager::currentCallCount to protected scope
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-12-19 12:31:44 +01:00
Bartosz Dunajski
9c2e5f5568 fix: ensure regular event completion in in-order mode 2
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-12-19 12:31:29 +01:00
Lukasz Jobczyk
085988c5e9 performance: Signal inOrder counter with pipe control
When on DC flush platform signal inOrder counter directly with pipe
control. Skip not needed inOrder timestamp with its reset and semaphore.
Currently only for non profiling immediate cmd list case.

Related-To: NEO-13441

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-12-19 08:02:56 +01:00
Jack Myers
9a14fe2478 fix: regression caused by tbx fault mngr
Addresses regressions from the reverted merge
of the tbx fault manager for host memory.

This fixes attempts by the tbx fault manager
to protect/unprotect host buffer memory, even
if the host ptr was not driver-allocated.

In the case of the smoke test that triggered
the critical regression, clCreateBuffer was
called with the CL_MEM_USE_HOST_PTR flag.
The subsequent `mprotect` calls on the
provided host ptr then failed.

Related-To: NEO-12268
Signed-off-by: Jack Myers <jack.myers@intel.com>
2024-12-18 23:16:36 +01:00
Aravind Gopalakrishnan
6c08454fea test: Resolve clang-tidy issue in semaphore test
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-12-18 21:15:41 +01:00
Bartosz Dunajski
e8cfb38db4 performance: improve relaxed ordering task count tracking
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-12-18 11:36:22 +01:00
Kamil Kopryk
b724c964fc performance: reserve vector before push_back's
Related-To: NEO-12742
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-12-18 10:07:56 +01:00
Kamil Kopryk
46a0c87168 test: add sampler with 1D image level zero black box test
Related-To: NEO-12742
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-12-18 09:32:13 +01:00
Kamil Kopryk
3be52fa1d0 refactor: Add ult with check if bindless ss is patched correctly
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-12-18 09:30:13 +01:00
Kulkarni, Ashwin Kumar
7986e68dfc feature: adds support for 3 channel
Related-To: NEO-12887

Signed-off-by: Kulkarni, Ashwin Kumar <ashwin.kumar.kulkarni@intel.com>
2024-12-18 07:04:14 +01:00
Raiyan Latif
06c94d47f9 feature: Add External Semaphore experimental [3/N]
Related-To: NEO-11488

Signed-off-by: Raiyan Latif <raiyan.latif@intel.com>
2024-12-18 05:29:33 +01:00
Bartosz Dunajski
b1dea19fbd refactor: move tag initialization to allocator [1/n]
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-12-17 17:53:13 +01:00
Bartosz Dunajski
f6df5c1dce performance: optimize event host writes
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-12-17 17:36:03 +01:00
Filip Hazubski
a0cc124b2e performance: Pass RootDeviceIndicesContainer by reference
Additionally pass std::map by reference in UsmMemAllocPoolsManager c-tor.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-12-17 14:18:30 +01:00
shubham kumar
3c3724a9df refactor: prework for adding eu stall support on xe2 and later
Related-To: NEO-13407

Signed-off-by: shubham kumar <shubham.kumar@intel.com>
2024-12-14 08:32:01 +01:00
Compute-Runtime-Validation
37eeef5ba4 Revert "fix: Remove the check for ze-take-global-address and update errors"
This reverts commit 156e219e7f.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-12-14 04:50:41 +01:00
Wenbin Lu
17e537b694 feature: enable event synchronization GPU status check by default
Related-To: GSD-10187

Signed-off-by: Wenbin Lu <wenbin.lu@intel.com>
2024-12-13 20:36:15 +01:00
Mateusz Jablonski
8f7bacdd95 feature: add eudebug interface class
eudebug interface is now hidden under EuDebugInterface class
shared code uses generic object and param values

layout of structs is guarded by static asserts

eudebug support is guarded by cmake flags:
- NEO_ENABLE_XE_EU_DEBUG_SUPPORT - enables eudebug in general
- NEO_USE_XE_EU_DEBUG_EXP_UPSTREAM - registers exp upstream uAPI support
- NEO_ENABLE_XE_PRELIM_DETECTION - registers prelim uAPI support

This way we can support two different xe-eudebug interfaces within
single binary.

In unit tests there is mock eudebug interface enabled (even if no
eudebug support is enabled by cmake flag).

Related-To: NEO-13472
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-12-13 17:07:12 +01:00
Neil R. Spruit
156e219e7f fix: Remove the check for ze-take-global-address and update errors
Related-to: NEO-13458

- With new IGC, ze-take-global-address is not needed to ensure global
pointers are allowed. Updates the error message to provide the
correct information on why the query failed and remove the requirement
check for the flag.

Signed-off-by: Neil R. Spruit <neil.r.spruit@intel.com>
2024-12-13 15:52:36 +01:00
Zbigniew Zdanowicz
98fd7c9432 test: refactor aub tests to add fixture into separate file
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-12-13 14:35:32 +01:00
Maciej Plewka
8151224501 fix: add microsecond resolution for timeout
Related-To: NEO-13445
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-12-13 09:32:18 +01:00
Wenbin Lu
c19df80bd8 feature: add key to force GPU status check in event synchronization
Related-To: GSD-10187

Signed-off-by: Wenbin Lu <wenbin.lu@intel.com>
2024-12-12 22:24:36 +01:00
Bartosz Dunajski
084ea8b687 fix: unset InOrderExecInfo after appendEventReset
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-12-12 16:26:53 +01:00
Bartosz Dunajski
071712b835 feature: enable new relaxed ordering heuristic
Related-To: NEO-13431

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-12-12 14:49:04 +01:00
Zbigniew Zdanowicz
a6ad9caf6c test: add option to create cooperative queues and command lists in tests
Related-To: NEO-13350

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-12-12 12:37:51 +01:00
Compute-Runtime-Validation
6c5d9a6ed7 Revert "feature: extend TBX page fault manager from CPU implementation"
This reverts commit 51c0e80299.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-12-12 12:30:22 +01:00
Bartosz Dunajski
5f6b2de3dc feature: clear standalone timestamp before submission
Related-To: HSD-18040896547

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-12-12 10:55:10 +01:00
Chandio, Bibrak Qamar
8cf4804fcd fix: Overhead in zeDeviceGetGlobalTimestamps
Related-To: NEO-11908

There is overhead when submission method is used for
zeDeviceGetGlobalTimestamps. This fixes it.

Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com>
2024-12-12 08:54:19 +01:00
Compute-Runtime-Validation
b060cb6c8c Revert "fix: ensure regular event completion in in-order mode"
This reverts commit 27c36c4998.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-12-12 03:18:13 +01:00
Bartosz Dunajski
27c36c4998 fix: ensure regular event completion in in-order mode
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-12-11 17:46:34 +01:00
Lukasz Jobczyk
f2725f217e refactor: Introduce debug flags to manipulate event's signal visibility
-Add AbortHostSyncOnNonHostVisibleEvent which abort when waiting for non
host visible event from host
-Add ForceHostSignalScope which forces add or clear of host scope to
event's signal scope

Related-To: NEO-13441

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-12-11 14:07:43 +01:00
Bartosz Dunajski
eca3d5a677 feature: debug flag to clear timestamp before submission
Related-To: HSD-18040896547

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-12-11 13:56:49 +01:00
Jack Myers
51c0e80299 feature: extend TBX page fault manager from CPU implementation
In TBX mode, the host could not write to host buffers after access from device
code due to the lack of a migration mechanism post-initial TBX upload.
Migration is unnecessary with real hardware, but required for TBX.

This patch introduces a new page fault manager type that extends the original
CPU fault manager, enabling automatic migration of host buffers in TBX mode.

Refactoring was necessary to avoid diamond inheritance, achieved by using a
template parameter as the base class for OS-specific fault managers.

Related-To: NEO-12268
Signed-off-by: Jack Myers <jack.myers@intel.com>
2024-12-11 09:09:50 +01:00