Commit Graph

2420 Commits

Author SHA1 Message Date
Kamil Kopryk
9b7d54ef66 fix: fix exported function address for heapless mode in L0
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-02-08 09:45:42 +01:00
Yoon, Young Jin
97ef964bc4 feature: Add keys to override sync mode for immediate command list
Added OverrideImmediateCmdListSynchronousMode to override synchronous
mode for immediate command list

Related-To: NEO-10316
Signed-off-by: Yoon, Young Jin <young.jin.yoon@intel.com>
2024-02-08 08:35:32 +01:00
Dominik Dabek
371788210d performance: limit usm host allocation recycle
Query system total memory size and limit usm host allocation recycle to
use at most x%.
x is read from ExperimentalEnableDeviceAllocationCache for device and
ExperimentalEnableHostAllocationCache for host.

Related-To: GSD-7497

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-02-07 17:45:41 +01:00
Fabian Zwolinski
87990b8cfc fix: add ISA cache flush prior to cmdlists execution
Add sizeForFullCacheFlush and
sizeForInstructionCacheFlush before
estimateLinearStreamSizeComplementary call
to take into account the state of globalInit.

Set ctx.globalInit = true when cache flush is required.

Related-To: NEO-10045
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-02-07 17:32:15 +01:00
Yoon, Young Jin
22f761b4eb feature: Add introspection APIs for event
Added entrypoints for all event-related introspection APIs
and implemented corresponding functions in Event and EventPool.

Modified Event to record its EventPool upon creation.

Related-To: NEO-10265
Signed-off-by: Yoon, Young Jin <young.jin.yoon@intel.com>
2024-02-07 11:48:28 +01:00
Kacper Nowak
999ec9b2ca refactor: Unify logic for getting atomic FP caps 1/n
- Separate logic for fp16/32/46 caps.
- Add aggregated constexprs for local & global caps of given type
- Pass arguments by reference
- Add hwInfo as argument for future refactors
- Add static_asserts in L0 to ensure there is no mismatch between
internal/external caps
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2024-02-07 11:39:36 +01:00
Yoon, Young Jin
dac2af299b feature: Add introspection APIs for cmdlist
Added entrypoints for all command-list-related introspection APIs
and implemented corresponding functions in command list.

Modified device to record ordinal when creating command lists.

Related-To: NEO-10265
Signed-off-by: Yoon, Young Jin <young.jin.yoon@intel.com>
2024-02-07 07:22:26 +01:00
Yoon, Young Jin
7b12735ade feature: Add introspection APIs for cmdqueue
Added entrypoints for zeCommandQueueGetOrdinal and
zeCommandQueueGetIndex and implemented getOrdinal and getIndex
in command queue.

Related-To: NEO-10265
Signed-off-by: Yoon, Young Jin <young.jin.yoon@intel.com>
2024-02-06 07:38:17 +01:00
Fabian Zwolinski
f55cb596c7 fix: remove unnecessary allocation add to residencyContainer
Image implicit args allocation is not related to redescribed image
and is already added to the residencyContainer in setArgImage.

Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-02-02 16:26:33 +01:00
Dominik Dabek
2cad595a0d performance: debug flag for usm host alloc recycle
set ExperimentalEnableHostAllocationCache=1 to recycle host usm
allocations

Related-To: GSD-7497

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-02-01 16:47:59 +01:00
Compute-Runtime-Validation
fb46066abc Revert "fix: enable cache env variables for level-zero"
This reverts commit 743904d2df.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-01-31 08:33:05 +01:00
Fabian Zwolinski
743904d2df fix: enable cache env variables for level-zero
Related-To: NEO-10045
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-01-30 14:01:42 +01:00
Mateusz Jablonski
2db441a0e0 fix: allow query kernel timestamp after launch cooperative kernel within cmdlist
Related-To: NEO-10191
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-30 12:55:50 +01:00
Fabian Zwolinski
c51b656d2c fix: request instruction cache invalidation on module destroy
Invalidation is requested on both linux and windows,
on Csr's that used Isa allocation.

Related-To: NEO-10045
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-01-29 16:00:34 +01:00
Michal Mrozek
1003594294 Revert "fix: choose proper csr for low priority immediate command lists"
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-01-29 14:36:28 +01:00
Lukasz Jobczyk
4704cd49a3 fix: Remove fence handling when reuse cmd buffer
Resolves: NEO-10163
Related-To: NEO-7116

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-01-29 09:08:19 +01:00
Compute-Runtime-Validation
63a5b64c7e Revert "fix: Remove fence handling when reuse cmd buffer"
This reverts commit f3bbd70a58.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-01-27 08:28:57 +01:00
Lukasz Jobczyk
f3bbd70a58 fix: Remove fence handling when reuse cmd buffer
Resolves: NEO-10163
Related-To: NEO-7116

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-01-26 10:06:24 +01:00
Michal Mrozek
473b892132 fix: choose proper csr for low priority immediate command lists
Resolves: NEO-10168

Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-01-25 15:42:32 +01:00
Kamil Kopryk
fdb5a53f85 fix: fix scratch programming in heapless mode l0
Related-To: NEO-10107
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-01-25 10:50:37 +01:00
Kamil Kopryk
bf4cf57001 fix: fix exported function address when heapless mode is enabled in L0
Related-To: IGC-8472
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-01-24 17:12:21 +01:00
Mateusz Jablonski
a697a3f718 refactor: create new members for storing spill and private memory in scratch
rename private scratch space into scratch space slot 1 as it can be generic

Related-To: NEO-9944
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-23 12:42:25 +01:00
Compute-Runtime-Validation
f9f9035b95 Revert "refactor: create new members for storing spill and private memory in ...
This reverts commit 87eb5f554a.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-01-23 09:13:00 +01:00
Mateusz Jablonski
87eb5f554a refactor: create new members for storing spill and private memory in scratch
rename private scratch space into scratch space slot 1 as it can be generic

Related-To: NEO-9944
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-22 19:48:48 +01:00
Maciej Bielski
f7b3b21efe fix: update build log when program build fails
Regression fix.

Resolves: GSD-7621

Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2024-01-22 17:35:29 +01:00
Dunajski, Bartosz
b77e1a6a71 feature: improve reporting max cooperative group count
Related-To: NEO-8210

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-22 14:56:15 +01:00
Lukasz Jobczyk
7bd33af394 fix: use std::thread detach instead of astd::async for init builtin async
Resolves: GSD-7023

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-01-22 13:58:29 +01:00
Dunajski, Bartosz
6f4ed10919 refactor: unify getMaxWorkGroupCount logic
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-22 11:16:45 +01:00
Maciej Bielski
134c718a25 fix: support zeEventPool IPC handles from single process
For all execution paths where needed, use `BufferObjectHandleWrapper`
instances for registration of BO-handles and try to obtain shared
ownership. This allows both sides of IPC communication to be implemented
in the same process and avoid the double-free problem on a BufferObject
of the same handle.

Currently there are two pairs of such calls:
* `zeEventPoolGetIpcHandle()` + `zeEventPoolOpenIpcHandle()`
* `zeMemGetIpcHandle()` + `zeMemOpenIpcHandle()`

The capability of executing both sides from the same process is useful
for testing but not only.

Related-To: NEO-9837
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2024-01-19 14:49:38 +01:00
Dominik Dabek
e0ccf22557 fix: indirect access in external functions
Read indirect_stateless_count in module external functions.

If greater than 0, mark all kernels that have the has_stack_calls flag
set from this module as having indirect accesses.

Related-To: NEO-7712

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-01-18 15:29:41 +01:00
Dunajski, Bartosz
e354c01acc feature: allow waiting for standalone CB Event
Related-To: NEO-8145

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-16 11:59:46 +01:00
John Falkowski
10ef2a28ff fix: correct for default hierachry
Resolves: NEO-10006

Signed-off-by: John Falkowski <john.falkowski@intel.com>
2024-01-15 22:15:49 +01:00
Dunajski, Bartosz
dfc69c746a feature: create InOrderExecInfo for standalone CB Event
Related-To: NEO-8145

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-15 16:56:52 +01:00
Dunajski, Bartosz
9309d7d865 feature: initial support of zexCounterBasedEventCreate 2
Related-To: NEO-8145

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-15 11:47:36 +01:00
Compute-Runtime-Validation
1cc4c6c10f Revert "feature: initial support of zexCounterBasedEventCreate"
This reverts commit d8f8b70dd7.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-01-15 05:41:10 +01:00
Dunajski, Bartosz
d8f8b70dd7 feature: initial support of zexCounterBasedEventCreate
Related-To: NEO-8145

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-12 15:11:00 +01:00
Warchulski, Jaroslaw
d1061cd923 fix: don't unregister csr clients in EventImp::reset()
Related-To: GSD-7129
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2024-01-12 14:35:20 +01:00
Dominik Dabek
b3b402cad2 feature(zebin): read indirect detection version
Read indirect detection version intel compat notes from zebin.

This is to prepare for enabling indirect access optimization in AOT
built kernels.

Related-To: NEO-7712

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-01-12 14:20:52 +01:00
Fabian Zwolinski
a3bd629aff fix: store image implicit args in bindless slot 1
bindless slot 0: image
bindless slot 1: image implicit args
bindless slot 2: redescribed image

Redescribed image moved one slot higher.
Separate allocation is created for new bindless slot.

Related-To: NEO-9740
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-01-12 13:09:06 +01:00
Dunajski, Bartosz
aaef241111 refactor: allow creating Event without pool object
Related-To: NEO-8145

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-12 10:08:17 +01:00
Lu, Wenbin
0db5b630c9 fix: do not force root device when opening IPC handles
Related-To: NEO-9422

Signed-off-by: Lu, Wenbin <wenbin.lu@intel.com>
2024-01-12 09:38:57 +01:00
Fabian Zwolinski
903e581b5f fix: add support for bindless implicit args
Support for:
global_base and const_base in bindless addressing mode.

Related-To: NEO-9855
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-01-12 01:27:17 +01:00
Dunajski, Bartosz
6abce2b29d feature: zexEventGetDeviceAddress support
Related-To: NEO-8145

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-11 16:22:11 +01:00
Dunajski, Bartosz
b9fe6a3f18 feature: allow patching CB Events on non-inOrder regular cmd lists
Related-To: NEO-8145

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-11 15:19:45 +01:00
Dunajski, Bartosz
b18acc26b7 feature: new API to create and export counter-based Event
Related-To: NEO-8145

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-11 09:42:38 +01:00
Mateusz Jablonski
a73fb4d2fe fix: correct reporing kernel private size on L0
unify the logic across APIs

Related-To: NEO-9944
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-09 09:54:05 +01:00
Dunajski, Bartosz
2082038475 fix: fallback empty counter based pool flags to immediate mode
Related-To: NEO-8145

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-05 19:05:40 +01:00
Mateusz Hoppe
425ae43ed1 feature: use high priority csr from secondary contexts
Related-To: NEO-7824

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-01-05 08:26:11 +01:00
Dunajski, Bartosz
57900c5455 refactor: improve single slice dispatch mode handling
Related-To: NEO-8210

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-01-04 15:24:09 +01:00
Lukasz Jobczyk
d472cf0a5d fix: Restore handle fence completion when reusing cmd buffer
Related-To: NEO-7116

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-01-04 12:11:39 +01:00