Commit Graph

939 Commits

Author SHA1 Message Date
Szymon Morek 39ec7facee performance: use BCS for transfers if CCS is busy
Related-To: NEO-11501

Also, if device is iGPU, don't use staging buffers
in that case.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-22 15:36:26 +02:00
Falkowski, John 85c8c09990 fix: correct latestSentTaskCount in flushBcsTask
Resolves: NEO-12075

Signed-off-by: John Falkowski <john.falkowski@intel.com>
2024-07-19 09:43:31 +02:00
Szymon Morek 09cbbc2625 fix: don't evict always resident allocations
Related-To: NEO-12008

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-15 11:39:00 +02:00
Mateusz Jablonski 778645c11e feature: add initial support for Xe2 platforms
Related-To: NEO-8188, NEO-10774
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-09 10:30:27 +02:00
Mateusz Jablonski 0db66bb708 feature: add new flag to StateComputeModeProperties
Related-To: NEO-8188, NEO-10774

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-03 14:05:02 +02:00
Jitendra Sharma 51fc9059c2 feature: program debug surface address
Related-To: NEO-7986
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2024-07-01 16:30:14 +02:00
Maciej Bielski 3849bad719 fix: block copy operations CommandStreamReceiver
The implicit ones are not sufficient and user-provided does not seem to
be needed.

Related-To: NEO-11825

Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2024-06-27 11:45:15 +02:00
Lukasz Jobczyk fed90f5c8e fix: Add infrastructure to force dc flush when mitigate dc
-force dc on next tag update after RT kernel
-force dc when release shared object

Related-To: NEO-10556

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-06-24 17:15:03 +02:00
Filip Hazubski 2f6eaf149a fix: Update SIP kernel initialization logic
Initialize SIP kernel when shared device is being initialized
instead of api-specific device.

Initialize debugger when shared device is being initialized
instead of during platform or driver initialization.

Add missing makeResident calls for SIP kernel in heapless paths.

Related-To: HSD-18038645398, HSD-18038819112

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-06-20 18:30:46 +02:00
Jaroslaw Warchulski 63da79798d refactor: reduce the number of GraphicsAllocation constructors and derivatives
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2024-06-18 21:31:37 +02:00
Dominik Dabek 296d2bb148 performance: adjust ulls timeout on power saving
Use l0 power saving setting to also adjust ulls timeout.

Related-To: NEO-10800

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-06-17 16:51:55 +02:00
Mateusz Hoppe b3d72ddd3d fix: write memory for resident allocations in simulation mode
- refactor and call proceesFlushResdiency() on memoryOperationsHandler
- call free() to remove allocation from resident allocations when
graphics allocation is released

Related-To: NEO-11719

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-06-14 18:49:01 +02:00
Kamil Kopryk 0cb48901dc refactor: add AUBDumpCaptureDirPath debug flag
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-13 14:17:50 +02:00
Mateusz Hoppe 8c48dbe39e feature: pass lowPriority flag when creating context with aubstream
Related-To: NEO-10168

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-06-07 13:14:08 +02:00
Bartosz Dunajski da9c009b88 feature: assign unique interrupt to queue
Related-To: NEO-8179

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-06-07 10:06:31 +02:00
Bartosz Dunajski fe6809ac04 feature: prework to initialize BCS state in heapless mode
Related-To: NEO-7824

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-05-31 10:06:58 +02:00
Mrozek, Michal 38d7e02c28 refactor: remove not needed code
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-05-29 13:24:02 +02:00
Szymon Morek 2c2ff5e369 fix: update task count when object is always resident
Related-To: NEO-11537

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-27 15:18:00 +02:00
Lukasz Jobczyk a9269939f6 fix: Defer MOCS to PAT
Related-To: NEO-10556

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-05-27 10:26:26 +02:00
Compute-Runtime-Validation ce0ccacef6 Revert "fix: Defer MOCS to PAT"
This reverts commit 6c75ec3116.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-25 06:22:22 +02:00
Lukasz Jobczyk 6c75ec3116 fix: Defer MOCS to PAT
Related-To: NEO-10556

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-05-24 15:30:49 +02:00
Mateusz Hoppe a2a154ba4a fix: initialize context with first submission in heapless execution
- early return from initializeDeviceWithFirstSubmission if flush was
already called

Related-To: NEO-11219

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-05-24 14:56:04 +02:00
Mateusz Hoppe e9b81037a9 refactor: add heaplessStateInitialized flag
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-05-15 15:39:52 +02:00
Compute-Runtime-Validation ebf09afc42 Revert "performance: Stop direct submission before removing host ptrs"
This reverts commit 4fd219cdbb.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-13 10:10:51 +02:00
Kamil Kopryk 1707434591 refactor: remove unused variable
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-05-09 15:41:50 +02:00
Maciej Plewka e39893485c fix: add cache flush as dependency for bcs ccs synchronization
Related-to: NEO-9872
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-05-09 13:43:39 +02:00
Compute-Runtime-Validation 902b611179 Revert "fix: Fix front end programming for cooperative dispatch"
This reverts commit 36ddfaaf4d.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-08 11:14:37 +02:00
Bartosz Dunajski e5882e0d31 feature: pass GraphicsAllocation to fence wait
Related-To: NEO-8179

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-05-07 17:59:28 +02:00
Lukasz Jobczyk 4fd219cdbb performance: Stop direct submission before removing host ptrs
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-05-07 13:25:09 +02:00
Mateusz Hoppe d35d8727e5 fix: use primaryCsr allocations when csr has primaryCsr set
- global fence allocation
- global stateless heap allocation
- preemption allocation
- debug surface allocation

all above are shared from primary csr

Related-To: NEO-7824


Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-05-07 02:41:24 +02:00
Lukasz Jobczyk c1004b77bf performance: Limit tlb flush in state cache flush to wddm
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-05-06 12:45:26 +02:00
Compute-Runtime-Validation 8342c0ae2f Revert "fix: add cache flush as dependency for bcs ccs synchronization"
This reverts commit 5e57bb2a32.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-01 03:05:47 +02:00
Maciej Plewka 5e57bb2a32 fix: add cache flush as dependency for bcs ccs synchronization
Related-to: NEO-9872
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-04-30 11:57:49 +02:00
Bartosz Dunajski 806da85ec6 refactor: prework to pass interrupt hint
Related-To: NEO-8179

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-04-29 11:14:53 +02:00
Lukasz Jobczyk 9df35ac9bb fix: Add tlb invalidation to render state cache flush
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-04-26 10:37:51 +02:00
Lukasz Jobczyk 36ddfaaf4d fix: Fix front end programming for cooperative dispatch
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-04-25 12:55:46 +02:00
Mateusz Hoppe d38ac4806b feature: global bindless sampler offsets
- use global heap base for DSH when bindlessHeapsHelper enabled

Related-To: NEO-10505

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-04-24 21:12:17 +02:00
Andrzej Koska ae139aeffd refactor: Passing information about the engine
Extension of the interface with information
about the engine type passed to the function

Related-To: NEO-10678

Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2024-04-22 13:33:56 +02:00
Bartosz Dunajski e47b88f6e4 feature: program fence during bcs tag update
Related-To: NEO-10417

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-04-11 13:43:04 +02:00
Mateusz Jablonski cb2b572e94 feature: add support for null aub mode
In this mode AUB csr will be created, however, no aub file will be created

Related-To: NEO-11097
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-04-09 16:59:42 +02:00
Bartosz Dunajski ef35c5cb62 feature: program device to host fence in ocl path
Related-To: NEO-10417

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-04-08 14:57:16 +02:00
Zbigniew Zdanowicz 73d558058c feature: add heapless and global stateless scratch address patching
Related-To: NEO-10381

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-04-03 17:04:35 +02:00
Kamil Kopryk 4eae28bd64 feature: introduce heapless state init in L0
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-04-02 12:34:53 +02:00
Bartosz Dunajski fce016ee7a refactor: add new semaphore param
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-03-26 15:46:47 +01:00
Mateusz Jablonski 78a4a92b44 refactor: reorder members to reduce internal padding in structs
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-25 15:50:00 +01:00
Dominik Dabek 2b964254d6 performance: debug key for adjust ULLS on battery
ULLS controller timeout settings will be adjusted based on ac line
status and lowest queue throttle from submissions.

Lowest queue throttle is reset when controller stops ULLS.

Related-To: NEO-10800

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-03-22 14:24:00 +01:00
Aravind Gopalakrishnan 04b99de4d6 refactor: Force tlb flush during TC after copy
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-03-21 07:25:46 +01:00
Mrozek, Michal f71f6d2b72 refactor: remove not needed code
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-03-08 18:18:55 +01:00
Mrozek, Michal 13a347cbde refactor: remove not needed code
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-03-08 15:10:06 +01:00
Kamil Kopryk 168445784e feature: introduce states programming at driver init heapless ocl
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-03-08 12:29:44 +01:00
Mrozek, Michal c72b9ec448 refactor: remove not needed code
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-03-08 08:53:13 +01:00
Mrozek, Michal 10313b7b84 refactor: remove not needed code
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-03-07 18:50:16 +01:00
Michal Mrozek e08668b982 refactor: remove not needed code.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-03-07 13:02:29 +01:00
Filip Hazubski d25026b263 refactor: Add getTotalMemBankSize function to ReleaseHelper
Minor refactor of ULTs to not use hard coded banks size.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-03-06 09:53:56 +01:00
Filip Hazubski 5f7e56e78b refactor: Unify aub config generation
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-03-05 11:09:34 +01:00
Mateusz Jablonski 80aa55a3cb fix: don't program dummy blit prior to MI_FLUSH_DW without postsync
add missing dummy blits before MI_FLUSH_DW with postsync

Related-To: NEO-9996
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-03-01 12:36:13 +01:00
Compute-Runtime-Validation 5fce046877 Revert "fix: don't program dummy blit prior to MI_FLUSH_DW without postsync"
This reverts commit d796fb559d.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-03-01 01:43:35 +01:00
Mateusz Jablonski d796fb559d fix: don't program dummy blit prior to MI_FLUSH_DW without postsync
add missing dummy blits before MI_FLUSH_DW with postsync

Related-To: NEO-9996
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-02-29 13:30:09 +01:00
Michal Mrozek 64232ec370 fix: choose proper csr for low priority immediate command lists
Resolves: NEO-10168

Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-02-28 12:45:02 +01:00
Kamil Kopryk 7729eb8127 refactor: move flush task submission to a function
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-02-23 18:11:57 +01:00
Kamil Kopryk 10d610d163 refactor: move process barrier with post sync to a function
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-02-23 09:32:48 +01:00
Kamil Kopryk 01a721df3e refactor: move preparing flush task batch buffer to a function
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-02-23 09:18:58 +01:00
Kamil Kopryk a4ed483238 refactor: move update task count and completion stamp to function
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-02-22 14:56:25 +01:00
Kamil Kopryk 7b689aa464 refactor: move handle batched dispatch implicit flush code to function
Related-To: NEO-7824
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-02-22 14:56:10 +01:00
Dunajski, Bartosz f31c88d05f fix: size estimation for tlb flush before copy
Related-To: HSD-18036669673

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-19 13:35:35 +01:00
Dunajski, Bartosz 88c5872682 feature: debug flag to flush tlb before copy
Related-To: HSD-18036669673

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2024-02-14 20:05:57 +01:00
Fabian Zwolinski c51b656d2c fix: request instruction cache invalidation on module destroy
Invalidation is requested on both linux and windows,
on Csr's that used Isa allocation.

Related-To: NEO-10045
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2024-01-29 16:00:34 +01:00
Kamil Kopryk 87d13fcb6e fix: fix scratch programming in heapless mode
Related-To: NEO-10107

Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-01-23 15:26:56 +01:00
Mateusz Jablonski a697a3f718 refactor: create new members for storing spill and private memory in scratch
rename private scratch space into scratch space slot 1 as it can be generic

Related-To: NEO-9944
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-23 12:42:25 +01:00
Compute-Runtime-Validation f9f9035b95 Revert "refactor: create new members for storing spill and private memory in ...
This reverts commit 87eb5f554a.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-01-23 09:13:00 +01:00
Mateusz Jablonski 87eb5f554a refactor: create new members for storing spill and private memory in scratch
rename private scratch space into scratch space slot 1 as it can be generic

Related-To: NEO-9944
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-22 19:48:48 +01:00
Kamil Kopryk 3a09ea3436 Revert "refactor: reduce batch buffer struct size"
This reverts commit 97ecf58577.

Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-01-19 16:06:57 +01:00
Kamil Kopryk d7d3a6b76d fix: use full address to sip if needed
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-01-19 12:28:46 +01:00
Mateusz Jablonski a673347378 fix: don't allocate preemption surface for BCS engines
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-01-19 07:59:24 +01:00
Kamil Kopryk 696be48cd4 refactor: add preemption xe only file to xehpg and xehpc sources
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-01-17 18:32:37 +01:00
Kamil Kopryk 97ecf58577 refactor: reduce batch buffer struct size
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-01-17 14:04:00 +01:00
Mateusz Hoppe 31edeb0765 feature: set HP flag when creating HardwareContextController
- for highPriority context

Related-To: NEO-7824

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-01-10 12:46:29 +01:00
Lukasz Jobczyk d472cf0a5d fix: Restore handle fence completion when reusing cmd buffer
Related-To: NEO-7116

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-01-04 12:11:39 +01:00
Mateusz Hoppe 31e9b5e9fa feature: add support for secondary contexts in group
Related-To: NEO-7824

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-12-28 13:31:08 +01:00
Mateusz Jablonski de93bc6928 refactor: correct naming of enum class constants 10/n
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-19 11:30:39 +01:00
Mateusz Jablonski a6458433dc refactor: correct naming of enum class constants 9/n
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-19 10:36:25 +01:00
Mateusz Jablonski dd1b9d6abc refactor: correct naming of enum class constants 8/n
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-19 08:18:18 +01:00
Mateusz Jablonski 739d181026 refactor: correct naming of enum class constants 6/n
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-13 14:48:52 +01:00
Mateusz Jablonski 432142c574 refactor: correct naming of enum class constants 4/n
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-13 08:08:51 +01:00
Mateusz Jablonski 01dd503e47 refactor: correct naming of MemoryPool enum values
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-13 07:51:39 +01:00
Mateusz Jablonski b182917d9d refactor: correct naming of allocation types
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-11 16:23:37 +01:00
Mateusz Jablonski da957d1a37 refactor: correct naming of enum class constants 1/n
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-05 14:26:42 +01:00
Mateusz Jablonski 6849d33326 refactor: remove redundant definitions KB/MB/GB
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-05 08:55:18 +01:00
Mateusz Jablonski c48e57e2de refactor: rename register offsets and constants
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-04 18:31:39 +01:00
Mateusz Jablonski 9a28317ac4 refactor: rename variables from MemoryBanks namespace
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-04 16:46:46 +01:00
Mateusz Jablonski e57d372608 refactor: correct naming of enum class constants
EngineGroupType
BlitDirection
PostBlitMode
WaitStatus

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-01 11:53:51 +01:00
Mateusz Jablonski c3ac7b78bd refactor: correct variable naming
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-12-01 02:18:46 +01:00
Mateusz Jablonski c9664e6bad refactor: rename global debug manager to debugManager
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-30 13:00:59 +01:00
Mateusz Jablonski 36194c4e7d refactor: correct variable namings
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-29 23:49:03 +01:00
Fabian Zwolinski 06ec4c3a82 fix: aub residency in HW+AUB mode
Related-To: NEO-9230
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-11-29 11:54:59 +01:00
Mateusz Hoppe 83ac95d293 fix: L0 - remove synchronization with events on appends in tbx mode
Related-To: NEO-9400

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-11-27 10:39:55 +01:00
Kamil Kopryk 982c2d665b feature: add programming heapless mode in SBA
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-11-24 15:29:19 +01:00
Mateusz Jablonski 220b1e10cf refactor: remove copy/move ctors when class has user-defined dtor
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-11-24 13:13:07 +01:00
Kamil Kopryk ce7298d512 feature: Add heapless mode programming in ocl
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-11-24 12:53:39 +01:00
Maciej Plewka 560b588cbe fix: Update residency fence value after ring buffer switch
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-11-17 20:46:14 +01:00
Dominik Dabek 6562828095 performance: prealloc internal heap on mtl
Preallocate 1 internal heap allocation per csr on mtl

Related-To: NEO-8152

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-11-17 13:36:21 +01:00
Dominik Dabek 7a6fc209dd performance: prealloc cmdbuffer on mtl
Preallocate 2 command buffers allocations per command queue initialized
on MTL.

Related-To: NEO-8152

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-11-08 17:21:13 +01:00
Michal Mrozek 80e3632c41 performance: do not free deferred allocations when not needed
Level Zero doesn't use deferred allocations so no point in paying the
price to check for them in cleanAllocationList.

Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2023-10-27 18:02:43 +02:00
Dominik Dabek 39cf653959 performance(ocl): cmd buffer prealloc per cmdqueue
Add mechanism to preallocate cmd buffer allocations in command stream
receiver reusable allocations list per command queue initialized.

This should limit additional allocations during hot loop.

Needs to be enabled in subsequent commits by setting product helper
method.

Related-To: NEO-8152

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-10-27 16:56:29 +02:00
Mateusz Hoppe 5d572b9c8f feature: allow freeing memory in aubstream
Related-To: NEO-2707

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-10-26 17:16:23 +02:00
Compute-Runtime-Validation fca2159430 Revert "fix: if device hierarchy is flat then getSubDevicesCount return 1u"
This reverts commit cb0bb57f49.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-10-26 15:40:29 +02:00
Baj, Tomasz cb0bb57f49 fix: if device hierarchy is flat then getSubDevicesCount return 1u
Related-To: NEO-9167

Signed-off-by: Baj, Tomasz <tomasz.baj@intel.com>
2023-10-25 15:51:52 +02:00
Jablonski, Mateusz ac5f64f5c6 fix: fix compilation error in clang on Windows (2/n)
Signed-off-by: Jablonski, Mateusz <mateusz.jablonski@intel.com>
2023-10-24 15:59:06 +02:00
Zbigniew Zdanowicz 0b42510f41 fix: add l1 cache flush to barrier operations when required
Related-To: NEO-8395

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-10-23 18:18:22 +02:00
Mateusz Hoppe 52b0f32688 fix: offset cpu address when writing chunk in simulated csr
- not only gpuAddress is offset but also cpu address with data needs
to be offset while writing memory.

Related-To: GSD-6604

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-10-23 17:01:25 +02:00
Dunajski, Bartosz 25195ebc96 fix: capability to write memory chunk in aub/tbx mode
Related-To: GSD-6604

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-10-19 19:13:11 +02:00
Mateusz Jablonski fd7c750cf7 fix: ensure local variable address is not exposed outside of function
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-06 15:59:16 +02:00
Mateusz Jablonski 72182c28e9 refactor: add NonCopyableOrMovableClass to classes that don't need copy ctors
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-05 13:11:01 +02:00
Mateusz Jablonski fc508212de refactor: pass big parameters as reference instead of by value
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-04 14:53:13 +02:00
Mateusz Jablonski 6b3ad5b3e3 fix: add unrecoverable in case of nullptr
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-04 14:38:05 +02:00
Zbigniew Zdanowicz 0a99384936 fix: set flushed task count for all cases of post sync task count operations
- set monitor fence dispatch for all cases task count post sync operation
- stand alone flush task count will not happen when already flushed and so
monitor fence
- monitor fence then must be dispatched together with task count post sync

Related-To: NEO-8395

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-10-03 12:31:45 +02:00
Mateusz Jablonski 09044dfbaa refactor: remove not needed code
Related-To: NEO-7527

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-27 14:35:49 +02:00
Dominik Dabek eebf2bbd26 performance(ocl): timestamp packet count per gfx
Add support for different timestamp packet counts per gfx family.
Change all packet counts to 1 except for xe-hpc.

Related-To: NEO-8154

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-09-25 20:34:58 +02:00
Dunajski, Bartosz 80d0c74605 fix: track registered CSR clients 2
Related-To: NEO-8884

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-25 09:35:55 +02:00
Compute-Runtime-Validation ade538ce54 Revert "fix: track registered CSR clients"
This reverts commit 53f635e392.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-09-24 10:07:26 +02:00
Zbigniew Zdanowicz d725d454ea performance: emit fence on immediate copy engine only for blocking calls
- this change handles level zero immediate command lists on copy engine
- monitor fence will be dispatched for blocking calls
- asynchronous mode will dispatch monitor fence only on host synchronization

Related-To: NEO-8395

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-09-22 13:14:37 +02:00
Dunajski, Bartosz 53f635e392 fix: track registered CSR clients
In L0 its not possible to track objects relations. For example CmdList
may be removed before Event.
In such case, Event needs to safely skip unregister call, without
accessing CmdList/CmdQueue object.

Related-To: NEO-8884

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-22 09:42:55 +02:00
Mateusz Hoppe 69f5ca6345 feature: bindless addressing - flush state cache after reusing SS slot
- when Surface State is reused for new resource, State Cache needs to be
invalidated

Related-To: NEO-7063

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-09-20 12:53:32 +02:00
Zbigniew Zdanowicz 40e169f8e2 refactor: add batch buffer attribute for explicit monitor fence dispatch
Related-To: NEO-8395

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-09-20 12:18:32 +02:00
Dominik Dabek 1b7e178b25 performance(ocl): program barrier pc in taskStream
Program barrier to task stream, before next enqueue kernel.
This will reduce the number of batch buffer starts for sequences of
enqueue, barrier, enqueue, ... .

Related-To: NEO-8147

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-09-19 11:48:02 +02:00
Dunajski, Bartosz 7562842a58 refactor: remove LogicalStateHelper
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-13 10:29:53 +02:00
Mrozek, Michal d9f938f3db refactor: remove not needed code
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2023-09-12 14:25:04 +02:00
Dunajski, Bartosz 6648065703 feature: add indirect semaphore mode
Related-To: NEO-8242

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-12 13:15:51 +02:00
Dunajski, Bartosz def3f2e9ad refactor: improve semaphore programming
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-12 11:24:11 +02:00
Compute-Runtime-Validation b5e9c10f64 Revert "performance(ocl): program barrier pc in taskStream"
This reverts commit 839c2d6737.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-09-12 01:32:28 +02:00
Dominik Dabek 839c2d6737 performance(ocl): program barrier pc in taskStream
Program barrier immediately to task stream.
This will reduce the number of batch buffer starts.

Related-To: NEO-8147

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-09-11 13:23:26 +02:00
Maciej Plewka 3b3e17e738 performance: Use vector for private allocs to reuse
Related-To: HSD-18033105655, HSD-18033153203

Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-09-04 13:34:38 +02:00
Maciej Plewka 5807d512b3 fix: Reuse private allocations during cmdList dispatch
Related-To: NEO-8201

Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-08-31 14:40:55 +02:00
Zbigniew Zdanowicz 54fce64583 fix: set the indirect object address stream property when not set already
Related-To: NEO-8281

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-08-30 15:51:24 +02:00
Mateusz Hoppe f5cb7df7cd fix: do not download event allocation in TBX mode
- only download when allocation was used - inidcated by taskCount
Resolves: NEO-8312

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-08-29 16:27:33 +02:00
Zbigniew Zdanowicz 873b3d4241 fix: do not process scratch space when no surface heap pointer provided
Related-To: NEO-8281

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-08-22 17:44:35 +02:00
Lukasz Jobczyk 6fcb0f1407 fix: flush dc when blocking flag set
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-08-17 10:10:56 +02:00
Fabian Zwolinski 6fca8ee195 refactor: Remove SourceLevelDebugger
Removed:
- SourceLevelDebugger (with tests)
- DebuggerLibrary
- DebuggerLibraryRestore
- debuggerSupported field from hwInfo.capabilityTable
- HasSourceLevelDebuggerSupport matcher
- ExperimentalEnableSourceLevelDebugger debug var
- EnableMockSourceLevelDebugger debug var
- DebuggerOptDisable debug var
- lib_names.h.in file
- third_party/source_level_debugger/igfx_debug_interchange_types.h

Related-To: NEO-7213
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-08-10 11:14:02 +02:00
Lukasz Jobczyk fc1f45b630 feature: Dispatch monitor fence from wait
Resolves: NEO-8240
Related-To: NEO-8067

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-08-08 16:42:27 +02:00
Mateusz Hoppe bcba74f839 feature: bindless global allocator with scratch
- allocate SSH in cmdContainer when scratch allocation used with
private heaps
- scratch SurfaceStates are addressed relative to
SurfaceStateBaseAddress and have to be placed on SSH
- remove not used SCRATCH_SSH heap type from bindelssHeapHelper

Related-To: NEO-7063

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-08-03 13:24:59 +02:00
Dominik Dabek 12ab74fe96 performance: flag to program barrier in task cs
Add debug flag ProgramBarrierInCommandStreamTask to program barrier
pipe control in task command stream instead of csr command stream.
This will reduce the number of batch buffer starts.

Related-To: NEO-8147

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-08-02 10:26:34 +02:00
Mateusz Hoppe 997b599168 fix(debugger): pass correct sipAllocation to makeResident
- sipAllocation for context must be resident in Offline mode

Related-To: NEO-7630

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-07-28 20:51:12 +02:00
Dunajski, Bartosz cd9ad1f04c fix: decanonize GPU VA during TBX memory read.
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-07-26 19:44:19 +02:00
Dunajski, Bartosz a241099dff feature: use WaitUserFence on zeEventHostSynchronize
Disabled by default. Debug flag is required.

Related-To: NEO-7966

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-07-26 19:41:09 +02:00
Dunajski, Bartosz 2c50fd9486 fix: waiting for completion in TBX mode
- use testTaskCountReady method to check TaskCount value
- download all allocations when TaskCount is ready

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-07-25 11:54:09 +02:00
Baj, Tomasz 4ca213d4d7 fix: commandContainer is nullptr in LinearStream for immediate cmdList
Related-To: GSD-4084
Signed-off-by: Baj, Tomasz <tomasz.baj@intel.com>
2023-07-24 15:06:18 +02:00
Lukasz Jobczyk b06f605895 fix: Correctly flush DC when reuse heap
Related-To: NEO-8067

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-07-20 14:50:37 +02:00
Dominik Dabek 622a3ed89c performance(ocl): flag to not dcFlush on no event
If waitForBarrier is not passed outEvent then do
dcFlush on the next synchronize call.

Related-To: NEO-8147

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-07-18 15:38:54 +02:00
Zbigniew Zdanowicz 1c0285a156 fix: correct alignment of per thread scratch size
Related-To: NEO-5288

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-07-12 12:31:47 +02:00
Zbigniew Zdanowicz 3f7269d401 fix: make sip state programing once for all level zero command queues
Related-To: NEO-7828

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-07-11 11:34:21 +02:00