Mateusz Jablonski
c3ac7b78bd
refactor: correct variable naming
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-01 02:18:46 +01:00
Mateusz Jablonski
c9664e6bad
refactor: rename global debug manager to debugManager
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-11-30 13:00:59 +01:00
Mateusz Jablonski
36194c4e7d
refactor: correct variable namings
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-11-29 23:49:03 +01:00
Fabian Zwolinski
06ec4c3a82
fix: aub residency in HW+AUB mode
...
Related-To: NEO-9230
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com >
2023-11-29 11:54:59 +01:00
Mateusz Hoppe
83ac95d293
fix: L0 - remove synchronization with events on appends in tbx mode
...
Related-To: NEO-9400
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-11-27 10:39:55 +01:00
Kamil Kopryk
982c2d665b
feature: add programming heapless mode in SBA
...
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2023-11-24 15:29:19 +01:00
Mateusz Jablonski
220b1e10cf
refactor: remove copy/move ctors when class has user-defined dtor
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-11-24 13:13:07 +01:00
Kamil Kopryk
ce7298d512
feature: Add heapless mode programming in ocl
...
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2023-11-24 12:53:39 +01:00
Maciej Plewka
560b588cbe
fix: Update residency fence value after ring buffer switch
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2023-11-17 20:46:14 +01:00
Dominik Dabek
6562828095
performance: prealloc internal heap on mtl
...
Preallocate 1 internal heap allocation per csr on mtl
Related-To: NEO-8152
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-11-17 13:36:21 +01:00
Dominik Dabek
7a6fc209dd
performance: prealloc cmdbuffer on mtl
...
Preallocate 2 command buffers allocations per command queue initialized
on MTL.
Related-To: NEO-8152
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-11-08 17:21:13 +01:00
Michal Mrozek
80e3632c41
performance: do not free deferred allocations when not needed
...
Level Zero doesn't use deferred allocations so no point in paying the
price to check for them in cleanAllocationList.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2023-10-27 18:02:43 +02:00
Dominik Dabek
39cf653959
performance(ocl): cmd buffer prealloc per cmdqueue
...
Add mechanism to preallocate cmd buffer allocations in command stream
receiver reusable allocations list per command queue initialized.
This should limit additional allocations during hot loop.
Needs to be enabled in subsequent commits by setting product helper
method.
Related-To: NEO-8152
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-10-27 16:56:29 +02:00
Mateusz Hoppe
5d572b9c8f
feature: allow freeing memory in aubstream
...
Related-To: NEO-2707
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-10-26 17:16:23 +02:00
Compute-Runtime-Validation
fca2159430
Revert "fix: if device hierarchy is flat then getSubDevicesCount return 1u"
...
This reverts commit cb0bb57f49 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-10-26 15:40:29 +02:00
Baj, Tomasz
cb0bb57f49
fix: if device hierarchy is flat then getSubDevicesCount return 1u
...
Related-To: NEO-9167
Signed-off-by: Baj, Tomasz <tomasz.baj@intel.com >
2023-10-25 15:51:52 +02:00
Jablonski, Mateusz
ac5f64f5c6
fix: fix compilation error in clang on Windows (2/n)
...
Signed-off-by: Jablonski, Mateusz <mateusz.jablonski@intel.com >
2023-10-24 15:59:06 +02:00
Zbigniew Zdanowicz
0b42510f41
fix: add l1 cache flush to barrier operations when required
...
Related-To: NEO-8395
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-10-23 18:18:22 +02:00
Mateusz Hoppe
52b0f32688
fix: offset cpu address when writing chunk in simulated csr
...
- not only gpuAddress is offset but also cpu address with data needs
to be offset while writing memory.
Related-To: GSD-6604
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-10-23 17:01:25 +02:00
Dunajski, Bartosz
25195ebc96
fix: capability to write memory chunk in aub/tbx mode
...
Related-To: GSD-6604
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-10-19 19:13:11 +02:00
Mateusz Jablonski
fd7c750cf7
fix: ensure local variable address is not exposed outside of function
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-06 15:59:16 +02:00
Mateusz Jablonski
72182c28e9
refactor: add NonCopyableOrMovableClass to classes that don't need copy ctors
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-05 13:11:01 +02:00
Mateusz Jablonski
fc508212de
refactor: pass big parameters as reference instead of by value
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-04 14:53:13 +02:00
Mateusz Jablonski
6b3ad5b3e3
fix: add unrecoverable in case of nullptr
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-04 14:38:05 +02:00
Zbigniew Zdanowicz
0a99384936
fix: set flushed task count for all cases of post sync task count operations
...
- set monitor fence dispatch for all cases task count post sync operation
- stand alone flush task count will not happen when already flushed and so
monitor fence
- monitor fence then must be dispatched together with task count post sync
Related-To: NEO-8395
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-10-03 12:31:45 +02:00
Mateusz Jablonski
09044dfbaa
refactor: remove not needed code
...
Related-To: NEO-7527
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-09-27 14:35:49 +02:00
Dominik Dabek
eebf2bbd26
performance(ocl): timestamp packet count per gfx
...
Add support for different timestamp packet counts per gfx family.
Change all packet counts to 1 except for xe-hpc.
Related-To: NEO-8154
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-09-25 20:34:58 +02:00
Dunajski, Bartosz
80d0c74605
fix: track registered CSR clients 2
...
Related-To: NEO-8884
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-25 09:35:55 +02:00
Compute-Runtime-Validation
ade538ce54
Revert "fix: track registered CSR clients"
...
This reverts commit 53f635e392 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-09-24 10:07:26 +02:00
Zbigniew Zdanowicz
d725d454ea
performance: emit fence on immediate copy engine only for blocking calls
...
- this change handles level zero immediate command lists on copy engine
- monitor fence will be dispatched for blocking calls
- asynchronous mode will dispatch monitor fence only on host synchronization
Related-To: NEO-8395
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-09-22 13:14:37 +02:00
Dunajski, Bartosz
53f635e392
fix: track registered CSR clients
...
In L0 its not possible to track objects relations. For example CmdList
may be removed before Event.
In such case, Event needs to safely skip unregister call, without
accessing CmdList/CmdQueue object.
Related-To: NEO-8884
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-22 09:42:55 +02:00
Mateusz Hoppe
69f5ca6345
feature: bindless addressing - flush state cache after reusing SS slot
...
- when Surface State is reused for new resource, State Cache needs to be
invalidated
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-09-20 12:53:32 +02:00
Zbigniew Zdanowicz
40e169f8e2
refactor: add batch buffer attribute for explicit monitor fence dispatch
...
Related-To: NEO-8395
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-09-20 12:18:32 +02:00
Dominik Dabek
1b7e178b25
performance(ocl): program barrier pc in taskStream
...
Program barrier to task stream, before next enqueue kernel.
This will reduce the number of batch buffer starts for sequences of
enqueue, barrier, enqueue, ... .
Related-To: NEO-8147
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-09-19 11:48:02 +02:00
Dunajski, Bartosz
7562842a58
refactor: remove LogicalStateHelper
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-13 10:29:53 +02:00
Mrozek, Michal
d9f938f3db
refactor: remove not needed code
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com >
2023-09-12 14:25:04 +02:00
Dunajski, Bartosz
6648065703
feature: add indirect semaphore mode
...
Related-To: NEO-8242
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-12 13:15:51 +02:00
Dunajski, Bartosz
def3f2e9ad
refactor: improve semaphore programming
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-12 11:24:11 +02:00
Compute-Runtime-Validation
b5e9c10f64
Revert "performance(ocl): program barrier pc in taskStream"
...
This reverts commit 839c2d6737 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-09-12 01:32:28 +02:00
Dominik Dabek
839c2d6737
performance(ocl): program barrier pc in taskStream
...
Program barrier immediately to task stream.
This will reduce the number of batch buffer starts.
Related-To: NEO-8147
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-09-11 13:23:26 +02:00
Maciej Plewka
3b3e17e738
performance: Use vector for private allocs to reuse
...
Related-To: HSD-18033105655, HSD-18033153203
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2023-09-04 13:34:38 +02:00
Maciej Plewka
5807d512b3
fix: Reuse private allocations during cmdList dispatch
...
Related-To: NEO-8201
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2023-08-31 14:40:55 +02:00
Zbigniew Zdanowicz
54fce64583
fix: set the indirect object address stream property when not set already
...
Related-To: NEO-8281
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-08-30 15:51:24 +02:00
Mateusz Hoppe
f5cb7df7cd
fix: do not download event allocation in TBX mode
...
- only download when allocation was used - inidcated by taskCount
Resolves: NEO-8312
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-08-29 16:27:33 +02:00
Zbigniew Zdanowicz
873b3d4241
fix: do not process scratch space when no surface heap pointer provided
...
Related-To: NEO-8281
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-08-22 17:44:35 +02:00
Lukasz Jobczyk
6fcb0f1407
fix: flush dc when blocking flag set
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-08-17 10:10:56 +02:00
Fabian Zwolinski
6fca8ee195
refactor: Remove SourceLevelDebugger
...
Removed:
- SourceLevelDebugger (with tests)
- DebuggerLibrary
- DebuggerLibraryRestore
- debuggerSupported field from hwInfo.capabilityTable
- HasSourceLevelDebuggerSupport matcher
- ExperimentalEnableSourceLevelDebugger debug var
- EnableMockSourceLevelDebugger debug var
- DebuggerOptDisable debug var
- lib_names.h.in file
- third_party/source_level_debugger/igfx_debug_interchange_types.h
Related-To: NEO-7213
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com >
2023-08-10 11:14:02 +02:00
Lukasz Jobczyk
fc1f45b630
feature: Dispatch monitor fence from wait
...
Resolves: NEO-8240
Related-To: NEO-8067
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-08-08 16:42:27 +02:00
Mateusz Hoppe
bcba74f839
feature: bindless global allocator with scratch
...
- allocate SSH in cmdContainer when scratch allocation used with
private heaps
- scratch SurfaceStates are addressed relative to
SurfaceStateBaseAddress and have to be placed on SSH
- remove not used SCRATCH_SSH heap type from bindelssHeapHelper
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-08-03 13:24:59 +02:00
Dominik Dabek
12ab74fe96
performance: flag to program barrier in task cs
...
Add debug flag ProgramBarrierInCommandStreamTask to program barrier
pipe control in task command stream instead of csr command stream.
This will reduce the number of batch buffer starts.
Related-To: NEO-8147
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-08-02 10:26:34 +02:00