Mateusz Hoppe
5d572b9c8f
feature: allow freeing memory in aubstream
...
Related-To: NEO-2707
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-10-26 17:16:23 +02:00
Compute-Runtime-Validation
fca2159430
Revert "fix: if device hierarchy is flat then getSubDevicesCount return 1u"
...
This reverts commit cb0bb57f49 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-10-26 15:40:29 +02:00
Baj, Tomasz
cb0bb57f49
fix: if device hierarchy is flat then getSubDevicesCount return 1u
...
Related-To: NEO-9167
Signed-off-by: Baj, Tomasz <tomasz.baj@intel.com >
2023-10-25 15:51:52 +02:00
Jablonski, Mateusz
ac5f64f5c6
fix: fix compilation error in clang on Windows (2/n)
...
Signed-off-by: Jablonski, Mateusz <mateusz.jablonski@intel.com >
2023-10-24 15:59:06 +02:00
Zbigniew Zdanowicz
0b42510f41
fix: add l1 cache flush to barrier operations when required
...
Related-To: NEO-8395
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-10-23 18:18:22 +02:00
Mateusz Hoppe
52b0f32688
fix: offset cpu address when writing chunk in simulated csr
...
- not only gpuAddress is offset but also cpu address with data needs
to be offset while writing memory.
Related-To: GSD-6604
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-10-23 17:01:25 +02:00
Dunajski, Bartosz
25195ebc96
fix: capability to write memory chunk in aub/tbx mode
...
Related-To: GSD-6604
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-10-19 19:13:11 +02:00
Mateusz Jablonski
fd7c750cf7
fix: ensure local variable address is not exposed outside of function
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-06 15:59:16 +02:00
Mateusz Jablonski
72182c28e9
refactor: add NonCopyableOrMovableClass to classes that don't need copy ctors
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-05 13:11:01 +02:00
Mateusz Jablonski
fc508212de
refactor: pass big parameters as reference instead of by value
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-04 14:53:13 +02:00
Mateusz Jablonski
6b3ad5b3e3
fix: add unrecoverable in case of nullptr
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-04 14:38:05 +02:00
Zbigniew Zdanowicz
0a99384936
fix: set flushed task count for all cases of post sync task count operations
...
- set monitor fence dispatch for all cases task count post sync operation
- stand alone flush task count will not happen when already flushed and so
monitor fence
- monitor fence then must be dispatched together with task count post sync
Related-To: NEO-8395
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-10-03 12:31:45 +02:00
Mateusz Jablonski
09044dfbaa
refactor: remove not needed code
...
Related-To: NEO-7527
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-09-27 14:35:49 +02:00
Dominik Dabek
eebf2bbd26
performance(ocl): timestamp packet count per gfx
...
Add support for different timestamp packet counts per gfx family.
Change all packet counts to 1 except for xe-hpc.
Related-To: NEO-8154
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-09-25 20:34:58 +02:00
Dunajski, Bartosz
80d0c74605
fix: track registered CSR clients 2
...
Related-To: NEO-8884
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-25 09:35:55 +02:00
Compute-Runtime-Validation
ade538ce54
Revert "fix: track registered CSR clients"
...
This reverts commit 53f635e392 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-09-24 10:07:26 +02:00
Zbigniew Zdanowicz
d725d454ea
performance: emit fence on immediate copy engine only for blocking calls
...
- this change handles level zero immediate command lists on copy engine
- monitor fence will be dispatched for blocking calls
- asynchronous mode will dispatch monitor fence only on host synchronization
Related-To: NEO-8395
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-09-22 13:14:37 +02:00
Dunajski, Bartosz
53f635e392
fix: track registered CSR clients
...
In L0 its not possible to track objects relations. For example CmdList
may be removed before Event.
In such case, Event needs to safely skip unregister call, without
accessing CmdList/CmdQueue object.
Related-To: NEO-8884
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-22 09:42:55 +02:00
Mateusz Hoppe
69f5ca6345
feature: bindless addressing - flush state cache after reusing SS slot
...
- when Surface State is reused for new resource, State Cache needs to be
invalidated
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-09-20 12:53:32 +02:00
Zbigniew Zdanowicz
40e169f8e2
refactor: add batch buffer attribute for explicit monitor fence dispatch
...
Related-To: NEO-8395
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-09-20 12:18:32 +02:00
Dominik Dabek
1b7e178b25
performance(ocl): program barrier pc in taskStream
...
Program barrier to task stream, before next enqueue kernel.
This will reduce the number of batch buffer starts for sequences of
enqueue, barrier, enqueue, ... .
Related-To: NEO-8147
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-09-19 11:48:02 +02:00
Dunajski, Bartosz
7562842a58
refactor: remove LogicalStateHelper
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-13 10:29:53 +02:00
Mrozek, Michal
d9f938f3db
refactor: remove not needed code
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com >
2023-09-12 14:25:04 +02:00
Dunajski, Bartosz
6648065703
feature: add indirect semaphore mode
...
Related-To: NEO-8242
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-12 13:15:51 +02:00
Dunajski, Bartosz
def3f2e9ad
refactor: improve semaphore programming
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-12 11:24:11 +02:00
Compute-Runtime-Validation
b5e9c10f64
Revert "performance(ocl): program barrier pc in taskStream"
...
This reverts commit 839c2d6737 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-09-12 01:32:28 +02:00
Dominik Dabek
839c2d6737
performance(ocl): program barrier pc in taskStream
...
Program barrier immediately to task stream.
This will reduce the number of batch buffer starts.
Related-To: NEO-8147
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-09-11 13:23:26 +02:00
Maciej Plewka
3b3e17e738
performance: Use vector for private allocs to reuse
...
Related-To: HSD-18033105655, HSD-18033153203
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2023-09-04 13:34:38 +02:00
Maciej Plewka
5807d512b3
fix: Reuse private allocations during cmdList dispatch
...
Related-To: NEO-8201
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2023-08-31 14:40:55 +02:00
Zbigniew Zdanowicz
54fce64583
fix: set the indirect object address stream property when not set already
...
Related-To: NEO-8281
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-08-30 15:51:24 +02:00
Mateusz Hoppe
f5cb7df7cd
fix: do not download event allocation in TBX mode
...
- only download when allocation was used - inidcated by taskCount
Resolves: NEO-8312
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-08-29 16:27:33 +02:00
Zbigniew Zdanowicz
873b3d4241
fix: do not process scratch space when no surface heap pointer provided
...
Related-To: NEO-8281
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-08-22 17:44:35 +02:00
Lukasz Jobczyk
6fcb0f1407
fix: flush dc when blocking flag set
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-08-17 10:10:56 +02:00
Fabian Zwolinski
6fca8ee195
refactor: Remove SourceLevelDebugger
...
Removed:
- SourceLevelDebugger (with tests)
- DebuggerLibrary
- DebuggerLibraryRestore
- debuggerSupported field from hwInfo.capabilityTable
- HasSourceLevelDebuggerSupport matcher
- ExperimentalEnableSourceLevelDebugger debug var
- EnableMockSourceLevelDebugger debug var
- DebuggerOptDisable debug var
- lib_names.h.in file
- third_party/source_level_debugger/igfx_debug_interchange_types.h
Related-To: NEO-7213
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com >
2023-08-10 11:14:02 +02:00
Lukasz Jobczyk
fc1f45b630
feature: Dispatch monitor fence from wait
...
Resolves: NEO-8240
Related-To: NEO-8067
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-08-08 16:42:27 +02:00
Mateusz Hoppe
bcba74f839
feature: bindless global allocator with scratch
...
- allocate SSH in cmdContainer when scratch allocation used with
private heaps
- scratch SurfaceStates are addressed relative to
SurfaceStateBaseAddress and have to be placed on SSH
- remove not used SCRATCH_SSH heap type from bindelssHeapHelper
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-08-03 13:24:59 +02:00
Dominik Dabek
12ab74fe96
performance: flag to program barrier in task cs
...
Add debug flag ProgramBarrierInCommandStreamTask to program barrier
pipe control in task command stream instead of csr command stream.
This will reduce the number of batch buffer starts.
Related-To: NEO-8147
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-08-02 10:26:34 +02:00
Mateusz Hoppe
997b599168
fix(debugger): pass correct sipAllocation to makeResident
...
- sipAllocation for context must be resident in Offline mode
Related-To: NEO-7630
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-07-28 20:51:12 +02:00
Dunajski, Bartosz
cd9ad1f04c
fix: decanonize GPU VA during TBX memory read.
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-26 19:44:19 +02:00
Dunajski, Bartosz
a241099dff
feature: use WaitUserFence on zeEventHostSynchronize
...
Disabled by default. Debug flag is required.
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-26 19:41:09 +02:00
Dunajski, Bartosz
2c50fd9486
fix: waiting for completion in TBX mode
...
- use testTaskCountReady method to check TaskCount value
- download all allocations when TaskCount is ready
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-25 11:54:09 +02:00
Baj, Tomasz
4ca213d4d7
fix: commandContainer is nullptr in LinearStream for immediate cmdList
...
Related-To: GSD-4084
Signed-off-by: Baj, Tomasz <tomasz.baj@intel.com >
2023-07-24 15:06:18 +02:00
Lukasz Jobczyk
b06f605895
fix: Correctly flush DC when reuse heap
...
Related-To: NEO-8067
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-07-20 14:50:37 +02:00
Dominik Dabek
622a3ed89c
performance(ocl): flag to not dcFlush on no event
...
If waitForBarrier is not passed outEvent then do
dcFlush on the next synchronize call.
Related-To: NEO-8147
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-07-18 15:38:54 +02:00
Zbigniew Zdanowicz
1c0285a156
fix: correct alignment of per thread scratch size
...
Related-To: NEO-5288
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-07-12 12:31:47 +02:00
Zbigniew Zdanowicz
3f7269d401
fix: make sip state programing once for all level zero command queues
...
Related-To: NEO-7828
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-07-11 11:34:21 +02:00
Mateusz Hoppe
4aba0f0340
feature: global bindless surface state base support
...
- program global bindless ssba when external allocator used (
UseExternalAllocatorForSshAndDsh)
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-07-06 18:31:49 +02:00
Zbigniew Zdanowicz
c892b8c6f3
fix: remove redundant check
...
Related-To: NEO-7828
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-07-06 14:58:18 +02:00
Zbigniew Zdanowicz
8836838c7c
performance: add one time context init sip state to immediate flush task
...
Related-To: NEO-7808
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-07-06 14:25:35 +02:00
Zbigniew Zdanowicz
59949bc833
performance: add one time context init csr surface to immediate flush task
...
Related-To: NEO-7808
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-07-05 16:18:21 +02:00