Dunajski, Bartosz
2e8cf5fdf5
refactor: split CpuInaccessible MemoryPool types to Device and System
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-10-16 12:47:13 +02:00
Compute-Runtime-Validation
30b066c40e
Revert "fix: synchronize host and device timers to avoid device timer overflow"
...
This reverts commit dae8c34f81 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-10-16 11:16:23 +02:00
Mateusz Jablonski
dae8c34f81
fix: synchronize host and device timers to avoid device timer overflow
...
Related-To: NEO-8394
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-13 17:40:45 +02:00
Dominik Dabek
eebf2bbd26
performance(ocl): timestamp packet count per gfx
...
Add support for different timestamp packet counts per gfx family.
Change all packet counts to 1 except for xe-hpc.
Related-To: NEO-8154
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-09-25 20:34:58 +02:00
Dunajski, Bartosz
80d0c74605
fix: track registered CSR clients 2
...
Related-To: NEO-8884
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-25 09:35:55 +02:00
Compute-Runtime-Validation
ade538ce54
Revert "fix: track registered CSR clients"
...
This reverts commit 53f635e392 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-09-24 10:07:26 +02:00
Dunajski, Bartosz
53f635e392
fix: track registered CSR clients
...
In L0 its not possible to track objects relations. For example CmdList
may be removed before Event.
In such case, Event needs to safely skip unregister call, without
accessing CmdList/CmdQueue object.
Related-To: NEO-8884
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-22 09:42:55 +02:00
Dominik Dabek
ace5e67b51
test(ocl): negative test, split barrier required
...
Test for splitBarrierRequired not being set on
enqueueMarkerWithWaitList.
Related-To: NEO-8147
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-09-21 18:16:36 +02:00
Dunajski, Bartosz
d3d5da1f72
feature: initial 64b in-order CmdList support
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-19 15:23:37 +02:00
Dominik Dabek
1b7e178b25
performance(ocl): program barrier pc in taskStream
...
Program barrier to task stream, before next enqueue kernel.
This will reduce the number of batch buffer starts for sequences of
enqueue, barrier, enqueue, ... .
Related-To: NEO-8147
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-09-19 11:48:02 +02:00
Dominik Dabek
ee767822b8
performance(ocl): add bcsEngineCount to cmdq
...
Related-To: NEO-8155
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-09-15 15:49:56 +02:00
Mateusz Jablonski
d4f80fb957
test: correct scope of unit test
...
ensure that getHwRevIdFromStepping is not defined for MTL+
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-09-13 11:56:56 +02:00
Dunajski, Bartosz
7562842a58
refactor: remove LogicalStateHelper
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-13 10:29:53 +02:00
Mrozek, Michal
15a0bf9d7b
refactor: remove not needed code.
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com >
2023-09-12 13:52:47 +02:00
Dunajski, Bartosz
2a6be2fccd
feature: update conditional bb start to use qword data
...
Related-To: NEO-8242
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-12 11:24:28 +02:00
Mrozek, Michal
ae4757711e
refactor: remove not needed code
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com >
2023-09-12 09:39:01 +02:00
Compute-Runtime-Validation
b5e9c10f64
Revert "performance(ocl): program barrier pc in taskStream"
...
This reverts commit 839c2d6737 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-09-12 01:32:28 +02:00
Dominik Dabek
839c2d6737
performance(ocl): program barrier pc in taskStream
...
Program barrier immediately to task stream.
This will reduce the number of batch buffer starts.
Related-To: NEO-8147
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-09-11 13:23:26 +02:00
Lukasz Jobczyk
a38ac3557b
performance: Disable blit enqueue on xe_lpg
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-09-11 13:10:23 +02:00
Compute-Runtime-Validation
7b5ef1b2f4
Revert "performance: Disable blit enqueue on xe_lpg"
...
This reverts commit ac97b1c493 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-09-11 10:10:21 +02:00
Michal Mrozek
65188fc90b
refactor: remove not needed code
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2023-09-08 17:22:18 +02:00
Lukasz Jobczyk
ac97b1c493
performance: Disable blit enqueue on xe_lpg
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-09-08 14:04:24 +02:00
Dunajski, Bartosz
8f6183e758
fix: check queue post post completion resources when releasing Event
...
Related-To: NEO-8316
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-04 08:23:56 +02:00
Maciej Plewka
2e249d819e
fix: submit marker for cpu transfer with event on IOQ queue
...
Related-To: NEO-8081
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2023-08-21 15:07:37 +02:00
Dunajski, Bartosz
3cf1f5c462
fix: unregister CSR client on OCL sync points
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-08-16 13:35:07 +02:00
Dunajski, Bartosz
a42e88d55b
fix: stall RelaxedOrdering scheduler when programming OCL dependencies
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-08-14 10:47:35 +02:00
Fabian Zwolinski
6fca8ee195
refactor: Remove SourceLevelDebugger
...
Removed:
- SourceLevelDebugger (with tests)
- DebuggerLibrary
- DebuggerLibraryRestore
- debuggerSupported field from hwInfo.capabilityTable
- HasSourceLevelDebuggerSupport matcher
- ExperimentalEnableSourceLevelDebugger debug var
- EnableMockSourceLevelDebugger debug var
- DebuggerOptDisable debug var
- lib_names.h.in file
- third_party/source_level_debugger/igfx_debug_interchange_types.h
Related-To: NEO-7213
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com >
2023-08-10 11:14:02 +02:00
Mateusz Jablonski
db12dc78a8
test: reduce gpu hang check period in test expecting gpu hang
...
this reduces test execution time by 500 ms
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-08-04 08:49:51 +02:00
Filip Hazubski
12af65a970
fix: Change default value of EnableCpuCacheForResources debug toggle
...
This change disables CPU caching for resources
not accessed by CPU for MTL devices.
Related-To: NEO-7194
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2023-07-31 09:15:43 +02:00
Mateusz Hoppe
997b599168
fix(debugger): pass correct sipAllocation to makeResident
...
- sipAllocation for context must be resident in Offline mode
Related-To: NEO-7630
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-07-28 20:51:12 +02:00
Mateusz Jablonski
f84f22d23c
refactor: remove unused isSharedContext variable
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-07-26 12:14:23 +02:00
Dunajski, Bartosz
6656e23b86
refactor: unify completion check methods
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-25 14:17:52 +02:00
Dunajski, Bartosz
2c50fd9486
fix: waiting for completion in TBX mode
...
- use testTaskCountReady method to check TaskCount value
- download all allocations when TaskCount is ready
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-25 11:54:09 +02:00
Cencelewska, Katarzyna
6f45574ff9
fix: add proper bcs synchronization when marker
...
Related-To: NEO-8105
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-07-24 10:57:41 +02:00
Compute-Runtime-Validation
8c155a2e89
Revert "performance: Memory handling improvements"
...
This reverts commit 5b80bd4d7c .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-07-20 11:37:09 +02:00
Filip Hazubski
5b80bd4d7c
performance: Memory handling improvements
...
By default prefer allocating memory first by KMD, instead of malloc first.
By default prefer not caching allocations on MTL devices. This results
in allocations being handled with non-coherent pat index.
For integrated devices when caching is not preferred do not allow
direct memory access in CPU domain. For map/unmap operations create
a dedicated memory allocation for CPU access, instead of accessing it
directly, reusing the same logic as when mapping/unmapping local memory.
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2023-07-19 19:21:44 +02:00
Dominik Dabek
622a3ed89c
performance(ocl): flag to not dcFlush on no event
...
If waitForBarrier is not passed outEvent then do
dcFlush on the next synchronize call.
Related-To: NEO-8147
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-07-18 15:38:54 +02:00
Dunajski, Bartosz
815b37bf3a
performance: allow waiting for OOQ timestamps in clEnqueueWaitForEvents
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-14 11:33:10 +02:00
Zbigniew Zdanowicz
1c0285a156
fix: correct alignment of per thread scratch size
...
Related-To: NEO-5288
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-07-12 12:31:47 +02:00
Cencelewska, Katarzyna
aa0beb8191
fix: Unify logic calculating threads per work group part 4
...
- also use helper when checking that is simd1 to have same flow
Related-To: NEO-8087
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-07-07 15:34:59 +02:00
Dunajski, Bartosz
00bae2c827
fix: add missing nullptr check
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-07 11:34:49 +02:00
Cencelewska, Katarzyna
61f701aba5
fix: Unify logic calculating threads per work group part 3
...
Related-To: NEO-8087
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-07-04 15:27:44 +02:00
Cencelewska, Katarzyna
7cb3278eb3
fix: add function to calculate number of threads per tg
...
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-06-13 14:02:24 +02:00
Mateusz Jablonski
4f72835b7d
fix: create dedicated class for root device indices to store unique values
...
remove method to removing duplicates from StackVec as the method
implicitly sorted the vector
Related-To: GSD-4692
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-06-12 22:24:06 +02:00
Dunajski, Bartosz
5fe9d70066
feature: new multitile post sync layout for immediate write [1/n]
...
No functional changes in this commit. This is prework.
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-07 13:11:10 +02:00
Kacper Nowak
d05fee8087
fix(ocl): Use GPU address of SVM alloc. when using builtins
...
Use GPU address of SVM allocation instead of CPU one. It fixes a bug
visible on platforms, where potentially GPU VA does not have to match
CPU VA.
Related-To: NEO-7970
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com >
2023-05-26 13:39:14 +02:00
Lukasz Jobczyk
0e758e4bb5
performance: Add debug flag to set BCS split minimal size
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-05-17 08:07:43 +02:00
Lukasz Jobczyk
9f755b54ad
performance: Bidirectional BCS split implementation for OCL
...
Related-To: NEO-7877
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-05-16 13:34:18 +02:00
Mateusz Jablonski
425a2a6fa2
fix: set NotLockable flag when resource does not need to be lockable
...
disable compression preference when resource is lockable
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-05-15 16:47:21 +02:00
Naklicki, Mateusz
71e5c76d67
fix: remove explicit AgeBased Thread Arbitration policy
...
Related-To: NEO-7913
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com >
2023-05-15 09:58:04 +02:00