Dunajski, Bartosz
7ac825e74b
refactor: add debug flag to synchronize Event before reset
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-26 17:38:37 +02:00
Joshua Santosh Ranjan
d2b935b300
fix: fix uninitialized variables in test
...
Related-To: LOCI-4171
Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com >
2023-06-26 16:35:59 +02:00
Cencelewska, Katarzyna
68d81c82a7
fix: Use proper value about hw local id generations
...
- remove useless flag ForceNumberOfThreadsInGpgpuThreadGroup
- add new flag "RemoveRestrictionsOnNumberOfThreadsInGpgpuThreadGroup"
to restore old path without restrictions about number of threads in
thread group
- fix forwarding information about hw local ids generations to
calculate numOfThreadsInThreadGroup correctly
Related-To: NEO-7952, NEO-7982
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-06-26 16:35:42 +02:00
Dunajski, Bartosz
aea5f435db
feature: unregister CSR client on Event host synchronize
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-26 12:02:14 +02:00
Dunajski, Bartosz
61fb19caab
feature: bring back counter based in-order tracking
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-26 10:01:18 +02:00
Lukasz Jobczyk
bc0a3a7eb5
fix: Consider slm size in suggest work group cache
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-06-26 09:12:54 +02:00
Joshua Santosh Ranjan
97b4d8bab5
feature: add initial support for host mapped timestamps
...
Related-To: LOCI-4171
Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com >
2023-06-26 08:29:58 +02:00
Lukasz Jobczyk
0bc5eead84
fix: Remove not needed BCS split helper
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-06-22 14:36:22 +02:00
Zbigniew Zdanowicz
ddffb8a67f
fix: add missing unrecoverable macro
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-06-22 10:47:18 +02:00
Dunajski, Bartosz
eda13e675a
feature: enable dynamic post sync allocation layout
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-22 08:37:40 +02:00
Lukasz Jobczyk
a61b39c47b
fix: Invalidate constant cache when ULLS enabled
...
Related-To: NEO-8067
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-06-21 15:17:14 +02:00
Dunajski, Bartosz
662146ea17
feature: adjust ULTs for future dynamic post sync layout testing [2/2]
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-21 11:57:46 +02:00
Matias Cabral
96517a08aa
feature: Implement zetMetricGroupGetGlobalTimestampsExp()
...
Resolves: LOCI-3072
Signed-off-by: Matias Cabral <matias.a.cabral@intel.com >
2023-06-21 09:48:41 +02:00
Mateusz Hoppe
111b112729
feature: add assertBufferPtr to ImplicitArgs
...
Related-To: NEO-5753, NEO-8078
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-06-20 20:43:57 +02:00
Dunajski, Bartosz
6544a1defa
feature: adjust unit tests for future dynamic post sync allocation testing
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-20 16:22:33 +02:00
Mateusz Jablonski
1be48965ef
fix: parse all extensions passed to zeMemAllocShared
...
reuse common pNext parser
allow to create raytracing allocation with relaxed size
Related-To: NEO-8075
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-06-20 14:32:16 +02:00
Dunajski, Bartosz
46e8c3f5dd
fix: reenable RelaxedOrdering for OCL
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-20 13:05:25 +02:00
Lukasz Jobczyk
c5f61cce85
fix: Ensure flat ring buffer disabled on regular cmd lists
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-06-19 16:27:57 +02:00
Mateusz Hoppe
313fb84fda
feature: bindless addressing mode support
...
- allow bindless kernels to execute
- bindless addressing kernels are using private heaps mode
- do not differentiate bindful and bindless surface state base addresses
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-06-19 12:41:03 +02:00
Zbigniew Zdanowicz
1a80ab2589
performance: store values instead calling virtual methods in command list
...
Related-To: NEO-7828
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-06-16 16:45:16 +02:00
Lukasz Jobczyk
0cf975605b
performance: Cache suggest group size
...
Resolves: NEO-7968
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-06-16 13:26:55 +02:00
Spruit, Neil R
907f41529c
feature: zeModuleInspectLinkageExt to read Linkage of modules
...
Related-To: LOCI-2914
Signed-off-by: Spruit, Neil R <neil.r.spruit@intel.com >
2023-06-15 18:13:43 +02:00
Zhang, Winston
44a8610ebb
feature: Implement support for zeDeviceGetStatus
...
Added support for zeDeviceGetStatus API.
Added ULTs for zeDeviceGetStatus API.
Related-To: LOCI-1558
Signed-off-by: Zhang, Winston <winston.zhang@intel.com >
2023-06-15 06:36:14 +02:00
Mateusz Jablonski
3b981331c9
fix: correct handling ZE_ENABLE_PCI_ID_DEVICE_ORDER flag
...
- by default ZE_ENABLE_PCI_ID_DEVICE_ORDER is disabled
- by default devices are sorted by type (discrete first), then by pci order
- when ZE_ENABLE_PCI_ID_DEVICE_ORDER is enabled, devices are sorted by pci id
Related-To: LOCI-4520
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-06-14 16:27:55 +02:00
Compute-Runtime-Validation
02436b8877
Revert "fix: support alignments in host and shared UnifiedMemoryAllocation"
...
This reverts commit c11809e002 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-06-14 06:32:40 +02:00
Lukasz Jobczyk
694e731a01
fix: disable OOOS
...
Related-To: NEO-7458
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-06-13 14:10:27 +02:00
Cencelewska, Katarzyna
7cb3278eb3
fix: add function to calculate number of threads per tg
...
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-06-13 14:02:24 +02:00
Lu, Wenbin
c11809e002
fix: support alignments in host and shared UnifiedMemoryAllocation
...
Related-To: LOCI-4334
Signed-off-by: Lu, Wenbin <wenbin.lu@intel.com >
2023-06-13 10:01:11 +02:00
Mateusz Jablonski
4f72835b7d
fix: create dedicated class for root device indices to store unique values
...
remove method to removing duplicates from StackVec as the method
implicitly sorted the vector
Related-To: GSD-4692
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-06-12 22:24:06 +02:00
Kamil Kopryk
52651991c2
feature: don't allow to build program with stateful accesses on PVC
...
Related-To: NEO-6075
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2023-06-12 19:31:32 +02:00
Mateusz Hoppe
646c8985e8
refactor: store number of stateful args in KernelDescriptor
...
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-06-12 14:15:43 +02:00
Kamil Kopryk
6a0f7afd64
feature: verify stateful information only when binary is generated by IGC
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
Related-To: NEO-6075
Ngen binaries contain stateful information, however they are
not used in isa on Pvc. Therefore, we can just ignore them.
2023-06-12 11:45:41 +02:00
Dunajski, Bartosz
37f04dc670
feature: in-order CmdList support for BCS split
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-09 15:15:45 +02:00
Dunajski, Bartosz
3d49658f50
feature: new multitile post sync layout for immediate write [2/n]
...
No functional changes in this commit. This is prework.
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-09 14:20:34 +02:00
Joshua Santosh Ranjan
64bd9e58a8
fix: correctly handle timeouts in zeCommandListHostSynchronize
...
Related-To: LOCI-4446
Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com >
2023-06-09 02:33:41 +02:00
Dunajski, Bartosz
5fe9d70066
feature: new multitile post sync layout for immediate write [1/n]
...
No functional changes in this commit. This is prework.
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-07 13:11:10 +02:00
Rafal Maziejuk
6abdc64d73
refactor: move gen_kernel.cmake file to test_modules directory
...
Related-To: NEO-7975
Signed-off-by: Rafal Maziejuk <rafal.maziejuk@intel.com >
2023-06-07 12:28:58 +02:00
Neil R Spruit
a5ca2f44fe
fix: check L0 driver init success before setting teardown in L0 Loader
...
Related-To: LOCI-4499
Signed-off-by: Spruit, Neil R <neil.r.spruit@intel.com >
2023-06-07 08:19:21 +02:00
Neil R Spruit
ba6d447b4d
feature: Support for using Reserved address with multiple mappings
...
Related-To: LOCI-4381
- Enabled support for customers to use full Virtual reservation range
with multiple physical mappings with additional allocations implicitly
included in residency.
- Buffer Surface state size extended for first allocation to stretch to
the bufferSize requested.
Signed-off-by: Neil R Spruit <neil.r.spruit@intel.com >
2023-06-07 03:12:29 +02:00
Mateusz Hoppe
0844ca0ac8
refactor: cleanup getBindlessMode() usage
...
- getGlobalBindlessHeapConfiguration() should be used to choose global
alloctor for SSH
- remove not needed and incorrect unit tests
- remove not needed branches
- bindless mode controls bindless compilation only
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-06-06 17:23:13 +02:00
Dunajski, Bartosz
d2c9122ad3
feature: signal in-order sync allocation in Copy only path
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-05 16:40:01 +02:00
Dunajski, Bartosz
5f559ec2d5
feature: signal sync alloc in in-order CmdList appendSignalEvent path
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-05 16:11:16 +02:00
Jaime Arteaga
8085652467
fix: Correctly resize edge vector on ULT
...
Related-To: LOCI-4491
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com >
2023-06-03 02:23:27 +02:00
Zbigniew Zdanowicz
9ed942e552
fix: remove possible deadlock during state cache invalidation
...
Heaps must be invalidated before returning allocation to the pool.
Pipeline can be blocked by any outstanding events, so there is no need to
wait for the state cache flush completion. In case heap allocation is not
ready after state cache flush, command list will create new allocation and
eventually driver will reach state state of heap allocation reuse.
Related-To: NEO-7963
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-06-02 08:34:17 +02:00
Dunajski, Bartosz
fb7fff7442
feature: skip Events from previous append in in-order CL path
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-01 11:35:08 +02:00
Dunajski, Bartosz
808ff8c2e4
refactor: remove unused EncodeDispatchKernelArgs field
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-01 10:42:22 +02:00
Milczarek, Slawomir
ac3a2b6756
feature: extend regkey to force prefetch of shared memory for cmd list immediate
...
Related-To: NEO-7841
Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com >
2023-06-01 10:05:20 +02:00
Lukasz Jobczyk
b43847e1f8
performance: Skip already completed events
...
Resolves: NEO-7587
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-05-31 15:11:14 +02:00
Zbigniew Zdanowicz
7d7ef2f104
performance: Add function pointer for immediate command list flush method
...
Related-To: NEO-7808
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-05-30 18:33:18 +02:00
Mateusz Jablonski
61055478d4
fix: adjust scope of disable L3 for debug WA
...
Related-To: HSD-1609398399
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-05-30 14:23:16 +02:00