Dunajski, Bartosz
a241099dff
feature: use WaitUserFence on zeEventHostSynchronize
...
Disabled by default. Debug flag is required.
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-26 19:41:09 +02:00
Joshua Santosh Ranjan
b6e76b9118
fix: Move event reference time tracking into event class
...
This would avoid recalculating reference timestamps
when event is used with different command lists.
Related-To: LOCI-4563
Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com >
2023-07-25 08:44:47 +02:00
Mateusz Hoppe
e52712b800
feature(ocl): enable "cl_khr_external_memory" extension
...
- report extension string
- report supported memory handle types
Related-To: NEO-6757
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-07-24 14:22:39 +02:00
Dominik Dabek
0a4d0917d4
performance(ocl): skip dcFlush on no event
...
Skip dcFlush on waitForBarrier without event by default.
Related-To: NEO-8147
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-07-20 14:57:37 +02:00
Compute-Runtime-Validation
8c155a2e89
Revert "performance: Memory handling improvements"
...
This reverts commit 5b80bd4d7c .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-07-20 11:37:09 +02:00
Filip Hazubski
5b80bd4d7c
performance: Memory handling improvements
...
By default prefer allocating memory first by KMD, instead of malloc first.
By default prefer not caching allocations on MTL devices. This results
in allocations being handled with non-coherent pat index.
For integrated devices when caching is not preferred do not allow
direct memory access in CPU domain. For map/unmap operations create
a dedicated memory allocation for CPU access, instead of accessing it
directly, reusing the same logic as when mapping/unmapping local memory.
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2023-07-19 19:21:44 +02:00
Wilma, Pawel
39b25abf0e
feature: debug flag to enable/disable AIL
...
Related-to: NEO-8049
Signed-off-by: Wilma, Pawel <pawel.wilma@intel.com >
2023-07-19 12:10:05 +02:00
Mateusz Jablonski
01990e8bd7
feature(internal): add debug flag to control preferred allocation method on Wddm
...
Related-To: NEO-7194
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-07-18 16:46:17 +02:00
Dominik Dabek
622a3ed89c
performance(ocl): flag to not dcFlush on no event
...
If waitForBarrier is not passed outEvent then do
dcFlush on the next synchronize call.
Related-To: NEO-8147
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-07-18 15:38:54 +02:00
Lukasz Jobczyk
83bd33befc
refactor: Add flag to control BCS split for pageable memory
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-07-11 15:12:40 +02:00
Jaime Arteaga
23eeaf816d
feature: Add debug keys for chunking allocation and size
...
Related-to: NEO-7695
New debug keys added:
EnableBOChunking is now a mask
0 = no chunking (default).
1 = shared allocations only
2 = device allocations only
3 = shared and device allocations
MinimalAllocationSizeForChunking sets the minimum allocation
size to apply chunking. Default is 2MB.
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com >
2023-07-07 23:39:43 +02:00
Michal Mrozek
5eadedc36e
refactor: Remove not used logic.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2023-06-30 10:58:35 +02:00
Zbigniew Zdanowicz
21823af419
performance: add skeleton method to cmdlist immediate flush task
...
Related-To: NEO-7808
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-06-30 10:46:20 +02:00
Zbigniew Zdanowicz
1067167637
test: adding testing debug flag overriding driver version
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-06-27 17:58:03 +02:00
Dunajski, Bartosz
7ac825e74b
refactor: add debug flag to synchronize Event before reset
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-26 17:38:37 +02:00
Cencelewska, Katarzyna
68d81c82a7
fix: Use proper value about hw local id generations
...
- remove useless flag ForceNumberOfThreadsInGpgpuThreadGroup
- add new flag "RemoveRestrictionsOnNumberOfThreadsInGpgpuThreadGroup"
to restore old path without restrictions about number of threads in
thread group
- fix forwarding information about hw local ids generations to
calculate numOfThreadsInThreadGroup correctly
Related-To: NEO-7952, NEO-7982
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-06-26 16:35:42 +02:00
Dunajski, Bartosz
aea5f435db
feature: unregister CSR client on Event host synchronize
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-26 12:02:14 +02:00
Joshua Santosh Ranjan
97b4d8bab5
feature: add initial support for host mapped timestamps
...
Related-To: LOCI-4171
Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com >
2023-06-26 08:29:58 +02:00
Dunajski, Bartosz
b004a27e4e
refactor: Debug flag to print TSP usage
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-22 14:47:39 +02:00
Mateusz Jablonski
26ad315207
feature: enable allocating shared usm in heap extended host by default
...
Related-To: NEO-7665
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-06-20 15:45:25 +02:00
Cencelewska, Katarzyna
9f7374da6e
fix: Change default setting flag EnableCpuCacheForResources to true on mtl
...
Related-To: HSD-18030829682
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-06-19 12:22:42 +02:00
Cencelewska, Katarzyna
7cb3278eb3
fix: add function to calculate number of threads per tg
...
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-06-13 14:02:24 +02:00
Dunajski, Bartosz
3d49658f50
feature: new multitile post sync layout for immediate write [2/n]
...
No functional changes in this commit. This is prework.
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-09 14:20:34 +02:00
Cencelewska, Katarzyna
baa4ba9c56
fix: set default value of EnableCpuCacheForResources to false
...
- this flag is affecting only mtl
Related-To: NEO-7194
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-06-05 13:42:56 +02:00
Jaime Arteaga
2efd6e547a
feature: Add support for chunking in the UMD (1/N)
...
Read if support for chunking is available in the KMD.
If available, KMD will create a BO with 1 or more chunks,
depending on the chunk size selected.
Related-To: NEO-7695
Sync to
https://github.com/intel-gpu/drm-uapi-helper/releases/tag/v2.0-rc18
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com >
Signed-off-by: John Falkowski <john.falkowski@intel.com >
2023-06-02 23:27:40 +02:00
Bellekallu Rajkiran
3c072a6cd1
fix: WA for VF bar resource allocation post Warm reset
...
On Warm reset, With default bar size set by bios, VF bar
allocation is getting failed because of bug in pci driver
which impacts SRIOV functionality.
Resize VF bar size for succesful allocation of VF bar
post warm reset.
Related-To: LOCI-4481
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com >
2023-06-02 13:16:34 +02:00
Warchulski, Jaroslaw
03d9a20559
feature: add debug flag to wait for release memory
...
Related-To: NEO-6766
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com >
2023-06-02 09:57:27 +02:00
Cencelewska, Katarzyna
115d6de350
fix: add debug key to verify device state before submit
...
- new debug key EnableDeviceStateVerification to check device state not
ony in debug mode
Related-To: NEO-7669
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-05-31 14:31:23 +02:00
Compute-Runtime-Validation
9cc7028025
Revert "feature: enable allocating shared usm in heap extended host by default"
...
This reverts commit 5b178e68e9 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-05-31 09:39:11 +02:00
Mateusz Jablonski
5b178e68e9
feature: enable allocating shared usm in heap extended host by default
...
Related-To: NEO-7665
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-05-29 11:28:18 +02:00
Joshua Santosh Ranjan
29682a4f8d
feature: print global timestamp
...
Related-To: LOCI-4285
Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com >
2023-05-25 09:45:13 +02:00
Daria Hinz
331f167cfe
feature: Add debug flag for setting hw ip version
...
Signed-off-by: Daria Hinz <daria.hinz@intel.com >
Related-To: NEO-7954
2023-05-23 15:32:46 +02:00
Compute-Runtime-Validation
d390ec6e8d
Revert "fix: set default value of flag EnableCpuCacheForResources to false"
...
This reverts commit 305cc00b0f .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-05-19 11:40:47 +02:00
Katarzyna Cencelewska
305cc00b0f
fix: set default value of flag EnableCpuCacheForResources to false
...
when flag disabled, gmm flag Cacheable won't set on xe_hp and later
Related-To: NEO-7194
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2023-05-18 10:40:01 +02:00
Cencelewska, Katarzyna
71ec4c528f
fix: set default value of flag EnableCpuCacheForResources to true
...
Related-To: HSD-18030023426, HSD-18030026101, HSD-18030022460
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-05-17 11:26:36 +02:00
Lukasz Jobczyk
0e758e4bb5
performance: Add debug flag to set BCS split minimal size
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-05-17 08:07:43 +02:00
Katarzyna Cencelewska
004a3d875c
fix: Remove default setting of gmm flag Cacheable to true
...
- add debug flag EnableCpuCacheForResources to be able to allow coherency when
resources could be cacheable
Resolves: NEO-7194
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2023-05-16 09:17:29 +02:00
Dunajski, Bartosz
cfacbbd811
refactor: Simplify OverrideBlitterMocs usage
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-05-09 19:22:57 +02:00
Warchulski, Jaroslaw
7fdf4985a3
feature: add support for cl_khr_external_memory extension
...
Related-To: NEO-7069
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com >
2023-05-05 15:51:39 +02:00
Bellekallu Rajkiran
d3a31957db
feature(sysman): Add delay for HBM diagnostics
...
Add debug variable to set sleep duration for HBM
IFR to complete
Related-To: LOCI-4298
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com >
2023-05-03 20:27:21 +02:00
Mateusz Jablonski
74205f3f37
Revert "feature: enable allocating shared usm in heap extended host by default"
...
This reverts commit 26f16f4e98 .
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-05-02 09:12:26 +02:00
Aravind Gopalakrishnan
1883161e1e
fix: Add debug key to Force Tlb flush
...
Related-To: GSD-4457
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com >
2023-05-01 17:52:22 +02:00
Mateusz Jablonski
26f16f4e98
feature: enable allocating shared usm in heap extended host by default
...
Related-To: NEO-7665
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-04-28 13:18:20 +02:00
Mateusz Jablonski
5a5c20f99c
fix: create separate heap for host and shared usm in 48-56b VA
...
Related-To: NEO-7665
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-04-28 10:09:38 +02:00
Cencelewska, Katarzyna
861ec524c6
fix: check icbe version only once when patchtoken
...
- set by default flag ZebinIgnoreIcbeVersion to true
- for zebin icbe version check is only inside flag
- only when use patchtoken then check icbe version is mandatory
Resolves: NEO-7904
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-04-28 09:26:02 +02:00
Dunajski, Bartosz
14c3777409
feature: Experimental support of immediate cmd list in-order execution [1/n]
...
Related-To: LOCI-4332
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-04-26 13:15:59 +02:00
Mateusz Jablonski
06bd405e88
feature: add debug flag to control usage of heap extended for USM Host
...
Related-To: NEO-7665
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-04-25 15:39:49 +02:00
Dunajski, Bartosz
6e9257c623
Debug flag to force early exit
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-04-25 09:44:44 +02:00
Lukasz Jobczyk
853a65aae9
Add PCI barrier implementation
...
Resolves: NEO-7850
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-04-24 10:33:03 +02:00
Kacper Nowak
c7adbc2140
Add debug key for dumping ELF to file
...
Add "DumpZEBin" debug flag. When this flag is enabled, Zebin will be
dumped to a .elf file (with appropiate suffix, in case such file has
been dumped before).
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com >
Related-To: NEO-7895
2023-04-18 20:40:25 +02:00