Chandio, Bibrak Qamar
8cf4804fcd
fix: Overhead in zeDeviceGetGlobalTimestamps
...
Related-To: NEO-11908
There is overhead when submission method is used for
zeDeviceGetGlobalTimestamps. This fixes it.
Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com >
2024-12-12 08:54:19 +01:00
Lukasz Jobczyk
c2093990d4
fix: Flush monitor fence only to context where needed
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-12-11 10:06:43 +01:00
Fabian Zwoliński
d2ce3badfc
fix: bindlessHeapsHelper handle unavailable external heap
...
This PR handles the situation in which a component
has reserved a front window space for itself in the external heap,
so that the Compute Runtime cannot access this area.
In such a situation, we perform the following steps:
1. reserve 4GB chunk in heapStandard
2. split our chunk into 2 parts: heapFrontWindow, heapRegular
3. from this point on, map all linearStream allocations in reserved 4GB
chunk
Patch applies to Windows and WSL.
Patch only applies when the bindless global allocator is enabled.
Related-To: HSD-16025889919
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com >
2024-12-05 14:18:01 +01:00
Chandio, Bibrak Qamar
ab2e831a4a
fix: zeDeviceGetGlobalTimestamp to use submisison
...
Related-To: GSD-10253, GSD-9467, GSD-9381, NEO-11908
When EnableGlobalTimestampViaSubmission is set then
zeDeviceGetGlobalTimestamp uses immediate cmd submission
method to get GPU time.
Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com >
2024-12-04 19:10:07 +01:00
Szymon Morek
d5e5c8e59e
fix: call flushMonitorFence on BCS CSR
...
Related-To: NEO-12477
Also, make sure that global fence is always resident
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-12-03 12:48:54 +01:00
Compute-Runtime-Validation
f5cda5ba53
Revert "fix: call flushMonitorFence on BCS CSR"
...
This reverts commit fc8aa8246b .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-11-30 22:31:30 +01:00
Szymon Morek
fc8aa8246b
fix: call flushMonitorFence on BCS CSR
...
Related-To: NEO-12477
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-11-29 16:31:40 +01:00
Mateusz Jablonski
d4e201db86
test: remove not used usings/typedefs/variables in shared tests
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-11-29 10:46:10 +01:00
Mateusz Jablonski
2039b1c41b
refactor: remove not needed code
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-11-28 16:16:30 +01:00
Mateusz Jablonski
974e8ae63f
performance: don't call evict with empty handle list
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-11-27 09:25:36 +01:00
Mateusz Jablonski
db6fe7892c
fix: remove destroyed allocations from eviction lists
...
mark explicitly made resident allocations
Related-To: NEO-13246, GSD-10319
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-11-25 22:19:54 +01:00
Wenbin Lu
2ba80ce114
feature: support physical host memory
...
Related-To: NEO-11981
Signed-off-by: Wenbin Lu <wenbin.lu@intel.com >
2024-11-20 08:19:52 +01:00
Lukasz Jobczyk
7f3896d05f
performance: Ensure hostptrs removed before creating new one
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-11-12 13:55:15 +01:00
Marcel Skierkowski
49d999abe6
fix: remove defaultProfilingTimerResolution from RuntimeCapabilityTable
...
Related-To: NEO-12275
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com >
2024-11-04 12:02:11 +01:00
Lukasz Jobczyk
e687e11ab1
performance: Add CCS Optimization
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-10-23 11:35:57 +02:00
Maciej Bielski
45e78fea76
fix: use productHelper in getPatIndexInfoString() on Windows
...
Fix the PAT-index reporting in logger as currently on Windows reported
values are simply wrong.
The changed logic dependends on `RootDeviceEnvironment` and in order to
avoid introducing such dependencies into logger.[ch] the
`logAllocation()` is no longer a member of `FileLogger` but
a free-function instead (and a separate .cpp file). This is important
because the source files `logger.[ch]` are also used by ocloc library
and there is no point to contaminate ocloc code structure with
unnecessary dependencies.
Related-To: NEO-9421
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com >
2024-10-22 19:27:13 +02:00
Wenbin Lu
a8a40d2afd
feature: support SVM heap in reserveVirtualMem
...
Related-To: NEO-11981
Signed-off-by: Wenbin Lu <wenbin.lu@intel.com >
2024-10-22 16:47:14 +02:00
Compute-Runtime-Validation
e10998db45
Revert "performance: Add CCS Optimization"
...
This reverts commit e7b3a40aa7 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-10-22 05:52:14 +02:00
Michal Mrozek
18d828421d
performance: add debug flag to control huge chunk size on wddm.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2024-10-21 16:51:03 +02:00
Lukasz Jobczyk
8a647f6a39
Revert "performance: Ensure hostptrs removed before creating new one"
...
This reverts commit 5b2f2f3d83 .
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-10-21 10:36:06 +02:00
Lukasz Jobczyk
e7b3a40aa7
performance: Add CCS Optimization
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-10-21 09:59:14 +02:00
Mateusz Jablonski
4154e6666b
fix: correct loading L0 loader functions
...
- don't load ze_loader.dll from file system
- to perform self-open on Windows use getModuleHandleA with proper module name
- don't free library loaded with getModuleHandleA
- as loader may be not available during runtime teardown:
- load translate handle function during global setup
- load setDriverTeardown function during global teardown
- when loader is not available during teardown, unset translate handle
function
Related-To: GSD-10147
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-10-17 15:39:20 +02:00
Compute-Runtime-Validation
f117b5fb68
Revert "fix: correct loading L0 loader functions"
...
This reverts commit 9c7b3c5e19 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-10-17 08:20:19 +02:00
Mateusz Jablonski
9c7b3c5e19
fix: correct loading L0 loader functions
...
on Windows use getModuleHandleA with proper module name
don't load ze_loader.dll from file system
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-10-16 19:53:05 +02:00
Mateusz Jablonski
2ce3022030
refactor: remove not needed code
...
Related-To: NEO-12949
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-10-15 15:33:12 +02:00
Compute-Runtime-Validation
a55a846284
Revert "fix: update slm size based on gt system info in wddm path"
...
This reverts commit d433fd585a .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-10-11 03:43:37 +02:00
Mateusz Jablonski
d433fd585a
fix: update slm size based on gt system info in wddm path
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-10-10 11:29:19 +02:00
Wojciech Konior
6b40f9bc5a
refactor: engineInstancedType removed
...
Related-To: NEO-12594
Signed-off-by: Wojciech Konior <wojciech.konior@intel.com >
2024-10-09 16:30:48 +02:00
Lukasz Jobczyk
5b2f2f3d83
performance: Ensure hostptrs removed before creating new one
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-10-08 13:04:56 +02:00
Compute-Runtime-Validation
60afb83b3b
Revert "performance: Ensure hostptrs removed before creating new one"
...
This reverts commit a890ed5648 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-10-08 07:45:55 +02:00
Szymon Morek
a915ef4b7b
fix: Don't program redundant paging fence semaphores
...
Related-To: NEO-12197
Don't program semaphore to wait for paging fence if it was
already programmed with the same value
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-10-07 11:42:06 +02:00
Mateusz Hoppe
e5658e5327
fix: fix freeing allocations in aub mode
...
- free in operations handler with aub dump must be called to evict
allocations from residentAllocations list
- prior to this change - stale allocations where in the container
leading to crash in aub modes
Related-To: NEO-12869
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2024-10-03 17:14:07 +02:00
Compute-Runtime-Validation
41df1a6f47
Revert "feature: support SVM heap in reserveVirtualMem"
...
This reverts commit bfaeeb01d6 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-10-03 14:53:50 +02:00
Mateusz Jablonski
9819b1e2e4
refactor: remove not needed flag ftrGpGpuMidThreadLevelPreempt
...
the flag was used for mid thread preemption support on pre-gen12 platforms
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-10-01 12:52:12 +02:00
Szymon Morek
468e309e11
fix: call evict under lock
...
Related-To: NEO-12657
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-09-25 10:26:01 +02:00
Maciej Plewka
80f75ceace
fix: submit dummy exec to pin memory during zeContextMakeMemoryResident call
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2024-09-23 14:43:59 +02:00
Lukasz Jobczyk
a890ed5648
performance: Ensure hostptrs removed before creating new one
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-09-19 14:34:19 +02:00
Compute-Runtime-Validation
e4d2f16632
Revert "performance: Ensure hostptrs removed before creating new one"
...
This reverts commit ac1d203555 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-09-18 19:31:34 +02:00
Lukasz Jobczyk
ac1d203555
performance: Ensure hostptrs removed before creating new one
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-09-18 14:23:16 +02:00
Lukasz Jobczyk
64c935a594
fix: Do not defer external hostptr destroy
...
Resolves: NEO-12567
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-09-17 14:54:01 +02:00
Mateusz Jablonski
7e218a5f70
test: simplify IsAtLeastGen12lp and IsAtMostGen12lp matchers
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-09-17 13:42:19 +02:00
Michal Mrozek
dd631610b3
refactor: move memory tracking to memory manager
...
- remove wddm specific code
- improve total size reported to be in decimal
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2024-09-12 17:32:38 +02:00
Michal Mrozek
da59b88122
refactor: improve logging of allocations
...
add capability to measure total amount of allocated memory
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2024-09-12 11:00:27 +02:00
Kamil Kopryk
d18d58d4f7
test: correct expectations in ults if heapless enabled
...
Related-To: NEO-10681
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-09-12 09:20:47 +02:00
Michal Mrozek
7cba813c81
refactor: improve logging by adding size.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2024-09-11 20:33:55 +02:00
Wenbin Lu
bfaeeb01d6
feature: support SVM heap in reserveVirtualMem
...
Related-To: NEO-11981
Signed-off-by: Wenbin Lu <wenbin.lu@intel.com >
2024-09-09 23:22:04 +02:00
Fabian Zwoliński
38e1614f4a
fix: create and use new allocation type for syncBuffer
...
Related-To: NEO-11533
Related-To: HSD-18039788811
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com >
2024-09-06 14:11:33 +02:00
Szymon Morek
80ef56ef4a
fix: Fix residency handling when out-of-memory occurs
...
Related-To: NEO-12434 , NEO-11755
When OOM was triggered from KMD then reiterate
over allocations again since allocations which
should be resident could be evicted during trim process.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-09-05 13:23:37 +02:00
Compute-Runtime-Validation
d842f65cf1
Revert "fix: submit dummy exec to pin memory during zeContextMakeMemoryReside...
...
This reverts commit f9b87d53e6 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-09-05 03:28:03 +02:00
Maciej Plewka
f9b87d53e6
fix: submit dummy exec to pin memory during zeContextMakeMemoryResident call
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
Related-To: NEO-11879
2024-09-04 14:07:29 +02:00