Dominik Dabek
a236171f0d
performance(ocl): enable device usm alloc reuse
...
Enabling on MTL+
Limited to use max 2% of global device memory.
Related-To: NEO-6893, NEO-11463
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-05-17 13:32:45 +02:00
Bartosz Dunajski
cb9977b8f4
feature: create copy offload queue under debug flag
...
Related-To: NEO-11376
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-05-17 11:04:35 +02:00
Compute-Runtime-Validation
34f53d5d94
Revert "performance: Reuse GPU timestamps by default"
...
This reverts commit 7aceed58ca .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-05-16 21:52:10 +02:00
Young Jin Yoon
e204d27190
fix: print to stdout for disable scratch page
...
Modified to print out error messages to stdout when disable scratch page
is used.
Related-To: GSD-7611
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com >
2024-05-16 15:05:07 +02:00
Kamil Kopryk
44ef4c1ef2
refactor: add defaultThreadArbitrationPolicy in cmdlist
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-05-16 13:28:20 +02:00
Maciej Bielski
dc32da2507
feature: prepare StorageInfo for Drm-specific customization
...
Get the local memory regions count from the primary source (MemoryInfo)
and store for further use when using DrmMemoryManager.
Add a point of dispatch (virtual `createStorageInfoFromProperties`) for
further Drm-specific customizations related to StorageInfo. As the
function became virtual, move one of its callers
(`isLocalMemoryUsedForIsa()`) from the constructor of `MemoryManager` to
respective constructors of all derivative classes. This prevents
bypassing the virtual call dispatch.
Related-To: NEO-9754
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com >
2024-05-16 12:24:48 +02:00
Pawel Cieslak
84ba633098
refactor: use legacy versions of injectMMIOList and setTbxServerIp
...
Related-To: NEO-7539
Signed-off-by: Pawel Cieslak <pawel.cieslak@intel.com >
2024-05-16 10:47:35 +02:00
Mateusz Jablonski
03d87d27ef
fix: generate per process aub file name by default
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-05-16 09:03:21 +02:00
Bari, Pratik
6b04cd4a1e
fix: Release the Dummy Allocation before Root Device Environment
...
Related-To: GSD-6398, GSD-9027
Signed-off-by: Bari, Pratik <pratik.bari@intel.com >
2024-05-15 18:08:41 +02:00
Szymon Morek
7aceed58ca
performance: Reuse GPU timestamps by default
...
Related-To: NEO-10615
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-05-15 17:51:42 +02:00
Mateusz Hoppe
e9b81037a9
refactor: add heaplessStateInitialized flag
...
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2024-05-15 15:39:52 +02:00
Dominik Dabek
b4d839fe29
performance(usm): l0, add usm host memory pooling
...
Disabled by default.
Related-To: NEO-11356
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-05-15 15:20:51 +02:00
Dominik Dabek
ae8c7589dc
refactor: move implicit arg has indirect access
...
Move implicit arg has indirect access boolean to kernelAttributes
Related-To: NEO-11396
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-05-15 13:11:04 +02:00
Krzysztof Gibala
a70aaa72ed
refactor: add debug message about the zero engine info size
...
Signed-off-by: Krzysztof Gibala <krzysztof.gibala@intel.com >
2024-05-15 09:07:40 +02:00
Young Jin Yoon
06faaab5bb
refactor: read scratch page options during init
...
Change scratch page logic to initialize during Drm::create.
Related-To: GSD-7742
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com >
2024-05-15 08:56:14 +02:00
Dominik Dabek
fd47030ad6
fix: use igc indirect detection v3
...
Update to use igc indirect detection v3. Fix for not detecting indirects
passed as implicit arguments.
Related-To: NEO-11396
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-05-15 06:35:42 +02:00
Pawel Cieslak
f10439aea2
fix: include <algorithm> where std::find is used
...
Related-To: NEO-11375
Signed-off-by: Pawel Cieslak <pawel.cieslak@intel.com >
2024-05-14 16:14:41 +02:00
Ratajewski, Andrzej
0b0f7bd4cd
feature: Enable support for cl_intel_subgroup_2d_block_io
...
This change enables support for cl_intel_subgroup_2d_block_io extension for
PVC and later.
Related-To: GSD-7825
Signed-off-by: Ratajewski, Andrzej <andrzej.ratajewski@intel.com >
2024-05-14 15:16:49 +02:00
Maciej Bielski
014720fc29
refactor: use DeviceBitfield where appropriate
...
Replace less type-safe uint32_t and reduce conversions between both
types.
Related-To: NEO-9754
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com >
2024-05-14 09:13:11 +02:00
Young Jin Yoon
2c488d9e84
fix: check reset status after completion
...
Added a logic to check the reset status after the completion to make
sure we go through the logic at least once
Related-To: GSD-8902
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com >
2024-05-14 06:38:07 +02:00
Compute-Runtime-Validation
ebf09afc42
Revert "performance: Stop direct submission before removing host ptrs"
...
This reverts commit 4fd219cdbb .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-05-13 10:10:51 +02:00
Compute-Runtime-Validation
741474208e
Revert "refactor: command container tests"
...
This reverts commit 02e2a13c84 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-05-11 11:46:44 +02:00
Slawomir Milczarek
b37c2970ce
test: Rename regkey BcsNumberOverride to BlitterEnableMaskOverride
...
BlitterEnableMaskOverride is a bitmask with BCS engines available on device
Related-To: NEO-11152
Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com >
2024-05-10 21:18:44 +02:00
Szymon Morek
e35b951a00
performance: Allow indirect allocs as pack on OpenCL
...
Related-To: NEO-11228
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-05-10 17:57:42 +02:00
Kamil Kopryk
553cb16964
refactor: correct spelling reseting -> resetting
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-05-10 11:38:42 +02:00
Katarzyna Cencelewska
02e2a13c84
refactor: command container tests
...
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2024-05-10 11:23:58 +02:00
Compute-Runtime-Validation
6a24610ed1
Revert "fix: Make makeResident explicitly blocking on linux"
...
This reverts commit 9095c3ed8a .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-05-10 11:03:13 +02:00
Mateusz Hoppe
77b817a15c
fix: fail when invalid zebin is decoded
...
- bindless + bindful accesses in single kernel are not supported
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2024-05-09 15:53:00 +02:00
Szymon Morek
9095c3ed8a
fix: Make makeResident explicitly blocking on linux
...
Currently this call is blocking on Windows and
xe kmd, this commit also makes it blocking on i915.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-05-09 14:13:42 +02:00
Maciej Plewka
e39893485c
fix: add cache flush as dependency for bcs ccs synchronization
...
Related-to: NEO-9872
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2024-05-09 13:43:39 +02:00
Szymon Morek
10ed479b16
performance: share inter-module ISA allocations
...
Related-To: NEO-10258
Currently each module has it's own GA
for kernel ISA's. This change allows new modules to
reuse existing allocation.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-05-09 08:43:55 +02:00
Mateusz Hoppe
76af952fcb
feature: Bindless sampled image support
...
Related-To: NEO-10352
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2024-05-08 17:50:37 +02:00
Compute-Runtime-Validation
902b611179
Revert "fix: Fix front end programming for cooperative dispatch"
...
This reverts commit 36ddfaaf4d .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-05-08 11:14:37 +02:00
Slawomir Milczarek
2473c38e31
test: Add regkey to override number of BCS engines on platform
...
New regkey BcsNumberOverride for use in TBX and AUB mode
Related-To: NEO-11082
Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com >
2024-05-07 18:42:01 +02:00
Bartosz Dunajski
e5882e0d31
feature: pass GraphicsAllocation to fence wait
...
Related-To: NEO-8179
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-05-07 17:59:28 +02:00
Andrzej Koska
e44c7f2627
fix: BCS MMIO programming
...
When remap is enabled, we must set different base offset for copy engines.
Copy engines must use BCS0 base.
Related-To: NEO-10678
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com >
2024-05-07 14:08:28 +02:00
Lukasz Jobczyk
4fd219cdbb
performance: Stop direct submission before removing host ptrs
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-05-07 13:25:09 +02:00
Katarzyna Cencelewska
e9f7df6ae6
refactor: create helper for maxPtssIndex
...
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2024-05-07 12:47:52 +02:00
Mateusz Hoppe
d35d8727e5
fix: use primaryCsr allocations when csr has primaryCsr set
...
- global fence allocation
- global stateless heap allocation
- preemption allocation
- debug surface allocation
all above are shared from primary csr
Related-To: NEO-7824
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2024-05-07 02:41:24 +02:00
Bartosz Dunajski
ce36812f8d
feature: obtain GraphicsAllocation from external CB event
...
Related-To: NEO-8179
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-05-06 22:52:36 +02:00
Weronika Kapusta
d6c16c1640
fix: remove compiler cache legacy implementation
...
Related-To: NEO-10679
Signed-off-by: Weronika Kapusta <weronika.kapusta@intel.com >
2024-05-06 18:28:35 +02:00
Szymon Morek
6df46aa062
performance: Iterate over indirect allocations once
...
Related-To: NEO-11228
Iterate only on new allocations when making indirect
allocations resident.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-05-06 15:51:37 +02:00
Szymon Morek
92344bfce5
fix: Cast variable to correct type
...
Related-To: NEO-10615
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-05-06 15:40:11 +02:00
Szymon Morek
d708c5f300
performance: Allow indirect allocs as pack on windows
...
Related-To: NEO-11228
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-05-06 15:03:12 +02:00
Szymon Morek
83e8ae4a20
performance: Reuse GPU timestamp instead of KMD escape
...
This can be enabled only if related
debug flag will be set.
Related-To: NEO-10615
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-05-06 14:46:30 +02:00
Lukasz Jobczyk
c1004b77bf
performance: Limit tlb flush in state cache flush to wddm
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-05-06 12:45:26 +02:00
Young Jin Yoon
07aa53fd87
fix: disable scratch page by default only on PVC
...
Disabled scratch paged by default only on PVC with productHelper.
Related-To: GSD-7742
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com >
2024-05-01 23:44:48 +02:00
Bartosz Dunajski
806da85ec6
refactor: prework to pass interrupt hint
...
Related-To: NEO-8179
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-04-29 11:14:53 +02:00
Maciej Bielski
ff494d5c50
feature: add tile-to-lmem-region map in MemoryInfo
...
Arguments names refreshed to be more explanatory.
Related-To: NEO-9754
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com >
2024-04-26 15:16:03 +02:00
Bartosz Dunajski
2a2596c13b
refactor: pass additional data to ioctl helper
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-04-26 14:53:14 +02:00