Commit Graph

4004 Commits

Author SHA1 Message Date
Dominik Dabek
a236171f0d performance(ocl): enable device usm alloc reuse
Enabling on MTL+
Limited to use max 2% of global device memory.

Related-To: NEO-6893, NEO-11463

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-05-17 13:32:45 +02:00
Bartosz Dunajski
cb9977b8f4 feature: create copy offload queue under debug flag
Related-To: NEO-11376

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-05-17 11:04:35 +02:00
Compute-Runtime-Validation
34f53d5d94 Revert "performance: Reuse GPU timestamps by default"
This reverts commit 7aceed58ca.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-16 21:52:10 +02:00
Young Jin Yoon
e204d27190 fix: print to stdout for disable scratch page
Modified to print out error messages to stdout when disable scratch page
is used.

Related-To: GSD-7611
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-05-16 15:05:07 +02:00
Kamil Kopryk
44ef4c1ef2 refactor: add defaultThreadArbitrationPolicy in cmdlist
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-05-16 13:28:20 +02:00
Maciej Bielski
dc32da2507 feature: prepare StorageInfo for Drm-specific customization
Get the local memory regions count from the primary source (MemoryInfo)
and store for further use when using DrmMemoryManager.
Add a point of dispatch (virtual `createStorageInfoFromProperties`) for
further Drm-specific customizations related to StorageInfo. As the
function became virtual, move one of its callers
(`isLocalMemoryUsedForIsa()`) from the constructor of `MemoryManager` to
respective constructors of all derivative classes. This prevents
bypassing the virtual call dispatch.

Related-To: NEO-9754
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2024-05-16 12:24:48 +02:00
Pawel Cieslak
84ba633098 refactor: use legacy versions of injectMMIOList and setTbxServerIp
Related-To: NEO-7539
Signed-off-by: Pawel Cieslak <pawel.cieslak@intel.com>
2024-05-16 10:47:35 +02:00
Mateusz Jablonski
03d87d27ef fix: generate per process aub file name by default
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-05-16 09:03:21 +02:00
Bari, Pratik
6b04cd4a1e fix: Release the Dummy Allocation before Root Device Environment
Related-To: GSD-6398, GSD-9027

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2024-05-15 18:08:41 +02:00
Szymon Morek
7aceed58ca performance: Reuse GPU timestamps by default
Related-To: NEO-10615

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-15 17:51:42 +02:00
Mateusz Hoppe
e9b81037a9 refactor: add heaplessStateInitialized flag
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-05-15 15:39:52 +02:00
Dominik Dabek
b4d839fe29 performance(usm): l0, add usm host memory pooling
Disabled by default.

Related-To: NEO-11356

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-05-15 15:20:51 +02:00
Dominik Dabek
ae8c7589dc refactor: move implicit arg has indirect access
Move implicit arg has indirect access boolean to kernelAttributes

Related-To: NEO-11396

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-05-15 13:11:04 +02:00
Krzysztof Gibala
a70aaa72ed refactor: add debug message about the zero engine info size
Signed-off-by: Krzysztof Gibala <krzysztof.gibala@intel.com>
2024-05-15 09:07:40 +02:00
Young Jin Yoon
06faaab5bb refactor: read scratch page options during init
Change scratch page logic to initialize during Drm::create.

Related-To: GSD-7742
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-05-15 08:56:14 +02:00
Dominik Dabek
fd47030ad6 fix: use igc indirect detection v3
Update to use igc indirect detection v3. Fix for not detecting indirects
passed as implicit arguments.

Related-To: NEO-11396

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-05-15 06:35:42 +02:00
Pawel Cieslak
f10439aea2 fix: include <algorithm> where std::find is used
Related-To: NEO-11375
Signed-off-by: Pawel Cieslak <pawel.cieslak@intel.com>
2024-05-14 16:14:41 +02:00
Ratajewski, Andrzej
0b0f7bd4cd feature: Enable support for cl_intel_subgroup_2d_block_io
This change enables support for cl_intel_subgroup_2d_block_io extension for
PVC and later.

Related-To: GSD-7825
Signed-off-by: Ratajewski, Andrzej <andrzej.ratajewski@intel.com>
2024-05-14 15:16:49 +02:00
Maciej Bielski
014720fc29 refactor: use DeviceBitfield where appropriate
Replace less type-safe uint32_t and reduce conversions between both
types.

Related-To: NEO-9754
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2024-05-14 09:13:11 +02:00
Young Jin Yoon
2c488d9e84 fix: check reset status after completion
Added a logic to check the reset status after the completion to make
sure we go through the logic at least once

Related-To: GSD-8902
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-05-14 06:38:07 +02:00
Compute-Runtime-Validation
ebf09afc42 Revert "performance: Stop direct submission before removing host ptrs"
This reverts commit 4fd219cdbb.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-13 10:10:51 +02:00
Compute-Runtime-Validation
741474208e Revert "refactor: command container tests"
This reverts commit 02e2a13c84.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-11 11:46:44 +02:00
Slawomir Milczarek
b37c2970ce test: Rename regkey BcsNumberOverride to BlitterEnableMaskOverride
BlitterEnableMaskOverride is a bitmask with BCS engines available on device

Related-To: NEO-11152

Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2024-05-10 21:18:44 +02:00
Szymon Morek
e35b951a00 performance: Allow indirect allocs as pack on OpenCL
Related-To: NEO-11228

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-10 17:57:42 +02:00
Kamil Kopryk
553cb16964 refactor: correct spelling reseting -> resetting
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-05-10 11:38:42 +02:00
Katarzyna Cencelewska
02e2a13c84 refactor: command container tests
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-05-10 11:23:58 +02:00
Compute-Runtime-Validation
6a24610ed1 Revert "fix: Make makeResident explicitly blocking on linux"
This reverts commit 9095c3ed8a.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-10 11:03:13 +02:00
Mateusz Hoppe
77b817a15c fix: fail when invalid zebin is decoded
- bindless + bindful accesses in single kernel are not supported

Related-To: NEO-7063

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-05-09 15:53:00 +02:00
Szymon Morek
9095c3ed8a fix: Make makeResident explicitly blocking on linux
Currently this call is blocking on Windows and
xe kmd, this commit also makes it blocking on i915.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-09 14:13:42 +02:00
Maciej Plewka
e39893485c fix: add cache flush as dependency for bcs ccs synchronization
Related-to: NEO-9872
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-05-09 13:43:39 +02:00
Szymon Morek
10ed479b16 performance: share inter-module ISA allocations
Related-To: NEO-10258

Currently each module has it's own GA
for kernel ISA's. This change allows new modules to
reuse existing allocation.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-09 08:43:55 +02:00
Mateusz Hoppe
76af952fcb feature: Bindless sampled image support
Related-To: NEO-10352

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-05-08 17:50:37 +02:00
Compute-Runtime-Validation
902b611179 Revert "fix: Fix front end programming for cooperative dispatch"
This reverts commit 36ddfaaf4d.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-08 11:14:37 +02:00
Slawomir Milczarek
2473c38e31 test: Add regkey to override number of BCS engines on platform
New regkey BcsNumberOverride for use in TBX and AUB mode

Related-To: NEO-11082

Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2024-05-07 18:42:01 +02:00
Bartosz Dunajski
e5882e0d31 feature: pass GraphicsAllocation to fence wait
Related-To: NEO-8179

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-05-07 17:59:28 +02:00
Andrzej Koska
e44c7f2627 fix: BCS MMIO programming
When remap is enabled, we must set different base offset for copy engines.
Copy engines must use BCS0 base.

Related-To: NEO-10678

Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2024-05-07 14:08:28 +02:00
Lukasz Jobczyk
4fd219cdbb performance: Stop direct submission before removing host ptrs
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-05-07 13:25:09 +02:00
Katarzyna Cencelewska
e9f7df6ae6 refactor: create helper for maxPtssIndex
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-05-07 12:47:52 +02:00
Mateusz Hoppe
d35d8727e5 fix: use primaryCsr allocations when csr has primaryCsr set
- global fence allocation
- global stateless heap allocation
- preemption allocation
- debug surface allocation

all above are shared from primary csr

Related-To: NEO-7824


Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-05-07 02:41:24 +02:00
Bartosz Dunajski
ce36812f8d feature: obtain GraphicsAllocation from external CB event
Related-To: NEO-8179

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-05-06 22:52:36 +02:00
Weronika Kapusta
d6c16c1640 fix: remove compiler cache legacy implementation
Related-To: NEO-10679

Signed-off-by: Weronika Kapusta <weronika.kapusta@intel.com>
2024-05-06 18:28:35 +02:00
Szymon Morek
6df46aa062 performance: Iterate over indirect allocations once
Related-To: NEO-11228

Iterate only on new allocations when making indirect
allocations resident.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-06 15:51:37 +02:00
Szymon Morek
92344bfce5 fix: Cast variable to correct type
Related-To: NEO-10615

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-06 15:40:11 +02:00
Szymon Morek
d708c5f300 performance: Allow indirect allocs as pack on windows
Related-To: NEO-11228

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-06 15:03:12 +02:00
Szymon Morek
83e8ae4a20 performance: Reuse GPU timestamp instead of KMD escape
This can be enabled only if related
debug flag will be set.

Related-To: NEO-10615

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-06 14:46:30 +02:00
Lukasz Jobczyk
c1004b77bf performance: Limit tlb flush in state cache flush to wddm
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-05-06 12:45:26 +02:00
Young Jin Yoon
07aa53fd87 fix: disable scratch page by default only on PVC
Disabled scratch paged by default only on PVC with productHelper.

Related-To: GSD-7742
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-05-01 23:44:48 +02:00
Bartosz Dunajski
806da85ec6 refactor: prework to pass interrupt hint
Related-To: NEO-8179

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-04-29 11:14:53 +02:00
Maciej Bielski
ff494d5c50 feature: add tile-to-lmem-region map in MemoryInfo
Arguments names refreshed to be more explanatory.

Related-To: NEO-9754

Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2024-04-26 15:16:03 +02:00
Bartosz Dunajski
2a2596c13b refactor: pass additional data to ioctl helper
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-04-26 14:53:14 +02:00