Commit Graph

6206 Commits

Author SHA1 Message Date
Joshua Santosh Ranjan
d294d71f95 feature: make programmable metrics enabled by default
Related-To: NEO-13011

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2024-11-13 09:42:23 +01:00
Zbigniew Zdanowicz
d7f310dd5a refactor: reorganize command encode classes 5/n
- move various methods from specialization in cpp into correct inl files
- EncodeAtomic, EncodeMiFlushDW, EncodeMemoryPrefetch, EncodeDispatchKernel
- move methods from generic inl files that are used on single platform to cpp
- EncodeMiFlushDW

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-13 09:18:30 +01:00
Jaroslaw Warchulski
72efceb8a3 fix: try to lower alignment in heap allocator when it is too big
Related-To: GSD-8948
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2024-11-13 09:14:12 +01:00
Mateusz Jablonski
1350441a72 build: fix multiple definitions issue
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-11-12 15:54:12 +01:00
Dominik Dabek
0a12817664 performance: flag, force zero copy for host ptr
When debug flag ForceZeroCopyForUseHostPtr is set, add
CL_MEM_FORCE_HOST_MEMORY_INTEL flag to buffers created with
CL_MEM_USE_HOST_PTR.
This makes the buffers use zero copy.

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-11-12 15:43:17 +01:00
Mateusz Jablonski
f55ad93baf fix: remove L3 config from release helper
L3 bank count should be queried from KMD
L3 bank size should be queried from device blob

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-11-12 14:28:36 +01:00
Lukasz Jobczyk
7f3896d05f performance: Ensure hostptrs removed before creating new one
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-11-12 13:55:15 +01:00
Mateusz Jablonski
33d2a40cc8 fix: setup threads per EU configs based on real threads per eu count
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-11-12 12:53:45 +01:00
Zbigniew Zdanowicz
051648e215 refactor: reorganize command encode classes 4/n
- share xe hpc methods in generic way
- move gen12lp and xe hpg methods into dedicated inl

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-12 12:15:54 +01:00
Compute-Runtime-Validation
5b35887bd1 Revert "feature: enable wmtp on BMG"
This reverts commit f95a6339a5.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-11-10 11:21:14 +01:00
Slawomir Milczarek
a331ffcb55 feature: Add ioctl helper functions to mmap/unmap and acquire/releaseGpuRange
Related-To: NEO-11817

Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>
2024-11-08 15:22:44 +01:00
Zbigniew Zdanowicz
ead001360b refactor: reorganize command encode classes 3/n
- shift xe hpg inl file methods into specialization and generic inl files
- remove gen12lp specific implementation from main inl file
- remove not needed xe hpg inl file
- remove not needed compression inl files
- remove not used raytracing inl file

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-08 14:38:31 +01:00
Zbigniew Zdanowicz
eb435acb22 refactor: reorganize command encode classes 2/n
- add new inl files for selected platform families
- split image compression methods into correct inl files
- remove dedicated compression inl files

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-08 13:07:03 +01:00
Mateusz Jablonski
2054322041 build: update aot config header
stop using BMG_G21_A1 / BMG_G21_B0

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-11-07 14:46:50 +01:00
Mateusz Jablonski
f95a6339a5 feature: enable wmtp on BMG
Related-To: NEO-12814
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-11-07 14:28:25 +01:00
Szymon Morek
1199e19df5 performance: enable image writes through staging chunks
Related-To: NEO-12968

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-11-07 13:43:25 +01:00
Zbigniew Zdanowicz
9536510c5b refactor: reorganize command encode classes 1/n
- remove obsolete file
- move methods from redundant compute mode file into dedicated platform files
- group same implementation into platform specific inl files

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-07 13:06:28 +01:00
Mateusz Jablonski
daa53cd8ca fix: change default bmg revision to 0
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-11-06 18:06:40 +01:00
Szymon Morek
a25e973205 performance: write image through staging chunks
Related-To: NEO-12968

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-11-06 16:45:22 +01:00
Compute-Runtime-Validation
fd1a000ae4 Revert "feature: enable recoverable pagefault"
This reverts commit c6c2eef063.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-11-06 07:39:04 +01:00
Mrozek, Michal
2febf0597e fix: remove incorrect code
- freed pointer cannot overlap with freed chunks, therefore there is no way
to merge it with freed chunks if they overlap

Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
2024-11-05 18:58:41 +01:00
Kamil Kopryk
99238f4dee refactor: simplify getBuiltinResourceNames
remove not needed lambda function,
use string_view and remove string creation with .str()

Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-11-05 14:46:46 +01:00
Lukasz Jobczyk
fe82425fa9 performance: Prefer to assign new gpu va rather than reuse
Resolves: NEO-10734

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-11-05 10:16:03 +01:00
Zbigniew Zdanowicz
89b40b7ec4 refactor: change additional walker fields encoder 6/n
- change order of members of additional walker struct

Related-To: NEO-12639

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-05 08:28:57 +01:00
Szymon Morek
5d62be2bea performance: enable staging buffer for write image
Related-To: NEO-12968

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-11-05 06:06:07 +01:00
Young Jin Yoon
c6c2eef063 feature: enable recoverable pagefault
Enabled recoverable pagefault

Related-To: NEO-13116
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-11-04 21:44:04 +01:00
Wojciech Konior
c65b45471b feature: support binary compatibility across multiple HW targets
- EnableCompatibilityMode flag added
- validateTergetDevice func modified to take into account the flag

Related-To: NEO-11568

Signed-off-by: Wojciech Konior <wojciech.konior@intel.com>
2024-11-04 16:53:57 +01:00
Szymon Morek
8aa5331bc1 fix: wait for latest known usage of indirect usm
Related-To: GSD-9989

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-11-04 16:24:30 +01:00
Zbigniew Zdanowicz
2e2b7a473a refactor: change additional walker fields encoder 5/n
- move compute dispatch all walker into dedicated encoder
- group same implementations into single file

Related-To: NEO-12639

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-11-04 13:11:59 +01:00
Marcel Skierkowski
49d999abe6 fix: remove defaultProfilingTimerResolution from RuntimeCapabilityTable
Related-To: NEO-12275
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2024-11-04 12:02:11 +01:00
Zbigniew Zdanowicz
32fd00e150 refactor: change additional walker fields encoder 4/n
- move post sync system fence into dedicated encoder

Related-To: NEO-12639

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-10-31 14:41:55 +01:00
Naklicki, Mateusz
64061b623b feature: xekmd vm prefetch
Related-To: HSD-18040382956
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com>
2024-10-31 13:49:25 +01:00
Compute-Runtime-Validation
aca7875dfd Revert "performance: Prefer to assign new gpu va rather than reuse"
This reverts commit 0f2f3c3764.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-10-31 05:06:43 +01:00
Compute-Runtime-Validation
45a26c22dd Revert "performance: limit tlb flush scope to DG2"
This reverts commit 10d123ae3e.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-10-30 22:38:17 +01:00
Szymon Morek
10d123ae3e performance: limit tlb flush scope to DG2
Related-To: NEO-7116

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-10-30 18:27:19 +01:00
Szymon Morek
cf58be4142 performance: use staging buffer when writing to an image
Related-To: NEO-12968

Also, don't import usm/mapped allocations for image
operations

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-10-30 17:02:13 +01:00
Lukasz Jobczyk
0f2f3c3764 performance: Prefer to assign new gpu va rather than reuse
Resolves: NEO-10734

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-10-30 16:48:49 +01:00
Szymon Morek
1f60935930 fix: don't return csr as busy if gpu hang is detected
Related-To: NEO-13071

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-10-30 16:17:38 +01:00
Zbigniew Zdanowicz
a0ce4b78d3 refactor: change additional walker fields encoder 2/n
- remove not needed release helper method

Related-To: NEO-12639

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-10-30 14:56:23 +01:00
Zbigniew Zdanowicz
6f4994c269 refactor: change additional walker fields encoder 1/n
- move encoding l3 prefetch field into dedicated function

Related-To: NEO-12639

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-10-29 21:37:38 +01:00
Zbigniew Zdanowicz
2dccdd886a refactor: change encoder for thread group over dispatch 2/n
- bind algorithms to input arguments
- use thread group count array for regular kernels

Related-To: NEO-12639

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-10-29 21:20:59 +01:00
Lukasz Jobczyk
1f6eaf2525 refactor: Add debug flags to set PATs for dc flush mitigation
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-10-29 21:12:59 +01:00
Alicja Lukaszewicz
654fdc1345 feature: add query for additional device properties
Related-To: NEO-12590

Signed-off-by: Alicja Lukaszewicz <alicja.lukaszewicz@intel.com>
2024-10-29 20:40:27 +01:00
Compute-Runtime-Validation
022f9e642d Revert "refactor: change encoder for thread group over dispatch 2/n"
This reverts commit 046631767b.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-10-29 01:51:52 +01:00
Dominik Dabek
7f811798c0 refactor: AIL for ocl buffer pooling
Use dedicated app list for DG2

Related-To: NEO-12974

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-10-28 18:02:38 +01:00
Bartosz Dunajski
7f5e6b4124 Revert "fix: Enable 64k pages for TSB allocation"
This reverts commit eed69f45ed.

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-10-28 16:02:34 +01:00
Oskar Hubert Weber
fd45b2883d fix: neo ULT build with optimizations disabled 1/n
Related-To: NEO-8116

Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com>
2024-10-28 15:38:28 +01:00
Zbigniew Zdanowicz
046631767b refactor: change encoder for thread group over dispatch 2/n
- bind algorithms to input arguments

Related-To: NEO-12639

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-10-28 10:11:13 +01:00
Young Jin Yoon
ebdded1bb9 fix: change error message for GPU page fault
Change the error message for GPU page fault to match
with the message from gdb output

Related-To: NEO-13093
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-10-26 01:29:50 +02:00
Dominik Dabek
741101551e fix: add infrastructure to limit device usm reuse max memory used
Related-To: NEO-12924

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-10-25 21:54:41 +02:00