Joshua Santosh Ranjan
d294d71f95
feature: make programmable metrics enabled by default
...
Related-To: NEO-13011
Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com >
2024-11-13 09:42:23 +01:00
Zbigniew Zdanowicz
d7f310dd5a
refactor: reorganize command encode classes 5/n
...
- move various methods from specialization in cpp into correct inl files
- EncodeAtomic, EncodeMiFlushDW, EncodeMemoryPrefetch, EncodeDispatchKernel
- move methods from generic inl files that are used on single platform to cpp
- EncodeMiFlushDW
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-11-13 09:18:30 +01:00
Jaroslaw Warchulski
72efceb8a3
fix: try to lower alignment in heap allocator when it is too big
...
Related-To: GSD-8948
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com >
2024-11-13 09:14:12 +01:00
Mateusz Jablonski
1350441a72
build: fix multiple definitions issue
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-11-12 15:54:12 +01:00
Dominik Dabek
0a12817664
performance: flag, force zero copy for host ptr
...
When debug flag ForceZeroCopyForUseHostPtr is set, add
CL_MEM_FORCE_HOST_MEMORY_INTEL flag to buffers created with
CL_MEM_USE_HOST_PTR.
This makes the buffers use zero copy.
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-11-12 15:43:17 +01:00
Mateusz Jablonski
f55ad93baf
fix: remove L3 config from release helper
...
L3 bank count should be queried from KMD
L3 bank size should be queried from device blob
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-11-12 14:28:36 +01:00
Lukasz Jobczyk
7f3896d05f
performance: Ensure hostptrs removed before creating new one
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-11-12 13:55:15 +01:00
Mateusz Jablonski
33d2a40cc8
fix: setup threads per EU configs based on real threads per eu count
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-11-12 12:53:45 +01:00
Zbigniew Zdanowicz
051648e215
refactor: reorganize command encode classes 4/n
...
- share xe hpc methods in generic way
- move gen12lp and xe hpg methods into dedicated inl
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-11-12 12:15:54 +01:00
Compute-Runtime-Validation
5b35887bd1
Revert "feature: enable wmtp on BMG"
...
This reverts commit f95a6339a5 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-11-10 11:21:14 +01:00
Slawomir Milczarek
a331ffcb55
feature: Add ioctl helper functions to mmap/unmap and acquire/releaseGpuRange
...
Related-To: NEO-11817
Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com >
2024-11-08 15:22:44 +01:00
Zbigniew Zdanowicz
ead001360b
refactor: reorganize command encode classes 3/n
...
- shift xe hpg inl file methods into specialization and generic inl files
- remove gen12lp specific implementation from main inl file
- remove not needed xe hpg inl file
- remove not needed compression inl files
- remove not used raytracing inl file
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-11-08 14:38:31 +01:00
Zbigniew Zdanowicz
eb435acb22
refactor: reorganize command encode classes 2/n
...
- add new inl files for selected platform families
- split image compression methods into correct inl files
- remove dedicated compression inl files
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-11-08 13:07:03 +01:00
Mateusz Jablonski
2054322041
build: update aot config header
...
stop using BMG_G21_A1 / BMG_G21_B0
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-11-07 14:46:50 +01:00
Mateusz Jablonski
f95a6339a5
feature: enable wmtp on BMG
...
Related-To: NEO-12814
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-11-07 14:28:25 +01:00
Szymon Morek
1199e19df5
performance: enable image writes through staging chunks
...
Related-To: NEO-12968
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-11-07 13:43:25 +01:00
Zbigniew Zdanowicz
9536510c5b
refactor: reorganize command encode classes 1/n
...
- remove obsolete file
- move methods from redundant compute mode file into dedicated platform files
- group same implementation into platform specific inl files
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-11-07 13:06:28 +01:00
Mateusz Jablonski
daa53cd8ca
fix: change default bmg revision to 0
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-11-06 18:06:40 +01:00
Szymon Morek
a25e973205
performance: write image through staging chunks
...
Related-To: NEO-12968
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-11-06 16:45:22 +01:00
Compute-Runtime-Validation
fd1a000ae4
Revert "feature: enable recoverable pagefault"
...
This reverts commit c6c2eef063 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-11-06 07:39:04 +01:00
Mrozek, Michal
2febf0597e
fix: remove incorrect code
...
- freed pointer cannot overlap with freed chunks, therefore there is no way
to merge it with freed chunks if they overlap
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com >
2024-11-05 18:58:41 +01:00
Kamil Kopryk
99238f4dee
refactor: simplify getBuiltinResourceNames
...
remove not needed lambda function,
use string_view and remove string creation with .str()
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-11-05 14:46:46 +01:00
Lukasz Jobczyk
fe82425fa9
performance: Prefer to assign new gpu va rather than reuse
...
Resolves: NEO-10734
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-11-05 10:16:03 +01:00
Zbigniew Zdanowicz
89b40b7ec4
refactor: change additional walker fields encoder 6/n
...
- change order of members of additional walker struct
Related-To: NEO-12639
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-11-05 08:28:57 +01:00
Szymon Morek
5d62be2bea
performance: enable staging buffer for write image
...
Related-To: NEO-12968
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-11-05 06:06:07 +01:00
Young Jin Yoon
c6c2eef063
feature: enable recoverable pagefault
...
Enabled recoverable pagefault
Related-To: NEO-13116
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com >
2024-11-04 21:44:04 +01:00
Wojciech Konior
c65b45471b
feature: support binary compatibility across multiple HW targets
...
- EnableCompatibilityMode flag added
- validateTergetDevice func modified to take into account the flag
Related-To: NEO-11568
Signed-off-by: Wojciech Konior <wojciech.konior@intel.com >
2024-11-04 16:53:57 +01:00
Szymon Morek
8aa5331bc1
fix: wait for latest known usage of indirect usm
...
Related-To: GSD-9989
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-11-04 16:24:30 +01:00
Zbigniew Zdanowicz
2e2b7a473a
refactor: change additional walker fields encoder 5/n
...
- move compute dispatch all walker into dedicated encoder
- group same implementations into single file
Related-To: NEO-12639
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-11-04 13:11:59 +01:00
Marcel Skierkowski
49d999abe6
fix: remove defaultProfilingTimerResolution from RuntimeCapabilityTable
...
Related-To: NEO-12275
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com >
2024-11-04 12:02:11 +01:00
Zbigniew Zdanowicz
32fd00e150
refactor: change additional walker fields encoder 4/n
...
- move post sync system fence into dedicated encoder
Related-To: NEO-12639
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-10-31 14:41:55 +01:00
Naklicki, Mateusz
64061b623b
feature: xekmd vm prefetch
...
Related-To: HSD-18040382956
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com >
2024-10-31 13:49:25 +01:00
Compute-Runtime-Validation
aca7875dfd
Revert "performance: Prefer to assign new gpu va rather than reuse"
...
This reverts commit 0f2f3c3764 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-10-31 05:06:43 +01:00
Compute-Runtime-Validation
45a26c22dd
Revert "performance: limit tlb flush scope to DG2"
...
This reverts commit 10d123ae3e .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-10-30 22:38:17 +01:00
Szymon Morek
10d123ae3e
performance: limit tlb flush scope to DG2
...
Related-To: NEO-7116
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-10-30 18:27:19 +01:00
Szymon Morek
cf58be4142
performance: use staging buffer when writing to an image
...
Related-To: NEO-12968
Also, don't import usm/mapped allocations for image
operations
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-10-30 17:02:13 +01:00
Lukasz Jobczyk
0f2f3c3764
performance: Prefer to assign new gpu va rather than reuse
...
Resolves: NEO-10734
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-10-30 16:48:49 +01:00
Szymon Morek
1f60935930
fix: don't return csr as busy if gpu hang is detected
...
Related-To: NEO-13071
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-10-30 16:17:38 +01:00
Zbigniew Zdanowicz
a0ce4b78d3
refactor: change additional walker fields encoder 2/n
...
- remove not needed release helper method
Related-To: NEO-12639
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-10-30 14:56:23 +01:00
Zbigniew Zdanowicz
6f4994c269
refactor: change additional walker fields encoder 1/n
...
- move encoding l3 prefetch field into dedicated function
Related-To: NEO-12639
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-10-29 21:37:38 +01:00
Zbigniew Zdanowicz
2dccdd886a
refactor: change encoder for thread group over dispatch 2/n
...
- bind algorithms to input arguments
- use thread group count array for regular kernels
Related-To: NEO-12639
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-10-29 21:20:59 +01:00
Lukasz Jobczyk
1f6eaf2525
refactor: Add debug flags to set PATs for dc flush mitigation
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-10-29 21:12:59 +01:00
Alicja Lukaszewicz
654fdc1345
feature: add query for additional device properties
...
Related-To: NEO-12590
Signed-off-by: Alicja Lukaszewicz <alicja.lukaszewicz@intel.com >
2024-10-29 20:40:27 +01:00
Compute-Runtime-Validation
022f9e642d
Revert "refactor: change encoder for thread group over dispatch 2/n"
...
This reverts commit 046631767b .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-10-29 01:51:52 +01:00
Dominik Dabek
7f811798c0
refactor: AIL for ocl buffer pooling
...
Use dedicated app list for DG2
Related-To: NEO-12974
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-10-28 18:02:38 +01:00
Bartosz Dunajski
7f5e6b4124
Revert "fix: Enable 64k pages for TSB allocation"
...
This reverts commit eed69f45ed .
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-10-28 16:02:34 +01:00
Oskar Hubert Weber
fd45b2883d
fix: neo ULT build with optimizations disabled 1/n
...
Related-To: NEO-8116
Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com >
2024-10-28 15:38:28 +01:00
Zbigniew Zdanowicz
046631767b
refactor: change encoder for thread group over dispatch 2/n
...
- bind algorithms to input arguments
Related-To: NEO-12639
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-10-28 10:11:13 +01:00
Young Jin Yoon
ebdded1bb9
fix: change error message for GPU page fault
...
Change the error message for GPU page fault to match
with the message from gdb output
Related-To: NEO-13093
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com >
2024-10-26 01:29:50 +02:00
Dominik Dabek
741101551e
fix: add infrastructure to limit device usm reuse max memory used
...
Related-To: NEO-12924
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-10-25 21:54:41 +02:00