Szymon Morek
cf58be4142
performance: use staging buffer when writing to an image
...
Related-To: NEO-12968
Also, don't import usm/mapped allocations for image
operations
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-10-30 17:02:13 +01:00
Lukasz Jobczyk
0f2f3c3764
performance: Prefer to assign new gpu va rather than reuse
...
Resolves: NEO-10734
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-10-30 16:48:49 +01:00
Szymon Morek
1f60935930
fix: don't return csr as busy if gpu hang is detected
...
Related-To: NEO-13071
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-10-30 16:17:38 +01:00
Zbigniew Zdanowicz
a0ce4b78d3
refactor: change additional walker fields encoder 2/n
...
- remove not needed release helper method
Related-To: NEO-12639
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-10-30 14:56:23 +01:00
Zbigniew Zdanowicz
6f4994c269
refactor: change additional walker fields encoder 1/n
...
- move encoding l3 prefetch field into dedicated function
Related-To: NEO-12639
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-10-29 21:37:38 +01:00
Zbigniew Zdanowicz
2dccdd886a
refactor: change encoder for thread group over dispatch 2/n
...
- bind algorithms to input arguments
- use thread group count array for regular kernels
Related-To: NEO-12639
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-10-29 21:20:59 +01:00
Lukasz Jobczyk
1f6eaf2525
refactor: Add debug flags to set PATs for dc flush mitigation
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-10-29 21:12:59 +01:00
Alicja Lukaszewicz
654fdc1345
feature: add query for additional device properties
...
Related-To: NEO-12590
Signed-off-by: Alicja Lukaszewicz <alicja.lukaszewicz@intel.com >
2024-10-29 20:40:27 +01:00
Compute-Runtime-Validation
022f9e642d
Revert "refactor: change encoder for thread group over dispatch 2/n"
...
This reverts commit 046631767b .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-10-29 01:51:52 +01:00
Dominik Dabek
7f811798c0
refactor: AIL for ocl buffer pooling
...
Use dedicated app list for DG2
Related-To: NEO-12974
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-10-28 18:02:38 +01:00
Bartosz Dunajski
7f5e6b4124
Revert "fix: Enable 64k pages for TSB allocation"
...
This reverts commit eed69f45ed .
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-10-28 16:02:34 +01:00
Oskar Hubert Weber
fd45b2883d
fix: neo ULT build with optimizations disabled 1/n
...
Related-To: NEO-8116
Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com >
2024-10-28 15:38:28 +01:00
Zbigniew Zdanowicz
046631767b
refactor: change encoder for thread group over dispatch 2/n
...
- bind algorithms to input arguments
Related-To: NEO-12639
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-10-28 10:11:13 +01:00
Young Jin Yoon
ebdded1bb9
fix: change error message for GPU page fault
...
Change the error message for GPU page fault to match
with the message from gdb output
Related-To: NEO-13093
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com >
2024-10-26 01:29:50 +02:00
Dominik Dabek
741101551e
fix: add infrastructure to limit device usm reuse max memory used
...
Related-To: NEO-12924
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-10-25 21:54:41 +02:00
Mateusz Hoppe
81644a46cc
fix: disable global bindless when NEO_L0_SYSMAN_NO_CONTEXT_MODE set
...
Related-To: HSD-13012436157
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2024-10-25 21:25:26 +02:00
Kamil Kopryk
3891e887c1
feature: use heapless builtins for images
...
Related-To: NEO-12744
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-10-25 16:42:27 +02:00
Zbigniew Zdanowicz
bbdf1ac7b6
refactor: change encoder for thread group over dispatch 1/n
...
- change method name to more meaningful
- add all inputs of the algorithm as explicit function arguments
- position all implementations accordingly
- rename unit test names to fit new method name
- fix unit test to have correct initial command values
- fix unit test to have consistent input values with command values
- fix unit test to change input values together with command values
Related-To: NEO-12639
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-10-25 16:31:39 +02:00
Slawomir Milczarek
2b2bd19845
feature: Add branch dir suffix for ioctl helper xe prelim in cmake
...
Related-To: NEO-12846
Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com >
2024-10-25 12:20:43 +02:00
Wenbin Lu
1c46ed9b40
feature: enable reservation from SVM range by default
...
Related-To: NEO-11981
Signed-off-by: Wenbin Lu <wenbin.lu@intel.com >
2024-10-24 20:01:18 +02:00
Dominik Dabek
22a3ec948b
fix: ulls controller sleep, windows
...
Request higher resolution for windows periodic timers for ulls
controller sleep.
Allows for controller thread to sleep with granularity of 1ms.
Related-To: NEO-10800
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-10-24 18:07:04 +02:00
Jitendra Sharma
b51be4e2dd
refactor: fix description of debug variables
...
Fix description of debug variable DebugUmdInterruptTimeout
and DebugUmdMaxReadWriteRetry.
Related-To: NEO-13046
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com >
2024-10-24 09:35:57 +02:00
Joshua Santosh Ranjan
c9e48d0d2b
refactor: support oa programmable metric group
...
Related-To: NEO-12184
Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com >
2024-10-24 08:35:12 +02:00
Zbigniew Zdanowicz
a4f34e6fcc
refactor: unify programming of preferred slm size 8/n
...
- remove not needed release helper method
Related-To: NEO-12639
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-10-24 08:32:17 +02:00
Naklicki, Mateusz
a429f520fb
fix: use proper cpu caching for USM allocations
...
When using USM allocation, placement of BO can be simultaneously set to
local and system memory. This indicatates that migration between them
is allowed, but then we must not allow CPU caching to prevent corruptions.
Related-To: HSD-18040382956
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com >
2024-10-24 03:24:29 +02:00
Chodor, Jaroslaw
5f908ce092
feature: adding support for custom compiler backends
...
This adds abbility to load different versions of the backend
compiler based on underlying device.
Related-To: NEO-12747
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com >
2024-10-23 19:55:36 +02:00
Compute-Runtime-Validation
69046526bd
Revert "feature: Add branch dir suffix for ioctl helper xe prelim in cmake"
...
This reverts commit ac9386e190 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-10-23 19:25:11 +02:00
Bartosz Dunajski
9d76158c1f
feature: debug flag to change ULLS BCS timeout
...
Related-To: HSD-18040119232
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-10-23 18:18:42 +02:00
Slawomir Milczarek
ac9386e190
feature: Add branch dir suffix for ioctl helper xe prelim in cmake
...
Related-To: NEO-12846
Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com >
2024-10-23 14:30:58 +02:00
Szymon Morek
fca544b178
performance: enable idle csr detection in ULLS controller
...
Related-To: NEO-12991
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-10-23 12:10:45 +02:00
Lukasz Jobczyk
e687e11ab1
performance: Add CCS Optimization
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-10-23 11:35:57 +02:00
Jitendra Sharma
171f1e27a3
fix: Add debug variables for configurable timeouts in debugger
...
Related-To: NEO-13046
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com >
2024-10-23 10:30:03 +02:00
Jaroslaw Warchulski
2ba5ee2f6b
Revert "fix: use full size for HEAP_EXTENDED initialization"
...
This reverts commit 5afc63df93 .
Related-To: GSD-8948
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com >
2024-10-22 19:27:31 +02:00
Maciej Bielski
45e78fea76
fix: use productHelper in getPatIndexInfoString() on Windows
...
Fix the PAT-index reporting in logger as currently on Windows reported
values are simply wrong.
The changed logic dependends on `RootDeviceEnvironment` and in order to
avoid introducing such dependencies into logger.[ch] the
`logAllocation()` is no longer a member of `FileLogger` but
a free-function instead (and a separate .cpp file). This is important
because the source files `logger.[ch]` are also used by ocloc library
and there is no point to contaminate ocloc code structure with
unnecessary dependencies.
Related-To: NEO-9421
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com >
2024-10-22 19:27:13 +02:00
Wenbin Lu
a8a40d2afd
feature: support SVM heap in reserveVirtualMem
...
Related-To: NEO-11981
Signed-off-by: Wenbin Lu <wenbin.lu@intel.com >
2024-10-22 16:47:14 +02:00
Bartosz Dunajski
eed69f45ed
fix: Enable 64k pages for TSB allocation
...
Related-To: HSD-18040274716
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-10-22 15:12:42 +02:00
Robert Krzemień
fe1fd22371
fix: stop direct submission before oa configuration activation
...
Related-To: NEO-12778
Signed-off-by: Robert Krzemień <robert.krzemien@intel.com >
2024-10-22 14:26:44 +02:00
Szymon Morek
01a0b8e7f7
performance: improve ULLS controller timeout detection
...
Related-To: NEO-12991
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-10-22 13:53:25 +02:00
Compute-Runtime-Validation
e10998db45
Revert "performance: Add CCS Optimization"
...
This reverts commit e7b3a40aa7 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-10-22 05:52:14 +02:00
Michal Mrozek
18d828421d
performance: add debug flag to control huge chunk size on wddm.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2024-10-21 16:51:03 +02:00
Naklicki, Mateusz
a5a11f4a0e
feature: add debug key to set MaxSubSlicesSupported
...
Related-To: HSD-16025421624
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com >
2024-10-21 13:34:43 +02:00
Lukasz Jobczyk
8a647f6a39
Revert "performance: Ensure hostptrs removed before creating new one"
...
This reverts commit 5b2f2f3d83 .
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-10-21 10:36:06 +02:00
Bartosz Dunajski
ff80a02fcb
refactor: parse extra zebin params
...
Related-To: NEO-12591
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-10-21 09:59:33 +02:00
Lukasz Jobczyk
e7b3a40aa7
performance: Add CCS Optimization
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-10-21 09:59:14 +02:00
Jitendra Sharma
26709ba124
fix: Implement polling of SW FIFO
...
Related-To: NEO-12955
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com >
2024-10-21 07:19:42 +02:00
Jaroslaw Warchulski
5afc63df93
fix: use full size for HEAP_EXTENDED initialization
...
Related-To: GSD-8948
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com >
2024-10-18 18:14:34 +02:00
Tomasz Biernacik
46f3d27339
feature: initialize regionCount
...
Related-To: NEO-12138
Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com >
2024-10-18 13:33:14 +02:00
Mateusz Hoppe
c72ebb263d
feature: enable global bindless allocator on xe2hpg
...
Related-To: NEO-7063, NEO-11533
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2024-10-18 09:37:37 +02:00
Compute-Runtime-Validation
d7a4dab4ca
Revert "performance: Enable timestamp wait for queues on Xe2"
...
This reverts commit 9185760a0e .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-10-18 08:46:03 +02:00
Mateusz Jablonski
4154e6666b
fix: correct loading L0 loader functions
...
- don't load ze_loader.dll from file system
- to perform self-open on Windows use getModuleHandleA with proper module name
- don't free library loaded with getModuleHandleA
- as loader may be not available during runtime teardown:
- load translate handle function during global setup
- load setDriverTeardown function during global teardown
- when loader is not available during teardown, unset translate handle
function
Related-To: GSD-10147
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-10-17 15:39:20 +02:00