Kamil Kopryk
b7681a2e7b
refactor: redirect flush L3 host usm to external by default
...
Related-To: NEO-13163
Rename ForceL3FlushAfterPostSync to EnableL3FlushAfterPostSync
Rename DisableFlushL3ForHostUsm to RedirectFlushL3HostUsmToExternal
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-05-28 08:50:01 +02:00
Filip Hazubski
3dc32cec88
feature: Add debug key to override number of threads per EU
...
Related-To: HSD-22021012755, NEO-14749
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-05-23 16:20:40 +02:00
Mateusz Jablonski
98bf872fdd
fix: return error when cannot obtain debug surface size
...
remove default max debug surface size
check state save area size only for debug scenarios
reduce state save area size in unit tests - rely on values from mock
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-05-23 14:48:08 +02:00
Mateusz Jablonski
8353f68307
feature(internal): add debug key to override max debug surface size
...
reduce the value in unit tests
Related-To: NEO-14097
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-05-21 14:57:41 +02:00
Bartosz Dunajski
91940d2a79
fix: initial support for single temporary allocations list
...
Related-To: NEO-14641
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-05-21 13:16:15 +02:00
Szymon Morek
3ba25459e8
performance: set timestamps as non-coherent since xe2
...
Related-To: NEO-14360
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-05-21 07:49:56 +02:00
Bartosz Dunajski
7f7aa36c52
feature: allow enabling copy offload on default cmd list
...
Related-To: NEO-7067
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-05-19 13:28:46 +02:00
Szymon Morek
b74dc9b7e1
performance: enable low latency hint on Xe
...
Related-To: NEO-14708
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-05-14 15:47:21 +02:00
Young Jin Yoon
5221b5b00e
fix: patch counter values with additional blit properties
...
Modified BlitCommandsHelper and CommandListHw to patch counter values
when using additional blit properties and in order command list is
enabled.
Related-To: NEO-13003
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2025-05-13 04:32:31 +02:00
Lukasz Jobczyk
df2c776aab
refactor: Remove unused cmdq round robin engine assign
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-09 11:03:57 +02:00
Lukasz Jobczyk
e964ccdf57
refactor: remove unused ulls diagnostic mode
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-09 10:01:51 +02:00
Lukasz Jobczyk
1d1414febc
refactor: remove unused dc flush mitigation
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-09 08:26:26 +02:00
Lukasz Jobczyk
d590172b65
refactor: Remove unused events tracker
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-08 23:53:47 +02:00
Michal Mrozek
52daf63a6f
refactor: remove not needed debug variable
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2025-05-08 14:29:27 +02:00
Filip Hazubski
ba85f7417d
fix: Add debug toggle to disable flush L3 for host usm
...
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-05-07 16:57:31 +02:00
Lukasz Jobczyk
ac7bd757a6
refactor: deallocate via gmm
...
Related-To: NEO-13051
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-06 18:12:46 +02:00
Bartosz Dunajski
f3ad230c45
feature: initial support for different copy offload submission modes
...
Related-To: NEO-7067
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-04-30 09:18:02 +02:00
Bartosz Dunajski
c2266fc69e
refactor: remove not used CB event TS logic
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-04-28 14:31:34 +02:00
Michal Mrozek
c41ac0ad4c
refactor: remove not needed code
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2025-04-25 16:44:01 +02:00
Mateusz Hoppe
41efee1e7c
feature: add mode to expose single root device
...
- simplify API to expose and use only single root device. Disallow
returning subdevices. This is experimental mode.
Related-To: NEO-14559
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-04-25 13:20:56 +02:00
Compute-Runtime-Validation
30f6ddb6c7
Revert "refactor: deallocate via gmm"
...
This reverts commit bf861d8d49
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-04-23 03:50:43 +02:00
Lukasz Jobczyk
bf861d8d49
refactor: deallocate via gmm
...
Related-To: NEO-13051
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-22 16:46:36 +02:00
Aravind Gopalakrishnan
b4f95a1a26
feature: Add pidfd option for IPC handle import
...
Related-To: NEO-10380
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2025-04-18 13:01:27 +02:00
Szymon Morek
3596522637
refactor: remove unused logic in ULLS controller
...
Related-To: NEO-13843
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-17 18:35:20 +02:00
Chodor, Jaroslaw
eb58e1f964
feature: Allow NEOReadDebugKeys and PrintDebugMessages envs in ocloc
...
This enables NEO's NEOReadDebugKeys and PrintDebugMessages in ocloc
in the same scenarios as in NEO OCL/L0 RT
Note :
* NEOReadDebugKeys=1 should not be prefixed with NEO_OCLOC_
* PrintDebugMessages=1 should be prefixed with NEO_OCLOC_
(to distinguish from OCL/L0 RT usage), so the full env
is NEO_OCLOC_PrintDebugMessages=1
Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-16 18:17:15 +02:00
Chodor, Jaroslaw
b423084861
refactor: Change FinalizerInputType debug var to string
...
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-16 10:33:19 +02:00
Chodor, Jaroslaw
2ce694c52b
feature: Enabling support for redirecting FCL to IGC for OCLOC
...
This allows for invoking IGC as OCL C translator in OCLOC paths
Accepted debug env :
NEO_OCLOC_UseIgcAsFcl=1 ; forces IGC
NEO_OCLOC_UseIgcAsFcl=2 ; forces FCL
NEO_OCLOC_UseIgcAsFcl=0 ; use defaults
Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-15 16:44:35 +02:00
Chodor, Jaroslaw
7bded13638
feature: Enabling support for redirecting FCL to IGC for OCL
...
This allows for invoking IGC as OCL C translator in OpenCL paths
Accepted debug env :
UseIgcAsFcl=1 ; forces IGC
UseIgcAsFcl=2 ; forces FCL
UseIgcAsFcl=0 ; uses defaults
Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-14 18:41:48 +02:00
Chodor, Jaroslaw
a079a2654d
feature: Enabling support for DisableDeepBind debug env in ocloc
...
Accepted debug env :
NEO_OCLOC_DisableDeepBind=1
Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-12 23:21:19 +02:00
Chodor, Jaroslaw
9e14eaaed1
feature: Adding support for scoped debug variables
...
This allows for limitting debug variables to specific
components.
Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-11 17:52:17 +02:00
Fabian Zwoliński
7ef3880793
feature: implement pool allocator for gpuTimestampDeviceBuffer
...
The patch applies to Level Zero.
Only allocations < 2MB will be fetched from the pool.
Allocations are shared and reused within a given device.
Additionally, I added a new debug flag to control the allocator:
EnableTimestampPoolAllocator
Related-To: NEO-12287
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2025-04-02 14:28:56 +02:00
Dominik Dabek
915d657420
fix: flag to limit usm reuse based on memory usage
...
Host usm and device usm for igfx checks system memory usage.
Device usm for dgfx checks local memory usage.
If used memory is above limit threshold:
- no new allocations will be saved for reuse
- cleaner will use shorter hold time of 2 seconds
- cleaner will free all eligible allocations, regardless of async
deleter thread having work
Motivation: in case of gfx memory being full, making resident new
allocations will require evictions which leads to massive slowdown on
enqueue calls.
This change aims to minimize cases where extra memory usage from usm
reuse mechanism leads to above situation.
Related-To: NEO-6893, NEO-14160
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-03-27 10:25:19 +01:00
Lukasz Jobczyk
8a85a96ed2
feature: Add 3-level wait scheme with tpause intrinsic
...
Related-To: NEO-14336
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-21 12:12:57 +01:00
Michal Mrozek
6a9716b81c
refactor: remove not needed code
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2025-03-19 14:53:57 +01:00
Jitendra Sharma
dda7876d3a
feature: Introduce SipClassType::externalLib
...
Related-To: NEO-13737
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2025-03-19 11:06:03 +01:00
Maciej Bielski
a7c46b8213
feature: support L2 cache reservation
...
Related-To: NEO-12837
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-03-17 19:41:55 +01:00
Kamil Kopryk
2c77d2c698
refactor: add SetMaxBVHLevels debug flag
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-03-14 16:59:48 +01:00
Jaroslaw Warchulski
413194bd2a
Revert "fix: do not prefer image compression on xe_lpg for linux and WSL"
...
This reverts commit 8814b6ac4f
.
Resolves: NEO-14286
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-03-07 11:38:46 +01:00
Dominik Dabek
9eb8e1812c
feature: flag to log usm reuse operations
...
If flag "LogUsmReuse" is set, usm reuse will log operations to csv file.
Each line will contain: timestamp, host/device, operation type,
allocation size, true/false whether operation succeeded.
This data can then be used to produce graphs and help in analyzing
usm reuse in a particular workload.
Related-To: NEO-6893
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-03-06 11:06:27 +01:00
Szymon Morek
82fba79d9d
performance: set 1ms timeout for ulls controller on LNL and PTL
...
Related-To: NEO-13843
Limit scope to Windows only.
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-03-06 09:13:58 +01:00
Kamil Kopryk
4c795027e3
refactor: add check if event L3 flush is needed
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-03-05 18:25:29 +01:00
Compute-Runtime-Validation
98d1d629a8
Revert "feature: Enable defer backing by default"
...
This reverts commit fffe8f8027
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-03-05 03:35:03 +01:00
Bellekallu Rajkiran
fffe8f8027
feature: Enable defer backing by default
...
Related-To: NEO-13403
Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2025-03-04 14:29:53 +01:00
Jaroslaw Warchulski
8814b6ac4f
fix: do not prefer image compression on xe_lpg for linux and WSL
...
Related-To: HSD-18034872015
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-02-28 14:20:57 +01:00
Mateusz Jablonski
374863ba08
refactor: remove cl pipe related logic
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-02-25 17:56:53 +01:00
Jitendra Sharma
d480cc093c
feature: Add method to determine if Sip should be get from external library
...
Related-To: NEO-13735
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2025-02-25 15:14:54 +01:00
Compute-Runtime-Validation
f907c7dcb2
Revert "performance: set 1ms timeout for ulls controller since xe2"
...
This reverts commit 28c008e438
.
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-02-24 15:04:54 +01:00
Mateusz Jablonski
3e45cf2a6c
feature: add debug key to expose L0 DDI Handles extension
...
Related-To: NEO-13121, NEO-13917
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-02-20 17:24:33 +01:00
Mateusz Jablonski
70f3342c2d
refactor: remove vme specific code
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-02-20 17:08:52 +01:00
Szymon Morek
28c008e438
performance: set 1ms timeout for ulls controller since xe2
...
Related-To: NEO-13843
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-02-19 13:48:26 +01:00