Lukasz Jobczyk
a96f2ea13a
performance: disable blit enqueue on LNL
...
Resolves: NEO-11471
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-07-04 05:41:31 +02:00
Szymon Morek
daeb6e897a
performance: limit barrier usage in non-usm copies
...
Related-To: NEO-11501
Don't emit barrier if non-usm copy through staging
buffers with OOQ and single transfer was issued.
No need to send barrier if there was single transfer -
there's nothing to aggregate
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-07-03 13:45:52 +02:00
Compute-Runtime-Validation
b8246d806f
Revert "performance: change buffer type for new coherency model"
...
This reverts commit c979495265 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-07-03 02:11:19 +02:00
Morek, Szymon
aed96cede4
fix: flush barrier when profiling enabled
...
Related-To: NEO-10615
Signed-off-by: Morek, Szymon <szymon.morek@intel.com >
2024-07-02 13:25:07 +02:00
Szymon Morek
c979495265
performance: change buffer type for new coherency model
...
Related-To: NEO-11882
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-07-02 12:49:41 +02:00
Szymon Morek
53f9a48fce
fix: set start TS on cpu if TS packet write disabled
...
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-07-01 17:14:10 +02:00
Oskar Hubert Weber
9a486dd5a1
fix: Add missing fp64 extensions in caps initialization
...
In a scenario where fp64 emulation is both supported and enabled,
we want to include 2 additional extensions:
- cl_khr_fp64 to deviceExtensions
- __opencl_c_fp64 to openclCFeatures (OpenCL ver. 3.0)
Related-To: NEO-7611
Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com >
2024-07-01 13:09:13 +02:00
Dominik Dabek
76e8be5c39
fix(ocl): handle gl sharing displayable textures
...
Displayable textures always need dc flush.
Related-To: NEO-11694
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-06-27 13:43:20 +02:00
Kamil Kopryk
70e52ce4d8
refactor: add seperate enablers for compute walker and gpgpu walker
...
Related-To: NEO-10641
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-06-27 10:27:48 +02:00
Szymon Morek
8ee92b840f
fix: don't set start timestamp on cpu for bcs
...
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-06-26 13:30:58 +02:00
Kamil Kopryk
fbc29bb43d
refactor: use indirectDataAlignment from gen cmds
...
Related-To: NEO-10641
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-06-25 21:02:36 +02:00
Michal Mrozek
a3c3b6533a
performance: improve pool handling
...
Related-To: NEO-11731
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2024-06-25 17:04:17 +02:00
Kamil Kopryk
2ec2f8c085
refactor: add heaplessStateInitEnabled bool to dispatch walker args
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-06-25 15:53:19 +02:00
Lukasz Jobczyk
1cba900ad9
refactor: Add dc flush mitigation infrastructure
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-06-25 14:53:24 +02:00
Artur Harasimiuk
bbe11205e3
build: add cmake flags to control platform support
...
Related-To: NEO-6683
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com >
2024-06-25 09:08:03 +02:00
Szymon Morek
29e3eb512c
performance: non-usm copy through staging buffers
...
Related-To: NEO-11501
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-06-25 07:18:53 +02:00
Lukasz Jobczyk
fed90f5c8e
fix: Add infrastructure to force dc flush when mitigate dc
...
-force dc on next tag update after RT kernel
-force dc when release shared object
Related-To: NEO-10556
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-06-24 17:15:03 +02:00
Lukasz Jobczyk
773da10099
fix: Copy hostptr using BCS when mitigate dc flush
...
Related-To: NEO-10556
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-06-24 16:26:44 +02:00
Compute-Runtime-Validation
7136dfbd38
Revert "performance: improve pool handling"
...
This reverts commit 5f0b9efd2b .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-06-24 12:47:19 +02:00
Filip Hazubski
9513df52a6
refactor: Explicitly delete unused functions of BarrierCommand
...
Comply with the rule of five.
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2024-06-24 11:18:17 +02:00
Kamil Kopryk
0c019c9f2b
performance: don't check multiple times if gpgpu is initialized
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-06-24 10:12:46 +02:00
Filip Hazubski
2f6eaf149a
fix: Update SIP kernel initialization logic
...
Initialize SIP kernel when shared device is being initialized
instead of api-specific device.
Initialize debugger when shared device is being initialized
instead of during platform or driver initialization.
Add missing makeResident calls for SIP kernel in heapless paths.
Related-To: HSD-18038645398, HSD-18038819112
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2024-06-20 18:30:46 +02:00
Kamil Kopryk
78c7d8878d
refactor: change method to get indirect data address
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-06-20 14:24:51 +02:00
Lukasz Jobczyk
ce0047de39
fix: Use proper GA object to check shared handle in debug break
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-06-19 11:08:37 +02:00
Jaroslaw Warchulski
daec249430
refactor: remove not needed createGraphicsAllocationFromNTHandle method
...
Related-To: NEO-11498
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com >
2024-06-18 14:25:00 +02:00
Dominik Dabek
b6d86d2648
refactor: tests for buffer pool
...
add support for future AIL
Related-To: NEO-11694
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-06-18 13:55:21 +02:00
Mrozek, Michal
5f0b9efd2b
performance: improve pool handling
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com >
Resolves: NEO-11731
2024-06-14 12:02:34 +02:00
Alicja Lukaszewicz
730fed3206
fix: add missing tracing to all functions in api.cpp
...
Related-To: NEO-11576
Signed-off-by: Alicja Lukaszewicz <alicja.lukaszewicz@intel.com >
2024-06-13 14:55:13 +02:00
Jaroslaw Warchulski
76a05c1cab
refactor: pass arrayIndex to Wddm::openNTHandle function
...
Related-To: NEO-11498
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com >
2024-06-12 12:11:35 +02:00
Filip Hazubski
5ae2709e6e
fix: Remove allow-zebin and enable-zebin compile options
...
Zebin is enabled by default.
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2024-06-10 14:08:40 +02:00
Bartosz Dunajski
da9c009b88
feature: assign unique interrupt to queue
...
Related-To: NEO-8179
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-06-07 10:06:31 +02:00
Jaroslaw Warchulski
0ac1be7669
refactor: pass arrayIndex to Wddm::openSharedHandle function
...
Related-To: NEO-11498
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com >
2024-06-07 09:51:26 +02:00
Alicja Lukaszewicz
b4c31492d3
fix: add missing tracing to all functions in api.cpp
...
Related-To: NEO-11576
Signed-off-by: Alicja Lukaszewicz <alicja.lukaszewicz@intel.com >
2024-06-06 12:14:21 +02:00
Bartosz Dunajski
e88127dcc3
refactor: remove not used multi context per engine code
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-06-06 11:41:15 +02:00
Bartosz Dunajski
bc37e14e87
refactor: fallback paths for secondary context debug flags
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-06-05 12:02:16 +02:00
Bartosz Dunajski
88ed909e57
feature: assign secondary BCS engines at queue creation
...
Related-To: NEO-7824
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-06-03 09:41:43 +02:00
Mateusz Hoppe
8e79e7a174
refactor: fix gpu address formatting in logs
...
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2024-05-31 13:55:26 +02:00
Szymon Morek
5e92d530de
performance: Reuse GPU timestamps by default on Windows
...
Related-To: NEO-10615
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-05-28 12:25:30 +02:00
Szymon Morek
387ec34207
fix: correct timestamps if cpu diff below resolution
...
Related-To: NEO-10615
Currently if cpu diff is equals to zero, gpu timestamp
will be decreased relatively to submit timestamp.
This commit fixes that so timestamps will be aligned.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-05-28 11:12:37 +02:00
Compute-Runtime-Validation
0b2c9e92e7
Revert "performance: Reuse GPU timestamps by default on Windows"
...
This reverts commit bca3fecaa0 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-05-25 07:59:00 +02:00
Szymon Morek
bca3fecaa0
performance: Reuse GPU timestamps by default on Windows
...
Related-To: NEO-10615
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-05-24 20:11:45 +02:00
Bartosz Dunajski
0a74d718f2
refactor: allow using different engine types for secondary contexts
...
Related-To: NEO-7824
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-05-24 09:39:46 +02:00
Fabian Zwoliński
59829aeb11
fix(ocl): Force rebuild if binary contains bindless kernel
...
Related-To: NEO-7063
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com >
2024-05-20 14:15:17 +02:00
Maciej Plewka
90df4b298b
fix: cache flush dependency for queue blocked
...
Related-to: NEO-9872, HSD-18038461954
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2024-05-20 11:55:24 +02:00
Szymon Morek
8989bf4078
fix: set timestamp diff correctly if lower than resolution
...
Related-To: NEO-10615
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-05-17 20:15:58 +02:00
Dominik Dabek
a236171f0d
performance(ocl): enable device usm alloc reuse
...
Enabling on MTL+
Limited to use max 2% of global device memory.
Related-To: NEO-6893, NEO-11463
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-05-17 13:32:45 +02:00
Compute-Runtime-Validation
4c5d567b40
Revert "fix: remove not needed check in OCL CPU copy path 2"
...
This reverts commit 0a27aa9f7b .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-05-17 10:52:17 +02:00
Compute-Runtime-Validation
34f53d5d94
Revert "performance: Reuse GPU timestamps by default"
...
This reverts commit 7aceed58ca .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-05-16 21:52:10 +02:00
Bartosz Dunajski
0a27aa9f7b
fix: remove not needed check in OCL CPU copy path 2
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2024-05-16 12:38:39 +02:00
Szymon Morek
7aceed58ca
performance: Reuse GPU timestamps by default
...
Related-To: NEO-10615
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2024-05-15 17:51:42 +02:00