Commit Graph

2362 Commits

Author SHA1 Message Date
Lukasz Jobczyk
a96f2ea13a performance: disable blit enqueue on LNL
Resolves: NEO-11471

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-07-04 05:41:31 +02:00
Szymon Morek
daeb6e897a performance: limit barrier usage in non-usm copies
Related-To: NEO-11501

Don't emit barrier if non-usm copy through staging
buffers with OOQ and single transfer was issued.
No need to send barrier if there was single transfer -
there's nothing to aggregate

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-03 13:45:52 +02:00
Compute-Runtime-Validation
b8246d806f Revert "performance: change buffer type for new coherency model"
This reverts commit c979495265.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-07-03 02:11:19 +02:00
Morek, Szymon
aed96cede4 fix: flush barrier when profiling enabled
Related-To: NEO-10615

Signed-off-by: Morek, Szymon <szymon.morek@intel.com>
2024-07-02 13:25:07 +02:00
Szymon Morek
c979495265 performance: change buffer type for new coherency model
Related-To: NEO-11882

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-02 12:49:41 +02:00
Szymon Morek
53f9a48fce fix: set start TS on cpu if TS packet write disabled
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-01 17:14:10 +02:00
Oskar Hubert Weber
9a486dd5a1 fix: Add missing fp64 extensions in caps initialization
In a scenario where fp64 emulation is both supported and enabled,
we want to include 2 additional extensions:
- cl_khr_fp64 to deviceExtensions
- __opencl_c_fp64 to openclCFeatures (OpenCL ver. 3.0)

Related-To: NEO-7611

Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com>
2024-07-01 13:09:13 +02:00
Dominik Dabek
76e8be5c39 fix(ocl): handle gl sharing displayable textures
Displayable textures always need dc flush.

Related-To: NEO-11694

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-06-27 13:43:20 +02:00
Kamil Kopryk
70e52ce4d8 refactor: add seperate enablers for compute walker and gpgpu walker
Related-To: NEO-10641
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-27 10:27:48 +02:00
Szymon Morek
8ee92b840f fix: don't set start timestamp on cpu for bcs
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-06-26 13:30:58 +02:00
Kamil Kopryk
fbc29bb43d refactor: use indirectDataAlignment from gen cmds
Related-To: NEO-10641
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-25 21:02:36 +02:00
Michal Mrozek
a3c3b6533a performance: improve pool handling
Related-To: NEO-11731
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-06-25 17:04:17 +02:00
Kamil Kopryk
2ec2f8c085 refactor: add heaplessStateInitEnabled bool to dispatch walker args
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-25 15:53:19 +02:00
Lukasz Jobczyk
1cba900ad9 refactor: Add dc flush mitigation infrastructure
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-06-25 14:53:24 +02:00
Artur Harasimiuk
bbe11205e3 build: add cmake flags to control platform support
Related-To: NEO-6683
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2024-06-25 09:08:03 +02:00
Szymon Morek
29e3eb512c performance: non-usm copy through staging buffers
Related-To: NEO-11501

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-06-25 07:18:53 +02:00
Lukasz Jobczyk
fed90f5c8e fix: Add infrastructure to force dc flush when mitigate dc
-force dc on next tag update after RT kernel
-force dc when release shared object

Related-To: NEO-10556

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-06-24 17:15:03 +02:00
Lukasz Jobczyk
773da10099 fix: Copy hostptr using BCS when mitigate dc flush
Related-To: NEO-10556

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-06-24 16:26:44 +02:00
Compute-Runtime-Validation
7136dfbd38 Revert "performance: improve pool handling"
This reverts commit 5f0b9efd2b.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-06-24 12:47:19 +02:00
Filip Hazubski
9513df52a6 refactor: Explicitly delete unused functions of BarrierCommand
Comply with the rule of five.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-06-24 11:18:17 +02:00
Kamil Kopryk
0c019c9f2b performance: don't check multiple times if gpgpu is initialized
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-24 10:12:46 +02:00
Filip Hazubski
2f6eaf149a fix: Update SIP kernel initialization logic
Initialize SIP kernel when shared device is being initialized
instead of api-specific device.

Initialize debugger when shared device is being initialized
instead of during platform or driver initialization.

Add missing makeResident calls for SIP kernel in heapless paths.

Related-To: HSD-18038645398, HSD-18038819112

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-06-20 18:30:46 +02:00
Kamil Kopryk
78c7d8878d refactor: change method to get indirect data address
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-20 14:24:51 +02:00
Lukasz Jobczyk
ce0047de39 fix: Use proper GA object to check shared handle in debug break
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-06-19 11:08:37 +02:00
Jaroslaw Warchulski
daec249430 refactor: remove not needed createGraphicsAllocationFromNTHandle method
Related-To: NEO-11498
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2024-06-18 14:25:00 +02:00
Dominik Dabek
b6d86d2648 refactor: tests for buffer pool
add support for future AIL

Related-To: NEO-11694

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-06-18 13:55:21 +02:00
Mrozek, Michal
5f0b9efd2b performance: improve pool handling
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
Resolves: NEO-11731
2024-06-14 12:02:34 +02:00
Alicja Lukaszewicz
730fed3206 fix: add missing tracing to all functions in api.cpp
Related-To: NEO-11576

Signed-off-by: Alicja Lukaszewicz <alicja.lukaszewicz@intel.com>
2024-06-13 14:55:13 +02:00
Jaroslaw Warchulski
76a05c1cab refactor: pass arrayIndex to Wddm::openNTHandle function
Related-To: NEO-11498
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2024-06-12 12:11:35 +02:00
Filip Hazubski
5ae2709e6e fix: Remove allow-zebin and enable-zebin compile options
Zebin is enabled by default.

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-06-10 14:08:40 +02:00
Bartosz Dunajski
da9c009b88 feature: assign unique interrupt to queue
Related-To: NEO-8179

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-06-07 10:06:31 +02:00
Jaroslaw Warchulski
0ac1be7669 refactor: pass arrayIndex to Wddm::openSharedHandle function
Related-To: NEO-11498
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2024-06-07 09:51:26 +02:00
Alicja Lukaszewicz
b4c31492d3 fix: add missing tracing to all functions in api.cpp
Related-To: NEO-11576

Signed-off-by: Alicja Lukaszewicz <alicja.lukaszewicz@intel.com>
2024-06-06 12:14:21 +02:00
Bartosz Dunajski
e88127dcc3 refactor: remove not used multi context per engine code
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-06-06 11:41:15 +02:00
Bartosz Dunajski
bc37e14e87 refactor: fallback paths for secondary context debug flags
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-06-05 12:02:16 +02:00
Bartosz Dunajski
88ed909e57 feature: assign secondary BCS engines at queue creation
Related-To: NEO-7824

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-06-03 09:41:43 +02:00
Mateusz Hoppe
8e79e7a174 refactor: fix gpu address formatting in logs
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-05-31 13:55:26 +02:00
Szymon Morek
5e92d530de performance: Reuse GPU timestamps by default on Windows
Related-To: NEO-10615

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-28 12:25:30 +02:00
Szymon Morek
387ec34207 fix: correct timestamps if cpu diff below resolution
Related-To: NEO-10615

Currently if cpu diff is equals to zero, gpu timestamp
will be decreased relatively to submit timestamp.
This commit fixes that so timestamps will be aligned.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-28 11:12:37 +02:00
Compute-Runtime-Validation
0b2c9e92e7 Revert "performance: Reuse GPU timestamps by default on Windows"
This reverts commit bca3fecaa0.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-25 07:59:00 +02:00
Szymon Morek
bca3fecaa0 performance: Reuse GPU timestamps by default on Windows
Related-To: NEO-10615

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-24 20:11:45 +02:00
Bartosz Dunajski
0a74d718f2 refactor: allow using different engine types for secondary contexts
Related-To: NEO-7824

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-05-24 09:39:46 +02:00
Fabian Zwoliński
59829aeb11 fix(ocl): Force rebuild if binary contains bindless kernel
Related-To: NEO-7063
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-05-20 14:15:17 +02:00
Maciej Plewka
90df4b298b fix: cache flush dependency for queue blocked
Related-to: NEO-9872, HSD-18038461954
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-05-20 11:55:24 +02:00
Szymon Morek
8989bf4078 fix: set timestamp diff correctly if lower than resolution
Related-To: NEO-10615

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-17 20:15:58 +02:00
Dominik Dabek
a236171f0d performance(ocl): enable device usm alloc reuse
Enabling on MTL+
Limited to use max 2% of global device memory.

Related-To: NEO-6893, NEO-11463

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-05-17 13:32:45 +02:00
Compute-Runtime-Validation
4c5d567b40 Revert "fix: remove not needed check in OCL CPU copy path 2"
This reverts commit 0a27aa9f7b.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-17 10:52:17 +02:00
Compute-Runtime-Validation
34f53d5d94 Revert "performance: Reuse GPU timestamps by default"
This reverts commit 7aceed58ca.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-16 21:52:10 +02:00
Bartosz Dunajski
0a27aa9f7b fix: remove not needed check in OCL CPU copy path 2
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-05-16 12:38:39 +02:00
Szymon Morek
7aceed58ca performance: Reuse GPU timestamps by default
Related-To: NEO-10615

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-05-15 17:51:42 +02:00