Commit Graph

5085 Commits

Author SHA1 Message Date
Lukasz Jobczyk
d6076941a8 refactor: Add dc flush mitigation infrastructure
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-07-12 14:45:51 +02:00
Fabian Zwoliński
359f4d5b56 feature: add SUPPORTED_DEVICES query to ocloc
New query option
ocloc query SUPPORTED_DEVICE
allows to generate a YAML file containing
information about supported devices for:
- the current version of ocloc on Windows
- the current and previous versions of ocloc on Linux

Each version of ocloc build needs to set
NEO_OCLOC_CURRENT_LIB_NAME
NEO_OCLOC_FORMER_LIB_NAME
cmake defines for the ocloc to be able to
find a previous lib and query its supported devices.

Example of correct format:
NEO_OCLOC_FORMER_LIB_NAME=libocloc-1.0.so
NEO_OCLOC_CURRENT_LIB_NAME=libocloc-2.0.so

Related-To: NEO-9630
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-07-11 17:38:33 +02:00
Bartosz Dunajski
e188de2489 fix: initialize page tables before access for TSP allocation in TBX mode
Related-To: NEO-8340

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-07-11 16:03:19 +02:00
Lukasz Jobczyk
b0a5f2cced fix: Stop direct submission before signal GL event
Related-To: NEO-10556

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-07-11 13:45:42 +02:00
Dominik Dabek
bdeccab7aa fix: bcs enqueue after marker properly waits
For an example sequence of:
IOQ_1 -> enqueue copy, enqueue marker with waitlist (out event)
IOQ_2 -> enqueue marker with waitlist (event), enqueue copy

Add missing synchronization between the enqueue copies

Related-To: NEO-11694

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-07-11 10:36:18 +02:00
Michal Mrozek
05eb4e7a0d performance: add debug flag to disable l1 flush
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-07-11 10:09:46 +02:00
Mateusz Hoppe
52f5a9d6f0 refactor: remove SourceLevelDebugger from test and function name
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-07-10 16:32:26 +02:00
Szymon Morek
35cbbfe43a performance: Don't wait for taskCount for indirect allocs
Related-To: GSD-9385

In case of indirect allocations, we don't really know
their task count because we can't track their true usage
on GPU.
In case of non-blocking free, don't wait for latestSentTaskCount.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-10 15:51:04 +02:00
Mateusz Jablonski
cd983d63de test: correct test matchers in multi tile aub tests
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-10 10:54:20 +02:00
Szymon Morek
dbd96372be performance: adjust staging buffer usage
Related-To: NEO-11928

Don't copy through staging buffer if dst usm allocation
was not used before and transfer would be splitted.
Also, don't use staging buffers for mapped ocl buffers.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-10 10:19:18 +02:00
Maciej Bielski
005b273b1e test: stop abusing ctors in DrmMockCustom and derived types
Calling virtual functions from within constructors is incorrect, the
virtualization mechanism does not work until a constructor completes.
So, instead of performing many operations in constructors, make them
small and use static `create()` methods to construct the object properly
and only then perform additional setup.

Mocks are a prominent example how such practice degrades the code.
A function called from a constructor is basically unmockable. Despite
being overriden by derived type (to alter the execution path as per
needs of a given test-case) the parent versions are always called.

This was making test development more and more painful as at some point
there is no hack to compensate for the lack of standard mechanism.

Related-To: NEO-9754
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2024-07-10 08:34:09 +02:00
Michal Mrozek
4cabc9e4d2 performance: remove not needed code.
events are already created with queued state.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-07-10 07:35:05 +02:00
Mateusz Jablonski
5d09dd8273 test: optimize test excludes
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-09 19:30:12 +02:00
Bartosz Dunajski
90d1a210d8 refactor: remove not needed check
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-07-09 16:11:33 +02:00
Mateusz Jablonski
09a2b10c48 test: remove not needed test excludes
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-09 15:01:19 +02:00
Mateusz Jablonski
778645c11e feature: add initial support for Xe2 platforms
Related-To: NEO-8188, NEO-10774
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-09 10:30:27 +02:00
Szymon Morek
7d25965a78 performance: change buffer type for new coherency model
Related-To: NEO-11882

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-09 09:33:53 +02:00
Szymon Morek
457cb005de performance: iterate over indirect allocations once
Related-To: NEO-11921

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-09 09:31:52 +02:00
Mateusz Jablonski
ff917f69b3 test: correct including source file with excludes
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-08 15:23:50 +02:00
Mateusz Jablonski
812ba03fab test: remove not needed test excludes in aub tests
correct namespace of excluded tests

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-08 13:13:05 +02:00
Lukasz Jobczyk
bbeb5224d8 fix: Add IOH alignment to estimation
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-07-08 12:29:54 +02:00
Mateusz Jablonski
a14d5c52cb fix: Extend scope of OverrideSlmSize debug key - apply key during initialization
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-05 18:03:45 +02:00
Mateusz Jablonski
783ceec1c8 test: remove not needed test excludes
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-05 11:23:02 +02:00
Mateusz Jablonski
c207e3aadc test: remove not needed test excludes
correct namespace of excluded tests

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-05 10:27:14 +02:00
Mateusz Jablonski
4f4b8fed15 refactor: remove not needed code related to deprecated device ip version
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-05 09:54:48 +02:00
Mateusz Jablonski
aa3e2ccfe8 refactor: remove not needed test exclude
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-05 09:06:32 +02:00
Compute-Runtime-Validation
38872b7e1b Revert "refactor: Add dc flush mitigation infrastructure"
This reverts commit 1cba900ad9.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-07-04 08:20:18 +02:00
Lukasz Jobczyk
a96f2ea13a performance: disable blit enqueue on LNL
Resolves: NEO-11471

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-07-04 05:41:31 +02:00
Szymon Morek
daeb6e897a performance: limit barrier usage in non-usm copies
Related-To: NEO-11501

Don't emit barrier if non-usm copy through staging
buffers with OOQ and single transfer was issued.
No need to send barrier if there was single transfer -
there's nothing to aggregate

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-03 13:45:52 +02:00
Mateusz Jablonski
f171a22c0a refactor: include reg_configs_common using full path
add missing register offset value

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-03 13:30:46 +02:00
Szymon Morek
3dd051c3ee performance: adjust compression handling
Related-To: NEO-11882

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-03 09:37:11 +02:00
Compute-Runtime-Validation
b8246d806f Revert "performance: change buffer type for new coherency model"
This reverts commit c979495265.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-07-03 02:11:19 +02:00
Morek, Szymon
aed96cede4 fix: flush barrier when profiling enabled
Related-To: NEO-10615

Signed-off-by: Morek, Szymon <szymon.morek@intel.com>
2024-07-02 13:25:07 +02:00
Szymon Morek
c979495265 performance: change buffer type for new coherency model
Related-To: NEO-11882

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-02 12:49:41 +02:00
Szymon Morek
53f9a48fce fix: set start TS on cpu if TS packet write disabled
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-01 17:14:10 +02:00
Oskar Hubert Weber
9a486dd5a1 fix: Add missing fp64 extensions in caps initialization
In a scenario where fp64 emulation is both supported and enabled,
we want to include 2 additional extensions:
- cl_khr_fp64 to deviceExtensions
- __opencl_c_fp64 to openclCFeatures (OpenCL ver. 3.0)

Related-To: NEO-7611

Signed-off-by: Oskar Hubert Weber <oskar.hubert.weber@intel.com>
2024-07-01 13:09:13 +02:00
Kamil Kopryk
c8c6115d1a test: use getInterfaceDescriptor if required
Related-To: NEO-10641
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-28 15:38:31 +02:00
Alicja Lukaszewicz
3db2bfc235 fix: generate SPIR-V for first device with -spv_only and multiple devices
If the family is provided as a device with the "-spv_only" flag, then
the SPIR-V file will be generated for the first device in the family.

Related-to: NEO-11550
Signed-off-by: Alicja Lukaszewicz <alicja.lukaszewicz@intel.com>
2024-06-28 09:30:26 +02:00
Kamil Kopryk
940f23ddbf test: use typename from walker
Related-To: NEO-10641
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-27 14:33:40 +02:00
Dominik Dabek
76e8be5c39 fix(ocl): handle gl sharing displayable textures
Displayable textures always need dc flush.

Related-To: NEO-11694

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-06-27 13:43:20 +02:00
Kamil Kopryk
3d6470281f test: improve XeHPAndLaterDispatchWalkerBasicTest tests
Related-To: NEO-10641
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-27 11:14:17 +02:00
Kamil Kopryk
70e52ce4d8 refactor: add seperate enablers for compute walker and gpgpu walker
Related-To: NEO-10641
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-27 10:27:48 +02:00
Artur Harasimiuk
4c3c89adb1 test: cleanup not needed cout
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>
2024-06-27 10:08:02 +02:00
Kamil Kopryk
0b00c5deba test: force some ults to use explicit COMPUTE_WALKER type 2/n
Related-To: NEO-10641
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-26 15:32:01 +02:00
Szymon Morek
8ee92b840f fix: don't set start timestamp on cpu for bcs
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-06-26 13:30:58 +02:00
Kamil Kopryk
e74c8c516a refactor: use getInitGpuWalker function in ults
Related-To: NEO-10641
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-26 09:31:12 +02:00
Kamil Kopryk
0f8d956592 refactor: use getKernelStartPointer return type
Related-To: NEO-10641

Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-26 02:36:20 +02:00
Kamil Kopryk
fbc29bb43d refactor: use indirectDataAlignment from gen cmds
Related-To: NEO-10641
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-25 21:02:36 +02:00
Michal Mrozek
a3c3b6533a performance: improve pool handling
Related-To: NEO-11731
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-06-25 17:04:17 +02:00
Kamil Kopryk
2ec2f8c085 refactor: add heaplessStateInitEnabled bool to dispatch walker args
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-25 15:53:19 +02:00