Commit Graph

339 Commits

Author SHA1 Message Date
Szymon Morek
701bcdaeab performance: reorder checks for staging buffer
Related-To: NEO-16052

defer registerHostPtr call to the last check.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-09-08 16:48:05 +02:00
Szymon Morek
03357b6089 performance: adjust size threshold for staging on linux
Related-To: NEO-15973

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-09-05 18:09:49 +02:00
Michal Mrozek
6ef0af7e22 refactor: remove redundant code
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2025-08-25 16:14:42 +02:00
Fabian Zwoliński
a2f60af5c6 fix: change global Var/Const Buffer type to SharedPoolAllocation
This is prep work for the future implementation of pooling these allocations.

Related-To: NEO-12287
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2025-08-19 17:29:34 +02:00
Jakub Nowacki
259271f59d fix: disallow copy and move
Related-To: NEO-15630

Signed-off-by: Jakub Nowacki <jakub.nowacki@intel.com>
2025-08-14 12:19:18 +02:00
Chandio, Bibrak Qamar
6acfcb83b9 Revert "Two commits related to pStart feature"
Related-To: NEO-15156, GSD-9939

Revert "test: fix minor errors"

This reverts commit 712dc8221e.

Revert "feature: Support for pStart"

This reverts commit 47caeda487.

Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com>
2025-08-13 07:10:40 +02:00
Naklicki, Mateusz
c36f6cadb0 feature: dump graph visualisation
Related-To: NEO-15377
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com>
2025-08-11 19:24:36 +02:00
Michal Mrozek
9c3904583b performance: improve container lookup
- check if incoming pointer is page aligned, if so do non offset based lookup
- make allowOffset compile time parameter
- in case non offset based lookup fails, re-try with offset allowed

Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2025-08-09 10:01:38 +02:00
Aleksander Czerwionka
6bc9ea3ddb feature: move svmAllocsManager and staggingBufferManager to platform
Related-To: NEO-13247

Signed-off-by: Aleksander Czerwionka <aleksander.czerwionka@intel.com>
2025-08-08 14:01:44 +02:00
Compute-Runtime-Validation
917032f2ac Revert "feature: dump graph visualisation"
This reverts commit 86d270633c.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-08-07 10:12:20 +02:00
Naklicki, Mateusz
86d270633c feature: dump graph visualisation
Related-To: NEO-15377
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com>
2025-08-06 17:14:50 +02:00
Chandio, Bibrak Qamar
47caeda487 feature: Support for pStart
Related-To: NEO-15156, GSD-9939

Support for start address hint in zeVirtualMemReserve.

If it fails to find pStart then it defaults to the base line
allocateWithCustomAlignment(...)

Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com>
2025-08-06 12:40:29 +02:00
Compute-Runtime-Validation
c9456cda14 Revert "feature: Support for pStart"
This reverts commit a50c0dbecf.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-08-06 02:26:13 +02:00
Chandio, Bibrak Qamar
a50c0dbecf feature: Support for pStart
Related-To: NEO-15156, GSD-9939

Support for start address hint in zeVirtualMemReserve.

If it fails to find pStart then it defaults to the base line
allocateWithCustomAlignment(...)

Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com>
2025-08-05 10:14:01 +02:00
Chodor, Jaroslaw
2d63adca99 refactor: Explicit move ctor/asign and dtor for Ext and Clonable
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-07-29 15:25:58 +02:00
Chodor, Jaroslaw
ad78ea7da4 refactor: Common ext pattern in KernelExecutionEnv
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-07-29 04:57:23 +02:00
Chodor, Jaroslaw
0c7ca82b93 refactor: Introducing common ext pattern
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-07-28 18:01:22 +02:00
Fabian Zwoliński
74d12b7f27 fix: enable TimestampPoolAllocator on hw mode
Related-To: NEO-12287
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2025-07-28 14:58:11 +02:00
Chodor, Jaroslaw
e396b7e5b8 feature: Adding clonable/ext utilities
Related-To: NEO-14899

Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-07-28 11:17:14 +02:00
Chodor, Jaroslaw
7e8cbf749b refactor: Don't force presence of copy assign for stacvec::resize
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-07-24 19:19:01 +02:00
Chodor, Jaroslaw
fcc2038def refactor: modernizing NEO::Range->std::span
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-07-17 01:27:11 +02:00
Jaroslaw Warchulski
2375ff89bb refactor: cleanup headers
Related-To: NEO-5548
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-06-23 11:23:49 +02:00
Aravind Gopalakrishnan
8e45fb880f fix: Add check if packet count is exceeded
Related-To: NEO-13003

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2025-06-16 20:24:29 +02:00
Compute-Runtime-Validation
4e0c765d72 Revert "performance: reorder checks for staging buffer"
This reverts commit e030086cf2.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-06-10 23:35:58 +02:00
Szymon Morek
6a7f6f1ad6 fix: add noexcept to move operators
Without it, copy operator is selected, even if move is explicitly defined.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-06-09 10:32:23 +02:00
Szymon Morek
e030086cf2 performance: reorder checks for staging buffer
defer registerHostPtr call to the last check.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-06-09 10:18:35 +02:00
Fabian Zwoliński
e2228201ce fix: Avoid redundant padding in ISA allocations
- Add isaPaddingIncluded flag to AllocationProperties to control ISA
padding
- Modify MemoryManager to skip automatic padding when flag is set
- Update ISAPoolAllocator to manage padding explicitly and set the
flag
- Use actual buffer size for HeapAllocator instead of requested size
- Add test covering different padding scenarios

This prevents double padding addition which was wasting memory when
ISAPoolAllocator managed its own padding on top of MemoryManager's
automatic padding.

Related-To: NEO-12287, HSD-18042276431
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2025-05-30 15:40:38 +02:00
Lukasz Jobczyk
0e3397547c fix: Adjust waitpkg counter for non ulls light
Resolves: NEO-14866, HSD-18042153820

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-23 15:58:21 +02:00
Szymon Morek
a2c41e4749 fix: add mock for staging buffer size
No need to allocate 2MB in unit tests.
This commit changes this size to 4KB.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-05-22 08:36:01 +02:00
Daniel Bermond
e0362a7c39 build: add missing headers for GCC 15
For using fixed width integer types[1], the `<cstdint>`[2]
C++ header needs to be explicitly included with GCC 15 due
to changes[3] in libstdc++.

For details, see the documentation[4] about porting to GCC 15.

[1] https://en.cppreference.com/w/cpp/types/integer
[2] https://en.cppreference.com/w/cpp/header/cstdint
[3] https://gcc.gnu.org/git/?p=gcc.git;a=commit;h=3a817a4a5a6d94da9127af3be9f84
[4] https://gcc.gnu.org/gcc-15/porting_to.html#cxx

Signed-off-by: Daniel Bermond <dbermond@archlinux.org>
2025-05-19 15:31:32 +02:00
Lukasz Jobczyk
5cd5bbafc5 refactor: Split waitpkg params for ulls light and default
Related-To: NEO-14866

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-09 12:40:36 +02:00
Andrzej Koska
41c0546c45 fix: reset timestamps when userEvent is terminated
Related-To: NEO-14448

Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2025-05-06 10:28:35 +02:00
Compute-Runtime-Validation
3fa9229483 Revert "performance: copy misaligned leftover in staging"
This reverts commit 13610d3473.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-05-06 02:12:48 +02:00
Szymon Morek
13610d3473 performance: copy misaligned leftover in staging
Related-To: NEO-14646

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-05-05 16:56:27 +02:00
Lukasz Jobczyk
97358acabe fix: use mfence instead of sfence on discrete devices
Related-To: NEO-14642

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-25 13:32:25 +02:00
Lukasz Jobczyk
5d2b673ef6 fix: Adjust waitpkg threshold to 20 us
Resolves: HSD-13012958900

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-04-23 10:46:22 +02:00
Fabian Zwoliński
204577a91f fix: disable TimestampPoolAllocator
Related-To: NEO-12287, HSD-13012945876
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2025-04-18 11:30:35 +02:00
Chodor, Jaroslaw
a855ce019e refactor: pass underlying allocator by ref
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-17 10:43:21 +02:00
Chodor, Jaroslaw
7744251995 refactor: explicit copy/move operations in OpaqueElementAllocator
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-16 10:10:05 +02:00
Jaroslaw Warchulski
3e1aa33924 refactor: cleanup headers
Related-To: NEO-5548
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-04-14 14:59:40 +02:00
Marcel Skierkowski
b75fbe8e2c refactor: mock filesystem in ocloc ult pt.1
Mocked IO operations in ./ocloc_tests application

Mocked gtest stdout capture in ocloc tests

Related-To: NEO-14084
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-04-08 16:08:42 +02:00
Szymon Morek
ff1dcc1a8a performance: adust copy from image to host logic
Related-To: NEO-14569

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-08 10:02:00 +02:00
Szymon Morek
95e0244f70 fix: properly pass info about 3D image
Related-To: NEO-14538

It's valid for 3D image to copy 2D region.
Current checks for mip map do not consider that.
This change correctly checks for mip mapped 3D image.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-03 16:33:57 +02:00
Szymon Morek
6ea83f322d fix: do not override user data beyond slice region
Related-To: NEO-14538

If user passes slice pitch which is larger than region
to copy, do not override memory beyond region but within
that slice pitch.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-03 12:37:35 +02:00
Fabian Zwoliński
7ef3880793 feature: implement pool allocator for gpuTimestampDeviceBuffer
The patch applies to Level Zero.
Only allocations < 2MB will be fetched from the pool.
Allocations are shared and reused within a given device.

Additionally, I added a new debug flag to control the allocator:
EnableTimestampPoolAllocator

Related-To: NEO-12287
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2025-04-02 14:28:56 +02:00
Szymon Morek
8836f6df0b fix: forward mip map level for 3D images
Related-To: NEO-14539

1D and 2D images have already mip map level set correctly.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-02 11:46:41 +02:00
Szymon Morek
3010af596e performance: add infrastructure for staging with 3D images
Related-To: NEO-14026

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-04-01 15:30:30 +02:00
Lukasz Jobczyk
60b551758c performance: Adjust waitpkg threshold for discrete devices
Related-To: NEO-14336

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-26 14:59:19 +01:00
Bartosz Dunajski
8c21a9634f refactor: add missing file to cmake list
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2025-03-24 15:19:03 +01:00
Lukasz Jobczyk
8fbde30432 fix: Adjust waitpkg threshold to 12 us
Resolves: NEO-14434, NEO-14435
Related-To: NEO-14336

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-03-24 11:43:54 +01:00