Szymon Morek
701bcdaeab
performance: reorder checks for staging buffer
...
Related-To: NEO-16052
defer registerHostPtr call to the last check.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-09-08 16:48:05 +02:00
Szymon Morek
03357b6089
performance: adjust size threshold for staging on linux
...
Related-To: NEO-15973
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-09-05 18:09:49 +02:00
Michal Mrozek
6ef0af7e22
refactor: remove redundant code
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2025-08-25 16:14:42 +02:00
Fabian Zwoliński
a2f60af5c6
fix: change global Var/Const Buffer type to SharedPoolAllocation
...
This is prep work for the future implementation of pooling these allocations.
Related-To: NEO-12287
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com >
2025-08-19 17:29:34 +02:00
Jakub Nowacki
259271f59d
fix: disallow copy and move
...
Related-To: NEO-15630
Signed-off-by: Jakub Nowacki <jakub.nowacki@intel.com >
2025-08-14 12:19:18 +02:00
Chandio, Bibrak Qamar
6acfcb83b9
Revert "Two commits related to pStart feature"
...
Related-To: NEO-15156, GSD-9939
Revert "test: fix minor errors"
This reverts commit 712dc8221e .
Revert "feature: Support for pStart"
This reverts commit 47caeda487 .
Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com >
2025-08-13 07:10:40 +02:00
Naklicki, Mateusz
c36f6cadb0
feature: dump graph visualisation
...
Related-To: NEO-15377
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com >
2025-08-11 19:24:36 +02:00
Michal Mrozek
9c3904583b
performance: improve container lookup
...
- check if incoming pointer is page aligned, if so do non offset based lookup
- make allowOffset compile time parameter
- in case non offset based lookup fails, re-try with offset allowed
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2025-08-09 10:01:38 +02:00
Aleksander Czerwionka
6bc9ea3ddb
feature: move svmAllocsManager and staggingBufferManager to platform
...
Related-To: NEO-13247
Signed-off-by: Aleksander Czerwionka <aleksander.czerwionka@intel.com >
2025-08-08 14:01:44 +02:00
Compute-Runtime-Validation
917032f2ac
Revert "feature: dump graph visualisation"
...
This reverts commit 86d270633c .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-08-07 10:12:20 +02:00
Naklicki, Mateusz
86d270633c
feature: dump graph visualisation
...
Related-To: NEO-15377
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com >
2025-08-06 17:14:50 +02:00
Chandio, Bibrak Qamar
47caeda487
feature: Support for pStart
...
Related-To: NEO-15156, GSD-9939
Support for start address hint in zeVirtualMemReserve.
If it fails to find pStart then it defaults to the base line
allocateWithCustomAlignment(...)
Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com >
2025-08-06 12:40:29 +02:00
Compute-Runtime-Validation
c9456cda14
Revert "feature: Support for pStart"
...
This reverts commit a50c0dbecf .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-08-06 02:26:13 +02:00
Chandio, Bibrak Qamar
a50c0dbecf
feature: Support for pStart
...
Related-To: NEO-15156, GSD-9939
Support for start address hint in zeVirtualMemReserve.
If it fails to find pStart then it defaults to the base line
allocateWithCustomAlignment(...)
Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com >
2025-08-05 10:14:01 +02:00
Chodor, Jaroslaw
2d63adca99
refactor: Explicit move ctor/asign and dtor for Ext and Clonable
...
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com >
2025-07-29 15:25:58 +02:00
Chodor, Jaroslaw
ad78ea7da4
refactor: Common ext pattern in KernelExecutionEnv
...
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com >
2025-07-29 04:57:23 +02:00
Chodor, Jaroslaw
0c7ca82b93
refactor: Introducing common ext pattern
...
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com >
2025-07-28 18:01:22 +02:00
Fabian Zwoliński
74d12b7f27
fix: enable TimestampPoolAllocator on hw mode
...
Related-To: NEO-12287
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com >
2025-07-28 14:58:11 +02:00
Chodor, Jaroslaw
e396b7e5b8
feature: Adding clonable/ext utilities
...
Related-To: NEO-14899
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com >
2025-07-28 11:17:14 +02:00
Chodor, Jaroslaw
7e8cbf749b
refactor: Don't force presence of copy assign for stacvec::resize
...
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com >
2025-07-24 19:19:01 +02:00
Chodor, Jaroslaw
fcc2038def
refactor: modernizing NEO::Range->std::span
...
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com >
2025-07-17 01:27:11 +02:00
Jaroslaw Warchulski
2375ff89bb
refactor: cleanup headers
...
Related-To: NEO-5548
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com >
2025-06-23 11:23:49 +02:00
Aravind Gopalakrishnan
8e45fb880f
fix: Add check if packet count is exceeded
...
Related-To: NEO-13003
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com >
2025-06-16 20:24:29 +02:00
Compute-Runtime-Validation
4e0c765d72
Revert "performance: reorder checks for staging buffer"
...
This reverts commit e030086cf2 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-06-10 23:35:58 +02:00
Szymon Morek
6a7f6f1ad6
fix: add noexcept to move operators
...
Without it, copy operator is selected, even if move is explicitly defined.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-06-09 10:32:23 +02:00
Szymon Morek
e030086cf2
performance: reorder checks for staging buffer
...
defer registerHostPtr call to the last check.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-06-09 10:18:35 +02:00
Fabian Zwoliński
e2228201ce
fix: Avoid redundant padding in ISA allocations
...
- Add isaPaddingIncluded flag to AllocationProperties to control ISA
padding
- Modify MemoryManager to skip automatic padding when flag is set
- Update ISAPoolAllocator to manage padding explicitly and set the
flag
- Use actual buffer size for HeapAllocator instead of requested size
- Add test covering different padding scenarios
This prevents double padding addition which was wasting memory when
ISAPoolAllocator managed its own padding on top of MemoryManager's
automatic padding.
Related-To: NEO-12287, HSD-18042276431
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com >
2025-05-30 15:40:38 +02:00
Lukasz Jobczyk
0e3397547c
fix: Adjust waitpkg counter for non ulls light
...
Resolves: NEO-14866, HSD-18042153820
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-23 15:58:21 +02:00
Szymon Morek
a2c41e4749
fix: add mock for staging buffer size
...
No need to allocate 2MB in unit tests.
This commit changes this size to 4KB.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-05-22 08:36:01 +02:00
Daniel Bermond
e0362a7c39
build: add missing headers for GCC 15
...
For using fixed width integer types[1], the `<cstdint>`[2]
C++ header needs to be explicitly included with GCC 15 due
to changes[3] in libstdc++.
For details, see the documentation[4] about porting to GCC 15.
[1] https://en.cppreference.com/w/cpp/types/integer
[2] https://en.cppreference.com/w/cpp/header/cstdint
[3] https://gcc.gnu.org/git/?p=gcc.git;a=commit;h=3a817a4a5a6d94da9127af3be9f84
[4] https://gcc.gnu.org/gcc-15/porting_to.html#cxx
Signed-off-by: Daniel Bermond <dbermond@archlinux.org >
2025-05-19 15:31:32 +02:00
Lukasz Jobczyk
5cd5bbafc5
refactor: Split waitpkg params for ulls light and default
...
Related-To: NEO-14866
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-05-09 12:40:36 +02:00
Andrzej Koska
41c0546c45
fix: reset timestamps when userEvent is terminated
...
Related-To: NEO-14448
Signed-off-by: Andrzej Koska <andrzej.koska@intel.com >
2025-05-06 10:28:35 +02:00
Compute-Runtime-Validation
3fa9229483
Revert "performance: copy misaligned leftover in staging"
...
This reverts commit 13610d3473 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2025-05-06 02:12:48 +02:00
Szymon Morek
13610d3473
performance: copy misaligned leftover in staging
...
Related-To: NEO-14646
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-05-05 16:56:27 +02:00
Lukasz Jobczyk
97358acabe
fix: use mfence instead of sfence on discrete devices
...
Related-To: NEO-14642
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-04-25 13:32:25 +02:00
Lukasz Jobczyk
5d2b673ef6
fix: Adjust waitpkg threshold to 20 us
...
Resolves: HSD-13012958900
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-04-23 10:46:22 +02:00
Fabian Zwoliński
204577a91f
fix: disable TimestampPoolAllocator
...
Related-To: NEO-12287, HSD-13012945876
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com >
2025-04-18 11:30:35 +02:00
Chodor, Jaroslaw
a855ce019e
refactor: pass underlying allocator by ref
...
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com >
2025-04-17 10:43:21 +02:00
Chodor, Jaroslaw
7744251995
refactor: explicit copy/move operations in OpaqueElementAllocator
...
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com >
2025-04-16 10:10:05 +02:00
Jaroslaw Warchulski
3e1aa33924
refactor: cleanup headers
...
Related-To: NEO-5548
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com >
2025-04-14 14:59:40 +02:00
Marcel Skierkowski
b75fbe8e2c
refactor: mock filesystem in ocloc ult pt.1
...
Mocked IO operations in ./ocloc_tests application
Mocked gtest stdout capture in ocloc tests
Related-To: NEO-14084
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com >
2025-04-08 16:08:42 +02:00
Szymon Morek
ff1dcc1a8a
performance: adust copy from image to host logic
...
Related-To: NEO-14569
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-08 10:02:00 +02:00
Szymon Morek
95e0244f70
fix: properly pass info about 3D image
...
Related-To: NEO-14538
It's valid for 3D image to copy 2D region.
Current checks for mip map do not consider that.
This change correctly checks for mip mapped 3D image.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-03 16:33:57 +02:00
Szymon Morek
6ea83f322d
fix: do not override user data beyond slice region
...
Related-To: NEO-14538
If user passes slice pitch which is larger than region
to copy, do not override memory beyond region but within
that slice pitch.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-03 12:37:35 +02:00
Fabian Zwoliński
7ef3880793
feature: implement pool allocator for gpuTimestampDeviceBuffer
...
The patch applies to Level Zero.
Only allocations < 2MB will be fetched from the pool.
Allocations are shared and reused within a given device.
Additionally, I added a new debug flag to control the allocator:
EnableTimestampPoolAllocator
Related-To: NEO-12287
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com >
2025-04-02 14:28:56 +02:00
Szymon Morek
8836f6df0b
fix: forward mip map level for 3D images
...
Related-To: NEO-14539
1D and 2D images have already mip map level set correctly.
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-02 11:46:41 +02:00
Szymon Morek
3010af596e
performance: add infrastructure for staging with 3D images
...
Related-To: NEO-14026
Signed-off-by: Szymon Morek <szymon.morek@intel.com >
2025-04-01 15:30:30 +02:00
Lukasz Jobczyk
60b551758c
performance: Adjust waitpkg threshold for discrete devices
...
Related-To: NEO-14336
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-03-26 14:59:19 +01:00
Bartosz Dunajski
8c21a9634f
refactor: add missing file to cmake list
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2025-03-24 15:19:03 +01:00
Lukasz Jobczyk
8fbde30432
fix: Adjust waitpkg threshold to 12 us
...
Resolves: NEO-14434, NEO-14435
Related-To: NEO-14336
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2025-03-24 11:43:54 +01:00