Mateusz Jablonski
2ab4aeda85
fix: add static casts to destination type to avoid overflows in multiplication
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-06 14:55:24 +02:00
Mateusz Jablonski
83c306e927
fix: avoid memory leak
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-06 13:16:56 +02:00
Filip Hazubski
08e92d154f
fix: Add getDefaultDeviceHierarchy call to GfxCoreHelper
...
Added getDefaultDeviceHierarchy call that describes default device
hierarchy for a gfx core. Refactored L0 and OCL paths to use this
value by default and override this value when user sets
ZE_FLAT_DEVICE_HIERARCHY environment variable or
ReturnSubDevicesAsApiDevices debug key.
Updated ReturnSubDevicesAsApiDevices to force COMPOSITE device hierarchy
when set to 0.
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com >
2023-10-06 12:32:41 +02:00
Mateusz Hoppe
9134a4e1bc
fix: call heapFree with gpu address for front window allocations
...
- when heapAllocate is used to get gpu address, heapFree should be
called on memory free
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-10-05 16:40:35 +02:00
Mateusz Jablonski
382fc952f2
refactor: add NonAssignableClass to define classes without assign operator
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-05 15:16:58 +02:00
Mateusz Jablonski
712ba60452
fix: add unrecoverable to avoid nullptr access
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-05 13:20:21 +02:00
Mateusz Jablonski
72182c28e9
refactor: add NonCopyableOrMovableClass to classes that don't need copy ctors
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-05 13:11:01 +02:00
Mateusz Jablonski
038c287656
fix: avoid double free when wddm initialization fails
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-05 13:03:49 +02:00
Mrozek, Michal
90e24a433d
Revert "performance: Change thread group dispatch size algorithm"
...
This reverts commit ac7cd9c4c5 .
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com >
2023-10-05 12:32:09 +02:00
Mateusz Hoppe
e676ac49bb
fix: front window allocations in device pool with correct gpu address
...
- use correct FRONT_WINDOW heap address when flag use32BitFrontWindow is
set while allocating from Device Pool
- applies to Windows OS
Related-To: NEO-4550, NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-10-05 12:19:20 +02:00
Mateusz Jablonski
6d259ac4b7
fix: add unrecoverable to avoid out of bound access
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-05 12:03:28 +02:00
Compute-Runtime-Validation
d5f90ae155
Revert "fix: query drm info to 8 byte aligned storage"
...
This reverts commit 9b344280d6 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-10-05 06:07:27 +02:00
John Falkowski
ffab8fd9cc
fix: add concurrent atomic access with KMD migration
...
Related-To: NEO-8219
Signed-off-by: John Falkowski <john.falkowski@intel.com >
2023-10-05 03:13:22 +02:00
Mateusz Jablonski
ad2701ad26
fix: add unrecoverable to avoid out of bound access
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-04 15:59:18 +02:00
Mateusz Jablonski
95eb5df978
refactor: use reference in for-range loops
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-04 15:32:26 +02:00
Zbigniew Zdanowicz
ec9fa23b2e
refactor: change order of fields of EncodeDispatchKernelArgs structure
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-10-04 15:00:29 +02:00
Mateusz Jablonski
fc508212de
refactor: pass big parameters as reference instead of by value
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-04 14:53:13 +02:00
Mateusz Jablonski
6b3ad5b3e3
fix: add unrecoverable in case of nullptr
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-04 14:38:05 +02:00
Mateusz Jablonski
747d44f8fc
fix: add unrecoverable in case of allocation type unknown
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-04 14:02:36 +02:00
Compute-Runtime-Validation
8042df8bb4
Revert "fix: support alignments in device and shared memory allocation on WSL2"
...
This reverts commit dd46bf1e90 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-10-04 02:25:25 +02:00
Lu, Wenbin
dd46bf1e90
fix: support alignments in device and shared memory allocation on WSL2
...
Related-To: LOCI-4334
Signed-off-by: Lu, Wenbin <wenbin.lu@intel.com >
2023-10-03 19:22:23 +02:00
Mrozek, Michal
ac7cd9c4c5
performance: Change thread group dispatch size algorithm
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com >
2023-10-03 19:04:22 +02:00
Mateusz Jablonski
895a586083
refactor: use reference in for-range loops
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-03 18:56:53 +02:00
Mateusz Jablonski
9b344280d6
fix: query drm info to 8 byte aligned storage
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-03 18:43:53 +02:00
Jitendra Sharma
8a01619310
refactor: Enable CSR heap sharing on Older Generation platforms
...
Related-To: LOCI-4312
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com >
2023-10-03 18:19:50 +02:00
Mateusz Jablonski
5e4ea627f7
refactor: correct printf formats
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-03 18:14:20 +02:00
Fabian Zwolinski
d7d577ba48
feature: enable cl_cache by default on Windows
...
Related-To: NEO-8092
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com >
2023-10-03 16:57:10 +02:00
Mateusz Jablonski
110164a52a
fix: remove invalid std::forward
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-03 16:41:41 +02:00
Fabian Zwolinski
1f1af5bb36
refactor: windows cl_cache eviction mechanism
...
Refactored eviction mechanism works as follows:
- eviction is needed only if
total size of cache binaries + size of the new binary exceed cache limit
- single evition call removes files with a summed size of 1/3 of the cache limit
- if new binary can not fit in the cache size limit
even after eviction, it will not be saved
- cache limit applies only to
files in cache directory with .cl_cache/.l0_cache extension.
Only these files are counted and only these files are removed
Related-To: NEO-8092
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com >
2023-10-03 16:20:28 +02:00
Compute-Runtime-Validation
8fa0b90f35
Revert "fix: query drm info to 8 byte aligned storage"
...
This reverts commit d0e615820c .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-10-03 15:41:11 +02:00
Jitendra Sharma
5a018385aa
feature: Add BCS engine as internal engine for dg1
...
Related-To: NEO-9033
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com >
2023-10-03 14:39:31 +02:00
Zbigniew Zdanowicz
0a99384936
fix: set flushed task count for all cases of post sync task count operations
...
- set monitor fence dispatch for all cases task count post sync operation
- stand alone flush task count will not happen when already flushed and so
monitor fence
- monitor fence then must be dispatched together with task count post sync
Related-To: NEO-8395
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-10-03 12:31:45 +02:00
Mateusz Jablonski
d0e615820c
fix: query drm info to 8 byte aligned storage
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-03 11:13:38 +02:00
Mateusz Jablonski
7531ced0e3
refactor: remove not needed error handling for make_unique
...
make_unique does not return empty object on failure
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-03 10:56:15 +02:00
Mateusz Jablonski
1963ea69e5
fix: add missing member initializations
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-03 08:41:46 +02:00
Naklicki, Mateusz
0461af492d
fix: unify path for getting number of grfs per thread
...
Related-To: NEO-8043
Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com >
2023-10-03 08:17:46 +02:00
Mateusz Jablonski
9337911742
fix: add self-assign check in operator=
...
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-10-03 08:09:16 +02:00
Fabian Zwolinski
7c80f49176
fix: refactor Windows cl_cache and add extra safety layers
...
What's changed:
- Add early return when open existing config
fails due to different error than `ERROR_FILE_NOT_FOUND`
- Swich `ReadFileEx` to `ReadFile`
- Add `SetFilePointer` to make sure we're reading
from the beginning of the file
- Pass `overlapped` to `WriteFile` to make sure
we're writing from the beginning of the file
Related-To: NEO-8092
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com >
2023-10-02 19:04:14 +02:00
Zbigniew Zdanowicz
d27d81f206
fix: dispatch monitor fence when stalling command dispatched
...
Resolves: NEO-9034
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-10-02 16:47:02 +02:00
Mateusz Hoppe
dc32e2b7da
refactor: extract ApiSpecificConfig methods to separate cpp file
...
- split ult config and ApiSpecificConfig used in ULTs
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-10-02 16:30:42 +02:00
John Falkowski
56f05303c9
feature: add support for zeMemGet/SetAtomicAccessAttributeExp
...
Resolves: NEO-8219
Signed-off-by: John Falkowski <john.falkowski@intel.com >
2023-10-02 15:59:17 +02:00
Compute-Runtime-Validation
1ac37d4a49
Revert "refactor: Enable CSR heap sharing on Older Generation platforms"
...
This reverts commit 58ff9c6d94 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-10-02 15:13:23 +02:00
Baj, Tomasz
cb903cbc03
refactor: Move disableRcs to releaseHelper
...
Related-To: NEO-8296
Signed-off-by: Baj, Tomasz <tomasz.baj@intel.com >
2023-09-29 14:19:02 +02:00
Hoppe, Mateusz
8da92f56c7
refactor: code cleanup
...
- remove not needed branch and message
Signed-off-by: Hoppe, Mateusz <mateusz.hoppe@intel.com >
2023-09-29 13:40:51 +02:00
Hoppe, Mateusz
5c565efe28
feature: bindless global heap with debugger
...
- program debugSurface's SurfaceState at the beginning of Bindless Surface
State Heap - SPECIAL_SSH
- ensure SPECIAL_SSH is resident
Related-To: NEO-7063
Signed-off-by: Hoppe, Mateusz <mateusz.hoppe@intel.com >
2023-09-29 13:13:46 +02:00
Mateusz Jablonski
ae90fd9e57
refactor: remove dead code
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-09-29 12:23:22 +02:00
Mateusz Jablonski
a033df33ff
fix: remove preferSmallWorkgroupSizeForKernel method
...
Related-To: HSD-18033866078
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-09-29 11:55:09 +02:00
Jitendra Sharma
58ff9c6d94
refactor: Enable CSR heap sharing on Older Generation platforms
...
Related-To: LOCI-4312
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com >
2023-09-29 11:54:51 +02:00
Dominik Dabek
b2a8fa6e57
performance: allocate cmd buffer by umd on mtl
...
Default allocation by kmd is slower, this improves enqueue times.
Related-To: NEO-8152
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-09-29 11:30:56 +02:00
Mateusz Hoppe
0f2a04fef4
feature: linux support for global ssh heap
...
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-09-29 11:18:01 +02:00