Compute-Runtime-Validation
e949ba7144
Revert "refactor: Add GDI profiling"
...
This reverts commit 8d56f8fb6b .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2024-01-23 06:13:02 +01:00
Mateusz Jablonski
87eb5f554a
refactor: create new members for storing spill and private memory in scratch
...
rename private scratch space into scratch space slot 1 as it can be generic
Related-To: NEO-9944
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-01-22 19:48:48 +01:00
Dunajski, Bartosz
b77e1a6a71
feature: improve reporting max cooperative group count
...
Related-To: NEO-8210
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2024-01-22 14:56:15 +01:00
Lukasz Jobczyk
8d56f8fb6b
refactor: Add GDI profiling
...
Resolves: NEO-9236
Related-To: NEO-10036
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2024-01-22 14:24:08 +01:00
Dunajski, Bartosz
6f4ed10919
refactor: unify getMaxWorkGroupCount logic
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2024-01-22 11:16:45 +01:00
Dominik Dabek
7a09c51f3d
fix(ocl): querying memobj offset
...
Querying buffer offset via clGetMemObjectInfo should not modify its
value.
Related-To: NEO-9690
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-01-19 15:48:46 +01:00
Dunajski, Bartosz
b266f1f3cc
refactor: improve implicit scaling interface
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2024-01-19 14:09:04 +01:00
Dominik Dabek
e0ccf22557
fix: indirect access in external functions
...
Read indirect_stateless_count in module external functions.
If greater than 0, mark all kernels that have the has_stack_calls flag
set from this module as having indirect accesses.
Related-To: NEO-7712
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-01-18 15:29:41 +01:00
Kamil Kopryk
36dcfc0b42
feature: add stateless heapless builtins
...
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-01-16 11:11:31 +01:00
Zbigniew Zdanowicz
c49695fa6e
fix: flush L3 cache and wait for flush to complete when event is completed
...
Related-To: NEO-8395
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2024-01-16 07:05:35 +01:00
Dominik Dabek
b3b402cad2
feature(zebin): read indirect detection version
...
Read indirect detection version intel compat notes from zebin.
This is to prepare for enabling indirect access optimization in AOT
built kernels.
Related-To: NEO-7712
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-01-12 14:20:52 +01:00
Kamil Kopryk
f99edf72e1
refactor: Add common HardwareCommandsHelper enablers
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-01-12 09:39:12 +01:00
Mateusz Jablonski
a73fb4d2fe
fix: correct reporing kernel private size on L0
...
unify the logic across APIs
Related-To: NEO-9944
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2024-01-09 09:54:05 +01:00
Dominik Dabek
9b52d52062
performance(ocl): enable usm pool allocator
...
Enable on xe hpg and lpg platforms
Related-To: NEO-9700
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-01-05 16:17:48 +01:00
Dominik Dabek
af1620a308
fix(ocl): allocation info from pool svm ptr
...
Fix querying allocation info from pooled svm ptr.
Handle requested allocation alignment.
Refactor sorted vector usage.
Do not associate device with host pool allocation.
Related-To: NEO-9700
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2024-01-05 15:20:01 +01:00
Kamil Kopryk
2a46350ba3
fix: fix programming indirect data pointer in heapless mode
...
Indirect data pointer should be offsetted by used offset.
Related-To: NEO-7621
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2024-01-04 07:51:31 +01:00
Compute-Runtime-Validation
5535ef3049
Revert "performance(ocl): enable usm pool allocator"
...
This reverts commit 7bc8424a69 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-12-29 05:54:07 +01:00
Mateusz Hoppe
31e9b5e9fa
feature: add support for secondary contexts in group
...
Related-To: NEO-7824
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-12-28 13:31:08 +01:00
Dominik Dabek
7bc8424a69
performance(ocl): enable usm pool allocator
...
Enable opencl usm pool allocator by default
Related-To: NEO-9700
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-12-28 13:14:41 +01:00
Dominik Dabek
d238a68bae
fix(ocl): usm pool allocator correct size
...
Wrong debug flag was used for setting host allocation pool size
Related-To: NEO-9700
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-12-27 23:14:28 +01:00
Dunajski, Bartosz
df66a0276f
refactor: remove not used logic to check dynamic postsync layout
...
Related-To: NEO-8210
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-12-27 13:12:11 +01:00
Dominik Dabek
2fe3804cc2
performance(ocl): add usm allocation pooling flag
...
EnableDeviceUsmAllocationPool and EnableHostUsmAllocationPool for device
and host allocations respectively.
Pool size will be set to flag value * MB.
Allocation size threshold to be pooled is 1MB.
Pools are created per context.
Related-To: NEO-9700
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-12-27 11:42:01 +01:00
Dominik Dabek
239831f097
fix: return error for 0 size usm allocations
...
according to both level zero and opencl specs, usm allocations with
size=0 should return invalid/unsupported buffer size errors
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-12-22 17:26:39 +01:00
Kacper Nowak
18fb432552
refactor: Stop using hard-coded BCS engine as default one
...
Do not rely on presence of BCS0 engine.
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com >
2023-12-21 17:34:57 +01:00
Mateusz Jablonski
a4888b39c6
build: add clang-tidy restriction for Enum case
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-21 08:58:51 +01:00
Mateusz Jablonski
bf60d77e10
refactor: correct naming of enum class constants n/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-20 10:56:37 +01:00
Mateusz Jablonski
138fb65401
refactor: correct naming of enum class constants 11/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-19 14:52:57 +01:00
Mateusz Jablonski
de93bc6928
refactor: correct naming of enum class constants 10/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-19 11:30:39 +01:00
Mateusz Jablonski
a6458433dc
refactor: correct naming of enum class constants 9/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-19 10:36:25 +01:00
Mateusz Jablonski
dd1b9d6abc
refactor: correct naming of enum class constants 8/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-19 08:18:18 +01:00
Chodor, Jaroslaw
7e795cd3c1
feature: Adding support for OCL C support queries to ocloc
...
Feature needed for automated OCL C compilation with ocloc as backend.
Added queries :
* CL_DEVICE_EXTENSIONS
* CL_DEVICE_EXTENSIONS_WITH_VERSION
* CL_DEVICE_PROFILE
* CL_DEVICE_OPENCL_C_ALL_VERSIONS
* CL_DEVICE_OPENCL_C_FEATURES
Sample command line:
ocloc query -device skl CL_DEVICE_OPENCL_C_FEATURES
Related-To: GSD-7420
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com >
2023-12-18 15:19:16 +01:00
Dunajski, Bartosz
d99104d5bf
refactor: improve ImplicitArg struct handling
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-12-18 15:19:00 +01:00
Mateusz Jablonski
fca28a930f
refactor: correct naming of enum class constants 7/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-18 13:05:56 +01:00
Ben Ashbaugh
976c6d3acc
feature: initial register count query implementation
...
Adds a per-kernel and per-device query to determine the
number of GRF registers that a kernel was compiled for.
This is an informal query for now, but may be added to
a formally supported extension in the future.
Related-To: NEO-9807
Signed-off-by: Ben Ashbaugh <ben.ashbaugh@intel.com >
2023-12-18 11:45:01 +01:00
Mateusz Jablonski
27fbdde4c5
refactor: correct naming of unified memory enums
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-13 15:58:21 +01:00
Mateusz Jablonski
739d181026
refactor: correct naming of enum class constants 6/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-13 14:48:52 +01:00
Mateusz Jablonski
8c90350f3c
refactor: correct naming of enum class constants 5/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-13 10:06:02 +01:00
Mateusz Jablonski
432142c574
refactor: correct naming of enum class constants 4/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-13 08:08:51 +01:00
Mateusz Jablonski
895519db38
refactor: correct naming of NEOImageType enum values
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-12 11:15:28 +01:00
Mateusz Jablonski
b182917d9d
refactor: correct naming of allocation types
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-11 16:23:37 +01:00
Mateusz Jablonski
beafea9b39
refactor: correct naming of enum class constants 2/n
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-12-11 13:13:35 +01:00
Dunajski, Bartosz
58323bfb12
refactor: improve PostSync helper methods
...
Related-To: NEO-8210
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-12-11 10:41:30 +01:00
Compute-Runtime-Validation
a2994e9b29
Revert "performance(ocl): set pool allocator threshold 1MB"
...
This reverts commit fc1d93af8e .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-12-09 07:02:42 +01:00
Milczarek, Slawomir
bec9874487
fix: Improved event profiling setup for bcs split enqueue
...
Fixes OpenCL Khronos failures in test profiling with bcs split
Related-To: NEO-8927
Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com >
2023-12-07 21:29:55 +01:00
Dunajski, Bartosz
aab3e7f7c5
refactor: improve getting postsync_data struct type
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-12-07 15:41:44 +01:00
Kamil Kopryk
7a14db7f9b
refactor: add max front end threads in EncodeWalkerArgs
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
2023-12-07 11:38:46 +01:00
Lukasz Jobczyk
c8c3f862f4
refactor: Add key to force zero copy without coherency
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-12-07 07:56:54 +01:00
Katarzyna Cencelewska
39ccf0297f
fix: Skip rebuild binary forced by flag when lack of ir
...
when flag RebuildPrecompiledKernels set and call clCreateProgramWithBinary
skip rebuild if ir binary not available
Resolves: HSD-18035219734
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com >
2023-12-06 20:00:05 +01:00
Dominik Dabek
fc1d93af8e
performance(ocl): set pool allocator threshold 1MB
...
Increase pool allocator threshold to 1MB
Remove stack allocations based on threshold in tests.
Related-To: NEO-9690
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-12-06 19:55:48 +01:00
Dunajski, Bartosz
9ce3713ace
refactor: improve Walker args handling
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-12-06 16:10:26 +01:00