This patch adds FP64 emulation support for ATS-M.
Introducing new environment variable - NEO_FP64_EMULATION - which provides
an option to allow the opt-in emulation of FP64.
When emulation is enabled, we pass -cl-fp64-gen-emu (ocl) /
-ze-fp64-gen-emu (L0) as an internal option to IGC.
Related-To: NEO-7611
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
This commit adds support for handling local symbols.
* Added 2 fields to SymbolInfo - binding, and associated
instructions segment id.
* Simplified code for decoding elf symbols and relocations.
* Simplified code for patching instruction segments.
* Changed logic of decodeElfSymbolTableAndRelocations:
* Add every global symbol to symbol map.
* Add any local symbol used by relocation to symbol map.
* Changed logic of link:
* After performing relocations remove local symbols from map.
* Replaced UNRECOVERABLE_IF with returning error.
* Removed LocalSymbolInfo structure used before for local kernel jumps.
* Removed old tests.
Signed-off-by: Krystian Chmielewski <krystian.chmielewski@intel.com>
Confirm the allocations used in an appendMemoryCopy operation
belong to the same context as the list.
Related-To: LOCI-1996
Resolves: NEO-6162
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
Confirm the allocations used in an appendMemoryCopy operation
belong to the same context as the list.
Related-To: LOCI-1996
Resolves: NEO-6162
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
By default, cooperative dispatch is not supported
for platforms not supporting cooperative engine.
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
Use correct offset for subBuffer created with clCreateSubBuffer
Make calls to clGetMemObjectInfo for CL_MEM_OFFSET and
CL_MEM_ASSOCIATED_MEMOBJECT return values like with pool buffer disabled
Related-To: NEO-7332
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
Allow creating subBuffer from buffer from buffer pool allocator
by redirecting the call to the pool buffer and adjusting offset
Related-To: NEO-7332
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
Instead of just returning proper error code in case of exceeding
available Shared Local Memory size we also want to print error message
to make debugging easier.
Related-To: NEO-7280
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
clEnqueueSVMMemFill() is utilized by clEnqueueMemFillINTEL().
The latter may receive USM device pointers. If anyone passed
such pointer then SEGMENTATION_FAULT would occur.
Signed-off-by: Patryk Wrobel <patryk.wrobel@intel.com>
Remove error for unused parameter
from clGetKernelMaxConcurrentWorkGroupCountINTEL
Resolves: NEO-7044
Signed-off-by: Neumann, Marta <marta.neumann@intel.com>
Given a following sequence of setting kernel arg svm pointers: A, nullptr, A
the kernel arg was not being set to A
Related-To: NEO-6895
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
Check allocId earlier and also reuse if allocationsCounter did not
change from last call.
Related-To: NEO-6737
Co-authored-by: Michal Mrozek <michal.mrozek@intel.com>
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
Forces extended buffer size by adding pageSize specify by number when
debug flag is >=1 in:
- clHostMemAllocINTEL
- clDeviceMemAllocINTEL
- clSharedMemAllocINTEL
Usage:
ForceExtendedUSMBufferSize=2
size += (2 * pageSize)
Signed-off-by: Krzysztof Gibala <krzysztof.gibala@intel.com>