- do not mark interrupt as complete when thread was stopped
before handling ATT event
- if no newly stopped threads reported in ATT event, interrupt
trigger thread unavailable event
Related-To: NEO-7501
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
No need to keep isTimestampWaitSupportedForEvents function
in hwHelper and in hwInfoConfig helper, move it to hwInfoConfig instead.
When available, PRELIM_DRM_I915_QUERY_FABRIC_INFO is used to query
connectivity between two devices. If not, then a copy is performed.
Signed-off-by: Jaime A Arteaga Molina <jaime.a.arteaga.molina@intel.com>
OpenCL expects args type names without additional information (type size).
- Use only the part up to the delimiter, without quotes
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
In case of recompilation from IR, apart from updating build log,
print debug message with recompilation warning.
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
Added getHelper common getter for hwInfoConfig and hwHelper
and use it as CoreHelper(HwHelper) and ProductHelper(HwInfoConfig)
Add missing allocation of kernel private memory for the scenario when
the private memory is not allocated within `KernelImp::initialize()` but
deferred until `appendLaunchKernelWithParams()` instead.
One kernel can never allocate more private/scratch memory than
`globalMemorySize`, that ends up in `ZE_RESULT_ERROR_OUT_OF_DEVICE_MEMORY`
being returned. However, several separate kernels can exceed the
`globalMemorySize` and then, the private region of each such kernel is
allocated at later stage, in `appendLaunchKernelWithParams()`.
Such mechanism was present on pre-xehp platforms and it is now added to
xehp-and-later.
See:
* ModuleImp::checkIfPrivateMemoryPerDispatchIsNeeded()
* Module::shouldAllocatePrivateMemoryPerDispatch()
Related-To: NEO-7398
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
Limit the amount of times compare_exchange_weak is called,
to avoid issues with contention when multiple cpu cores request
the same address.
Related-To: NEO-7030
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
- cases with null lws should only fail when computed
lws sizes result in too big number of workgroups
Related-To: NEO-6976
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
- Set initial value for ArgIndexT index in
KernelArgMiscInfoT struct
- Add check for arg index in extended metadata in order to avoid OOB
accesses
- Fix missing: populating ArgTypeTraits + refactor ULTs in order to
check it
- Add missing ULTs for Program::callPopulateZebinExtendedArgsMetadataOnce
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
This commit switches the device ID logic from the deprecated
to the new one, so that if the user passes a hex value to the -device
parameter, ocloc will use the new implementation in the product config
helper.
Signed-off-by: Daria Hinz <daria.hinz@intel.com>
Related-To: NEO-7487
This change extends the logic of opening IPC
memory handles and fills the gap between this
logic and allocation of USM device buffers.
When HEAP_EXTENDED is available, then it is
preferred.
Signed-off-by: Wrobel, Patryk <patryk.wrobel@intel.com>
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
- Added HwInfoConfig getter in RootDeviceEnvironment,
which temporarily takes HwInfoConfig from the global array
- use HwInfoConfig from RootDeviceEnvironment to
call ConfigureHardwareCustom function
- Added getHwInfoConfig in DeviceFixture
- ConfigureHardwareCustom function and few others changed to const
- Small code cleanup