- new flag ExperimentalEnableSourceLevelDebugger that
allows communication with debugger library
Related-To: NEO-6514
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
Related-To: NEO-6510
Check which prelim version is being used.
Select proper IoctlHelper based on that version.
If no version found, switch to upstream instead.
Source of prelim headers:
https://github.com/intel-gpu/drm-uapi-helper
Signed-off-by: Szymon Morek <szymon.morek@intel.com>
Currently only supported scenario is one in order queue.
Instead of resolving dependencies via semaphores, do this with pipe controls.
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
Itroduces new parameter "-exclude_ir" to ocloc CLI.
This parameter can be used to reduce output binary size
when IR is not needed.
Related-To: NEO-6477
Signed-off-by: Patryk Wrobel <patryk.wrobel@intel.com>
If a kernel has ray tracing calls, we allocate and initialize
per-device RTDispatchGlobals if needed, and hand off pointer to
the same into a running kernel via an implicit parameter.
Related-To: NEO-5384
Signed-off-by: Jim Snow <jim.m.snow@intel.com>
- update crossthreaddata size according to argument offsets
when processing patchtoken binary when DATA PARAMETER STREAM SIZE
is lower than size required for arguments
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
Introduce the debug regkey OverrideMocsIndexForScratchSpace
to control MOCS index in surface state for scratch space
Related-To: NEO-6509
Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>
- Clear dependencies even if last engine changed
- Do no program semaphore waiting for blit when blit is submitted with gpgpu
- Track barrier timestamps to correctly synchronize blits in OOQ
Related-To: NEO-6444
Signed-off-by: Maciej Dziuban <maciej.dziuban@intel.com>
Fix problem with not enabled slices in mode SetCommandStreamReceiver=1
Related-To: NEO-6507
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
Introduce the regkey OverrideL1CacheControlInSurfaceStateForScratchSpace
to control cache policy in surface state for scratch space
Related-To: NEO-3227
Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>
Simplify logic around engine groups.
Remove no longer needed code.
Ensure correct device is used when verifying engine groups.
Related-To: NEO-6219
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
Related-To: NEO-6075
After this change driver will fail clBuildProgram/zeModuleCreate api calls
whenever stateful access is discovered and device has shared system usm caps
enabled.This is required since in this case allocation greater than 4GB
will not work.
If user still wants to use stateful addressing mode,
-cl-opt-smaller-than-4GB-buffers-only / -ze-opt-smaller-than-4GB-buffers-only
build option should be passed as build option, but then user can not use
buffers greater than 4GB.
Crash on an attempt to read from tag address when ring is not started
Related-to: NEO-5869
Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>
Related-To: NEO-6448
- add new IGC compilation flag when bindless mode used
- fix SBA programming of BindlessSurfaceStateSize -
always set maximum surface state count
- fix residency of global DSH heap on gen9 - gen12lp
in bindless mode
- add L0 aub test with bindless kernel - disabled
- partial fixes in OCL aub tests
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>