compute-runtime

Commit Graph

Author	SHA1	Message	Date
Wang, Elaine	c4728b312c	feature: Add ARGB support for openCL VA surface Support VA surface format VA_FOURCC_ARGB Related-To: HSD-22018705184 Signed-off-by: Wang, Elaine <elaine.wang@intel.com>	2023-08-24 08:37:28 +02:00
Dominik Dabek	5c5c718af3	performance: detect indirect access in kernel, PVC Enabling on pvc after patch in igc. Enabling only for JIT kernels because AOT could have been compiled with IGC older than required. Related-To: NEO-7712 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2023-08-24 02:15:11 +02:00
John Falkowski	d49190f4ae	feature: Add debug/release variables prefixes Add debug/release variables with prefixes for Level Zero, OpenCL and NEO Resolves: NEO-6357 Signed-off-by: John Falkowski <john.falkowski@intel.com>	2023-08-22 15:15:45 +02:00
Maciej Plewka	2e249d819e	fix: submit marker for cpu transfer with event on IOQ queue Related-To: NEO-8081 Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>	2023-08-21 15:07:37 +02:00
Artur Harasimiuk	f6e0c0cf89	Revert "feature: Add debug/release variable prefixes" This reverts commit `ec95d9314a`. Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>	2023-08-18 12:42:39 +02:00
Dunajski, Bartosz	59233d9597	performance: skip queue state check when waiting for latest IOQ TSP Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-08-18 08:17:11 +02:00
Andrzej Koska	0a3b135673	fix: Taking into account variable ReturnSubDevicesAsApiDevices Taking into account variable ReturnSubDevicesAsApiDevices during Retain and Release Device Related-To: NEO-8161 Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>	2023-08-16 16:14:42 +02:00
Dunajski, Bartosz	3cf1f5c462	fix: unregister CSR client on OCL sync points Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-08-16 13:35:07 +02:00
Dunajski, Bartosz	a42e88d55b	fix: stall RelaxedOrdering scheduler when programming OCL dependencies Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-08-14 10:47:35 +02:00
Fabian Zwolinski	85c30457b9	fix: restore createDebugZebin Related-To: NEO-7213 Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>	2023-08-11 14:57:32 +02:00
John Falkowski	ec95d9314a	feature: Add debug/release variable prefixes Resolves: NEO-6357 Signed-off-by: John Falkowski <john.falkowski@intel.com>	2023-08-10 14:01:09 +02:00
Fabian Zwolinski	6fca8ee195	refactor: Remove SourceLevelDebugger Removed: - SourceLevelDebugger (with tests) - DebuggerLibrary - DebuggerLibraryRestore - debuggerSupported field from hwInfo.capabilityTable - HasSourceLevelDebuggerSupport matcher - ExperimentalEnableSourceLevelDebugger debug var - EnableMockSourceLevelDebugger debug var - DebuggerOptDisable debug var - lib_names.h.in file - third_party/source_level_debugger/igfx_debug_interchange_types.h Related-To: NEO-7213 Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>	2023-08-10 11:14:02 +02:00
Kacper Nowak	1afaf37f78	refactor: Unify GTPin initialization logic between APIs Add support for new GTPin loading logic in OCL path, similar to existing in L0 - invoking exposed, dedicated API call (OpenGTPinOCL). - Move logic to shared, including unit tests - Check whether instrumentation is required on context creation and if yes, make a call to OpenGTPinOCL function. Handle potential errors gracefully without exiting. Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>	2023-08-09 17:15:24 +02:00
Compute-Runtime-Validation	65df34bbc7	Revert "refactor: Unify GTPin initialization logic between APIs" This reverts commit `68a5108e05`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-08-07 04:31:52 +02:00
Kacper Nowak	68a5108e05	refactor: Unify GTPin initialization logic between APIs Add support for new GTPin loading logic in OCL path, similar to existing in L0 - invoking exposed, dedicated API call (OpenGTPinOCL). - Move logic to shared, including unit tests - Check whether instrumentation is required on clGetPlatformIDs API call, and if yes, make a call to OpenGTPinOCL function. Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>	2023-08-05 18:22:08 +02:00
Lukasz Jobczyk	e89e20c013	fix: Disable timestamp wait for events on WSL Related-To: NEO-8067 Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>	2023-08-04 09:23:30 +02:00
Compute-Runtime-Validation	f4d05a8e6b	Revert "fix: Add missing fp64 extensions in caps initialization" This reverts commit `22a719d62f`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-07-31 13:43:23 +02:00
Fabian Zwolinski	22a719d62f	fix: Add missing fp64 extensions in caps initialization In a scenario where fp64 emulation is both supported and enabled, we want to include 2 additional extensions: - cl_khr_fp64 to deviceExtensions - __opencl_c_fp64 to openclCFeatures (OpenCL ver. 3.0) Related-To: NEO-7611 Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>	2023-07-28 21:29:30 +02:00
Filip Hazubski	8dd23f4b4d	feature: Add logic around cpu side allocations Group allocation types related to cpu side allocations in function to query gmm usage type. These types will have caching enabled even if CPU caching is not preferred by GPU. Add logic to query whether the cpu access is allowed for an allocation (in cases when it is not preffered by GPU). Related-To: NEO-7194 Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>	2023-07-28 21:04:24 +02:00
Mateusz Hoppe	997b599168	fix(debugger): pass correct sipAllocation to makeResident - sipAllocation for context must be resident in Offline mode Related-To: NEO-7630 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-07-28 20:51:12 +02:00
Mateusz Jablonski	f84f22d23c	refactor: remove unused isSharedContext variable Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-07-26 12:14:23 +02:00
Milczarek, Slawomir	027c51d396	feature: Add CPU side USM allocation to trim candidate list on page fault Enable eviction of CPU side USM allocation for UMD migrations on Windows. Reverts incorrect auto-revert commit 218de586a4f28b1de3e983b9006e7a99d3a4d10e. Related-To: NEO-8015 Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>	2023-07-25 15:21:12 +02:00
Dunajski, Bartosz	6656e23b86	refactor: unify completion check methods Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-07-25 14:17:52 +02:00
Dunajski, Bartosz	2c50fd9486	fix: waiting for completion in TBX mode - use testTaskCountReady method to check TaskCount value - download all allocations when TaskCount is ready Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-07-25 11:54:09 +02:00
Cencelewska, Katarzyna	6f45574ff9	fix: add proper bcs synchronization when marker Related-To: NEO-8105 Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-07-24 10:57:41 +02:00
Compute-Runtime-Validation	918b41d26d	Revert "feature: Add CPU side USM allocation to trim candidate list on page f... This reverts commit `60a4448a07`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-07-24 08:44:22 +02:00
Milczarek, Slawomir	60a4448a07	feature: Add CPU side USM allocation to trim candidate list on page fage fault Enable eviction of CPU side USM allocation for UMD migrations on Windows. Related-To: NEO-8015 Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>	2023-07-23 10:24:28 +02:00
Compute-Runtime-Validation	4a562e352b	Revert "feature: Add CPU side USM allocation to trim candidate list on page f... This reverts commit `cce2cc920d`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-07-21 16:40:59 +02:00
Milczarek, Slawomir	cce2cc920d	feature: Add CPU side USM allocation to trim candidate list on page fault Enable eviction of CPU side USM allocation for UMD migrations on Windows. Related-To: NEO-8015 Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>	2023-07-21 14:18:38 +02:00
Dominik Dabek	0a4d0917d4	performance(ocl): skip dcFlush on no event Skip dcFlush on waitForBarrier without event by default. Related-To: NEO-8147 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2023-07-20 14:57:37 +02:00
Compute-Runtime-Validation	8c155a2e89	Revert "performance: Memory handling improvements" This reverts commit `5b80bd4d7c`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-07-20 11:37:09 +02:00
Neil R Spruit	8efd8853fb	feature: L0 Device Hierarchy support for Composite & Flat modes Related-To: LOCI-4615 - Added Support for users to set ZE_FLAT_DEVICE_HIERARCHY to either FLAT or COMPOSITE to change how devices are returned in zeDeviceGet and clGetDeviceIDs. - COMPOSITE is default behavior that exists today. - FLAT returns all sub devices which have no sub devices and all root devices that have no sub devices in zeDeviceGet ie with all devices flattened out in order. - Added zeDeviceGetRootDevice for one to retrieve the Root Device for any SubDevice. Signed-off-by: Neil R Spruit <neil.r.spruit@intel.com>	2023-07-20 01:00:54 +02:00
Filip Hazubski	5b80bd4d7c	performance: Memory handling improvements By default prefer allocating memory first by KMD, instead of malloc first. By default prefer not caching allocations on MTL devices. This results in allocations being handled with non-coherent pat index. For integrated devices when caching is not preferred do not allow direct memory access in CPU domain. For map/unmap operations create a dedicated memory allocation for CPU access, instead of accessing it directly, reusing the same logic as when mapping/unmapping local memory. Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>	2023-07-19 19:21:44 +02:00
Dominik Dabek	622a3ed89c	performance(ocl): flag to not dcFlush on no event If waitForBarrier is not passed outEvent then do dcFlush on the next synchronize call. Related-To: NEO-8147 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2023-07-18 15:38:54 +02:00
Mateusz Hoppe	776917159d	feature: cl_khr_external_memory - return platform info - return supported memory handle types when querying CL_PLATFORM_EXTERNAL_MEMORY_IMPORT_HANDLE_TYPES_KHR Related-To: NEO-6757 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-07-18 14:21:20 +02:00
Mateusz Jablonski	33e0eabe91	performance(ocl): verify tracingInProgress after checking for tracing support Related-To: NEO-7958 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-07-18 11:42:30 +02:00
Dunajski, Bartosz	815b37bf3a	performance: allow waiting for OOQ timestamps in clEnqueueWaitForEvents Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-07-14 11:33:10 +02:00
Dunajski, Bartosz	712e059ace	performance: check completion alloc only once when waiting for Event Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-07-14 08:32:50 +02:00
Maciej Bielski	c7a971a28f	feature: add optional onChunkFree callback to AbstractBuffersPool Instances returned by `getAllocationsVector()` in some cases cannot be freed (in the `malloc/new` sense) until the `drain()` function invokes `allocInUse()` on them. Plus, the `chunksToFree` container operates on pairs `{offset, size}`, not pointers, so such pair cannot be used to release allocations either. Provide an optional callback, which can be implemented by the custom pool derived from `AbstractBuffersPool`. This callback can be used, for example, to perform actual release of an allocation related to the currently processed chunk. Additionally, provide the `drain()` and `tryFreeFromPoolBuffer()` functions with pool-independent versions and keep the previous versions as defaults (for allocators with a single pool). The new versions allow reusing the code for cases when allocator has multiple pools. In both cases, there was no such needs so far but it arose when working on `IsaBuffersAllocator`. The latter is coming with future commits, but the shared code modifications are extracted as an independent step. Related-To: NEO-7788 Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>	2023-07-13 17:26:51 +02:00
Mateusz Hoppe	9fd7f9cf05	fix: set ImplicitArgs size to size of defined fields Resolves: NEO-8169 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-07-12 21:30:32 +02:00
Dunajski, Bartosz	40c7b2842f	fix: check engines completion before releasing deferred TSP nodes Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com> Related-To: NEO-8146	2023-07-11 16:11:51 +02:00
Cencelewska, Katarzyna	aa0beb8191	fix: Unify logic calculating threads per work group part 4 - also use helper when checking that is simd1 to have same flow Related-To: NEO-8087 Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-07-07 15:34:59 +02:00
Dunajski, Bartosz	00bae2c827	fix: add missing nullptr check Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-07-07 11:34:49 +02:00
Compute-Runtime-Validation	9c7950cd22	Revert "feature: add optional onChunkFree callback to AbstractBuffersPool" This reverts commit `b7ecf99abb`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-07-07 04:31:30 +02:00
Dunajski, Bartosz	d96cf5846a	fix: dont allocate TSP for OOQ without Event Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-07-06 15:12:22 +02:00
Maciej Bielski	b7ecf99abb	feature: add optional onChunkFree callback to AbstractBuffersPool Instances returned by `getAllocationsVector()` in some cases cannot be freed (in the `malloc/new` sense) until the `drain()` function invokes `allocInUse()` on them. Plus, the `chunksToFree` container operates on pairs `{offset, size}`, not pointers, so such pair cannot be used to release allocations either. Provide an optional callback, which can be implemented by the custom pool derived from `AbstractBuffersPool`. This callback can be used, for example, to perform actual release of an allocation related to the currently processed chunk. Additionally, provide the `drain()` and `tryFreeFromPoolBuffer()` functions with pool-independent versions and keep the previous versions as defaults (for allocators with a single pool). The new versions allow reusing the code for cases when allocator has multiple pools. In both cases, there was no such needs so far but it arose when working on `IsaBuffersAllocator`. The latter is coming with future commits, but the shared code modifications are extracted as an independent step. Related-To: NEO-7788 Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>	2023-07-06 10:38:55 +02:00
Cencelewska, Katarzyna	61f701aba5	fix: Unify logic calculating threads per work group part 3 Related-To: NEO-8087 Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-07-04 15:27:44 +02:00
Cencelewska, Katarzyna	2e17c21728	fix: Unify logic calculating threads per work group part 2 - use calculateNumThreadsPerThreadGroup instead of getThreadsPerWG to have same flow and proper values of threads per work groups Related-To: NEO-8087 Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-07-04 10:34:02 +02:00
Compute-Runtime-Validation	39740da9d1	Revert "fix: Unify logic calculating threads per work group part 2" This reverts commit `1e8a53bd53`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-07-02 07:09:14 +02:00
Cencelewska, Katarzyna	1e8a53bd53	fix: Unify logic calculating threads per work group part 2 - use calculateNumThreadsPerThreadGroup instead of getThreadsPerWG to have same flow and proper values of threads per work groups Related-To: NEO-8087 Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-06-30 14:16:08 +02:00

1 2 3 4 5 ...

2054 Commits