compute-runtime

Commit Graph

Author	SHA1	Message	Date
Szymon Morek	5e92d530de	performance: Reuse GPU timestamps by default on Windows Related-To: NEO-10615 Signed-off-by: Szymon Morek <szymon.morek@intel.com>	2024-05-28 12:25:30 +02:00
Lukasz Jobczyk	8217b76cef	refactor: Add key to not register pagefault handler on migration Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>	2024-05-28 08:45:34 +02:00
Compute-Runtime-Validation	0b2c9e92e7	Revert "performance: Reuse GPU timestamps by default on Windows" This reverts commit `bca3fecaa0`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2024-05-25 07:59:00 +02:00
Szymon Morek	bca3fecaa0	performance: Reuse GPU timestamps by default on Windows Related-To: NEO-10615 Signed-off-by: Szymon Morek <szymon.morek@intel.com>	2024-05-24 20:11:45 +02:00
Bartosz Dunajski	913d5dc3b1	feature: create secondary contexts for different engine types Related-To: NEO-7824 Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>	2024-05-24 15:14:24 +02:00
Zbigniew Zdanowicz	294c3b77ba	refactor: add level zero console logging for kernel buffer arguments Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2024-05-23 11:01:38 +02:00
Bartosz Dunajski	cb9977b8f4	feature: create copy offload queue under debug flag Related-To: NEO-11376 Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>	2024-05-17 11:04:35 +02:00
Compute-Runtime-Validation	34f53d5d94	Revert "performance: Reuse GPU timestamps by default" This reverts commit `7aceed58ca`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2024-05-16 21:52:10 +02:00
Mateusz Jablonski	03d87d27ef	fix: generate per process aub file name by default Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2024-05-16 09:03:21 +02:00
Szymon Morek	7aceed58ca	performance: Reuse GPU timestamps by default Related-To: NEO-10615 Signed-off-by: Szymon Morek <szymon.morek@intel.com>	2024-05-15 17:51:42 +02:00
Slawomir Milczarek	b37c2970ce	test: Rename regkey BcsNumberOverride to BlitterEnableMaskOverride BlitterEnableMaskOverride is a bitmask with BCS engines available on device Related-To: NEO-11152 Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>	2024-05-10 21:18:44 +02:00
Slawomir Milczarek	2473c38e31	test: Add regkey to override number of BCS engines on platform New regkey BcsNumberOverride for use in TBX and AUB mode Related-To: NEO-11082 Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>	2024-05-07 18:42:01 +02:00
Szymon Morek	83e8ae4a20	performance: Reuse GPU timestamp instead of KMD escape This can be enabled only if related debug flag will be set. Related-To: NEO-10615 Signed-off-by: Szymon Morek <szymon.morek@intel.com>	2024-05-06 14:46:30 +02:00
Bartosz Dunajski	8e5f9e72c8	refactor: simplify waiting for fence logic Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>	2024-04-25 22:36:19 +02:00
Slawomir Milczarek	6d15c248ec	feature: Add regkey to control AUB/TBX writable for buffer host memory The new regkey SetBufferHostMemoryAlwaysAubWritable=0/1 allows controlling if buffer host memory allocation is one-time AUB / TBX writable. Related-To: NEO-11158 Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>	2024-04-24 21:47:02 +02:00
Kulkarni, Ashwin Kumar	8c1f0836ae	feature: enables basic framework for spdlogs Related-To: NEO-10510 Signed-off-by: Kulkarni, Ashwin Kumar <ashwin.kumar.kulkarni@intel.com>	2024-04-23 07:23:46 +02:00
Compute-Runtime-Validation	da9df9f0e7	Revert "performance: Reuse GPU timestamp instead of KMD escape" This reverts commit `9ca2091725`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2024-04-18 10:25:15 +02:00
Morek, Szymon	9ca2091725	performance: Reuse GPU timestamp instead of KMD escape Resolves: NEO-10615 Signed-off-by: Morek, Szymon <szymon.morek@intel.com>	2024-04-17 09:39:29 +02:00
Mateusz Jablonski	cb2b572e94	feature: add support for null aub mode In this mode AUB csr will be created, however, no aub file will be created Related-To: NEO-11097 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2024-04-09 16:59:42 +02:00
Young Jin Yoon	d6a14d4ed5	feature: support explicit memory locking Added lockMemory in context to explicitly locking memory, Added a boolean flag in graphics_allocation to indicate the allocation is locked, and modified memory_operations_handler to add lock(). Related-To: NEO-8277 Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>	2024-03-29 07:31:22 +01:00
Maciej Plewka	b722f3b579	feature: Add interface to bind resources as readonly Related-To: NEO-10398 Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>	2024-03-27 14:24:58 +01:00
Compute-Runtime-Validation	df164174b4	Revert "fix: pass FtrTile64Optimization as-is" This reverts commit `22d08dabc4`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2024-03-22 23:56:28 +01:00
Bartosz Dunajski	9aa81bae75	feature: initial support to enable synchronized dispatch Related-To: NEO-8171 Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>	2024-03-22 17:23:58 +01:00
Dominik Dabek	2b964254d6	performance: debug key for adjust ULLS on battery ULLS controller timeout settings will be adjusted based on ac line status and lowest queue throttle from submissions. Lowest queue throttle is reset when controller stops ULLS. Related-To: NEO-10800 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2024-03-22 14:24:00 +01:00
Zbigniew Zdanowicz	12affba420	feature: add override key to change command list update capability Related-To: NEO-10062 Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2024-03-21 17:11:27 +01:00
Mateusz Jablonski	22d08dabc4	fix: pass FtrTile64Optimization as-is Related-To: NEO-10623 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2024-03-21 16:52:27 +01:00
Aravind Gopalakrishnan	04b99de4d6	refactor: Force tlb flush during TC after copy Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>	2024-03-21 07:25:46 +01:00
Mateusz Hoppe	27b930cabc	refactor: allow default setting for UseExternalAllocatorForSshAndDsh - value of -1 selects driver default setting for external allocator Related-To: NEO-7063 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2024-03-20 12:29:56 +01:00
Joshua Santosh Ranjan	06fcdd28f3	feature: add debug flag for metrics logs Related-To: NEO-10125 Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>	2024-03-19 12:33:26 +01:00
Mateusz Jablonski	1e1d675606	fix: disable passing FtrTile64Optimization to gmmlib add debug key to control if the value should be passed Related-To: NEO-10785 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2024-03-15 17:42:53 +01:00
Young Jin Yoon	82728ff394	feature: add logic to iterate for all contexts to check GPU pagefault Implemented to go through entire contexts in the process and then query reset status to check the unexpected GPU segfault. Added a new debug variable GpuFaultCheckThreshold to change the checking frequency for each hang check for performance analysis. Related-To: GSD-5673 Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>	2024-03-15 07:48:39 +01:00
Aravind Gopalakrishnan	3f20dd3b49	refactor: Add optional user fence during unbind Add optional fence and wait operations after unbind operation. Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>	2024-03-13 12:47:44 +01:00
Lukasz Jobczyk	c3f1eba24a	refactor: Add flag to control DC flush Related-To: NEO-10556 Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>	2024-03-12 14:54:16 +01:00
Mrozek, Michal	ee1a225a41	refactor: remove not used debug variables Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>	2024-03-12 10:12:23 +01:00
Dominik Dabek	5ba9308804	performance: debug flag for localPreferred Add flag for setting localPreferred (implicit when gmm localOnly=0 and NonLocalOnly=0) when allocating buffer, svmGpu and image. Related-To: NEO-9695 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2024-03-11 10:51:49 +01:00
Mateusz Hoppe	cb7ac1ada0	feature: add debug key to generate sip header file - header file can be used with LoadBinarySipFromFile Related-To: GSD-8253 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2024-03-08 19:03:43 +01:00
Bartosz Dunajski	fcd57f94cf	refactor: capability to print mmap and munmap calls Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>	2024-03-06 14:29:01 +01:00
Lukasz Jobczyk	cfd3edfb2c	fix: Align IOH entry Related-To: NEO-10036 Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>	2024-02-26 14:36:31 +01:00
Dominik Dabek	07639401c5	performance: enable pat index, mtl linux Enable programming pat indexes on mtl linux by default. Related-To: NEO-7896 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2024-02-16 18:31:21 +01:00
Dominik Dabek	0120d8a58d	performance: program pat index on mtl linux Enable programming pat indexes on mtl linux for device buffers. Change DrmMemoryManager::allocateMemoryByKMD to use gemCreateExt. Set mmap flags based on coherency. Map as write back on legacy and coherent. On non-coherent map as write combined. Changes currently disabled, to enable use debug keys: DisableGemCreateExtSetPat=0 UseGemCreateExtInAllocateMemoryByKMD=1 Reorder BufferObject to decrease padding. Related-To: NEO-7896 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2024-02-16 17:33:07 +01:00
Compute-Runtime-Validation	7b340775c6	Revert "performance: program pat index on mtl linux" This reverts commit `8e0b23db84`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2024-02-15 02:06:03 +01:00
Dunajski, Bartosz	88c5872682	feature: debug flag to flush tlb before copy Related-To: HSD-18036669673 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2024-02-14 20:05:57 +01:00
Dominik Dabek	8e0b23db84	performance: program pat index on mtl linux Enable programming pat indexes on mtl linux for device buffers. Change DrmMemoryManager::allocateMemoryByKMD to use gemCreateExt. Related-To: NEO-7896 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2024-02-14 18:42:04 +01:00
Yoon, Young Jin	97ef964bc4	feature: Add keys to override sync mode for immediate command list Added OverrideImmediateCmdListSynchronousMode to override synchronous mode for immediate command list Related-To: NEO-10316 Signed-off-by: Yoon, Young Jin <young.jin.yoon@intel.com>	2024-02-08 08:35:32 +01:00
Lukasz Jobczyk	486cc71b76	refactor: Add GDI profiling Resolves: NEO-9236 Related-To: NEO-10036 Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>	2024-02-07 18:44:11 +01:00
Dominik Dabek	371788210d	performance: limit usm host allocation recycle Query system total memory size and limit usm host allocation recycle to use at most x%. x is read from ExperimentalEnableDeviceAllocationCache for device and ExperimentalEnableHostAllocationCache for host. Related-To: GSD-7497 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2024-02-07 17:45:41 +01:00
Dunajski, Bartosz	f31fafb1e2	refactor: improve debug flag to override bcs mocs Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2024-02-05 20:23:54 +01:00
Kamil Kopryk	a4f7dda98f	refactor: Add xe print debug key Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>	2024-02-02 16:39:51 +01:00
Katarzyna Cencelewska	e6ba9766bd	feature: add debug flags to force pat index for cached recouces: OverridePatIndexForCachedTypes for uncached resouces: OverridePatIndexForUncachedTypes Related-To: NEO-10157 Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>	2024-02-02 16:11:34 +01:00
Dominik Dabek	2cad595a0d	performance: debug flag for usm host alloc recycle set ExperimentalEnableHostAllocationCache=1 to recycle host usm allocations Related-To: GSD-7497 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2024-02-01 16:47:59 +01:00
Compute-Runtime-Validation	fb46066abc	Revert "fix: enable cache env variables for level-zero" This reverts commit `743904d2df`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2024-01-31 08:33:05 +01:00
Fabian Zwolinski	743904d2df	fix: enable cache env variables for level-zero Related-To: NEO-10045 Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>	2024-01-30 14:01:42 +01:00
Compute-Runtime-Validation	fa9c79fb63	Revert "refactor: Add GDI profiling" This reverts commit `524ae7713a`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2024-01-30 10:47:34 +01:00
Lukasz Jobczyk	524ae7713a	refactor: Add GDI profiling Resolves: NEO-9236 Related-To: NEO-10036 Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>	2024-01-29 11:36:04 +01:00
Zbigniew Zdanowicz	a25eedb5ac	feature: add print of cpu flags and address size upon detection Related-To: NEO-9737 Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2024-01-24 11:03:30 +01:00
Compute-Runtime-Validation	e949ba7144	Revert "refactor: Add GDI profiling" This reverts commit `8d56f8fb6b`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2024-01-23 06:13:02 +01:00
Lukasz Jobczyk	8d56f8fb6b	refactor: Add GDI profiling Resolves: NEO-9236 Related-To: NEO-10036 Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>	2024-01-22 14:24:08 +01:00
Mateusz Jablonski	7b40b01f54	feature: add debug key for toggling bit in 57bit GPU VA for specific allocations Related-To: NEO-9419 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2024-01-15 19:37:00 +01:00
Dominik Dabek	997bdfa010	performance: add windows thread priority debug key Set windows thread priority to "above normal" on wddm init if flag is set. Related-To: NEO-8215 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2024-01-15 08:14:46 +01:00
Young Jin Yoon	4ccae1dbb4	feature: support memory policy for GEM_CREATE Modified ioctl_helper_prelim to support the extension of gem_create_ext, i.e. prelim_drm_i915_gem_create_ext_mempolicy. Added two debug variables to be used for the mempolicy extension. Modified functions in memory_info and drm_memory_manager to support extension Added numaif.h from https://github.com/numactl/numactl/tree/master, v2.0.14 Related-To: NEO-8276 Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>	2024-01-04 23:49:10 +01:00
Mateusz Hoppe	31e9b5e9fa	feature: add support for secondary contexts in group Related-To: NEO-7824 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-12-28 13:31:08 +01:00
John Falkowski	138f22f684	fix: correct calculation for chunking size Resolves: NEO-9562 Signed-off-by: John Falkowski <john.falkowski@intel.com>	2023-12-27 16:27:09 +01:00
Dunajski, Bartosz	df66a0276f	refactor: remove not used logic to check dynamic postsync layout Related-To: NEO-8210 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-12-27 13:12:11 +01:00
Dominik Dabek	2fe3804cc2	performance(ocl): add usm allocation pooling flag EnableDeviceUsmAllocationPool and EnableHostUsmAllocationPool for device and host allocations respectively. Pool size will be set to flag value * MB. Allocation size threshold to be pooled is 1MB. Pools are created per context. Related-To: NEO-9700 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2023-12-27 11:42:01 +01:00
Naklicki, Mateusz	08f7e7be18	fix: align NEO to new Xe KMD header Align to the new PAT and cache coherency support There is an issue with coherency=non_coh, which is default option for some platforms. Add temporary W/A until this issue is resolved. xe_drm.h header is generated from the series "PAT and cache coherency support" from https://patchwork.freedesktop.org/series/123027/ Related-To: NEO-9421, NEO-8324 Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com>	2023-12-22 16:44:26 +01:00
Zbigniew Zdanowicz	7418cff844	feature: add debug flags and instrumentation of waitpkg calls Related-To: NEO-9737 Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-12-22 08:34:13 +01:00
Compute-Runtime-Validation	570b4d3d39	Revert "fix: align NEO to new Xe KMD header" This reverts commit `f68b8a2c97`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-12-14 10:23:31 +01:00
Naklicki, Mateusz	f68b8a2c97	fix: align NEO to new Xe KMD header Align to the new PAT and cache coherency support xe_drm.h header is generated from the series "PAT and cache coherency support" from https://patchwork.freedesktop.org/series/123027/ Related-To: NEO-9421, NEO-8324 Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com>	2023-12-13 14:14:35 +01:00
Dunajski, Bartosz	8b58cbbad8	feature: create duplicated storage for in-order counter Related-To: NEO-8145 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-12-08 18:19:03 +01:00
Lukasz Jobczyk	c8c3f862f4	refactor: Add key to force zero copy without coherency Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>	2023-12-07 07:56:54 +01:00
Mateusz Jablonski	da957d1a37	refactor: correct naming of enum class constants 1/n Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-12-05 14:26:42 +01:00
Dominik Dabek	6cf6a8def8	performance: add power throttling debug key Set windows process power throttling hint to HIGH on wddm init Related-To: NEO-8215 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2023-12-04 15:03:33 +01:00
John Falkowski	911acd81a2	feature: add SetBOChunkingSize debug variable Resolves: NEO-9562 Signed-off-by: John Falkowski <john.falkowski@intel.com>	2023-12-04 06:21:01 +01:00
Compute-Runtime-Validation	9add9f12dc	Revert "feature: add SetBOChunkingSize debug variable" This reverts commit `e1df8f9112`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-12-03 03:28:16 +01:00
Dunajski, Bartosz	2c921ec940	feature: support to use mi_atomic for signalling in-order counter Related-To: NEO-7966 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-12-01 15:35:12 +01:00
John Falkowski	e1df8f9112	feature: add SetBOChunkingSize debug variable Resolves: NEO-9562 Signed-off-by: John Falkowski <john.falkowski@intel.com>	2023-12-01 09:39:47 +01:00
Mateusz Jablonski	c9664e6bad	refactor: rename global debug manager to debugManager Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-11-30 13:00:59 +01:00
Mateusz Jablonski	36194c4e7d	refactor: correct variable namings Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-11-29 23:49:03 +01:00
Dunajski, Bartosz	5772b17924	refactor: Add debug flag to check Device State on failed Wddm submission Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-11-29 18:44:25 +01:00
Dunajski, Bartosz	aba1cd8f9c	feature: improve waiting and signaling Events via KMD calls Related-To: NEO-8179 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-11-23 14:53:27 +01:00
Baj, Tomasz	c49a9b9787	refactor: remove ReturnSubDevicesAsApiDevices from shared code Related-To: NEO-9437 Signed-off-by: Baj, Tomasz <tomasz.baj@intel.com>	2023-11-22 15:13:29 +01:00
Kacper Nowak	1b932bf119	fix: allow legacy device binary validation logic for Blender on DG2 and MTL Temporarily opt-out from additional compatibility checks on DG2 and MTL for Blender and its derivatives AOT-compiled kernels. This prevents a long kernel recompilation. Additionally, same behavior can be enforced for other applications manually via NEO debug key named DoNotUseProductConfigForValidationWa. Signed-off-by: Kacper Nowak <kacper.nowak@intel.com> Related-To: NEO-9240	2023-11-21 16:05:17 +01:00
Dominik Dabek	6562828095	performance: prealloc internal heap on mtl Preallocate 1 internal heap allocation per csr on mtl Related-To: NEO-8152 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2023-11-17 13:36:21 +01:00
Dunajski, Bartosz	a0beb96db8	feature: initial support for implicit convertion to CounterBased Events Related-To: NEO-8145 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-11-08 14:59:12 +01:00
John Falkowski	f0175b3916	feature: set device allocation chunking as default Device allocation chunking only applies for multi-tile mode for implicit scaling Related-To: NEO-9051 Signed-off-by: John Falkowski <john.falkowski@intel.com>	2023-11-07 10:58:17 +01:00
Zbigniew Zdanowicz	20c3f45998	refactor: add ulls diagnostic flag to select monitor fence input Related-To: NEO-8395 Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-11-06 15:09:36 +01:00
Zbigniew Zdanowicz	e0ce08bb77	fix: detect gpu hang or page fault at direct submission flush to gpu Related-To: NEO-8395 Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-11-06 14:22:02 +01:00
Michal Mrozek	ed897c302d	performance: Implement V2 version of tg dispatch size algorithm. Signed-off-by: Michal Mrozek <michal.mrozek@intel.com> Related-To: NEO-6989 -Prevent imbalance in multi dimensional dispatches -Make sure to utilize as much Eus as possible -Prefer highest possible tg dspatch count possible -Make sure that xe_core doesn't have uneven workgroups	2023-11-03 15:54:04 +01:00
Zbigniew Zdanowicz	19586277ca	refactor: add debug flag to control delay after waiting for paging fence on cpu Related-To: NEO-8395 Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-11-03 12:49:39 +01:00
Dominik Dabek	39cf653959	performance(ocl): cmd buffer prealloc per cmdqueue Add mechanism to preallocate cmd buffer allocations in command stream receiver reusable allocations list per command queue initialized. This should limit additional allocations during hot loop. Needs to be enabled in subsequent commits by setting product helper method. Related-To: NEO-8152 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2023-10-27 16:56:29 +02:00
Mateusz Hoppe	5d572b9c8f	feature: allow freeing memory in aubstream Related-To: NEO-2707 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-10-26 17:16:23 +02:00
Compute-Runtime-Validation	69f614a8c2	Revert "fix: allow legacy device binary validation logic for Blender on DG2 p... This reverts commit `d3d15542fb`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-10-24 21:00:19 +02:00
Kacper Nowak	d3d15542fb	fix: allow legacy device binary validation logic for Blender on DG2 platforms Temporarily opt-out from additional compatibility checks on DG2 for Blender AOT-compiled kernels. This prevents a long kernel recompilation. Additionally, same behavior can be enforced for other applications manually via NEO debug key named DoNotUseProductConfigForValidationWa. Signed-off-by: Kacper Nowak <kacper.nowak@intel.com> Related-To: NEO-9240	2023-10-23 18:20:37 +02:00
Mateusz Jablonski	8da4a9cbc7	fix: add debug flag to control non walker signalling in in-order cmdlist set to false by default Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-10-23 14:43:40 +02:00
John Falkowski	f156a74f54	fix: split chunking prefetch flags Related-To: NEO-9120 Signed-off-by: John Falkowski <john.falkowski@intel.com>	2023-10-18 19:20:42 +02:00
Dominik Dabek	75c4844987	feature(internal): logging kernel dispatch params Use debug flag PrintKernelDispatchParameters to print params used in thread group dispatch size heuristic when encoding kernel dispatch. Related-To: NEO-6989 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2023-10-17 17:31:54 +02:00
Compute-Runtime-Validation	30b066c40e	Revert "fix: synchronize host and device timers to avoid device timer overflow" This reverts commit `dae8c34f81`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-10-16 11:16:23 +02:00
Dunajski, Bartosz	0592390e2b	refactor: print gmm compression settings Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-10-16 09:14:52 +02:00
Mateusz Jablonski	dae8c34f81	fix: synchronize host and device timers to avoid device timer overflow Related-To: NEO-8394 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-10-13 17:40:45 +02:00
Dunajski, Bartosz	06a02552ce	refactor: debug flag to override PAT index for given memory type Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-10-12 15:47:28 +02:00
Filip Hazubski	08e92d154f	fix: Add getDefaultDeviceHierarchy call to GfxCoreHelper Added getDefaultDeviceHierarchy call that describes default device hierarchy for a gfx core. Refactored L0 and OCL paths to use this value by default and override this value when user sets ZE_FLAT_DEVICE_HIERARCHY environment variable or ReturnSubDevicesAsApiDevices debug key. Updated ReturnSubDevicesAsApiDevices to force COMPOSITE device hierarchy when set to 0. Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>	2023-10-06 12:32:41 +02:00
Mateusz Jablonski	110164a52a	fix: remove invalid std::forward Related-To: NEO-9038 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-10-03 16:41:41 +02:00
Mateusz Jablonski	a033df33ff	fix: remove preferSmallWorkgroupSizeForKernel method Related-To: HSD-18033866078 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-09-29 11:55:09 +02:00
Dunajski, Bartosz	4e8600d8d0	feature: initial support for RelaxedOrdering of in-order Events chaining Disabled by default. Related-To: NEO-7966 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-09-27 16:45:20 +02:00
Dunajski, Bartosz	42496ac96d	feature: initial support for patching regular in-order CmdList Related-To: NEO-7966 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-09-21 14:20:50 +02:00
Dunajski, Bartosz	b94f58abaa	feature: debug flag to enable in-order events Related-To: NEO-7966 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-09-21 11:22:48 +02:00
Dominik Dabek	1b7e178b25	performance(ocl): program barrier pc in taskStream Program barrier to task stream, before next enqueue kernel. This will reduce the number of batch buffer starts for sequences of enqueue, barrier, enqueue, ... . Related-To: NEO-8147 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2023-09-19 11:48:02 +02:00
Mrozek, Michal	451c48fc2f	refactor: remove not needed code. Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>	2023-09-12 10:51:35 +02:00
Mateusz Jablonski	46288b8efd	fix: setup correct non-release key name in getStringWithFlags unify function for getting env Related-To: NEO-8347 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-09-08 15:39:36 +02:00
Joshua Santosh Ranjan	91784a87cc	fix: Return success for system address in setArg This patch avoids returning error for system addresses in setArg Related-To: GSD-3597 Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>	2023-09-08 05:27:55 +02:00
Zbigniew Zdanowicz	cb641226b5	fix: add debug key to provide alternative directory for wddm residency logs Related-To: NEO-8211 Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-09-01 10:15:09 +02:00
Mateusz Hoppe	9e89704624	feature: debug flag to disable DriverStore path enforcement Resolves: NEO-8320 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-08-31 08:20:53 +02:00
John Falkowski	d49190f4ae	feature: Add debug/release variables prefixes Add debug/release variables with prefixes for Level Zero, OpenCL and NEO Resolves: NEO-6357 Signed-off-by: John Falkowski <john.falkowski@intel.com>	2023-08-22 15:15:45 +02:00
Dunajski, Bartosz	7e6e0da978	feature: flush task count on cmd list hostSynchronize if needed Related-To: NEO-7966 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-08-22 14:29:14 +02:00
Dunajski, Bartosz	f3b2458a9c	fix: Use immediate command queue instead of CSR to obtain TaskCount. Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-08-21 15:04:46 +02:00
Artur Harasimiuk	f6e0c0cf89	Revert "feature: Add debug/release variable prefixes" This reverts commit `ec95d9314a`. Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>	2023-08-18 12:42:39 +02:00
John Falkowski	2403212dcd	fix: chunking prefetch add USER_FENCE Add USER_FENCE before PREFETCH call and after the BIND Related-To: NEO-8098 Signed-off by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com> Signed-off-by: John Falkowski <john.falkowski@intel.com>	2023-08-17 21:32:47 +02:00
John Falkowski	ec95d9314a	feature: Add debug/release variable prefixes Resolves: NEO-6357 Signed-off-by: John Falkowski <john.falkowski@intel.com>	2023-08-10 14:01:09 +02:00
Fabian Zwolinski	6fca8ee195	refactor: Remove SourceLevelDebugger Removed: - SourceLevelDebugger (with tests) - DebuggerLibrary - DebuggerLibraryRestore - debuggerSupported field from hwInfo.capabilityTable - HasSourceLevelDebuggerSupport matcher - ExperimentalEnableSourceLevelDebugger debug var - EnableMockSourceLevelDebugger debug var - DebuggerOptDisable debug var - lib_names.h.in file - third_party/source_level_debugger/igfx_debug_interchange_types.h Related-To: NEO-7213 Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>	2023-08-10 11:14:02 +02:00
Dominik Dabek	12ab74fe96	performance: flag to program barrier in task cs Add debug flag ProgramBarrierInCommandStreamTask to program barrier pipe control in task command stream instead of csr command stream. This will reduce the number of batch buffer starts. Related-To: NEO-8147 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2023-08-02 10:26:34 +02:00
Filip Hazubski	12af65a970	fix: Change default value of EnableCpuCacheForResources debug toggle This change disables CPU caching for resources not accessed by CPU for MTL devices. Related-To: NEO-7194 Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>	2023-07-31 09:15:43 +02:00
Filip Hazubski	7ea22d0369	feature: Add pat index programming to gem create ext call When upstream ioctl helper is created it will try to create small allocation, adding I915_GEM_CREATE_EXT_SET_PAT extension. If it succeeds, for all resources with valid pat index value it will then explicitly program pat index value with gem create ext call. PrintBOCreateDestroyResult value can be used to: - print whether the set pat extension is supported by the kernel, when ioctl helper is created - print whether set pat extension was added for a given gem create ext call and what pat index value was programmed Note: introduced changes are disabled by defualt. Toggle DisableGemCreateExtSetPat can be used to enable new functionality. Related-To: NEO-7896 Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>	2023-07-31 09:00:04 +02:00
Dunajski, Bartosz	e1e9907973	feature: debug flag to signal user interrupts. Related-To: NEO-7966 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-07-28 18:56:28 +02:00
Dunajski, Bartosz	a241099dff	feature: use WaitUserFence on zeEventHostSynchronize Disabled by default. Debug flag is required. Related-To: NEO-7966 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-07-26 19:41:09 +02:00
Joshua Santosh Ranjan	b6e76b9118	fix: Move event reference time tracking into event class This would avoid recalculating reference timestamps when event is used with different command lists. Related-To: LOCI-4563 Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>	2023-07-25 08:44:47 +02:00
Mateusz Hoppe	e52712b800	feature(ocl): enable "cl_khr_external_memory" extension - report extension string - report supported memory handle types Related-To: NEO-6757 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-07-24 14:22:39 +02:00
Dominik Dabek	0a4d0917d4	performance(ocl): skip dcFlush on no event Skip dcFlush on waitForBarrier without event by default. Related-To: NEO-8147 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2023-07-20 14:57:37 +02:00
Compute-Runtime-Validation	8c155a2e89	Revert "performance: Memory handling improvements" This reverts commit `5b80bd4d7c`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-07-20 11:37:09 +02:00
Filip Hazubski	5b80bd4d7c	performance: Memory handling improvements By default prefer allocating memory first by KMD, instead of malloc first. By default prefer not caching allocations on MTL devices. This results in allocations being handled with non-coherent pat index. For integrated devices when caching is not preferred do not allow direct memory access in CPU domain. For map/unmap operations create a dedicated memory allocation for CPU access, instead of accessing it directly, reusing the same logic as when mapping/unmapping local memory. Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>	2023-07-19 19:21:44 +02:00
Wilma, Pawel	39b25abf0e	feature: debug flag to enable/disable AIL Related-to: NEO-8049 Signed-off-by: Wilma, Pawel <pawel.wilma@intel.com>	2023-07-19 12:10:05 +02:00
Mateusz Jablonski	01990e8bd7	feature(internal): add debug flag to control preferred allocation method on Wddm Related-To: NEO-7194 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-07-18 16:46:17 +02:00
Dominik Dabek	622a3ed89c	performance(ocl): flag to not dcFlush on no event If waitForBarrier is not passed outEvent then do dcFlush on the next synchronize call. Related-To: NEO-8147 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2023-07-18 15:38:54 +02:00
Lukasz Jobczyk	83bd33befc	refactor: Add flag to control BCS split for pageable memory Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>	2023-07-11 15:12:40 +02:00
Jaime Arteaga	23eeaf816d	feature: Add debug keys for chunking allocation and size Related-to: NEO-7695 New debug keys added: EnableBOChunking is now a mask 0 = no chunking (default). 1 = shared allocations only 2 = device allocations only 3 = shared and device allocations MinimalAllocationSizeForChunking sets the minimum allocation size to apply chunking. Default is 2MB. Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>	2023-07-07 23:39:43 +02:00
Michal Mrozek	5eadedc36e	refactor: Remove not used logic. Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>	2023-06-30 10:58:35 +02:00
Zbigniew Zdanowicz	21823af419	performance: add skeleton method to cmdlist immediate flush task Related-To: NEO-7808 Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-06-30 10:46:20 +02:00
Zbigniew Zdanowicz	1067167637	test: adding testing debug flag overriding driver version Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-06-27 17:58:03 +02:00
Dunajski, Bartosz	7ac825e74b	refactor: add debug flag to synchronize Event before reset Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-06-26 17:38:37 +02:00
Cencelewska, Katarzyna	68d81c82a7	fix: Use proper value about hw local id generations - remove useless flag ForceNumberOfThreadsInGpgpuThreadGroup - add new flag "RemoveRestrictionsOnNumberOfThreadsInGpgpuThreadGroup" to restore old path without restrictions about number of threads in thread group - fix forwarding information about hw local ids generations to calculate numOfThreadsInThreadGroup correctly Related-To: NEO-7952, NEO-7982 Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-06-26 16:35:42 +02:00
Dunajski, Bartosz	aea5f435db	feature: unregister CSR client on Event host synchronize Related-To: NEO-7458 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-06-26 12:02:14 +02:00
Joshua Santosh Ranjan	97b4d8bab5	feature: add initial support for host mapped timestamps Related-To: LOCI-4171 Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>	2023-06-26 08:29:58 +02:00
Dunajski, Bartosz	b004a27e4e	refactor: Debug flag to print TSP usage Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-06-22 14:47:39 +02:00
Mateusz Jablonski	26ad315207	feature: enable allocating shared usm in heap extended host by default Related-To: NEO-7665 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-06-20 15:45:25 +02:00
Cencelewska, Katarzyna	9f7374da6e	fix: Change default setting flag EnableCpuCacheForResources to true on mtl Related-To: HSD-18030829682 Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-06-19 12:22:42 +02:00
Mateusz Jablonski	3b981331c9	fix: correct handling ZE_ENABLE_PCI_ID_DEVICE_ORDER flag - by default ZE_ENABLE_PCI_ID_DEVICE_ORDER is disabled - by default devices are sorted by type (discrete first), then by pci order - when ZE_ENABLE_PCI_ID_DEVICE_ORDER is enabled, devices are sorted by pci id Related-To: LOCI-4520 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-06-14 16:27:55 +02:00
Cencelewska, Katarzyna	7cb3278eb3	fix: add function to calculate number of threads per tg Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-06-13 14:02:24 +02:00
Dunajski, Bartosz	3d49658f50	feature: new multitile post sync layout for immediate write [2/n] No functional changes in this commit. This is prework. Related-To: NEO-7966 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-06-09 14:20:34 +02:00
Cencelewska, Katarzyna	baa4ba9c56	fix: set default value of EnableCpuCacheForResources to false - this flag is affecting only mtl Related-To: NEO-7194 Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-06-05 13:42:56 +02:00
Jaime Arteaga	2efd6e547a	feature: Add support for chunking in the UMD (1/N) Read if support for chunking is available in the KMD. If available, KMD will create a BO with 1 or more chunks, depending on the chunk size selected. Related-To: NEO-7695 Sync to https://github.com/intel-gpu/drm-uapi-helper/releases/tag/v2.0-rc18 Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com> Signed-off-by: John Falkowski <john.falkowski@intel.com>	2023-06-02 23:27:40 +02:00
Bellekallu Rajkiran	3c072a6cd1	fix: WA for VF bar resource allocation post Warm reset On Warm reset, With default bar size set by bios, VF bar allocation is getting failed because of bug in pci driver which impacts SRIOV functionality. Resize VF bar size for succesful allocation of VF bar post warm reset. Related-To: LOCI-4481 Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>	2023-06-02 13:16:34 +02:00
Warchulski, Jaroslaw	03d9a20559	feature: add debug flag to wait for release memory Related-To: NEO-6766 Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>	2023-06-02 09:57:27 +02:00
Cencelewska, Katarzyna	115d6de350	fix: add debug key to verify device state before submit - new debug key EnableDeviceStateVerification to check device state not ony in debug mode Related-To: NEO-7669 Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-05-31 14:31:23 +02:00
Compute-Runtime-Validation	9cc7028025	Revert "feature: enable allocating shared usm in heap extended host by default" This reverts commit `5b178e68e9`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-05-31 09:39:11 +02:00
Mateusz Jablonski	5b178e68e9	feature: enable allocating shared usm in heap extended host by default Related-To: NEO-7665 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-05-29 11:28:18 +02:00
Joshua Santosh Ranjan	29682a4f8d	feature: print global timestamp Related-To: LOCI-4285 Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>	2023-05-25 09:45:13 +02:00
Daria Hinz	331f167cfe	feature: Add debug flag for setting hw ip version Signed-off-by: Daria Hinz <daria.hinz@intel.com> Related-To: NEO-7954	2023-05-23 15:32:46 +02:00
Compute-Runtime-Validation	d390ec6e8d	Revert "fix: set default value of flag EnableCpuCacheForResources to false" This reverts commit `305cc00b0f`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-05-19 11:40:47 +02:00
Katarzyna Cencelewska	305cc00b0f	fix: set default value of flag EnableCpuCacheForResources to false when flag disabled, gmm flag Cacheable won't set on xe_hp and later Related-To: NEO-7194 Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>	2023-05-18 10:40:01 +02:00
Cencelewska, Katarzyna	71ec4c528f	fix: set default value of flag EnableCpuCacheForResources to true Related-To: HSD-18030023426, HSD-18030026101, HSD-18030022460 Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-05-17 11:26:36 +02:00
Lukasz Jobczyk	0e758e4bb5	performance: Add debug flag to set BCS split minimal size Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>	2023-05-17 08:07:43 +02:00
Katarzyna Cencelewska	004a3d875c	fix: Remove default setting of gmm flag Cacheable to true - add debug flag EnableCpuCacheForResources to be able to allow coherency when resources could be cacheable Resolves: NEO-7194 Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>	2023-05-16 09:17:29 +02:00
Dunajski, Bartosz	cfacbbd811	refactor: Simplify OverrideBlitterMocs usage Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-05-09 19:22:57 +02:00
Warchulski, Jaroslaw	7fdf4985a3	feature: add support for cl_khr_external_memory extension Related-To: NEO-7069 Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>	2023-05-05 15:51:39 +02:00
Bellekallu Rajkiran	d3a31957db	feature(sysman): Add delay for HBM diagnostics Add debug variable to set sleep duration for HBM IFR to complete Related-To: LOCI-4298 Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>	2023-05-03 20:27:21 +02:00
Mateusz Jablonski	74205f3f37	Revert "feature: enable allocating shared usm in heap extended host by default" This reverts commit `26f16f4e98`. Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-05-02 09:12:26 +02:00
Aravind Gopalakrishnan	1883161e1e	fix: Add debug key to Force Tlb flush Related-To: GSD-4457 Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>	2023-05-01 17:52:22 +02:00
Mateusz Jablonski	26f16f4e98	feature: enable allocating shared usm in heap extended host by default Related-To: NEO-7665 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-04-28 13:18:20 +02:00
Mateusz Jablonski	5a5c20f99c	fix: create separate heap for host and shared usm in 48-56b VA Related-To: NEO-7665 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-04-28 10:09:38 +02:00
Cencelewska, Katarzyna	861ec524c6	fix: check icbe version only once when patchtoken - set by default flag ZebinIgnoreIcbeVersion to true - for zebin icbe version check is only inside flag - only when use patchtoken then check icbe version is mandatory Resolves: NEO-7904 Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-04-28 09:26:02 +02:00
Dunajski, Bartosz	14c3777409	feature: Experimental support of immediate cmd list in-order execution [1/n] Related-To: LOCI-4332 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-04-26 13:15:59 +02:00
Mateusz Jablonski	06bd405e88	feature: add debug flag to control usage of heap extended for USM Host Related-To: NEO-7665 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-04-25 15:39:49 +02:00
Fabian Zwolinski	2022592f3d	Apply CamelCase for class and struct names 2/2 Additionally change .clang-tidy not to ignore struct names. Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>	2023-04-25 13:10:23 +02:00
Dunajski, Bartosz	6e9257c623	Debug flag to force early exit Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-04-25 09:44:44 +02:00
Lukasz Jobczyk	853a65aae9	Add PCI barrier implementation Resolves: NEO-7850 Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>	2023-04-24 10:33:03 +02:00
Kacper Nowak	c7adbc2140	Add debug key for dumping ELF to file Add "DumpZEBin" debug flag. When this flag is enabled, Zebin will be dumped to a .elf file (with appropiate suffix, in case such file has been dumped before). Signed-off-by: Kacper Nowak <kacper.nowak@intel.com> Related-To: NEO-7895	2023-04-18 20:40:25 +02:00
Dominik Dabek	411ed1c643	feat: direct submission variable timeout Add mechanism to increase direct submission timeout up to a maximum value when no new submissions were made since last sleep. This should help in workloads that have delays between iterations larger than current direct submission controller timeout. Related-To: NEO-7878 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2023-04-18 17:33:55 +02:00
Fabian Zwolinski	b909b03b02	Rename OpenCL Platform Name - Rename "Intel(R) OpenCL HD Graphics" -> "Intel(R) OpenCL Graphics" - Add and implement new DebugVariable - OverridePlatformName - for overriding Platform Name in OpenCL Related-To: NEO-7826 Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>	2023-04-17 11:09:32 +02:00
Kacper Nowak	e19e006370	feat(zebin): Add debug flag for logging ZE Info Add debug key LogZEInfo for logging ZE Info from zebin elf. ZE Info will be dumped to a file (default igdrcl.log) Related-To: NEO-7895 Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>	2023-04-14 17:14:07 +02:00
Mateusz Jablonski	e4a446df58	feature usm: add debug flag to allocate shared USM in heap extended Related-To: NEO-7665 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-04-13 11:30:09 +02:00
Konstanty Misiak	1f37e69fd2	Refactor of IO functions Related-To: NEO-4562 Signed-off-by: Konstanty Misiak <konstanty.misiak@intel.com>	2023-04-13 10:46:47 +02:00
Milczarek, Slawomir	8e04a7a83f	Access counters mode to not rely on KMD cross-tile migrations (by default) Add new regkey KMDSupportForCrossTileMigrationPolicy (disabled by default, in absence of KMD suppport for cross-tile migrations) to control placement of shared allocation and memory prefetch behavior. Related-To: NEO-7885 Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>	2023-04-11 15:56:14 +02:00
Zbigniew Zdanowicz	1fcf564cc1	Enable state base address tracking Related-To: NEO-5055 Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-04-07 11:22:24 +02:00
Compute-Runtime-Validation	e1af516c25	Revert "Enable state base address tracking" This reverts commit `6a08d29869`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-04-04 11:37:19 +02:00
Zbigniew Zdanowicz	a5179aae0b	[perf] add debug key and control variable to command list primary buffer Related-To: NEO-7807 Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-04-04 10:58:11 +02:00
Zbigniew Zdanowicz	6a08d29869	Enable state base address tracking Related-To: NEO-5055 Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-04-03 15:26:09 +02:00
Dunajski, Bartosz	78cad1e3c0	Fix debug variable data type Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-04-03 11:34:38 +02:00
Milczarek, Slawomir	50da94dc56	Add regkey to force prefetch of shared memory in cmd list execute Add the regkey ForceMemoryPrefetchForKmdMigratedSharedAllocations to force meory prefetch of kmd-migrated shared allocation in zeCommandQueueExecuteCommandLists(). Related-To: NEO-7841 Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>	2023-04-03 11:14:18 +02:00
Milczarek, Slawomir	4e6995bc4c	Set VM advise with preferred location to device by default Apply the KMD advise with preferred device location for KMD-migrated shared allocation to migrate to lmem on every GPU page fault (default KMD migration policy). Related-To: NEO-7851 Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>	2023-03-30 17:04:23 +02:00
Milczarek, Slawomir	5936734550	Add regkey to set preferred location for kmd-migrated shared allocation The regkey SetVmAdvisePreferredLocation sets the KMD VM advise with preferred location for KMD-migrated shared allocation (default - none, 1 - system, 2 - device memory). Related-To: NEO-7252 Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>	2023-03-27 12:48:46 +02:00
Dunajski, Bartosz	b3c2fa41c5	OCL: Optimize IOQ barriers handling Related-To: NEO-7458 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-03-24 12:52:47 +01:00
Zbigniew Zdanowicz	b4cce380c8	Revert "Enable state base address tracking" This reverts commit `6fb905acb2`. Resolves: HSD-18028477709 Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-03-24 10:20:36 +01:00
Cencelewska, Katarzyna	1624ad911b	wa: set flag ForceDummyBlitWa to -1 to apply wa properly on mtl Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-03-22 13:32:15 +01:00
Compute-Runtime-Validation	7b5897d585	Revert "wa: set flag ForceDummyBlitWa to -1 to apply wa properly" This reverts commit `095f5a773a`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-03-22 10:42:46 +01:00
Zbigniew Zdanowicz	6fb905acb2	Enable state base address tracking Related-To: NEO-5055 Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-03-21 15:53:24 +01:00
Cencelewska, Katarzyna	095f5a773a	wa: set flag ForceDummyBlitWa to -1 to apply wa properly Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-03-20 09:41:31 +01:00
Fabian Zwolinski	1e4c91fb08	Do not disable scratch pages when dbgr is enabled Related-To: NEO-7990 Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>	2023-03-14 15:03:18 +01:00
Compute-Runtime-Validation	e3a80f0bc1	Revert "Enable state base address tracking" This reverts commit `8b9078127f`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-03-12 07:38:57 +01:00
Zbigniew Zdanowicz	8b9078127f	Enable state base address tracking Related-To: NEO-5055 Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-03-10 17:32:40 +01:00
Dominik Dabek	69a16fd3ed	feature: check indirect access for kernel Do not make indirect allocations resident if kernel does not use indirect access. For both level zero and opencl. Currently disabled by default, enable with debug flag DetectIndirectAccessInKernel Related-To: NEO-7712 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2023-03-08 16:58:26 +01:00
Spruit, Neil R	9aa4275fda	Check for valid stype before reading Device Properties pNext Related-To: LOCI-3884 - Added check for valid device properties stype to remove the feature specific debug vars that enabled/disabled reading of the pNext. - Requires applications to properly set the device properties stype in order for the pNext to be read for extensions. Signed-off-by: Spruit, Neil R <neil.r.spruit@intel.com>	2023-03-07 18:20:10 +01:00

... 2 3 4 5 6 ...

829 Commits