compute-runtime

Commit Graph

Author	SHA1	Message	Date
Chodor, Jaroslaw	7e795cd3c1	feature: Adding support for OCL C support queries to ocloc Feature needed for automated OCL C compilation with ocloc as backend. Added queries : * CL_DEVICE_EXTENSIONS * CL_DEVICE_EXTENSIONS_WITH_VERSION * CL_DEVICE_PROFILE * CL_DEVICE_OPENCL_C_ALL_VERSIONS * CL_DEVICE_OPENCL_C_FEATURES Sample command line: ocloc query -device skl CL_DEVICE_OPENCL_C_FEATURES Related-To: GSD-7420 Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>	2023-12-18 15:19:16 +01:00
Mateusz Jablonski	fca28a930f	refactor: correct naming of enum class constants 7/n Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-12-18 13:05:56 +01:00
Dunajski, Bartosz	1219c48620	refactor: improve post sync handling and in-order unit tests Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-12-15 11:17:56 +01:00
Lu, Wenbin	67fa39c9a1	fix: get right page size when malloc uses 0 alignment Related-To: GSD-7103 Signed-off-by: Lu, Wenbin <wenbin.lu@intel.com>	2023-12-13 15:00:56 +01:00
Mateusz Jablonski	739d181026	refactor: correct naming of enum class constants 6/n Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-12-13 14:48:52 +01:00
Mateusz Jablonski	261c4d3e72	refactor: add NOLINT for hardware info padding warning Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-12-13 12:18:11 +01:00
Mateusz Jablonski	432142c574	refactor: correct naming of enum class constants 4/n Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-12-13 08:08:51 +01:00
Mateusz Jablonski	0428c0acd1	refactor: correct naming of HeapIndex enum values Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-12-12 11:46:17 +01:00
Dunajski, Bartosz	717bc0c03f	refactor: pass InOrderExecInfo to encode method Related-To: NEO-7966 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-12-11 16:43:30 +01:00
Mateusz Jablonski	b182917d9d	refactor: correct naming of allocation types Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-12-11 16:23:37 +01:00
Mateusz Jablonski	beafea9b39	refactor: correct naming of enum class constants 2/n Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-12-11 13:13:35 +01:00
Dunajski, Bartosz	9ce3713ace	refactor: improve Walker args handling Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-12-06 16:10:26 +01:00
Mateusz Jablonski	1bbc7a57ab	refactor: rename VFE_STATE_TYPE -> FrontEndStateCommand Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-12-05 14:51:01 +01:00
Mateusz Jablonski	da957d1a37	refactor: correct naming of enum class constants 1/n Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-12-05 14:26:42 +01:00
Mateusz Jablonski	6849d33326	refactor: remove redundant definitions KB/MB/GB Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-12-05 08:55:18 +01:00
Mateusz Jablonski	c48e57e2de	refactor: rename register offsets and constants Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-12-04 18:31:39 +01:00
Mateusz Jablonski	9486cd2a26	refactor: rename WALKER_TYPE to DefaultWalkerType Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-12-04 16:29:32 +01:00
Mateusz Jablonski	c3d3a4db1f	refactor: correct variable naming Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-12-04 13:45:53 +01:00
Mateusz Jablonski	e57d372608	refactor: correct naming of enum class constants EngineGroupType BlitDirection PostBlitMode WaitStatus Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-12-01 11:53:51 +01:00
Mateusz Jablonski	b92b5892b8	refactor: correct variable naming Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-12-01 08:34:19 +01:00
Mateusz Jablonski	c3ac7b78bd	refactor: correct variable naming Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-12-01 02:18:46 +01:00
Mateusz Jablonski	c9664e6bad	refactor: rename global debug manager to debugManager Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-11-30 13:00:59 +01:00
Mateusz Jablonski	36194c4e7d	refactor: correct variable namings Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-11-29 23:49:03 +01:00
Kamil Kopryk	982c2d665b	feature: add programming heapless mode in SBA Related-To: NEO-7621 Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>	2023-11-24 15:29:19 +01:00
Kamil Kopryk	ce7298d512	feature: Add heapless mode programming in ocl Related-To: NEO-7621 Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>	2023-11-24 12:53:39 +01:00
Baj, Tomasz	c49a9b9787	refactor: remove ReturnSubDevicesAsApiDevices from shared code Related-To: NEO-9437 Signed-off-by: Baj, Tomasz <tomasz.baj@intel.com>	2023-11-22 15:13:29 +01:00
Mateusz Jablonski	8dd80efbb1	refactor: move getting thread per eu configs to release helper Related-To: HSD-18034098647 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-11-21 09:44:32 +01:00
Baj, Tomasz	97842d8eab	fix: program PIPE_CONTROl with RT Flush and CS Stall prior to PIPE_SELECT Related-To: NEO-9194 Signed-off-by: Baj, Tomasz <tomasz.baj@intel.com>	2023-11-20 16:43:59 +01:00
Mateusz Jablonski	35c1f34672	refactor: move number of threads per eu to release helper Related-To: HSD-18034098647 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-11-20 12:16:33 +01:00
Mateusz Jablonski	a0f8e8accc	fix: initialize max capability values for ARL setup max values when overriding hw info with debug key Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-11-15 13:44:11 +01:00
Dunajski, Bartosz	30777d4d4c	feature: use indirect semaphore for 64b values Related-To: NEO-8145 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-11-09 16:58:45 +01:00
Mateusz Hoppe	f36e86f121	fix: program bindless surface state size from sba properties - surfaceStateSize is in pages, bindless size needs to be programmed in surface state units Related-To: NEO-7063 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-11-07 17:45:54 +01:00
Kamil Kopryk	73bb139e17	refactor: add applyExtraInternalOptions function Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>	2023-11-03 09:43:37 +01:00
Michal Mrozek	060fc40938	performance: move constants out of the loop. Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>	2023-10-30 18:16:35 +01:00
Mateusz Jablonski	1e18bf0392	fix: define timestamp mask based on timer resolution Related-To: HSD-18034577551 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-10-27 11:55:20 +02:00
Compute-Runtime-Validation	fca2159430	Revert "fix: if device hierarchy is flat then getSubDevicesCount return 1u" This reverts commit `cb0bb57f49`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-10-26 15:40:29 +02:00
Baj, Tomasz	cb0bb57f49	fix: if device hierarchy is flat then getSubDevicesCount return 1u Related-To: NEO-9167 Signed-off-by: Baj, Tomasz <tomasz.baj@intel.com>	2023-10-25 15:51:52 +02:00
Mateusz Jablonski	6d2d16d68e	fix: avoid overflow of gpu time stamp in ns Related-To: NEO-8394 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-10-24 14:56:15 +02:00
Zbigniew Zdanowicz	2e09b5ff66	refactor: reposition preamble helper implementation methods Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-10-20 13:18:29 +02:00
Mateusz Hoppe	97faeae16f	feature: heapAssigner per root device - create heapAssigner per root device in memory manager to allow per device config Related-To: NEO-7063 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-10-19 19:52:24 +02:00
Mateusz Jablonski	4dfa12c8eb	fix: add mechanism to detect gpu timestamp overflows unify naming CpuGpu to GpuCpu Related-To: NEO-8394 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-10-19 16:31:06 +02:00
Lukasz Jobczyk	750b5ba89a	fix: flush necessary caches when dispatch pipe control Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>	2023-10-19 15:09:31 +02:00
Mateusz Hoppe	5a4fa180d6	feature: control bindless compilation mode based on release - check releaseHelper support when selecting bindless mode, if not disabled, prefer bindless mode in L0 API - bindless mode can be forced with DebugVariable: UseBindlessMode Related-To: NEO-7063 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-10-19 10:11:11 +02:00
John Falkowski	f156a74f54	fix: split chunking prefetch flags Related-To: NEO-9120 Signed-off-by: John Falkowski <john.falkowski@intel.com>	2023-10-18 19:20:42 +02:00
Mateusz Jablonski	a420e34b10	fix: explicitly remove assign operators when not needed when class defines copy/move ctor then corresponding assign operator(s) should be defined or deleted Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-10-12 10:25:15 +02:00
Mateusz Jablonski	3fdcf049bf	fix: set default device hierarchy to composite for all platforms except xe hpc Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-10-12 10:09:50 +02:00
Latif, Raiyan	dee5ecfdf3	fix: ReturnSubDevicesAsApiDevices flag being ignored Proper subdevice count being returned now in GfxCoreHelper path, as previous method ignored the usage of the ReturnSubDevicesAsApiDevices flag. Related-To: LOCI-4859 Signed-off-by: Latif, Raiyan <raiyan.latif@intel.com>	2023-10-10 17:05:00 +02:00
Mateusz Jablonski	420f273a6c	fix: don't wait on condition in unit tests Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-10-10 15:14:30 +02:00
Mateusz Hoppe	c2d69e5857	feature: allocate SPECIAL_SSH heap in front window from EXTERNAL heap - SPECIAL_SSH is used for debug surface SurfaceState which must be located at bindless offset zero - limit size of external front window Related-To: NEO-7063 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-10-09 14:54:39 +02:00
Filip Hazubski	0c8a514349	fix: Switch default device hierarchy to FLAT Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>	2023-10-06 15:24:50 +02:00
Filip Hazubski	08e92d154f	fix: Add getDefaultDeviceHierarchy call to GfxCoreHelper Added getDefaultDeviceHierarchy call that describes default device hierarchy for a gfx core. Refactored L0 and OCL paths to use this value by default and override this value when user sets ZE_FLAT_DEVICE_HIERARCHY environment variable or ReturnSubDevicesAsApiDevices debug key. Updated ReturnSubDevicesAsApiDevices to force COMPOSITE device hierarchy when set to 0. Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>	2023-10-06 12:32:41 +02:00
Mateusz Jablonski	382fc952f2	refactor: add NonAssignableClass to define classes without assign operator Related-To: NEO-9038 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-10-05 15:16:58 +02:00
Mateusz Jablonski	712ba60452	fix: add unrecoverable to avoid nullptr access Related-To: NEO-9038 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-10-05 13:20:21 +02:00
Mateusz Jablonski	6d259ac4b7	fix: add unrecoverable to avoid out of bound access Related-To: NEO-9038 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-10-05 12:03:28 +02:00
Mateusz Jablonski	ad2701ad26	fix: add unrecoverable to avoid out of bound access Related-To: NEO-9038 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-10-04 15:59:18 +02:00
John Falkowski	56f05303c9	feature: add support for zeMemGet/SetAtomicAccessAttributeExp Resolves: NEO-8219 Signed-off-by: John Falkowski <john.falkowski@intel.com>	2023-10-02 15:59:17 +02:00
Hoppe, Mateusz	5c565efe28	feature: bindless global heap with debugger - program debugSurface's SurfaceState at the beginning of Bindless Surface State Heap - SPECIAL_SSH - ensure SPECIAL_SSH is resident Related-To: NEO-7063 Signed-off-by: Hoppe, Mateusz <mateusz.hoppe@intel.com>	2023-09-29 13:13:46 +02:00
Mateusz Jablonski	a033df33ff	fix: remove preferSmallWorkgroupSizeForKernel method Related-To: HSD-18033866078 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-09-29 11:55:09 +02:00
Mateusz Jablonski	3a21b3b228	refactor: remove not needed code Related-To: NEO-7527 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-09-28 07:52:39 +02:00
Mateusz Jablonski	5f846d8a13	refactor: remove not needed code Related-To: NEO-7527 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-09-27 18:17:04 +02:00
Mateusz Jablonski	03874b8815	refactor: remove not needed code Related-To: NEO-7527 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-09-27 17:45:54 +02:00
Mateusz Jablonski	09044dfbaa	refactor: remove not needed code Related-To: NEO-7527 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-09-27 14:35:49 +02:00
Kacper Nowak	160303924d	refactor: Correct logic for SIMD1 - For calculating number of threads per workgroup, for SIMD 1, return local work size (each software thread should be mapped into a whole hardware thread). - Correct logic of calculating space for per thread data for SIMD 1. - Minor: unit tests refactor. - Corrected naming. Related-To: NEO-8261 Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>	2023-09-26 15:28:37 +02:00
Dominik Dabek	eebf2bbd26	performance(ocl): timestamp packet count per gfx Add support for different timestamp packet counts per gfx family. Change all packet counts to 1 except for xe-hpc. Related-To: NEO-8154 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2023-09-25 20:34:58 +02:00
Maciej Plewka	8658fdb04e	fix: Use stack vec for api specific prefix Related-To: NEO-8388, GSD-6296 Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>	2023-09-21 16:10:54 +02:00
Maciej Bielski	97e7cda912	feature: Optimize intra-module kernel ISA allocations So far, there is a separate page allocated for each kernel's ISA within `KernelImmutableData::initialize()`. Apparently the ISA blocks are often much smaller than a 64k page, which leads to poor memory utilization and was even observed to cause the device OOM error if a single module has several keys. Improve the situation by reusing the parent allocation (owned by the module instance) for modules, which kernel ISAs can fit together within a single 64k page. This improves the memory utilization on a single module level. Related-To: NEO-7788 Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>	2023-09-21 13:55:45 +02:00
Katarzyna Cencelewska	d7d46a9fc5	refactor: use initialized variable in getHighestEnabledDualSubSlice Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>	2023-09-20 14:49:56 +02:00
Mateusz Jablonski	b1808f7830	fix: correct suggested number of work groups for concurrent kernels on PVC value depends on CCS count: - single CCS mode (default) - 50% available - two CCS mode - 25% available - four CCS mode - 12.5% available Related-To: NEO-8377 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-09-20 13:40:22 +02:00
Mateusz Hoppe	69f5ca6345	feature: bindless addressing - flush state cache after reusing SS slot - when Surface State is reused for new resource, State Cache needs to be invalidated Related-To: NEO-7063 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-09-20 12:53:32 +02:00
Dunajski, Bartosz	d3d5da1f72	feature: initial 64b in-order CmdList support Related-To: NEO-7966 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-09-19 15:23:37 +02:00
Compute-Runtime-Validation	913a926fd4	Revert "feature: Optimize intra-module kernel ISA allocations" This reverts commit `c348831470`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-09-19 14:16:05 +02:00
Maciej Bielski	c348831470	feature: Optimize intra-module kernel ISA allocations So far, there is a separate page allocated for each kernel's ISA within `KernelImmutableData::initialize()`. Apparently the ISA blocks are often much smaller than a 64k page, which leads to poor memory utilization and was even observed to cause the device OOM error if a single module has several keys. Improve the situation by reusing the parent allocation (owned by the module instance) for modules, which kernel ISAs can fit together within a single 64k page. This improves the memory utilization on a single module level. Related-To: NEO-7788 Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>	2023-09-19 12:05:09 +02:00
Compute-Runtime-Validation	73731d3be5	Revert "fix: correct suggested number of work groups for concurrent kernels o... This reverts commit `6fc673b0fe`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-09-15 04:21:58 +02:00
Mateusz Jablonski	6fc673b0fe	fix: correct suggested number of work groups for concurrent kernels on PVC value depends on CCS count: - single CCS mode (default) - no limitations - two CCS mode - 25% available - four CCS mode - 12.5% available Related-To: NEO-8377 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-09-14 16:13:54 +02:00
Mateusz Jablonski	2f7c33c1fd	refactor: move xe hpg specific appendBlitCommandsBlockCopy to xe hpg file Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-09-13 10:49:28 +02:00
Dunajski, Bartosz	7562842a58	refactor: remove LogicalStateHelper Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-09-13 10:29:53 +02:00
Compute-Runtime-Validation	413365a7bf	Revert "fix: Correct logic for SIMD1" This reverts commit `fc099ead2e`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-09-13 08:23:59 +02:00
Kacper Nowak	fc099ead2e	fix: Correct logic for SIMD1 - For calculating number of threads per workgroup, treat simd 1 as it was simd 32 - Correct logic of calculating space for per thread data for simd 1 - Minor: unit tests refactor - Corrected naming Related-To: NEO-8261 Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>	2023-09-13 07:03:12 +02:00
Mrozek, Michal	d9f938f3db	refactor: remove not needed code Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>	2023-09-12 14:25:04 +02:00
Dunajski, Bartosz	6648065703	feature: add indirect semaphore mode Related-To: NEO-8242 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-09-12 13:15:51 +02:00
Dunajski, Bartosz	2a6be2fccd	feature: update conditional bb start to use qword data Related-To: NEO-8242 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-09-12 11:24:28 +02:00
Dunajski, Bartosz	def3f2e9ad	refactor: improve semaphore programming Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-09-12 11:24:11 +02:00
Filip Hazubski	d7db6ac467	feature: Add preferredPlatformName field to RuntimeCapabilityTable For all of the devices, preferredPlatformName is initialized with nullptr by default and platform name will be initialized to driver's default platform name, at the moment this is "Intel(R) OpenCL Graphics". When Platform is initialized and preferredPlatformName is not nullptr then Platform name will be set to the value stored in preferredPlatformName. Add ENABLE_LEGACY_PLATFORM_NAME AIL enum related to added functionality. Move PlatformInfo to NEO namespace. Related-To: HSD-22018809561 Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>	2023-09-12 11:07:14 +02:00
Mateusz Jablonski	c851896482	refactor: move XeHpg specific setExtraAllocationData definition to Xe Hpg file Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-09-11 13:32:00 +02:00
Compute-Runtime-Validation	1579c69316	Revert "performance: allocate timestamp packet tag buffer in local mem on DG2" This reverts commit `819908ec94`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-09-11 12:07:26 +02:00
Fabian Zwolinski	b2ba1fbecf	fix: enforce zebin format by default in Ocloc for ICL/TGL/RKL/ADL Products for which zebin has been set as default format in OCLOC: - ICELAKE_LP - TIGERLAKE_LP - ROCKETLAKE - ALDERLAKE_S - ALDERLAKE_P - ALDERLAKE_N The default format does not override `--format` parameter. Related-To: NEO-8334 Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>	2023-09-08 16:16:18 +02:00
Fabian Zwolinski	10675134e1	feature: Add process safety to Windows compiler cache Related-To: NEO-8092 Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com> Co-authored-by: Diedrich, Kamil <kamil.diedrich@intel.com>	2023-09-06 15:34:15 +02:00
Maciej Plewka	3b3e17e738	performance: Use vector for private allocs to reuse Related-To: HSD-18033105655, HSD-18033153203 Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>	2023-09-04 13:34:38 +02:00
Mateusz Jablonski	91b26277a4	feature: add method to adjust hw info for igc Related-To: NEO-8203 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-09-04 11:58:13 +02:00
Jitendra Sharma	9818ef61a5	feature: Report correct GRF register count Based on Large GRF enabled or not, report correct GRF register. Related-To: NEO-6788 Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>	2023-09-04 11:42:48 +02:00
Compute-Runtime-Validation	154530ad23	Revert "feature: Report correct GRF register count" This reverts commit `8eb3fe222e`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-09-01 15:12:57 +02:00
Jitendra Sharma	8eb3fe222e	feature: Report correct GRF register count Based on Large GRF enabled or not, report correct GRF register. Related-To: NEO-6788 Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>	2023-08-31 18:48:29 +02:00
Slawomir Milczarek	5833d65bc0	feature: Add support for memadvise with system preferred location Related-To: NEO-7851 Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>	2023-08-31 13:09:55 +02:00
Mateusz Jablonski	819908ec94	performance: allocate timestamp packet tag buffer in local mem on DG2 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-08-30 12:02:04 +02:00
Mateusz Hoppe	00de13939d	feature: bindless addressing allocator - reuse of bindless slots - introduce 2 reuse pools to bindlessHeapHelper - one pool stores slots for reuse, second pool stores released slots - stateCacheDirty flags keep track of state cache - when pools are switched - flags are set indicating flushing caches is needed after old slots have been reused for new allocations Related-To: NEO-7063 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-08-29 09:51:36 +02:00
Mateusz Jablonski	dbbc57777a	refactor: move MTL device ids to common place Related-To: NEO-8187 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-08-25 00:12:24 +02:00
Mateusz Jablonski	db52486bbf	refactor: move MTL specific definition of ProductHelper to Xe Lpg Related-To: NEO-8187 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-08-24 10:50:26 +02:00
John Falkowski	d49190f4ae	feature: Add debug/release variables prefixes Add debug/release variables with prefixes for Level Zero, OpenCL and NEO Resolves: NEO-6357 Signed-off-by: John Falkowski <john.falkowski@intel.com>	2023-08-22 15:15:45 +02:00
Mateusz Hoppe	8435160db4	feature: bindless addressing for images - program surface states for redescribed images correctly. Image copy to/from memory are using redescribed surface states, - refactor state base address programming - program address and size together, set max size at the beginning due to lack of Enable flag - set GpuBase in WddmAllocation when external heap is used - return max ssh required size from kernelInfo or based on stateful args Related-To: NEO-7063 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-08-18 15:59:20 +02:00
Artur Harasimiuk	f6e0c0cf89	Revert "feature: Add debug/release variable prefixes" This reverts commit `ec95d9314a`. Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>	2023-08-18 12:42:39 +02:00
John Falkowski	ec95d9314a	feature: Add debug/release variable prefixes Resolves: NEO-6357 Signed-off-by: John Falkowski <john.falkowski@intel.com>	2023-08-10 14:01:09 +02:00
Fabian Zwolinski	6fca8ee195	refactor: Remove SourceLevelDebugger Removed: - SourceLevelDebugger (with tests) - DebuggerLibrary - DebuggerLibraryRestore - debuggerSupported field from hwInfo.capabilityTable - HasSourceLevelDebuggerSupport matcher - ExperimentalEnableSourceLevelDebugger debug var - EnableMockSourceLevelDebugger debug var - DebuggerOptDisable debug var - lib_names.h.in file - third_party/source_level_debugger/igfx_debug_interchange_types.h Related-To: NEO-7213 Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>	2023-08-10 11:14:02 +02:00
Mateusz Hoppe	bcba74f839	feature: bindless global allocator with scratch - allocate SSH in cmdContainer when scratch allocation used with private heaps - scratch SurfaceStates are addressed relative to SurfaceStateBaseAddress and have to be placed on SSH - remove not used SCRATCH_SSH heap type from bindelssHeapHelper Related-To: NEO-7063 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-08-03 13:24:59 +02:00
Mateusz Jablonski	659f7b645f	refactor: move methods from CompilerProductHelper to ReleaseHelper Related-To: NEO-7786 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-08-02 10:40:48 +02:00
Mateusz Jablonski	02aa4b6acc	refactor: unify isMatrixMultiplyAccumulateSupported definition Related-To: NEO-7786 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-08-01 14:39:53 +02:00
Zbigniew Zdanowicz	22ab72ed3d	performance: do not flush caches when writing global timestamp Related-To: HSD-18032167035 Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-08-01 10:46:18 +02:00
Mateusz Jablonski	1e95ed33f9	refactor: remove redundant parameter from HardwareInfo ctor hw ip version is queried from KMD or set based on device id and rev id Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-07-31 16:12:57 +02:00
Mateusz Jablonski	27e459dfd0	fix: add missing cache flushes on MTL and later integrated GPUs hdc pipeline / untyped dataport cache flushes were applied only on discrete GPUs Related-To: GSD-5085 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-07-28 19:30:13 +02:00
Mateusz Jablonski	f84f22d23c	refactor: remove unused isSharedContext variable Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-07-26 12:14:23 +02:00
Mateusz Hoppe	67d39f88e6	feature: bindless addressing - store bindlessInfo in allocation - store surface state info for bindless addressing in graphics allocation - remove map in BindlessHeapsHelper - bindlessInfo is constant for the lifetime of an allocation - program bindless offsets and surface states for images when used in bindless kernel - handle ouf of memory on surface state heap - return error Related-To: NEO-7063 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-07-24 14:48:35 +02:00
Kacper Nowak	b908203001	fix: Compile built-ins per release - Preserve releases on CMake level. - Instead of generating builtins per platform, generate them per-release (+ correct naming accordingly). - Stop using revisions in builtin compilation logic path, as they are already embedded in release (device ip). - Remove platform names & revisions from names for generated files (related to builtins). - Remove unnecessary code, refactor ULT logic. Related-To: NEO-7783 Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>	2023-07-11 16:02:36 +02:00
Andrzej Ratajewski	dc0796c2a1	feature: Add cl_khr_spirv_linkonce_odr to supported extensions Related-To: NEO-8165 Signed-off-by: Andrzej Ratajewski <andrzej.ratajewski@intel.com>	2023-07-11 13:19:55 +02:00
Young Jin Yoon	81822e3716	refactor: rename pageSize2Mb to pageSize2M The previous name "pageSize2Mb" defined in shared/source/helpers/constant.h is inconsistent to other variable, i.e. pageSize64k. Furthermore, it's a bit misleading because the page size is defined in Megabytes (MB), not in Megabits (Mb). Related-to: NEO-7695 Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>	2023-07-10 20:12:09 +02:00
Cencelewska, Katarzyna	aa0beb8191	fix: Unify logic calculating threads per work group part 4 - also use helper when checking that is simd1 to have same flow Related-To: NEO-8087 Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-07-07 15:34:59 +02:00
Mateusz Hoppe	4aba0f0340	feature: global bindless surface state base support - program global bindless ssba when external allocator used ( UseExternalAllocatorForSshAndDsh) Related-To: NEO-7063 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-07-06 18:31:49 +02:00
Dominik Dabek	f1f0f2410c	refactor: remove unused getBaseTimeout Related-To: GSD-3612 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2023-07-05 16:12:15 +02:00
Mateusz Hoppe	0cc8330a1a	refactor: Unify bindless surface state base programming - program BSSBA in one place when sbaProperties used Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-07-04 16:34:43 +02:00
Cencelewska, Katarzyna	61f701aba5	fix: Unify logic calculating threads per work group part 3 Related-To: NEO-8087 Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-07-04 15:27:44 +02:00
Cencelewska, Katarzyna	2e17c21728	fix: Unify logic calculating threads per work group part 2 - use calculateNumThreadsPerThreadGroup instead of getThreadsPerWG to have same flow and proper values of threads per work groups Related-To: NEO-8087 Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-07-04 10:34:02 +02:00
Compute-Runtime-Validation	39740da9d1	Revert "fix: Unify logic calculating threads per work group part 2" This reverts commit `1e8a53bd53`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-07-02 07:09:14 +02:00
Cencelewska, Katarzyna	1e8a53bd53	fix: Unify logic calculating threads per work group part 2 - use calculateNumThreadsPerThreadGroup instead of getThreadsPerWG to have same flow and proper values of threads per work groups Related-To: NEO-8087 Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-06-30 14:16:08 +02:00
Zbigniew Zdanowicz	21823af419	performance: add skeleton method to cmdlist immediate flush task Related-To: NEO-7808 Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-06-30 10:46:20 +02:00
Dominik Dabek	10ac167bdc	fix(ocl): do not multiply kmdNotify waitTimeout With direct submission disabled this resulted in waitTimeout long enough that kmdWait fallback was rarely used. This caused more CPU spin time. Related-To: GSD-3612 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2023-06-29 16:06:28 +02:00
Cencelewska, Katarzyna	0d7aefe66b	fix: Unify logic calculating threads per work group part 1 Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-06-29 10:43:22 +02:00
Dunajski, Bartosz	ecb415bf62	feature: reenable RelaxedOrdering Related-To: NEO-7458 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-06-28 12:20:17 +02:00
Cencelewska, Katarzyna	68d81c82a7	fix: Use proper value about hw local id generations - remove useless flag ForceNumberOfThreadsInGpgpuThreadGroup - add new flag "RemoveRestrictionsOnNumberOfThreadsInGpgpuThreadGroup" to restore old path without restrictions about number of threads in thread group - fix forwarding information about hw local ids generations to calculate numOfThreadsInThreadGroup correctly Related-To: NEO-7952, NEO-7982 Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-06-26 16:35:42 +02:00
Dunajski, Bartosz	2b5e475db9	refactor: use hex values to print TSP usage Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-06-23 11:22:10 +02:00
Dunajski, Bartosz	b004a27e4e	refactor: Debug flag to print TSP usage Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-06-22 14:47:39 +02:00
Lukasz Jobczyk	0bc5eead84	fix: Remove not needed BCS split helper Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>	2023-06-22 14:36:22 +02:00
Fabian Zwolinski	99d0823e8f	fix: Append extra extensions when FP64 emulation is enabled Related-To: NEO-7611 Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>	2023-06-22 08:38:53 +02:00
Dunajski, Bartosz	46e8c3f5dd	fix: reenable RelaxedOrdering for OCL Related-To: NEO-7458 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-06-20 13:05:25 +02:00
Cencelewska, Katarzyna	7cb3278eb3	fix: add function to calculate number of threads per tg Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-06-13 14:02:24 +02:00
Dunajski, Bartosz	3d49658f50	feature: new multitile post sync layout for immediate write [2/n] No functional changes in this commit. This is prework. Related-To: NEO-7966 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-06-09 14:20:34 +02:00
Jaime Arteaga	2efd6e547a	feature: Add support for chunking in the UMD (1/N) Read if support for chunking is available in the KMD. If available, KMD will create a BO with 1 or more chunks, depending on the chunk size selected. Related-To: NEO-7695 Sync to https://github.com/intel-gpu/drm-uapi-helper/releases/tag/v2.0-rc18 Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com> Signed-off-by: John Falkowski <john.falkowski@intel.com>	2023-06-02 23:27:40 +02:00
Daria Hinz	68b6cfebd2	feature: Add default ip version values to helper Signed-off-by: Daria Hinz <daria.hinz@intel.com> Related-To: NEO-8021	2023-06-02 12:10:21 +02:00
Cencelewska, Katarzyna	d2436a8231	fix: add limitations for setting gmm flag Cacheable - move isCachingOnCpuAvailable to product helper - isCachingOnCpuAvailable should return false on mtl - if wsl, skip checking method from product helper Related-To: NEO-7194 Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-05-30 17:04:57 +02:00
Mateusz Jablonski	61055478d4	fix: adjust scope of disable L3 for debug WA Related-To: HSD-1609398399 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-05-30 14:23:16 +02:00
Dunajski, Bartosz	5aeffbf673	refactor: define initial value for TimestampPacket Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-05-30 12:09:05 +02:00
Mateusz Hoppe	1c196b9f3d	refactor: change ApiSpecificConfig functions names - better description of the meaning of functions Related-To: NEO-7063 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-05-30 09:20:01 +02:00
Mateusz Jablonski	c544004b8e	fix: move getProductConfigFromHwInfo to CompilerProductHelper add tests for default PVC configs Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-05-29 14:08:52 +02:00
Dunajski, Bartosz	814de81aca	feature: replace in-order sync allocation with TimestampPacket Related-To: NEO-7966 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-05-29 09:32:35 +02:00
Daria Hinz	331f167cfe	feature: Add debug flag for setting hw ip version Signed-off-by: Daria Hinz <daria.hinz@intel.com> Related-To: NEO-7954	2023-05-23 15:32:46 +02:00
Filip Hazubski	d234bc970d	refactor: Move getMaxNumSamplers function to ProductHelper Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>	2023-05-18 09:25:07 +02:00
Cencelewska, Katarzyna	5f22e9eaca	fix: don't set Cacheable on xe_hp and later Related-To: NEO-7194 Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-05-18 09:17:32 +02:00
Lukasz Jobczyk	9f755b54ad	performance: Bidirectional BCS split implementation for OCL Related-To: NEO-7877 Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>	2023-05-16 13:34:18 +02:00
Milczarek, Slawomir	66eb1c9c0a	refactor: Add helpers to control kmd migration support on PVC platform This commit keeps KMD migration still disabled by default on PVC platform. Related-To: NEO-6465 Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>	2023-05-15 13:51:19 +02:00
Daria Hinz	1d64b94f30	feature: Match config based on device & rev ID This change matches the appropriate aot config for the combination of device ID and revision ID. Signed-off-by: Daria Hinz <daria.hinz@intel.com> Related-To: NEO-7905	2023-05-10 08:22:09 +02:00
Dunajski, Bartosz	cfacbbd811	refactor: Simplify OverrideBlitterMocs usage Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-05-09 19:22:57 +02:00
Warchulski, Jaroslaw	7fdf4985a3	feature: add support for cl_khr_external_memory extension Related-To: NEO-7069 Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>	2023-05-05 15:51:39 +02:00
Fabian Zwolinski	cbce863dc2	refactor: Rename member variables to camelCase 3/n Additionally enable clang-tidy check for member variables Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>	2023-04-28 16:01:14 +02:00
Kamil Kopryk	171e97e71a	feature: introduce release helper Related-To: NEO-7786 Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>	2023-04-27 21:38:08 +02:00
Fabian Zwolinski	e351a90f81	refactor: Rename member variables to camelCase 2/n Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>	2023-04-27 20:39:22 +02:00
Mateusz Jablonski	2f9135a4e6	fix: change type of container with registered engines per root device use StackVec instead of unordered map resize container at MemoryManager's creation time Related-To: NEO-7925 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-04-27 17:06:42 +02:00
Mateusz Jablonski	32d8a3bc6d	fix: store registered engines per root device in most cases we need to iterate over engines associated to single root device Related-To: NEO-7925 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-04-27 10:54:07 +02:00
Fabian Zwolinski	2022592f3d	Apply CamelCase for class and struct names 2/2 Additionally change .clang-tidy not to ignore struct names. Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>	2023-04-25 13:10:23 +02:00
Milczarek, Slawomir	bf778be99e	[fix] zeCommandListAppendMemAdvise to clear preferred location correctly The memadvise with preferred location for kmd-migrated shared allocation is set to device associated with cmd list by default to migrate data to lmem on non-atomic gpu page fault as well (for performance reasons). Related-To: NEO-7252 Signed-off-by: Milczarek, Slawomir <slawomir.milczarek@intel.com>	2023-04-24 14:51:49 +02:00
Andrzej Koska	b22f3f0796	Fix for handling incorrect patchSize in patchWithRequiredSize This fix adds additional size control to patched memory Related-To: NEO-7760 Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>	2023-04-20 16:07:45 +02:00
Daria Hinz	2647efc23c	feat: Add support for ip version value in ocloc Ocloc supports passing hw ip version value to -device arg in the form of major.minor.revision. This change adds support for directly passed value as uint32_t as well. Support added for single and fat binary. Signed-off-by: Daria Hinz <daria.hinz@intel.com> Related-To: NEO-7903	2023-04-20 13:20:11 +02:00
Mateusz Jablonski	688d958fe8	fix: handle single storage USM in gtpin helpers use GPU address from gpu allocation instead of CPU allocation check page fault manager presence before migrating to GPU domain Related-To: NEO-7690 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-04-19 14:26:25 +02:00
Mateusz Hoppe	97a8f15fbd	fix: cleanup memory leaks in ocloc - enable memory leaks detection in ocloc tests Related-To: NEO-7467 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-04-19 13:48:49 +02:00
Kacper Nowak	c7adbc2140	Add debug key for dumping ELF to file Add "DumpZEBin" debug flag. When this flag is enabled, Zebin will be dumped to a .elf file (with appropiate suffix, in case such file has been dumped before). Signed-off-by: Kacper Nowak <kacper.nowak@intel.com> Related-To: NEO-7895	2023-04-18 20:40:25 +02:00
Maciej Plewka	730578112f	Report cl_khr_integer_dot_product extension Related-To: NEO-6206 With this commit OpenCL will report cl_khr_integer_dot_product extension in version 2. With all properties enabled. Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>	2023-04-14 14:46:18 +02:00
Konstanty Misiak	1f37e69fd2	Refactor of IO functions Related-To: NEO-4562 Signed-off-by: Konstanty Misiak <konstanty.misiak@intel.com>	2023-04-13 10:46:47 +02:00
Maciej Plewka	095108edd8	Abort when blit resolve operation dispatched for different tilings Related-To: NEO-7872, NEO-7873, NEO-7874, NEO-7875 Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>	2023-04-12 13:17:00 +02:00
Compute-Runtime-Validation	6357ab73ea	Revert "Report cl_khr_integer_dot_product extension" This reverts commit `31983ec043`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-04-06 14:38:45 +02:00
Maciej Plewka	31983ec043	Report cl_khr_integer_dot_product extension Related-To: NEO-6206 With this commir OpenCL will report cl_khr_integer_dot_product extension in version 2. With all properties enabled. Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>	2023-04-05 20:28:00 +02:00
Zbigniew Zdanowicz	4c7bc2ca98	[feature, perf] add alogrithm to chain command buffers in container This feature is part of performance improvement to dispatch and start command buffers as primary batch buffers. When exhausted command buffer is closed, then reserve exact space for chained batch buffer start and bind it to the next command buffer. When closing command buffer, then save ending pointer and reserve aligned space. Related-To: NEO-7807 Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-04-05 15:49:01 +02:00
Compute-Runtime-Validation	c927c46495	Revert "Report cl_khr_integer_dot_product extension" This reverts commit `dbd9b19b35`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-04-05 04:16:29 +02:00
Maciej Plewka	dbd9b19b35	Report cl_khr_integer_dot_product extension Releated-to: NEO-6206 With this commir OpenCL will report cl_khr_integer_dot_product extension in version 2. With all properties enabled. Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>	2023-04-04 11:15:38 +02:00
Mateusz Jablonski	ab265bca93	fix blitter programming for media compression on xe hpg platforms - when destination surface is media compressed then disable compression bit - rename command field CompressionType->ControlSurfaceType - program this field only on Xe Hpg platforms Related-To: NEO-7415 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-03-31 17:15:28 +02:00
Rafal Maziejuk	b9828b543e	feature: adjust maxWorkGroupSize value Related-To: NEO-7357 Signed-off-by: Rafal Maziejuk <rafal.maziejuk@intel.com>	2023-03-28 15:19:52 +02:00
Zbigniew Zdanowicz	6437c1a91e	Flush state caches after command list is destroyed When state base address tracking is enabled and command list use private heaps then command list at destroy time must calls all compute CSRs that were using that heap to invalidate state caches. This allows new command list to reuse the same heap allocation for different surface states, so before new use cached states are invalidated. Related-To: NEO-5055 Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-03-28 14:52:30 +02:00
Mateusz Jablonski	5d482ffe87	refactor: add constexpr for tera byte Related-To: NEO-7665 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-03-28 12:55:52 +02:00
Dunajski, Bartosz	b3c2fa41c5	OCL: Optimize IOQ barriers handling Related-To: NEO-7458 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-03-24 12:52:47 +01:00
Zbigniew Zdanowicz	179429b25c	Add stateless arguments checker for single kernel Related-To: NEO-5055 Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-03-23 10:55:15 +01:00
Mateusz Jablonski	5610eae710	refactor: fix typo Barrierl -> Barrier Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-03-21 15:58:24 +01:00
Cencelewska, Katarzyna	a4a296d59f	wa: enable wa to add additional dummy blits after blit copy - reduce number of dummy blits where are not needed - track if dummy blit required in cmdlist Related-To: NEO-7450 Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-03-17 10:43:00 +01:00
Filip Hazubski	0bee81c0c0	refactor: Move isLinearStoragePreferred function from gfx to product helper Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>	2023-03-15 18:51:59 +01:00
Mateusz Jablonski	e3c5cbc22f	fix ocl/ocloc: expose cl_ext_float_atomics for OCL 2.0+ Related-To: NEO-7800 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-03-15 14:48:17 +01:00
Mateusz Jablonski	db719b2acb	refactor: rename function getExtensions->getDeviceExtensions Related-To: NEO-7800 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-03-15 12:46:46 +01:00
Mateusz Jablonski	028e1bb1b6	fix ocloc: define extensions in one place unify exposed device extensions between OCL and ocloc Related-To: NEO-7800 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-03-15 11:14:46 +01:00
Dunajski, Bartosz	60741b05aa	Add missing struct member initialization Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-03-15 10:09:05 +01:00
Zbigniew Zdanowicz	b6d0a64638	Revert: "compute events select device memory for xe hpg family" This reverts commit `072233d170`. Resolves: HSD-18028222329 Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-03-14 17:18:59 +01:00
Mateusz Jablonski	faba16f657	refactor: move extensions string creation to CompilerProductHelper Related-To: NEO-7800 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-03-14 16:27:21 +01:00
Dunajski, Bartosz	477448c097	RelaxedOrdering dispatch for OCL 2/n Related-To: NEO-7458 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-03-14 15:48:06 +01:00
Mateusz Jablonski	39eab81c83	refactor: add separate helper for extension support cl_intel_subgroup_split_matrix_multiply_accumulate Related-To: NEO-7800 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-03-14 15:08:29 +01:00
Mateusz Jablonski	340f932ca2	refactor: move GfxCoreHelper::getExtensions to CompilerProductHelper Related-To: NEO-7800 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-03-14 13:56:19 +01:00
Mateusz Jablonski	483f9fe340	refactor: move extensions support logic to compiler product helper - cl_intel_subgroup_local_block_io - cl_intel_dot_accumulate - cl_intel_create_buffer_with_properties - cl_khr_subgroup_named_barrier - cl_intel_subgroup_extended_block_read Related-To: NEO-7800 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-03-14 12:09:26 +01:00
Dunajski, Bartosz	97a35debde	RelaxedOrdering dispatch for OCL 1/n Related-To: NEO-7458 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-03-13 17:37:12 +01:00
Zhenjie Pan	00b675643e	fix: missed error handler of SubmissionStatus::FAILED Related-To: NEO-7802 Signed-off-by: Pan Zhenjie <zhenjie.pan@intel.com>	2023-03-13 16:08:27 +01:00
Raiyan Latif	d5c909c9f9	Fix calculation of number of Ray-Tracing stacks MaxDualSubSlicesSupported is filled inside GT_SYSTEM_INFO structure when querying the KMD appropriately with the number of enabled DualSubSlices. However we need to find the highest index of the last enabled DualSubSlice. For proper allocation of thread scratch space, allocation has to be done based on native die config (including unfused or non-enabled DualSubSlices). Since HW doesn't provide us a way to know the exact native die config, in SW we need to allocate RT stacks with enough size based on the last used DualSubSlice. The IsDynamicallyPopulated field in GT_SYSTEM_INFO is used to indicate if system details are populated either via Fuse reg. or hard-coded. Based on this field's value, we calcuate the numRtStacks appropriately. Related-To: LOCI-3954 Signed-off-by: Raiyan Latif <raiyan.latif@intel.com>	2023-03-13 10:48:10 +01:00
Kamil Kopryk	5a99635232	Don't check addressing mode for gtpin surface If gtpin is used, then don't check addressing mode of the last explicit arg, which is gtpin's surface. Related-To: NEO-6075 Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>	2023-03-10 16:01:24 +01:00
Cencelewska, Katarzyna	398c7b2d29	refactor, remove typo in struct name change name of EncodeSempahore to EncodeSemaphore Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-03-10 15:44:25 +01:00
Mateusz Jablonski	89cd4bd4a5	refactor: move getters for device extension support to compiler product helper isMatrixMultiplyAccumulateSupported isBFloat16ConversionSupported Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-03-10 15:41:26 +01:00
Kamil Kopryk	fa8579602f	refactor: rename product helper files n/n Related-To: NEO-7703 Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>	2023-03-10 13:24:38 +01:00
Zbigniew Zdanowicz	0950f5a23e	Set global heap size to constant value Related-To: NEO-5055 Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-03-09 17:17:32 +01:00
Mateusz Jablonski	5080250a64	refactor: remove include from enable_compiler_product_helper files Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-03-09 11:21:37 +01:00
Cencelewska, Katarzyna	c274309d7b	wa: add dummy blits before command MI_FLUSH_DW to guarantee that all subblt got complete for previous copy affect xe hpg temporary changes under flag ForceDummyBlitWa Related-To: NEO-7450 Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-03-09 10:40:35 +01:00
Mateusz Hoppe	37dbec305d	feature: add AssertHandler - initial implementation to support assert() on GPU Related-To: NEO-5753 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-03-08 17:55:23 +01:00
Dominik Dabek	69a16fd3ed	feature: check indirect access for kernel Do not make indirect allocations resident if kernel does not use indirect access. For both level zero and opencl. Currently disabled by default, enable with debug flag DetectIndirectAccessInKernel Related-To: NEO-7712 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2023-03-08 16:58:26 +01:00

... 2 3 4 5 6 ...

1286 Commits