compute-runtime

Commit Graph

Author	SHA1	Message	Date
Mateusz Hoppe	1c37da280c	fix: fix bindless offset patching for images - usingSurfaceStateHeap indicates if any of the args is using local ssh in bindless kernels: without global allocator - ssh is used for all args with global bindless allocator - ssh used only for buffer with offset set in surface state, otherwise not used When any of the args is using ssh - getSurfaceStateHeapDataSize() returns non-zero size. Related-To: NEO-7063 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-11-07 11:39:49 +01:00
Mateusz Hoppe	02b6b3bbaa	feature: enable illegal opcode exception Related-To: NEO-9088 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-11-06 16:09:29 +01:00
Dominik Dabek	75c4844987	feature(internal): logging kernel dispatch params Use debug flag PrintKernelDispatchParameters to print params used in thread group dispatch size heuristic when encoding kernel dispatch. Related-To: NEO-6989 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2023-10-17 17:31:54 +02:00
Zbigniew Zdanowicz	ec9fa23b2e	refactor: change order of fields of EncodeDispatchKernelArgs structure Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-10-04 15:00:29 +02:00
Naklicki, Mateusz	0461af492d	fix: unify path for getting number of grfs per thread Related-To: NEO-8043 Signed-off-by: Naklicki, Mateusz <mateusz.naklicki@intel.com>	2023-10-03 08:17:46 +02:00
Mateusz Jablonski	5f846d8a13	refactor: remove not needed code Related-To: NEO-7527 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-09-27 18:17:04 +02:00
Mateusz Jablonski	5dc56c221f	refactor: remove not needed function Related-To: NEO-7527 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-09-27 14:44:56 +02:00
Dunajski, Bartosz	480c058cb2	feature: in-order patching for ComputeWalker Related-To: NEO-7966 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-09-22 15:00:44 +02:00
Maciej Bielski	97e7cda912	feature: Optimize intra-module kernel ISA allocations So far, there is a separate page allocated for each kernel's ISA within `KernelImmutableData::initialize()`. Apparently the ISA blocks are often much smaller than a 64k page, which leads to poor memory utilization and was even observed to cause the device OOM error if a single module has several keys. Improve the situation by reusing the parent allocation (owned by the module instance) for modules, which kernel ISAs can fit together within a single 64k page. This improves the memory utilization on a single module level. Related-To: NEO-7788 Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>	2023-09-21 13:55:45 +02:00
Compute-Runtime-Validation	913a926fd4	Revert "feature: Optimize intra-module kernel ISA allocations" This reverts commit `c348831470`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-09-19 14:16:05 +02:00
Maciej Bielski	c348831470	feature: Optimize intra-module kernel ISA allocations So far, there is a separate page allocated for each kernel's ISA within `KernelImmutableData::initialize()`. Apparently the ISA blocks are often much smaller than a 64k page, which leads to poor memory utilization and was even observed to cause the device OOM error if a single module has several keys. Improve the situation by reusing the parent allocation (owned by the module instance) for modules, which kernel ISAs can fit together within a single 64k page. This improves the memory utilization on a single module level. Related-To: NEO-7788 Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>	2023-09-19 12:05:09 +02:00
Dunajski, Bartosz	7562842a58	refactor: remove LogicalStateHelper Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-09-13 10:29:53 +02:00
Dunajski, Bartosz	6648065703	feature: add indirect semaphore mode Related-To: NEO-8242 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-09-12 13:15:51 +02:00
Dunajski, Bartosz	def3f2e9ad	refactor: improve semaphore programming Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-09-12 11:24:11 +02:00
Mateusz Hoppe	93469eaf5d	feature: bindless addressing for buffers with offset - allocate SurfaceStates on kernel's heap for offsetted buffers Related-To: NEO-7063 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-09-08 12:03:23 +02:00
Fabian Zwolinski	6fca8ee195	refactor: Remove SourceLevelDebugger Removed: - SourceLevelDebugger (with tests) - DebuggerLibrary - DebuggerLibraryRestore - debuggerSupported field from hwInfo.capabilityTable - HasSourceLevelDebuggerSupport matcher - ExperimentalEnableSourceLevelDebugger debug var - EnableMockSourceLevelDebugger debug var - DebuggerOptDisable debug var - lib_names.h.in file - third_party/source_level_debugger/igfx_debug_interchange_types.h Related-To: NEO-7213 Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>	2023-08-10 11:14:02 +02:00
Cencelewska, Katarzyna	aa0beb8191	fix: Unify logic calculating threads per work group part 4 - also use helper when checking that is simd1 to have same flow Related-To: NEO-8087 Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-07-07 15:34:59 +02:00
Mateusz Hoppe	4aba0f0340	feature: global bindless surface state base support - program global bindless ssba when external allocator used ( UseExternalAllocatorForSshAndDsh) Related-To: NEO-7063 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-07-06 18:31:49 +02:00
Cencelewska, Katarzyna	61f701aba5	fix: Unify logic calculating threads per work group part 3 Related-To: NEO-8087 Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-07-04 15:27:44 +02:00
Cencelewska, Katarzyna	2e17c21728	fix: Unify logic calculating threads per work group part 2 - use calculateNumThreadsPerThreadGroup instead of getThreadsPerWG to have same flow and proper values of threads per work groups Related-To: NEO-8087 Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-07-04 10:34:02 +02:00
Compute-Runtime-Validation	39740da9d1	Revert "fix: Unify logic calculating threads per work group part 2" This reverts commit `1e8a53bd53`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-07-02 07:09:14 +02:00
Cencelewska, Katarzyna	1e8a53bd53	fix: Unify logic calculating threads per work group part 2 - use calculateNumThreadsPerThreadGroup instead of getThreadsPerWG to have same flow and proper values of threads per work groups Related-To: NEO-8087 Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-06-30 14:16:08 +02:00
Mateusz Jablonski	2d01bdec81	fix: change denorm mode in IDD to FlushToZero denorm support is controlled by IGC, we should just set zero by default Related-To: NEO-8059 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-06-23 09:28:32 +02:00
Dunajski, Bartosz	6544a1defa	feature: adjust unit tests for future dynamic post sync allocation testing Related-To: NEO-7966 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-06-20 16:22:33 +02:00
Mateusz Hoppe	313fb84fda	feature: bindless addressing mode support - allow bindless kernels to execute - bindless addressing kernels are using private heaps mode - do not differentiate bindful and bindless surface state base addresses Related-To: NEO-7063 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-06-19 12:41:03 +02:00
Compute-Runtime-Validation	995e2a79c6	Revert "fix: change denorm mode in IDD to FlushToZero" This reverts commit `987394b27c`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2023-06-15 11:49:01 +02:00
Mateusz Jablonski	987394b27c	fix: change denorm mode in IDD to FlushToZero denorm support is controlled by IGC, we should just set zero by default Related-To: NEO-8059 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-06-13 13:42:50 +02:00
Mateusz Hoppe	8bc1fb1251	refactor: add function checking bindless addressing - simplify logic to check addressing mode of a kernel Related-To: NEO-7063 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-06-12 14:42:18 +02:00
Kamil Kopryk	6a0f7afd64	feature: verify stateful information only when binary is generated by IGC Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com> Related-To: NEO-6075 Ngen binaries contain stateful information, however they are not used in isa on Pvc. Therefore, we can just ignore them.	2023-06-12 11:45:41 +02:00
Dunajski, Bartosz	3d49658f50	feature: new multitile post sync layout for immediate write [2/n] No functional changes in this commit. This is prework. Related-To: NEO-7966 Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-06-09 14:20:34 +02:00
Mateusz Hoppe	0844ca0ac8	refactor: cleanup getBindlessMode() usage - getGlobalBindlessHeapConfiguration() should be used to choose global alloctor for SSH - remove not needed and incorrect unit tests - remove not needed branches - bindless mode controls bindless compilation only Related-To: NEO-7063 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-06-06 17:23:13 +02:00
Mateusz Hoppe	19bb1e334c	feature: enable SW exceptions for kernels with assert and debugging - when debugging is enabled, assert() in gpu kernel will trigger SW exception Related-To: NEO-5753 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-06-01 15:31:36 +02:00
Mateusz Hoppe	1c196b9f3d	refactor: change ApiSpecificConfig functions names - better description of the meaning of functions Related-To: NEO-7063 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2023-05-30 09:20:01 +02:00
Kamil Kopryk	e0d3db3d91	fix: improve release helper Related-To: NEO-7786 Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>	2023-05-15 14:30:15 +02:00
Dunajski, Bartosz	feff1c35cc	feature: Experimental support of immediate cmd list in-order execution [5/n] Related-To: LOCI-4332 - Signal non-timestamp Walkers with in-order CL value - Event host synchronization based on CL signal value Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>	2023-05-09 11:46:14 +02:00
Fabian Zwolinski	cbce863dc2	refactor: Rename member variables to camelCase 3/n Additionally enable clang-tidy check for member variables Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>	2023-04-28 16:01:14 +02:00
Dominik Dabek	c84c7a0c91	performance: adjust thread group dispatch size adjust thread group dispatch size on pvc if chosen size does not evenly divide dimension this is to avoid leftover thread groups Related-To: NEO-7927 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2023-04-27 18:24:53 +02:00
Zbigniew Zdanowicz	a114448792	[feat, perf] Indicate implicit scaling is dispatched from primary batch buffer This change is part of performance feature to start command list batch buffers as primary. Implicit Scaling sometimes require to jump over control section and these jumps must maintain the same level of batch buffer as the whole command list. Related-To: NEO-7807 Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-04-11 12:39:25 +02:00
Mateusz Jablonski	31f32cc16e	fix implicit args: generate local ids as for grf size 32 Related-To: IGC-6936 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2023-04-07 11:37:07 +02:00
Kacper Nowak	f1c64adb3c	fix(ocl): Fix potential mem leak + simplify code - Fix potential memleak in case ASSERT returns false and test gets aborted - Remove not needed function argument Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>	2023-03-27 13:31:42 +02:00
Zbigniew Zdanowicz	38e50007f7	[perf] simplify memory layout of command container class Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-03-23 13:31:47 +01:00
Zbigniew Zdanowicz	bc4e540c33	[fix] unify heaps size programing - share same code between csr and cmd container to get default heap size - share handling of debug flag to change heap size - share platform level surface heap size between csr and command list - refactor heap size files - put heap size constant and function into namespace - command list surface heap size increased to 2MB for xehp+ to match csr - command list increased surface heap size only for sba tracking - sba tracking heap consumption increased due to different reset policy Related-To: NEO-5055 Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-03-17 08:34:06 +01:00
Cencelewska, Katarzyna	398c7b2d29	refactor, remove typo in struct name change name of EncodeSempahore to EncodeSemaphore Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-03-10 15:44:25 +01:00
Kamil Kopryk	fa8579602f	refactor: rename product helper files n/n Related-To: NEO-7703 Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>	2023-03-10 13:24:38 +01:00
Zbigniew Zdanowicz	0950f5a23e	Set global heap size to constant value Related-To: NEO-5055 Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-03-09 17:17:32 +01:00
Cencelewska, Katarzyna	c274309d7b	wa: add dummy blits before command MI_FLUSH_DW to guarantee that all subblt got complete for previous copy affect xe hpg temporary changes under flag ForceDummyBlitWa Related-To: NEO-7450 Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-03-09 10:40:35 +01:00
Cencelewska, Katarzyna	3e116ea378	refactor: use same paths when add command mi_semaphore_wait Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-03-07 10:35:26 +01:00
Cencelewska, Katarzyna	50da32ffb1	wa: add dummy blits before command MI_ARB_CHECK to guarantee that all subblt got complete for previous copy affect xe hpg Related-To: NEO-7450 Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>	2023-03-07 10:21:05 +01:00
Zbigniew Zdanowicz	34064811d2	Refactor state base address programing 4/n - This change gets level one cache policy from cached values instead of calling virtual methods Related-To: NEO-5055 Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-02-27 17:30:36 +01:00
Zbigniew Zdanowicz	3cb064fe95	Refactor state base address programing 3/n This is small optimization to replace virtual call and retrieved struct with cached value. Related-To: NEO-5055 Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2023-02-23 13:08:32 +01:00

1 2 3 4

174 Commits