intel/llvm - llvm - Gitea: Git with a cup of tea

intel/llvm

mirror of https://github.com/intel/llvm.git synced 2026-02-08 17:28:30 +08:00

Author	SHA1	Message	Date
Uday Bondhugula	a91cfd1990	[MLIR] Improve op parse error message for AtLeastNOperands trait Improve parse error message for "at least N operands" op trait. Differential Revision: https://reviews.llvm.org/D109747	2021-09-14 15:01:51 +05:30
Matthias Springer	62883459cd	[mlir][linalg] makeTiledShape: No affine.min if tile size == 1 This improves codegen (more static type information) with `scalarize-dynamic-dims`. Differential Revision: https://reviews.llvm.org/D109415	2021-09-14 10:48:20 +09:00
Matthias Springer	fb1def9c66	[mlir][linalg] New tiling option: Scalarize dynamic dims This tiling option scalarizes all dynamic dimensions, i.e., it tiles all dynamic dimensions by 1. This option is useful for linalg ops with partly dynamic tensor dimensions. E.g., such ops can appear in the partial iteration after loop peeling. After scalarizing dynamic dims, those ops can be vectorized. Differential Revision: https://reviews.llvm.org/D109268	2021-09-14 10:40:50 +09:00
Matthias Springer	8faf35c0a5	[mlir][linalg] Add scf.for loop peeling to codegen strategy Only scf.for loops are supported at the moment. linalg.tiled_loop support will be added in a subsequent commit. Only static tensor sizes are supported. Loops for dynamic tensor sizes can be peeled, but the generated code is not optimal due to a missing canonicalization pattern. Differential Revision: https://reviews.llvm.org/D109043	2021-09-14 10:35:01 +09:00
Nicolas Vasilache	181d18ef53	[mlir][Linalg] Insert static buffers as high as possible during ComprehensiveBufferization. This revision allows hoisting static alloc/dealloc pairs as high as possible during ComprehensiveBufferization. This also aligns such allocated buffers to 128B by default. This change exhibited some issues wrt insertion points and a missing copy that are also fixed in this revision; tests are updated accordingly. Differential Revision: https://reviews.llvm.org/D109684	2021-09-13 15:59:03 +00:00
Simon Camphausen	ec92f788f3	[mlir][emitc] Print signed integers properly Previously negative integers were printed as large unsigned values. Reviewed By: marbre Differential Revision: https://reviews.llvm.org/D109690	2021-09-13 15:29:30 +00:00
Matthias Springer	7c9b6a3355	[mlir][linalg] ComprehensiveBufferize: Do not copy InitTensorOps Do not copy InitTensorOps or casts thereof. Differential Revision: https://reviews.llvm.org/D109656	2021-09-13 22:31:54 +09:00
Nicolas Vasilache	b01d223faf	[mlir][Linalg] Use reify for padded op shape derivation. Previously, we would insert a DimOp and rely on later canonicalizations. Unfortunately, reifyShape kind of rewrites are not canonicalizations anymore. This introduces undesirable pass dependencies. Instead, immediately reify the result shape and avoid the DimOp altogether. This is akin to a local folding, which avoids introducing more reliance on `-resolve-shaped-type-result-dims` (similar to compositions of `affine.apply` by construction to avoid chains of size > 1). It does not completely get rid of the reliance on the pass as the process is merely local: calling the pass may still be necessary for global effects. Indeed, one of the tests still requires the pass. Differential Revision: https://reviews.llvm.org/D109571	2021-09-13 11:54:59 +00:00
Valentin Clement	57bf856011	[mlir] Add missing namespace to createInlinerPass One of the createInlinerPass does not have the mlir:: namespace Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D109580	2021-09-13 11:58:27 +02:00
Mehdi Amini	7fb2394a4f	Add sanity check in MLIR ODS to catch case where an arguments/results/regions/successors names overlap This is making a tablegen crash with a more friendly error. Differential Revision: https://reviews.llvm.org/D109474	2021-09-13 06:21:25 +00:00
Kiran Chandramohan	187d9f8cd9	[OpenMP][MLIR] Add a conversion pattern for the master op The conversion pattern is particularly useful for conversion of block arguments in the master op. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D109610	2021-09-12 10:13:40 +00:00
Rob Suderman	b0532286fe	[mlir][tosa] Add shape inference for tosa.while Tosa.while shape inference requires repeatedly running shape inference across the body of the loop until the types become static as we do not know the number of iterations required by the loop body. Once the least specific arguments are known they are propagated to both regions. To determine the final end type, the least restrictive types are determined from all yields. Differential Revision: https://reviews.llvm.org/D108801	2021-09-10 13:11:53 -07:00
Alex Zinenko	61bc6aa5a7	[mlir] spelling and style changes in ReconcileUnrealizedCasts.cpp. NFC.	2021-09-10 14:09:29 +02:00
Stephan Herhut	5e6c170b3f	[mlir][linalg] Fix bufferize pattern to allow unknown operations in body of generic The original version of the bufferization pattern for linalg.generic would manually clone operations within the region to the bufferized clone of the operation. This triggers legality requirements on those operations in the conversion infra. Instead, this now uses the rewriter to inline the region instead, avoiding those legality requirements. Differential Revision: https://reviews.llvm.org/D109581	2021-09-10 13:37:42 +02:00
Matthias Springer	0f3544d185	[mlir][scf] Loop peeling: Use scf.for for partial iteration Generate an scf.for instead of an scf.if for the partial iteration. This is for consistency reasons: The peeling of linalg.tiled_loop also uses another loop for the partial iteration. Note: Canonicalizations patterns may rewrite partial iterations to scf.if afterwards. Differential Revision: https://reviews.llvm.org/D109568	2021-09-10 19:07:09 +09:00
Tobias Gysi	16488dc300	[mlir][linalg] Pass all operands to tile to the tile loop region builder (NFC). Extend the signature of the tile loop nest region builder to take all operand values to use and not just the scf::For iterArgs. This change allows us to pass in all block arguments of TiledLoop and use them directly instead of replacing them after the loop generation. Reviewed By: pifon2a Differential Revision: https://reviews.llvm.org/D109569	2021-09-10 08:35:11 +00:00
Nicolas Vasilache	5f1a1af4bf	[mlir][Linalg] Properly order extract_slice traversal in comprehensive bufferization This revision fixes the traversal order of extract_slice during the inplace analysis. It was previously thought that such ops could be analyzed at the very end. This is unfortunately not true as the AliasInfo for dependents of these ops need to be updated. This change allows the aliases introduced by the bufferization of extract_slice to be properly propagated. Differential Revision: https://reviews.llvm.org/D109519	2021-09-10 07:10:06 +00:00
natashaknk	d4d50e4710	[mlir][tosa] Add lowering for tosa.clz using scf::whileOp Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D109540	2021-09-09 15:57:35 -07:00
Aart Bik	066d786ce0	[mlir][sparse] add folding to sparse_tensor.convert folds conversion between identical types (with tests) Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D109545	2021-09-09 15:45:19 -07:00
Alexander Slepko	89837a0e1b	Adding min(f/s/u) and max(f/s/u) cases for vector reduction This PR adds missing AtomicRMWKind::min/max cases which we would like to use for min/max reduction loop vectorizations. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D104881	2021-09-09 12:00:43 -07:00
Chris Lattner	735f46715d	[APInt] Normalize naming on keep constructors / predicate methods. This renames the primary methods for creating a zero value to `getZero` instead of `getNullValue` and renames predicates like `isAllOnesValue` to simply `isAllOnes`. This achieves two things: 1) This starts standardizing predicates across the LLVM codebase, following (in this case) ConstantInt. The word "Value" doesn't convey anything of merit, and is missing in some of the other things. 2) Calling an integer "null" doesn't make any sense. The original sin here is mine and I've regretted it for years. This moves us to calling it "zero" instead, which is correct! APInt is widely used and I don't think anyone is keen to take massive source breakage on anything so core, at least not all in one go. As such, this doesn't actually delete any entrypoints, it "soft deprecates" them with a comment. Included in this patch are changes to a bunch of the codebase, but there are more. We should normalize SelectionDAG and other APIs as well, which would make the API change more mechanical. Differential Revision: https://reviews.llvm.org/D109483	2021-09-09 09:50:24 -07:00
Aart Bik	e2d3db42e5	[mlir][sparse] add casts to operations to lattice and exp builders Further enhance the set of operations that can be handled by the sparse compiler Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D109413	2021-09-09 08:49:50 -07:00
Alex Zinenko	8b58ab8ccd	[mlir] Factor type reconciliation out of Standard-to-LLVM conversion Conversion to the LLVM dialect is being refactored to be more progressive and is now performed as a series of independent passes converting different dialects. These passes may produce `unrealized_conversion_cast` operations that represent pending conversions between built-in and LLVM dialect types. Historically, a more monolithic Standard-to-LLVM conversion pass did not need these casts as all operations were converted in one shot. Previous refactorings have led to the requirement of running the Standard-to-LLVM conversion pass to clean up `unrealized_conversion_cast`s even though the IR had no standard operations in it. The pass must have been also run the last among all to-LLVM passes, in contradiction with the partial conversion logic. Additionally, the way it was set up could produce invalid operations by removing casts between LLVM and built-in types even when the consumer did not accept the uncasted type, or could lead to cryptic conversion errors (recursive application of the rewrite pattern on `unrealized_conversion_cast` as a means to indicate failure to eliminate casts). In fact, the need to eliminate A->B->A `unrealized_conversion_cast`s is not specific to to-LLVM conversions and can be factored out into a separate type reconciliation pass, which is achieved in this commit. While the cast operation itself has a folder pattern, it is insufficient in most conversion passes as the folder only applies to the second cast. Without complex legality setup in the conversion target, the conversion infra will either consider the cast operations valid and not fold them (a separate canonicalization would be necessary to trigger the folding), or consider the first cast invalid upon generation and stop with error. The pattern provided by the reconciliation pass applies to the first cast operation instead. Furthermore, having a separate pass makes it clear when `unrealized_conversion_cast`s could not have been eliminated since it is the only reason why this pass can fail. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D109507	2021-09-09 16:51:24 +02:00
Uday Bondhugula	524eafa5b2	[MLIR] Avoid double space print on llvm global op Fix extra space print for llvm global op when the 'unamed_addr' attribute was empty. This led to two spaces being printed in the custom form between non-whitespace chars. A round trip would add an extra space to a typical spaced form. NFC. Differential Revision: https://reviews.llvm.org/D109502	2021-09-09 19:52:38 +05:30
Alex Zinenko	1ce752b741	[mlir] support reductions in SCF to OpenMP conversion OpenMP reductions need a neutral element, so we match some known reduction kinds (integer add/mul/or/and/xor, float add/mul, integer and float min/max) to define the neutral element and the atomic version when possible to express using atomicrmw (everything except float mul). The SCF-to-OpenMP pass becomes a module pass because it now needs to introduce new symbols for reduction declarations in the module. Reviewed By: chelini Differential Revision: https://reviews.llvm.org/D107549	2021-09-09 13:04:27 +02:00
Matthias Springer	c7d569b8f7	[mlir][scf] Fold dim(scf.for) to dim(iter_arg) Fold dim ops of scf.for results to dim ops of the respective iter args if the loop is shape preserving. Differential Revision: https://reviews.llvm.org/D109430	2021-09-09 13:47:13 +09:00
Matthias Springer	e2c8fcb9d0	[mlir][linalg] Fold dim(linalg.tiled_loop) to dim(output_arg) Fold dim ops of linalg.tiled_loop results to dim ops of the respective iter args if the loop is shape preserving. Differential Revision: https://reviews.llvm.org/D109431	2021-09-09 13:37:28 +09:00
Matthias Springer	f7137da174	[mlir][linalg] Fix dim(iter_arg) canonicalization Run a small analysis to see if the runtime type of the iter_arg is changing. Fold only if the runtime type stays the same. (Same as `DimOfIterArgFolder` in SCF.) Differential Revision: https://reviews.llvm.org/D109299	2021-09-09 12:13:05 +09:00
Matthias Springer	c95a7246a3	[mlir][linalg] Tiling: Use loop ub in extract_slice size computation if possible When tiling a LinalgOp, extract_slice/insert_slice pairs are inserted. To avoid going out-of-bounds when the tile size does not divide the shape size evenly (at the boundary), AffineMin ops are inserted. Some ops have assumptions regarding the dimensions of inputs/outputs. E.g., in a `A * B` matmul, `dim(A, 1) == dim(B, 0)`. However, loop bounds use either `dim(A, 1)` or `dim(B, 0)`. With this change, AffineMin ops are expressed in terms of loop bounds instead of tensor sizes. (Both have the same runtime value.) This simplifies canonicalizations. Differential Revision: https://reviews.llvm.org/D109267	2021-09-09 11:06:22 +09:00
Chris Lattner	40a89da65c	[Canonicalize] Don't call isBeforeInBlock in OperationFolder::tryToFold. This patch (`e4635e6328`) fixed a bug where a newly generated/reused constant wouldn't dominate a folded operation. It did so by calling isBeforeInBlock to move the constant around on demand. This introduced a significant compile time regression, because "isBeforeInBlock" is O(n) in the size of a block the first time it is called, and the cache is invalidated any time canonicalize changes something big in the block. This fixes LLVM PR51738 and this CIRCT issue: https://github.com/llvm/circt/issues/1700 This does affect the order of constants left in the top of a block, I staged in the testsuite changes in rG42431b8207a5. Differential Revision: https://reviews.llvm.org/D109454	2021-09-08 13:33:22 -07:00
Kunwar Shaanjeet Singh Grover	dea76ccaf4	[MLIR] FlatAffineConstraints: Refactored computation of explicit representation for identifiers This patch refactors the existing implementation of computing an explicit representation of an identifier as a floordiv in terms of other identifiers and exposes this computation as a public function. The computation of this representation is required to support local identifiers in PresburgerSet subtract, complement and isEqual. Reviewed By: bondhugula, arjunp Differential Revision: https://reviews.llvm.org/D106662	2021-09-08 20:24:46 +05:30
Arnab Dutta	1524b01541	[MLIR] Add loop coalesce utility for affine.for Add loop coalesce utility for affine.for. This expects loops to have been normalized a-priori. This works for both constant as well non constant upper bounds having single/multiple result upper bound affine map. With contributions from Arnab Dutta and Uday Bondhugula. Reviewed By: bondhugula, ayzhuang Differential Revision: https://reviews.llvm.org/D108126	2021-09-08 18:02:23 +05:30
Aart Bik	d02e12fadf	[mlir][sparse] fix typos Perhaps one of these days I will actually learn how to spell opaque.... Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D109391	2021-09-07 14:20:05 -07:00
Alex Zinenko	b841ae55e5	[mlir] Fix SplatOp lowering to the LLVM dialect The lowering has been incorrectly using the operands of the original op instead of rewritten operands provided to matchAndRewrite call. This may lead to spurious materializations and generally invalid IR. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D109355	2021-09-07 19:14:28 +02:00
Alex Zinenko	821262eef2	[mlir] Fix GPU LaunchFunc conversion to the LLVM dialect The conversion has been incorrectly using the operands of the original operation instead of the converted operands provided to the matchAndRewrite call. This may lead to spurious materializations and generally invalid IR if the producer of the original operands is deleted in the process of conversion. Reviewed By: csigg Differential Revision: https://reviews.llvm.org/D109356	2021-09-07 16:50:11 +02:00
Matthias Springer	c57c4f888c	[mlir][linalg] linalg.tiled_loop peeling Differential Revision: https://reviews.llvm.org/D108270	2021-09-07 09:50:08 +09:00
Alexander Belyaev	58c188507f	[mlir][linalg] Fix `FoldInitTensorWithDimOp` if dim(init_tensor) is static. It looks like it was a typo. Instead of `maybeConstantIndex`, `initTensorOp.getStaticSize(maybeConstantIndex)` should be used to access the dim size of the tensor. There is a test for that in `canonicalize.mlir`, but it was working correctly because `ReplaceStaticShapeDims` was canonicalizing DimOp before `FoldInitTensorWithDimOp`. So, to make the patterns more "orthogonal", this case is disabled. Differential Revision: https://reviews.llvm.org/D109247	2021-09-06 10:47:26 +02:00
Eugene Zhulenev	fd52b4357a	[mlir] Async: check awaited operand error state after sync await Previously only await inside the async function (coroutine after lowering to async runtime) would check the error state Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D109229	2021-09-04 05:00:17 -07:00
Loren Maggiore	361458b1ce	[mlir] create gpu memset op Create a gpu memset op and corresponding CUDA and ROCm wrappers. Reviewed By: herhut, lorenrose1013 Differential Revision: https://reviews.llvm.org/D107548	2021-09-04 08:13:04 +02:00
William S. Moses	21d43daf8f	[MLIR] Primitive linkage lowering of FuncOp FuncOp always lowers to an LLVM external linkage presently. This makes it impossible to define functions in mlir which are local to the current module. Until MLIR FuncOps have a more formal linkage specification, this commit allows funcop's to have an optionally specified llvm.linkage attribute, whose value will be used as the linkage of the llvm funcop when lowered. Differential Revision: https://reviews.llvm.org/D108524 Support LLVM linkage	2021-09-03 20:41:39 -04:00
Mehdi Amini	78accf9f35	Make LLVM Linkage a first class attribute instead of using an integer attribute This makes the IR more readable, in particular when this will be used on the builtin func outside of the LLVM dialect. Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D109209	2021-09-03 21:21:46 +00:00
Aart Bik	eee1f1c8fb	[mlir][sparse] add convenience method for sparse tensor setup This simplifies setting up sparse tensors through C-style data structures. Useful for runtimes that want to interact with MLIR-generated code without knowning about all bufferization details (viz. memrefs). Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D109251	2021-09-03 13:35:59 -07:00
Alexander Belyaev	5ee5bbd0ff	[mlir][linalg] Extend tiled_loop to SCF conversion to generate scf.parallel. Differential Revision: https://reviews.llvm.org/D109230	2021-09-03 18:05:54 +02:00
Aart Bik	b6d1a31c1b	[mlir][sparse] refine heuristic for iteration graph topsort The sparse index order must always be satisfied, but this may give a choice in topsorts for several cases. We broke ties in favor of any dense index order, since this gives good locality. However, breaking ties in favor of pushing unrelated indices into sparse iteration spaces gives better asymptotic complexity. This revision improves the heuristic. Note that in the long run, we are really interested in using ML for ML to find the best loop ordering as a replacement for such heuristics. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D109100	2021-09-03 08:37:15 -07:00
Jean Perier	49af2a6275	[mlir][flang] Do not prevent integer types from being parsed as MLIR keywords DialectAsmParser::parseKeyword is rejecting `'i' digit+` while it is a valid identifier according to mlir/docs/LangRef.md. Integer types actually used to be TOK_KEYWORD a while back before the change: `6af866c58d`. This patch Modifies `isCurrentTokenAKeyword` to return true for tokens that match integer types too. The motivation for this change is the parsing of `!fir.type<{` `component-name: component-type,`+ `}>` type in FIR that represent Fortran derived types. The component-names are parsed as keywords, and can very well be i32 or any ixxx (which are valid Fortran derived type component names). The Quant dialect type parser had to be modified since it relied on `iw` not being parsed as keywords. Differential Revision: https://reviews.llvm.org/D108913	2021-09-03 08:20:49 +02:00
Matthias Springer	4fa6c2734c	[mlir][scf] Allow runtime type of iter_args to change The limitation on iter_args introduced with D108806 is too restricting. Changes of the runtime type should be allowed. Extends the dim op canonicalization with a simple analysis to determine when it is safe to canonicalize. Differential Revision: https://reviews.llvm.org/D109125	2021-09-03 10:03:05 +09:00
Stella Laurenzo	cb7b03819a	[mlir][python] Simplify python extension loading. * Now that packaging has stabilized, removes old mechanisms for loading extensions, preferring direct importing. * Removes _cext_loader.py, _dlloader.py as unnecessary. * Fixes the path where the CAPI dll is written on Windows. This enables that path of least resistance loading behavior to work with no further drama (see: https://bugs.python.org/issue36085). * With this patch, `ninja check-mlir` on Windows with Python bindings works for me, modulo some failures that are actually due to a couple of pre-existing Windows bugs. I think this is the first time the Windows Python bindings have worked upstream. * Downstream changes needed: * If downstreams are using the now removed `load_extension`, `reexport_cext`, etc, then those should be replaced with normal import statements as done in this patch. Reviewed By: jdd, aartbik Differential Revision: https://reviews.llvm.org/D108489	2021-09-03 00:43:28 +00:00
Alex Zinenko	f9be7a7afd	[mlir] speed up construction of LLVM IR constants when possible The translation to LLVM IR used to construct sequential constants by recurring down to individual elements, creating constant values for them, and wrapping them into aggregate constants in post-order. This is highly inefficient for large constants with known data such as DenseElementsAttr. Use LLVM's ConstantData for the innermost dimension instead. LLVM does seem to support data constants for nested sequential constants so the outer dimensions are still handled recursively. Nevertheless, this speeds up the translation of large constants with equal dimensions by up to 30x. Users are advised to rewrite large constants to use flat types before translating to LLVM IR if more efficiency in translation is necessary. This is not done automatically as the translation is not aware of the expectations of the overall compilation flow about type changes and indexing, in particular for global constants with external linkage. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D109152	2021-09-02 23:07:30 +02:00
Marius Brehler	f6063fedb4	[mlir] Add missing dep on MLIRTranslation	2021-09-02 16:54:46 +00:00
Kiran Chandramohan	711aa35759	[MLIR][OpenMP] Add support for declaring critical construct names Add an operation omp.critical.declare to declare names/symbols of critical sections. Named omp.critical operations should use symbols declared by omp.critical.declare. Having a declare operation ensures that the names of critical sections are global and unique. In the lowering flow to LLVM IR, the OpenMP IRBuilder creates unique names for critical sections. Reviewed By: ftynse, jeanPerier Differential Revision: https://reviews.llvm.org/D108713	2021-09-02 14:31:19 +00:00

1 2 3 4 5 ...

6311 Commits