intel/llvm - llvm - Gitea: Git with a cup of tea

intel/llvm

mirror of https://github.com/intel/llvm.git synced 2026-01-25 10:55:58 +08:00

Author	SHA1	Message	Date
Matthias Springer	440808faf6	[mlir][linalg] MapCopyToThreadsOp: Support tensor.pad Also return the generated loop op. Differential Revision: https://reviews.llvm.org/D155950	2023-07-21 15:51:46 +02:00
Nikita Popov	98b10727b1	[ValueTracking] Extract isKnownNonZeroFromOperator() (NFC) Split off the primary part of the isKnownNonZero() implementation, in the same way it is done for computeKnownBits(). This makes it easier to reorder different parts of isKnownNonZero().	2023-07-21 15:51:29 +02:00
Maciej Gabka	b172fbff68	Revert "[TLI][AArch64] Add missing SLEEF mappings to scalable vector functions for log2 and log2f" This reverts commit `791c89600a`.	2023-07-21 13:50:10 +00:00
Maciej Gabka	791c89600a	[TLI][AArch64] Add missing SLEEF mappings to scalable vector functions for log2 and log2f In the original commit adding SLEEF mappings, https://reviews.llvm.org/D146839 mappings for log2/log2f were missing. Reviewed By: paulwalker-arm Differential Revision: https://reviews.llvm.org/D155623	2023-07-21 13:46:03 +00:00
Matthias Springer	a5bba98a58	[mlir][linalg] BufferizeToAllocationOp: Add option to materialize buffers for operands Add an option that does not bufferize the targeted op itself, but just materializes a buffer for the destination operands. This is useful for partial bufferization of complex ops such as `scf.forall`, which need special handling (and an analysis if the region). Differential Revision: https://reviews.llvm.org/D155946	2023-07-21 15:29:59 +02:00
Matthias Springer	20245ed4de	[mlir][transform] Add `apply_cse` option to `transform.apply_patterns` op Applying the canonicalizer and CSE in an interleaved fashion is useful after bufferization (and maybe other transforms) to fold away self copies. Differential Revision: https://reviews.llvm.org/D155933	2023-07-21 15:13:56 +02:00
Daniel Krupp	26b19a67e5	[clang][analyzer]Fix non-effective taint sanitation There was a bug in alpha.security.taint.TaintPropagation checker in Clang Static Analyzer. Taint filtering could only sanitize const arguments. After this patch, taint filtering is effective also on non-const parameters. Differential Revision: https://reviews.llvm.org/D155848	2023-07-21 15:11:13 +02:00
Corentin Jabot	2021910606	[Clang] Diagnose jumps into statement expressions Such jumps are not allowed by GCC and allowing them can lead to situations where we jumps into unevaluated statements. Fixes #63682 Reviewed By: aaron.ballman, #clang-language-wg Differential Revision: https://reviews.llvm.org/D154696	2023-07-21 15:08:51 +02:00
Jie Fu	3fd1790638	[mlir][nvgpu] Ignore -Wunused-function in NVGPUDialect.cpp (NFC) In file included from /Users/jiefu/llvm-project/mlir/lib/Dialect/NVGPU/IR/NVGPUDialect.cpp:363: /Users/jiefu/llvm-project/build-Release/tools/mlir/include/mlir/Dialect/NVGPU/IR/NVGPUAttrDefs.cpp.inc:22:36: error: unused function 'generatedAttributeParser' [-Werror,-Wunused-function] static ::mlir::OptionalParseResult generatedAttributeParser(::mlir::AsmParser &parser, ::llvm::StringRef *mnemonic, ::mlir::Type type, ::mlir::Attribute &value) { ^ /Users/jiefu/llvm-project/build-Release/tools/mlir/include/mlir/Dialect/NVGPU/IR/NVGPUAttrDefs.cpp.inc:46:30: error: unused function 'generatedAttributePrinter' [-Werror,-Wunused-function] static ::mlir::LogicalResult generatedAttributePrinter(::mlir::Attribute def, ::mlir::AsmPrinter &printer) { ^ 2 errors generated.	2023-07-21 20:50:48 +08:00
David Berard	8fa02db8cf	[llvm][SLP] Exit early if inputs to comparator are equal TL;DR: This PR modifies a comparator. The comparator is used in a subsequent call to llvm::stable_sort. Sorting comparators should follow strict weak ordering - in particular, (x < x) should return false. This PR adds a fix to avoid an infinite loop when the inputs to the comparator are equal. Details: Sometimes when two equivalent tensors passed into the comparator, we encounter infinite looping (at `aae2eaae2c/llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp (L4049)`) Although it seems like this comparator will never be called with two equivalent pointers, some sanitizers, e.g. https://chromium.googlesource.com/chromiumos/third_party/gcc/+/refs/heads/stabilize-zako-5712.88.B/libstdc++-v3/include/bits/stl_algo.h#360, will add checks for (x < x). When this sanitizer is used with the current implementation, it triggers a comparator check for (x < x) which runs into the infinite loop Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D155874	2023-07-21 05:40:55 -07:00
Kadir Cetinkaya	0a093f62d1	[clangd] Prefer definitions for gototype and implementation Differential Revision: https://reviews.llvm.org/D133843	2023-07-21 14:38:51 +02:00
Simon Pilgrim	be62041e7e	[X86] matchBinaryShuffle - match PACKUS for v2i64 -> v4i32 shuffle truncation patterns. Handle PACKUSWD on +SSE41 targets, or fallback to PACKUSBW on any +SSE2 target	2023-07-21 13:32:04 +01:00
Simon Pilgrim	c0a1f4624b	[X86] Add packus.ll test coverage Similar to the existing packss.ll tests	2023-07-21 13:32:04 +01:00
Simon Pilgrim	7196eb2541	[X86] packss.ll - add SSE4.2 test coverage	2023-07-21 13:32:03 +01:00
Michael Halkenhaeuser	d82eace1c9	[OpenMP][OMPT] Add 'Initialized' flag We observed some overhead and unnecessary debug output. This can be alleviated by (re-)introduction of a boolean that indicates, if the OMPT initialization has been performed. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D155186	2023-07-21 08:19:03 -04:00
Haojian Wu	209694ded0	[clangd] Make the order of missing-include edits deterministic Fixes https://github.com/llvm/llvm-project/issues/63995	2023-07-21 14:16:16 +02:00
Aaron Ballman	29112a9946	Mark this test as unsupported on Windows systems There is a strange issue happening with command line processing though. The command line argument --export-dynamic-symbol 'f*' does not have the single quotes stripped on some Windows targets (but not all). This causes the glob matching to fail, which means the test fails on some Windows bots and passes on others. This is expected to be a temporary measure to get bots back to green. I've not found a commit that has caused a behavioral change that could be reverted instead, so this could be an issue with lit or test machine configuration.	2023-07-21 08:10:52 -04:00
Benjamin Kramer	c4303920f1	[bazel] Tweak dependency spaghetti after `70c2e0618a`	2023-07-21 13:54:31 +02:00
Benjamin Kramer	977633b26a	[bazel] Port `5d8813dec6`	2023-07-21 13:52:32 +02:00
Matthias Springer	544f0e9161	[mlir] Fix build after D155680	2023-07-21 13:33:54 +02:00
Alexander Belyaev	4114f07c2a	[mlir] Update bazel build after rG70c2e0618a0f3c09ed7149d88b4987b932eb6705	2023-07-21 13:21:49 +02:00
Corentin Jabot	33c2e4ec7c	[Clang] Fix access to an unitinialized variable This fixes the spurious test failure introduced in `f9caa12328`	2023-07-21 12:34:35 +02:00
Shivam Gupta	db04f018bb	Revert "[LIT] Added an option to llvm-lit to emit the necessary test coverage data, divided per test case" This reverts commit `d8e26bccb3`. Test case are meant to run only when LLVM_INDIVIDUAL_TEST_COVERAGE is set.	2023-07-21 15:59:56 +05:30
Michael Halkenhaeuser	453a75dc52	[OpenMP] [OMPT] [6/8] Added callback support for target data operations, target submit, and target regions. This patch adds support for invoking target callbacks but does not yet invoke them. A new structure OmptInterface has been added that tracks thread local states including correlation ids. This structure defines methods that will be called from the device independent target library with information related to a target entry point for which a callback is invoked. These methods in turn use the callback functions maintained by OmptDeviceCallbacksTy to invoke the tool supplied callbacks. Depends on D124652 Patch from John Mellor-Crummey <johnmc@rice.edu> With contributions from: Dhruva Chakrabarti <Dhruva.Chakrabarti@amd.com> Differential Revision: https://reviews.llvm.org/D127365	2023-07-21 06:24:12 -04:00
Matthias Springer	ba745eea40	[mlir][bufferization] Remove cleanup pipeline from bufferization pass To keep the pass simple, users should apply cleanup passes manually when necessary. In particular, `-cse -canonicalize` are often desireable to fold away self-copies that are created by the bufferization. This addresses a comment in D120191. Differential Revision: https://reviews.llvm.org/D155923	2023-07-21 12:11:25 +02:00
Pranav Taneja	c41a62e924	[AMDGPU] [NFC] Fixed a typo in SIShrinkInstructions.cpp Reviewed By: pravinjagtap Differential Revision: https://reviews.llvm.org/D155785	2023-07-21 15:35:19 +05:30
Jay Foad	e45a0c2994	[AMDGPU][RFC] Update isLegalAddressingMode for GFX9 SMEM signed offsets Differential Revision: https://reviews.llvm.org/D155587	2023-07-21 10:56:43 +01:00
Jay Foad	787bef0bee	[AMDGPU] Add tests for SMEM addressing modes in CodeGenPrepare Differential Revision: https://reviews.llvm.org/D155854	2023-07-21 10:56:43 +01:00
Shivam Gupta	d8e26bccb3	[LIT] Added an option to llvm-lit to emit the necessary test coverage data, divided per test case This patch is the first part of https://llvm.org/OpenProjects.html#llvm_patch_coverage. We have first define a new variable LLVM_TEST_COVERAGE which when set, pass --emit-coverage option to llvm-lit which will help in setting a unique value to LLVM_PROFILE_FILE for each RUN. So for example coverage data for test case llvm/test/Analysis/AliasSet/memtransfer.ll will be emitted as build/test/Analysis/AliasSet/memtransfer.profraw Reviewed By: hnrklssn Differential Revision: https://reviews.llvm.org/D154280	2023-07-21 15:23:01 +05:30
Ingo Müller	8fd207fd0d	[mlir][transform][structured][python] Allow str arg in match_op_names. Allow the `names` argument in `MatchOp.match_op_names` to be of type `str` in addition to `Sequence[str]`. In this case, the argument is treated as a list with one name, i.e., it is possible to write `MatchOp.match_op_names(..., "test.dummy")` instead of `MatchOp.match_op_names(..., ["test.dummy"])`. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D155807	2023-07-21 09:36:55 +00:00
Ingo Müller	522831384f	[mlir][linalg][transform] Extend diagnostics of FuseIntoContainingOp. This patch extends the diagnostic output of `FuseIntoContainingOp` when it fails to find the next producer by also provided the location of the affected transform op. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D155803	2023-07-21 09:34:04 +00:00
Guray Ozen	e56d6745f7	[mlir][nvgpu] Add `tma.create.descriptor` to create tensor map descriptor The Op creates a tensor map descriptor object representing tiled memory region. The descriptor is used by Tensor Memory Access (TMA). The `tensor` is the source tensor to be tiled. The `boxDimensions` is the size of the tiled memory region in each dimension. The pattern here lowers `tma.create.descriptor` to a runtime function call that eventually calls calls CUDA Driver's `cuTensorMapEncodeTiled`. For more information see below: https://docs.nvidia.com/cuda/cuda-driver-api/group__CUDA__TENSOR__MEMORY.html Depends on D155453 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D155680	2023-07-21 11:33:04 +02:00
Luke Lau	33a83c5486	[RISCV] Add SDNode patterns for vrol.[vv,vx] and vror.[vv,vx,vi] These correspond to ROTL/ROTR nodes Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D155439	2023-07-21 10:22:46 +01:00
Andrzej Warzynski	5c9db62587	[mlir][test] Add missing LIT config for `mlir-cpu-config` + emulator Similarly to when using `lli`, make sure that when using `mlir-cpu-runner` with an emulator, a full path to `mlir-cpu-runner` is used. Otherwise `mlir-cpu-runner` won't be found and you will get the following error: ``` Error while loading mlir-cpu-runner: No such file or directory ``` This patch should fix: * https://lab.llvm.org/buildbot/#/builders/179 The breakage was originally introduced in https://reviews.llvm.org/D155405. Differential Revision: https://reviews.llvm.org/D155920	2023-07-21 09:20:03 +00:00
Alex Zinenko	8dbddb1718	[mlir] allow region branch spec from parent op to itself RegionBranchOpInterface did not allow the operation with regions to specify itself as successors. Therefore, this implied that the control is always transferred to a region before being transferred back to the parent op. Since the region can only transfer the control back to the parent op from a terminator, this transitively implied that the first block of any region with a RegionBranchOpInterface is always executed until the terminator can transfer the control flow back. This is trivially false for any conditional-like operation that may or may not execute the region, as well as for loop-like operations that may not execute the body. Remove the restriction from the interface description and update the only transform that relied on it. See https://discourse.llvm.org/t/rfc-region-control-flow-interfaces-should-encode-region-not-executed-correctly/72103. Depends On: https://reviews.llvm.org/D155757 Reviewed By: Mogball, springerm Differential Revision: https://reviews.llvm.org/D155822	2023-07-21 09:16:56 +00:00
Alex Zinenko	5d8813dec6	[mlir] allow dense dataflow to customize call and region operations Initial implementations of dense dataflow analyses feature special cases for operations that have region- or call-based control flow by leveraging the corresponding interfaces. This is not necessarily sufficient as these operations may influence the dataflow state by themselves as well we through the control flow. For example, `linalg.generic` and similar operations have region-based control flow and their proper memory effects, so any memory-related analyses such as last-writer require processing `linalg.generic` directly instead of, or in addition to, the region-based flow. Provide hooks to customize the processing of operations with region- cand call-based contol flow in forward and backward dense dataflow analysis. These hooks are trigerred when control flow is transferred between the "main" operation, i.e. the call or the region owner, and another region. Such an apporach allows the analyses to update the lattice before and/or after the regions. In the `linalg.generic` example, the reads from memory are interpreted as happening before the body region and the writes to memory are interpreted as happening after the body region. Using these hooks in generic analysis may require introducing additional interfaces, but for now assume that the specific analysis have spceial cases for the (rare) operaitons with call- and region-based control flow that need additional processing. Reviewed By: Mogball, phisiart Differential Revision: https://reviews.llvm.org/D155757	2023-07-21 09:16:03 +00:00
Luke Lau	f6bdfb0b92	[RISCV] Remove VPatBinaryExtVL_WV_WX multiclass. NFC It's no longer needed now that the sext/zext patterns have been merged. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D155815	2023-07-21 10:13:30 +01:00
Luke Lau	24628a14c4	[RISCV] Add patterns for vnsr[a,l].wx where shift amount has different type than vector element We're currently only matching scalar shift amounts where the type is the same as the vector element type. But because only the bottom log2(2*SEW) bits are used, only 7 bits will be used at most so we can use any scalar type >= i8. This patch adds patterns for the case above, as well as for when the shift amount type is the same as the widened element type and doesn't need extended. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D155698	2023-07-21 10:13:28 +01:00
Luke Lau	418e678ba3	[RISCV] Add tests for vnsr[l,a].wx patterns that could be matched These patterns of ([l,a]shr v, ([s,z]ext splat)) only pick up the cases where the scalar has the same type as the vector element. However since only the low log2(SEW) bits of the scalar are read, we could use any scalar type that has been extended. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D155697	2023-07-21 10:13:26 +01:00
Ilya Leoshkevich	a0eee6c971	[SystemZ] Allow symbols in immediate asm operands Currently mentioning any symbols in immediate asm operands is not supported, for example: error: invalid operand for instruction lghi %r4,foo_end-foo The immediate problem is that isImm() and printOperand() functions do not accept MCExprs, but simply relaxing these checks is not enough: after symbol addresses are computed, range checks need to run against resolved values. Add a number of SystemZ::FixupKind members for each kind of immediate value and process them in SystemZMCAsmBackend::applyFixup(). Only perform the range checks, do not change anything. Adjust the tests: move previously failing cases like the one shown above out of insn-bad.s. Reviewed By: uweigand Differential Revision: https://reviews.llvm.org/D154899	2023-07-21 11:09:19 +02:00
Alexander Belyaev	e6c62a2dcc	[mlir] Fix bazel build after `b96bd025b3`	2023-07-21 11:08:44 +02:00
Corentin Jabot	f9caa12328	[Clang] Fix constraint checking of non-generic lambdas. A lambda call operator can be a templated entity - and therefore have constraints while not being a function template template<class T> void f() { []() requires false { }(); } In that case, we would check the constraints of the call operator which is non-viable. However, we would find a viable candidate: the conversion operator to function pointer, and use it to perform a surrogate call. These constraints were not checked because: * We never check the constraints of surrogate functions * The lambda conversion operator has non constraints. From the wording, it is not clear what the intent is but it seems reasonable to expect the constraints of the lambda conversion operator to be checked and it is consistent with GCC and MSVC. This patch also improve the diagnostics for constraint failure on surrogate calls. Fixes #63181 Reviewed By: #clang-language-wg, aaron.ballman Differential Revision: https://reviews.llvm.org/D154368	2023-07-21 10:59:36 +02:00
Nikita Popov	f060f095aa	[X86] Expand constant expressions in test (NFC)	2023-07-21 10:40:47 +02:00
Guray Ozen	9dad32cb90	[mlir][nvgpu] Improve finding module Op to for `mbarrier.create` Current transformation expects module op to be two level higher, however, it is not always the case. This work searches module op in a while loop. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D155825	2023-07-21 10:36:45 +02:00
Guray Ozen	70c2e0618a	[mlir][nvgpu] Add nvgpu.tma.async.load and nvgpu.tma.descriptor This work adds `nvgpu.tma.async.load` Op that requests tma load asyncronusly using mbarrier object. It also creates nvgpu.tma.descriptor type. The type is supposed be created by `cuTensorMapEncodeTiled` cuda drivers api. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D155453	2023-07-21 10:23:25 +02:00
Alex Zinenko	2469cdd156	[mlir] remove RegionBranchOpInterface from linalg ops Linalg structure ops do not implement control flow in the way expected by RegionBranchOpInterface, and the interface implementation isn't actually used anywhere. The presence of this interface without correct implementation is confusing for, e.g., dataflow analyses. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D155841	2023-07-21 08:18:41 +00:00
Nikita Popov	174300a283	[LoopIdiom] Regenerate test checks (NFC)	2023-07-21 10:12:05 +02:00
Nikita Popov	2a11549dc5	[InstCombine] Regenerate test checks (NFC)	2023-07-21 10:11:35 +02:00
Nikita Popov	086ee99564	Reapply [IR] Mark and constant expressions as undesirable Reapply after fixing an issue in canonicalizeLogicFirst() exposed by this change (`218f97578b`). ----- In preparation for removing support for and expressions, mark them as undesirable. As such, we will no longer implicitly create such expressions, but they still exist.	2023-07-21 10:10:50 +02:00
Haojian Wu	163268ada0	[bazel] add missing dep for llvm/unittests:frontend_tests	2023-07-21 10:10:15 +02:00

1 2 3 4 5 ...

468627 Commits