intel/llvm - llvm - Gitea: Git with a cup of tea

intel/llvm

mirror of https://github.com/intel/llvm.git synced 2026-01-18 07:57:36 +08:00

Author	SHA1	Message	Date
Michael Kruse	90e5ce0b0d	[PollyACC] Fix implicit function definitions. NFC. The isl_id_* have been in used without including the correspodning isl/id.h header. According to rules in C, a function is defined implicitly when first used with an assumed int return type (32 bits on 64 bit systems). But the implementation returns a pointer (64 bits on 64 bit systems). Is usually has no consequence because the return value is stored in a registers that is 64 bits (RAX) and the optimizer does not truncate its value before using it again as a pointer value. However, LTO optimizers will be rightfull;y confused. Fix by including <isl/id.h> This fixes llvm.org/PR50021	2021-04-21 01:08:00 -05:00
Siva Chandra Reddy	f76fb7d420	[libc] Add fma to the C standard spec.	2021-04-21 06:00:35 +00:00
Serge Pavlov	d20a2376d8	[RISCV] Introduce floating point control and state registers New registers FRM, FFLAGS and FCSR was defined. They represent corresponding system registers. The new registers are necessary to properly order floating point instructions in non-default modes. Differential Revision: https://reviews.llvm.org/D99083	2021-04-21 12:55:30 +07:00
serge-sans-paille	d9806334d1	Use SmallVector instead of std::vector to manage storage of llvm::BitVector This is a follow-up to https://reviews.llvm.org/D100387. std::vector is not the best storage container here. My local benchmark (counting the number of instruction when compiling the sqlite3 amalgamation) yields the following: - std::vector<BitVector> -> 5,860,885,896 - SmallVector<BitWord, 0> -> 5,858,991,997 - SmallVector<BitWord> -> 5,817,679,224 Differential Revision: https://reviews.llvm.org/D100744	2021-04-21 07:31:28 +02:00
Arthur Eubanks	dd56715326	[NFC] Remove redundant InstCombinePass name	2021-04-20 22:23:07 -07:00
Max Kazantsev	0ef7e0041a	[Test] Add a negative unit test	2021-04-21 12:11:05 +07:00
Pushpinder Singh	3194761d27	[AMDGPU][OpenMP] Add amdgpu-arch tool to list AMD GPUs installed This patch adds new clang tool named amdgpu-arch which uses HSA to detect installed AMDGPU and report back latter's march. This tool is built only if system has HSA installed. The value printed by amdgpu-arch is used to fill -march when latter is not explicitly provided in -Xopenmp-target. Reviewed By: JonChesterfield, gregrodgers Differential Revision: https://reviews.llvm.org/D99949	2021-04-21 05:05:49 +00:00
Siva Chandra Reddy	653345155a	[libc] Disable fma and fmaf for x86_64. The version of clang installed on the buildbot workers is not able to compile them. However, the version of gcc installed is able to compile them fine. So, this change disables them until we can find a way to compile them using clang on the buildbot workers.	2021-04-21 05:01:15 +00:00
Vitaly Buka	5e9e463e1f	[lsan] Test to show lsan dependency on globals This test from @MaskRay comment on D69428. The patch is looking to break this behavior. If we go with D69428 I hope we will have some workaround for this test or include explicit test update into the patch. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D100906	2021-04-20 22:00:26 -07:00
Jonas Devlieghere	eff4f65afb	[lldb] Simplify check for nill value in breakpoint_function_callback.test	2021-04-20 21:53:30 -07:00
Zi Xuan Wu	ca31b43ae8	[NFC][CSKY] Resort the instruction description in td Resort the instruction description in td to make it easy to upstream more instructions and add predicts later.	2021-04-21 12:36:07 +08:00
Siva Chandra	95934c3a37	[libc] Add hardware implementations of fma and fmaf for x86_64 and aarch64. The current generic implementation of the fmaf function has been moved to the FPUtil directory. This allows one use the fma operation from implementations of other math functions like the trignometric functions without depending on/requiring the fma/fmaf/fmal function targets. If this pattern ends being convenient, we will switch all generic math implementations to this pattern. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D100811	2021-04-21 04:31:27 +00:00
Jonas Devlieghere	05eeed9691	Revert "[Driver] Support default libc++ library location on Darwin" This reverts the following commits because it breaks TestAppleSimulatorOSType.py on GreenDragon [1]. `caff17e503` `f5efe0aa04` `ae8b2cab67` [1] http://green.lab.llvm.org/green/view/LLDB/job/lldb-cmake/31346/	2021-04-20 20:42:50 -07:00
Liu, Chen3	72e4bf12ee	[X86] Support some missing intrinsics Support for _mm512_i32logather_pd, _mm512_mask_i32logather_pd, _mm512_i32logather_epi64, _mm512_mask_i32logather_epi64, _mm512_i32loscatter_pd, _mm512_mask_i32loscatter_pd, _mm512_i32loscatter_epi64, _mm512_mask_i32loscatter_epi64. Differential Revision: https://reviews.llvm.org/D100368	2021-04-21 10:50:37 +08:00
Craig Topper	78abad569c	[RISCV] Add missing SEW=64 tests to vmslt-rv32.ll. NFC	2021-04-20 18:31:36 -07:00
Amy Zhuang	9194071626	[mlir] Support hoisting whole affine for loops in LICM Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D100512	2021-04-20 18:07:06 -07:00
George Balatsouras	79b5280a6c	[dfsan] Enable origin tracking with fast8 mode All related instrumentation tests have been updated. Reviewed By: stephan.yichao.zhao Differential Revision: https://reviews.llvm.org/D100903	2021-04-20 18:10:32 -07:00
Fangrui Song	031c40dc3c	[sanitizer] Fix glibc sparc build and add GetTls support sanitizer_linux_libcdep.cpp doesn't build for Linux sparc (with minimum support but can build) after D98926. I wasn't aware because the file didn't mention `__sparc__`. While here, add the relevant support since it does not add complexity (the D99566 approach). Adds an explicit `#error` for unsupported non-Android Linux and FreeBSD architectures. ThreadDescriptorSize is only used by lsan to scan thread-specific data keys in the thread control block. On TLS Variant II architectures (i386/x86_64/s390/sparc), our dl_iterate_phdr based approach can cover the region from the first byte of the static TLS block (static TLS surplus) to the thread pointer. We just need to extend the range to include the first few members of struct pthread. offsetof(struct pthread, specific_used) satisfies the requirement and has not changed since 2007-05-10. We don't need to update ThreadDescriptorSize for each glibc version. Technically we could use the 524/1552 for x86_64 as well but there is potential risk that large applications with thousands of shared object dependency may dislike the time complexity increase if there are many threads, so I don't make the simplification for now. Differential Revision: https://reviews.llvm.org/D100892	2021-04-20 17:42:41 -07:00
Adrian Prantl	81cad0be68	Make sure PHIElimination doesn't copy debug locations across basic blocks. PHIElimination may insert copy instructions in multiple basic blocks. Moving debug locations across basic block boundaries would be misleading as illustrated by the test case. rdar://75463656 Differential Revision: https://reviews.llvm.org/D100886	2021-04-20 17:03:29 -07:00
Sam Clegg	103956170b	[WebAssembly] Update README. NFC. This is just a cleanup of the very high level stuff. I'm sure there is more to update here but I'll leave that to others and/or a followup. Differential Revision: https://reviews.llvm.org/D100888	2021-04-20 16:59:08 -07:00
Jez Ng	7208bd4b32	[lld-macho] Skip platform checks for a few libSystem re-exports XCode 12 ships with mismatched platforms for these libraries, so this hack is necessary... Fixes PR49799. Reviewed By: #lld-macho, gkm, smeenai Differential Revision: https://reviews.llvm.org/D100913	2021-04-20 19:54:53 -04:00
Arthur Eubanks	326da4adcb	[FuncAttrs] Always preserve FunctionAnalysisManagerCGSCCProxy FunctionAnalysisManagerCGSCCProxy should not be preserved if any of its keys may be invalid. Since we are not removing/adding functions in FuncAttrs, it's fine to preserve it. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D100893	2021-04-20 16:37:45 -07:00
Jim Radford	16a0d80912	[CMake][llvm] avoid changing global flags (may be used outside of llvm) Changing global flags can break builds of projects that include/build llvm as a sub-project, as the effect is global. Ideally we would disable this warning at the directory level instead, but the obvious way (disabling warning D9025) isn't supported. At least we can limit the effect to only MSVC. Patch by Jim Radford. Differential Revision: https://reviews.llvm.org/D100900	2021-04-20 16:06:25 -07:00
Reid Kleckner	91f7a4fff7	Revert "[InstCombine] Recognize `((x * y) s/ x) !=/== y` as an signed multiplication overflow check (PR48769)" This reverts commit `13ec913bdf`. This commit introduces new uses of the overflow checking intrinsics that depend on implementations in compiler-rt, which Windows users generally do not link against. I filed an issue (somewhere) to make clang auto-link the builtins library to resolve this situation, but until that happens, it isn't reasonable for the optimizer to introduce new link time dependencies.	2021-04-20 15:53:34 -07:00
Matthias Springer	dd5324467d	[mlir] Disallow broadcast dimensions on TransferWriteOp. The current implementation allows for TransferWriteOps with broadcasts that do not make sense. E.g., a broadcast could write a vector into a single (scalar) memory location, which is effectively the same as writing only the last element of the vector. Differential Revision: https://reviews.llvm.org/D100842	2021-04-21 07:43:45 +09:00
Philip Reames	4824d876f0	Revert "Allow invokable sub-classes of IntrinsicInst" This reverts commit `d87b9b81cc`. Post commit review raised concerns, reverting while discussion happens.	2021-04-20 15:38:38 -07:00
Dávid Bolvanský	9f1e2ee462	[Clang, builtins] Added aligned_alloc, memalign support	2021-04-21 00:11:54 +02:00
Roman Lebedev	5a654bfeab	Revert "[InstCombine] `sext(trunc(x)) --> sext(x)` iff trunc is NSW (PR49543)" I forgot about the case where we sign-extend to width smaller than the original. This reverts commit `1e6ca23ab8`.	2021-04-21 01:11:15 +03:00
Roman Lebedev	1e68d338c1	Revert "[InstCombine] "Bypass" NUW trunc of lshr if we are going to sext the result (PR49543)" I forgot about the case where we sign-extend to width smaller than the original. This reverts commit `41b71f718b`.	2021-04-21 01:11:14 +03:00
Philip Reames	d87b9b81cc	Allow invokable sub-classes of IntrinsicInst It used to be that all of our intrinsics were call instructions, but over time, we've added more and more invokable intrinsics. According to the verifier, we're up to 8 right now. As IntrinsicInst is a sub-class of CallInst, this puts us in an awkward spot where the idiomatic means to check for intrinsic has a false negative if the intrinsic is invoked. This change switches IntrinsicInst from being a sub-class of CallInst to being a subclass of CallBase. This allows invoked intrinsics to be instances of IntrinsicInst, at the cost of requiring a few more casts to CallInst in places where the intrinsic really is known to be a call, not an invoke. After this lands and has baked for a couple days, planned cleanups: Make GCStatepointInst a IntrinsicInst subclass. Merge intrinsic handling in InstCombine and use idiomatic visitIntrinsicInst entry point for InstVisitor. Do the same in SelectionDAG. Do the same in FastISEL. Differential Revision: https://reviews.llvm.org/D99976	2021-04-20 15:03:49 -07:00
Mehdi Chinoune	080d48f279	[flang][msvc] Fix compilation of RuntimeGtest Removes alternate spelling 'not' with '!'. Reviewed by: ashermancinelli, awarzynski, Meinersbur Differential revision: https://reviews.llvm.org/D100442	2021-04-20 15:36:38 -06:00
Roman Lebedev	41b71f718b	[InstCombine] "Bypass" NUW trunc of lshr if we are going to sext the result (PR49543) This is a more convoluted form of the same pattern "sext of NSW trunc", but in this case the operand of trunc was a right-shift, and the truncation chops off just the zero bits that were shifted-in.	2021-04-21 00:31:46 +03:00
Roman Lebedev	0ea464824a	[NFC][InstCombine] Add tests for sext-of-trunc-nuw-of-lshr (PR49543)	2021-04-21 00:31:46 +03:00
Roman Lebedev	ea1a0d7c9a	[InstSimplify] Bypass no-op `and`-mask, using known bits (PR49543) We already special-cased a few interesting patterns, but that is strictly less powerful than using KnownBits. So instead get the known bits for the operand of `and`, and iff all the unset bits of the `and`-mask are known to be zeros in the operand, we can omit said `and`.	2021-04-21 00:31:46 +03:00
Roman Lebedev	8cff391995	[NFC][InstSimplify] Add one more test for unneeded 'and'	2021-04-21 00:31:46 +03:00
Roman Lebedev	1e6ca23ab8	[InstCombine] `sext(trunc(x)) --> sext(x)` iff trunc is NSW (PR49543) If we can tell that trunc only chops off sign bits, and not all of them, then we can simply sign-extend the trunc's source.	2021-04-21 00:31:45 +03:00
Roman Lebedev	4e2c4190be	[NFC][InstCombine] Add test for sign-extending NSW trunc (PR49543)	2021-04-21 00:31:45 +03:00
Sanjay Patel	1e202e8f39	[InstCombine] fold shift-of-srem-by-2 to mask+shift There are several potential srem-by-2 folds because the result is known {-1,0,1}. https://alive2.llvm.org/ce/z/LuVyeK	2021-04-20 17:10:16 -04:00
Sanjay Patel	a2099d6542	[InstCombine] add tests for srem-by-2; NFC	2021-04-20 17:10:16 -04:00
Sam Clegg	d2de2d1724	[WebAssembly] Remove unused known_gcc_test_failures.txt. NFC Differential Revision: https://reviews.llvm.org/D100887	2021-04-20 14:07:25 -07:00
Zequan Wu	aa80955f63	[lld-link] Warn on exported deleting dtor MSVC linker has this [[ https://docs.microsoft.com/en-us/cpp/error-messages/tool-errors/linker-tools-warning-lnk4102?view=msvc-160 \| warning]], so lld-link should also warn on this. Differential Revision: https://reviews.llvm.org/D100606	2021-04-20 14:06:31 -07:00
Jez Ng	bb62ef9943	[lld-macho] Ensure segments are laid out contiguously codesign/libstuff checks that the `__LLVM` segment is directly before `__LINKEDIT` by checking that `fileOff + fileSize == next segment fileOff`. Previously, there would be gaps between the segments due to the fact that their fileOffs are page-aligned but their fileSizes aren't. In order to satisfy codesign, we page-align fileOff before calculating fileSize. (I don't think codesign checks for the relative ordering of other segments, so in theory we could do this just for `__LLVM`, but ld64 seems to do it for all segments.) Note that we don't round up the fileSize of the `__LINKEDIT` segment. Since it's the last segment, it doesn't need to worry about contiguity; in addition, codesign checks that the last (hidden) section in `__LINKEDIT` covers the last byte of the segment, so if we rounded up `__LINKEDIT`'s size we would have to do the same for its last section, which is a bother. While at it, I also addressed a FIXME in the linkedit-contiguity.s test to cover more `__LINKEDIT` sections. Reviewed By: #lld-macho, thakis, alexshap Differential Revision: https://reviews.llvm.org/D100848	2021-04-20 16:58:57 -04:00
Jez Ng	1aa29dffce	[lld-macho] Support subtractor relocations that reference sections The minuend (but not the subtrahend) can reference a section. Note that we do not yet properly validate that the subtrahend isn't referencing a section; I've filed PR50034 to track that. I've also extended the reloc-subtractor.s test to reorder symbols, to make sure that the addends are being associated with the minuend (and not the subtrahend) relocation. Fixes PR49999. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D100804	2021-04-20 16:58:57 -04:00
Alexey Bataev	673e2f1b70	[COST][AARCH64] Improve cost of reverse shuffles for AArch64. Introduced the cost of thre reverse shuffles for AArch64, currently just copied the costs for PermuteSingleSrc. Differential Revision: https://reviews.llvm.org/D100871	2021-04-20 13:47:56 -07:00
Petr Hosek	caff17e503	[Driver] Don't use capture for InstalledDir This is another attempt to address the issue introduced in `ae8b2cab67`. We cannot capture InstalledDir because FileCheck doesn't handle the backslashes correctly, so instead we just consume the entire path prefix which is what other tests are doing.	2021-04-20 13:43:56 -07:00
Petr Hosek	f5efe0aa04	[Driver] Support both slashes This addresses Windows breakage introduced by `ae8b2cab67`.	2021-04-20 13:25:38 -07:00
Philip Reames	6792e26c0d	Reapply "Look through invertible recurrences in isKnownNonEqual" I'd reverted this in commit `3b6acb1797` due to buildbot failures. This patch contains the fix for said issue. I'd forgotten to handle the case where two phis in the same block have different operand order. We canonicalize away from this, but it's still valid IR. The tests included in this change (as opposed to simply having test output changed), crashed without the fix. Original commit message follows... This extends the phi handling in isKnownNonEqual with a special case based on invertible recurrences. If we can prove the recurrence is invertible (which many common ones are), we can recurse through the start operands of the recurrence skipping the phi cycle. (Side note: Instcombine currently does not push back through these cases. I will implement that in a follow up change w/separate review.) Differential Revision: https://reviews.llvm.org/D99912	2021-04-20 12:47:59 -07:00
Peter Steinfeld	d667b96c98	[flang] Fix assignment of parameterized derived types We were erroneously emitting error messages for assignments of derived types where the associated objects were instantiated with non-constant LEN type parameters. I fixed this by adding the member function MightBeAssignmentCompatibleWith() to the class DerivedTypeSpec and calling it to determine whether it's possible that objects of parameterized derived types can be assigned to each other. Its implementation first compares the uninstantiated values of the types. If they are equal, it then compares the values of the constant instantiated type parameters. I added tests to assign04.f90 to exercise this new code. Differential Revision: https://reviews.llvm.org/D100868	2021-04-20 12:41:52 -07:00
Jon Roelofs	167da6c9e8	[AArch64][GlobalISel] Clarify fallback debug print ... to only print when that fallback actually happens.	2021-04-20 12:41:14 -07:00
Thomas Lively	693d767c60	[WebAssembly] More codegen for f64x2.convert_low_i32x4_{s,u} `af7925b4dd` added a custom DAG combine for recognizing fp-to-ints of extract_subvectors that could be lowered to f64x2.convert_low_i32x4_{s,u} instructions. This commit extends the combines to recognize equivalent extract_subvectors of fp-to-ints as well. Differential Revision: https://reviews.llvm.org/D100790	2021-04-20 12:37:13 -07:00

1 2 3 4 5 ...

386095 Commits