intel/llvm - llvm - Gitea: Git with a cup of tea

intel/llvm

mirror of https://github.com/intel/llvm.git synced 2026-02-04 20:00:11 +08:00

Author	SHA1	Message	Date
Hans Wennborg	369ac75853	[libFuzzer] Fix two typos	2021-10-01 13:15:43 +02:00
Alexander Belyaev	693c61b2e0	[mlir] Enable loop peeling for "reduction" dimensions of tiled_loop. Differential Revision: https://reviews.llvm.org/D110919	2021-10-01 13:07:57 +02:00
Fraser Cormack	fcaa64d947	[RISCV][NFC] Add closing parentheses to frame layout comments	2021-10-01 11:58:55 +01:00
Michał Górny	58b4501eea	[lldb] [Host] Refactor TerminalState Refactor TerminalState to make the code simpler. Move 'struct termios' to a PImpl-style subclass. Add an RAII interface to automatically store and restore the state. Differential revision: https://reviews.llvm.org/D110721	2021-10-01 12:53:21 +02:00
Kadir Cetinkaya	512aa84850	[clangd] Handle members of anon structs in SelectionTree References to fields inside anon structs contain an implicit children for the container, which has the same SourceLocation with the field. This was resulting in SelectionTree always picking the anon-struct rather than the field as the selection. This patch prevents that by claiming the range for the field early. https://github.com/clangd/clangd/issues/877. Differential Revision: https://reviews.llvm.org/D110825	2021-10-01 12:38:18 +02:00
Florian Hahn	413b7ac6b5	[BasicAA] Add test showing 32 bit overflow issue for GEPs. This patch additional tests with i64 GEP indices for 32 bit pointers. @mustalias_overflow_in_32_bit_add_mul_gep highlights a case where BasicAA currently incorrectly determines noalias. Modeled in Alive2 for 32 bit pointers: https://alive2.llvm.org/ce/z/HHjQgb Modeled in Alive2 for 64 bit pointers: https://alive2.llvm.org/ce/z/DoWK2c	2021-10-01 11:37:56 +01:00
Matthew Devereau	f085a9db8b	[AArch64][SVE] Replace fmul, fadd and fsub LLVM IR instrinsics with LLVM IR binary ops Replacing fmul and fadd instrinsics with their binary ops results more succinct AArch64 SVE output, e.g.: 4: 65428041 fmul z1.h, p0/m, z1.h, z2.h 8: 65408020 fadd z0.h, p0/m, z0.h, z1.h -> 4: 65620020 fmla z0.h, p0/m, z1.h, z2.h	2021-10-01 11:24:46 +01:00
Kerry McLaughlin	c1d46d3461	[SLPVectorizer] Fix crash in isShuffle with scalable vectors D104809 changed `buildTree_rec` to check for extract element instructions with scalable types. However, if the extract is extended or truncated, these changes do not apply and we assert later on in isShuffle(), which attempts to cast the type of the extract to FixedVectorType. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D110640	2021-10-01 10:56:44 +01:00
Florian Hahn	57fbb9ed0e	[llvm-reduce] Skip updating calls where OldF isn't the called fn. When replacing function calls, skip call instructions where the old function is not the called function, but e.g. the old function is passed as an argument. This fixes a crash due to trying to construct invalid IR for the test case. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D109759	2021-10-01 10:52:48 +01:00
David Spickett	81d2cea690	Revert "[libcxx][test] Use python specified by build rather than system default python" This reverts commit `9f641c96cb`. The "python" command in gdb uses the python gdb is linked to, not what "python" would give you if you used it directly in the shell.	2021-10-01 09:45:44 +00:00
David Spickett	5fbe9e40d1	Revert "[libcxx] Run u16string tests for gdb pretty printers" This reverts commit `e9564c3698` due to a report of these tests failing.	2021-10-01 09:45:14 +00:00
Krasimir Georgiev	685f1bfd0a	Revert "[LoopVectorize] Permit vectorisation of more select(cmp(), X, Y) reduction patterns" It appears to cause stage2 clang build failures, e.g., https://lab.llvm.org/buildbot/#/builders/74/builds/7145. This reverts commit `1fb37334bd`.	2021-10-01 11:39:43 +02:00
Balázs Kéri	cad9ff531c	[clang][ASTImporter] Import ConstructorUsingShadowDecl correctly. Fix import of ConstructorUsingShadowDecl and add tests. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D110398	2021-10-01 11:41:08 +02:00
David Spickett	3780de4600	[flang][driver] Error if uuidgen is not installed Ubuntu Bionic installs it by default, Focal does not. Differential Revision: https://reviews.llvm.org/D110694	2021-10-01 09:42:58 +01:00
Gabor Marton	a3a0b06626	[clang][ASTImporter] Import InheritedConstructor and ConstructorUsingShadowDecl. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D110395	2021-10-01 10:16:11 +02:00
David Sherwood	1fb37334bd	[LoopVectorize] Permit vectorisation of more select(cmp(), X, Y) reduction patterns This patch adds further support for vectorisation of loops that involve selecting an integer value based on a previous comparison. Consider the following C++ loop: int r = a; for (int i = 0; i < n; i++) { if (src[i] > 3) { r = b; } src[i] += 2; } We should be able to vectorise this loop because all we are doing is selecting between two states - 'a' and 'b' - both of which are loop invariant. This just involves building a vector of values that contain either 'a' or 'b', where the final reduced value will be 'b' if any lane contains 'b'. The IR generated by clang typically looks like this: %phi = phi i32 [ %a, %entry ], [ %phi.update, %for.body ] ... %pred = icmp ugt i32 %val, i32 3 %phi.update = select i1 %pred, i32 %b, i32 %phi We already detect min/max patterns, which also involve a select + cmp. However, with the min/max patterns we are selecting loaded values (and hence loop variant) in the loop. In addition we only support certain cmp predicates. This patch adds a new pattern matching function (isSelectCmpPattern) and new RecurKind enums - SelectICmp & SelectFCmp. We only support selecting values that are integer and loop invariant, however we can support any kind of compare - integer or float. Tests have been added here: Transforms/LoopVectorize/AArch64/sve-select-cmp.ll Transforms/LoopVectorize/select-cmp-predicated.ll Transforms/LoopVectorize/select-cmp.ll Differential Revision: https://reviews.llvm.org/D108136	2021-10-01 08:41:03 +01:00
Sander de Smalen	b62e6f19d7	[SelectionDAG] Handle promotion + widening in getCopyToPartsVector Some vectors require both widening and promotion for their legalization. This case is not yet handled in getCopyToPartsVector and falls back on scalarizing by default. BBecause scalable vectors can't easily be scalarised, we need to implement this in two separate stages: 1. Widen the vector. 2. Promote the vector. As part of this patch, PromoteIntRes_CONCAT_VECTORS also needed to be made scalable aware. Instead of falling back on scalarizing the vector (fixed-width only), each sub-part of the CONCAT vector is promoted, and the operation is performed on the type with the widest element type, finally truncating the result to the promoted result type. Differential Revision: https://reviews.llvm.org/D110646	2021-10-01 08:19:47 +01:00
Valentin Clement	a149b103ca	[fir][NFC] Move fir.select_type builder to cpp file Move the big builder out of the td file to the cpp file. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D110820	2021-10-01 09:19:39 +02:00
Valentin Clement	b04dd35f0e	[fir][NFC] Update doc for pinned attr in fir.alloca Add descritpion for the attribute added in D110815. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D110877	2021-10-01 09:18:09 +02:00
Jean Perier	7a6ab39e71	[flang] Revert 3 commits pushed by mistake along `b7c07ce15f` Revert "[flang][NFC] Add debug dump method to evaluate::Expr and semantics::Symbol" This reverts commit `b0e35fde21`. Revert "[flang] Add a wrapper for Fortran main program" This reverts commit `2c1ce0755e`. Revert "[flang][NFC] Fix header comments in some runtime headers" This reverts commit `a63f57674d`.	2021-10-01 09:01:31 +02:00
Jean Perier	b7c07ce15f	[flang] Improve runtime interface with C99 complex Follow up of https://reviews.llvm.org/D83397. In folding, make pgmath usage conditional to C99 complex support in C++. Disable warning in such case. In lowering, use an empty class type to indicate C99 complex type in runtime interface. Add a unit test enforcing C99 complex can be processed by FIR runtime interface builder. Differential Revision: https://reviews.llvm.org/D110860	2021-10-01 08:45:24 +02:00
Jean Perier	b0e35fde21	[flang][NFC] Add debug dump method to evaluate::Expr and semantics::Symbol Helps debugging when working with symbol/expression issue. The dump method is easy to call in the debugger.	2021-10-01 08:45:20 +02:00
Jean Perier	2c1ce0755e	[flang] Add a wrapper for Fortran main program Add a C wrapper that calls the Fortran runtime initialization and finalization routines as well as the compiled Fortran main program _QQmain. Place it in its own library to satisfy shared library builds since it contains a C main function. - `cc7ac498f9 (diff-fa35a5efa62731fd2845e5e982eca9a2e36439783e11a4e4a463753c2160ec10R53)` - was created in flang/test/Examples/main.c in Eric's branch	2021-10-01 08:45:20 +02:00
Jean Perier	a63f57674d	[flang][NFC] Fix header comments in some runtime headers	2021-10-01 08:45:20 +02:00
Teresa Johnson	d047368149	[MemProf] Loosen matching of profile data to avoid bot flakes Allow for the allocations to have migrated cpus, assuming they wouldn't is causing some bot flakiness, e.g.: https://lab.llvm.org/buildbot/#/builders/37/builds/7197	2021-09-30 21:22:40 -07:00
Koutheir Attouchi	16661b1a3c	Expose `DIBuilder::finalizeSubprogram()` through the LLVM C API The LLVM C API function is called `LLVMDIBuilderFinalizeSubprogram()`. Reviewed By: CodaFi Differential Revision: https://reviews.llvm.org/D104794	2021-09-30 20:59:41 -07:00
Albion Fung	29bb877499	[PowerPC] Fix lharx and lbarx builtin signatures The signatures for the PowerPC builtins lharx and lbarx are incorrect, and causes issues when used in a function that requires the return of the builtin to be promoted. This patch fixes these signatures. Differential revision: https://reviews.llvm.org/D110273	2021-09-30 22:36:13 -05:00
Vitaly Buka	d2df5ce294	[NFC][asan] Remove redundant functions	2021-09-30 19:38:23 -07:00
Vitaly Buka	051d766bae	[NFC][lsan] Expand use StackDepotReverseMap Before StackDepotReverseMap was used only by ProcessPC.	2021-09-30 19:26:47 -07:00
Vitaly Buka	548aa9022e	[NFC][sanitizer] Lazy init in StackDepotReverseMap	2021-09-30 19:26:34 -07:00
LLVM GN Syncbot	fcdefc8575	[gn build] Port `3077bc90de`	2021-10-01 00:43:50 +00:00
Christopher Tetreault	3077bc90de	[NFC] Restore magic and magicu to a globally visible location While these functions are only used in one location in upstream, it has been reused in multiple downstreams. Restore this file to a globally visibile location (outside of APInt.h) to eliminate donwstream breakage and enable potential future reuse. Additionally, this patch renames types and cleans up clang-tidy issues.	2021-09-30 17:43:12 -07:00
ZijunZhao	91bfccf837	add tsan shared library	2021-10-01 00:19:35 +00:00
Vitaly Buka	5c3568d01f	[NFC][sanitizer] Add const into method	2021-09-30 17:16:34 -07:00
Yonghong Song	3562ad3ebe	BPF: implement isLegalAddressingMode() properly Latest upstream llvm caused the kernel bpf selftest emitting the following warnings: In file included from progs/profiler3.c:6: progs/profiler.inc.h:489:2: warning: loop not unrolled: the optimizer was unable to perform the requested transformation; the transformation might be disabled or specified as part of an unsupported transformation ordering [-Wpass-failed=transform-warning] for (int i = 0; i < MAX_PATH_DEPTH; i++) { ^ Further bisecting shows this SimplifyCFG patch ([1]) changed the condition on how to fold branch to common dest. This caused some unroll pragma is not honored in selftests/bpf. The patch [1] test getUserCost() as the condition to perform the certain basic block folding transformation. For the above example, before the loop unroll pass, the control flow looks like: cond_block: branch target: body_block, cleanup_block body_block: branch target: cleanup_block, end_block end_block: branch target: cleanup_block, end10_block end10_block: %add.ptr = getelementptr i8, i8* %payload.addr.0, i64 %call2 %inc = add nuw nsw i32 %i.0, 1 branch target: cond_block In the above, %call2 is an unknown scalar. Before patch [1], end10_block will be folded into end_block, forming the code like cond_block: branch target: body_block, cleanup_block body_block: branch target: cleanup_block, end_block end_block: branch target: cleanup_block, cond_block and the compiler is happy to perform unrolling. With patch [1], getUserCost(), which calls getGEPCost(), which calls isLegalAddressingMode() in TargetLoweringBase.cpp, considers IR %add.ptr = getelementptr i8, i8* %payload.addr.0, i64 %call2 is free, so the above basic block folding transformation is not performed and unrolling does not happen. For BPF target, the IR %add.ptr = getelementptr i8, i8* %payload.addr.0, i64 %call2 is not free and we don't have ld/st instruction address with 'r+r' mode. This patch implemented a BPF hook for isLegalAddressingMode(), which is identical to Mips isLegalAddressingMode() implementation where the address pattern like 'r+r', 'r+r+i' or '2*r' are not allowed. With testing kernel bpf selftests, all loop not unrolled warnings are gone and all selftests run successfully. [1] https://reviews.llvm.org/D108837 Differential Revision: https://reviews.llvm.org/D110789	2021-09-30 16:41:15 -07:00
Philip Reames	bdb5aa65b1	[test] Add tests covering a missing opt in SCEV's isSCEVExprNeverPoison	2021-09-30 16:15:06 -07:00
Leonard Chan	9f641c96cb	[libcxx][test] Use python specified by build rather than system default python As of `e9564c3698`, libcxx/gdb/gdb_pretty_printer_test.sh.cpp fails locally for me because the REQUIRES check for host-has-gdb-with-python uses python, which for me expands to python 2.7.18. This failure does not seem to be caught on any upstream builders, potentially because they don't have gdb, python, or a version of python that makes the test UNSUPPORTED (like python3). This updates the check to use the python specified by the build (which should be the python that runs this code), rather than just python. Differential Revision: https://reviews.llvm.org/D110887	2021-09-30 15:34:30 -07:00
Philip Reames	c5e491e6ee	[SCEV] Modernize code style of isSCEVExprNeverPoison [NFC] Use for-range and all_of to make code easier to read in advance of other changes.	2021-09-30 15:13:43 -07:00
Teresa Johnson	0d8bdc1786	[MemProf] Record accesses for all words touched in mem intrinsic Previously for mem* intrinsics we only incremented the access count for the first word in the range. However, after thinking it through I think it makes more sense to record an access for every word in the range. This better matches the behavior of inlined memory intrinsics, and also allows better analysis of utilization at a future date. Differential Revision: https://reviews.llvm.org/D110799	2021-09-30 15:07:55 -07:00
Rafael Auler	c82f98ba4c	[MC] Fix buildbots with shared lib builds In D109412 I forgot to add a dependency on libObject. Fix that. Reviewed By: maksfb Differential Revision: https://reviews.llvm.org/D110886	2021-09-30 14:42:15 -07:00
Amara Emerson	ca8316b704	[GlobalISel] Extend CombinerHelper::matchConstantOp() to match constant splat vectors. This allows the "x op 0 -> x" fold to optimize vector constant RHSs. Differential Revision: https://reviews.llvm.org/D110802	2021-09-30 14:31:25 -07:00
Jean Perier	fdcbb540fc	[flang][NFC] Add debug dump method to evaluate::Expr and semantics::Symbol Helps debugging when working with symbol/expression issue. The dump method is easy to call in the debugger. Co-authored-by: Eric Schweitz <eschweitz@nvidia.com> Differential Revision: https://reviews.llvm.org/D110856	2021-09-30 23:26:46 +02:00
Craig Topper	a21c557955	[RISCV] Remove Zbproposedc extension This consists of 3 compressed instructions, c.not, c.neg, and c.zext.w. I believe these have been picked up by the Zce effort using different encodings. I don't think it makes sense to keep them in bitmanip. It will eventually cause a conflict if/when Zce is implemented in llvm. Differential Revision: https://reviews.llvm.org/D110871	2021-09-30 14:23:05 -07:00
Jean Perier	962e503cc8	[flang] Take into account SubprogramDetails in GetInterfaceSymbol When the ProcRef is Symbol is a SubprogramDetails, the interface is the SubprogramDetails. Do not return nullptr. Differential Revision: https://reviews.llvm.org/D110853	2021-09-30 23:06:32 +02:00
Jon Chesterfield	72e8a4c45d	[openmp][docs] Describe how the internal components are found Add a FAQ entry about the names of openmp offloading components and how they are searched for. Reviewed By: jhuber6 Differential Revision: https://reviews.llvm.org/D109619	2021-09-30 22:05:12 +01:00
Jean Perier	cf1f5fbdfc	[flang][NFC] Fix header comments in some runtime headers Differential Revision: https://reviews.llvm.org/D110850	2021-09-30 23:04:23 +02:00
Petr Hosek	0c4a75f193	[CMake] Remove the LLD LTO check for Darwin LLD now supports LTO on Darwin. Differential Revision: https://reviews.llvm.org/D110881	2021-09-30 14:00:31 -07:00
Gwen Mittertreiner	72e7e15a12	[compiler-rt] Add -fno-omit-frame-pointer check to builtins rG210d72e9d6b4a8e7633921d0bd7186fd3c7a2c8c moved the check from builtin-config-ix to config-ix so that the check would be made even when the builtins are not built. However, now the check is no longer made when the builtins are built standalone which causes the builtins to fail to build. Add the check back to builtins-config-ix so that the check gets performed both when the builtins are not built, and when they are built standalone. Reviewed By: smeenai Differential Revision: https://reviews.llvm.org/D110879	2021-09-30 13:53:13 -07:00
Jon Chesterfield	3247329107	[openmp] Add addrspacecast to getOrCreateIdent Fixes 51982. Adds a missing CreatePointerCast and allocates a global in the correct address space. Test case derived from https://github.com/ROCm-Developer-Tools/aomp/\ blob/aomp-dev/test/smoke/nest_call_par2/nest_call_par2.c by deleting parts while checking the assertion failure still occurred. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D110556	2021-09-30 21:36:31 +01:00
Jon Chesterfield	b75a7481ba	[libomptarget] Apply D110029 to amdgpu Use enum for execution mode. This is partly a port from ROCm and partly a port from D110029. Attempted to make the same choices as ROCm as far as comments etc go to reduce the merge conflicts. There is some cleanup warranted here - in particular I like the cuda patch factoring out the comparisons into named variables - but I'd like to leave that for a follow up patch, keeping this one minimal. Reviewed By: carlo.bertolli Differential Revision: https://reviews.llvm.org/D110845	2021-09-30 21:29:37 +01:00

1 2 3 4 5 ...

400498 Commits