intel/llvm - llvm - Gitea: Git with a cup of tea

intel/llvm

mirror of https://github.com/intel/llvm.git synced 2026-01-22 07:01:03 +08:00

Author	SHA1	Message	Date
Sam McCall	c5c3cdb9c9	[clangd] Populate-switch triggers when the whole condition is selected. This allows vscode to find it as a diagnostic quickfix for -Wswitch. While here, group the code into chunks and add a couple more comments.	2021-08-08 21:06:08 +02:00
Kazu Hirata	d9c9d13365	[DWARF] Remove collectChildrenAddressRanges (NFC) The last use was removed on Dec 21, 2018 in commit `c3f30a7fc6`.	2021-08-08 08:57:32 -07:00
Dimitry Andric	400cd6d2f0	[libomptarget][amdgpu] use --allow-shlib-undefined to link on FreeBSD On FreeBSD, the `environ` symbol is undefined at link time for shared libraries, but resolved by the dynamic linker at runtime. Therefore, allow the symbol to be undefined when creating a shared library, by using the `--allow-shlib-undefined` linker flag, instead of `-z defs` (a.k.a `--no-undefined`). Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D107698	2021-08-08 13:52:44 +02:00
Dimitry Andric	ab4b4684a2	[mlir] Avoid including <alloca.h> on FreeBSD and NetBSD Instead, include `<cstdlib>` which is the canonical header containing the declaration of `alloca()`. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D107699	2021-08-08 13:32:35 +02:00
Dorit Nuzman	67278b8a90	[LV] Support Interleaved Store Group With Gaps Teach LV to use masked-store to support interleave-store-group with gaps (instead of scatters/scalarization). The symmetric case of using masked-load to support interleaved-load-group with gaps was introduced a while ago, by https://reviews.llvm.org/D53668; This patch completes the store-scenario leftover from D53668, and solves PR50566. Reviewed by: Ayal Zaks Differential Revision: https://reviews.llvm.org/D104750	2021-08-08 10:32:02 +03:00
Min-Yih Hsu	657bb7262d	[M68k] Separate ADDA from ADD and migrate rest of the arithmetic MC tests Previously ADD & ADDA (as well as SUB & SUBA) instructions are mixed together, which not only violated Motorola assembly's syntax but also made asm parsing more difficult. This patch separates these two kinds of instructions migrate rest of the tests from test/CodeGen/M68k/Encoding/Arithmetic to test/MC/M68k/Arithmetic. Note that we observed minor regressions on codegen quality: Sometimes isel uses ADD instead of ADDA even the latter can lead to shorter sequence of code. This issue implies that some isel patterns might need to be updated.	2021-08-07 17:19:12 -07:00
Min-Yih Hsu	4c0d15f86f	Update `llvm-readobj` command invocation in extract-section.py Change `-elf-output-style` to `--elf-output-style` to reflect the recent changes on short/long CLI options in LLVM binary utils.	2021-08-07 17:19:12 -07:00
Craig Topper	5894134c6e	[RISCV] Autogenerate test. NFC	2021-08-07 17:11:11 -07:00
Craig Topper	d4ee84ceee	[RISCV] Support FP_TO_S/UINT_SAT for i32 and i64. The fcvt fp to integer instructions saturate if their input is infinity or out of range, but the instructions produce a maximum integer for nan instead of 0 required for the ISD opcodes. This means we can use the instructions to do the saturating conversion, but we'll need to fix up the nan case at the end. We can probably improve the i8 and i16 default codegen as well, but I'll leave that for a follow up. Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D107230	2021-08-07 16:06:00 -07:00
Jonas Devlieghere	47a889c668	[lldb] Move Objective-C constants into ObjCConstants.h Move Objective-C constants into ObjCConstants.h and share them between Cocoa and AppleObjCTypeEncodingParser. Differential revision: https://reviews.llvm.org/D107679	2021-08-07 16:04:52 -07:00
Nikita Popov	88003cea1c	[MemCpyOpt] Remove MemDepAnalysis-based implementation The MemorySSA-based implementation has been enabled for a few months (since D94376). This patch drops the old MDA-based implementation entirely. I've kept this to only the basic cleanup of dropping various conditions -- the code could be further cleaned up now that there is only one implementation. Differential Revision: https://reviews.llvm.org/D102113	2021-08-07 22:35:44 +02:00
Rainer Orth	a382a74627	[clang] Fix libclang linking on Solaris Linking `libclang.so` is currently broken on Solaris: ld: fatal: option --version-script requires option -z gnu-version-script-compat to be specified While Solaris `ld` supports a considerable subset of `--version-script`, there are some elements of the syntax that aren't. The fix is equivalent to D78510 <https://reviews.llvm.org/D78510>. Additionally, use of C-style comments is a GNU extension that can easily be avoided by using `#` as comment character, which is supported by GNU `ld`, `gold`, and `lld`. Tested on `amd64-pc-solaris2.11`, `sparcv9-sun-solaris2.11`, `x86_64-pc-linux-gnu`. Differential Revision: https://reviews.llvm.org/D107559	2021-08-07 21:14:15 +02:00
Krishna	a9a176ca3b	[InstCombine] Remove nnan requirement for transformation to fabs from select In this patch, the "nnan" requirement is removed for the canonicalization of select with fcmp to fabs. (i) FSub logic: Remove check for nnan flag presence in fsub. Example: https://alive2.llvm.org/ce/z/751svg (fsub). (ii) FNeg logic: Remove check for the presence of nnan and nsz flag in fneg. Example: https://alive2.llvm.org/ce/z/a_fsdp (fneg). Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D106872	2021-08-07 22:38:45 +05:30
Ye Luo	262289c103	[OpenMP] mark target task untied OpenMP specification Tasking Terminology target task :A mergeable and untied task that ... Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D107686	2021-08-07 12:31:20 -04:00
Craig Topper	618543bb12	[clang][NFC] Fix a -Wparentheses warning.	2021-08-07 08:56:31 -07:00
Craig Topper	24dfba8d50	[X86] Teach shouldSinkOperands to recognize pmuldq/pmuludq patterns. The IR for pmuldq/pmuludq intrinsics uses a sext_inreg/zext_inreg pattern on the inputs. Ideally we pattern match these away during isel. It is possible for LICM or other middle end optimizations to separate the extend from the mul. This prevents SelectionDAG from removing it or depending on how the extend is lowered, we may not be able to generate an AssertSExt/AssertZExt in the mul basic block. This will prevent pmuldq/pmuludq from being formed at all. This patch teaches shouldSinkOperands to recognize this so that CodeGenPrepare will clone the extend into the same basic block as the mul. Fixes PR51371. Differential Revision: https://reviews.llvm.org/D107689	2021-08-07 08:45:56 -07:00
Craig Topper	8a2d1b183d	[X86] Add test cases for pr51371. NFC	2021-08-07 08:45:56 -07:00
Kazu Hirata	c21f6dc8a4	[IR] Remove unused declaration InitializeTypeMap (NFC) The corresponding function definition was removed on Apr 23, 2016 in commit `a59d3e5af8`.	2021-08-07 07:37:54 -07:00
Roman Lebedev	0a241e90d4	[NFC][InstCombine] `vector_reduce_xor(?ext(<n x i1>))` --> `?ext(vector_reduce_add(<n x i1>))` Instead of expanding it ourselves, we can just forward to `?ext(vector_reduce_add(<n x i1>))`, as per alive2: https://alive2.llvm.org/ce/z/ymz7zE (self) https://alive2.llvm.org/ce/z/eKu2v2 (skipped zext) https://alive2.llvm.org/ce/z/c3BXgc (skipped sext)	2021-08-07 17:31:33 +03:00
Roman Lebedev	c6ff867f92	[NFC][InstCombine] Simplify emitted IR for `vector_reduce_xor(?ext(<n x i1>))` Now that we canonicalize low bit splatting to the form we were emitting here ourselves, emit simpler IR that will be canonicalized later. See `1e801439be` for proofs: https://alive2.llvm.org/ce/z/MjCm5W (self) https://alive2.llvm.org/ce/z/kgqF4M (skipped zext) https://alive2.llvm.org/ce/z/pgy3HP (skipped sext)	2021-08-07 17:31:24 +03:00
Roman Lebedev	e71870512f	[InstCombine] Prefer `-(x & 1)` as the low bit splatting pattern (PR51305) Both patterns are equivalent (https://alive2.llvm.org/ce/z/jfCViF), so we should have a preference. It seems like mask+negation is better than two shifts.	2021-08-07 17:25:28 +03:00
Roman Lebedev	d88d279e76	[NFC][InstCombine] Add tests for low bit splatting pattern (PR51305)	2021-08-07 17:25:28 +03:00
Roman Lebedev	d05d4e7f7e	[NFC][InstCombine] Autogenerate checklines in a few tests being affected by an upcoming change	2021-08-07 17:25:27 +03:00
Christudasan Devadasan	ffc3fb665d	SROA: Enhance speculateSelectInstLoads Allow the folding even if there is an intervening bitcast. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D106667	2021-08-07 09:09:14 -04:00
Florian Hahn	a00aafc30d	[VPlan] Iterate over phi recipes to detect reductions to fix. After refactoring the phi recipes, we can now iterate over all header phis in a VPlan to detect reductions when it comes to fixing them up when tail folding. This reduces the coupling with the cost model & legal by using the information directly available in VPlan. It also removes a call to getOrAddVPValue, which references the original IR value which may become outdated after VPlan transformations. Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D100102	2021-08-07 14:06:50 +01:00
Andrea Di Biagio	45685a1fc4	[MCA] Simplify the rounding logic used in TimelineView::printWaitTimeEntry. This is related to PR51392. Before this patch, the timeline view was rounding doubles to the first decimal, using a logic similar to this: ``` double AverageTime = (double)Input / CumulativeExecutions; double Result = floor((AverageTime * 10) + 0.5) / 10 ``` Here, Input and CumulativeExecutions are both unsigned integers. The last operation is what effectively performs the rounding of AverageTime. PR51392 has been raised because - under specific -m32 configurations of GCC - one of the timeline tests reports slighlty different values (due to a different rounding choice). This patch tries to minimise the propagation of floating-point error by hoisting the multiply by 10, so that it is performed on the unsigned. ``` double AverageTime = (double)(Input * 10) / CumulativeExecutions; floor(AverageTime + 0.5) / 10 ``` So we are trading a floating point multiply for a integer multiply (which can be expanded using a simple MUL or using an `ADD + LEA` sequence). This decrease in floating point operations executed should also help with decreasing the error in the computation.. Strictly speaking, that computation will always be potentially subject to error (depending on what values are passed in input). However, this patch should improve the situation and make bug like PR51392 less frequent.	2021-08-07 11:59:41 +01:00
Simon Atanasyan	454f69bcc1	[LLD] Add required `ppc` target to the test cases. NFC	2021-08-07 13:29:59 +03:00
Simon Atanasyan	c6ebc651b6	[LLD] Support compressed input sections on big-endian targets This patch enables compressed input sections on big-endian targets by checking the target endianness and selecting an appropriate `Chdr` structure. Fixes PR51369 Differential Revision: https://reviews.llvm.org/D107635	2021-08-07 13:20:13 +03:00
Amara Emerson	4c2e01232c	[GlobalISel] Fix a combine causing DBG_VALUE with dangling vregs. We should use MachineInstr::eraseFromParentAndMarkDBGValuesForRemoval() instead of eraseFromParent(). We should probably use that in other places too but fix this issue which affects clang bootstrap builds for now.	2021-08-07 01:41:02 -07:00
Roger Ferrer Ibanez	bfb77364d0	[OpenMP] Fix accidental reuse of VLA size We were using an OpaqueValueExpr allocated on the stack to store the size of a VLA. Because the VLASizeMap in CodegenFunction uses the address of the expression to avoid recomputing VLAs, we were accidentally reusing an earlier llvm::Value. This led to invalid LLVM IR. This is a temporary solution until VLASizeMap can be pushed and popped based on the context. Differential Revision: https://reviews.llvm.org/D107666	2021-08-07 05:55:27 +00:00
Nemanja Ivanovic	62fe3dcf98	Fix PPC buildbot break caused by `4c4093e6e3` This commit adds the isnan intrinsic and provides a default expansion for it in the SDAG. However, it makes the assumption that types it operates on are IEEE-compliant types. This is not always the case. An example of that is PPC "double double" which has a representation that - Does not need to conform to IEEE requirements for isnan as it is not an IEEE-compliant type - Does not have a representation that allows for straightforward reinterpreting as an integer and use of integer operations The result was that this commit broke __builtin_isnan for ppc_fp128 making many valid numeric values report a NaN. This patch simply changes the expansion to always expand to unordered comparison (regardless of whether FP exceptions are tracked). This is inline with previous semantics.	2021-08-06 22:10:20 -05:00
Matt Jacobson	71e71067f3	[AVR][clang] Add '$SYSROOT/avr' to possible avr-libc locations Reviewed by: benshi001 Differential Revision: https://reviews.llvm.org/D107672	2021-08-07 10:24:14 +08:00
Roland McGrath	5a2a179695	[profile][Fuchsia] Add missing system header #include The _zx_vmar_root_self function is not a system call but a libc function declared in a separate header. Reviewed By: gulfem Differential Revision: https://reviews.llvm.org/D107616	2021-08-06 17:59:35 -07:00
Jonas Devlieghere	9d5e95d094	Re-land "[lldb] Upstream support for Foundation constant classes" Upstream support for NSConstantArray, NSConstantIntegerNumber, NSConstant{Float,Double}Number and NSConstantDictionary. We would've upstreamed this earlier but testing it requires -fno-constant-nsnumber-literals, -fno-constant-nsarray-literals and -fno-constant-nsdictionary-literals which haven't been upstreamed yet. As a temporary workaround use the system compiler (xcrun clang) for the constant variant of the tests. I'm just upstreaming this. The patch and the tests were all authored by Fred Riss. Differential revision: https://reviews.llvm.org/D107660	2021-08-06 17:24:47 -07:00
Sterling Augustine	4e5af6ef48	Revert "[lldb] Upstream support for Foundation constant classes" This reverts commit `34d78b6a67`. This breaks build bots witha missing file: /home/worker/2.0.1/lldb-x86_64-debian/llvm-project/lldb/source/Plugins/Language/ObjC/Cocoa.cpp:10:10: fatal error: 'objc/runtime.h' file not found	2021-08-06 16:56:59 -07:00
Jim Ingham	bfeb281fbd	Use LC_DYLD_EXPORTS_TRIE to locate the dyld trie structure if present The pointer to the dyld trie data structure which lldb needs to parse to get "trampoline kinds" on Darwin used to be a field in the LC_DYLD_INFO load command. A new load command was added recently dedicated to this purpose: LC_DYLD_EXPORTS_TRIE. The format of the trie did not change, however. So all we have to do is use the new command if present. The commands are supposed to be mutually exclusive, so I added an lldb_assert to warn if they are not. Differential Revision: https://reviews.llvm.org/D107673	2021-08-06 16:38:34 -07:00
Max Kudryavtsev	0b8cb87e0d	[MLIR][STD] Add safe scalar constant propagation for FPTruncOp Perform scalar constant propagation for FPTruncOp only if the resulting value can be represented without precision loss or rounding. Example: %cst = constant 1.000000e+00 : f32 %0 = fptrunc %cst : f32 to bf16 --> %cst = constant 1.000000e+00 : bf16 Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D107518	2021-08-06 16:31:29 -07:00
Dave Airlie	1854db74c5	opencl-c.h: add 3.0 optional extension support for a few more bits These 3 are fairly simple, pipes, workgroups and subgroups. Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D105858	2021-08-07 09:25:00 +10:00
Steffen Larsen	1b4c85fc02	[NVPTX] Add NVPTX intrinsics for CUDA PTX 6.5 ldmatrix instructions Adds NVPTX intrinsics for the CUDA PTX `ldmatrix.sync.aligned` instructions added in PTX 6.5. PTX ISA description of `ldmatrix.sync.aligned`: https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#warp-level-matrix-instructions-ldmatrix Authored-by: Steffen Larsen <steffen.larsen@codeplay.com> Reviewed By: tra Differential Revision: https://reviews.llvm.org/D107046	2021-08-06 16:13:35 -07:00
Jonas Devlieghere	34d78b6a67	[lldb] Upstream support for Foundation constant classes Upstream support for NSConstantArray, NSConstantIntegerNumber, NSConstant{Float,Double}Number and NSConstantDictionary. We would've upstreamed this earlier but testing it requires -fno-constant-nsnumber-literals, -fno-constant-nsarray-literals and -fno-constant-nsdictionary-literals which haven't been upstreamed yet. As a temporary workaround use the system compiler (xcrun clang) for the constant variant of the tests. I'm just upstreaming this. The patch and the tests were all authored by Fred Riss. Differential revision: https://reviews.llvm.org/D107660	2021-08-06 16:08:48 -07:00
Stanislav Mekhanoshin	1962b33d3f	[AMDGPU] Added test for MachineLICM reg pressure. NFC. The test shows excessive register pressure after the MachineLICM. This is a pre-commit for the patch fixing it. Differential Revision:	2021-08-06 16:00:35 -07:00
Jim Ingham	f362b05d0d	Add a "current" token to the ThreadID option to break set/modify. This provides a convenient way to limit a breakpoint to the current thread when setting it from the command line w/o having to figure out what the current thread is. Differential Revision: https://reviews.llvm.org/D107015	2021-08-06 15:29:55 -07:00
Paul Robinson	3229c97151	Revert "[lit] Have REQUIRES support the target triple" This reverts commit `100a7b6197`. Speculating that this is the reason behind a sanitizer failure: https://lab.llvm.org/buildbot/#/builders/37/builds/5945	2021-08-06 15:03:49 -07:00
Zequan Wu	2129c4a861	Fix Windows bots failure caused by `8c4208d5c1`	2021-08-06 15:03:00 -07:00
Joseph Huber	41a6b50c25	[OpenMP]Fix PR51349: Remove AlwaysInline for if regions. After D94315 we add the `NoInline` attribute to the outlined function to handle data environments in the OpenMP if clause. This conflicted with the `AlwaysInline` attribute added to the outlined function. for better performance in D106799. The data environments should ideally not require NoInline, but for now this fixes PR51349. Reviewed By: mikerice Differential Revision: https://reviews.llvm.org/D107649	2021-08-06 17:53:04 -04:00
Sanjay Patel	0369714b31	[InstCombine] reduce vector casting before icmp There may be some generalizations (see test comments) of these patterns, but this should handle the cases motivated by: https://llvm.org/PR51315 https://llvm.org/PR51259 The backend may want to transform differently, but at least for the x86 examples that I looked at, there does not appear to be any significant perf diff either way.	2021-08-06 17:09:38 -04:00
Sanjay Patel	67d499445d	[InstCombine] add tests for icmp of casted vector; NFC https://llvm.org/PR51315	2021-08-06 17:09:38 -04:00
Michael Liao	05783e1cfe	[amdgpu] Revise the conversion from i64 to f32. - Replace 'cmp+sel' with 'umin' if possible. Reviewed By: foad Differential Revision: https://reviews.llvm.org/D107507	2021-08-06 17:01:47 -04:00
Zequan Wu	8c4208d5c1	[Profile][NFC] Clean up initializeProfileForContinuousMode Merge two versions of `initializeProfileForContinuousMode` function into one. Differential Revision: https://reviews.llvm.org/D107591	2021-08-06 14:00:36 -07:00
Nick Desaulniers	d238b60285	[Clang][DiagnosticSemaKinds] combine diagnostic texts The diagnostic texts for warning on attributes that don't appear on the initial declaration is generally useful. We'd like to re-use it in D106030, but first let's combine two that already are very similar so we may re-use it a third time in that commit. Also, fix a few places that were using notePreviousDefinition to point to declarations, to instead use diag::note_previous_declaration. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D107613	2021-08-06 13:58:21 -07:00

1 2 3 4 5 ...

396060 Commits