intel/llvm - llvm - Gitea: Git with a cup of tea

intel/llvm

mirror of https://github.com/intel/llvm.git synced 2026-01-22 23:49:22 +08:00

Author	SHA1	Message	Date
Karl-Johan Karlsson	ff426a6250	[compiler-rt] Fix signed integer overflow in int_mulo_impl.inc When compiling compiler-rt with -fsanitize=undefined and running testcases you end up with the following warning: UBSan:/repo/uabkaka/llvm-project/compiler-rt/lib/builtins/int_mulo_impl.inc:24:23: signed integer overflow: -1 * -2147483648 cannot be represented in type 'si_int' (aka 'long') This can be avoided by doing the multiplication in a matching unsigned variant of the type. This was found in an out of tree target. Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D146623	2023-03-24 11:02:34 +01:00
Ben Shi	2a6e39dbf8	[AVR] Do not emit 'LPM Rd, Z' on devices without FeatureLPMX The 'LPM' instruction has three forms: ------------------------ \| form \| feature \| \| ---------- \| --------\| \| LPM \| hasLPM \| \| LPM Rd, Z \| hasLPMX \| \| LPM Rd, Z+ \| hasLPMX \| ------------------------ The second form is always selected in ISelDAGToDAG, even on devices without FeatureLPMX. This patch emits "LPM + MOV" on devices with only FeatureLPM. Reviewed By: jacquesguan Differential Revision: https://reviews.llvm.org/D141246	2023-03-24 17:47:24 +08:00
David Sherwood	bd0c281fcd	[NFC][LoopVectorize] Change trip counts for some tests to guarantee a scalar tail Quite a few vectoriser tests were using a trip count of 1024, which meant: 1. For fixed-length VFs we would never actually tail-fold, e.g. see Transforms/LoopVectorize/RISCV/uniform-load-store.ll. This is because we can prove at compile-time there will never be a scalar tail. 2. As of D146199 the same optimisation mentioned above will also apply to scalable VFs too. I've changed all such trip counts to be 1025 instead. Differential Revision: https://reviews.llvm.org/D146219	2023-03-24 09:43:50 +00:00
Akshay Khadse	0c36ab1908	[NFC] Fix auto usage to avoid copies Fixes some usages of the "auto" keyword to avoid creation of copies. Reviewed By: LuoYuanke Differential Revision: https://reviews.llvm.org/D146694	2023-03-24 17:29:48 +08:00
mydeveloperday	b94175b0ee	[clang-format] NFC Format.h and ClangFormatStyleOptions.rst are out of date Regenerate the style documentation, requires some minor sphinx changes to avoid warnings Reviewed By: klimek Differential Revision: https://reviews.llvm.org/D146704	2023-03-24 09:27:23 +00:00
Stefan Gränitz	5c1d160cd9	Reland "[JITLink] Initial AArch32 backend" This first version lays the foundations for AArch32 support in JITLink. ELFLinkGraphBuilder_aarch32 processes REL-type relocations and populates LinkGraphs from ELF object files for both big- and little-endian systems. The ArmCfg member controls subarchitecture-specific details throughout the linking process (i.e. it's passed to ELFJITLinker_aarch32). Relocation types follow the ABI documentation's division into classes: Data (endian-sensitive), Arm (32-bit little-endian) and Thumb (2x 16-bit little-endian, "Thumb32" in the docs). The implementation of instruction encoding/decoding for relocation resolution is implemented symmetrically and is testable in isolation (see AArch32 category in JITLinkTests). Callable Thumb functions are marked with a ThumbSymbol target-flag and stored in the LinkGraph with their real addresses. The thumb-bit is added back in when the owning JITDylib requests the address for such a symbol. The StubsManager can generate (absolute) Thumb-state stubs for branch range extensions on v7+ targets. Proper GOT/PLT handling is not yet implemented. This patch is based on the backend implementation in ez-clang and has just enough functionality to model the infrastructure and link a Thumb function `main()` that calls `printf()` to dump "Hello Arm!" on Armv7a. It was tested on Raspberry Pi with 32-bit Raspbian OS. Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D144083	2023-03-24 10:18:08 +01:00
Luke Lau	410a239996	[RISCV] Add test for shuffles that could be done as vmerges Reviewed By: reames Differential Revision: https://reviews.llvm.org/D146710	2023-03-24 09:15:03 +00:00
Andrzej Warzynski	cf60d3f1a6	[mlir][arith] Extend the `floordivsi` converter This patch extends the `createConst` method so that it can generate constant vectors (it can already generate scalars). This change is required to be able to apply the converter for `arith.floordivsi` (i.e. `FloorDivSIOpConverter`) to vectors. While `arith.floordivsi` is my main motivation for this change, this patch should also allow other Arith ops to be converted in vector cases. In my example, the Linalg vectorizer updates `arith.floordivsi` to operate on vectors and hence the need for this change. Differential Revision: https://reviews.llvm.org/D146741	2023-03-24 08:48:55 +00:00
luxufan	6a7d04c6b4	[NFC] Regenerate test of InstCombine/load-combine-metadata-dominance.ll	2023-03-24 16:47:53 +08:00
Max Kazantsev	0858b5369b	[Test] Regenerate checks in test file	2023-03-24 15:43:02 +07:00
Martin Storsjö	4a5bc791f3	[CMake] Respect variables for specifying host tools even without LLVM_USE_HOST_TOOLS set When LLVM_NATIVE_TOOL_DIR was introduced in `d3da9067d1` / D131052, it consisted of refactoring a couple cases of manual logic for tools in clang-tools-extra/clang-tidy, clang-tools-extra/pseudo/include and mlir/tools/mlir-linalg-ods-gen. The former two had the same consistent behaviour while the latter was slightly different, so the refactoring would end up slightly adjusting one or the other. The difference was that the clang-tools-extra tools respected the external variable for setting the tool name, regardless of the LLVM_USE_HOST_TOOLS variable, while mlir-linalg-ods-gen tool only checked its external variable if LLVM_USE_HOST_TOOLS was set. LLVM_USE_HOST_TOOLS is supposed to be enabled automatically whenever cross compiling, so this shouldn't have been an issue. In https://github.com/llvm/llvm-project/issues/60784, it seems like some users do cross compile LLVM, without CMake knowing about it (without CMAKE_CROSSCOMPILING being set). In these cases, their build broke, as the variables for pointing to external host tools no longer were being respected. The fact that CMAKE_CROSSCOMPILING wasn't set stems from a non-obvious behaviour of CMake; CMAKE_CROSSCOMPILING isn't supposed to be set by the user (and if it was, it gets overridden), but one has to set CMAKE_SYSTEM_NAME to indicate that one is cross compiling, even if the target OS is the same as the current host. Skip the checks for LLVM_USE_HOST_TOOLS and always respect the variables for pointing to external tools (both the old tool specific variables, and the new LLVM_NATIVE_TOOL_DIR), if they're set. This makes the logic within setup_host_tool more exactly match the logic for the clang-tools-extra tools from before the refactoring in `d3da9067d1`. This makes the behaviour consistent with that of the tablegen executables, which also respect the externally set variables regardless of LLVM_USE_HOST_TOOLS. This fixes https://github.com/llvm/llvm-project/issues/60784. Differential Revision: https://reviews.llvm.org/D146666	2023-03-24 10:35:24 +02:00
Johannes de Fine Licht	a94d2bd5b5	[MLIR][LLVM] Add debug output to the LLVM inliner. This revealed a test case that wasn't hitting the intended branch because the inlinees had no function definition. Depends on D146628 Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D146633	2023-03-24 09:33:42 +01:00
Martin Storsjö	09aa3f7bb5	[LLD] [test] Add a missing REQUIRES: x86 in a new test This test somewhat unconventionally assembles both aarch64 and x86 object files. This fixes test failures in build configurations with the aarch64 target enabled but x86 target disabled.	2023-03-24 10:28:49 +02:00
luxufan	558b33c5aa	[NFC] Regenerate test NewGVN/metadata-nonnull.ll	2023-03-24 16:26:32 +08:00
Dmitry Chernenkov	3048c9e154	Revert "Recommit [Modules] Remove unnecessary check when generating name lookup table in ASTWriter" This reverts commit `25557aa38a`.	2023-03-24 07:59:04 +00:00
Tobias Gysi	b0cd5b2a47	[mlir][llvm] Switch remaining LLVM dialect tests to opaque pointers. The revision switches the remaining LLVM dialect tests to use opaque pointers. Selected tests are copied to a postfixed test file for the time being. A number of tests disappear once we fully switch to opaque pointers. In particular, all tests that check verify a pointer element type matches another type as well as tests of recursive types. Part of https://discourse.llvm.org/t/rfc-switching-the-llvm-dialect-and-dialect-lowerings-to-opaque-pointers/68179 Reviewed By: Dinistro, zero9178 Differential Revision: https://reviews.llvm.org/D146726	2023-03-24 08:24:58 +01:00
Carlos Galvez	f957b8fe1e	[clang-tidy][NFC] Improve naming convention in google-readability-avoid-underscore-in-googletest-name According to the Google docs, the convention is TEST(TestSuiteName, TestName). Apply that convention to the source code, test and documentation of the check. Differential Revision: https://reviews.llvm.org/D146713	2023-03-24 07:22:04 +00:00
Michael Platings	d30bc9e912	[Driver] Change multilib selection algorithm The new algorithm is: 1. Find all multilibs with flags that are a subset of the requested flags. 2. If more than one multilib matches, choose the last. In addition a new selection mechanism is permitted via an overload of MultilibSet::select() for which multiple multilibs are returned. This allows layering multilibs on top of each other. Since multilibs are now ordered within a list, they no longer need a Priority field. The new algorithm is different to the old algorithm, but in practise the old algorithm was always used in such a way that the effect is the same. The old algorithm was to find the set intersection of the requested flags (with the first character of each removed) with each multilib's flags (ditto), and for that intersection check whether the first character matched. However, ignoring the first characters, the requested flags were always a superset of all the multilibs flags. Therefore the new algorithm can be used as a drop-in replacement. The exception is Fuchsia, which needs adjusting slightly to set both fexceptions and fno-exceptions flags. Differential Revision: https://reviews.llvm.org/D142905	2023-03-24 06:58:07 +00:00
Kazu Hirata	1e4325f30c	[X86] Precommit a test This patch precommits a test for: https://github.com/llvm/llvm-project/issues/61365	2023-03-23 23:48:17 -07:00
Dave Lee	abddb83598	[lldb] Fix type of --apply-fixits (NFC)	2023-03-23 22:49:29 -07:00
Xiang1 Zhang	cc86e6b0a8	[BugFix] Fix VSELECT ISel fail Reviewed By: Luo yuanke Differential Revision: https://reviews.llvm.org/D146683	2023-03-24 13:13:35 +08:00
Kazu Hirata	d8efbcf9dc	[AArch64] Add tests for umax(x, 1u) This patch adds tests for umax(x, 1u). This patch fixes: https://github.com/llvm/llvm-project/issues/60233 It turns out that commit `86b4d8645f` on Feb 8, 2023 already performs the instcombine transformation proposed in the issue, so the issue requires no change on the codegen side.	2023-03-23 20:20:20 -07:00
Xiaodong Liu	11674147e4	[LoongArch] Enable LoopDataPrefetch pass Keep `EnableLoopDataPrefetch` option off for now because we need a few more TTIs and ISels. This patch is inspired by http://reviews.llvm.org/D17943. Reviewed By: SixWeining Differential Revision: https://reviews.llvm.org/D146600	2023-03-24 11:09:18 +08:00
Jun Zhang	cea938390e	[InstCombine] Try to recognize bswap pattern when calling funnel shifts Alive2: https://alive2.llvm.org/ce/z/dxxD7B Fixes: https://github.com/llvm/llvm-project/issues/60690 Signed-off-by: Jun Zhang <jun@junz.org> Differential Revision: https://reviews.llvm.org/D146637	2023-03-24 10:51:36 +08:00
Jun Zhang	3ca6e69b6e	Precommit tests for #60690 Differential Revision: https://reviews.llvm.org/D146636 Signed-off-by: Jun Zhang <jun@junz.org>	2023-03-24 10:51:32 +08:00
XinWang10	4950104e24	[NFC][X86]remove trailing space in X86InstrArithmetic.td In this file, most of the line don't have trailing spaces, but some of them have. To keep consistent, remove the trailing spaces. Reviewed By: skan Differential Revision: https://reviews.llvm.org/D146697	2023-03-23 22:32:45 -04:00
Kazu Hirata	231fa27435	[InstCombine] Generate better code for std::bit_ceil Without this patch, std::bit_ceil<uint32_t> is compiled as: %dec = add i32 %x, -1 %lz = tail call i32 @llvm.ctlz.i32(i32 %dec, i1 false) %sub = sub i32 32, %lz %res = shl i32 1, %sub %ugt = icmp ugt i32 %x, 1 %sel = select i1 %ugt, i32 %res, i32 1 With this patch, we generate: %dec = add i32 %x, -1 %ctlz = tail call i32 @llvm.ctlz.i32(i32 %dec, i1 false) %sub = sub nsw i32 0, %ctlz %and = and i32 %1, 31 %sel = shl nuw i32 1, %and ret i32 %sel https://alive2.llvm.org/ce/z/pwezvF This patch recognizes the specific pattern from std::bit_ceil in libc++ and libstdc++ and drops the conditional move. In addition to the LLVM IR generated for std::bit_ceil(X), this patch recognizes variants like: std::bit_ceil(X - 1) std::bit_ceil(X + 1) std::bit_ceil(X + 2) std::bit_ceil(-X) std::bit_ceil(~X) This patch fixes: https://github.com/llvm/llvm-project/issues/60802 Differential Revision: https://reviews.llvm.org/D145299	2023-03-23 19:26:43 -07:00
Kazu Hirata	5f48b861f8	[SelectionDAG] Use isOneConstant (NFC)	2023-03-23 19:26:42 -07:00
Rahul Joshi	24657a95c1	[NFC] Fix Windows builds that use MSVC 14.x Differential Revision: https://reviews.llvm.org/D146769	2023-03-23 19:16:44 -07:00
Arthur Eubanks	ccd96b3e03	[builtins][test] Fix divmodti4_test.c on Windows By making the 64 bit integer literals unsigned. Otherwise some of them are unexpectedly sign extended (and the compiler rightly diagnosed this with warnings) Initially added in D80506. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D146667	2023-03-23 19:08:59 -07:00
LiaoChunyu	24847a90aa	[LegalizeTypes][RISCV] Add a special case for (add X, -1) to ExpandIntRes_ADDSUB On targets without ADDCARRY or ADDE, we need to emit a separate SETCC to determine carry from the low half to the high half. The high half is calculated by a series of ADDs. When RHSLo and RHSHi are -1, without this patch, we get: Hi = (add (add LHSHi,(setult Lo, LHSLo), -1) Where as with the patch we get: Hi = (sub LHSHi, (seteq LHSLo, 0)) Only RHSLo is -1 we can instead do (setne Lo, 0). Similar to gcc: https://godbolt.org/z/M83f6rz39 Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D146635	2023-03-24 09:45:43 +08:00
Quinn Dawkins	c82803097f	[mlir][linalg] Refactor convolution to img2col conversion to use gather semantics Following up on the comments in https://reviews.llvm.org/D144108 this patch refactors the im2col conversion patterns for `linalg.conv_2d_nhwc_hwcf` and `linalg.conv_2d_nchw_fchw` convolutions to use gather semantics for the im2col packing `linalg.generic`. Follow up work can include a similar pattern for depthwise convolutions and a generalization of the patterns here to work with any `LinalgOp` as well. Differential Revision: https://reviews.llvm.org/D144678	2023-03-23 19:38:53 -04:00
Bruno Cardoso Lopes	07ef7b1ff2	[Builtins] Add __builtin_assume_separate_storage Plumbing from the language level to the assume intrinsics with separate_storage operand bundles. Patch by David Goldblatt (davidtgoldblatt) Differential Revision: https://reviews.llvm.org/D136515	2023-03-23 16:35:30 -07:00
Fangrui Song	399f313f27	[Driver] Remove remnant mips64el-linux-android code after D146565	2023-03-23 16:00:17 -07:00
Fangrui Song	ccc2f362db	Android.rules: remove mips* rules They have been obsoleted for a long time and D146565 recently removed Clang support.	2023-03-23 15:58:43 -07:00
Fangrui Song	bb0ecb7bf0	[Driver][test] Remove remnant mips*-linux-android tests after `805f51f9fe`	2023-03-23 15:49:38 -07:00
Heejin Ahn	999643f151	[WebAssembly] Tidy up DebugValueManager (NFC) Misc. cleanups for `WebAssemblyDebugValueManager`. - Use `Register` for registers - Simpler for loop iteration - Rename a variable - Reorder methods - Reduce `SmallVector` size for `DBG_VALUE`s to 1; one def usually have a single `DBG_VALUE` attached to it in most cases - Add a few more lines of comments Reviewed By: dschuff Differential Revision: https://reviews.llvm.org/D146743	2023-03-23 15:35:26 -07:00
NagaChaitanya Vellanki	c13ccf1fba	[clang][ExtractAPI]Fix Declaration fragments for instancetype in the type position degrade to id Fixes https://github.com/llvm/llvm-project/issues/61481 Reviewed By: dang Differential Revision: https://reviews.llvm.org/D146671	2023-03-23 15:10:27 -07:00
Cyndy Ishida	397486566e	[llvm][TextAPI] Handle implicitly upgraded deployment versions Sometimes the clang driver will receive a target triple where the deployment version is too low to support the platform + arch. In those cases, the compiler upgrades the final minOS which is what gets recorded ultimately by the linker in LC_BUILD_VERSION. TextAPI should also reuse this logic for capturing minOS in recorded TBDv5 files. Reviewed By: ributzka Differential Revision: https://reviews.llvm.org/D145690	2023-03-23 14:58:41 -07:00
Lang Hames	ec2333d885	[JITLink] Add a jitlink::Section::empty operation.	2023-03-23 14:52:17 -07:00
Chia-hung Duan	2e9bcadb7c	Revert "[scudo] Add a Timer class to assist performance measurement" This reverts commit `e0361396c2`.	2023-03-23 21:49:02 +00:00
Leonard Chan	53a9175951	[llvm] Handle duplicate call bases when applying branch funneling It's possible to segfault in `DevirtModule::applyICallBranchFunnel` when attempting to call `getCaller` on a call base that was erased in a prior iteration. This can occur when attempting to find devirtualizable calls via `findDevirtualizableCallsForTypeTest` if the vtable passed to llvm.type.test is a global and not a local. The function works by taking the first argument of the llvm.type.test call (which is a vtable), iterating through all uses of it, and adding any relevant all uses that are calls associated with that intrinsic call to a vector. For most cases where the vtable is actually a local, this wouldn't be an issue. Take for example: ``` define i32 @fn(ptr %obj) #0 { %vtable = load ptr, ptr %obj %p = call i1 @llvm.type.test(ptr %vtable, metadata !"typeid2") call void @llvm.assume(i1 %p) %fptr = load ptr, ptr %vtable %result = call i32 %fptr(ptr %obj, i32 1) ret i32 %result } ``` `findDevirtualizableCallsForTypeTest` will check the call base ` %result = call i32 %fptr(ptr %obj, i32 1)`, find that it is associated with a virtualizable call from `%vtable`, find all loads for `%vtable`, and add any instances those load results are called into a vector. Now consider the case where instead `%vtable` was the global itself rather than a local: ``` define i32 @fn(ptr %obj) #0 { %p = call i1 @llvm.type.test(ptr @vtable, metadata !"typeid2") call void @llvm.assume(i1 %p) %fptr = load ptr, ptr @vtable %result = call i32 %fptr(ptr %obj, i32 1) ret i32 %result } ``` `findDevirtualizableCallsForTypeTest` should work normally and add one unique call instance to a vector. However, if there are multiple instances where this same global is used for llvm.type.test, like with: ``` define i32 @fn(ptr %obj) #0 { %p = call i1 @llvm.type.test(ptr @vtable, metadata !"typeid2") call void @llvm.assume(i1 %p) %fptr = load ptr, ptr @vtable %result = call i32 %fptr(ptr %obj, i32 1) ret i32 %result } define i32 @fn2(ptr %obj) #0 { %p = call i1 @llvm.type.test(ptr @vtable, metadata !"typeid2") call void @llvm.assume(i1 %p) %fptr = load ptr, ptr @vtable %result = call i32 %fptr(ptr %obj, i32 1) ret i32 %result } ``` Then each call base `%result = call i32 %fptr(ptr %obj, i32 1)` will be added to the vector twice. This is because for either call base `%result = call i32 %fptr(ptr %obj, i32 1) `, we determine it is associated with a virtualizable call from `@vtable`, and then we iterate through all the uses of `@vtable`, which is used across multiple functions. So when scanning the first `%result = call i32 %fptr(ptr %obj, i32 1)`, then both call bases will be added to the vector, but when scanning the second one, both call bases are added again, resulting in duplicate call bases in the CSInfo.CallSites vector. Note this is actually accounted for in every other instance WPD iterates over CallSites. What everything else does is actually add the call base to the `OptimizedCalls` set and just check if it's already in the set. We can't reuse that particular set since it serves a different purpose marking which calls where devirtualized which `applyICallBranchFunnel` explicitly says it doesn't. For this fix, we can just account for duplicates with a map and do the actual replacements afterwards by iterating over the map. Differential Revision: https://reviews.llvm.org/D146267	2023-03-23 21:44:59 +00:00
Joseph Huber	9ddc03a17d	[OpenMP] Fix test after updating NVPTX atomic inlines Summary: The previous patch fixed how we handle emitting atomics for targeting NVPTX directly. This is the only other file that really does that and has atomics and I forgot to update it.	2023-03-23 16:41:25 -05:00
Joseph Huber	d11e49f0c8	[libc][NFC] Fix misspelled variable name in cmake message	2023-03-23 16:30:31 -05:00
Joseph Huber	af54d1e852	[NVPTX] Set the atomic inling threshold when targeting NVPTX directly Since Clang 16.0.0 users can target the `NVPTX` architecture directly via `--target=nvptx64-nvidia-cuda`. However, this does not set the atomic inlining size correctly. This leads to spurious warnings and emission of runtime atomics that are never implemented. This patch ensures that we set this to the appropriate pointer width. This will always be 64 in the future as `nvptx64` will only be supported moving forward. Fixes: https://github.com/llvm/llvm-project/issues/61410 Reviewed By: tra Differential Revision: https://reviews.llvm.org/D146750	2023-03-23 16:30:07 -05:00
Sam Clegg	3111784ff7	[lld][WebAssembly] Initial support for stub libraries See the docs in lld/docs/WebAssembly.rst for more on this. This feature unlocks a lot of simplification in the emscripten toolchain since we can represent the JS libraries to wasm-ld as stub libraries. See https://github.com/emscripten-core/emscripten/issues/18875 Differential Revision: https://reviews.llvm.org/D145308	2023-03-23 14:26:27 -07:00
NagaChaitanya Vellanki	1c9173365a	Fix highlighting issue with _complex and initialization list with more than 2 items Fixes https://github.com/llvm/llvm-project/issues/61518 Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D146503	2023-03-23 14:18:02 -07:00
AdityaK	805f51f9fe	Remove Android-mips related tests Split from: https://reviews.llvm.org/D146565, already reviewed there.	2023-03-23 14:06:50 -07:00
Arthur Eubanks	088da8a0e5	[lldb][NFC] makeArrayRef -> ArrayRef makeArrayRef is deprecated.	2023-03-23 14:05:06 -07:00
Alexander Yermolovich	d557384b43	[LLDB] Fix for D139955 Summary: Fixing a small typo. Reviewed By: clayborg Differential Revision: https://reviews.llvm.org/D146659	2023-03-23 14:03:42 -07:00

1 2 3 4 5 ...

455668 Commits