intel/llvm - llvm - Gitea: Git with a cup of tea

intel/llvm

mirror of https://github.com/intel/llvm.git synced 2026-01-16 21:55:39 +08:00

Author	SHA1	Message	Date
fourdim	3af7bbca4a	[JITLink][RISCV] fix the extractBits behavior and add R_RISCV_JAL relocation. This patch supports the R_RISCV_JAL relocation. Moreover, it will fix the extractBits function's behavior as it extracts Size + 1 bits. In the test ELF_jal.s: Before: ``` Hi: 4294836480 extractBits(Hi, 12, 8): 480 ``` After: ``` Hi: 4294836480 extractBits(Hi, 12, 8): 224 ``` Reviewed By: StephenFan Differential Revision: https://reviews.llvm.org/D117975	2022-02-17 23:03:36 +08:00
Zakk Chen	eeb7754f68	[RISCV] Add the passthru operand for vmv.vv/vmv.vx/vfmv.vf IR intrinsics. Add the passthru operand for VMV_V_X_VL, VFMV_V_F_VL and SPLAT_VECTOR_SPLIT_I64_VL also. The goal is support tail and mask policy in RVV builtins. We focus on IR part first. If the passthru operand is undef, we use tail agnostic, otherwise use tail undisturbed. Reviewed By: rogfer01 Differential Revision: https://reviews.llvm.org/D119688	2022-02-17 06:38:14 -08:00
Alexander Potapenko	be77afe43d	tsan: Add a missing disable_sanitizer_instrumentation attribute Turns out the test was working by accident: we need to ensure TSan instrumentation is not called from the fork() hook, otherwise the tool will deadlock. Previously it worked because alloc_free_blocks() got inlined into __tsan_test_only_on_fork(), but it cannot always be the case. Adding __attribute__((disable_sanitizer_instrumentation)) will prevent TSan from instrumenting alloc_free_blocks(). Reviewed By: dvyukov Differential Revision: https://reviews.llvm.org/D120050	2022-02-17 15:34:41 +01:00
Lei Zhang	c9b36807be	[mlir][spirv] Add a pass to unify aliased resource variables In SPIR-V, resources are represented as global variables that are bound to certain descriptor. SPIR-V requires those global variables to be declared as aliased if multiple ones are bound to the same slot. Such aliased decorations can cause issues for transcompilers like SPIRV-Cross when converting to source shading languages like MSL. So this commit adds a pass to perform analysis of aliased resources and see if we can unify them into one. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D119872	2022-02-17 09:08:58 -05:00
Alexey Bataev	d1cd64ffdd	[SLP][NFC]Fix misprint in function name, NFC.	2022-02-17 05:57:51 -08:00
Nico Weber	a569d6060d	[gn build] (manually) port `f75da0c8e6` (ObjCopy lib)	2022-02-17 08:56:35 -05:00
Shao-Ce SUN	7798ecca9c	[RISCV] add the MC layer support of Zfinx extension This patch added the MC layer support of Zfinx extension. Authored-by: StephenFan Co-Authored-by: Shao-Ce Sun Reviewed By: asb Differential Revision: https://reviews.llvm.org/D93298	2022-02-17 21:54:13 +08:00
Simon Pilgrim	3f22a4962d	[X86] selectLEAAddr - add X86ISD::SMUL/UMULO handling After D118128 relaxed the heuristic to require only one EFLAGS generating operand, it now makes sense to avoid X86ISD::SMUL/UMULO duplication as well. Differential Revision: https://reviews.llvm.org/D119578	2022-02-17 13:51:02 +00:00
Guillaume Chatelet	da5a4f16e8	[libc][automemcpy] Introduce geomean of scores as a tie breaker Differential Revision: https://reviews.llvm.org/D120040	2022-02-17 13:37:05 +00:00
Paul Walker	6457f42bde	[DAGCombiner] Extend ISD::ABDS/U combine to handle more cases. The current ABD combine doesn't quite work for SVE because only a single scalable vector per scalar integer type is legal (e.g. for i32, <vscale x 4 x i32> is the only legal scalable vector type). This patch extends the combine to also trigger for the cases when operand extension must be retained. Differential Revision: https://reviews.llvm.org/D115739	2022-02-17 13:32:20 +00:00
Bjorn Pettersson	1a8bdf95a3	[DAG] Fix in ReplaceAllUsesOfValuesWith When doing SelectionDAG::ReplaceAllUsesOfValuesWith a worklist is prepared containing all users that should be updated. Then we use the RemoveNodeFromCSEMaps/AddModifiedNodeToCSEMaps helpers to handle recursive CSE updates while doing the replacements. This patch aims at solving a problem that could arise if the recursive CSE updates would result in an SDNode present in the worklist is being removed as a side-effect of morphing a prio user in the worklist. To examplify such a scenario, imagine that we have these nodes in the DAG t12: i64 = add t8, t11 t13: i64 = add t12, t8 t14: i64 = add t11, t11 t15: i64 = add t14, t8 t16: i64 = sub t13, t15 and that the t8 uses should be replaced by t11. An initial worklist (listing the users that should be morphed) could be [t12, t13, t15]. When updating t12 we get t12: i64 = add t11, t11 which results in a CSE update that replaces t14 by t12, so we get t15: i64 = add t12, t8 which results in a CSE update that replaces t13 by t12, so we get t16: i64 = sub t12, t15 and then t13 is removed given that it was the last use of t13. So when being done with the updates triggered by rewriting the use of t8 in t12 the t13 node no longer exist. And we used to end up hitting an assertion when continuing with the worklist aiming at replacing the t8 uses in t13. The solution is based on using a DAGUpdateListener, making sure that we prune a user from the worklist if it is removed during the recursive CSE updates. The bug was found using an OOT target. I think the problem is quite old, even if the particular intree target reproducer added in this patch seem to pass when using LLVM 13.0.0. Differential Revision: https://reviews.llvm.org/D119088	2022-02-17 14:29:59 +01:00
Simon Pilgrim	1c502c63cb	[clang-doc] SerializeIndex - pass Index param by constant reference Silence coverity warnings about unnecessary copies	2022-02-17 13:28:02 +00:00
Shao-Ce SUN	f29f86b60b	[NFC] Fix comment	2022-02-17 21:19:22 +08:00
Simon Pilgrim	57fc9798d7	[clang] CGDebugInfo::getOrCreateMethodType - use castAs<> instead of getAs<> to avoid dereference of nullptr The pointer is always dereferenced, so assert the cast is correct instead of returning nullptr	2022-02-17 13:18:23 +00:00
Simon Pilgrim	2614de8202	[clang] CGCXXABI::EmitLoadOfMemberFunctionPointer - use castAs<> instead of getAs<> to avoid dereference of nullptr The pointer is always dereferenced by arrangeCXXMethodType, so assert the cast is correct instead of returning nullptr	2022-02-17 13:18:23 +00:00
Shao-Ce SUN	21ac474392	[NFC] Correct typo `interger` to `integer`	2022-02-17 21:17:47 +08:00
Benjamin Kramer	d955ca4937	[BufferDeallocation] Don't assume successor operands are unique This would create a double free when a memref is passed twice to the same op. This wasn't a problem at the time the pass was written but is common since the introduction of scf.while. There's a latent non-determinism that's triggered by the test, but this change is messy enough as-is so I'll leave that for later. Differential Revision: https://reviews.llvm.org/D120044	2022-02-17 14:16:32 +01:00
John Brawn	d916856bee	[AArch64] Allow strict opcodes in faddp patterns This also requires adjustment to code in AArch64ISelLowering so that vector_extract is distributed over strict_fadd. Differential Revision: https://reviews.llvm.org/D118489	2022-02-17 13:11:55 +00:00
John Brawn	b670da798d	[AArch64] Allow strict opcodes in indexed fmul and fma patterns Using an indexed version instead of a non-indexed version doesn't change anything with regards to exceptions or rounding. Differential Revision: https://reviews.llvm.org/D118487	2022-02-17 13:11:54 +00:00
John Brawn	9d68ed0817	[AArch64] Allow strict opcodes in fp->int->fp patterns These patterns don't change the fundamental instructions that are used, just the variants that are used in order to remove some extra MOVs. Differential Revision: https://reviews.llvm.org/D118485	2022-02-17 13:11:54 +00:00
John Brawn	d4342efb69	[AArch64] Add instruction selection for strict FP This consists of marking the various strict opcodes as legal, and adjusting instruction selection patterns so that 'op' is 'any_op'. FP16 and vector instructions additionally require some extra work in lowering and legalization, so we can't set IsStrictFPEnabled just yet. Also more work needs to be done for full strict fp support (marking instructions that can raise exceptions as such, and modelling FPCR use for controlling rounding). Differential Revision: https://reviews.llvm.org/D114946	2022-02-17 13:11:54 +00:00
Guillaume Chatelet	48e0e6cedc	[llvm][automemcpy] Allow distribution filtering in analysis Differential Revision: https://reviews.llvm.org/D120037	2022-02-17 13:10:46 +00:00
Adrian Kuegel	d74f15faff	[AArch64][NFC] Fix unused-lambda-capture warning. Differential Revision: https://reviews.llvm.org/D120041	2022-02-17 14:09:58 +01:00
Nikita Popov	dce3b403a7	[Docs] Use correct rst syntax	2022-02-17 14:08:46 +01:00
Momchil Velikov	030503e17c	Remove duplicated code for printing the `uwtable` attribute (NFC) Committed as obvious. Reviewed By: chill Differential Revision: https://reviews.llvm.org/D120030	2022-02-17 12:24:41 +00:00
Adrian Kuegel	e7d65fca7e	[Bazel] Fix build after ObjCopy move. Differential Revision: https://reviews.llvm.org/D120039	2022-02-17 13:22:26 +01:00
Andrzej Warzynski	e993b20c04	[flang][driver] Add support for `-emit-llvm` This patch adds support for the `-emit-llvm` option in the frontend driver (i.e. `flang-new -fc1`). Similarly to Clang, `flang-new -fc1 -emit-llvm file.f` will generate a textual LLVM IR file. Depends on D118985 Differential Revision: https://reviews.llvm.org/D119012	2022-02-17 12:13:03 +00:00
Guillaume Chatelet	b254a2a703	[libc][automemcpy] Add mean/variance and simplify implementation Differential Revision: https://reviews.llvm.org/D120031	2022-02-17 12:11:05 +00:00
Nikita Popov	4846568191	[Docs] Update opaque pointers docs Expand migration instructions.	2022-02-17 13:03:33 +01:00
Roman Lebedev	07cf95942f	[NFC][PhaseOrdering] Improve test coverage for D119975	2022-02-17 14:58:22 +03:00
Simon Pilgrim	5f4549c372	[SystemZ] lowerDYNAMIC_STACKALLOC_XPLINK - use cast<> instead of dyn_cast<> to avoid dereference of nullptr The pointer is always dereferenced, so assert the cast is correct instead of returning nullptr	2022-02-17 11:56:29 +00:00
Simon Pilgrim	ada6bcc13f	[X86] X86tcret_1reg - use cast<> instead of dyn_cast<> to avoid dereference of nullptr The pointer is always dereferenced, so assert the cast is correct instead of returning nullptr	2022-02-17 11:54:12 +00:00
Simon Pilgrim	f1877eb1bb	AArch64_MC::isQForm - Fix MSVC 'no default capture mode' lambda warning	2022-02-17 11:41:47 +00:00
Nikita Popov	36fdfaba19	[RelLookupTableConverter] Ensure that GV, GEP and load types match This code could be generalized to be type-independent, but for now just ensure that the same type constraints are enforced with opaque pointers as with typed pointers.	2022-02-17 12:05:05 +01:00
Max Kazantsev	fc539b0004	[SCEV] Infer ranges for SCC consisting of cycled Phis Our current strategy of computing ranges of SCEVUnknown Phis was to simply compute the union of ranges of all its inputs. In order to avoid infinite recursion, we mark Phis as pending and conservatively return full set for them. As result, even simplest patterns of cycled phis always have a range of full set. This patch makes this logic a bit smarter. We basically do the same, but instead of taking inputs of single Phi we find its strongly connected component (SCC) and compute the union of all inputs that come into this SCC from outside. Processing entire SCC together has one more advantage: we can set range for all of them at once, because the only thing that happens to them is the same value is being passed between those Phis. So, despite we spend more time analyzing a single Phi, overall we may save time by not processing other SCC members, so amortized compile time spent should be approximately the same. Differential Revision: https://reviews.llvm.org/D110620 Reviewed By: reames	2022-02-17 18:03:52 +07:00
Sven van Haastregt	9798b33d1d	[OpenCL] Guard 64-bit atomic types Until now, overloads with a 64-bit atomic type argument were always made available with `-fdeclare-opencl-builtins`. Ensure these overloads are only available when both the `cl_khr_int64_base_atomics` and `cl_khr_int64_extended_atomics` extensions have been enabled, as required by the OpenCL specification. Differential Revision: https://reviews.llvm.org/D119858	2022-02-17 10:58:52 +00:00
Alexey Lapshin	889317d47b	[objcopy][NFC] Add doc comments to the executeObjcopy* functions. Add doc comments to the executeObjcopy* functions. Depends on D88827	2022-02-17 13:48:48 +03:00
Pavel Kosov	37fa99eda0	[SchedModels][CortexA55] Add ASIMD integer instructions Depends on D114642 Original review https://reviews.llvm.org/D112201 OS Laboratory. Huawei Russian Research Institute. Saint-Petersburg Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D117003	2022-02-17 13:41:57 +03:00
Pavel Kosov	f3809b20f2	[AArch64][SchedModels] Handle virtual registers in FP/NEON predicates Current implementation of Check[HSDQ]Form predicates doesn’t handle virtual registers and therefore isn’t useful for pre-RA scheduling. Patch fixes this implementing two function predicates: CheckQForm for checking that instruction writes 128-bit NEON register and CheckFpOrNEON which checks that instruction writes FP register (any width). The latter supersedes Check[HSD]Form predicates which are not used individually. OS Laboratory. Huawei Russian Research Institute. Saint-Petersburg Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D114642	2022-02-17 13:41:05 +03:00
Nikita Popov	5065076698	[CodeGen] Rename deprecated Address constructor To make uses of the deprecated constructor easier to spot, and to ensure that no new uses are introduced, rename it to Address::deprecated(). While doing the rename, I've filled in element types in cases where it was relatively obvious, but we're still left with 135 calls to the deprecated constructor.	2022-02-17 11:26:42 +01:00
Zakk Chen	093ecccdab	[RISCV] Add the passthru operand for vadc/vsbc/vmerge/vfmerge IR intrinsics. The goal is support tail and mask policy in RVV builtins. We focus on IR part first. If the passthru operand is undef, we use tail agnostic, otherwise use tail undisturbed. Reviewed By: rogfer01 Differential Revision: https://reviews.llvm.org/D119686	2022-02-17 02:21:39 -08:00
Eli Friedman	0389f2edf7	Revert "[compiler-rt] Implement ARM atomic operations for architectures without SMP support" This reverts commit `910a642c0a`. There are serious correctness issues with the current approach: __sync_* routines which are not actually atomic should not be enabled by default. I'll continue discussion on the review.	2022-02-17 02:17:27 -08:00
Eli Friedman	d20e01bb06	Revert "[NFC][compiler-rt] Format file lib/builtins/arm/sync-ops.h" This reverts commit `f165c23bf3`. Part of revert sequence for `910a642c0a`.	2022-02-17 02:16:25 -08:00
Alexey Lapshin	f75da0c8e6	[llvm-objcopy][NFC] Move core implementation of llvm-objcopy into separate library. This patch moves core implementation of llvm-objcopy into Object library (http://lists.llvm.org/pipermail/llvm-dev/2020-September/145075.html). The functionality for parsing input options is left inside tools/llvm-objcopy. The interface of ObjCopy library: ObjCopy/ELF/ELFObjcopy.h ``` Error executeObjcopyOnIHex(const CopyConfig &Config, MemoryBuffer &In, Buffer &Out); Error executeObjcopyOnRawBinary(const CopyConfig &Config, MemoryBuffer &In, Buffer &Out); Error executeObjcopyOnBinary(const CopyConfig &Config, object::ELFObjectFileBase &In, Buffer &Out); ``` ObjCopy/COFF/COFFObjcopy.h ``` Error executeObjcopyOnBinary(const CopyConfig &Config, object::COFFObjectFile &In, Buffer &Out); ``` ObjCopy/MachO/MachOObjcopy.h ``` Error executeObjcopyOnBinary(const CopyConfig &Config, object::MachOObjectFile &In, Buffer &Out); ``` ObjCopy/wasm/WasmObjcopy.h ``` Error executeObjcopyOnBinary(const CopyConfig &Config, object::WasmObjectFile &In, Buffer &Out); ``` Differential Revision: https://reviews.llvm.org/D88827	2022-02-17 13:11:42 +03:00
Siddharth Bhat	24a37a396a	[MLIR] add entry block to MLIR grammar. The MLIR parser allows regions to have an unnamed entry block. Make this explicit in the language grammar. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D119950	2022-02-17 15:40:43 +05:30
David Green	f3bc7fd546	[AArch64] Cleanup for performCommonVectorExtendCombine. NFC This is some NFC (hopefully!) cleanup for performCommonVectorExtendCombine and related methods, removing conditions that cannot occur and otherwise cleaning up the code a little.	2022-02-17 10:03:28 +00:00
Jay Foad	c08896d292	[AMDGPU] Return better Changed status from SILowerI1Copies Differential Revision: https://reviews.llvm.org/D119946	2022-02-17 09:38:57 +00:00
Jay Foad	78ebb1dd24	[AMDGPU] Return better Changed status from SIAnnotateControlFlow Differential Revision: https://reviews.llvm.org/D119945	2022-02-17 09:38:57 +00:00
Stanislav Gatev	a480841566	Add missing break statement in switch.	2022-02-17 09:37:02 +00:00
Jay Foad	1822a5ecdd	[AMDGPU] Return better Changed status from AMDGPUPerfHintAnalysis Differential Revision: https://reviews.llvm.org/D119944	2022-02-17 09:31:42 +00:00

1 2 3 4 5 ...

415425 Commits