intel/llvm - llvm - Gitea: Git with a cup of tea

intel/llvm

mirror of https://github.com/intel/llvm.git synced 2026-01-18 16:50:51 +08:00

Author	SHA1	Message	Date
Stanislav Gatev	a480841566	Add missing break statement in switch.	2022-02-17 09:37:02 +00:00
Jay Foad	1822a5ecdd	[AMDGPU] Return better Changed status from AMDGPUPerfHintAnalysis Differential Revision: https://reviews.llvm.org/D119944	2022-02-17 09:31:42 +00:00
Jay Foad	77e793d025	[AMDGPU] Return better Changed status from AMDGPUAnnotateUniformValues Differential Revision: https://reviews.llvm.org/D119943	2022-02-17 09:31:42 +00:00
Jay Foad	50ddb5d2d1	[CodeGen] Return better Changed status from LocalStackSlotAllocation Differential Revision: https://reviews.llvm.org/D119942	2022-02-17 09:31:41 +00:00
Jay Foad	f0092f9ded	[CodeGen] Return false from LiveIntervals::runOnMachineFunction This is an analysis pass so it does not modify the MachineFunction. Differential Revision: https://reviews.llvm.org/D119941	2022-02-17 09:31:41 +00:00
Jay Foad	3c9229c663	[CodeGen] Return better Changed status from DetectDeadLanes Differential Revision: https://reviews.llvm.org/D119940	2022-02-17 09:31:41 +00:00
Ben Shi	0b93e90971	Revert "[RISCV] LUI used for address computation should not isAsCheapAsAMove" This reverts commit `23a5073600`. Although this patch achieved better codegen in most cases, it is really important to accurately describe the cost of instructions. So I revert it.	2022-02-17 17:27:37 +08:00
Roman Lebedev	371fcb720e	[SimplifyCFG][PhaseOrdering] Defer lowering switch into an integer range comparison and branch until after at least the IPSCCP That transformation is lossy, as discussed in https://github.com/llvm/llvm-project/issues/53853 and https://github.com/rust-lang/rust/issues/85133#issuecomment-904185574 This is an alternative to D119839, which would add a limited IPSCCP into SimplifyCFG. Unlike lowering switch to lookup, we still want this transformation to happen relatively early, but after giving a chance for the things like CVP to do their thing. It seems like deferring it just until the IPSCCP is enough for the tests at hand, but perhaps we need to be more aggressive and disable it until CVP. Fixes https://github.com/llvm/llvm-project/issues/53853 Refs. https://github.com/rust-lang/rust/issues/85133 Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D119854	2022-02-17 12:13:55 +03:00
Alex Zinenko	d4a53f3bfa	[mlir] call target materialization more in dialect conversion During dialect conversion, target materialization is triggered to create cast-like operations when a type mismatch occurs between the value that replaces a rewritten operation and the type that another operations expects as operands processed by the type conversion. First, a dummy cast is inserted to make sure the pattern application can proceed. The decision to trigger the user-provided materialization hook is taken later based on the result of the dummy cast having uses. However, it only has uses if other patterns constructed new operations using the casted value as operand. If existing (legal) operations use the replaced value, they may have not been updated to use the casted value yet. The conversion infra would then delete the dummy cast first, and then would replace the uses with now-invalid (null in the bast case) value. When deciding whether to trigger cast materialization, check for liveness the uses not only of the casted value, but also of all the values that it replaces. This was discovered in the finalizing bufferize pass that cleans up mutually-cancelling casts without touching other operations. It is not impossible that there are other scenarios where the dialect converison infra could produce invalid operand uses because of dummy casts erased too eagerly. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D119937	2022-02-17 10:13:23 +01:00
Stanislav Gatev	dd4dde8d39	[clang][dataflow] Add transfer functions for logical and, or, not. This is part of the implementation of the dataflow analysis framework. See "[RFC] A dataflow analysis framework for Clang AST" on cfe-dev. Reviewed-by: xazax.hun Differential Revision: https://reviews.llvm.org/D119953	2022-02-17 09:09:59 +00:00
Lorenzo Chelini	25f1d50ca5	[MLIR][PDL] Fix typo (NFC)	2022-02-17 10:06:16 +01:00
Nikita Popov	c3c5280b0e	[InstSimplify] Delay creation of constants for offsets (NFC) Return APInt from stripAndComputeConstantOffsets(), and only create corresponding Constants later, if we actually need them.	2022-02-17 09:56:32 +01:00
Martin Storsjö	83c2aa467e	[libcxx] [test] Fix locale.time.get.byname get_date and get_date_wide on Windows Also apply the same fix on glibc. This takes the test one step closer to passing on glibc, but it still fails on the zh_CN test (which requires a more involved fix in libc++ itself). Differential Revision: https://reviews.llvm.org/D119791	2022-02-17 10:54:39 +02:00
Martin Storsjö	f081cc5037	[libcxx] [test] Fix the locale get_one_wide test for windows and glibc Differential Revision: https://reviews.llvm.org/D119790	2022-02-17 10:54:18 +02:00
Martin Storsjö	77c7ce0384	[libcxx] [test] Make the put_long_double test pass on mingw, clarify quirks in put_double Expect the same NAN formatting on Windows as on Glibc. (Both MSVC and MinGW produce the same formatting there.) The hex float formatting tests pass on MinGW, so opt in to those tests. Document exactly what issues are remaining in Clang-cl/MSVC configurations. (It's easily possible to make the tests pass there too, but it requires a whole lot of small-scope ifndefs in the test file; around 60 ifdefs in total for those both test files. Those could be avoided if the CI environment could run with a newer version of UCRT, but that's nontrivial to fix right away.) Differential Revision: https://reviews.llvm.org/D119766	2022-02-17 10:53:28 +02:00
Amir Ayupov	32d2473a5d	[BOLT][NFC] Report errors from createBinaryContext and RewriteInstance ctor Refactor createBinaryContext and RewriteInstance/MachORewriteInstance constructors to report an error in a library and fuzzer-friendly way instead of returning a nullptr or exiting. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D119658	2022-02-17 00:50:52 -08:00
Marek Kurdej	0ae2464fcd	[clang-format] Fix wrong assertion with non-negative shift when aligning tokens. Fixes https://github.com/llvm/llvm-project/issues/53880.	2022-02-17 09:49:00 +01:00
Nikita Popov	859567725d	[IndVars] Don't run full optimization pipeline in test (NFC) This extracts the IR prior to IndVarSimplify and only runs the single pass.	2022-02-17 09:28:33 +01:00
Ivan Butygin	d271fc04d5	[mlir][gpu] Split ops sinking from gpu-kernel-outlining pass into separate pass Previously `gpu-kernel-outlining` pass was also doing index computation sinking into gpu.launch before actual outlining. Split ops sinking from `gpu-kernel-outlining` pass into separate pass, so users can use theirs own sinking pass before outlining. To achieve old behavior users will need to call both passes: `-gpu-launch-sink-index-computations -gpu-kernel-outlining`. Differential Revision: https://reviews.llvm.org/D119932	2022-02-17 10:34:20 +03:00
Pavel Kosov	f165c23bf3	[NFC][compiler-rt] Format file lib/builtins/arm/sync-ops.h	2022-02-17 10:21:22 +03:00
Pavel Kosov	910a642c0a	[compiler-rt] Implement ARM atomic operations for architectures without SMP support ARMv5 and older architectures don’t support SMP and do not have atomic instructions. Still they’re in use in IoT world, where one has to stick to libgcc. Reviewed By: mstorsjo Differential Revision: https://reviews.llvm.org/D116088	2022-02-17 10:11:25 +03:00
Damian Rouson	d4332a8842	[flang] add semantics test for sync images Test a range of acceptable forms of SYNC IMAGES statements, including combinations with and without the stat-variable and errmsg-variable present. Also test that several invalid forms of SYNC IMAGES call generate the correct error messages. Differential Revision: https://reviews.llvm.org/D118933	2022-02-16 22:30:58 -08:00
Tom Stellard	fee491a10a	issue-release-workflow: Add support for /cherry-pick command in issue body Reviewed By: kwk Differential Revision: https://reviews.llvm.org/D119312	2022-02-16 22:18:06 -08:00
Shraiysh Vaishay	5bec1ea7a7	[mlir] Added oilist primitive This patch attempts to add the `oilist` primitive proposed in the [[ https://llvm.discourse.group/t/rfc-extending-declarative-assembly-format-to-support-order-independent-variadic-segments/4388 \| RFC: Extending Declarative Assembly Format to support order-independent variadic segments ]]. This element supports optional order-independent variadic segments for operations. This will allow OpenACC and OpenMP Dialects to have similar and relaxed requirements while encouraging the use of Declarative Assembly Format and avoiding code duplication. An oilist element parses grammar of the form: ``` clause-list := clause clause-list \| empty clause := `keyword` clause1 \| `otherKeyword` clause2 clause1 := <assembly-format element> clause2 := <assembly-format element> ``` AssemblyFormat specification: ``` let assemblyFormat = [{ oilist( `keyword` clause1 \| `otherkeyword` clause2 ... ) }]; ``` Example: ``` oilist( `private` `(` $arg0 `:` type($arg0) `)` \| `nowait` \| `reduction` custom<ReductionClause>($arg1, type($arg1))) oilist( `private` `=` $arg0 `:` type($arg0) \| `reduction` `=` $arg1 `:` type($arg1) \| `firstprivate` `=` $arg3 `:` type($arg2)) ``` Reviewed By: Mogball, rriddle Differential Revision: https://reviews.llvm.org/D115215	2022-02-17 11:10:24 +05:30
Eugene Zhulenev	abe2dee5eb	[mlir] NFC Async: always use 'b' for the current builder Currently some of the nested IR building inconsistently uses `nb` and `b`, it's very easy to call wrong builder outside of the current scope, so for simplicity all builders are always called `b`, and in nested IR building regions they just shadow the "parent" builder. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D120003	2022-02-16 21:20:53 -08:00
Damian Rouson	99dd49cf97	[flang] add semantics test for sync all Test a range of acceptable forms of SYNC ALL statements, including combinations with and without the stat-variable and errmsg-variable present. Also test that several invalid forms of SYNC ALL call generate the correct error messages. Differential Revision: https://reviews.llvm.org/D114181	2022-02-16 20:43:01 -08:00
Serguei Katkov	194899caef	[MemoryDependency] Relax the re-ordering of atomic store and unordered load/store Atomic store with Release semantic allows re-ordering of unordered load/store before the store. Implement it. Reviewers: reames Reviewed By: reames Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D119844	2022-02-17 10:53:25 +07:00
Jim Ingham	05f10ae0d8	On Windows, the function name contains the return parameter, so the test has to be "function name contains the name we used to specify the breakpoint" not IS the name...	2022-02-16 19:10:03 -08:00
Alex Brachet	64f5f6d759	[libc] Use '+' constraint on inline assembly As suggested by @mcgrathr in D118099 Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D119978	2022-02-17 03:00:17 +00:00
Jim Ingham	dd8490d207	Add a test for breaking on overloaded functions by name.	2022-02-16 18:12:00 -08:00
V Donaldson	d8364e3ea4	[flang] Allow tabs as white space in formats The fortran standard views blanks in IO formats as white space in non-string contexts. Other compilers extend this to also view horizontal tabs as white space. Some compilers additionally add other white space characters to this group. Add recognition of horizontal and vertical tabs to runtime format validation code to match what the runtime code currently does.	2022-02-16 17:49:52 -08:00
Aart Bik	34381a76c1	[mlir][sparse] avoid some codeup in sparsification transformation A very small refactoring, but a big impact on tests that expect an exact order. This revision fixes the tests, but also makes them less brittle for similar minor changes in the future! Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D119992	2022-02-16 17:39:04 -08:00
Amara Emerson	c8b8c8e989	[AArch64][GlobalISel] Implement support for clang.arc.attachedcall call operand bundles. Differential Revision: https://reviews.llvm.org/D119983	2022-02-16 17:35:22 -08:00
Yonghong Song	3671bdbcd2	[BPF] Fix a BTF type pruning bug In BPF backend, BTF type generation may skip some debuginfo types if they are the pointee type of a struct member. For example, struct task_struct { ... struct mm_struct mm; ... }; BPF backend may generate a forward decl for 'struct mm_struct' instead of full type if there are no other usage of 'struct mm_struct'. The reason is to avoid bringing too much unneeded types in BTF. Alexei found a pruning bug where we may miss some full type generation. The following is an illustrating example: struct t1 { ... } struct t2 { struct t1 p; }; struct t2 g; void foo(struct t1 *arg) { ... } In the above case, we will have partial debuginfo chain like below: struct t2 -> member p \ -> ptr -> struct t1 / foo -> argument arg During traversing struct t2 -> member p -> ptr -> struct t1 The corresponding BTF types are generated except 'struct t1' which will be in FixUp stage. Later, when traversing foo -> argument arg -> ptr -> struct t1 The 'ptr' BTF type has been generated and currently implementation ignores 'pointer' type hence 'struct t1' is not generated. This patch fixed the issue not just for the above case, but for general case with multiple derived types, e.g., struct t2 -> member p \ -> const -> ptr -> volatile -> struct t1 / foo -> argument arg Differential Revision: https://reviews.llvm.org/D119986	2022-02-16 17:23:34 -08:00
Alina Sbirlea	21aaa1fb22	[bazel] Add libc dependency.	2022-02-16 17:15:45 -08:00
Jez Ng	69297cf639	[lld-macho] Don't include CommandFlags.h in CommonLinkerContext.h Main motivation: including `llvm/CodeGen/CommandFlags.h` in `CommonLinkerContext.h` means that the declaration of `llvm::Reloc` is visible in any file that includes `CommonLinkerContext.h`. Since our cpp files have both `using namespace llvm` and `using namespace lld::macho`, this results in conflicts with `lld::macho::Reloc`. I suppose we could put `llvm::Reloc` into a nested namespace, but in general, I think we should avoid transitively including too many header files in a very widely used header like `CommonLinkerContext.h`. RegisterCodeGenFlags' ctor initializes a bunch of function-`static` structures and does nothing else, so it should be fine to "initialize" it as a temporary stack variable rather than as a file static. Reviewed By: aganea Differential Revision: https://reviews.llvm.org/D119913	2022-02-16 20:05:07 -05:00
Daniil Suchkov	a99989529e	[RewriteStatepointsForGC] Add a test exposing an incorrect assertion	2022-02-17 00:22:46 +00:00
Heejin Ahn	c60d822965	[WebAssembly] Make __wasm_lpad_context thread-local This makes `__wasm_lpad_context`, a struct that is used as a communication channel between compiler-generated code and personality function in libunwind, thread local. The library code will be changed to thread local in the emscripten side. Reviewed By: sbc100, tlively Differential Revision: https://reviews.llvm.org/D119803	2022-02-16 15:56:38 -08:00
Matt Arsenault	3884cb9235	AMDGPU: Always reserve VGPR for AGPR copies on gfx908 Just because there aren't AGPRs in the original program doesn't mean the register allocator can't choose to use them (unless we were to forcibly reserve all AGPRs if there weren't any uses). This happens in high pressure situations and introduces copies to avoid spills. In this test, the allocator ends up introducing a copy from SGPR to AGPR which requires an intermediate VGPR. I don't believe it would introduce a copy from AGPR to AGPR in this situation, since it would be trying to use an intermediate with a different class. Theoretically this is also broken on gfx90a, but I have been unable to come up with a testcase.	2022-02-16 18:48:18 -05:00
Jacques Pienaar	6e2cf33b24	[mlir][doc] Add passes docs to Passes.md	2022-02-16 15:42:20 -08:00
Florian Mayer	c195addb60	[NFC] [MTE] [HWASan] Remove unnecessary member of AllocaInfo Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D119981	2022-02-16 15:19:30 -08:00
Jacob Lambert	7470244475	[AMDGPU] Add agpr_count to metadata and AsmParser gfx90a allows the number of ACC registers (AGPRs) to be set independently to the VGPR registers. For both HSA and PAL metadata, we now include an "agpr_count" key to report the number of AGPRs set for supported devices (gfx90a, gfx908, as determined by hasMAIInsts()). This is collected from SIProgramInfo.NumAccVGPR for both HSA and PAL. The AsmParser also now recognizes ".kernel.agpr_count" for supported devices. Differential Revision: https://reviews.llvm.org/D116140	2022-02-16 15:17:23 -08:00
Nikolas Klauser	318507edee	[libc++] Remove a few unneeded _LIBCPP_CXX03_LANG ifdefs Reviewed By: Quuxplusone, ldionne, #libc Spies: libcxx-commits Differential Revision: https://reviews.llvm.org/D119896	2022-02-17 00:14:42 +01:00
Arthur Eubanks	826fae51d2	[SLPVectorizer][OpaquePtrs] Check GEP source element type Fixes a miscompile with opaque pointers. Reviewed By: #opaque-pointers, nikic Differential Revision: https://reviews.llvm.org/D119980	2022-02-16 14:47:20 -08:00
Arthur Eubanks	4e24397805	[test][SLPVectorizer][OpaquePtr] Precommit test	2022-02-16 14:47:20 -08:00
Eugene Zhulenev	b171583ae7	[mlir] Async: create async.group inside the scf.if branch Reviewed By: cota Differential Revision: https://reviews.llvm.org/D119959	2022-02-16 14:47:04 -08:00
Sam Clegg	dabbab6861	[lld][WebAssembly] Apply global relocs before data relocs Since the code for apply data relocations can sometimes use the values stored in he globals, they need to be relocated before the data relocations can be run. Fixes: https://github.com/emscripten-core/emscripten/issues/13398 Differential Revision: https://reviews.llvm.org/D119666	2022-02-16 14:30:39 -08:00
Marek Kurdej	ef39235cb9	[clang-format] Make checking for a record more robust and avoid a loop.	2022-02-16 23:05:49 +01:00
Johannes Doerfert	8ad39fbaf2	[Attributor][FIX] Heap2Stack needs to use the alloca AS When we move an allocation from the heap to the stack we need to allocate it in the alloca AS and then cast the result. This also prevents us from inserting the alloca after the allocation call but rather right before. Fixes https://github.com/llvm/llvm-project/issues/53858	2022-02-16 15:58:32 -06:00
Johannes Doerfert	668c5c688b	[Attributor][FIX] Use liveness information of the right function When we use liveness for edges during the `genericValueTraversal` we need to make sure to use the AAIsDead of the correct function. This patch adds the proper logic and some simple caching scheme. We also add an assertion to the `isEdgeDead` call to make sure future misuse is detected earlier. Fixes https://github.com/llvm/llvm-project/issues/53872	2022-02-16 15:58:32 -06:00

1 2 3 4 5 ...

415377 Commits