intel/llvm - llvm - Gitea: Git with a cup of tea

intel/llvm

mirror of https://github.com/intel/llvm.git synced 2026-02-05 13:21:04 +08:00

Author	SHA1	Message	Date
Siva Chandra	53fcf6bb62	[libc][aarch64] Enable a bunch of math functions. Namely, these are the functions enabled: rint, lrint, llrint, lround, llround, nearbyint. They were previously not enabled because they required rounding mode and FP exception support. Now that rounding mode and FP exception support is available for Aarch64, they can be enabled.	2021-02-05 15:11:17 -08:00
Lei Zhang	7630520ae3	[mlir][vector] Add pattern to shuffle bitcast ops These patterns move vector.bitcast ops to be before insert ops or after extract ops where suitable. With them, bitcast will happen on smaller vectors and there are more chances to share extract/insert ops. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D96040	2021-02-05 17:52:49 -05:00
Fangrui Song	a4fa667dee	[libc++abi] Disable _Unwind_ForcedUnwind + exception tests for ARM EHABI libunwind ARM EHABI does not support _Unwind_ForcedUnwind yet. In addition, ARM EHABI makes `_Unwind_Exception` a typedef so `struct _Unwind_Exception*` cannot be used.	2021-02-05 14:12:27 -08:00
Sam Clegg	38a285885d	[clang][emscripten] Add builtin define for __EMSCRIPTEN_PTHREADS__ Currently the emscripten frontend driver injects this when building with thread support. Moving this into the clang driver itself makes the emscripten python driver less magical. Differential Revision: https://reviews.llvm.org/D96171	2021-02-05 13:53:05 -08:00
Wouter van Oortmerssen	5e5b2cb131	[WebAssembly] Prevent data inside text sections in assembly This is not supported in Wasm, unless the data was encoded instructions, but that wouldn't work with the assembler's other functionality (enforcing nesting etc.). Fixes: https://bugs.llvm.org/show_bug.cgi?id=48971 Differential Revision: https://reviews.llvm.org/D95838	2021-02-05 13:48:25 -08:00
Nico Weber	5f76044c25	[gn build] enable new pass manager more, follow-up to `39ceb5c9cf`	2021-02-05 16:15:14 -05:00
Aaron Ballman	ec04e2850a	Allow SmallPtrSet to be used with a std::insert_iterator Currently, the SmallPtrSet type allows inserting elements but it does not support inserting elements with a positional hint. The lack of this signature means that you cannot use SmallPtrSet with std::insert_iterator or std::inserter(), which makes some code constructs more awkward. This adds an overload of insert() that can be used in these scenarios. The positional hint is unused by SmallPtrSet and the call is equivalent to calling insert() without a hint.	2021-02-05 16:12:47 -05:00
Petr Hosek	9fd9b5a9c9	Don't emit coverage mapping for excluded functions When a function or a file is excluded using -fprofile-list= option, don't emit coverage mapping as doing so confuses users since those functions would always have zero count. This also reduces the binary size considerably in cases where only a few functions or files are being instrumented. Differential Revision: https://reviews.llvm.org/D96000	2021-02-05 13:03:57 -08:00
Sterling Augustine	a34b8b879e	Various minor fixes for python 3 Switch StdTuple printer from python 2-style "next" to python 3. Nested iteration changed enough to make the original bitset iteration code a bit trickier than it needs to be, so unnest. The end node of a map iterator is sometimes hard to detect in isolation, don't fail in that case. Differential Revision: https://reviews.llvm.org/D96167	2021-02-05 13:01:34 -08:00
Martin Storsjö	d4f4e723d0	[libcxx] Implement temp_directory_path using GetTempPath on windows This does roughly the same as the manual implementation, but checks a slightly different set of environment variables and has a more appropriate fallback if no environment variables are available (/tmp isn't a very useful fallback on windows). Differential Revision: https://reviews.llvm.org/D91175	2021-02-05 22:47:33 +02:00
Martin Storsjö	4d292d531b	[libcxx] Use the posix code for directory_entry::__do_refresh This works just fine for windows, as all the functions it calls are implemented and wrapped for windows. Differential Revision: https://reviews.llvm.org/D91173	2021-02-05 22:47:33 +02:00
Eric Schweitz	f6342806db	[flang][fir] Add FIR's vector type. This patch adds support for `!fir.vector`, a rank one, constant length data type. https://github.com/flang-compiler/f18-llvm-project/pull/413 Differential Revision: https://reviews.llvm.org/D96162	2021-02-05 12:44:19 -08:00
Arthur O'Dwyer	85167fb7c2	[libc++] Further improve the contiguous-iterator story, and fix some bugs. - Quality-of-implementation: Avoid calling __unwrap_iter in constexpr contexts. The user might conceivably write a contiguous iterator where normal iterator arithmetic is constexpr-friendly but `std::to_address(it)` isn't. - Bugfix: When you pass contiguous iterators to `std::copy`, you should get back your contiguous iterator type, not a raw pointer. That means that libc++ can't `__unwrap_iter` unless it also does `__rewrap_iter`. Fortunately, this is implementable. - Improve test coverage of the new `contiguous_iterator` test iterator. This catches the bug described above. - Tests: Stop testing that we can `std::copy` //into// an `input_iterator`. Our test iterators may currently support that, but it seems nonsensical to me. Differential Revision: https://reviews.llvm.org/D95983	2021-02-05 15:18:04 -05:00
Yaxun (Sam) Liu	b008ea304d	[CUDA][HIP] Fix device variable linkage For -fgpu-rdc, shadow variables should not be internalized, otherwise they cannot be accessed by other TUs. This is necessary because the shadow variable of external device variables are always emitted as undefined symbols, which need to resolve to a global symbols. Managed variables need to be emitted as undefined symbols in device compilations. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D95901	2021-02-05 15:11:12 -05:00
Sanjay Patel	c981f6f8e1	Revert "[Codegen][ReplaceWithVecLib] add pass to replace vector intrinsics with calls to vector library" This reverts commit `2303e93e66`. Investigating bot failures.	2021-02-05 15:10:11 -05:00
Craig Topper	3c767b96dc	[RISCV] Correct types in tablegen multiclasses found by D95874.	2021-02-05 11:55:58 -08:00
Arthur Eubanks	526c0955c0	[NVPTX][NewPM] Temporarily disable NVPTX passes in new PM pipeline These passes are causing numerical discrepancies after being added to the pipeline. Disable while investigating. Reviewed By: rupprecht Differential Revision: https://reviews.llvm.org/D96166	2021-02-05 11:31:07 -08:00
zoecarver	fab194898b	[lic++][docs] Explain noexcept policy for narrow contracts. Adds documentation around libc++'s policy to add noexcept to things that cannot throw but aren't marked as noexcept. Refs LWG 3518 and D95251. Differential Revision: https://reviews.llvm.org/D95821	2021-02-05 11:27:19 -08:00
Sanjay Patel	8d9527a0bf	[PhaseOrdering] add test to show combined result of reassociate+instcombine+vectorizers; NFC	2021-02-05 14:25:19 -05:00
Sanjay Patel	3d10a0bdab	[Reassociate] add test for shl+or; NFC	2021-02-05 14:25:19 -05:00
Lukas Sommer	2303e93e66	[Codegen][ReplaceWithVecLib] add pass to replace vector intrinsics with calls to vector library This patch adds a pass to replace calls to vector intrinsics (i.e., LLVM intrinsics operating on vector operands) with calls to a vector library. Currently, calls to LLVM intrinsics are only replaced with calls to vector libraries when scalar calls to intrinsics are vectorized by the Loop- or SLP-Vectorizer. With this pass, it is now possible to replace calls to LLVM intrinsics already operating on vector operands, e.g., if such code was generated by MLIR. For the replacement, information from the TargetLibraryInfo, e.g., as specified via -vector-library is used. Differential Revision: https://reviews.llvm.org/D95373	2021-02-05 14:25:19 -05:00
Wouter van Oortmerssen	e3c0b0fe09	[WebAssembly] locals can now be indirect in DWARF This for example to indicate that byval args are represented by a pointer to a struct. Followup to https://reviews.llvm.org/D94140 Differential Revision: https://reviews.llvm.org/D94347	2021-02-05 11:14:42 -08:00
Eric Schweitz	a1a1d338e9	[flang][NFC] Make KindTy consistent and consistently used. Differential Revision: https://reviews.llvm.org/D96154	2021-02-05 10:54:58 -08:00
Eric Schweitz	ea35745610	[flang][NFC] Update comments. Differential Revision: https://reviews.llvm.org/D96152	2021-02-05 10:54:58 -08:00
Siva Chandra	c90c8d38d3	[libc] Add aarch64 flavors of floor, round, sqrt and trunc. Only single and double precision flavors have been added. Reviewed By: lntue, sdesmalen Differential Revision: https://reviews.llvm.org/D95999	2021-02-05 10:41:32 -08:00
Thomas Preud'homme	00a62547da	Stop traping on sNaN in __builtin_isnan __builtin_isnan currently generates a floating-point compare operation which triggers a trap when faced with a signaling NaN in StrictFP mode. This commit uses integer operations instead to not generate any trap in such a case. Reviewed By: kpn Differential Revision: https://reviews.llvm.org/D95948	2021-02-05 18:28:48 +00:00
Amy Huang	34f3249abd	[DebugInfo] Fix error from D95893, where I accidentally used an unsigned int in a loop and it wraps around. Follow up to https://reviews.llvm.org/D95893	2021-02-05 10:25:21 -08:00
Huihui Zhang	1b81117f88	[DAGCombiner][SVE] Fix invalid use of getVectorNumElements() in visitSRA. Make sure scalable property is preserved by using getVectorElementCount(). Reviewed By: paulwalker-arm Differential Revision: https://reviews.llvm.org/D95967	2021-02-05 09:56:49 -08:00
Amy Huang	a740af4de9	[CodeView][DebugInfo] Update the code for removing template arguments from the display name of a codeview function id. Previously the code split the string at the first '<', which incorrectly truncated names like `operator<`. Differential Revision: https://reviews.llvm.org/D95893	2021-02-05 09:49:11 -08:00
Jianzhou Zhao	0f3fd3b281	[dfsan] Add thread registration This is a part of https://reviews.llvm.org/D95835. This change is to address two problems 1) When recording stacks in origin tracking, libunwind is not async signal safe. Inside signal callbacks, we need to use fast unwind. Fast unwind needs threads 2) StackDepot used by origin tracking is not async signal safe, we set a flag per thread inside a signal callback to prevent from using it. The thread registration is similar to ASan and MSan. Related MSan changes are * `98f5ea0dba` * `f653cda269` * `5a7c364343` Some changes in the diff are used in the next diffs 1) The test case pthread.c is not very interesting for now. It will be extended to test origin tracking later. 2) DFsanThread::InSignalHandler will be used by origin tracking later. Reviewed-by: morehouse Differential Revision: https://reviews.llvm.org/D95963	2021-02-05 17:38:59 +00:00
Fangrui Song	5f4d7b2f0a	[ELF] Improve --icf=safe diagnostic The current diagnostic has confused users. The new wording is adapted from one suggested by Ian Lance Taylor. Differential Revision: https://reviews.llvm.org/D95917	2021-02-05 09:37:37 -08:00
Wen-Heng (Jack) Chung	04766c401b	[AMDGPU] Add Fiji target in fptosi/fptoui instruction-select MIR tests. In response to review comments in D95964, add a target with f16 instructions. Differential Revision: https://reviews.llvm.org/D96061	2021-02-05 11:33:54 -06:00
Arthur Eubanks	39ceb5c9cf	[gn build] Turn on new pass manager by default Matches cmake build	2021-02-05 09:11:07 -08:00
Arthur O'Dwyer	5d9565634c	Revert "Revert "[libc++] [P0879] constexpr std::nth_element, and rewrite its tests."" This reverts commit `b6ffece320`. The bug is now fixed (it was a stupid cut-and-paste kind of error), and the regression test added. The new patch is also simpler than the old one! Differential Revision: https://reviews.llvm.org/D96084	2021-02-05 12:02:43 -05:00
Simon Pilgrim	bd1009224a	[MSP430] Fix multiclass template parameter types. NFC. Fixes TableGen parser errors reported by D95874.	2021-02-05 16:57:07 +00:00
Sam McCall	fed995ef90	[clangd] Trace queue state for each TUScheduler action. The new trace event includes what's already in the queue when adding. For tracers that follow contexts, the trace event will span the time that the action spends in the queue. For tracers that follow threads, the trace will be a tiny span on the enqueuing thread. Differential Revision: https://reviews.llvm.org/D96027	2021-02-05 17:56:51 +01:00
Jonas Devlieghere	e5228ef556	[lldb] Emit type annotation in SWIG generated Python code. The Python code generated by SWIG is compatible with both Python 2 and Python 3. The -py3 option enables Python 2 incompatible features such as function annotations and abstract base classes. Differential revision: https://reviews.llvm.org/D96096	2021-02-05 08:51:22 -08:00
Denys Shabalin	1d0b02368e	Drop LLVM_i1, LLVM_i8, LLVM_i32, LLVM_IntBase types Those types are not needed any longer since LLVM dialect has migrated to using MLIR's I1, I8, I32 types directly. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D96127	2021-02-05 17:33:16 +01:00
Michael Liao	01bf529db2	Recommit of `a2fdf9d4d7`. - The failures are all cc1-based tests due to the missing `-aux-triple` options, which is always prepared by the driver in CUDA/HIP compilation. - Add extra check on the missing aux-targetinfo to prevent crashing. [hip][cuda] Enable extended lambda support on Windows. - On Windows, extended lambda has extra issues due to the numbering schemes are different between the host compilation (Microsoft C++ ABI) and the device compilation (Itanium C++ ABI. Additional device side lambda number is required per lambda for the host compilation to correctly mangle the device-side lambda name. - A hybrid numbering context `MSHIPNumberingContext` is introduced to number a lambda for both host- and device-compilations. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D69322 This reverts commit `4874ff0241`.	2021-02-05 11:27:30 -05:00
Sidharth Baveja	22ebbc4765	LoopUnrollAndJam] Only allow loops with single exit(ing) blocks Summary: This resolves an issue posted on Bugzilla. https://bugs.llvm.org/show_bug.cgi?id=48764 In this issue, the loop had multiple exit blocks, which resulted in the function getExitBlock to return a nullptr, which resulted in hitting the assert. This patch ensures that loops which only have one exit block as allowed to be unrolled and jammed. Reviewed By: Whitney, Meinersbur, dmgreen Differential Revision: https://reviews.llvm.org/D95806	2021-02-05 16:10:53 +00:00
Anton Zabaznov	d88c55ab95	[OpenCL] Add macro definitions of OpenCL C 3.0 features This patch adds possibility to define OpenCL C 3.0 feature macros via command line option or target setting. Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D95776	2021-02-05 18:42:25 +03:00
Stephen Kelly	4cbea09431	[ASTMatchers] Fix segfault when Action is nullptr It can be nullptr in unit tests.	2021-02-05 15:17:13 +00:00
Yvan Roux	38f28f4621	[Test] Fix test case demonstrating oddities in behavior of LSR. Add x86 targets requirement to fix bots which are not building these targets.	2021-02-05 16:03:24 +01:00
Paul Robinson	a0749f9bcc	[RGT][ProfileData] Correct a test assertion Found by the Rotten Green Tests project. Differential Revision: https://reviews.llvm.org/D95258	2021-02-05 06:55:08 -08:00
Sven van Haastregt	42e018997b	[OpenCL] Add cl_khr_subgroup_shuffle* to TableGen BIFs Add the builtin functions brought by the cl_khr_subgroup_shuffle and cl_khr_subgroup_shuffle_relative extensions to `-fdeclare-opencl-builtins`.	2021-02-05 14:16:13 +00:00
Lei Zhang	8dae90997a	[mlir][vector] Add constant folding for fp16 to fp32 bitcast Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D96041	2021-02-05 09:12:50 -05:00
Lei Zhang	9f622b3d5d	[mlir][spirv] Add more vector conversion patterns This patch introduces a few more straightforward patterns to convert vector ops operating on 1-4 element vectors to their corresponding SPIR-V counterparts. This patch also enables converting vector<1xT> to T. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D96042	2021-02-05 09:11:16 -05:00
Akira Hatanaka	4a64d8fe39	[ObjC][ARC] Use operand bundle 'clang.arc.rv' instead of explicitly emitting retainRV or claimRV calls in the IR This reapplies `3fe3946d9a` without the changes made to lib/IR/AutoUpgrade.cpp, which was violating layering. Original commit message: Background: This patch makes changes to the front-end and middle-end that are needed to fix a longstanding problem where llvm breaks ARC's autorelease optimization (see the link below) by separating calls from the marker instructions or retainRV/claimRV calls. The backend changes are in https://reviews.llvm.org/D92569. https://clang.llvm.org/docs/AutomaticReferenceCounting.html#arc-runtime-objc-autoreleasereturnvalue What this patch does to fix the problem: - The front-end adds operand bundle "clang.arc.rv" to calls, which indicates the call is implicitly followed by a marker instruction and an implicit retainRV/claimRV call that consumes the call result. In addition, it emits a call to @llvm.objc.clang.arc.noop.use, which consumes the call result, to prevent the middle-end passes from changing the return type of the called function. This is currently done only when the target is arm64 and the optimization level is higher than -O0. - ARC optimizer temporarily emits retainRV/claimRV calls after the calls with the operand bundle in the IR and removes the inserted calls after processing the function. - ARC contract pass emits retainRV/claimRV calls after the call with the operand bundle. It doesn't remove the operand bundle on the call since the backend needs it to emit the marker instruction. The retainRV and claimRV calls are emitted late in the pipeline to prevent optimization passes from transforming the IR in a way that makes it harder for the ARC middle-end passes to figure out the def-use relationship between the call and the retainRV/claimRV calls (which is the cause of PR31925). - The function inliner removes an autoreleaseRV call in the callee if nothing in the callee prevents it from being paired up with the retainRV/claimRV call in the caller. It then inserts a release call if the call is annotated with claimRV since autoreleaseRV+claimRV is equivalent to a release. If it cannot find an autoreleaseRV call, it tries to transfer the operand bundle to a function call in the callee. This is important since ARC optimizer can remove the autoreleaseRV returning the callee result, which makes it impossible to pair it up with the retainRV/claimRV call in the caller. If that fails, it simply emits a retain call in the IR if the implicit call is a call to retainRV and does nothing if it's a call to claimRV. Future work: - Use the operand bundle on x86-64. - Fix the auto upgrader to convert call+retainRV/claimRV pairs into calls annotated with the operand bundles. rdar://71443534 Differential Revision: https://reviews.llvm.org/D92808	2021-02-05 06:09:42 -08:00
Arnold Schwaighofer	8a7f5ad0fd	We can only move static allocas into the resume entry points Dynamic allocas that still exist have been verified to be only used 'locally' not accross a suspend point. rdar://73903220 Differential Revision: https://reviews.llvm.org/D96071	2021-02-05 06:06:10 -08:00
Stephen Kelly	2cba22c23a	[clang-tidy] Simplify implementation of container-size-empty Use IgnoreUnlessSpelledInSource to make the matcher code smaller and more visibly-related to the code. Differential Revision: https://reviews.llvm.org/D91303	2021-02-05 14:03:41 +00:00

1 2 3 4 5 ...

379193 Commits