intel/llvm - llvm - Gitea: Git with a cup of tea

intel/llvm

mirror of https://github.com/intel/llvm.git synced 2026-01-25 19:44:38 +08:00

Author	SHA1	Message	Date
Sergei Lebedev	3737712dae	Slightly improved ir.pyi type annotations (#76728 ) * Replaced `Any` with static types where appropriate * Removed undocumented `__str__` and `__repr__` -- these are always defined via `object`	2024-01-04 09:49:57 +01:00
Andrzej Warzyński	f8c034140b	[mlir][docs] Update TD tutorial - Ch0 (#76858 ) Updates `generic` as `linalg.generic` (for consistency and to avoid ambiguity) and a few other fixes.	2024-01-04 09:48:44 +01:00
Aiden Grossman	232be5129b	[NFC][llvm-exegesis] Fix comment text I'm reasonably confident iff wasn't an abbreviation for if and only if here and it was starting to bug me quite a bit.	2024-01-04 00:40:30 -08:00
gmh	f3f4387e02	[lldb][NFC] Fix compilation issue on windows (#76453 )	2024-01-04 08:39:50 +00:00
bgra8	8c72ff716b	[NFC] Renames a template parameter to avoid clashes with userspace names. (#76829 ) Co-authored-by: Bogdan Graur <bgraur@google.com>	2024-01-04 09:25:57 +01:00
Ben Shi	18c0f59b3e	[clang][analyzer] Support 'fdopen' in the StreamChecker (#76776 )	2024-01-04 15:52:52 +08:00
Ben Shi	75365b2e18	[clang][AVR] Restrict range of assembly constraint 'G' (#76561 ) According to https://www.nongnu.org/avr-libc/user-manual/inline_asm.html, 'G' only represents floating point constant '0.0'. And avr-gcc also rejects other non-zero FP values.	2024-01-04 15:51:57 +08:00
Dominik Adamski	0cdaadf15a	[libomptarget][flang] Explicitly pass the OpenMP device libraries to tests (#76796 ) This pull request is a follow-up of patch: https://github.com/llvm/llvm-project/pull/68225 and it explicitly specifies OpenMP device libraries for Fortran OpenMP tests.	2024-01-04 08:45:34 +01:00
Phoebe Wang	176c341198	[X86][BF16] Add 32-bit tests to show ABI problem, NFC	2024-01-04 15:43:34 +08:00
David Green	5550e9c841	[GlobalISel][AArch64] Add libcall lowering for fpowi. (#67114 ) This adds legalization, notably libcall lowering for fpowi. It is a little different to other methods as the function takes both a float and integer register. Otherwise all vectors get scalarized and fp16 is promoted to fp32.	2024-01-04 07:26:23 +00:00
sstipanovic	55395f5c83	[AMDGPU] Remove `nosync` from image atomic intrinsics. (#76814 ) Remove `nosync` as discussed in https://github.com/llvm/llvm-project/pull/73613	2024-01-04 08:22:05 +01:00
Nilanjana Basu	cd28da390f	[LV] Change loops' interleave count computation (#73766 ) [LV] Change loops' interleave count computation A set of microbenchmarks in llvm-test-suite (https://github.com/llvm/llvm-test-suite/pull/56), when tested on a AArch64 platform, demonstrates that loop interleaving is beneficial when the vector loop runs at least twice or when the epilogue loop trip count (TC) is minimal. Therefore, we choose interleaving count (IC) between TC/VF & TC/2*VF (VF = vectorization factor), such that remainder TC for the epilogue loop is minimum while the IC is maximum in case the remainder TC is same for both. The initial tests for this change were submitted in PRs: https://github.com/llvm/llvm-project/pull/70272 and https://github.com/llvm/llvm-project/pull/74689.	2024-01-04 12:45:22 +05:30
Yingwei Zheng	0ce193708c	[InstCombine] Refactor folding of commutative binops over select/phi/minmax (#76692 ) This patch cleans up the duplicate code for folding commutative binops over `select/phi/minmax`. Related commits: + select support: `88cc35b27e` + phi support: `8674a023bc` + minmax support: `624973806c`	2024-01-04 15:11:28 +08:00
Craig Topper	80889ae029	[RISCV] Remove RISCVISD::VSELECT_VL. (#76866 ) We can use RISCVISD::VMERGE_VL with an undef passthru operand. I had to rewrite the FMA patterns to handle both undef and non-undef cases so we can get the tail policy.	2024-01-03 21:31:07 -08:00
Owen Pan	42ec976184	[clang-format] Optimize processing .clang-format-ignore files (#76733 ) Reuse the patterns governing the previous input file being formatted if the current input file is from the same directory.	2024-01-03 21:12:28 -08:00
Jacques Pienaar	6ae7f66ff5	[mlir] Add config for PDL (#69927 ) Make it so that PDL in pattern rewrites can be optionally disabled. PDL is still enabled by default and not optional bazel. So this should be a NOP for most folks, while enabling other to disable. This only works with tests disabled. With tests enabled this still compiles but tests fail as there is no lit config to disable tests that depend on PDL rewrites yet.	2024-01-03 20:37:20 -08:00
Mircea Trofin	cda388c440	[mlgo] Fix test post PR #76697 Opcode values changed, trivial fix.	2024-01-03 19:38:03 -08:00
Jerry Wu	53edf12e52	[mlir] Add `res()` method to `linalg::ContractionOpInterface` (#76539 ) In addition to `lhs()` and `rhs()` to return left and right operands, add `res()` to return the result value.	2024-01-03 22:34:19 -05:00
XinWang10	ddf0096a92	[NFC][X86] Reorg MC tests for APX promoted instrs (#76697 ) As suggested in https://github.com/llvm/llvm-project/pull/76210, this patch re-organize the mc tests for apx promoted instrs, instr tests within same cpuid would be listed in one test. Also add explicit prefix {evex} tests and 8 displacement memory test, promoted instrs need set No_CD8 to avoid AVX512 compress encoding.	2024-01-04 10:42:27 +08:00
Shengchen Kan	dd9681f839	[X86][MC] Support encoding/decoding for APX variant INC/DEC/ADCX/ADOX instructions (#76721 ) Four variants: promoted legacy, ND (new data destination), NF (no flags update) and NF_ND (NF + ND). The syntax of NF instructions is aligned with GNU binutils. https://sourceware.org/pipermail/binutils/2023-September/129545.html	2024-01-04 10:12:12 +08:00
Micah Weston	7df28fd61a	[SHT_LLVM_BB_ADDR_MAP][AsmPrinter] Implements PGOAnalysisMap emitting in AsmPrinter with tests. (#75202 ) Uses machine analyses to emit PGOAnalysisMap into the bb-addr-map ELF section. Implements filecheck tests to verify emitting new fields. This patch emits optional PGO related analyses into the bb-addr-map ELF section during AsmPrinter. This currently supports Function Entry Count, Machine Block Frequencies. and Machine Branch Probabilities. Each is independently enabled via the `feature` byte of `bb-addr-map` for the given function. A part of [RFC - PGO Accuracy Metrics: Emitting and Evaluating Branch and Block Analysis](https://discourse.llvm.org/t/rfc-pgo-accuracy-metrics-emitting-and-evaluating-branch-and-block-analysis/73902).	2024-01-03 19:17:44 -05:00
YunQiang Su	ddfbca8b08	Clang/MIPS: Use -mnan value for -mabs if not specified (#71157 ) On most hardware, FCSR.ABS2008 is set the value same with FCSR.NAN2008. Let's use this behaivor by default. With this commit, `clang -target mips -mnan=2008 -c fabs.c` will imply `-mabs=2008`. And of course, `clang -mnan=2008 -mabs=legacy` can continue workable like previous. Co-authored-by: YunQiang Su <yunqiang.su@cipunited.com>	2024-01-03 19:04:22 -05:00
Alex Langford	bdaedffc43	[lldb] Speculatively fix TestBreakpointSerialization on Windows See: https://lab.llvm.org/buildbot/#/builders/219/builds/7961/steps/6/logs/stdio	2024-01-03 15:50:25 -08:00
Nick Desaulniers	e68a0320a1	[libc] fix -Wcast-function-type via union rather than reinterpret_cast (#76875 ) The GCC build is producing the following diagnostic: llvm-project/libc/src/signal/linux/signal_utils.h: In member function ‘__llvm_libc_18_0_0_git::KernelSigaction& __llvm_libc_18_0_0_git::KernelSigaction::operator=(const sigaction&)’: llvm-project/libc/src/signal/linux/signal_utils.h:38:20: warning: cast between incompatible function types from ‘void ()(int, siginfo_t, void)’ to ‘void ()(int)’ [-Wcast-function-type] 38 \| sa_handler = reinterpret_cast<HandlerType >(sa.sa_sigaction); \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ llvm-project/libc/src/signal/linux/signal_utils.h: In member function ‘__llvm_libc_18_0_0_git::KernelSigaction::operator sigaction() const’: llvm-project/libc/src/signal/linux/signal_utils.h:51:25: warning: cast between incompatible function types from ‘void ()(int)’ to ‘void ()(int, siginfo_t, void)’ [-Wcast-function-type] 51 \| sa.sa_sigaction = reinterpret_cast<SiginfoHandlerType >(sa_handler); \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Two issues here: 1. Clang supports -Wcast-function-type, but not as part of the -Wextra group. 2. The existing implementation tried to work around the oddity that is the kernel's struct sigaction != POSIX via reinterpret_cast in a way that's not compatible with -Wcast-function-type. Just use a union which is well defined (and two function pointers are the same size.) Link: https://github.com/llvm/llvm-project/issues/76872 Fixes: https://github.com/llvm/llvm-project/issues/74617	2024-01-03 15:48:05 -08:00
Boian Petkantchin	7a4c49756d	[mlir][mesh] Use one type for mesh axis (#76830 ) Make all ops and attributes use the types MeshAxis and MeshAxesAttr instead of int16_t, int32_t, DenseI16ArrayAttr and DenseI32ArrayAttr.	2024-01-03 15:47:11 -08:00
Derek Schuff	428cf71ffa	Reland "[WebAssembly][Object]Use file offset as function symbol address for linked files (#76198 )" WebAssembly doesn't have a single virtual memory space the way other object formats or architectures do, so "addresses" mean different things depending on the context. Function symbol addresses in object files are offsets from the start of the code section. This is good for linking and relocation. However when dealing with linked binaries, offsets from the start of the file/module are more often used (e.g. for stack traces in browsers), and are more useful for use cases like binary size attribution. This PR changes Object to use the file offset instead of the section offset for function symbols, but only for linked (non-DSO) files. This is a reland of `fc5f51cf` with a fix for the MSan failure (it was not caused by this change, but it was revealed by the new tests).	2024-01-03 15:39:48 -08:00
Aiden Grossman	a25b66217f	[NFC][llvm-exegesis] Remove redundant register initial values argument This patch removes the redundant RegisterInitialValues parameter from assembleToStream and friends as it is included within the BenchmarkKey struct that is also passed to all the functions that need this information.	2024-01-03 15:25:21 -08:00
Erich Keane	3f2e670671	Revert "[Clang][Sema] Diagnose unexpanded packs in the template argument lists of function template specializations" (#76876 ) Reverts llvm/llvm-project#76677 See discussion here: https://github.com/llvm/llvm-project/pull/76677	2024-01-03 15:16:08 -08:00
Nicolai Hähnle	49b492048a	AMDGPU: Fix packed 16-bit inline constants (#76522 ) Consistently treat packed 16-bit operands as 32-bit values, because that's really what they are. The attempt to treat them differently was ultimately incorrect and lead to miscompiles, e.g. when using non-splat constants such as (1, 0) as operands. Recognize 32-bit float constants for i/u16 instructions. This is a bit odd conceptually, but it matches HW behavior and SP3. Remove isFoldableLiteralV216; there was too much magic in the dependency between it and its use in SIFoldOperands. Instead, we now simply rely on checking whether a constant is an inline constant, and trying a bunch of permutations of the low and high halves. This is more obviously correct and leads to some new cases where inline constants are used as shown by tests. Move the logic for switching packed add vs. sub into SIFoldOperands. This has two benefits: all logic that optimizes for inline constants in packed math is now in one place; and it applies to both SelectionDAG and GISel paths. Disable the use of opsel with v_dot* instructions on gfx11. They are documented to ignore opsel on src0 and src1. It may be interesting to re-enable to use of opsel on src2 as a future optimization. A similar "proper" fix of what inline constants mean could potentially be applied to unpacked 16-bit ops. However, it's less clear what the benefit would be, and there are surely places where we'd have to carefully audit whether values are properly sign- or zero-extended. It is best to keep such a change separate. Fixes: Corruption in FSR 2.0 (latent bug exposed by an LLPC change)	2024-01-04 00:10:15 +01:00
Alex Langford	49029f926d	[lldb] Fix breakpoint resolver serialization bug (#76766 ) BreakpointResolverAddress optionally can include the module name related to the address that gets resolved. Currently this will never work because it sets the name to itself (which is empty).	2024-01-03 15:02:37 -08:00
Yuxuan Chen	09de5e5c6d	[Clang] Fix failing CI with different test case attribute & host triple (#76863 ) As in title, `a8f4397426` broke CI due to the calling convention not available on certain targets. This patch uses a simpler calling convention and enables the test only when the attribute exists. It's verified that this test crashes the compiler before `a8f4397426` so it's the same effect as the previous test. Disabling the test on platforms that don't have the calling convention is fine because it's guarding against a frontend bug.	2024-01-03 14:43:10 -08:00
Ahmed Bougacha	155d5849da	[AArch64] Avoid jump tables in swiftasync clobber-live-reg test. NFC. The upstream test relies on jump-tables, which are lowered in dramatically different ways with later arm64e/ptrauth patches. Concretely, it's failing for at least two reasons: - ptrauth removes x16/x17 from tcGPR64 to prevent indirect tail-calls from using either register as the callee, conflicting with their usage as scratch for the tail-call LR auth checking sequence. In the 1/2_available_regs_left tests, this causes the MI scheduler to move the load up across some of the inlineasm register clobbers. - ptrauth adds an x16/x17-using pseudo for jump-table dispatch, which looks somewhat different from the regular jump-table dispatch codegen by itself, but also prevents compression currently. They seem like sensible changes. But they mean the tests aren't really testing what they're intented to, because there's always an implicit x16/x17 clobber when using jump-tables. This updates the test in a way that should work identically regardless of ptrauth support, with one exception, #1 above, which merely reorders the load/inlineasm w.r.t. eachother. I verified the tests still fail the live-reg assertions when applicable.	2024-01-03 13:51:46 -08:00
Andrzej Warzyński	39298b09ec	[mlir][docs] Capitalize "Transform" in "transform dialect" (#76840 ) A mix of "Transform dialect" and "transform dialect" is used ATM. This patch capitalizes the outstanding instances of "transform".	2024-01-03 21:33:11 +00:00
Craig Topper	bdcd7c0ba0	[DAGCombiner][RISCV] Preserve disjoint flag in folding (shl (or x, c1), c2) -> (or (shl x, c2), c1 << c2) (#76860 ) Since we are shifting both inputs to the original Or by the same amount and inserting zeros in the LSBs, the result should still be disjoint.	2024-01-03 13:14:13 -08:00
Muhammad Omair Javaid	a24c58140f	Revert "[mlir] Consider mlir-linalg-ods-gen as a tablegen tool in build (#75093 )" This reverts commit `9191ac0bdb`. Breaks build on following buildbot: https://lab.llvm.org/buildbot/#/builders/177/builds/27432	2024-01-04 02:01:16 +05:00
Florian Hahn	6dda74cc51	[VPlan] Use createSelect in adjustRecipesForReductions (NFCI). Simplify the code and rename Result->NewExitingVPV as suggested by @ayalz in https://github.com/llvm/llvm-project/pull/70253.	2024-01-03 20:54:10 +00:00
David Goldblatt	92e211ab33	[BasicAA] Enable separate storage hints by default (#76864 ) As requested in https://github.com/llvm/llvm-project/pull/76770#pullrequestreview-1801649466 A few months of experimentation in a large codebase did not reveal any significant build speed regressions, and `b07bf16` speeds up hint lookup even further. Co-authored-by: David Goldblatt <davidgoldblatt@meta.com>	2024-01-03 12:51:40 -08:00
Krzysztof Drewniak	ddd6acd7a8	[mlir][GPU] Expand LLVM function attribute copies (#76755 ) Expand the copying of attributes on GPU kernel arguments during LLVM lowering. Support copying attributes from values that are already LLVM pointers. Support copying attributes, like `noundef`, that aren't specific to (the pointer parts of) arguments.	2024-01-03 14:28:15 -06:00
Craig Topper	f64d1c810a	[RISCV] Add test cases for folding disjoint Or into a scalar load address. NFC After `47a1704ac9` we are able to reassociate a disjoint Or used as a GEP index to get the constant closer to a load to fold it. This is show by the first test. We are not able to do this if the GEP created a shift left to scale the index as the second test shows. To make this work, we need to preserve the disjoint flag when pulling the Or through the shift.	2024-01-03 12:17:57 -08:00
Craig Topper	47a1704ac9	[SelectionDAG][X86] Use disjoint flag in SelectionDAG::isADDLike. (#76847 ) Keep the haveNoCommonBitsSet check because we haven't started inferring the flag yet. I've added tests for two transforms, but these are not the only transforms that use isADDLike.	2024-01-03 11:54:29 -08:00
Krystian Stasiowski	7fbc1de989	[Clang][Sema] Diagnose unexpanded packs in the template argument lists of function template specializations (#76677 ) This diagnoses unexpanded packs in the _unqualified-id_ of a function template specialization's _declarator-id_. For example: ```cpp template<typename... Ts> struct A { template<typename U> void f(); template<> void f<Ts>(); // error: explicit specialization contains unexpanded parameter pack 'Ts' }; ``` I moved the handling of template-id's so it happens right after we determine whether we are declaring a function template/function template specialization so diagnostics are issued in lexical order.	2024-01-03 11:07:46 -08:00
Alexey Bataev	7c963fde16	[SLP]Use revectorized value for extracts from buildvector, beeing vectorized. If the insertelement instruction is vectorized, and the extractelement instruction from such insertelement also vectorized as part of the same tree, need to extract from the corresponding for insertelement vectorized value rather than original insertelement instruction.	2024-01-03 10:38:09 -08:00
Craig Hesling	ee94e548a1	[GitHub] Fix minor typos in .github/workflows (#76592 ) Fix one spelling typo and remove second newline from end of files.	2024-01-03 10:35:39 -08:00
Hassnaa Hamdi	e8df7e2c1a	[LLVM][AArch64][tblgen]: Make operand suffix case-insinsitive (#76808 ) Ex: `fdot v26.8H, v22.16B, v9.2B[0]` should be equivalent to `fdot v26.8h, v22.16b, v9.2b[0]`	2024-01-03 18:27:04 +00:00
max	b49e0ebedf	Revert "[mlir] Add config for PDL (#69927 )" This reverts commit `5930725c89`.	2024-01-03 12:16:19 -06:00
Jacques Pienaar	5930725c89	[mlir] Add config for PDL (#69927 ) Make it so that PDL in pattern rewrites can be optionally disabled. PDL is still enabled by default and not optional bazel. So this should be a NOP for most folks, while enabling other to disable. This is piped through mlir-tblgen invocation and that could be changed/avoided by splitting up the passes file instead. This only works with tests disabled. With tests enabled this still compiles but tests fail as there is no lit config to disable tests that depend on PDL rewrites yet.	2024-01-03 09:43:22 -08:00
Alex Zinenko	fc0fdd1ae2	[mlir] fix AsmPrinter after `c1eab57673` The change in `c1eab57673` fixed the behavior of `getDiscardableAttrDictionary` for ops that are not using properties to only return discardable attributes. AsmPrinter was relying on the wrong behavior when printing such ops in the generic form, assuming all attributes are discardable.	2024-01-03 17:37:41 +00:00
Puyan Lotfi	03e29a49d9	[mlir][Pass] Enable the option for reproducer generation without crashing (#75421 ) This PR adds API `makeReproducer` and cl::opt flag `--mlir-generate-reproducer=<filename>` in order to allow for mlir reproducer dumps even when the pipeline doesn't crash. This PR also decouples the code that handles generation of an MLIR reproducer from the crash recovery portion. The purpose is to allow for generating reproducers outside of the context of a compiler crash. This will be useful for frameworks and runtimes that use MLIR where it is needed to reproduce the pipeline behavior for reasons outside of diagnosing crashes. An example is for diagnosing performance issues using offline tools, where being able to dump the reproducer from a runtime compiler would be helpful.	2024-01-03 12:36:43 -05:00
Han-Chung Wang	76cb0bb7a4	[mlir][tensor] Add a pattern to simplify tensor.unpack to collpase shape (#76607 )	2024-01-03 09:34:52 -08:00
Hugo Melder	3f9f8efc54	ObjcRuntime.h: Add mips64, aarch64, and riscv64 to non-legacy dispatch (#76694 ) This PR updates the list of architectures for which libobjc2 has fast-path objc_msgSend implementations. Related to: https://github.com/gnustep/libobjc2/pull/261	2024-01-03 17:34:00 +00:00

1 2 3 4 5 ...

485163 Commits