intel/llvm - llvm - Gitea: Git with a cup of tea

intel/llvm

mirror of https://github.com/intel/llvm.git synced 2026-01-22 15:41:35 +08:00

Author	SHA1	Message	Date
Schrodinger ZHU Yifan	6695976d16	Reland "[libc] build fix for sigsetjmp (#137047 )" (#137214 ) Reland `sigsetjmp` patches with build fixes. We wrap every target replying on the epilogue library into conditional checks. --------- Co-authored-by: Petr Hosek <phosek@google.com>	2025-04-29 09:28:42 -04:00
Steven Perron	cc0cf72539	[HLSL] Allow non `.hlsl` files as source files (#137378 ) Changes the driver to assume input file with an unknown extension are HLSL source files instead of object files. Fixes https://github.com/llvm/llvm-project/issues/137370	2025-04-29 09:28:29 -04:00
Feng Zou	f02c93d707	[X86] Remove LLD command in LIT test (#137794 ) The test introduced in #136660, may be failed as ld.lld command not found.	2025-04-29 21:27:36 +08:00
Michael Buch	4560ff8740	[lldb][test] Skip TestFrameFormatFunctionSuffix.test on Windows We skip/xfail the other FrameFormat tests on Windows already. The number of different buildbot configurations out there targetting Windows makes it hard to make this test portable. If we want to test this on Windows we should probably just make a dedicated test for it.	2025-04-29 14:25:31 +01:00
Fraser Cormack	837d5a740f	[libclc][NFC] Remove unary_builtin.inc (#137656 ) We had two ways of achieving the same thing. This commit removes unary_builtin.inc in favour of the approach combining gentype.inc with unary_def.inc. There is no change to the codegen for any target.	2025-04-29 14:17:17 +01:00
Fraser Cormack	1e31f4b5eb	[AMDGPU] Support the OpenCL generic addrspace feature by default (#137636 ) This feature should be supported on AMDGCN architectures with flat addressing.	2025-04-29 14:14:00 +01:00
Fraser Cormack	78d95cc544	[libclc] Move fract to the CLC library (#137785 ) The builtin was already vectorized so there's no difference to codegen for non-SPIR-V targets.	2025-04-29 13:58:13 +01:00
Ramkumar Ramachandra	faf87e1414	[LAA] Prefer set-contains over set-count (NFC) (#136749 ) Improve code by preferring {SmallSet,SmallPtrSet}::contains() over the count() function, when used in a boolean context.	2025-04-29 13:56:04 +01:00
Rohit Aggarwal	2466100128	[X86] Add new #134979 test cases for gather scalar (#137416 ) We are adding new test cases to see the transformation impact due to #134979. These are similar to previous commit #688c3ffb057a87b86c6c1e77040418adf511efbb. Handling struct with two member and different vector factor. --------- Co-authored-by: Rohit Aggarwal <Rohit.Aggarwal@amd.com>	2025-04-29 13:52:43 +01:00
Luke Lau	b0f2bfc7e4	[VPlan] Use correct non-FMF constructor in VPInstructionWithType createNaryOp (#137632 ) Currently if we try to create a VPInstructionWithType without a FMF via VPBuilder::createNaryOp we will use the constructor that asserts `assert(isFPMathOp() && "this op can't take fast-math flags");`. This fixes it by checking if FMFs have a value, similar to the other createNaryOp overloads. This is needed by #129508	2025-04-29 20:35:19 +08:00
Aaron Ballman	2f976956e5	[C] Diagnose declarations hidden in C++ (#137368 ) This introduces a new diagnostic, -Wc++-hidden-decl, which is grouped under -Wc++-compat, that diagnoses declarations which are valid in C but invalid in C++ due to the type being at the wrong scope. e.g., ``` struct S { struct T { int x; } t; }; struct T t; // Valid C, invalid C++ ``` This is implementing the other half of #21898	2025-04-29 07:58:00 -04:00
Aaron Ballman	5e4ec04063	Fix crash with -ast-dump=json (#137324 ) When given an invalid Objective-C extension, Clang would crash when trying to emit the mangled name of the method to the JSON dump output. Fixes #137320	2025-04-29 07:57:03 -04:00
Aaron Ballman	c5bf901b1b	Fix crash with -ast-dump=json (#137324 ) When given an invalid Objective-C extension, Clang would crash when trying to emit the mangled name of the method to the JSON dump output. Fixes #137320	2025-04-29 07:55:32 -04:00
Tobias Stadler	0b5daeb2e5	[GlobalISel] Fix miscompile when narrowing vector loads/stores to non-byte-sized types (#136739 ) LegalizerHelper::reduceLoadStoreWidth does not work for non-byte-sized types, because this would require (un)packing of bits across byte boundaries. Precommit tests: #134904	2025-04-29 12:36:34 +01:00
LLVM GN Syncbot	81870cbcc2	[gn build] Port `bd6addc032`	2025-04-29 11:20:24 +00:00
Florian Hahn	d68b446933	[IR] Add matchers for remaining FP min/max intrinsics (NFC). (#137612 ) Add dedicated matchers for minimum,maximum,minimumnum and maximumnum intrinsics, similar for the existing matchers for maxnum and minnum. As suggested in https://github.com/llvm/llvm-project/pull/137335. PR: https://github.com/llvm/llvm-project/pull/137612	2025-04-29 12:20:00 +01:00
Feng Zou	bd6addc032	[X86][APX] Suppress EGPR/NDD instructions for relocations (#136660 ) Suppress EGPR/NDD instructions for relocations to avoid APX relocation types emitted. This is to keep backward compatibility with old version of linkers without APX support. The use case is to try APX features with LLVM + old built-in linker on RHEL9 OS which is expected to be EOL in 2032. If there are APX relocation types, the old version of linkers would raise "unsupported relocation type" error. Example: ``` $ llvm-mc -filetype=obj -o got.o -triple=x86_64-unknown-linux got.s $ ld got.o -o got.exe ld: got.o: unsupported relocation type 0x2b ... $ cat got.s ... movq foo@GOTPCREL(%rip), %r16 $ llvm-objdump -dr got.o ... 1: d5 48 8b 05 00 00 00 00 movq (%rip), %r16 0000000000000005: R_X86_64_CODE_4_GOTPCRELX foo-0x4 ```	2025-04-29 19:12:59 +08:00
Aaron Ballman	df267d77f6	[C] Add new -Wimplicit-int-enum-cast to -Wc++-compat (#137658 ) This introduces a new diagnostic group to diagnose implicit casts from int to an enumeration type. In C, this is valid, but it is not compatible with C++. Additionally, this moves the "implicit conversion from enum type to different enum type" diagnostic from `-Wenum-conversion` to a new group `-Wimplicit-enum-enum-cast`, which is a more accurate home for it. `-Wimplicit-enum-enum-cast` is also under `-Wimplicit-int-enum-cast`, as it is the same incompatibility (the enumeration on the right-hand is promoted to `int`, so it's an int -> enum conversion). Fixes #37027	2025-04-29 07:06:08 -04:00
Michael Buch	ebeae6402d	Reland "[lldb][Format] Make function name frame-format variables work without debug-info" (#137757 ) This reverts commit `da7099290c`. Same as the original PR. The failing test-case was resolved in https://github.com/llvm/llvm-project/pull/137763	2025-04-29 11:49:21 +01:00
Paul Walker	50d3febf17	[NFC][InstCombine][AArch64] Refactor SVE intrinsics tests. Only test a single element type because the combines are not sensitive to them. Extend coverage to cover the majority of SVE merging intrinsics.	2025-04-29 10:44:08 +00:00
Christian Ulmann	89c5c3ba38	[MLIR][IR] Add assert to Operation::moveBefore (NFC) (#137772 ) This commit adds an assert to `Operation::moveBefore` which ensures that moving of operations without a parent block is detected early on. This case otherwise runs into a segfault, as it's assumed that there is an parent block.	2025-04-29 12:38:34 +02:00
Paul Walker	93321966d9	[NFC][InstCombine][AArch64] Add simplify tests for reversed SVE intrinsics. Add missing tests for fdivr, fsubr, sdivr, subr & udivr. Add test case to demonstrate incorrect poison propagation.	2025-04-29 10:33:47 +00:00
Jason Eckhardt	e33cf4b078	[TableGen][NFC] Refactor/deduplicate emitAction. (#137434 ) Currently there is a fair amount of code duplication in various parts of emitAction. This patch factors out commonality among CCAssignToReg, CCAssignToRegAndStack, CCAssignToRegWithShadow, and CCAssignToStack. `XXXGenCallingConv.inc` are identical before and after this patch for all targets.	2025-04-29 05:24:23 -05:00
Paul Walker	6edcb52f40	[NFC][InstCombine][AArch64] Remove SVE mul idempotency tests. They have been subsumed into the SVE binop simplification tests.	2025-04-29 10:03:04 +00:00
Victor Lomuller	082598a64d	[SPIRV] Add intrinsic for OpGenericCastToPtrExplicit (#137626 ) The patch adds an intrinsic to encode OpGenericCastToPtrExplicit and the associated lowering logic.	2025-04-29 11:58:25 +02:00
Fraser Cormack	4609b6a3e7	[libclc] Move fmin & fmax to CLC library (#134218 ) This is an alternative to #128506 which doesn't attempt to change the codegen for fmin and fmax on their way to the CLC library. The amdgcn and r600 custom definitions of fmin/fmax are now converted to custom definitions of __clc_fmin and __clc_fmax. For simplicity, the CLC library doesn't provide vector/scalar versions of these builtins. The OpenCL layer wraps those up to the vector/vector versions. The only codegen change is that non-standard vector/scalar overloads of fmin/fmax have been removed. We were currently (accidentally, presumably) providing overloads with mixed elment types such as fmin(double2, float), fmax(half4, double), etc. The only vector/scalar overloads in the OpenCL spec are those with scalars of the same element type as the vector in the first argument.	2025-04-29 10:51:24 +01:00
Ramkumar Ramachandra	f6b6fb89ec	[VectorUtils] Improve computeMinimumValueSizes (NFC) (#137692 ) The Worklist used by computeMinimumValueSizes is wastefully a Value-vector, when the top of the loop attempts to cast the popped value to an Instruction anyway. Improve the algorithm by cutting this wasteful work, refining the type of Worklist, Visited, and Roots from Value-vectors to Instruction-vectors.	2025-04-29 10:36:26 +01:00
Simon Pilgrim	8961f3ee75	[X86] shouldReduceLoadWidth - don't split loads if we can freely reuse full width legal binop (#129695 ) Currently shouldReduceLoadWidth is very relaxed about when loads can be split to avoid extractions from the original full width load - resulting in many cases where the number of memory operations notably increases, replacing the cost of a extract_subvector for additional loads. This patch adjusts the 256/512-bit vector load splitting metric to detect cases where ANY use of the full width load is be used directly - in which case we will now reuse that load for smaller types, unless we'd need to extract an upper subvector / integer element - i.e. we now correctly treat (extract_subvector cst, 0) as free. We retain the existing logic of never splitting loads if all uses are extract+stores but we improve this by peeking through bitcasts while looking for extract_subvector/store chains. This required a number of fixes - shouldReduceLoadWidth now needs to peek through bitcasts UP the use-chain to find final users (limited to hasOneUse cases to reduce complexity). It also exposed an issue in isTargetCanonicalConstantNode which assumed that a load of vector constant data would always extract, which is no longer the case.	2025-04-29 10:35:15 +01:00
Jack Styles	bb9fa77c84	[ARM][Driver] Fix i8mm feature string (#137771 ) #137595 changed the behaviour for SIMD on ARM to ensure it is enabled and disabled correctly depending on the options used by the user. In this, the functionality to disable all features that depend on SIMD was added. However, there was a spelling error for the i8mm feature, which caused the `+nosimd` command to fail. This fixes the error, and allows the command to work again.	2025-04-29 10:32:50 +01:00
Carlos Galvez	014ab736dc	[clang-tidy] Do not pass any file when listing checks in run_clang_ti… (#137286 ) …dy.py Currently, run_clang_tidy.py does not correctly display the list of checks picked up from the top-level .clang-tidy file. The reason for that is that we are passing an empty string as input file. However, that's not how we are supposed to use clang-tidy to list checks. Per `65eccb463d`, we simply should not pass any file at all - the internal code of clang-tidy will pass a "dummy" file if that's the case and get the .clang-tidy file from the current working directory. Fixes #136659 Co-authored-by: Carlos Gálvez <carlos.galvez@zenseact.com>	2025-04-29 11:13:30 +02:00
TatWai Chong	3d47bc9d25	[mlir][tosa] Add error if and level checks for COND_IF & WHILE_LOOP (#136194 ) Error if checks: verify whether the same length and type between input list, output list, and control-flow blocks. Level_checks: verify whether the nested depth exceeds MAX_NESTING.	2025-04-29 10:10:03 +01:00
Michael Buch	252b095b32	[lldb][test] Add more test-cases to MangledTest	2025-04-29 10:07:04 +01:00
Michael Buch	64b5bc876a	[lldb][Format] Add function.suffix frame-format variable (#137763 ) This patch adds another frame-format variable (currently only implemented in the CPlusPlus language plugin) that represents the "suffix" of a function. The name is derived from the `DotSuffix` node of LLVM's Itanium demangler. For a function name such as `int foo() (.cold)`, the suffix would be `(.cold)`.	2025-04-29 10:02:44 +01:00
Brad Smith	76aa471698	Revert "[MIPS] Add Scheduling model for MIPS i6400 and i6500 CPUs (#132704 )" (#137767 ) This reverts commit `ffcca5112b`.	2025-04-29 05:01:36 -04:00
Timm Baeder	4bf356bbd2	[clang][bytecode] Start implementing builtin_is_within_lifetime (#137765 )	2025-04-29 10:57:30 +02:00
Ramkumar Ramachandra	13b443f2ef	[LAA] Improve convergent tests (#136758 ) LoopAccessAnalysis has code for handling function calls where the function is marked with the 'convergent' attribute, but the test coverage is insufficient. Fix this by adding a test showing the case of no-runtime-checks adapted from LoopDistribute, and clean up the existing test with runtime-checks. Also regenerate the test file with UpdateTestChecks.	2025-04-29 09:50:33 +01:00
Ramkumar Ramachandra	49842426f3	[LV] Fix MinBWs in WidenIntrinsic case (#137005 ) There is a bug in the computation and handling of MinBWs in the case of VPWidenIntrinsicRecipe: a crash is observed in VPlanTransforms::truncateToMinimalBitwidths due to a mismatch between the number of recipes processed and the number of entries in MinBWs. Fix handling of calls in llvm::computeMinimumValueSizes, and handle VPWidenIntrinsicRecipe in truncateToMinimalBitwidths, fixing the bug. Fixes #87407.	2025-04-29 09:47:38 +01:00
Timm Baeder	980d027a3f	[clang][bytecode] Allow This pointers in CPCE mode (#137761 ) The outermost function doesn't have a This pointers, but inner calls can. This still doesn't match the diagnostic output of the current interpreter properly, but it's closer I think.	2025-04-29 10:14:15 +02:00
Vikram Hegde	86d8e8d9a6	[CodeGen][NewPM] Port "PrologEpilogInserter" to NPM (#130550 )	2025-04-29 13:13:45 +05:30
Matthias Springer	a9b6185718	[mlir][memref] Fix memory leaks in `print-memref.mlir` (#137590 ) This fixes the test when running with ASAN.	2025-04-29 09:42:59 +02:00
Mallikarjuna Gouda	ffcca5112b	[MIPS] Add Scheduling model for MIPS i6400 and i6500 CPUs (#132704 ) Add scheduling model for the MIPS i6400 and i6500, an in-order MIPS64R6 processor. i6400 and i6500 share same instruction latencies. CPU has following pipelines - Two ALUs - Multiply and Divide unit (MDU) - Branch Unit (CTU) - Load/Store Unit (LSU) - Short Floating-point Unit and - Long Floating-point Unit Latency information is available at: https://mips.com/wp-content/uploads/2025/03/MIPS_I6500-F_Data_Sheet_Rev1.10_3-17-2025.pdf	2025-04-29 03:36:27 -04:00
Kazu Hirata	ff66d34286	[Driver] Simplify string comparisons (NFC) (#137756 )	2025-04-29 00:30:59 -07:00
Pavel Labath	a9ece2dc68	[lldb] Skip some Host::GetProcessInfo tests for macos Some of the tests moved in #134354 do not currently pass on a mac. Disable them. The test for testing process priority also does not build on a mac. I'm not moving it (back) to the linux, since it's "almost" fine -- we'd just need to implement the RLIMIT_NICE logic differently. I'm not doing that now since (apart from linux) there are no systems which are able to retrieve the process priority.	2025-04-29 09:28:43 +02:00
Jack Styles	ace8ceab73	[ARM][Driver] Ensure NEON is enabled and disabled correctly (#137595 ) In #130623 support was added for `+nosimd` in the clang driver. Following this PR, it was discovered that, if NEON is disabled in the command line, it did not disable features that depend on this, such as Crypto or AES. To achieve this, This PR does the following: - Ensure that disabling NEON (e.g., via +nosimd) also disables dependent features like Crypto and AES. - Update the driver to automatically enable NEON when enabling features that require it (e.g., AES). This fixes inconsistent behavior where features relying on NEON could be enabled without NEON itself being active, or where disabling NEON left dependent features incorrectly enabled.	2025-04-29 08:28:10 +01:00
LLVM GN Syncbot	a618ae2c72	[gn build] Port `e43e8ec7af`	2025-04-29 06:46:28 +00:00
Nikolas Klauser	e43e8ec7af	[libc++] Remove dead implementation of is_nothrow_convertible and merge the remaining code into is_convertible.h (#137717 ) We can use the `__is_nothrow_convertible` builtin unconditionally now, which makes the implementation very simple, so there isn't much of a need to keep a separate header around.	2025-04-29 08:46:06 +02:00
Pavel Labath	15cd71afd2	[lldb] Make ValueObject::Dereference less aggressive (#137311 ) The function was always trying to dereference both the synthetic and non-synthetic view of the object. This is wrong as the caller should be able to determine which view of the object it wants to access, as is done e.g. for child member access. This patch removes the nonsynthetic->synthetic fallback, which is the more surprising path, and fixes the callers to try both versions of the object (when appropriate). I also snuck in simplification of the member access code path because it was possible to use the same helper function for that, and I wanted to be sure I understand the logic correctly. I've left the synthetic->nonsynthetic fallback in place. I think we may want to keep that one as we often have synthetic child providers for pointer types. They usually don't provide an explicit dereference operation but I think users would expect that a dereference operation on those objects would work. What we may want to do is to try the synthetic operation first in this case, so that the nonsynthetic case is really a fallback. --------- Co-authored-by: Ilia Kuklin <kuklin.iy@mail.ru>	2025-04-29 08:27:18 +02:00
Pavel Labath	679cc0a1ed	[lldb] Remove Function null checks in Block.cpp (#137611 ) As of #117683, Blocks are always enclosed in a function, so these checks never fail. We can't change the signature of `CalculateSymbolContextFunction` as it's an abstract function (and some of its implementations can return nullptr), but we can create a different function that we can call when we know we're dealing with a block.	2025-04-29 08:20:08 +02:00
Pavel Labath	ebaeecc429	[lldb] Use early returns in TypeSystemClang::IsPossibleDynamicType (#137630 ) Remove a couple of levels of indentation before I need to add another one to fix a bug.	2025-04-29 08:17:53 +02:00
Michael Buch	da7099290c	Revert "[lldb][Format] Make function name frame-format variables work without debug-info" (#137757 ) Reverts llvm/llvm-project#137408 This change broke `lldb/test/Shell/Unwind/split-machine-functions.test`. The test binary has a symbol named `_Z3foov.cold` and the test expects the backtrace to print the name of the cold part of the function like this: ``` # SPLIT: frame #1: {{.*}}`foo() (.cold) + ``` but now it gets ``` frame #1: 0x000055555555514f split-machine-functions.test.tmp`foo() + 12 ```	2025-04-29 07:13:04 +01:00

1 2 3 4 5 ...

535737 Commits