Commit Graph

555565 Commits

Author SHA1 Message Date
Craig Topper
c265d7aebf [SelectionDAG] Add SDTCisInt<1> to SDTVecReduce and SDTCisEltOfVec to SDTFPVecReduce. NFC (#162761) 2025-10-10 09:02:08 -07:00
Craig Topper
672672bed3 [SelectionDAG] Add SDTCisSameNumEltsAs to more operations. NFC (#162759) 2025-10-10 09:00:36 -07:00
Amr Hesham
c25a2c7284 [CIR] Upstream Exception with empty try block (#162737)
Upstream, the basic support for the C++ try catch statement with an
empty try block

Issue https://github.com/llvm/llvm-project/issues/154992
2025-10-10 17:57:47 +02:00
Akash Dutta
04d3965125 [AMDGPU]: Packed instructions unpacking to co-issue post-RA scheduling - fix flag handling (#160195)
This is a follow up to https://github.com/llvm/llvm-project/pull/157968.
This fixes flag handling in inserted instructions.
2025-10-10 08:56:38 -07:00
Shilei Tian
24a5d8a9ca Revert "[LifetimeSafety] Reorganize code into modular components (#162474)"
This reverts commit 2eb8c47b88 since it completely breaks dylib build.
2025-10-10 11:51:41 -04:00
Chinmay Deshpande
de67a78c30 [AMDGPU][GlobalISel] Add RegBankLegalize support for G_ASSERT_{S|Z}EXT ops (#162728) 2025-10-10 08:48:06 -07:00
Jordan Rupprecht
5391c6849f [AMDGPU][NFC] Fix clang frontend<->sema layering issue (#162865)
#140210 added `#include "clang/Frontend/FrontendDiagnostic.h"` to
clang/lib/Sema/SemaAMDGPU.cpp, but Frontend itself has a dependency on
Sema. This creates a layering issue as described in
https://llvm.org/docs/CodingStandards.html#library-layering.

Fortunately, d076608d58 made this easy to
fix, as clang/Frontend/FrontendDiagnostic.h just forwards to
clang/Basic/DiagnosticFrontend.h, so it's trivial to make this depend on
basic instead of frontend.
2025-10-10 15:44:51 +00:00
Jordan Rupprecht
85dd8f4bff [bazel][uArch][XeGPU] Port #153706: add uArch package (#162872) 2025-10-10 15:44:10 +00:00
Aaron Ballman
f6caec67fa Speculatively fix build bots
https://github.com/llvm/llvm-project/pull/162843 landed a test that is
broken on darwin:

https://lab.llvm.org/buildbot/#/builders/190/builds/28819

This is an attempted quick fix.
2025-10-10 11:42:16 -04:00
Yi-Chi Lee
a9c8e94b43 [DAGCombiner] Extend FP-to-Int cast without requiring nsz (#161093)
This patch updates the FP-to-Int conversion handling:
- For signed integers: use `ftrunc` followed by clamping to the target
integer range.
- For unsigned integers: apply `fabs` + `ftrunc`, then clamp.

This removes the previous dependence on `nsz` and ensures correct
lowering for both signed and unsigned cases.

I've tested the code generation of -mtriple=amdgcn. It seems that the
assembly code is expected, but I'm not sure how to write a general
testcase for every target.

Fixes #160623.
2025-10-11 00:34:33 +09:00
Martin Storsjö
93d3260389 [lldb] [cmake] Fix delayloading liblldb.dll in mingw builds (#162831)
ec28b95b74 made liblldb delayloaded, but
the supplied command line parameter is only valid for MSVC style builds.

For mingw builds using LLD, we can easily pass a similar option. For
mingw builds with ld.bfd, we can't quite as easily delayload it - for
these cases, just keep linking it like we usually do, and warn if the
user tried to set LLDB_PYTHON_DLL_RELATIVE_PATH in a build where it
won't have any effect.

Also change the setting for MSVC style builds, to use the simpler
`$<TARGET_FILE_NAME:liblldb>` instead of
`$<TARGET_FILE_BASE_NAME:liblldb>.dll`. The former pattern is what we
use for mingw targets, and it makes the code clearer to use that for
both, as that same expression should do the right thing for both.
2025-10-10 18:18:14 +03:00
Martin Storsjö
13784f7a4d [cmake] Unconditionally use -Wno-pass-failed with Clang (#162835)
Since 4feae05c6a, most of the handling of
warning options was rewritten to add such options based on hardcoded
knowledge about what compilers support which options, and since which
versions. This avoids a number of configure time checks, speeding up the
cmake configuration.

This avoids erroneously adding this option with GCC, which doesn't
really support it.

If testing for a warning option like -Wno-<foo> with GCC, GCC won't
print any diagnostic at all, leading to the options being accepted
incorrectly. However later, if compiling a file that actually prints
another warning, GCC will also print warnings about these -Wno-<foo>
options being unrecognized.

This avoids extra warning spam like this, for every source file that
does produce warnings with GCC:

    At global scope:
    cc1plus: note: unrecognized command-line option ‘-Wno-pass-failed’ may have been intended to silence earlier diagnostics
2025-10-10 18:17:31 +03:00
Martin Storsjö
31d260211a [lldb] Add a missing <atomic> include. NFC. (#162809)
This fixes building LLDB for mingw with libstdc++, after
8ae30a3fac.

Previously, building errored out with errors like these:

    In file included from llvm-project/lldb/include/lldb/Protocol/MCP/Transport.h:12,
                     from llvm-project/lldb/include/lldb/Protocol/MCP/Server.h:16,
                     from llvm-project/lldb/source/Plugins/Protocol/MCP/ProtocolServerMCP.h:15,
                     from llvm-project/lldb/source/Plugins/Protocol/MCP/ProtocolServerMCP.cpp:9:
    llvm-project/lldb/include/lldb/Host/JSONTransport.h:608:23: error: field ‘replied’ has incomplete type ‘std::atomic<bool>’
      608 |     std::atomic<bool> replied = {false};
          |                       ^~~~~~~
    In file included from gcc-mingw/x86_64-w64-mingw32/include/c++/12.1.0/bits/shared_ptr_atomic.h:33,
                     from gcc-mingw/x86_64-w64-mingw32/include/c++/12.1.0/memory:78,
                     from llvm-project/lldb/include/lldb/Host/Socket.h:12,
                     from llvm-project/lldb/include/lldb/Core/ProtocolServer.h:13,
                     from llvm-project/lldb/source/Plugins/Protocol/MCP/ProtocolServerMCP.h:12:
    gcc-mingw/x86_64-w64-mingw32/include/c++/12.1.0/bits/atomic_base.h:162:12: note: declaration of ‘struct std::atomic<bool>’
      162 |     struct atomic;
          |            ^~~~~~
2025-10-10 18:12:57 +03:00
David Spickett
fca40b3b6f [lldb][docs] State that the extension packets doc should be alphabetised
I did this when I converted it but never stated so. Adding a comment
stating so. We'll probably forget at some point but might as well.

Also moved the x packet to the last place, after wasm which was
added to the end recently.
2025-10-10 15:03:54 +00:00
Connector Switch
21b2bdd91c [libcxx][NFC] Added libunwind to LLVM_ENABLE_RUNTIMES for Windows… (#162682)
… builds using MinGW.

I tried running the original command, but the error told me I had to
include libunwind.
2025-10-10 22:58:41 +08:00
Hongyu Chen
3340b245af [DFAJumpThreading] Precompute value => successor mapping (#162824)
Address some previous comments. Note that the value => successor mapping
is invalid during DFA transformation unless we update it correctly.
2025-10-10 22:57:42 +08:00
Matt Arsenault
424fa83335 CodeGen: Remove unused IntrinsicLowering includes (#162844) 2025-10-10 14:34:16 +00:00
Aaron Ballman
2992d3dfef [C2y] Claim support for WG14 N3623 (#162843)
This requires invalid signatures of main to be diagnosed while still
allowing for common extensions. Clang has always supported this.
2025-10-10 10:28:10 -04:00
Md Abdullah Shahneous Bari
83d3a2efe4 [uArch][XeGPU] Add XeGPU uArch definition. (#153706)
The uArch infrastructure provides:
- A set data structures to represent, uArch and it's necessary
components (e.g., instructions, register-files, caches).
- A set of utility interfaces that are common to a family of ops (e.g.,
mma ops, 2DBlockIO ops). The implementation of these interfaces are
provided by the specific instructions. Each family of ops provides these
5 common APIs. However, some family of ops may have more utility APIs.
The common 5 APIs are:
	- getSupportedShapes
	- getSupportedTypes
	- checkSupportedShapesAndTypes
	- checkSupportedTypes
	- validate

Add support for PVC and BMG architectures.
Add support for DPAS instruction.
2025-10-10 09:21:27 -05:00
Tom Tromey
311d1138e7 Do not emit right shift by 0 in DWARF expressions (#162738)
DW_OP_LLVM_extract_bits_zext and DW_OP_LLVM_extract_bits_sext can end up
emitting "DW_OP_constu 0; DW_OP_shr" when given certain arguments.
However, a shift by zero is not useful, and so it can be omitted.
2025-10-10 15:15:17 +01:00
Matt Arsenault
9922d8dd79 AMDGPU: Stop using aligned VGPR classes for addRegisterClass (#158278)
This is unnecessary. At use emission time, InstrEmitter will
use the common subclass of the value type's register class and
the use instruction register classes. This removes one of the
obstacles to treating special case instructions that do not have
the alignment requirement overly conservatively.
2025-10-10 22:59:35 +09:00
Michael Buch
e18461a14e [lldb][test] Skip TestCppFloatingTypesSpecialization.py on older compilers
Fails with:
```
08:19:28  /Users/ec2-user/jenkins/workspace/llvm.org/lldb-cmake-matrix/clang_1501_build/bin/clang++  -std=c++11 -g -O0 -isysroot "/Applications/Xcode-beta.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX14.2.sdk" -arch arm64  -I/Users/ec2-user/jenkins/workspace/llvm.org/lldb-cmake-matrix/llvm-project/lldb/packages/Python/lldbsuite/test/make/../../../../..//include -I/Users/ec2-user/jenkins/workspace/llvm.org/lldb-cmake-matrix/lldb-build/tools/lldb/include -I/Users/ec2-user/jenkins/workspace/llvm.org/lldb-cmake-matrix/llvm-project/lldb/test/API/lang/cpp/floating-types-specialization -I/Users/ec2-user/jenkins/workspace/llvm.org/lldb-cmake-matrix/llvm-project/lldb/packages/Python/lldbsuite/test/make -include /Users/ec2-user/jenkins/workspace/llvm.org/lldb-cmake-matrix/llvm-project/lldb/packages/Python/lldbsuite/test/make/test_common.h  -fno-limit-debug-info    -nostdlib++ -nostdinc++ -cxx-isystem /Users/ec2-user/jenkins/workspace/llvm.org/lldb-cmake-matrix/clang_1501_build/include/c++/v1/  --driver-mode=g++ -MT main.o -MD -MP -MF main.d -c -o main.o /Users/ec2-user/jenkins/workspace/llvm.org/lldb-cmake-matrix/llvm-project/lldb/test/API/lang/cpp/floating-types-specialization/main.cpp
08:19:28  /Users/ec2-user/jenkins/workspace/llvm.org/lldb-cmake-matrix/llvm-project/lldb/test/API/lang/cpp/floating-types-specialization/main.cpp:3:24: error: __bf16 is not supported on this target
08:19:28  template <> struct Foo<__bf16> {};
08:19:28                         ^
08:19:28  /Users/ec2-user/jenkins/workspace/llvm.org/lldb-cmake-matrix/llvm-project/lldb/test/API/lang/cpp/floating-types-specialization/main.cpp:5:40: error: __bf16 is not supported on this target
08:19:28  template <> struct Foo<_Float16> : Foo<__bf16> {};
08:19:28                                         ^
08:19:28  /Users/ec2-user/jenkins/workspace/llvm.org/lldb-cmake-matrix/llvm-project/lldb/test/API/lang/cpp/floating-types-specialization/main.cpp:8:7: error: __bf16 is not supported on this target
08:19:28    Foo<__bf16> f0;
08:19:28        ^
08:19:28  3 errors generated.
08:19:28  make: *** [main.o] Error 1
```

On our Clang-15.0 CI.
2025-10-10 14:54:24 +01:00
aokblast
394e7ded8b [LLDB, x86, FreeBSD] Fix Architecture parsing by reading the ELF header. (#162811)
Currently, LLDB in FreeBSD host sets the Process Architecture used by
lldbserver as Default one. Which cause problem when trying to debug a
32bit binary on amd64 platform since the lldb itself will found mismatch
architecture with lldbserver's return.

Notice that this patch is only a partial fix for the debugging problem.
We are still unable to debug x86 on x86_64 so that we don't provide
testcase in this patch.

See: https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=289945
2025-10-10 14:51:20 +01:00
Ryotaro Kasuga
1e84cb5127 [DA] Address followup comments on #128782 (#162645)
This is a follow-up PR for post-commit comments in #128782 .

- Fix variable name to camel case.
- Change the output format to make it easier to handle with FileCheck.
- Regenerate assertions of regression tests.
2025-10-10 22:19:19 +09:00
Simon Pilgrim
027b647896 [X86] Add additional test coverage for #160111 (#162838) 2025-10-10 13:15:47 +00:00
flovent
1d3ad74667 [clang-tidy] Fix insertion location for certain function pointers in cppcoreguidelines-init-variables (#162218)
This patch starts to find terminator from `VarDecl`'s end location
rather than it's `getLocation()` to ignore terminator(`,`) in function
paramaters list.

Kind of follow up to #112091 

Closes #161978 .
2025-10-10 21:13:49 +08:00
Nico Weber
a833d902ae [gn build] Port 2eb8c47b88 2025-10-10 09:10:46 -04:00
Nikita Popov
f04ae1f4b1 [InstCombine] Perform common cast transforms for ptrtoaddr 2025-10-10 15:08:21 +02:00
Arseniy Zaostrovnykh
e297184ba1 [clang][analyzer] Remove boolean per-entry-point metrics (#162817)
The complexity of maintaining an extra kind of metrics have not
justified itself, as it is not used upstream, and we have only a single
use of boolean stats per entrypoint downstream.

As I will do downstream, you can use an unsigned statistic type with
values 0 and 1 to model a boolean flag.

--

CPP-7097
2025-10-10 14:53:16 +02:00
Syadus Sefat
167c00e4a4 [AMDGPU][GlobalISel] Add register bank legalization for G_SMIN/G_SMAX/G_UMIN/G_UMAX (#159821)
[AMDGPU][GlobalISel] Add register bank legalization for
G_SMIN/G_SMAX/G_UMIN/G_UMAX

This patch adds register bank legalization support for min/max
operations in the AMDGPU GlobalISel pipeline.

- Add support S16, S32, and V2S16 types
- For V2S16 uniform operations implements UnpackMinMax lowering
2025-10-10 14:34:15 +02:00
Rahul Joshi
97031a6e09 [NFC][LLVM] Code cleanup in MisExpect.cpp (#162695)
- Make instruction arguments to the `misexpect` functions const and drop
const for `ArrayRef` arguments.
- Remove namespace llvm/misexpect surrounding all the code in .cpp file
and instead use `using namespace`.
- Change static helper functions to also use const references or
pointers when possible.
2025-10-10 05:24:42 -07:00
Rahul Joshi
ea5648b1a8 [NFC][LLVM] Code cleanup in LowerInvoke.cpp (#162691)
Fix indentation for `LowerInvokeLegacyPass` and use namespace qualifiers
for defining variables and functions in `llvm` namespace.
2025-10-10 05:24:11 -07:00
Rahul Joshi
bea786d128 [NFC][LLVM] Use namespace qualifier to define DenseMapInfo specializations (#162683)
Use `llvm::DenseMapInfo` to define `DenseMapInfo` specializations
instead of surrounding it with `namespace llvm {}`.
2025-10-10 05:23:44 -07:00
Aaron Ballman
0a21b06e0b Remove WG14 N3544
This paper had only non-normative changes about the classification list
of storage class specifiers. There is nothing here to test for
conformance.
2025-10-10 08:18:54 -04:00
Ryan Cowan
b298421e0c [AArch64][GlobalISel] Add G_FPEXT(G_FCONSTANT) folding (#162829) 2025-10-10 12:17:06 +00:00
Aaron Ballman
3fdef23873 Remove WG14 N3563
This paper had only non-normative changes and was already covered by
existing nullptr_t test coverage.
2025-10-10 08:16:00 -04:00
Valery Pykhtin
5eef98ba31 [AMDGPU] Use correct SlotIndex to calculate live-out register set. (#161720)
Using SlotIndexes::getMBBEndIdx() isn't the correct choice here because
it returns the starting index of the next basic block, causing live-ins
of the next block to be calculated instead of the intended live-outs of
the current block.

Add SlotIndexes::getMBBLastIdx() method to return the last valid
SlotIndex within a basic block, enabling correct live-out calculations.
2025-10-10 14:03:27 +02:00
Utkarsh Saxena
2eb8c47b88 [LifetimeSafety] Reorganize code into modular components (#162474)
Restructure the C++ Lifetime Safety Analysis into modular components
with clear separation of concerns.

This PR reorganizes the C++ Lifetime Safety Analysis code by:

1. Breaking up the monolithic `LifetimeSafety.cpp` (1500+ lines) into
multiple smaller, focused files
2. Creating a dedicated `LifetimeSafety` directory with a clean
component structure
3. Introducing header files for each component with proper documentation
4. Moving existing code into the appropriate component files:
    - `Checker.h/cpp`: Core lifetime checking logic
    - `Dataflow.h`: Generic dataflow analysis framework
    - `Facts.h`: Lifetime-relevant events and fact management
    - `FactsGenerator.h/cpp`: AST traversal for fact generation
- `LiveOrigins.h/cpp`: Backward dataflow analysis for origin liveness
- `LoanPropagation.h/cpp`: Forward dataflow analysis for loan tracking
    - `Loans.h`: Loan and access path definitions
    - `Origins.h`: Origin management
    - `Reporter.h`: Interface for reporting lifetime violations
    - `Utils.h`: Common utilities for the analysis

The code functionality remains the same, but is now better organized
with clearer interfaces between components.
2025-10-10 13:43:39 +02:00
Paul Walker
1c5bba1e47 [LLVM][CodeGen][SVE] ASRD cannot represent sdiv-by-one. (#162708)
We lower signed divides by a power-of-two to ASRD. However, ASRD's
immediate is log2(shift_amount) in the range 1 to elt-bitwidth, which
means it cannot represent sdiv-by-one.

Fixes https://github.com/llvm/llvm-project/issues/162616
2025-10-10 12:37:43 +01:00
Aaron Ballman
80f48b8493 [C2y] Claim support for WG14 N3532; NFC (#162718)
This paper was ensuring that the left operand of . and -> must be a
complete object type. This has always been required in every version of
Clang.
2025-10-10 07:34:16 -04:00
Pierre van Houtryve
e26058cefc [AMDGPU] Use reverse iteration in CodeGenPrepare (#145484)
In order to make this easier, I also removed all "removeFromParent"
calls from the visitors, instead adding instructions
to a set of instructions to delete once the function has been visited.
This avoids crashes due to functions deleting their operands. In theory
we could allow functions to delete the
instruction they visited (and only that one) but I think having one
idiom for everything is less error-prone.

Fixes #140219
2025-10-10 13:00:40 +02:00
Vladimir Radosavljevic
be7f85168d [CGP] Fix missing sign extension for base offset in optimizeMemoryInst (#161377)
If we have integers larger than 64-bit we need to explicitly sign extend
them, otherwise we will get wrong zero extended values.
2025-10-10 10:52:52 +00:00
Paul Walker
50cda17e32 [Clang][AArch64] Relax SVE2 requirement for builtins available via +sme. (#162457)
Having both SVE and SME environments essentially means the majority of
SVE2 instructions are always available. This PR reflects this at the
builtins level whereby +sve+sme is enough to enable this subset.

It's a similar story for +sve2p1 whereby the feature means a subset of
SME2 instructions are always available.
2025-10-10 11:28:04 +01:00
JaydeepChauhan14
86ab51935b [X86][NFC] Moved/Updated FNEG testcases (#162269)
- Moved existing testcases from
```llvm/test/CodeGen/X86/fast-isel-fneg.ll``` to
```llvm/test/CodeGen/X86/isel-fneg.ll```.
- Added **x86_fp80** testcase and **GISEL** RUN lines as precommit
testcases for GISEL.
- Two i686 testcases are disabled because it is crashing in later passes
only for GISEL. It will resolve after adding **FNEG GISEL
implementation**, I will add that in next PR.
2025-10-10 10:14:10 +00:00
Paul Walker
b52efa5f05 [LLVM][CodeGen][SVE] Add lowering for 3-way VECTOR_(DE)INTERLEAVE operations. (#162502)
SVE has no in-register instructions to do this, but we can perform the
operation through memory by using ld3/st3.
2025-10-10 11:12:50 +01:00
David Spickett
6f1ce2b25b [llvm][ADT] Fix uint64_t array BitSet construction on 32-bit systems (#162814)
When the underlying storage element is 32-bit, we may only need half of
the last value in the uint64_t array. I've adjusted the constructor to
account for that.

For example, if you construct a 65 bit bitset you will need both 32-bit
halves of the first array value but only the bottom half of the second
value. The storage will only have 3 elements, so attempting to assign
the top 32-bits into the storage will fail.

This happened on our buildbot:
https://lab.llvm.org/buildbot/#/builders/154/builds/22555
(though the traceback is not useful)

Then added tests for < 32 bit sizes, and assertions for the number of
elements we decide to use. Given that the only member of BitSet is a
std::array, I think the size will be consistent across systems.

Tested on 32 and 64-bit Arm machines.

Follow up to #162703.
2025-10-10 10:08:50 +00:00
Nikolas Klauser
8ad5a21790 [libc++] Fix %{temp} replacements when running benchmarks 2025-10-10 12:08:26 +02:00
Rana Pratap Reddy
f2ffb4d20b [AMDGPU] Support for type inferring image load/store builtins for AMDGPU (#140210)
Introduces the builtins for amdgcn_image_load/store/sample.
2025-10-10 15:26:08 +05:30
Ryan Cowan
fab7bd48fc Revert "[AArch64][GlobalISel] Add G_FPEXT(G_FCONSTANT) folding" (#162805)
Reverts llvm/llvm-project#160902 as the tests need updating to prevent
breaking the build. More specifically, the `AMDGPU` tests.
2025-10-10 10:46:30 +01:00
Pragyansh Chaturvedi
97cf55b2ce [GlobalISel] add KnownBits tracking for G_UMULH and G_SMULH (#158445)
The code is taken from `SelectionDAG::computeKnownBits`.
This ticks off `MULHU` and `MULHS` from
https://github.com/llvm/llvm-project/issues/150515
2025-10-10 18:42:14 +09:00