intel/llvm - llvm - Gitea: Git with a cup of tea

intel/llvm

mirror of https://github.com/intel/llvm.git synced 2026-02-02 10:08:59 +08:00

Author	SHA1	Message	Date
Aiden Grossman	ba3bce0779	[Github] Switch back to tj-actions/changed-files (#158335 ) We were using the step security fork after the tj-actions/changed-files supply chain attack given Github disabled the repo and all our actions were failing during that time. Switch away from the fork back to the main repository to avoid an extra level of indirection until we can probably just stop using this action/roll our own.	2025-09-12 15:25:28 -07:00
Ryosuke Niwa	8ee31ab00b	[WebKit checkers] Treat function pointers with "Singleton" suffix as singleton. (#158012 )	2025-09-12 15:08:23 -07:00
Mircea Trofin	0d4a615998	[InstCombine] Make test resilient to metadata presence (#157607 ) Modernized it to using `update_test_checks` which addresses an ambgiuty in the previous test formulation, where a profile metadaat of value `i32 1` would have (incorrectly matched.	2025-09-12 15:07:25 -07:00
Mircea Trofin	8f25ea2d73	[NFC] Leave a comment in `Local.cpp` about debug info & sample profiling (#155296 ) Issue #152767	2025-09-12 15:05:16 -07:00
Mircea Trofin	9e33997242	[IR] Add `MD_prof` to the `Keep` list of `dropUBImplyingAttrsAndMetadata` (#154635 ) `MD_prof` is safe to keep when e.g. hoisting instructions. Issue #147390	2025-09-12 15:04:38 -07:00
lntue	f019e2368b	[libc] Change __builtin_memcpy to inline_memcpy. (#158345 )	2025-09-12 17:57:08 -04:00
Nishant Patel	8e17f80908	[MLIR][XeGPU] Distribute vector.step & vector.shape_cast op from wg to sg (#155443 ) This PR adds patterns to distribute vector.step and vector.shape_cast op from wg to sg and it also enables constant, broadcast and elementwise ops to handle the slice attribute	2025-09-12 14:33:52 -07:00
Aiden Grossman	32620c58ac	[lit] Add missing split-file dependency There was a recent patch that added in some tests to the lit test suite that use split-file. An explicit dependency in CMake was not added, which led to check-lit not working if being run without doing a full build first. This patch explicitly adds the dependency inside the CMake file to fix this configuration.	2025-09-12 21:27:13 +00:00
Aiden Grossman	7f2e9b1709	Revert "[libc++] Mark __{emplace,push}_back_slow_path as noinline (#94379 )" This reverts commit `1bafd020c7`. This breaks the LLDB data formatters which means these failures show up on every premerge run. Reverting for now until fixing the LLDB formatters can be coordinated with a relanding.	2025-09-12 21:24:15 +00:00
Alan Li	b87f1b22a8	[MLIR] Add `InParallelOpInterface` for parallel combining operations (#157736 ) This commit: - Introduces a new `InParallelOpInterface`, along with the `ParallelCombiningOpInterface`, represent the parallel updating operations we have in a parallel loop of `scf.forall`. - Change the name of `ParallelCombiningOpInterface` to `InParallelOpInterface` as the naming was quite confusing. - `ParallelCombiningOpInterface` now is used to generalize operations that insert into shared tensors within parallel combining regions. Previously, only `tensor.parallel_insert_slice` was supported directly in `scf.InParallelOp` regions. - `tensor.parallel_insert_slice` now implements `ParallelCombiningOpInterface`. This change enables future extensions to support additional parallel combining operations beyond `tensor.parallel_insert_slice`, which have different update semantics, so the `in_parallel` region can correctly and safely represent these kinds of operation without potential mistakes such as races. Author credits: @qedawkins	2025-09-12 14:23:00 -07:00
Keith Smiley	f645d209d4	[bazel] Add rules_shell for sh_binary rule (#158365 ) This is required for the upcoming bazel 9.x release where this rule is no longer automatically available.	2025-09-12 14:07:44 -07:00
Jeff Niu	86bcd1c2b2	[mlir][Intrange] Fix materializing ShapedType constant values (#158359 ) When materializing integer ranges of splat tensors or vector as constants, they should use DenseElementsAttr of the shaped type, not IntegerAttrs of the element types, since this can violate the invariants of tensor/vector ops. Co-authored-by: Jeff Niu <jeffniu@openai.com>	2025-09-12 13:53:32 -07:00
jtstogel	b5516dad6e	[PGO][test] Ensure test input is writeable after copying. (#158356 ) This test errors when trying to append to the `%t` file when run in an environment where the source tree is mounted read-only, since `cp` preserves the read-only file permission.	2025-09-12 15:49:39 -05:00
Peter Collingbourne	e1efb51080	[gn build] Port `f3efbce4a7`	2025-09-12 13:42:00 -07:00
Peter Collingbourne	d161d37dd3	[gn build] Port `8c0f3b6e8f`	2025-09-12 13:42:00 -07:00
Peter Collingbourne	01d85e73d9	[gn build] Port `220d705d21`	2025-09-12 13:42:00 -07:00
Aiden Grossman	9566388cbd	[Github] Delete dependabot config (#158337 ) Dependabot cannot configure the branch prefix, which means it fails everytime it tries to run because we only allow user/ branches. This is in preparation for using Renovate which supports custom branch prefixes and has other advantages, like the ability to run/get setup without any assisstance from a repository admin unlike dependabot. This makes it significantly more hackable for the rest of the community.	2025-09-12 13:35:15 -07:00
David Blaikie	aabf18d718	Revert "[DebugLine] Correct debug line emittion" (#158343 ) Reverts llvm/llvm-project#157529 Sorry, I missed that the missed that the LLVM test was using clang - layering dictates thats not OK. Please readjust the test case to work like the existing test coverage (or perhaps the existing test coverage is sufficient?) and post a new PR.	2025-09-12 13:04:53 -07:00
Keith Smiley	1756b6e59c	[bazel] Fix buildifier in tblgen.bzl (#158351 )	2025-09-12 12:46:24 -07:00
choikwa	ef7de8d144	[AMDGPU] Remove scope check in SIInsertWaitcnts::generateWaitcntInstBefore (#157821 ) This change was motivated by CK where many VMCNT(0)'s were generated due to instructions lacking !alias.scope metadata. The two causes of this were: 1) LowerLDSModule not tacking on scope metadata on a single LDS variable 2) IPSCCP pass before inliner replacing noalias ptr derivative with a global value, which made inliner unable to track it back to the noalias ptr argument. However, it turns out that IPSCCP losing the scope information was largely ineffectual as ScopedNoAliasAA was able to handle asymmetric condition, where one MemLoc was missing scope, and still return NoAlias result. AMDGPU however was checking for existence of scope in SIInsertWaitcnts and conservatively treating it as aliasing all and inserted VMCNT(0) before DS_READs, forcing it to wait for all previous LDS DMA instructions. Since we know that ScopedNoAliasAA can handle asymmetry, we should also allow AA query to determine if two MIs may alias. Passed PSDB. Previous attempt to address the issue in IPSCCP, likely stalled: https://github.com/llvm/llvm-project/pull/154522 This solution may be preferrable over that as issue only affects AMDGPU.	2025-09-12 14:51:36 -04:00
Andrew Gontarek	4826039058	[LLDB][NVIDIA] Add NVPTX architecture support (#158334 ) - Introduced a new method `IsNVPTX()` in `ArchSpec` to check for NVPTX architecture. - Implemented the corresponding method in `ArchSpec.cpp` to utilize the existing triple architecture checks.	2025-09-12 14:46:51 -04:00
Elvin Wang	6af94c566e	[IntrinsicEmitter] Make AttributesMap bits adaptive (#157965 ) Make IntrinsicsToAttributesMap's func. and arg. fields be able to have adaptive sizes based on input other than hardcoded 8bits/8bits. This will ease the pressure for adding new intrinsics in private downstreams. func. attr bitsize will become 7(127/128) vs 8(255/256)	2025-09-12 20:42:08 +02:00
Maksim Levental	1a6b2b64b6	[MLIR] enable Standalone example test for Windows (#158183 ) This PR turns on all Standalone tests for Windows except for the plugins (which aren't enabled by default).	2025-09-12 11:34:44 -07:00
joaosaffran	5fd3aad54c	[DirectX] Updating Root Signature YAML representation to use Enums instead of uint (#154827 ) This PR is updating Root Signature YAML to use enums, this is a required change to remove the use of to_underlying from DirectXContainer binary file. Closes: [#150676](https://github.com/llvm/llvm-project/issues/150676)	2025-09-12 14:31:27 -04:00
Antonio Frighetto	370607065d	[llvm] Regenerate test checks including TBAA semantics (NFC) Tests exercizing TBAA metadata (both purposefully and not), and previously generated via UTC, have been regenerated and updated to version 6.	2025-09-12 20:01:17 +02:00
Han-Chung Wang	8eba28bc8c	[mlir][NFC] Correct pattern names to match the behaviors. (#158177 ) It is a follow-up for https://github.com/llvm/llvm-project/pull/131982#discussion_r2286014576 and https://github.com/llvm/llvm-project/pull/126898#discussion_r2286013250. The names do not match the behaviors, and the revision updates the names. Signed-off-by: hanhanW <hanhan0912@gmail.com>	2025-09-12 10:57:20 -07:00
Aiden Grossman	330068a74b	Revert "[lit] Implement ulimit builtin" This reverts commit `615d07ea55`. This was causing some MacOS buildbolt failures.	2025-09-12 17:53:17 +00:00
Peter Rong	84f431c35b	[DebugLine] Correct debug line emittion (#157529 ) ### Context #99710 introduced `.loc_label` so we can terminate a line sequence. However, it did not advance PC properly. This is problematic for 1-instruction functions as it will have zero-length sequence. The test checked in that PR shows the problem: ``` # CHECK-LINE-TABLE: Address Line Column File ISA Discriminator OpIndex Flags # CHECK-LINE-TABLE-NEXT: ------------------ ------ ------ ------ --- ------------- ------- ------------- # CHECK-LINE-TABLE-NEXT: 0x00000028: 05 DW_LNS_set_column (1) # CHECK-LINE-TABLE-NEXT: 0x0000002a: 00 DW_LNE_set_address (0x0000000000000000) # CHECK-LINE-TABLE-NEXT: 0x00000035: 01 DW_LNS_copy # CHECK-LINE-TABLE-NEXT: 0x0000000000000000 1 1 1 0 0 0 is_stmt # CHECK-LINE-TABLE-NEXT: 0x00000036: 00 DW_LNE_end_sequence # CHECK-LINE-TABLE-NEXT: 0x0000000000000000 1 1 1 0 0 0 is_stmt end_sequence ``` Both rows having PC 0x0 is incorrect, and parsers won't be able to parse them. See more explanation why this is wrong in #154851. ### Design This PR attempts to fix this by advancing the PC to the next available Label, and advance to the end of the section if no Label is available. ### Implementation - `emitDwarfLineEndEntry` will advance PC to the `CurrLabel` - If `CurrLabel` is null, its probably a fake LineEntry we introduced in #110192. In that case look for the next Label - If still not label can be found, use `null` and `emitDwarfLineEndEntry` is smart enough to advance PC to the end of the section - Rename `LastLabel` to `PrevLabel`, "last" can mean "previous" or "final", this is ambigous. - Updated the tests to emit a correct label. ### Note This fix should render #154986 and #154851 obsolete, they were temporary fixes and don't resolve the root cause. --------- Signed-off-by: Peter Rong <PeterRong@meta.com>	2025-09-12 10:33:53 -07:00
Aiden Grossman	e5db36b604	[Clang] Port ulimit tests to work with internal shell Now that ulimit is implemented for the internal shell, we can make sure that the clang tests utilizing ulimit actually work. One just needs the removal of its shell requirement while the other one needs some rework to avoid bash for loops. These are writtein in Python for about the same amount of complexity. Reviewers: ilovepi, cmtice, AaronBallman, Sirraide, petrhosek Reviewed By: ilovepi Pull Request: https://github.com/llvm/llvm-project/pull/157977	2025-09-12 10:21:40 -07:00
Reid Kleckner	fd58f235f8	Revert "[SCEV] Fold (C1 * A /u C2) -> A /u (C2 /u C1), if C2 > C1." (#158328 ) Reverts llvm/llvm-project#157656 There are multiple reports that this is causing miscompiles in the MSan test suite after bootstrapping and that this is causing miscompiles in rustc. Let's revert for now, and work to capture a reproducer next week.	2025-09-12 10:15:41 -07:00
CatherineMoore	ea24d62f10	Add table to track OpenMP 5.2 Support; Update status of task graph (#158322 ) implementation; Co-authored-by: Michael Klemm <michael.klemm@amd.com>	2025-09-12 19:14:11 +02:00
Philip Reames	d75b837ff4	[RISCV] Support umin/umax in tryFoldSelectIntoOp (#157548 ) The neutral values for these are -1U, and 0 respectively. We already have good arithmetic lowerings for selects with one arm equal to these values. smin/smax are a bit harder, and will be a separate change. Somewhat surprisingly, this looks to be a net code improvement in all of the configurations. With both zbb, it's a clear win. With only zicond, we still seem to come out ahead because we reduce the number of ziconds needed (since we lower min/max to them). Without either zbb or zicond, we're a bit more of wash, but the available arithmetic sequences are good enough that doing the select unconditionally before using branches for the min/max is probably still worthwhile?	2025-09-12 10:02:00 -07:00
Dmitry Vasilyev	a848008e19	[lldb] Fixed UB in CPlusPlusLanguage plug-in (#158304 ) C++11 allows the use of Universal Character Names (UCNs) in identifiers, including function names. According to the spec the behavior of std::isalpha(ch) and std::isalnum(ch) is undefined if the argument's value is neither representable as unsigned char nor equal to EOF. To use these functions safely with plain chars (or signed chars), the argument should first be converted to unsigned char.	2025-09-12 20:56:21 +04:00
Matheus Izvekov	ba9d1c41c4	[clang] AST: remove DependentTemplateSpecializationType (#158109 ) A DependentTemplateSpecializationType (DTST) is basically just a TemplateSpecializationType (TST) with a hardcoded DependentTemplateName (DTN) as its TemplateName. This removes the DTST and replaces all uses of it with a TST, removing a lot of duplication in the implementation. Technically the hardcoded DTN is an optimization for a most common case, but the TST implementation is in better shape overall and with other optimizations, so this patch ends up being an overall performance positive: <img width="1465" height="38" alt="image" src="https://github.com/user-attachments/assets/084b0694-2839-427a-b664-eff400f780b5" /> A DTST also didn't allow a template name representing a DTN that was substituted, such as from an alias template, while the TST does allow it by the simple fact it can hold an arbitrary TemplateName, so this patch also increases the amount of sugar retained, while still being faster overall. Example (from included test case): ```C++ template<template<class> class TT> using T1 = TT<int>; template<class T> using T2 = T1<T::template X>; ``` Here we can now represent in the AST that `TT` was substituted for the dependent template name `T::template X`.	2025-09-12 13:55:38 -03:00
Aiden Grossman	615d07ea55	[lit] Implement ulimit builtin This patch implements ulimit inside the lit internal shell. Implementation wise, this functions similar to umask. But instead of setting the limits within the lit test worker process, we set environment variables and add a wrapper around the command to be executed. The wrapper then sets the limits. This is because we cannot increase the limits after lowering them, so we would otherwise end up with a lit test worker stuck with a lower limit. There are several tests where the use of ulimit is essential to the semantics of the test (two in clang, ~7 in compiler-rt), so we need to implement this in order to switch on the internal shell by default without losing test coverage. Reviewers: cmtice, petrhosek, ilovepi Reviewed By: cmtice, ilovepi Pull Request: https://github.com/llvm/llvm-project/pull/157958	2025-09-12 09:51:17 -07:00
Kazu Hirata	bd7c2f15e8	[ADT] Simplify PointerBitMask in PointerIntPair.h (NFC) (#158210 ) A left shift of (uintptr_t)-1) is simpler.	2025-09-12 09:45:53 -07:00
Philip Reames	6885950931	[SCEV] Fix a hang introduced by collectForPHI (#158153 ) If we have a phi where one of it's source blocks is an unreachable block, we don't want to traverse back into the unreachable region. Doing so allows e.g. finding a trivial self loop when walking back the predecessor chain.	2025-09-12 09:39:57 -07:00
Antonio Frighetto	04d38bed70	[clang] Regenerate test checks including TBAA semantics (NFC) Tests exercizing TBAA metadata (both purposefully and not), and previously generated via UTC, have been regenerated and updated to version 6.	2025-09-12 18:37:59 +02:00
Mehdi Amini	f3b712f6e4	[MLIR] Add debug log to the pass manager (NFC) (#156205 )	2025-09-12 17:37:30 +01:00
Charitha Saumya	9b0d7ddb04	[mlir][xegpu] Add support for `vector.multi_reduction` and `vector.shape_cast` SIMT distribution. (#157560 ) Add support for distributing the `vector.multi_reduction` operation across lanes in a warp. Currently only 2D to 1D reductions are supported. Given layouts for the source and accumulator vectors, * If the reduction dimension is distributed across lanes, the reduction is non-lane-local and the reduction is done using warp shuffles. Here we simply rewrite the `MultiDimReductionOp` to a sequence of `ReductionOp`s inside the warp op body. Actual distribution will be done by `WarpOpReduction` pattern. * If the reduction dimension is not distributed across lanes, the reduction is lane-local. In this case, we yield the source and accumulator vectors from the warp op and perform the lane-local reduction outside the warp op using a sequence of `ReductionOp`s. PR also adds support for distributing `vector.shape_cast` based on layouts.	2025-09-12 09:37:04 -07:00
Felipe de Azevedo Piovezan	5d088ba304	[lldb] Track CFA pointer metadata in StackID (#157498 ) [lldb] Track CFA pointer metadata in StackID In this commit: `9c8e716442` [lldb] Make StackID call Fix{Code,Data} pointers (#152796) We made StackID keep track of the CFA without any pointer metadata in it. This is necessary when comparing two StackIDs to determine which one is "younger". However, the CFA inside StackIDs is also used in other contexts through the method StackID::GetCallFrameAddress. One notable case is DWARFExpression: the computation of `DW_OP_call_frame_address` is done using StackID. This feeds into many other places, e.g. expression evaluation may require the address of a variable that is computed from the CFA; to access the variable without faulting, we may need to preserve the pointer metadata. As such, StackID must be able to provide both versions of the CFA. In the spirit of allowing consumers of pointers to decide what to do with pointer metadata, this patch changes StackID to store both versions of the cfa pointer. Two getter methods are provided, and all call sites except DWARFExpression preserve their existing behavior (stripped pointer). Other alternatives were considered: * Just store the raw pointer. This would require changing the comparisong operator `<` to also receive a Process, as the comparison requires stripped pointers. It wasn't clear if all call-sites had a non-null process, whereas we know we have a process when creating a StackID. * Store a weak pointer to the process inside the class, and then strip metadata as needed. This would require a `weak_ptr::lock` in many operations of LLDB, and it felt wasteful. It also prevents stripping of the pointer if the process has gone away. This patch also changes RegisterContextUnwind::ReadFrameAddress, which is the method computing the CFA fed into StackID, to also preserve the signature pointers.	2025-09-12 09:17:48 -07:00
Vedant Paranjape	c45aa5c764	[InstCombine] Revert FSub optimization from #157757 (#158315 ) Since FSub X, 0 gets canoncialised to FAdd X, -0 the said optimization didn't make much sense for FSub. Remove it from IC and the adjoined testcase.	2025-09-12 12:16:31 -04:00
Kazu Hirata	2491dc3d6f	[Utils] Fix a warning This patch fixes: llvm/lib/Transforms/Utils/SimplifyCFG.cpp:338:6: error: unused function 'isSelectInRoleOfConjunctionOrDisjunction' [-Werror,-Wunused-function]	2025-09-12 09:13:16 -07:00
Florian Hahn	b8eaceb39b	[VPlan] Explicitly replicate VPInstructions by VF. (#155102 ) Extend replicateByVF added in #142433 (`aa24029319`) to also explicitly unroll replicating VPInstructions. Now the only remaining case where we replicate for all lanes is VPReplicateRecipes in replicate regions. PR: https://github.com/llvm/llvm-project/pull/155102	2025-09-12 17:06:26 +01:00
Alex Trotta	ed1f1b88e4	Revert "[bazel][mlir][python] Port #155741 : stub auto-generation (#157173 )" (#157995 ) This reverts commit `46d8fdd86e`. The whole set of commits got reverted in https://github.com/llvm/llvm-project/pull/157831, reverting this one too.	2025-09-12 10:53:34 -05:00
Matthew Devereau	ead4f3e271	[InstCombine] Canonicalize active lane mask params (#158065 ) Rewrite active lane mask intrinsics to begin their range from 0 when both parameters are constant integers.	2025-09-12 16:35:58 +01:00
Jeaye Wilkerson	7ebfcbd0ec	Allow for custom code model in clang::Interpreter (#156977 ) This is necessary when using ASan, since the larger code size will lead to errors such as: ``` JIT session error: In graph clojure_core-clojure.core$clojure_core_cpp_cast_24538-24543-jitted-objectbuffer, section .eh_frame: relocation target 0x7bffe374b000 (DW.ref.__gxx_personality_v0) is out of range of Delta32 fixup at address 0x7bffe374b000 (<anonymous block> @ 0x7fffebf48158 + 0x13) ``` Previously, `clang::Interpreter` would hard-code the usage of a small code model. With this change, we default to small, but allow for custom values. This related to #102858 and #135401. There is no change to default behavior here. @lhames for review.	2025-09-12 18:34:14 +03:00
Jakub Kuderski	be587941c2	[mlir] Self-nominate for arith dialect maintenance (#157355 ) Following https://llvm.org/docs/DeveloperPolicy.html#maintainers, I'd like to self-nominate for arith dialect maintenance. As per the policy: > Maintainers are volunteering to take on the following shared responsibilities within an area of a project: > ... I believe I've been already performing most of the maintenance duties over the past few years, including direct code contributions, code reviews, and both starting and participating in relevant RFCs on discourse. You can look those up with: * `git log --author=Jakub --oneline -- 'mlir/include/mlir/Dialect/Arith' 'mlir/lib/Dialect/Arith'` * https://github.com/llvm/llvm-project/pulls?q=is%3Apr+label%3Amlir%3Aarith+reviewed-by%3Akuhar * Some notable RFCs authored: https://discourse.llvm.org/t/rfc-define-precise-arith-semantics/65507, https://discourse.llvm.org/t/rfc-poison-semantics-for-mlir/66245, https://discourse.llvm.org/t/rfc-arith-add-extended-multiplication-ops/66869, https://discourse.llvm.org/t/rfc-add-integer-add-with-carry-op-to-arith/64573, https://discourse.llvm.org/t/rfc-arith-should-we-support-scalar-vector-arith-bitcast-s/65427. In addition to the `core` category maintainers, I can bring additional perspective as I care both about conversion to llvm (as a user) and to spirv (as a maintainer).	2025-09-12 11:32:10 -04:00
Sander de Smalen	149f91bad6	[compiler-rt][AArch64] Don't use x18 in __arm_sme_save (#157802 ) The AAPCS recommends avoiding the use of x18 as it may be used for other purposes such as a shadow call stack. In this particular case it could just as well use x16 instead.	2025-09-12 16:20:16 +01:00
Karlo Basioli	6c11130bcd	Fix bazel build issue - caused in #156825 (#158313 )	2025-09-12 16:10:15 +01:00

1 2 3 4 5 ...

552407 Commits