intel/llvm - llvm - Gitea: Git with a cup of tea

intel/llvm

mirror of https://github.com/intel/llvm.git synced 2026-02-04 19:44:15 +08:00

Author	SHA1	Message	Date
Jason Molenda	2149455cdc	Update docs to note lzfse open source implementation	2022-07-19 01:40:40 -07:00
Alexey Lapshin	e2147c26bd	[Debuginfo][llvm-dwarfutil] llvm-dwarfutil dsymutil-like tool for ELF. This patch implements proposal https://lists.llvm.org/pipermail/llvm-dev/2020-August/144579.html llvm-dwarfutil - is a tool that is used for processing debug info(DWARF) located in built binary files to improve debug info quality, reduce debug info size. The patch currently implements smaller set of command-line options(comparing to the proposal): ``` ./llvm-dwarfutil [options] <input file> <output file> --garbage-collection Do garbage collection for debug info(default) -j <value> Alias for --num-threads --no-garbage-collection Don`t do garbage collection for debug info --no-odr-deduplication Don`t do ODR deduplication for debug types --no-odr Alias for --no-odr-deduplication --no-separate-debug-file Create single output file, containing debug tables(default) --num-threads <threads> Number of available threads for multi-threaded execution. Defaults to the number of cores on the current machine --odr-deduplication Do ODR deduplication for debug types(default) --odr Alias for --odr-deduplication --separate-debug-file Create two output files: file w/o debug tables and file with debug tables --tombstone [bfd,maxpc,exec,universal] Tombstone value used as a marker of invalid address(default: universal) =bfd - Zero for all addresses and [1,1] for DWARF v4 (or less) address ranges and exec =maxpc - Minus 1 for all addresses and minus 2 for DWARF v4 (or less) address ranges =exec - Match with address ranges of executable sections =universal - Both: bfd and maxpc ``` Reviewed By: clayborg Differential Revision: https://reviews.llvm.org/D86539	2022-07-19 11:18:36 +03:00
Cullen Rhodes	f7b2d4aac6	[AArch64] Add patterns to fold zext(cmpeq(x, splat(0))) Reviewed By: paulwalker-arm Differential Revision: https://reviews.llvm.org/D129626	2022-07-19 08:14:38 +00:00
Xiang1 Zhang	4bb19de4b6	[X86] Add 64 bit implement for __SSC_MARK Reviewed By: craig.topper, pengfei.wang, jinsong Differential Revision: https://reviews.llvm.org/D129826	2022-07-19 16:13:41 +08:00
Nikita Popov	534b9246a2	[LoopInfo] Allow cloning of callbr After D129288, callbr is safe to clone without special handling. This permits optimizations like loop unroll and loop unswitch on loops containing callbrs. Fixes https://github.com/llvm/llvm-project/issues/41834. Differential Revision: https://reviews.llvm.org/D129993	2022-07-19 09:57:28 +02:00
Haojian Wu	d489b3807f	[pseudo] Implement a guard to determine function declarator. This eliminates some simple-declaration/function-definition false parses. - implement a function to determine whether a declarator ForestNode is a function declarator; - extend the standard declarator to two guarded function-declarator and non-function-declarator nonterminals; Differential Revision: https://reviews.llvm.org/D129222	2022-07-19 09:44:45 +02:00
Rosie Sumpter	05d424d165	[AArch64][SVE] Fold fadda(ptrue, x, select(mask, y, -0.0)) into fadda(mask, x, y) This patch adds an SVE pattern to recognize the use of a select with an fadda in the form fadda(ptrue, x, select(mask, y, -0.0)). In this case the select can be folded away, with the select mask used as the predicate for fadda. This improves the codegen when vectorizing loops with ordered fp reductions. Differential Revision: https://reviews.llvm.org/D129623	2022-07-19 08:31:51 +01:00
Matthias Springer	106d695287	[mlir][sparse][NFC] Update remaining test cases No more to_memref, memref.alloc or memref.dealloc when possible. Differential Revision: https://reviews.llvm.org/D130023	2022-07-19 09:21:10 +02:00
Matthias Springer	27a431f5e9	[mlir][bufferization][NFC] Move sparse_tensor.release to bufferization dialect This op used to belong to the sparse dialect, but there are use cases for dense bufferization as well. (E.g., when a tensor alloc is returned from a function and should be deallocated at the call site.) This change moves the op to the bufferization dialect, which now has an `alloc_tensor` and a `dealloc_tensor` op. Differential Revision: https://reviews.llvm.org/D129985	2022-07-19 09:18:19 +02:00
Nicolai Hähnle	5fc6213551	Revert change to clang/test/CodeGen/arm_acle.c For some reason, update_cc_test_checks.py produced a failing test. Partial revert of `301011fa60`	2022-07-19 09:11:23 +02:00
serge-sans-paille	eb0e3319bf	[sanitizer] Don't call dlerror() after swift_demangle lookup through dlsym Because the call to `dlerror()` may actually want to print something, which turns into a deadlock as showcased in #49223. Instead rely on further call to dlsym to clear `dlerror` internal state if they need to check the return status. Differential Revision: https://reviews.llvm.org/D128992	2022-07-19 09:07:30 +02:00
serge-sans-paille	a2ac383b44	[llvm] Fix forward declaration in Support/JSON.h Some methods of json::Array require json::Value to be completely defined, so they can't be defined in-class. Fix that by defining them out of class. Fix #55780	2022-07-19 09:07:29 +02:00
Bing1 Yu	af09127c94	[X86][NFC] avx512-f16c-v16f16-fadd.ll avx512-skx-v32f16-fadd.ll - add nounwind to prevent cfi noise on tests	2022-07-19 15:00:47 +08:00
Nicolai Hähnle	301011fa60	Rerun ./utils/update_cc_test.py on a bunch of tests Due to update script changes; this reduces the size of a later "real" diff.	2022-07-19 08:53:05 +02:00
Max Kazantsev	51f837a680	[NFC] Introduce API to detect tokens penetrating LCSSA form Following discussion in PR56243, we need to somehow detect the situation when token values penetrate LCSSA form for transforms that require that it is maintained by all values (for example, to sustain use-def dominance invarians). This patch introduces a parameter to LCSSA checkers to control their ignorance about tokens. Differential Revision: https://reviews.llvm.org/D129983 Reviewed By: efriedma	2022-07-19 13:52:30 +07:00
LLVM GN Syncbot	5114e2c50a	[gn build] Port `8ed702b83f`	2022-07-19 06:42:58 +00:00
Max Kazantsev	69b284aaf6	Revert "[DAGCombiner] Teach scalarizeBinOpOfSplats handle scalable splat." This reverts commit `58dfaaaace`. Massive AARCH test failures in buildbot.	2022-07-19 13:41:52 +07:00
Bing1 Yu	e01bf5a3e2	[X86] Promote v32f16's fadd into v32f32's fadd when it is avx512 without avx512fp16 Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D130059	2022-07-19 14:37:50 +08:00
Shraiysh Vaishay	35fc666877	[OpenMP][IRBuilder] Add support for taskgroup This patch adds support for generating taskgroup construct. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D128203	2022-07-19 10:49:34 +05:30
Jacques Pienaar	c8598fa22f	[mlir] Add refineReturnTypes to InferTypeOpInterface refineReturnType method shares the same parameters as inferReturnTypes but gets passed in the return types of the op if known that can be used during refinement passes or for more op specific error reporting. Currently the error reporting on failure is generic and doesn't allow for specializing the returned result based on failure, with this change what would previously have been a separate trait with specialized verification can just be handled as part of inferrence rather than duplicated. refineReturnTypes behaves like inferReturnTypes if no result types are fed in, while the current verification is recast as the default implementation for refineReturnTypes with it calling inferReturnTypes (and so the default type verification now goes through refine and allows for more op specific inference mismatch errors). Differential Revision: https://reviews.llvm.org/D129955	2022-07-18 22:18:52 -07:00
Carlos Alberto Enciso	83e922562f	Update the Windows packaging script. As discussed on: https://discourse.llvm.org/t/build-llvm-release-bat-script-options/63146/6 - Refactor the build/test steps into functions. - Exit the script if the build directory already exists. Reviewed By: hans Differential Revision: https://reviews.llvm.org/D129559	2022-07-19 05:55:14 +01:00
Nathan James	6357f1c1aa	[clang-tidy] Remove unnecessary code from ReadabilityModuleTest D56303 added testing code that was then made redundant by the changes in D125026. However this code wasn't completely removed in the latter patch. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D130026	2022-07-19 05:21:19 +01:00
Konstantin Varlamov	8ed702b83f	[libc++][ranges] Implement `ranges::{,stable_}partition`. Differential Revision: https://reviews.llvm.org/D129624	2022-07-18 21:06:17 -07:00
Lang Hames	67220c2ad7	[ORC] Fix serialization / deserialization of default-constructed ArrayRef<char>. Avoids a zero-length memcpy from a null src, which caused errors on some of the sanitizer bots. Also uses null when deserializing an empty ArrayRef (rather than pointing to a zero length range in the middle of the input buffer).	2022-07-18 20:39:01 -07:00
jacquesguan	58dfaaaace	[DAGCombiner] Teach scalarizeBinOpOfSplats handle scalable splat. This revision supports to scalarize a binary operation of two scalable splat vectors. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D122791	2022-07-19 11:20:51 +08:00
jacquesguan	3fcaea176c	[RISCV][test] Precommit test for D122791. Differential Revision: https://reviews.llvm.org/D123362	2022-07-19 10:56:02 +08:00
Kazushi (Jam) Marukawa	469044cfd3	[VE] Support load/store/spill of vector mask registers Support load/store/spill of vector mask registers and add regression tests. Reviewed By: efocht Differential Revision: https://reviews.llvm.org/D129415	2022-07-19 10:29:21 +09:00
zhongyunde	bddf20735e	[AArch64][NFC] Set true for default of subfeature is more readable Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D129960	2022-07-19 09:00:00 +08:00
Jim Ingham	83fab8cee9	Revert "Make hit point counts reliable for architectures that stop before evaluation." This reverts commit `5778ada8e5`. The watchpoint tests all stall on aarch64-ubuntu bots. Reverting till I can get my hands on an system to test this out.	2022-07-18 17:38:43 -07:00
Jim Ingham	4f5707e743	Revert "This is a followup to https://reviews.llvm.org/D129814 " This reverts commit `555ae5b8f5`. Apparently, there's something different about how Linux ARM handles watchpoints, as all the watchpoint tests seem to stall on the Ubuntu aarch64 bots. Reverting till I can get my hands on a linux system and see what is wrong.	2022-07-18 17:37:13 -07:00
ksyx	3198364e6e	[RISCV][Clang] Add support for Zmmul extension This patch implements recently ratified extension Zmmul, a subextension of M (Integer Multiplication and Division) consisting only multiplication part of it. Differential Revision: https://reviews.llvm.org/D103313 Reviewed By: craig.topper, jrtc27, asb	2022-07-18 20:26:08 -04:00
Argyrios Kyrtzidis	d1b58cada6	[unittests/Tooling/DependencyScannerTest] Add a target triple for `ScanDepsWithFS` test This should fix the `clang-ppc64-aix` builder.	2022-07-18 16:55:07 -07:00
Rahman Lavaee	ed93d157de	[llvm-objdump] Support --symbolize-operands when there is a single SHT_LLVM_BB_ADDR_MAP section for all text sections When linking, using `-Wl,-z,keep-text-section-prefix` results in multiple text sections while all `SHT_LLVM_BB_ADDR_MAP` sections are linked into a single one. In such case, we should not read the corresponding section for each text section, and instead read all `SHT_LLVM_BB_ADDR_MAP` sections before disassembly. Reviewed By: jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D129924	2022-07-18 16:51:22 -07:00
Jim Ingham	555ae5b8f5	This is a followup to https://reviews.llvm.org/D129814 That was causing hit counts to be double-counted on x86_64 Linux. It looks like StopInfoWatchpoint::ShouldStopSynchronous gets called twice for a give stop on Linux (not on Darwin). I had taken out the "have I been called already" check when I reworked this part of the code because it didn't seem necessary. Putting that back in because it looks like it is on some systems.	2022-07-18 16:24:31 -07:00
Ellis Hoag	3580daacf3	[InstrProf] Allow CSIRPGO function entry coverage The flag `-fcs-profile-generate` for enabling CSIRPGO moves the pass `pgo-instrumentation` after inlining. Function entry coverage works fine with this change, so remove the assert. I had originally left this assert in because I had not tested this at the time. Reviewed By: davidxl, MaskRay Differential Revision: https://reviews.llvm.org/D129407	2022-07-18 15:10:11 -07:00
Jim Ingham	e83d47f6b7	When the module path for `command script import` is invalid, echo the path. We were just emitting "invalid module" w/o saying which module. That's not particularly helpful. Differential Revision: https://reviews.llvm.org/D129338	2022-07-18 14:49:07 -07:00
Jim Ingham	5778ada8e5	Make hit point counts reliable for architectures that stop before evaluation. Since we want to present the "new & old" values for watchpoint hits, on architectures, including the ARM family, that stop before the triggering instruction is run, we need to single step over the instruction before stopping for realz. This was incorrectly done directly in the StopInfoWatchpoint::ShouldStop. That causes problems if more than one thread stops "for a reason" at the same time as the watchpoint, since the other actions didn't expect the process to make progress in this part of the execution control machinery. The correct way to do this is to schedule the step over using ThreadPlans, and then to restore the stop info after that plan stops, so that the rest of the stop info actions can happen when all the other threads have handled their immediate actions as well. Differential Revision: https://reviews.llvm.org/D129814	2022-07-18 14:36:32 -07:00
Matt Arsenault	8d0383eb69	CodeGen: Remove AliasAnalysis from regalloc This was stored in LiveIntervals, but not actually used for anything related to LiveIntervals. It was only used in one check for if a load instruction is rematerializable. I also don't think this was entirely correct, since it was implicitly assuming constant loads are also dereferenceable. Remove this and rely only on the invariant+dereferenceable flags in the memory operand. Set the flag based on the AA query upfront. This should have the same net benefit, but has the possible disadvantage of making this AA query nonlazy. Preserve the behavior of assuming pointsToConstantMemory implying dereferenceable for now, but maybe this should be changed.	2022-07-18 17:23:41 -04:00
Michael Jones	bf7f01d857	[libc] fix strtofloatingpoint on rare edge case Currently, there are two string parsers that can be used in a call to strtofloatingpoint. There is the main parser used by Clinger's fast path and Eisel-Lemire, and the backup parser used by Simple Decimal Conversion. There was a bug in the backup parser where if the number had more than 800 digits (the size of the SDC buffer) before the decimal point, it would just ignore the digits after the 800th and not count them into the exponent. This patch fixes that issue and adds regression tests. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D130032	2022-07-18 14:23:33 -07:00
zr33	1a1324a303	[BOLT][DWARF] Fix incorrect DW_AT_type offset for unittest Some unit tests has incorrect DW_AT_type offset since they are manual crafted, fix them to the correct offset. Reviewed By: Amir, ayermolo Differential Revision: https://reviews.llvm.org/D129828	2022-07-18 14:20:22 -07:00
zr33	66a41e0807	[BOLT][DWARF] Add Unit test for DW_AT_high_pc [DW_FORM_addr] Reviewed By: ayermolo Differential Revision: https://reviews.llvm.org/D127613	2022-07-18 14:03:53 -07:00
Sam McCall	fa0c7639e9	[pseudo] Add guards for module contextual keywords	2022-07-18 22:38:41 +02:00
Martin Storsjö	315072b450	[clang-tidy] Reduce the dependencies for the "make-confusable-table" tool When cross compiling llvm, a separate recursive native cmake build is generated, for building the tools that generate code (unless they're provided externally by the caller). This reduces the number of build steps for that native build from 1000+ steps to 162. This matches how the clang-pseudo-gen tool is set up in clang-tools-extra/pseudo/gen/CMakeLists.txt. Differential Revision: https://reviews.llvm.org/D129797	2022-07-18 22:50:29 +03:00
Björn Schäpers	d2eda49202	[clang-format] Mark constexpr lambdas as lambda Otherwise the brace was detected as a function brace, not wrong per se, but when directly calling the lambda the calling parens were put on the next line. Differential Revision: https://reviews.llvm.org/D129946	2022-07-18 21:42:34 +02:00
Björn Schäpers	3c18a8b3a3	[clang-format] Indent TT_CtorInitializerColon after requires clauses Fixes https://github.com/llvm/llvm-project/issues/56215 Differential Revision: https://reviews.llvm.org/D129942	2022-07-18 21:41:09 +02:00
Björn Schäpers	2b04c41b28	[clang-format] Fix misannotation of colon in presence of requires clause For clauses without parentheses it was annotated as TT_InheritanceColon. Relates to https://github.com/llvm/llvm-project/issues/56215 Differential Revision: https://reviews.llvm.org/D129940	2022-07-18 21:41:09 +02:00
Stanislav Mekhanoshin	523a99c0eb	[AMDGPU] Support for gfx940 fp8 smfmac Differential Revision: https://reviews.llvm.org/D129908	2022-07-18 12:12:41 -07:00
Stanislav Mekhanoshin	2695f0a688	[AMDGPU] Support for gfx940 fp8 mfma Differential Revision: https://reviews.llvm.org/D129906	2022-07-18 11:49:56 -07:00
Stanislav Mekhanoshin	9fa5a6b7e8	[AMDGPU] Support for gfx940 fp8 conversions Differential Revision: https://reviews.llvm.org/D129902	2022-07-18 11:48:43 -07:00
Florian Hahn	30e53b8c03	[LV] Sink module variable and use State to set it in widenCall. (NFC) Limits the lifetime of the variable and makes it independent of CallInst.	2022-07-18 19:41:48 +01:00

1 2 3 4 5 ...

430266 Commits