llvm/clang/lib/CodeGen at 96f6785bc9fe3219e9486ff09b22b31fa0c73b34 - llvm

intel/llvm

mirror of https://github.com/intel/llvm.git synced 2026-02-02 02:00:03 +08:00

Files

Michael Kruse 650bbc5620 [OpenMP][OpenMPIRBuilder] Implement loop unrolling.

Recommit of 707ce34b06. Don't introduce a
dependency to the LLVMPasses component, instead register the required
passes individually.

Add methods for loop unrolling to the OpenMPIRBuilder class and use them in Clang if `-fopenmp-enable-irbuilder` is enabled. The unrolling methods are:

 * `unrollLoopFull`
 * `unrollLoopPartial`
 * `unrollLoopHeuristic`

`unrollLoopPartial` and `unrollLoopHeuristic` can use compiler heuristics to automatically determine the unroll factor. If possible, that is if no CanonicalLoopInfo is required to pass to another method, metadata for LLVM's LoopUnrollPass is added. Otherwise the unroll factor is determined using the same heurstics as user by LoopUnrollPass. Not requiring a CanonicalLoopInfo, especially with `unrollLoopHeuristic` allows greater flexibility.

With full unrolling and partial unrolling with known unroll factor, instead of duplicating instructions by the OpenMPIRBuilder, the full unroll is still delegated to the LoopUnrollPass. In case of partial unrolling the loop is first tiled using the existing `tileLoops` methods, then the inner loop fully unrolled using the same mechanism.

Reviewed By: jdoerfert, kiranchandramohan

Differential Revision: https://reviews.llvm.org/D107764

2021-09-04 19:18:58 -05:00

ABIInfo.h

…

Address.h

…

BackendUtil.cpp

[NewPM] Make some sanitizer passes parameterized in the PassRegistry

2021-08-19 12:43:37 +02:00

CGAtomic.cpp

[OpaquePtr] Remove uses of CreateConstGEP1_64() without element type

2021-07-17 16:43:20 +02:00

CGBlocks.cpp

[clang][NFC] GetOrCreateLLVMGlobal takes LangAS

2021-08-23 14:55:58 +02:00

CGBlocks.h

…

CGBuilder.h

[OpaquePtr] Remove uses of CreateGEP() without element type

2021-07-17 22:56:27 +02:00

CGBuiltin.cpp

Revert @llvm.isnan intrinsic patchset.

2021-09-02 13:53:56 +03:00

CGCall.cpp

[Clang] add support for error+warning fn attrs

2021-08-25 10:34:18 -07:00

CGCall.h

…

CGClass.cpp

[OpaquePtr] Remove uses of CreateGEP() without element type

2021-07-17 22:56:27 +02:00

CGCleanup.cpp

…

CGCleanup.h

…

CGCoroutine.cpp

…

CGCUDANV.cpp

[OpaquePtr] Remove uses of CreateConstGEP1_32() without element type

2021-07-17 18:32:36 +02:00

CGCUDARuntime.cpp

…

CGCUDARuntime.h

…

CGCXX.cpp

[OpaquePtr] Remove uses of CGF.Builder.CreateConstInBoundsGEP1_64() without type

2021-07-17 17:07:46 +02:00

CGCXXABI.cpp

…

CGCXXABI.h

…

CGDebugInfo.cpp

Fully qualify template template parameters when printing

2021-09-02 15:04:34 -07:00

CGDebugInfo.h

DebugInfo: Refactor/deduplicate various template argument list emission

2021-08-30 22:39:46 -07:00

CGDecl.cpp

[clang] NFC: change uses of Expr->getValueKind into is?Value

2021-07-28 03:09:31 +02:00

CGDeclCXX.cpp

PR48030: Fix COMDAT-related linking problem with C++ thread_local static data members.

2021-08-24 19:53:44 -07:00

CGException.cpp

…

CGExpr.cpp

[clang][CodeGen] GetDefaultAlignTempAlloca uses preferred alignment

2021-08-23 14:55:58 +02:00

CGExprAgg.cpp

[OpaquePtr] Remove uses of CreateInBoundsGEP() without element type

2021-07-17 21:27:16 +02:00

CGExprComplex.cpp

…

CGExprConstant.cpp

…

CGExprCXX.cpp

[NFC] More get/removeAttribute() cleanup

2021-08-17 21:05:41 -07:00

CGExprScalar.cpp

[clang] NFC: Fix trivial typo in comments and document

2021-09-04 12:59:42 +05:30

CGGPUBuiltin.cpp

…

CGLoopInfo.cpp

…

CGLoopInfo.h

…

CGNonTrivialStruct.cpp

[CodeGen] Stop creating fake FunctionDecls when generating IR for

2021-06-29 14:22:33 -07:00

CGObjC.cpp

[clang][patch][FPEnv] Make initialization of C++ globals strictfp aware

2021-07-29 12:02:37 -04:00

CGObjCGNU.cpp

[OpaquePtr] Remove uses of CreateStructGEP() without element type

2021-07-17 18:48:21 +02:00

CGObjCMac.cpp

…

CGObjCRuntime.cpp

…

CGObjCRuntime.h

…

CGOpenCLRuntime.cpp

…

CGOpenCLRuntime.h

…

CGOpenMPRuntime.cpp

[OpenMP][OpenACC] Implement ompx_hold map type modifier extension in Clang (1/2)

2021-08-31 16:13:49 -04:00

CGOpenMPRuntime.h

[OpenMP] Creating the omp_target_num_teams and omp_target_thread_limit attributes to outlined functions

2021-07-27 17:21:04 -04:00

CGOpenMPRuntimeAMDGCN.cpp

[openmp][nfc] Replace OMPGridValues array with struct

2021-08-19 13:25:42 +01:00

CGOpenMPRuntimeAMDGCN.h

…

CGOpenMPRuntimeGPU.cpp

[openmp][nfc] Refactor GridValues

2021-08-23 16:19:11 +01:00

CGOpenMPRuntimeGPU.h

[openmp][nfc] Replace OMPGridValues array with struct

2021-08-19 13:25:42 +01:00

CGOpenMPRuntimeNVPTX.cpp

…

CGOpenMPRuntimeNVPTX.h

…

CGRecordLayout.h

…

CGRecordLayoutBuilder.cpp

…

CGStmt.cpp

[NFC] More get/removeAttribute() cleanup

2021-08-17 21:05:41 -07:00

CGStmtOpenMP.cpp

[OpenMP][OpenMPIRBuilder] Implement loop unrolling.

2021-09-04 19:18:58 -05:00

CGValue.h

…

CGVTables.cpp

…

CGVTables.h

…

CGVTT.cpp

…

CMakeLists.txt

…

CodeGenABITypes.cpp

…

CodeGenAction.cpp

[Clang] add support for error+warning fn attrs

2021-08-25 10:34:18 -07:00

CodeGenFunction.cpp

Ensure field-annotations on pointers properly match the AS of the field.

2021-09-01 06:12:24 -07:00

CodeGenFunction.h

[OpenMP][OpenMPIRBuilder] Implement loop unrolling.

2021-09-04 19:18:58 -05:00

CodeGenModule.cpp

[OpenCL] Defines helper function for kernel language compatible OpenCL version

2021-08-31 10:08:38 +01:00

CodeGenModule.h

[clang][NFC] GetOrCreateLLVMGlobal takes LangAS

2021-08-23 14:55:58 +02:00

CodeGenPGO.cpp

…

CodeGenPGO.h

…

CodeGenTBAA.cpp

…

CodeGenTBAA.h

…

CodeGenTypeCache.h

Fix __attribute__((annotate("")) with non-zero globals AS

2021-08-26 10:09:40 +01:00

CodeGenTypes.cpp

…

CodeGenTypes.h

…

ConstantEmitter.h

…

ConstantInitBuilder.cpp

…

CoverageMappingGen.cpp

…

CoverageMappingGen.h

…

EHScopeStack.h

…

ItaniumCXXABI.cpp

PR48030: Fix COMDAT-related linking problem with C++ thread_local static data members.

2021-08-24 19:53:44 -07:00

MacroPPCallbacks.cpp

…

MacroPPCallbacks.h

…

MicrosoftCXXABI.cpp

TypeInfo records more information about align requirement

2021-08-28 19:47:48 -04:00

ModuleBuilder.cpp

…

ObjectFilePCHContainerOperations.cpp

…

PatternInit.cpp

…

PatternInit.h

…

README.txt

…

SanitizerMetadata.cpp

…

SanitizerMetadata.h

…

SwiftCallingConv.cpp

…

TargetInfo.cpp

TypeInfo records more information about align requirement

2021-08-28 19:47:48 -04:00

TargetInfo.h

[Clang][AArch64] Inline assembly support for the ACLE type 'data512_t'

2021-07-31 09:51:28 +01:00

VarBypassDetector.cpp

…

VarBypassDetector.h

…

README.txt

IRgen optimization opportunities.

//===---------------------------------------------------------------------===//

The common pattern of
--
short x; // or char, etc
(x == 10)
--
generates an zext/sext of x which can easily be avoided.

//===---------------------------------------------------------------------===//

Bitfields accesses can be shifted to simplify masking and sign
extension. For example, if the bitfield width is 8 and it is
appropriately aligned then is is a lot shorter to just load the char
directly.

//===---------------------------------------------------------------------===//

It may be worth avoiding creation of alloca's for formal arguments
for the common situation where the argument is never written to or has
its address taken. The idea would be to begin generating code by using
the argument directly and if its address is taken or it is stored to
then generate the alloca and patch up the existing code.

In theory, the same optimization could be a win for block local
variables as long as the declaration dominates all statements in the
block.

NOTE: The main case we care about this for is for -O0 -g compile time
performance, and in that scenario we will need to emit the alloca
anyway currently to emit proper debug info. So this is blocked by
being able to emit debug information which refers to an LLVM
temporary, not an alloca.

//===---------------------------------------------------------------------===//

We should try and avoid generating basic blocks which only contain
jumps. At -O0, this penalizes us all the way from IRgen (malloc &
instruction overhead), all the way down through code generation and
assembly time.

On 176.gcc:expr.ll, it looks like over 12% of basic blocks are just
direct branches!

//===---------------------------------------------------------------------===//