llvm/clang/lib/CodeGen at fa13d015a35b879c33cd5ab68e0e4eb7cae28b11 - llvm

intel/llvm

mirror of https://github.com/intel/llvm.git synced 2026-02-05 13:21:04 +08:00

Files

Yaxun Liu fa13d015a3 [OpenCL] Fix __enqueue_block for block with captures

The following test case causes issue with codegen of __enqueue_block

void (^block)(void) = ^{ callee(id, out); };

enqueue_kernel(queue, 0, ndrange, block);
Clang first does codegen for block expression in the first line and deletes its block info.
Clang then tries to do codegen for the same block expression again for the second line,
and fails because the block info is gone.

The fix is to do normal codegen for both lines. Introduce an API to OpenCL runtime to
record llvm block invoke function and llvm block literal emitted for each AST block
expression, and use the recorded information for generating the wrapper kernel.

The EmitBlockLiteral APIs are cleaned up to minimize changes to the normal codegen
of blocks.

Another minor issue is that some clean up AST expression is generated for block
with captures, which can be stripped by IgnoreImplicit.

Differential Revision: https://reviews.llvm.org/D43240

llvm-svn: 325264

2018-02-15 16:39:19 +00:00

ABIInfo.h

…

Address.h

…

BackendUtil.cpp

Update for llvm change. NFC.

2018-02-14 19:11:37 +00:00

CGAtomic.cpp

[CodeGen] Decorate aggregate accesses with TBAA tags

2018-01-25 14:21:55 +00:00

CGBlocks.cpp

[OpenCL] Fix __enqueue_block for block with captures

2018-02-15 16:39:19 +00:00

CGBlocks.h

…

CGBuilder.h

Change memcpy/memove/memset to have dest and source alignment attributes.

2018-01-28 17:27:45 +00:00

CGBuiltin.cpp

[X86] Reverse the operand order of the implementation of the kunpack builtins.

2018-02-12 22:38:52 +00:00

CGCall.cpp

Make attribute-target on a Definition-after-use update the LLVM attributes

2018-02-12 17:01:41 +00:00

CGCall.h

Pass around function pointers as CGCallees, not bare llvm::Value*s.

2018-02-06 18:52:44 +00:00

CGClass.cpp

[CodeGen] Use the non-virtual alignment when emitting the base

2018-01-27 00:34:09 +00:00

CGCleanup.cpp

…

CGCleanup.h

…

CGCoroutine.cpp

…

CGCUDANV.cpp

…

CGCUDARuntime.cpp

…

CGCUDARuntime.h

…

CGCXX.cpp

…

CGCXXABI.cpp

…

CGCXXABI.h

Pass around function pointers as CGCallees, not bare llvm::Value*s.

2018-02-06 18:52:44 +00:00

CGDebugInfo.cpp

Implement function attribute artificial

2018-02-14 00:14:07 +00:00

CGDebugInfo.h

[DebugInfo] Update Checksum handling in CGDebugInfo

2018-02-12 19:47:05 +00:00

CGDecl.cpp

[DebugInfo] Avoid name conflict of generated VLA expression variable.

2018-02-13 07:49:34 +00:00

CGDeclCXX.cpp

…

CGException.cpp

…

CGExpr.cpp

Recommit rL323952: [DebugInfo] Enable debug information for C99 VLA types.

2018-02-03 13:55:59 +00:00

CGExprAgg.cpp

[CodeGen] Decorate aggregate accesses with TBAA tags

2018-01-25 14:21:55 +00:00

CGExprComplex.cpp

…

CGExprConstant.cpp

[CodeGen] Use the zero initializer instead of storing an all zero representation.

2018-02-09 22:10:09 +00:00

CGExprCXX.cpp

IRGen: Move vtable load after argument evaluation.

2018-02-05 23:09:13 +00:00

CGExprScalar.cpp

Recommit rL323952: [DebugInfo] Enable debug information for C99 VLA types.

2018-02-03 13:55:59 +00:00

CGGPUBuiltin.cpp

…

CGLoopInfo.cpp

…

CGLoopInfo.h

…

CGObjC.cpp

Revert "CodeGen: annotate ObjC ARC functions with ABI constraints"

2018-01-30 20:19:34 +00:00

CGObjCGNU.cpp

…

CGObjCMac.cpp

CodeGen: use llvm.used for ObjC protocols

2018-01-23 19:35:51 +00:00

CGObjCRuntime.cpp

…

CGObjCRuntime.h

…

CGOpenCLRuntime.cpp

[OpenCL] Fix __enqueue_block for block with captures

2018-02-15 16:39:19 +00:00

CGOpenCLRuntime.h

[OpenCL] Fix __enqueue_block for block with captures

2018-02-15 16:39:19 +00:00

CGOpenMPRuntime.cpp

Recommit rL323952: [DebugInfo] Enable debug information for C99 VLA types.

2018-02-03 13:55:59 +00:00

CGOpenMPRuntime.h

…

CGOpenMPRuntimeNVPTX.cpp

Recommit rL323952: [DebugInfo] Enable debug information for C99 VLA types.

2018-02-03 13:55:59 +00:00

CGOpenMPRuntimeNVPTX.h

…

CGRecordLayout.h

…

CGRecordLayoutBuilder.cpp

[CodeGen] Fix an assertion failure in CGRecordLowering.

2018-02-01 03:04:15 +00:00

CGStmt.cpp

[WinEH] Put funclet bundles on inline asm calls

2018-02-09 00:16:41 +00:00

CGStmtOpenMP.cpp

Recommit rL323952: [DebugInfo] Enable debug information for C99 VLA types.

2018-02-03 13:55:59 +00:00

CGValue.h

…

CGVTables.cpp

Recommit r324107 again.

2018-02-07 22:15:33 +00:00

CGVTables.h

…

CGVTT.cpp

Recommit r324107 again.

2018-02-07 22:15:33 +00:00

CMakeLists.txt

…

CodeGenABITypes.cpp

…

CodeGenAction.cpp

…

CodeGenFunction.cpp

Recommit rL323952: [DebugInfo] Enable debug information for C99 VLA types.

2018-02-03 13:55:59 +00:00

CodeGenFunction.h

[OpenCL] Fix __enqueue_block for block with captures

2018-02-15 16:39:19 +00:00

CodeGenModule.cpp

Make attribute-target on a Definition-after-use update the LLVM attributes

2018-02-12 17:01:41 +00:00

CodeGenModule.h

Make attribute-target on a Definition-after-use update the LLVM attributes

2018-02-12 17:01:41 +00:00

CodeGenPGO.cpp

…

CodeGenPGO.h

…

CodeGenTBAA.cpp

[CodeGen] Decorate aggregate accesses with TBAA tags

2018-01-25 14:21:55 +00:00

CodeGenTBAA.h

[CodeGen] Decorate aggregate accesses with TBAA tags

2018-01-25 14:21:55 +00:00

CodeGenTypeCache.h

…

CodeGenTypes.cpp

…

CodeGenTypes.h

…

ConstantEmitter.h

…

ConstantInitBuilder.cpp

…

CoverageMappingGen.cpp

…

CoverageMappingGen.h

…

EHScopeStack.h

…

ItaniumCXXABI.cpp

ASan+operator new[]: Add an option for more thorough operator new[] cookie poisoning

2018-02-12 11:49:02 +00:00

MacroPPCallbacks.cpp

…

MacroPPCallbacks.h

…

MicrosoftCXXABI.cpp

Pass around function pointers as CGCallees, not bare llvm::Value*s.

2018-02-06 18:52:44 +00:00

ModuleBuilder.cpp

…

ObjectFilePCHContainerOperations.cpp

…

README.txt

…

SanitizerMetadata.cpp

…

SanitizerMetadata.h

…

SwiftCallingConv.cpp

…

TargetInfo.cpp

Fix for #31362 - ms_abi is implemented incorrectly for values >=16 bytes.

2018-02-08 11:15:21 +00:00

TargetInfo.h

Don't pass ForDefinition_t in places it is redundant.

2018-02-07 19:04:41 +00:00

VarBypassDetector.cpp

…

VarBypassDetector.h

…

README.txt

IRgen optimization opportunities.

//===---------------------------------------------------------------------===//

The common pattern of
--
short x; // or char, etc
(x == 10)
--
generates an zext/sext of x which can easily be avoided.

//===---------------------------------------------------------------------===//

Bitfields accesses can be shifted to simplify masking and sign
extension. For example, if the bitfield width is 8 and it is
appropriately aligned then is is a lot shorter to just load the char
directly.

//===---------------------------------------------------------------------===//

It may be worth avoiding creation of alloca's for formal arguments
for the common situation where the argument is never written to or has
its address taken. The idea would be to begin generating code by using
the argument directly and if its address is taken or it is stored to
then generate the alloca and patch up the existing code.

In theory, the same optimization could be a win for block local
variables as long as the declaration dominates all statements in the
block.

NOTE: The main case we care about this for is for -O0 -g compile time
performance, and in that scenario we will need to emit the alloca
anyway currently to emit proper debug info. So this is blocked by
being able to emit debug information which refers to an LLVM
temporary, not an alloca.

//===---------------------------------------------------------------------===//

We should try and avoid generating basic blocks which only contain
jumps. At -O0, this penalizes us all the way from IRgen (malloc &
instruction overhead), all the way down through code generation and
assembly time.

On 176.gcc:expr.ll, it looks like over 12% of basic blocks are just
direct branches!

//===---------------------------------------------------------------------===//