llvm/clang/lib/CodeGen at 0bc4b2d33731279e41c4ce581a26a9f4386e08bb - llvm

intel/llvm

mirror of https://github.com/intel/llvm.git synced 2026-02-06 15:18:53 +08:00

Files

Yaxun Liu 0bc4b2d337 [OpenCL] Generate opaque type for sampler_t and function call for the initializer

Currently Clang use int32 to represent sampler_t, which have been a source of issue for some backends, because in some backends sampler_t cannot be represented by int32. They have to depend on kernel argument metadata and use IPA to find the sampler arguments and global variables and transform them to target specific sampler type.

This patch uses opaque pointer type opencl.sampler_t* for sampler_t. For each use of file-scope sampler variable, it generates a function call of __translate_sampler_initializer. For each initialization of function-scope sampler variable, it generates a function call of __translate_sampler_initializer.

Each builtin library can implement its own __translate_sampler_initializer(). Since the real sampler type tends to be architecture dependent, allowing it to be initialized by a library function simplifies backend design. A typical implementation of __translate_sampler_initializer could be a table lookup of real sampler literal values. Since its argument is always a literal, the returned pointer is known at compile time and easily optimized to finally become some literal values directly put into image read instructions.

This patch is partially based on Alexey Sotkin's work in Khronos Clang (3d4eec6162).

Differential Revision: https://reviews.llvm.org/D21567

llvm-svn: 277024

2016-07-28 19:26:30 +00:00

ABIInfo.h

…

Address.h

…

BackendUtil.cpp

Add flags to toggle preservation of assembly comments

2016-07-27 19:57:40 +00:00

CGAtomic.cpp

[NFC] Header cleanup

2016-07-18 19:02:11 +00:00

CGBlocks.cpp

…

CGBlocks.h

[NFC] Header cleanup

2016-07-18 19:02:11 +00:00

CGBuilder.h

…

CGBuiltin.cpp

[X86][AVX] Added support for lowering to VBROADCASTF128/VBROADCASTI128 with generic IR

2016-07-22 13:58:56 +00:00

CGCall.cpp

[OpenCL] Add missing -cl-no-signed-zeros option into driver

2016-07-08 20:28:29 +00:00

CGCall.h

[NFC] Header cleanup

2016-07-18 19:02:11 +00:00

CGClass.cpp

When copying an array into a lambda, destroy temporaries from

2016-07-20 21:02:43 +00:00

CGCleanup.cpp

…

CGCleanup.h

…

CGCUDABuiltin.cpp

[CUDA] Align kernel launch args correctly when the LLVM type's alignment is different from the clang type's alignment.

2016-07-27 22:36:21 +00:00

CGCUDANV.cpp

[CUDA] Align kernel launch args correctly when the LLVM type's alignment is different from the clang type's alignment.

2016-07-27 22:36:21 +00:00

CGCUDARuntime.cpp

…

CGCUDARuntime.h

…

CGCXX.cpp

…

CGCXXABI.cpp

…

CGCXXABI.h

…

CGDebugInfo.cpp

[OpenCL] Generate opaque type for sampler_t and function call for the initializer

2016-07-28 19:26:30 +00:00

CGDebugInfo.h

[OpenCL] Generate opaque type for sampler_t and function call for the initializer

2016-07-28 19:26:30 +00:00

CGDecl.cpp

P0217R3: Parsing support and framework for AST representation of C++1z

2016-07-22 23:36:59 +00:00

CGDeclCXX.cpp

Clang changes for overloading invariant.start and end intrinsics

2016-07-22 17:50:08 +00:00

CGException.cpp

…

CGExpr.cpp

[OpenCL] Generate opaque type for sampler_t and function call for the initializer

2016-07-28 19:26:30 +00:00

CGExprAgg.cpp

[OpenCL] Generate opaque type for sampler_t and function call for the initializer

2016-07-28 19:26:30 +00:00

CGExprComplex.cpp

[OpenCL] Generate opaque type for sampler_t and function call for the initializer

2016-07-28 19:26:30 +00:00

CGExprConstant.cpp

[OpenCL] Generate opaque type for sampler_t and function call for the initializer

2016-07-28 19:26:30 +00:00

CGExprCXX.cpp

…

CGExprScalar.cpp

[OpenCL] Generate opaque type for sampler_t and function call for the initializer

2016-07-28 19:26:30 +00:00

CGLoopInfo.cpp

…

CGLoopInfo.h

[NFC] Header cleanup

2016-07-18 19:02:11 +00:00

CGObjC.cpp

…

CGObjCGNU.cpp

CodeGen: honour dllstorage on ObjC types

2016-07-17 22:27:44 +00:00

CGObjCMac.cpp

CodeGen: honour dllstorage on ObjC types

2016-07-17 22:27:44 +00:00

CGObjCRuntime.cpp

…

CGObjCRuntime.h

…

CGOpenCLRuntime.cpp

[OpenCL] Generate opaque type for sampler_t and function call for the initializer

2016-07-28 19:26:30 +00:00

CGOpenCLRuntime.h

[OpenCL] Generate opaque type for sampler_t and function call for the initializer

2016-07-28 19:26:30 +00:00

CGOpenMPRuntime.cpp

[OpenMP] Change name of variable in mappble expression.

2016-07-28 15:31:29 +00:00

CGOpenMPRuntime.h

[OpenMP] Codegen for use_device_ptr clause.

2016-07-28 14:23:26 +00:00

CGOpenMPRuntimeNVPTX.cpp

…

CGOpenMPRuntimeNVPTX.h

…

CGRecordLayout.h

…

CGRecordLayoutBuilder.cpp

…

CGStmt.cpp

Reverting r275115 which caused PR28634.

2016-07-21 23:28:18 +00:00

CGStmtOpenMP.cpp

[OpenMP] Codegen for use_device_ptr clause.

2016-07-28 14:23:26 +00:00

CGValue.h

…

CGVTables.cpp

[NFC] Header cleanup

2016-07-18 19:02:11 +00:00

CGVTables.h

…

CGVTT.cpp

…

CMakeLists.txt

…

CodeGenABITypes.cpp

…

CodeGenAction.cpp

[CodeGen] Handle recursion in LLVMIRGeneration Timer.

2016-07-21 06:28:48 +00:00

CodeGenFunction.cpp

Add XRay flags to Clang. We implement two flags to control the XRay behaviour:

2016-07-13 22:32:15 +00:00

CodeGenFunction.h

[OpenMP] Codegen for use_device_ptr clause.

2016-07-28 14:23:26 +00:00

CodeGenModule.cpp

[OpenCL] Generate opaque type for sampler_t and function call for the initializer

2016-07-28 19:26:30 +00:00

CodeGenModule.h

[OpenCL] Generate opaque type for sampler_t and function call for the initializer

2016-07-28 19:26:30 +00:00

CodeGenPGO.cpp

[Coverage] Move logic to skip decl's into a helper (NFC)

2016-07-11 22:57:44 +00:00

CodeGenPGO.h

[NFC] Header cleanup

2016-07-18 19:02:11 +00:00

CodeGenTBAA.cpp

…

CodeGenTBAA.h

…

CodeGenTypeCache.h

…

CodeGenTypes.cpp

…

CodeGenTypes.h

[NFC] Header cleanup

2016-07-18 19:02:11 +00:00

CoverageMappingGen.cpp

[Coverage] Do not write out coverage mappings with zero entries

2016-07-26 00:24:59 +00:00

CoverageMappingGen.h

[NFC] Header cleanup

2016-07-18 19:02:11 +00:00

EHScopeStack.h

…

ItaniumCXXABI.cpp

Don't crash when generating code for __attribute__((naked)) member functions.

2016-07-27 22:04:24 +00:00

MicrosoftCXXABI.cpp

Don't crash when generating code for __attribute__((naked)) member functions.

2016-07-27 22:04:24 +00:00

ModuleBuilder.cpp

…

ObjectFilePCHContainerOperations.cpp

Frontend: Simplify ownership model for clang's output streams.

2016-07-15 00:55:40 +00:00

README.txt

…

SanitizerMetadata.cpp

…

SanitizerMetadata.h

…

SwiftCallingConv.cpp

…

TargetInfo.cpp

Adjust coercion of aggregates on RenderScript

2016-07-27 19:01:51 +00:00

TargetInfo.h

[OpenCL] AMDGCN target will generate images in constant address space

2016-07-20 19:21:11 +00:00

README.txt

IRgen optimization opportunities.

//===---------------------------------------------------------------------===//

The common pattern of
--
short x; // or char, etc
(x == 10)
--
generates an zext/sext of x which can easily be avoided.

//===---------------------------------------------------------------------===//

Bitfields accesses can be shifted to simplify masking and sign
extension. For example, if the bitfield width is 8 and it is
appropriately aligned then is is a lot shorter to just load the char
directly.

//===---------------------------------------------------------------------===//

It may be worth avoiding creation of alloca's for formal arguments
for the common situation where the argument is never written to or has
its address taken. The idea would be to begin generating code by using
the argument directly and if its address is taken or it is stored to
then generate the alloca and patch up the existing code.

In theory, the same optimization could be a win for block local
variables as long as the declaration dominates all statements in the
block.

NOTE: The main case we care about this for is for -O0 -g compile time
performance, and in that scenario we will need to emit the alloca
anyway currently to emit proper debug info. So this is blocked by
being able to emit debug information which refers to an LLVM
temporary, not an alloca.

//===---------------------------------------------------------------------===//

We should try and avoid generating basic blocks which only contain
jumps. At -O0, this penalizes us all the way from IRgen (malloc &
instruction overhead), all the way down through code generation and
assembly time.

On 176.gcc:expr.ll, it looks like over 12% of basic blocks are just
direct branches!

//===---------------------------------------------------------------------===//