intel/llvm

mirror of https://github.com/intel/llvm.git synced 2026-01-22 23:49:22 +08:00

Files

History

Guray Ozen 70c2e0618a [mlir][nvgpu] Add nvgpu.tma.async.load and nvgpu.tma.descriptor

This work adds `nvgpu.tma.async.load` Op that requests tma load asyncronusly using mbarrier object.

It also creates nvgpu.tma.descriptor type. The type is supposed be created by `cuTensorMapEncodeTiled` cuda drivers api.

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D155453

2023-07-21 10:23:25 +02:00

..

benchmark/python

…

…

…

…

[mlir][nvgpu] Add nvgpu.tma.async.load and nvgpu.tma.descriptor

2023-07-21 10:23:25 +02:00

[mlir][nvgpu] Add nvgpu.tma.async.load and nvgpu.tma.descriptor

2023-07-21 10:23:25 +02:00

…

[mlir][nvgpu] Add nvgpu.tma.async.load and nvgpu.tma.descriptor

2023-07-21 10:23:25 +02:00

[mlir] Add opt-in default property bytecode read and write implementation

2023-07-21 08:03:26 +02:00

…

…

.clang-format

…

.clang-tidy

…

CMakeLists.txt

…

LICENSE.TXT

…

README.md

…

README.md

Multi-Level Intermediate Representation

See https://mlir.llvm.org/ for more information.