Files
llvm/mlir
Valentin Clement (バレンタイン クレメン) b4d7d3f745 [mlir][NVVM] Add nvvm.membar operation (#166698)
Add nvvm.membar operation with level as defined in
https://docs.nvidia.com/cuda/parallel-thread-execution/#parallel-synchronization-and-communication-instructions-membar

This will be used to replace direct intrinsic call in CUDA Fortran for
`threadfence()`, `threadfence_block` and `thread fence_system()`
currently lowered here:
e700f15702/flang/lib/Optimizer/Builder/CUDAIntrinsicCall.cpp (L1310)

The nvvm membar intrsinsic are also used in CUDA C/C++
(49f55f4991/clang/lib/Headers/__clang_cuda_device_functions.h (L528))
2025-11-07 10:39:01 -08:00
..

Multi-Level Intermediate Representation

See https://mlir.llvm.org/ for more information.