mirror of
https://github.com/intel/llvm.git
synced 2026-02-04 11:38:04 +08:00
This patch handles native `mma.sync` sizes and enables issuing `ldmatrix` on largest possible tiles for matrixB. It requires handling `vector.extract_strided_slice` from vector to ngpu lowering. Differential Revision: https://reviews.llvm.org/D135749
Multi-Level Intermediate Representation
See https://mlir.llvm.org/ for more information.