mirror of
https://github.com/intel/llvm.git
synced 2026-02-07 07:39:11 +08:00
This patch handles native `mma.sync` sizes and enables issuing `ldmatrix` on largest possible tiles for matrixB. It requires handling `vector.extract_strided_slice` from vector to ngpu lowering. Differential Revision: https://reviews.llvm.org/D135749