llvm/mlir/test/Dialect/GPU at 300750d4bea3fc2a17de13aa26f71aa10f2f5d2f - llvm

intel/llvm

mirror of https://github.com/intel/llvm.git synced 2026-01-26 12:26:52 +08:00

Files

James Newling 0928f46c69 [MLIR][GPU] Ensure all lanes in cluster have final reduction value (#165764 )

This is a fix for a cluster size of 32 when the subgroup size is 64.
Previously, only lanes [16, 32) u [48, 64) contained the correct
clusterwise reduction value. This PR adds a swizzle instruction to
broadcast the correct value down to lanes [0, 16) u [32, 48).

2025-10-31 09:12:43 -07:00

all-reduce-add.mlir

…

all-reduce-maxf.mlir

…

async-region.mlir

…

barrier-elimination.mlir

…

broadcast-speculatability.mlir

…

bufferization-buffer-deallocation.mlir

…

canonicalize.mlir

…

decompose-memrefs.mlir

…

dynamic-shared-memory.mlir

…

globalId-rewrite.mlir

…

indirect-device-func-call.mlir

…

int-range-interface.mlir

…

invalid.mlir

…

mapping.mlir

…

memref-to-llvm.mlir

…

module-to-binary-invalid.mlir

…

module-to-binary-nvvm.mlir

…

module-to-binary-rocdl.mlir

…

module-to-binary-spirv.mlir

…

multiple-all-reduce.mlir

…

nvvm-attach-target.mlir

…

ops.mlir

…

outlining.mlir

…

promote-shuffle-amdgpu.mlir

…

promotion.mlir

…

shuffle-rewrite.mlir

…

sink-ops.mlir

…

sparse-roundtrip.mlir

…

spirv-attach-targets.mlir

…

subgroup-mma-vector-unroll.mlir

…

subgroup-reduce-lowering.mlir

…

subgroupId-rewrite.mlir

…

test-nvvm-pipeline.mlir

…

transform-gpu-failing.mlir

…

transform-gpu.mlir

…

value-bounds-op-interface-impl.mlir

…