intel/llvm

mirror of https://github.com/intel/llvm.git synced 2026-01-13 19:08:21 +08:00

Files

History

Giacomo Castiglioni d3edc94d11 [MLIR][GPU] subgroup_mma fp64 extension - take 2 (#169061 )

This PR re-lands #165873.

This PR extends the gpu.subgroup_mma_* ops to support fp64 type.
The extension requires special handling during the lowering to nvvm due
to the return type for load ops for fragment a and b (they return a
scalar instead of a struct).

The original PR did not guard the new test based on the required
architecture (sm80) which lead to a failure on the cuda runners with T4
GPUs.

2025-12-01 07:39:59 -05:00

..

benchmark/python

…

…

[mlir][linalg] Restrict fill initial value type to output element type (#169567 )

2025-11-30 09:51:37 -05:00

[MLIR][Python] make sure stubs get installed with LLVM_DISTRIBUTION_COMPONENTS (#168407 )

2025-11-19 07:07:28 -08:00

[MLIR][GPU] subgroup_mma fp64 extension - take 2 (#169061 )

2025-12-01 07:39:59 -05:00

[MLIR][GPU] subgroup_mma fp64 extension - take 2 (#169061 )

2025-12-01 07:39:59 -05:00

[mlir][linalg] Restrict fill initial value type to output element type (#169567 )

2025-11-30 09:51:37 -05:00

[MLIR][GPU] subgroup_mma fp64 extension - take 2 (#169061 )

2025-12-01 07:39:59 -05:00

[CodeGenTypes] Remove explicit VT numbers from ValueTypes.td (#169670 )

2025-11-27 13:11:45 +00:00

[acc][flang] Add getInitRegion() to GlobalVariableOpInterface (#169569 )

2025-11-25 13:44:11 -08:00

…

.clang-format

…

.clang-tidy

…

CMakeLists.txt

[MLIR][Python] make sure stubs get installed with LLVM_DISTRIBUTION_COMPONENTS (#168407 )

2025-11-19 07:07:28 -08:00

LICENSE.TXT

…

Maintainers.md

…

README.md

…

README.md

Multi-Level Intermediate Representation

See https://mlir.llvm.org/ for more information.