intel/llvm

mirror of https://github.com/intel/llvm.git synced 2026-01-27 06:06:34 +08:00

Go to file

Andrzej Warzynski 447bb5bee4 [mlir][ArmSME] Introduce new lowering layer (Vector -> ArmSME)

At the moment, the lowering from the Vector dialect to SME looks like
this:

  * Vector --> SME LLVM IR intrinsics

This patch introduces a new lowering layer between the Vector dialect
and the Arm SME extension:

  * Vector --> ArmSME dialect (custom Ops) --> SME LLVM IR intrinsics.

This is motivated by 2 considerations:
1. Storing `ZA` to memory (e.g. `vector.transfer_write`) requires an
   `scf.for` loop over all rows of `ZA`. Similar logic will apply to
   "load to ZA from memory". This is a rather complex transformation and
   a custom Op seems justified.
2. As discussed in [1], we need to prevent the LLVM type converter from
   having to convert types unsupported in LLVM, e.g.
   `vector<[16]x[16]xi8>`. A dedicated abstraction layer with custom Ops
   opens a path to some fine tuning (e.g. custom type converters) that
   will allow us to avoid this.

To facilitate this change, two new custom SME Op are introduced:

  * `TileStoreOp`, and
  * `ZeroOp`.

Note that no new functionality is added - these Ops merely model what's
already supported. In particular, the following tile size is assumed
(dimension and element size are fixed):

  * `vector<[16]x[16]xi8>`

The new lowering layer is introduced via a conversion pass between the
Vector and the SME dialects. You can use the `-convert-vector-to-sme`
flag to run it. The following function:
```
func.func @example(%arg0 : memref<?x?xi8>) {
  // (...)
  %cst = arith.constant dense<0> : vector<[16]x[16]xi8>
  vector.transfer_write %cst, %arg0 : vector<[16]x[16]xi8>, memref<?x?xi8>
  return
}
```
would be lowered to:
```
  func.func @example(%arg0: memref<?x?xi8>) {
    // (...)
    %0 = arm_sme.zero : vector<[16]x[16]xi8>
    arm_sme.tile_store %arg0[%c0, %c0], %0 : memref<?x?xi8>, vector<[16]x[16]xi8>
    return
  }
```

Later, a mechanism will be introduced to guarantee that `arm_sme.zero`
and `arm_sme.tile_store` operate on the same virtual tile. For `i8`
elements this is not required as there is only one tile.

In order to lower the above output to LLVM, use
  * `-convert-vector-to-llvm="enable-arm-sme"`.

[1] https://github.com/openxla/iree/issues/14294

Reviewed By: WanderAway

Differential Revision: https://reviews.llvm.org/D154867

2023-07-18 08:04:59 +00:00

.ci

…

.github/workflows

…

bolt

[BOLT][Utils] Add dot2html module entry point

2023-07-17 10:08:57 -07:00

clang

[clang][analyzer] Add all success/failure messages to StdLibraryFunctionsChecker.

2023-07-18 09:29:15 +02:00

clang-tools-extra

[clang-tidy] Model noexcept more properly in bugprone-exception-escape

2023-07-17 15:59:34 +00:00

cmake

[CMake] Switch the CMP0091 policy (MSVC_RUNTIME_LIBRARY) to the new behaviour

2023-07-17 09:59:05 +03:00

compiler-rt

[sanitizer][asan][win] Intercept _strdup on Windows instead of strdup

2023-07-17 21:06:50 -07:00

cross-project-tests

…

flang

[flang][openacc][NFC] Add test for scalar allocatable and pointer reduction

2023-07-17 13:27:43 -07:00

libc

[AMDGPU] Add targets gfx1150 and gfx1151

2023-07-17 13:06:12 +01:00

libclc

…

libcxx

[libc++] Remove broken self test for the libc++ Lit format

2023-07-17 18:35:10 -04:00

libcxxabi

[libc++] Remove internal "build-with-external-thread-library" configuration

2023-07-17 09:32:36 -04:00

libunwind

…

lld

[lld][test] Remove unused features

2023-07-18 00:28:17 -07:00

lldb

[lldb][NFCI] Avoid construction of temporary std::strings in RegisterValue

2023-07-17 12:53:34 -07:00

llvm

[LoongArch][NFC] Consistently derive instruction mnemonics from TableGen record names

2023-07-18 15:50:20 +08:00

llvm-libgcc

…

mlir

[mlir][ArmSME] Introduce new lowering layer (Vector -> ArmSME)

2023-07-18 08:04:59 +00:00

openmp

[AMDGPU] Add targets gfx1150 and gfx1151

2023-07-17 13:06:12 +01:00

polly

…

pstl

…

runtimes

…

third-party

…

utils

[bazel] fix build of ArithUtils

2023-07-18 09:20:13 +02:00

.arcconfig

…

.arclint

…

.clang-format

…

.clang-tidy

…

.git-blame-ignore-revs

…

.gitignore

…

.mailmap

…

CONTRIBUTING.md

…

LICENSE.TXT

…

README.md

…

SECURITY.md

…

README.md

The LLVM Compiler Infrastructure

Welcome to the LLVM project!

This repository contains the source code for LLVM, a toolkit for the construction of highly optimized compilers, optimizers, and run-time environments.

The LLVM project has multiple components. The core of the project is itself called "LLVM". This contains all of the tools, libraries, and header files needed to process intermediate representations and convert them into object files. Tools include an assembler, disassembler, bitcode analyzer, and bitcode optimizer.

C-like languages use the Clang frontend. This component compiles C, C++, Objective-C, and Objective-C++ code into LLVM bitcode -- and from there into object files, using LLVM.

Other components include: the libc++ C++ standard library, the LLD linker, and more.

Getting the Source Code and Building LLVM

Consult the Getting Started with LLVM page for information on building and running LLVM.

For information on how to contribute to the LLVM project, please take a look at the Contributing to LLVM guide.

Getting in touch

Join the LLVM Discourse forums, Discord chat, or #llvm IRC channel on OFTC.

The LLVM project has adopted a code of conduct for participants to all modes of communication within the project.

Languages

LLVM 41.5%

C++ 31.7%

C 13%

Assembly 9.1%

MLIR 1.5%

Other 2.8%