llvm/mlir/include at eac3788b4e2dd305a84f09ebc6fffc050ee84a68 - llvm

intel/llvm

mirror of https://github.com/intel/llvm.git synced 2026-01-13 11:02:04 +08:00

Files

Hanchenng Wu a6d1a52b8d [MLIR] Reuse AsmState to enable fast generate-runtime-verification pass; add location-only pass option (#160331 )

The pass generate-runtime-verification generates additional runtime op
verification checks.

Currently, the pass is extremely expensive. For example, with a
mobilenet v2 ssd network(converted to mlir), running this pass alone in
debug mode will take 30 minutes. The same observation has been made to
other networks as small as 5 Mb.

The culprit is this line "op->print(stream, flags);" in function
"RuntimeVerifiableOpInterface::generateErrorMessage" in File
mlir/lib/Interfaces/RuntimeVerifiableOpInterface.cpp.

As we are printing the op with all the names of the operands in the
middle end, we are constructing a new SSANameState for each
op->print(...) call. Thus, we are doing a new SSA analysis for each
error message printed.

Perf profiling shows that 98% percent of the time is spent in the
constructor of SSANameState.

This change refactored the message generator. We use a toplevel
AsmState, and reuse it with all the op-print(stream, asmState). With a
release build, this change reduces the pass exeuction time from ~160
seconds to 0.3 seconds on my machine.

This change also adds verbose options to generate-runtime-verification
pass.
verbose 0: print only source location with error message.
verbose 1: print the full op, including the name of the operands.

2025-10-08 11:48:34 +01:00

mlir

[MLIR] Reuse AsmState to enable fast generate-runtime-verification pass; add location-only pass option (#160331 )

2025-10-08 11:48:34 +01:00

mlir-c

[MLIR][Python] Expose the insertion point of pattern rewriter (#161001 )

2025-10-05 11:12:11 +08:00