Integrating LLVM optimizations with wasm-opt #7634

xuruiyang2002 · 2025-06-02T15:16:17Z

This draft is about leveraging llvm opt to benefiting wasm-opt.

Languages like C/C++ and Rust are from LLVM and benefit a lot. However, not all come from LLVM (GC languages like Java, Kotlin, Dart, etc). wasm-opt wishes to take the role of a toolchain optimizer but cannot do optimizations due to the AST level optimizations. For example, wasm-opt cannot optimize the redundant store (first one):

    ;; Store 1 into memory at address 0:
    (i32.const 0)     
    (i32.const 1)     
    (i32.store)       
    
    ;; Store 0 into memory at address 0:
    (i32.const 0)     
    (i32.const 0)     
    (i32.store)       
  
    ;; Load the value from memory address 0 and return it:
    (i32.const 0)     
    (i32.load)

The general idea is: translate Binaryen IR (from LLVM-compatible code) into LLVM IR, let llvm-opt optimize it, and then get back the optimized result . The most closely related work is Speeding up SMT Solving via Compiler Optimization (FSE 2023), which uses a similar approach by translating SMT queries into LLVM IR to benefit from LLVM optimizations.

An earlier prototype implementing this idea can be found in this PR: main...kripken:binaryen:llvm. That experiment used existing tools like wabt, emcc, and llvm-opt, but a direct 1-to-1 translation may be better.

(I'll continue this if time allows)

kripken · 2025-06-04T16:05:20Z

I think there is a lot of potential here!

Btw, I remembered in #7637 (comment) that our dataflow IR may be useful here, which is SSA-like:

https://github.com/WebAssembly/binaryen/tree/main/src/dataflow

There is a simple pass that does so,

https://github.com/WebAssembly/binaryen/blob/main/src/passes/DataFlowOpts.cpp

I'm not sure, but an option might be to use the existing Binaryen IR => DataFlow IR, and add DataFlow IR => LLVM IR (and the last part could be simpler since it would be SSA => SSA).

gkgoat1 · 2025-06-05T00:06:13Z

src/passes/LLVMOpt.cpp

+
+    // Create global memory buffer
+    ArrayType* memType = ArrayType::get(llvmBuilder->getInt8Ty(), totalSize);
+    GlobalVariable* llvmMem = new GlobalVariable(


How would this be rewritten back into, presumably, multi-memory WASM? Or should this pass only work on single-memory WASM?

I'm still relatively new to compilers and WebAssembly (currently studying both), so please forgive any naivety in my code. This was just a small attempt...

As a newbie, I’m trying to learn compilers and sys like llvm and wasm. However, the docs are huge and not very beginner-friendly. Any advice on where and how to start? Thanks!

gkgoat1 · 2025-06-05T00:07:38Z

src/passes/LLVMOpt.cpp

+  Value* visitStore(Store* store) {
+    // 1. Get the @wasm_memory global variable.
+    GlobalVariable* wasmMemory =
+      llvmMod->getGlobalVariable("wasm_memory", true);


If multiple memories are supported, this would be unsound.

The initial though would be:

for MVP, we just need map each instructions while carefully dealing with semantics gaps such as UB in LLVM, inconsistency of FP spec and other low-level differences between the source and target semantics.

for non-MVP (GC), we could perform code slicing to collect non GC parts (LLVM-optimizable), and transpile & send them to LLVM optimizer, then retrieve optimized code back and "stitch" it back.

So, in my humble view, it's better to satrt with MVP first.

gkgoat1 · 2025-06-05T00:08:10Z

src/passes/LLVMOpt.cpp

+
+  Value* visitConst(Const* c) {
+    assert(c->type.isBasic());
+    switch (c->type.getBasic()) {


Note LLVM supports WASM's externref

kripken · 2025-06-12T18:12:38Z

There is now a proposal to add wasm input to upstream LLVM:

https://discourse.llvm.org/t/rfc-mlir-dialect-for-webassembly/86758

If accepted, that could be very useful here, as it would let some wasm modules be read by LLVM, optimized, and re-emitted as LLVM.

They will never support all of wasm (like GC, I assume), but we could do work on our side to "filter" out the parts they can't handle, let them optimize, and then re-apply the filtered parts, something like that. That might still be a lot of work for us, but a lot less than otherwise.

xuruiyang2002 · 2025-06-13T12:12:36Z

Thanks for sharing, and I'll read it carefully.

Setup the environment and create a pass skeleton

c781d96

gkgoat1 reviewed Jun 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Integrating LLVM optimizations with wasm-opt #7634

Integrating LLVM optimizations with wasm-opt #7634

Uh oh!

xuruiyang2002 commented Jun 2, 2025

Uh oh!

kripken commented Jun 4, 2025

Uh oh!

gkgoat1 Jun 5, 2025

Uh oh!

xuruiyang2002 Jun 5, 2025

Uh oh!

xuruiyang2002 Jun 5, 2025

Uh oh!

gkgoat1 Jun 5, 2025

Uh oh!

xuruiyang2002 Jun 5, 2025

Uh oh!

gkgoat1 Jun 5, 2025

Uh oh!

kripken commented Jun 12, 2025

Uh oh!

xuruiyang2002 commented Jun 13, 2025

Uh oh!

Uh oh!

Integrating LLVM optimizations with wasm-opt #7634

Are you sure you want to change the base?

Integrating LLVM optimizations with wasm-opt #7634

Uh oh!

Conversation

xuruiyang2002 commented Jun 2, 2025

Uh oh!

kripken commented Jun 4, 2025

Uh oh!

gkgoat1 Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

xuruiyang2002 Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

xuruiyang2002 Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

gkgoat1 Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

xuruiyang2002 Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

gkgoat1 Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

kripken commented Jun 12, 2025

Uh oh!

xuruiyang2002 commented Jun 13, 2025

Uh oh!

Uh oh!