Toolchain support for multiple memories #45

titzer · 2023-05-03T22:13:16Z

Does anyone know the current status of multi-memory support in toolchains, e.g. LLVM? After a cursory search of LLVM commits, I didn't turn up anything.

dschuff · 2023-05-03T22:27:29Z

Support was recently added to Binaryen, but hasn't been added to LLVM yet.

chenzhuofu · 2023-05-07T07:37:40Z

I'm curious if there is any example of "high level language being compiled to WebAssembly with multi-memory support"?
I searched for a long time but still haven't found any. :(

penzn · 2023-05-08T16:34:37Z

(sorry, misread the above comment)

Support was recently added to Binaryen, but hasn't been added to LLVM yet.

Is anybody working on LLVM support? LLVM IR has address spaces, that can be used to represent multiple memories.

tlively · 2023-05-08T16:52:38Z

No, nobody is currently working on multimemory in LLVM, although Igalia's work adding support for tables is very similar to what would need to happen to support multiple memories as well.

dschuff · 2023-05-08T17:06:15Z

I'm curious if there is any example of "high level language being compiled to WebAssembly with multi-memory support"? I searched for a long time but still haven't found any. :(

I don't know of any either, probably because of the current lack of implementations. Hopefully we will soon break the chicken-and-egg problem. Did you have a particular thing you wanted to learn from an example?

chenzhuofu · 2023-05-08T17:17:55Z

I'm curious if there is any example of "high level language being compiled to WebAssembly with multi-memory support"? I searched for a long time but still haven't found any. :(

I don't know of any either, probably because of the current lack of implementations. Hopefully we will soon break the chicken-and-egg problem. Did you have a particular thing you wanted to learn from an example?

I have come up an technique leveraging the multi-memory support of the WebAssembly and now need some test cases.
So I got a high-level coding application and wanted to rewrite it for working on multi-memory, then compile it to wasm.

And such high-level coding method is what I wanted to learn.

penzn · 2023-05-08T17:18:09Z

Igalia's work adding support for tables is very similar to what would need to happen to support multiple memories as well.

What does this work look like? LLVM has addrspace attribute which in some cases has exactly the same meaning as multiple memories (think OpenCL before version 2), though I've also read about supporting GC object using that.

titzer · 2023-05-08T17:20:59Z

One of the more compelling use cases I've stumbled on is virtualizing interfaces that use memory. E.g. implementing a Wasm module that has an imported memory from the "user", which it may read and/or write, and then a private memory that is used to store additional internal state and possibly communicate with other modules.

AFAICT It would be possible to write such a module in C with address space annotations.

chenzhuofu · 2023-05-08T17:24:57Z

yes, that’s what I want to learn.
How does this addr space annotations work?

One of the more compelling use cases I've stumbled on is virtualizing interfaces that use memory. E.g. implementing a Wasm module that has an imported memory from the "user", which it may read and/or write, and then a private memory that is used to store additional internal state and possibly communicate with other modules.

AFAICT It would be possible to write such a module in C with address space annotations.

penzn · 2023-05-08T17:38:29Z

How does this addr space annotations work?

With Clang and C/C++ it is __attribute__((address_space(N))) before the type, though the N for the purposes of multiple memories needs to be a constant.

Example:

int incr_from_mem3(__attribute__((address_space(3))) int * ptr) {
  return (*ptr) + 1;
}

(Edit) Even though this would lead to addrspace in the LLVM IR, Wasm backend would quietly ignore it at the moment, though it should not be too hard to enable that.

chenzhuofu · 2023-05-08T19:13:37Z

How does this addr space annotations work?

With Clang and C/C++ it is __attribute__((address_space(N))) before the type, though the N for the purposes of multiple memories needs to be a constant.

Example:
int incr_from_mem3(__attribute__((address_space(3))) int * ptr) {
  return (*ptr) + 1;
}
(Edit) Even though this would lead to addrspace in the LLVM IR, Wasm backend would quietly ignore it at the moment, though it should not be too hard to enable that.

I see, thanks for explanation.

tlively · 2023-05-08T20:36:00Z

Since address spaces need to be statically allocated by the LLVM backend for WebAssembly, it would not be scalable to try to use them to support multiple memories directly. Tables are modeled in LLVM IR as global arrays in a special address space so that an arbitrary number of them may be created. The Wasm object file format used with LLVM was also extended with additional relocation types for tables. The same patterns would work well for modeling multi-memory as well.

dschuff · 2023-05-08T22:21:55Z

I actually find that take somewhat surprising; given that address spaces also need to be statically allocated in the wasm module, requiring the same static allocation at the LLVM IR level seems like it should scale exactly as well in LLVM as it would in wasm itself? Tables are different in the sense that there's not really any obvious analog in the IR already (not just for tables, but also for the references they contain).

penzn · 2023-05-09T01:24:07Z

I am going to second what @dschuff said, aren't memories statically declared, why would they need to get the same dynamic treatments tables get?

tlively · 2023-05-09T15:32:49Z

By "statically allocated in the backend," I mean statically allocated when LLVM is compiled, not when the user program is compiled. So if you had a 1:1 mapping between address spaces and memories, then when you compile LLVM, you would have to determine what the maximum number of memories an LLVM IR module could reference at that point. In contrast, the scheme used for tables allows user programs to use an arbitrary number of tables.

titzer · 2023-05-09T15:56:46Z

Is this discussion is just about the LLVM internal representation? At the C or C++ level these would still be address space annotations on pointer types?

penzn · 2023-05-09T16:38:07Z

So if you had a 1:1 mapping between address spaces and memories, then when you compile LLVM, you would have to determine what the maximum number of memories an LLVM IR module could reference at that point.

There is a hard limit on number of memories, memory index is one byte, I think.

tlively · 2023-05-09T16:50:53Z

Is this discussion is just about the LLVM internal representation? At the C or C++ level these would still be address space annotations on pointer types?

At the C or C++ level these would most likely be new annotations like __attribute__((wasm_memory)), since clang would also have to check a bunch of semantic restrictions (such as ensuring that the arrays are not address-taken) just like it does for tables.

There is a hard limit on number of memories, memory index is one byte, I think.

No, just like all other indices in Wasm, memory indices are LEB128 values.

titzer · 2023-05-09T17:46:01Z

At the C or C++ level these would most likely be new annotations like __attribute__((wasm_memory)), since clang would also have to check a bunch of semantic restrictions (such as ensuring that the arrays are not address-taken) just like it does for tables.

Oh, so you mean they would be globally-declared (non-address taken) arrays into which the program would index with integers?

tlively · 2023-05-09T21:00:39Z

Yes, exactly.

yamt · 2024-06-20T03:11:37Z

Is this discussion is just about the LLVM internal representation? At the C or C++ level these would still be address space annotations on pointer types?

At the C or C++ level these would most likely be new annotations like __attribute__((wasm_memory)), since clang would also have to check a bunch of semantic restrictions (such as ensuring that the arrays are not address-taken) just like it does for tables.

"not address-taken" sounds like a very severe restriction for memory
as C/C++ applications usually access memory via pointers.
i suspect it's worse than having a static limit on the number of memories.
am i missing something?

tlively · 2024-06-20T22:12:21Z

It's definitely a severe restriction compared to what you can do with other constructs in C/C++, but that's ok because a program would only need to use this feature to do something very specific to WebAssembly, and in that case having the source language construct match the underlying construct as closely as possible is a good thing.

titzer mentioned this issue Jul 7, 2023

Multi-memory support in AssemblyScript? AssemblyScript/assemblyscript#2716

Open

MendyBerger mentioned this issue Apr 1, 2024

[WASM] Ship multi-memory mdn/content#32777

Closed

10 tasks

TianlongLiang mentioned this issue May 7, 2024

[RFC] Basic support for Multi-Memory proposal bytecodealliance/wasm-micro-runtime#3381

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Toolchain support for multiple memories #45

Toolchain support for multiple memories #45

titzer commented May 3, 2023

dschuff commented May 3, 2023

chenzhuofu commented May 7, 2023 •

edited

Loading

penzn commented May 8, 2023 •

edited

Loading

tlively commented May 8, 2023

dschuff commented May 8, 2023

chenzhuofu commented May 8, 2023

penzn commented May 8, 2023

titzer commented May 8, 2023

chenzhuofu commented May 8, 2023 •

edited

Loading

penzn commented May 8, 2023 •

edited

Loading

chenzhuofu commented May 8, 2023

tlively commented May 8, 2023

dschuff commented May 8, 2023

penzn commented May 9, 2023 •

edited

Loading

tlively commented May 9, 2023

titzer commented May 9, 2023

penzn commented May 9, 2023

tlively commented May 9, 2023

titzer commented May 9, 2023

tlively commented May 9, 2023

yamt commented Jun 20, 2024

tlively commented Jun 20, 2024

Toolchain support for multiple memories #45

Toolchain support for multiple memories #45

Comments

titzer commented May 3, 2023

dschuff commented May 3, 2023

chenzhuofu commented May 7, 2023 • edited Loading

penzn commented May 8, 2023 • edited Loading

tlively commented May 8, 2023

dschuff commented May 8, 2023

chenzhuofu commented May 8, 2023

penzn commented May 8, 2023

titzer commented May 8, 2023

chenzhuofu commented May 8, 2023 • edited Loading

penzn commented May 8, 2023 • edited Loading

chenzhuofu commented May 8, 2023

tlively commented May 8, 2023

dschuff commented May 8, 2023

penzn commented May 9, 2023 • edited Loading

tlively commented May 9, 2023

titzer commented May 9, 2023

penzn commented May 9, 2023

tlively commented May 9, 2023

titzer commented May 9, 2023

tlively commented May 9, 2023

yamt commented Jun 20, 2024

tlively commented Jun 20, 2024

chenzhuofu commented May 7, 2023 •

edited

Loading

penzn commented May 8, 2023 •

edited

Loading

chenzhuofu commented May 8, 2023 •

edited

Loading

penzn commented May 8, 2023 •

edited

Loading

penzn commented May 9, 2023 •

edited

Loading