Memory growth and JS #82

kripken · 2019-08-28T16:35:06Z

It looks like the current wasi libc implements sbrk using the clang builtin to grow, and there isn't a wasi API for growth. I think this may be a problem for a JS embedding (that is, running a wasi program with JS implementing the wasi APIs etc.), as any JS views on the buffer used in the wasm Memory will become invalid - they don't resize automatically, and must be manually recreated. In particular I think the current Web polyfill for wasi probably doesn't fully work with memory growth.

There is no event callback for when a Memory grows, but even if there were, it wouldn't be enough, just like with pthreads - the event would happen on a later JS event loop iteration, and not when we need it.

One possible solution here would be to add an API to wasi that either does the growth (__wasi_grow_memory?), or that notifies the runtime about the growth (__wasi_notify_memory_growth?).

The text was updated successfully, but these errors were encountered:

sbc100 · 2019-08-28T16:39:06Z

The solution I used my tiny wasi.js that we use of the waterfall is to call checkHeap at the start of each syscall that needs memory: https://github.com/WebAssembly/waterfall/blob/fe3feca48ae596780282d9fc36f876b9a3131688/src/wasi.js#L76

devsnek · 2019-08-28T16:39:32Z

I do the same thing ^ https://github.com/devsnek/node-wasi/blob/12a0985a46589587facd8d8e161911650ef15f3b/src/index.js#L1282

pchickey · 2019-08-28T17:17:58Z

We even do the equivalent in Lucet - this isn't restricted to just JS embeddings https://github.com/fastly/lucet/blob/master/lucet-runtime/lucet-runtime-internals/src/vmctx.rs#L142

kripken · 2019-08-28T18:15:51Z

It's possible at the beginning of every call (into the runtime and out to wasm), yeah - emscripten does that for pthreads + memory growth. But it's pretty slow... I think it would be nice to have a proper API for this.

Another option is for the VM to instrument the wasm before running it, replacing every memory growth instruction with a notification. But that has other downsides...

devsnek · 2019-08-28T18:28:58Z

I don't think this is an inherent limitation of wasi or wasm, but it might be a limitation of the js api. A native implementation can just have an api where native functions are provided the current memory. Perhaps something worth requesting on the JS api is a "memory update" callback.

kripken · 2019-08-28T18:39:51Z

@devsnek - I think a normal JS event callback would happen in a later frame, which is too late (see note earlier). Maybe a nonstandard callback that happens synchronously could work? But I suspect that would be controversial on the Web.

devsnek · 2019-08-28T18:44:13Z

@kripken yes i was imagining a function passed directly to instantiation, so it would be called synchronously.

WebAssembly.instantiate(module, imports, {
  onMemoryUpdate: () => {},
});

I think we could add the callback in https://webassembly.github.io/spec/js-api/index.html#reset-the-memory-buffer

alexcrichton · 2019-08-28T19:56:31Z

Memory growth doesn't necessarily always happen in the standard library (e.g. wasi-libc or something like that) but rather since it's just an instruction any function can execute it. (or rather any particular library). As a result if we were to do something like this then for a solution like __wasi_grow_memory there'd need to be a postprocessing pass to wire up memory.grow instructions to that intrinsic, or with __wasi_notify_memory_growth it'd still have to be a postprocessing pass to inject after memory.grow instructions. Either way though it seems weird to have to postprocess a module like this to me?

I'd personally be more in favor of saying "the JS side has to check" and do what it does today with comparing ArrayBuffer instances before handing out views.

kripken · 2019-08-28T22:06:09Z

@alexcrichton I'm skeptical of the JS side having to check because it means every single location that calls into wasm needs extra work, and if you forget one you get breakage later. That would make it much more clumsy to use wasi binaries from JS...

@devsnek I'm also skeptical of a synchronous callback - is that common on the Web? Offhand the only example I can think of is sorting, and it's not even a real callback there, since the entire thing happens immediately...

guybedford · 2019-08-28T22:08:40Z

@kripken script load callbacks on the web are synchronous.

kripken · 2019-08-28T22:20:37Z

@guybedford Interesting, thanks, I guess I'm not enough of a web dev to know this stuff :) Reading this page I don't see anything clear enough on it being synchronous, or maybe I don't understand what you mean by "synchronous" here? Or am I looking in the wrong place?

devsnek · 2019-08-28T22:34:44Z

@kripken splitting your question into two parts:

EventTarget (web) and EventEmitter (node) dispatch events synchronously, but calls dispatch those events are generally queued via the event loop. The reason the event loop is used is generally because its reacting to things that happen while js is running (users can click buttons even when js is running), and both node and the web have a rule about state not changing observably in the js thread while js is running.

the wasm spec doesn't assume anything about the environment besides it being javascript, so we can't even use EventTarget or EventEmitter anyway, and so i think any precedence using them might bring is moot.

If the call stack is already in js land, i don't think it matters that much whether the callback is async or not.

sbc100 · 2019-08-28T23:07:58Z

@kripken are you worried about the cost of the if (mem.buffer) at the start of each JS syscall? Or are you worried about all the other JS functions that emscripten users might want to expose to wasm?

If its the latter then presumably we can wrap all JS functions that we export to wasm a helper that does this? It would be useful to have that ability anyway for other things such as tracing that wasm boundary.

kripken · 2019-08-28T23:17:55Z

Thanks @devsnek! It does sound like that may be a good option then. I opened WebAssembly/design#1296 , please let me know if I wrote this up ok.

@sbc100 yeah, but not just syscalls - any time you enter JS, if wasm might have run, you need to do that, and any time right after you call into wasm and return from there.

So if someone loads a wasi wasm file in JS, and interleaves calling exports from there with JS looking at the memory, bugs can easily happen. Like imagine box2d.wasm,

jsGlue.addObject(); // JS looking at views
box2D.calculatePhysics(); // wasm runs
jsGlue.readWorldState(); // JS looking at views

For that to be correct, we'd need to check if memory grew right after that wasm call. In other words, users need to be very careful...

sbc100 · 2019-08-28T23:23:55Z

@kripken could we not do that automatically since we control the list of functions we give to wasm that the functions we get back.

It would mean that the functions we expose to user JS are not that actual wasm functions but wrappers. And the JS function we give to wasm would also be wrappers.

This is obviously overhead, I'm just not sure how much given that the crossing from JS to wasm presumably already has a fair amount.

kripken · 2019-08-28T23:30:50Z

@sbc100 Yes, some automatic instrumentation is possible here. But it means that every user that uses wasi in JS must remember to do that, and do it properly - this isn't just for emscripten users!

sbc100 · 2019-08-28T23:37:52Z

Emscripten is a little different in that its users tend to expose arbitrary functions to their wasm programs. In the WASI world the idea is that the program will only be exposed to the WASI syscall APIs, not arbitrary host functions.

I'm not saying the current situation is ideal. Its certainly on overhead for JS embedders that they have to handle this, but hopefully its something that can be done at embedder level not something that individual developer or app will need to care about.

kripken · 2019-08-28T23:42:19Z

In the WASI world the idea is that the program will only be exposed to the WASI syscall APIs, not arbitrary host functions.

Will wasi programs not want to provide exports that can be called from JS?

sunfishcode · 2019-08-28T23:48:04Z

At the moment, all WASI programs are Commands, which means they just have a _start function that needs to get called. We're working on establishing Reactors and Libraries as two additional forms of WASI programs, which would support calls from the outside, but at the moment those concepts are still being developed.

devsnek · 2019-08-29T01:27:54Z

Another solution might be interface types. If wasi apis passed live memory views instead of pointers, this wouldn't be a problem anymore.

skoppe · 2019-10-23T18:45:27Z

There is no event callback for when a Memory grows

I have had the same problem in skoppe/spasm and - like everyone else - ended up using a check at the beginning of each JS function.

In my case I was statically linking imgui (compiled to wasm with wasi-libc) with a webgl backend written in Dlang. Both the D program and wasi-libc were issuing calls to the memory grow intrinsic, at which point I realised I couldn't intercept calls to the grow intrinsic in the D program, since it would miss those issued from wasi-libc.

So I ended up injecting memory checks in each js function. It didn't feel right when I wrote it, and it still doesn't.

~~While I understand that a reallocation invalidates pointers, I am of the opinion that the wasm runtime ought to fix that pointer after a memory grow.~~

To check whether the typed js array is still valid I check the length property, which equals 0 when the memory is invalid. This suggests to me there is already some bounds checking and memory validation code whenever JS accesses the typed array. If that is the case, having the wasm runtime update the typed array when a memory grow happens should cost very little.

I stand corrected. That is not what is happening. I now favor kripken's proposal to add load/store functions on the WebAssembly.Memory object, as outlined in https://gist.github.com/kripken/949eab99b7bc34f67c12140814d2b595.

linclark · 2020-12-16T22:00:31Z

This has been filed as a core Wasm issue, and it makes more sense to handle it at the Wasm level than WASI, so I'll go ahead and close this one out.

kripken mentioned this issue Aug 28, 2019

Synchronous JS API callback for memory growth? WebAssembly/design#1296

Open

kripken mentioned this issue Sep 23, 2019

Emscripten & WASI & POSIX emscripten-core/emscripten#9479

Closed

sunfishcode added the discussion A discussion that doesn't yet have a specific conclusion or actionable proposal. label Sep 26, 2019

kripken mentioned this issue Oct 7, 2019

Undo assertion on STANDALONE_WASM not working with memory growth emscripten-core/emscripten#9588

Merged

devsnek mentioned this issue Mar 5, 2020

Limitations of start function with exported memory WebAssembly/design#1160

Open

linclark closed this as completed Dec 16, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory growth and JS #82

Memory growth and JS #82

kripken commented Aug 28, 2019

sbc100 commented Aug 28, 2019

devsnek commented Aug 28, 2019

pchickey commented Aug 28, 2019 •

edited

Loading

kripken commented Aug 28, 2019

devsnek commented Aug 28, 2019

kripken commented Aug 28, 2019

devsnek commented Aug 28, 2019 •

edited

Loading

alexcrichton commented Aug 28, 2019

kripken commented Aug 28, 2019

guybedford commented Aug 28, 2019

kripken commented Aug 28, 2019

devsnek commented Aug 28, 2019 •

edited

Loading

sbc100 commented Aug 28, 2019

kripken commented Aug 28, 2019

sbc100 commented Aug 28, 2019

kripken commented Aug 28, 2019

sbc100 commented Aug 28, 2019

kripken commented Aug 28, 2019

sunfishcode commented Aug 28, 2019

devsnek commented Aug 29, 2019

skoppe commented Oct 23, 2019 •

edited

Loading

linclark commented Dec 16, 2020

Memory growth and JS #82

Memory growth and JS #82

Comments

kripken commented Aug 28, 2019

sbc100 commented Aug 28, 2019

devsnek commented Aug 28, 2019

pchickey commented Aug 28, 2019 • edited Loading

kripken commented Aug 28, 2019

devsnek commented Aug 28, 2019

kripken commented Aug 28, 2019

devsnek commented Aug 28, 2019 • edited Loading

alexcrichton commented Aug 28, 2019

kripken commented Aug 28, 2019

guybedford commented Aug 28, 2019

kripken commented Aug 28, 2019

devsnek commented Aug 28, 2019 • edited Loading

sbc100 commented Aug 28, 2019

kripken commented Aug 28, 2019

sbc100 commented Aug 28, 2019

kripken commented Aug 28, 2019

sbc100 commented Aug 28, 2019

kripken commented Aug 28, 2019

sunfishcode commented Aug 28, 2019

devsnek commented Aug 29, 2019

skoppe commented Oct 23, 2019 • edited Loading

linclark commented Dec 16, 2020

pchickey commented Aug 28, 2019 •

edited

Loading

devsnek commented Aug 28, 2019 •

edited

Loading

devsnek commented Aug 28, 2019 •

edited

Loading

skoppe commented Oct 23, 2019 •

edited

Loading