Unable to overwrite worker's onmessage when using MODULARIZE #20192

miloszmaki · 2023-09-06T07:59:40Z

Consider this example, which creates a new worker thread. The worker overwrites its onmessage to handle custom messages. Then the main thread sends a custom message to the worker.

#include <cstdio>
#include <emscripten.h>
#include <thread>

int main() {
  printf("hello from main\n");

  std::thread t{[](){
    EM_ASM({
        console.log("hello from thread");
        const def_onmsg = self.onmessage;
        self.onmessage = (e) => {
                console.log("handling message:", e, e.data, e.data.cmd);
                if (e["data"]["cmd"] != "custom1" && e["data"]["cmd"] != "custom2") {
                        def_onmsg(e);
                }
        };
        // self.onmessage({"data": {"cmd": "custom1", "id": 1}}); // this works always
    });
  }};

  EM_ASM({
  // setTimeout(()=>{ // doesn't help either
    const threads = Module["PThread"].pthreads;
    const id = Object.keys(threads)[0];
    const worker = threads[id];
    worker.postMessage({"cmd": "custom2", "id": 2}); // this fails with MODULARIZE
  // },1000);
  });

  return 0;
}

It works properly ("custom2" message gets handled by the worker) when I build with:
emcc main.cpp -o test.html --std=c++20 -pthread -s PTHREAD_POOL_SIZE_STRICT=0

However, it stops working (the console prints error "worker.js received unknown command custom2") when I build with:
emcc main.cpp -o test.html --std=c++20 -pthread -s PTHREAD_POOL_SIZE_STRICT=0 -s MODULARIZE=1 -s EXPORT_ES6=1

My output of emcc -v is:

emcc (Emscripten gcc/clang-like replacement + linker emulating GNU ld) 3.1.45 (ef3e4e3b044de98e1811546e0bc605c65d3412f4)
clang version 18.0.0 (https://github.com/llvm/llvm-project d1e685df45dc5944b43d2547d0138cd4a3ee4efe)
Target: wasm32-unknown-emscripten
Thread model: posix

Interestingly, it was working with some older versions (at least I checked 3.1.15).

The text was updated successfully, but these errors were encountered:

sbc100 · 2023-09-06T23:23:53Z

I'm not sure we have ever officially support doing this kind of thing. We have had others ask about it in the past though so there are at least few folks who want to do it. If we do want to support it we should add some tests that do this so it doesn't break again.

We should also make some kind of official API for going from a pthread ID to a worker handle in JS I suppose?

miloszmaki · 2023-09-07T06:58:47Z

I see. That would be great if you decide to support this. I think that being able to establish the communication between workers is quite important. Do you know of any workarounds available currently? One I can think of, is to call some C++ function from JS and then do the thread communication on the C++ side. Do you think it's a good way to go?

sbc100 · 2023-09-07T17:15:21Z

I see. That would be great if you decide to support this. I think that being able to establish the communication between workers is quite important. Do you know of any workarounds available currently? One I can think of, is to call some C++ function from JS and then do the thread communication on the C++ side. Do you think it's a good way to go?

Normally pthread communication happens through shared memory. is there some reason you can't do that in this case? The fact that pthreads are implemented as web workers is kind of an implementation detail that ideally you would not rely on. Is there some reason you need to use postMessage rather than shared memory?

sbc100 · 2023-09-07T17:16:16Z

do the thread communication on the C++ side.

Yes, I would recommend doing your thread communication using the pthread APIs (or C++ thread APIs) and shared memory.

miloszmaki · 2023-09-08T10:10:47Z

Is there some reason you need to use postMessage rather than shared memory?

I'm capturing camera frames on the main thread and need to transfer the ownership of a VideoFrame to the worker thread, which handles the frame by processing its data and eventually calling close() on it. Not sure if this is possible using shared memory only.

sbc100 · 2023-09-08T23:17:47Z

Yes, I can see that in that case you would need to use postMessage.

@juj doesn't this sounds like a reasonable use case for postMessage or is there some way this can be done using offscreen canvas perhaps?

sbc100 · 2023-09-08T23:18:36Z

Wait, if you are processing the data using native code then don't you have to copy the data out of the VideoFrame and into linear memory anyway? Why not copy the data out on the main thread?

miloszmaki · 2023-09-09T10:22:04Z

Right, I'm copying the data for further processing in C++. However, I'm concerned about the efficiency and would like to offload work from the main thread if possible. I think I will need to try and profile the approach you suggested to know if it works for me.

juj · 2023-09-12T11:18:34Z

I'm not sure we have ever officially support doing this kind of thing.

I think we should support this. It helps composability with other libraries, and helps users utilize the existing constructs that they might have and know about. Alternatively, we should clearly document that the .onmessage parameter is off limits to end users.

I have tried to do this for the recent libraries, e.g. Wasm Workers already utilize addEventListener instead of .onmessage:

emscripten/src/library_wasm_worker.js

Lines 90 to 99 in 453e83a

    
           // The Wasm Worker runtime is now up, so we can start processing 
        
           // any postMessage function calls that have been received. Drop the temp 
        
           // message handler that queued any pending incoming postMessage function calls ... 
        
           removeEventListener('message', _wasmWorkerAppendToQueue); 
        
           // ... then flush whatever messages we may have already gotten in the queue, 
        
           //     and clear _wasmWorkerDelayedMessageQueue to undefined ... 
        
           _wasmWorkerDelayedMessageQueue = _wasmWorkerDelayedMessageQueue.forEach(_wasmWorkerRunPostMessage); 
        
           // ... and finally register the proper postMessage handler that immediately 
        
           // dispatches incoming function calls without queueing them. 
        
           addEventListener('message', _wasmWorkerRunPostMessage);

even though it is larger code size.

We might code golf the size a bit for this in the future to use .onmessage with a hypothetical WORKERS_ARE_PRIVATE or something along those lines. There are too many tutorials and other articles that showcase modifying .onmessage that it's better to let users have it.

miloszmaki · 2023-09-13T15:31:58Z

Why not copy the data out on the main thread?

I tried this approach suggested by @sbc100 and it turns out to work quite well.

miloszmaki · 2023-09-18T09:24:26Z

However, I'm still a bit concerned with this new approach, since I need to do some kind of synchronization (e.g. mutex) which requires blocking on the main thread (not recommended, as pointed out in the documentation).

miloszmaki · 2024-10-02T07:51:31Z

@sbc100 @juj any updates on that? Is it possible to merge #20235 ?

sbc100 · 2024-10-02T17:34:58Z

I'd be OK with landing #20235 but I think we should add a test/example and some docs about how to do it in the blessed way.

juj · 2024-10-07T18:38:03Z

Sorry, I never quite had time to design the tests for that PR. I'll see if I can get a bit of time slotted for this soon.

miloszmaki mentioned this issue Sep 6, 2023

Unable to overwrite worker's onmessage when using MODULARIZE emscripten-core/emsdk#1276

Closed

juj mentioned this issue Sep 12, 2023

Use addEventListener in a few places instead of registering on to .onmessage #20235

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unable to overwrite worker's onmessage when using MODULARIZE #20192

Unable to overwrite worker's onmessage when using MODULARIZE #20192

miloszmaki commented Sep 6, 2023

sbc100 commented Sep 6, 2023

miloszmaki commented Sep 7, 2023

sbc100 commented Sep 7, 2023

sbc100 commented Sep 7, 2023

miloszmaki commented Sep 8, 2023 •

edited

Loading

sbc100 commented Sep 8, 2023

sbc100 commented Sep 8, 2023

miloszmaki commented Sep 9, 2023

juj commented Sep 12, 2023 •

edited

Loading

miloszmaki commented Sep 13, 2023

miloszmaki commented Sep 18, 2023

miloszmaki commented Oct 2, 2024

sbc100 commented Oct 2, 2024

juj commented Oct 7, 2024

Unable to overwrite worker's onmessage when using MODULARIZE #20192

Unable to overwrite worker's onmessage when using MODULARIZE #20192

Comments

miloszmaki commented Sep 6, 2023

sbc100 commented Sep 6, 2023

miloszmaki commented Sep 7, 2023

sbc100 commented Sep 7, 2023

sbc100 commented Sep 7, 2023

miloszmaki commented Sep 8, 2023 • edited Loading

sbc100 commented Sep 8, 2023

sbc100 commented Sep 8, 2023

miloszmaki commented Sep 9, 2023

juj commented Sep 12, 2023 • edited Loading

miloszmaki commented Sep 13, 2023

miloszmaki commented Sep 18, 2023

miloszmaki commented Oct 2, 2024

sbc100 commented Oct 2, 2024

juj commented Oct 7, 2024

miloszmaki commented Sep 8, 2023 •

edited

Loading

juj commented Sep 12, 2023 •

edited

Loading