Could this run in a worker? #100

oskbor · 2023-01-11T14:56:31Z

Greetings!

Do you know if it would be possible to run most of this library in a worker?
Perhaps by using an OffscreenCanvas.
Perhaps the number[]'s that are passed in could be replaced by TypedArrays since they can be backed by ArrayBuffers that are transferable objects

Just curious if this idea has crossed your mind. This library is great and a central part of an application that we are currently building, hence why I'm thinking of ways to squeeze out even more performance.

best regards Oskar

The text was updated successfully, but these errors were encountered:

flekschas · 2023-01-11T15:31:49Z

100% I was thinking about moving the library entirely into a worker at some point.

Unfortunately, there are few challenges:

My available free time to work on this
OffscreenCanvas is not yet supported in Safari, the Internet Explorer I suppose... However, dropping support for all iOS devices seems too drastic. Hence, all but the renderer would have to be moved to a web worker while the renderer remains in the main frame.

I think the second point can be addressed by moving all but the renderer into a web worker while keeping the renderer in the main browser thread.

Regarding the first point, I'd really love to workerize this library but I can't tell you when I'd have a chance to get around doing it. I am happy to collaborate if you have spare resources.

Also, I'd love to switch to TypedArrays entire and drop the internal use of standard arrays. However, the spatial index library we're using only works with row-based arrays. I.e., arrays of points instead of TypedArrays of x and y coordinates. We'd have to fix this as well. It's not that hard (I did it for another project) but I never gotten around implementing a proper PR for kdbush. Maybe I'll get started with it.

oskbor · 2023-01-12T14:53:25Z

My available free time to work on this

I can relate 😄 Nice that you have had the same thoughts. At this time I cannot justify doing this, but I would be happy to contribute once we encounter large enough datasets to motivate this work 👍

I think the second point can be addressed by moving all but the renderer into a web worker while keeping the renderer in the main browser thread.

Perhaps the renderer can be built so it runs in a worker when supported and otherwise it falls back to the main thread 🤔

fspoettel · 2023-09-08T12:49:08Z

@flekschas would you accept a contribution here? I would be open to do the required work to implement this, but might need some pointers in the process.

We are currently using regl-scatterplot to plot a large number of plots from parquet/arrow tables which we load/parse in a web worker. Even though we have to cast the TypedArrays to Arrays at some point, this already performs quite well (we can plot millions of points across ~30 plots on a page in an acceptable time), but looking at the profiling data, there is some jank that occurs whenever kdbush prepares v. large plots or a GC of intermediate values occurs. I think rendering in a web worker would solve most of this.

With OffscreenCanvas set to ship in Safari 17, it may soon be possible to also leverage that for drawing.

flekschas · 2023-09-11T13:55:22Z

I'm totally open to contributions! However, just be warned that the full workerization might be more involved due to the fact that multiple regl-scatterplot instances (can) share a renderer that holds the actual WebGL context.

I started prototyping a complete worker-based approach some time ago and eventually got the basic setup to work. However, the performance was somehow not great at all. My approach was the following:

Each regl-scatterplot instance creates a main thread plus worker pair. The purpose of the main thread instance is to forward all communication to the worker and provide access to a canvas element via the OffscreenCanvas. In addition, the first regl-scatterplot instance creates another main thread plus worker instance: the renderer. The only reason the renderer needs to know of the main thread is to get access to a hidden canvas element via the OffscreenCanvas. Only the renderer worker creates a WebGL context. Data that is being loaded with a plot instances is processed and transferred to the renderer who holds all WebGL programs. Additionally, the plot instances pass access to their visible canvas elements on to the renderer. Finally, the renderer renders out the pixels using the hidden canvas element and then copies them onto the visible canvas elements.

The setup is somewhat complex but having a single renderer makes it possible to create many plot instances without having to worry about the max number of WebGL contexts. This is important for libraries like jupyter-scatter which use regl-scatterplot.

I'm happy to share more details

fspoettel · 2023-09-11T15:47:28Z

Thank you for the write-up, appreciate it! Interesting that you did not see performance benefits with your approach.

I started prototyping a complete worker-based approach some time ago and eventually got the basic setup to work.

Is this available on a branch somewhere? Would be helpful to be able to look at the code / run it. I can also share more details about how we currently use regl-scatterplot in our (biomedical) react application if you're interested.

but having a single renderer makes it possible to create many plot instances without having to worry about the max number of WebGL contexts.

We ran into this issue as well and are using regl-scatterplot with a shared webgl context for that reason. I found that beyond the hard context limit, using isolated webgl contexts per plot also comes with considerable performance overhead (mostly memory) when drawing more than a few plots.

flekschas · 2023-09-12T01:00:03Z

It's not yet available on any branch as I was mostly just experimenting with the overall architecture and how to get the communication (between all the workers) working in general. I invited you to the repo. Note, the whole thing is extreme bare bone yet but it should give you an idea. Happy to jump or a call or so to see how we could move this forward but this week is a bit crazy due to some deadlines. Next week should be better.

fspoettel · 2023-09-14T16:37:32Z

Thank you! I'm on holiday until end of september, unlikely to start working on this before. I'll shoot you an email to the mail on your website (nice one btw) once I'm back to get the ball rolling.

flekschas · 2023-09-15T15:02:11Z

Sounds good and enjoy vacation! 🥳

oskbor · 2023-12-07T12:46:30Z

Hi guys,
has there been any developments on this?
Glad to see that @fspoettel is also interested in this feature.

fspoettel · 2023-12-07T12:52:26Z

Hey @oskbor, yes, there has been. @flekschas and I discussed this topic in a call, I still need to write the followups for that.

We identified that a good place to start would be the kdbush index generation which is one of the main blockers on the main thread. We decided that a good way to start here is doing two things:

move the kdbush index generation to a worker.
and benchmark how much faster it is. (here we want to add some general benchmark harnesses)

I started working on those. :)

oskbor · 2024-06-28T07:15:18Z

2. OffscreenCanvas is not yet supported in Safari,

Seems that OffscreenCanvas support has landed in Safari since a while back 🎉

flekschas added the improvement Feature improvement or enhancement label Jan 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Could this run in a worker? #100

Could this run in a worker? #100

oskbor commented Jan 11, 2023

flekschas commented Jan 11, 2023

oskbor commented Jan 12, 2023 •

edited

Loading

fspoettel commented Sep 8, 2023

flekschas commented Sep 11, 2023

fspoettel commented Sep 11, 2023 •

edited

Loading

flekschas commented Sep 12, 2023

fspoettel commented Sep 14, 2023

flekschas commented Sep 15, 2023

oskbor commented Dec 7, 2023

fspoettel commented Dec 7, 2023

oskbor commented Jun 28, 2024

Could this run in a worker? #100

Could this run in a worker? #100

Comments

oskbor commented Jan 11, 2023

flekschas commented Jan 11, 2023

oskbor commented Jan 12, 2023 • edited Loading

fspoettel commented Sep 8, 2023

flekschas commented Sep 11, 2023

fspoettel commented Sep 11, 2023 • edited Loading

flekschas commented Sep 12, 2023

fspoettel commented Sep 14, 2023

flekschas commented Sep 15, 2023

oskbor commented Dec 7, 2023

fspoettel commented Dec 7, 2023

oskbor commented Jun 28, 2024

oskbor commented Jan 12, 2023 •

edited

Loading

fspoettel commented Sep 11, 2023 •

edited

Loading