Implementing the API (WebONNX) for doing inference on ONNX-serialized models on platform-available accelerators #367

KOLANICH · 2022-06-03T04:30:00Z

The idea is to implement a WebExtensions experiment that

discovers the native libs for ML inference, available in the system, such as platform-provided frameworks like WinML, TF Lite, and Core ML, libs installed from packages, like onnxruntime (contains M$ telemetry!), libonnx and maybe even ONNX-mlir + LLVM 15
binds to native libs available in the system
provides WebExtensions with the API to infer models

The text was updated successfully, but these errors were encountered:

KOLANICH · 2022-06-03T04:34:26Z

The API should be designed so that almost exactly (but maybe under a different namespace, for example in webexts browser.onnx and in webpages as navigator.onnx) the same API could be exposed to web pages (allowing to use almost exactly the same code for them as within webexts).

kpu · 2022-06-03T09:19:45Z

You're absolutely right that native is faster. In fact the current WASM implementation is 10x slower than a proper native implementation, which could run in a sandbox. And that speed could also have been used to deliver better translation quality.

We're developing a native messaging extension here: https://github.com/jelmervdl/firefox-translations though it's still WIP.

Have you spoken to the Web Machine Learning group at W3C? https://www.w3.org/blog/2021/04/w3c-launches-the-web-machine-learning-working-group/ Their timeline was too long for us to use it.

KOLANICH · 2022-06-03T10:02:17Z

Have you spoken to the Web Machine Learning group at W3C?

No, but I guess I should. Thank you for mentioning it.

marco-c · 2023-07-11T12:56:55Z

We are not going to make any improvements or fixes to the addon since we are now focusing on the built-in version.
Once some API along these lines is available in Firefox, we'll try to make use of it to speed up model inference.

andrenatal added the non-issue label Jun 7, 2022

marco-c closed this as not planned Won't fix, can't repro, duplicate, stale Jul 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementing the API (WebONNX) for doing inference on ONNX-serialized models on platform-available accelerators #367

Implementing the API (WebONNX) for doing inference on ONNX-serialized models on platform-available accelerators #367

KOLANICH commented Jun 3, 2022

KOLANICH commented Jun 3, 2022

kpu commented Jun 3, 2022

KOLANICH commented Jun 3, 2022

marco-c commented Jul 11, 2023

Implementing the API (WebONNX) for doing inference on ONNX-serialized models on platform-available accelerators #367

Implementing the API (WebONNX) for doing inference on ONNX-serialized models on platform-available accelerators #367

Comments

KOLANICH commented Jun 3, 2022

KOLANICH commented Jun 3, 2022

kpu commented Jun 3, 2022

KOLANICH commented Jun 3, 2022

marco-c commented Jul 11, 2023