New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. Weβll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[WIP] πππ Transformers.js V3 πππ #545
base: main
Are you sure you want to change the base?
Conversation
Requires dynamic imports
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
Hey! This is great. Is this already in alpha? |
ORT modifies the object in-place, which means you can't pass different session objects to multiple sessions
@xenova There is new version 1.17.3 of onnxruntime-web. I tested with wav2vec and there is new error so looks like a progress π |
Segment Anything Encoder now works with WebGPU: up to 8x faster! (online demo) sam-webgpu.mp4 |
* Support exporting variants * Add support for box inputs * Add box processor unit test
In preparation for Transformers.js v3, I'm compiling a list of issues/features which will be fixed/included in the release.
onnxruntime-web
to 1.17.0).onnxruntime-web
β 1.17.0). Closes:onnxruntime-web
β 1.17.0). Closes:getModelJSON
fails with a bundler (see reproduction)Β #366PretrainedModel
,PretrainedTokenizer
, andProcessor
types. In a similar way to how the pipeline API has conditional types, we'll add the same for the other classes accessible by users.topk
->top_k
parameter.transpose
->permute
AutoProcessor
class, which uses image processor and tokenizerHow to use WebGPU
First, install the development branch
Then specify the
device
parameter when loading the model. Here's example code to get started. Please note that this is still a WORK IN PROGRESS, so the following usage may change before release.