[WIP] 🚀🚀🚀 Transformers.js V3 🚀🚀🚀 #545

xenova · 2024-01-27T17:53:18Z

In preparation for Transformers.js v3, I'm compiling a list of issues/features which will be fixed/included in the release.

How to use WebGPU

First, install the development branch

npm install xenova/transformers.js#v3

Then specify the device parameter when loading the model. Here's example code to get started. Please note that this is still a WORK IN PROGRESS, so the following usage may change before release.

import { pipeline } from '@xenova/transformers';

// Create feature extraction pipeline
const extractor = await pipeline('feature-extraction', 'Xenova/all-MiniLM-L6-v2', {
    device: 'webgpu',
    dtype: 'fp32', // or 'fp16'
});

// Generate embeddings
const sentences = ['That is a happy person', 'That is a very happy person'];
const output = await extractor(sentences, { pooling: 'mean', normalize: true });
console.log(output.tolist());

Requires dynamic imports

HuggingFaceDocBuilderDev · 2024-01-27T17:56:51Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Huguet57 · 2024-01-31T00:34:02Z

Hey! This is great. Is this already in alpha?

ORT modifies the object in-place, which means you can't pass different session objects to multiple sessions

young-developer · 2024-04-11T12:31:16Z

@xenova There is new version 1.17.3 of onnxruntime-web. I tested with wav2vec and there is new error so looks like a progress 😄

xenova · 2024-04-21T00:17:41Z

Segment Anything Encoder now works with WebGPU: up to 8x faster! (online demo)

sam-webgpu.mp4

* Support exporting variants * Add support for box inputs * Add box processor unit test

xenova added 24 commits August 24, 2023 02:38

[version] Update to 3.0.0-alpha.0

129abcd

Fix SamImageProcessor

8c98446

Split SAM encoder and decoder

75ae57b

Merge branch 'main' into v3

4449caf

Only use necessary inputs for prompt_encoder_mask_decoder

2e08b74

Update to onnxruntime v1.16.1

08fef47

Binarize mask with Uint8Array data

ee24941

Allow for separate computation of reshaped input points

8913e9a

Update calculateDimensions typing

1acb635

Implement additional helper functions for RawImage

7e115dc

Implement tensor multiplication function

55066b1

Do padding after rescaling/normalizing

49e669a

Minor reformatting

1903673

Update allowed types for min and max functions

704d95d

Update compatibility for webgpu EP

66da130

Requires dynamic imports

Fix assignment issue

27fb0dc

Update dependency versions

32bcaf7

Merge branch 'main' into v3

4940e85

post-merge cleanup

aa57f00

Support webgpu build

f3509e5

Remove stream/web import

f8bc912

Merge branch 'main' into v3

97b48b3

Merge branch 'main' into v3

e772c41

Merge branch 'main' into v3

50feeb0

This was linked to issues Jan 27, 2024

[Feature request] WebGPU support #73

Open

[Feature request] Deno Support #78

Open

YOLOS model extremely slow #533

Open

xenova marked this pull request as draft January 27, 2024 18:02

xenova mentioned this pull request Apr 10, 2024

Contribution Question-What's next after run scripts.convert? #644

Closed

xenova added 2 commits April 11, 2024 11:18

Merge branch 'main' into v3

4e03073

Make a clone of session_options before passing to ORT

28fc75e

ORT modifies the object in-place, which means you can't pass different session objects to multiple sessions

xenova added 13 commits April 11, 2024 17:14

Update default quantization settings for musicgen models

9af3599

Refactor transformers.js env var

57da953

Store sessions for a model as a record

82680b8

Mark constructSessions as private

549cf6a

Update llava session name

8834263

Remove unused function

cf4bfdf

Add support for token streaming

667b2c0

Remove warning debug log

33ff336

Make BaseStreamer visible to users

7194d06

Create MusicGen Web demo

e205f18

Upgrade onnxruntime-web to 1.17.3

f922046

Remove unused import

d62e410

Allow user to update stream size

6301327

Fruup mentioned this pull request Apr 20, 2024

Compatibility with Bun chroma-core/chromadb-default-embed#2

Closed

5 tasks

xenova added 10 commits April 20, 2024 15:23

Early de-referencing

061af01

Early de-referencing

02c6b15

Update remaining model constructors

bf009d7

Implement interpolate_4d with ORT sessions

0637743

Only use proxying when not in web-worker env

bcbc2fe

Pass text inputs to text-generation pipeline

520fcfc

Update segment-anything web demo

aeb9d87

Update SAM demo package.json

a44b2c1

Indicate webgpu version

3e624cf

Fix vite build

f872ac4

SAM add support for box inputs (#563)

dda6a19

* Support exporting variants * Add support for box inputs * Add box processor unit test

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] 🚀🚀🚀 Transformers.js V3 🚀🚀🚀 #545

[WIP] 🚀🚀🚀 Transformers.js V3 🚀🚀🚀 #545

xenova commented Jan 27, 2024 •

edited

HuggingFaceDocBuilderDev commented Jan 27, 2024

Huguet57 commented Jan 31, 2024

young-developer commented Apr 11, 2024 •

edited

xenova commented Apr 21, 2024 •

edited

[WIP] 🚀🚀🚀 Transformers.js V3 🚀🚀🚀 #545

Are you sure you want to change the base?

[WIP] 🚀🚀🚀 Transformers.js V3 🚀🚀🚀 #545

Conversation

xenova commented Jan 27, 2024 • edited

How to use WebGPU

HuggingFaceDocBuilderDev commented Jan 27, 2024

Huguet57 commented Jan 31, 2024

young-developer commented Apr 11, 2024 • edited

xenova commented Apr 21, 2024 • edited

xenova commented Jan 27, 2024 •

edited

young-developer commented Apr 11, 2024 •

edited

xenova commented Apr 21, 2024 •

edited