Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WASM backend #146

Open
FL33TW00D opened this issue Mar 29, 2024 · 4 comments
Open

WASM backend #146

FL33TW00D opened this issue Mar 29, 2024 · 4 comments

Comments

@FL33TW00D
Copy link
Collaborator

Whilst WebGPU support is growing, perhaps it would be prudent to have a WASM backend to ensure 100% device coverage.

@akashicMarga
Copy link

@FL33TW00D have you checked this https://github.com/WasmEdge/WasmEdge ? they already have llaama.cpp, pytorch and tensorflow backend.

this project is for llm - https://github.com/LlamaEdge/LlamaEdge/tree/main

@FL33TW00D
Copy link
Collaborator Author

@FL33TW00D have you checked this https://github.com/WasmEdge/WasmEdge ? they already have llaama.cpp, pytorch and tensorflow backend.

this project is for llm - https://github.com/LlamaEdge/LlamaEdge/tree/main

Yeah I've seen this! I think we can't really use it, but good reference!

@flatsiedatsie
Copy link

Interesting project. How is it related to Transformers.js? Since both are a project of HuggingFace, correct?

On topic: I'm working on a project that uses multiple 'engines' to achieve this.

  • WebLLM when for WebGPU is available
  • Wllama when it's not.

I try to choose models that run on both, and then invisibly switch engines for the user, depending on their browser.

Transformers.js runs all the smaller but vital other models, like voice recognition, TTS, OCR, embedding, etc.

@FL33TW00D
Copy link
Collaborator Author

Interesting project. How is it related to Transformers.js? Since both are a project of HuggingFace, correct?

On topic: I'm working on a project that uses multiple 'engines' to achieve this.

  • WebLLM when for WebGPU is available
  • Wllama when it's not.

I try to choose models that run on both, and then invisibly switch engines for the user, depending on their browser.

Transformers.js runs all the smaller but vital other models, like voice recognition, TTS, OCR, embedding, etc.

Ratchet is mostly experimental at the moment! Transformers.JS is production ready :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants