Proposed use cases from Web LLM project #480

anssiko · 2023-11-16T16:15:17Z

Great to see this thread, as part of of WebLLM project https://github.com/mlc-ai/web-llm We are also building related compilation flows for WebGPU, with ability to run llama up to 70b(with latest M2 max) https://webllm.mlc.ai/

There are great synergies to webnn related projects that possibly enables future hybrid executions of models(e.g. webgpu for customized op and some through webnn)

Discussed: WebML WG Teleconference – 16 November 2023

Related WebNN issue: custom operations #6

tqchen · 2023-11-24T16:26:27Z

Sorry for getting to this late! Happy to participate in future discussions! I think one quite useful feature is to enable a json schema of the webnn declaration, so that the compiler projects can generate them and invoke executions without explicitly doing so in javascript.

anssiko · 2024-01-11T16:49:12Z

It was my pleasure to host a discussion with @tqchen on the WebML WG's teleconference today. The minutes are at: https://www.w3.org/2024/01/11-webmachinelearning-minutes.html#t01

(Any errors in the minutes are entirely mine.)

We recorded the following as the top thing to help the Web LLM and related projects adopt WebNN:

Experiment with building WebNN TVM backend

@tqchen feel free to use this issue to drop any pointers to relevant references or note additional things the group could experiment with.

Thanks for the great work and presentation! I look forward to a continued productive collaboration.

anssiko · 2024-01-16T08:37:40Z

@tqchen thanks again for joining our meeting and presenting to us the exciting work you and your team has been doing with Web LLM, MLC and other projects.

A bunch of group participants were interested in revisiting the material you presented. If you have that available online feel free to drop a link here.

@fdwr feel free to use this issue to ask questions from @tqchen related to the discussion we had. I propose we use this issue to document areas to explore in this space to be discussed on our future calls, TVM backend being one.

anssiko · 2024-01-31T14:33:53Z

Folks interested in Web LLM and related projects should check the public Machine Learning Compilation course. The video lectures and notes are high quality.

Included are also code labs for e.g. translating low-level NumPy to TensorIR, a tensor program abstraction in Apache TVM. This will help build mental models on how WebNN API could plug in as another backend.

anssiko added the use case label Nov 16, 2023

anssiko mentioned this issue Nov 16, 2023

Support for transformers #375

Open

anssiko changed the title ~~Hybrid execution use case from Web LLM project~~ Proposed use case from Web LLM project Jan 8, 2024

anssiko changed the title ~~Proposed use case from Web LLM project~~ Proposed use cases from Web LLM project Jan 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposed use cases from Web LLM project #480

Proposed use cases from Web LLM project #480

anssiko commented Nov 16, 2023

tqchen commented Nov 24, 2023

anssiko commented Jan 11, 2024

anssiko commented Jan 16, 2024

anssiko commented Jan 31, 2024

Proposed use cases from Web LLM project #480

Proposed use cases from Web LLM project #480

Comments

anssiko commented Nov 16, 2023

tqchen commented Nov 24, 2023

anssiko commented Jan 11, 2024

anssiko commented Jan 16, 2024

anssiko commented Jan 31, 2024