Support a vision model on webcam live stream #291

flatsiedatsie · 2024-02-06T12:02:20Z

It would be very interesting to run a vision model like LLAVA, or the very impressive and tiny new Moondream LLM, in the browser. It would be fantastic if it could analyze images and (live) video.

That would, for example, allow us to create a tool to help blind people to get a sense of the world around them simply by opening a webpage on their phone.

CharlieFRuan · 2024-02-08T18:54:52Z

Yep, multimodal models like LLaVA are on our roadmap as brought up here #276

flatsiedatsie closed this as completed Feb 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support a vision model on webcam live stream #291

Support a vision model on webcam live stream #291

flatsiedatsie commented Feb 6, 2024

CharlieFRuan commented Feb 8, 2024

Support a vision model on webcam live stream #291

Support a vision model on webcam live stream #291

Comments

flatsiedatsie commented Feb 6, 2024

CharlieFRuan commented Feb 8, 2024