[Feature Request] Multimodality support (LLAVA) #678

JianbangZ · 2023-08-07T12:05:21Z

🚀 Feature

Multimodality model support (LLAVA)
There has been more and more community interest in the multimodality models, such as LLAVA. LLAVA itself has quite simple architecture: CLIP+projector+LLM.

Kathryn-cat · 2023-08-07T13:03:38Z

Hey @JianbangZ , thanks for bringing this up! We're bringing in LLaVA support this coming week after some major announcements are finished. We'll also bring a simple Python API that handles multimodality.

brianjking · 2023-08-07T17:00:58Z

Woah, why specifically LLaVa? Any plans for InstructBlip or other options?

Either way, VERY cool. @Kathryn-cat

JianbangZ · 2023-08-07T17:44:49Z

Woah, why specifically LLaVa? Any plans for InstructBlip or other options?

Either way, VERY cool. @Kathryn-cat

I think mainly due to it's simpleness. straight matmul projector instead of cross attention etc.

JianbangZ · 2023-08-07T17:51:07Z

@Kathryn-cat Great news! Will the entire pipeline be able to run on Vulkan or CUDA? perticularly the CLIP visual encoder part as LLM already has great Vulkan/CUDA support.

saad-palapa · 2023-11-24T06:41:31Z

Any updates on adding LLaVA to MLC?

sam1am · 2024-01-16T17:51:00Z

Really looking forward to this. Is it still happening?

MasterJH5574 · 2024-03-30T23:54:15Z

Hello folks, llava has been supported in #1974 and other followup PRs recently (many thanks to @anibohara2000!!). You are more than welcome to try out, and are welcome to open new issues if there are errors or questions regarding the Llava support.

JianbangZ added the feature request New feature or request label Aug 7, 2023

JianbangZ changed the title ~~[Feature Request]~~ [Feature Request] Multimodality support (LLAVA) Aug 7, 2023

Kathryn-cat mentioned this issue Aug 7, 2023

[Tracking] Multimodality Support #679

Closed

4 tasks

MasterJH5574 closed this as completed Mar 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Multimodality support (LLAVA) #678

[Feature Request] Multimodality support (LLAVA) #678

JianbangZ commented Aug 7, 2023

Kathryn-cat commented Aug 7, 2023

brianjking commented Aug 7, 2023

JianbangZ commented Aug 7, 2023

JianbangZ commented Aug 7, 2023

saad-palapa commented Nov 24, 2023

sam1am commented Jan 16, 2024

MasterJH5574 commented Mar 30, 2024

[Feature Request] Multimodality support (LLAVA) #678

[Feature Request] Multimodality support (LLAVA) #678

Comments

JianbangZ commented Aug 7, 2023

🚀 Feature

Kathryn-cat commented Aug 7, 2023

brianjking commented Aug 7, 2023

JianbangZ commented Aug 7, 2023

JianbangZ commented Aug 7, 2023

saad-palapa commented Nov 24, 2023

sam1am commented Jan 16, 2024

MasterJH5574 commented Mar 30, 2024