Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request] Multimodality support (LLAVA) #678

Closed
JianbangZ opened this issue Aug 7, 2023 · 7 comments
Closed

[Feature Request] Multimodality support (LLAVA) #678

JianbangZ opened this issue Aug 7, 2023 · 7 comments
Labels
feature request New feature or request

Comments

@JianbangZ
Copy link

馃殌 Feature

Multimodality model support (LLAVA)
There has been more and more community interest in the multimodality models, such as LLAVA. LLAVA itself has quite simple architecture: CLIP+projector+LLM.

@JianbangZ JianbangZ added the feature request New feature or request label Aug 7, 2023
@JianbangZ JianbangZ changed the title [Feature Request] [Feature Request] Multimodality support (LLAVA) Aug 7, 2023
@Kathryn-cat
Copy link
Contributor

Hey @JianbangZ , thanks for bringing this up! We're bringing in LLaVA support this coming week after some major announcements are finished. We'll also bring a simple Python API that handles multimodality.

@brianjking
Copy link

Woah, why specifically LLaVa? Any plans for InstructBlip or other options?

Either way, VERY cool. @Kathryn-cat

@JianbangZ
Copy link
Author

Woah, why specifically LLaVa? Any plans for InstructBlip or other options?

Either way, VERY cool. @Kathryn-cat

I think mainly due to it's simpleness. straight matmul projector instead of cross attention etc.

@JianbangZ
Copy link
Author

@Kathryn-cat Great news! Will the entire pipeline be able to run on Vulkan or CUDA? perticularly the CLIP visual encoder part as LLM already has great Vulkan/CUDA support.

@saad-palapa
Copy link

Any updates on adding LLaVA to MLC?

@sam1am
Copy link

sam1am commented Jan 16, 2024

Really looking forward to this. Is it still happening?

@MasterJH5574
Copy link
Collaborator

Hello folks, llava has been supported in #1974 and other followup PRs recently (many thanks to @anibohara2000!!). You are more than welcome to try out, and are welcome to open new issues if there are errors or questions regarding the Llava support.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request
Projects
None yet
Development

No branches or pull requests

6 participants