Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[New Model]: Google's Paligemma family of models #4833

Open
nfplay opened this issue May 15, 2024 · 2 comments
Open

[New Model]: Google's Paligemma family of models #4833

nfplay opened this issue May 15, 2024 · 2 comments
Labels
new model Requests to new models

Comments

@nfplay
Copy link

nfplay commented May 15, 2024

The model to consider.

https://huggingface.co/google/paligemma-3b-pt-896

The closest model vllm already supports.

I think the only visual language model supported right now is LLava but I could be wrong.

What's your difficulty of supporting the model you want?

No response

@nfplay nfplay added the new model Requests to new models label May 15, 2024
@abrichr
Copy link

abrichr commented May 15, 2024

MiniCPM is also supported.

Excited to test out how PaliGemma compares, especially when analyzing GUI images: OpenAdaptAI/OpenAdapt#637

@ywang96
Copy link
Collaborator

ywang96 commented Jun 2, 2024

I'm working on a PR for this currently. See #5189

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
new model Requests to new models
Projects
None yet
Development

No branches or pull requests

3 participants