Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to deploy this model via API? #7

Open
Iven2132 opened this issue May 11, 2024 · 2 comments
Open

How to deploy this model via API? #7

Iven2132 opened this issue May 11, 2024 · 2 comments

Comments

@Iven2132
Copy link

Iven2132 commented May 11, 2024

How do we deploy this model via API? Can I deploy it on vLLM or lmdeploy? I can't find any example to run this with HuggingFace transformers.

I want to deploy 72b and 110b model

@kcz358
Copy link
Collaborator

kcz358 commented May 11, 2024

Hi @Iven2132 , see that you have already noticed our PR in sglang.

For others that have similar problems and reading this issue, you can refer to here

@RonanKMcGovern
Copy link

Would be ideal to have TGI and vLLM support as well.

I tried TGI but it seems that won't work as the model isn't recognised.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants