Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

API support for multi-modal model inference #913

Open
babla9 opened this issue May 12, 2024 · 1 comment
Open

API support for multi-modal model inference #913

babla9 opened this issue May 12, 2024 · 1 comment
Assignees
Labels
enhancement New feature or request

Comments

@babla9
Copy link

babla9 commented May 12, 2024

Current code only supports single or batch inference for multi-modal models (Llava1.6, cogvlm etc) due to lack of vllm support. Any plans to add feature support to enable API support for these models? Maybe with something like https://github.com/sgl-project/sglang?

@tastelikefeet
Copy link
Collaborator

Sure, we will record this request to improve the performance of inference.

@tastelikefeet tastelikefeet added the enhancement New feature or request label May 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants