API support for multi-modal model inference #913

babla9 · 2024-05-12T01:44:18Z

Current code only supports single or batch inference for multi-modal models (Llava1.6, cogvlm etc) due to lack of vllm support. Any plans to add feature support to enable API support for these models? Maybe with something like https://github.com/sgl-project/sglang?

tastelikefeet · 2024-05-14T06:37:19Z

Sure, we will record this request to improve the performance of inference.

tastelikefeet added the enhancement New feature or request label May 14, 2024

tastelikefeet assigned Jintao-Huang May 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API support for multi-modal model inference #913

API support for multi-modal model inference #913

babla9 commented May 12, 2024

tastelikefeet commented May 14, 2024

API support for multi-modal model inference #913

API support for multi-modal model inference #913

Comments

babla9 commented May 12, 2024

tastelikefeet commented May 14, 2024