fix: patch vllm/local endpoint model GET bug #1179

cpacker · 2024-03-21T21:40:05Z

Please describe the purpose of this pull request.

Model listing during CLI was broken for local endpoints because for our local model GETs (to generate the list): we assume the server supports the OpenAI-proxy style of GET _/v1/models, however we don't store local model endpoints with the /v1 suffix since many of the local LLM inference servers don't use it.

This fixes the issue by appending a /v1 in the model GET call if it doesn't exist (since on this call we're assuming OpenAI proxy anyways).

Working as intended:

patch vllm model GET bug

935ec34

cpacker requested a review from sarahwooders March 21, 2024 21:41

cpacker merged commit b199573 into main Mar 21, 2024
4 checks passed

cpacker deleted the model-listing-patch branch March 22, 2024 19:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: patch vllm/local endpoint model GET bug #1179

fix: patch vllm/local endpoint model GET bug #1179

cpacker commented Mar 21, 2024

fix: patch vllm/local endpoint model GET bug #1179

fix: patch vllm/local endpoint model GET bug #1179

Conversation

cpacker commented Mar 21, 2024