-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
HFClientVLLM
passes extra parameters to vLLM server causing extra_forbidden
error
#974
Comments
Do we remove only port and url or all the kwargs provided by the user that are not in pydantic variables? |
So the way vLLM is structured, any field which is not a parameter to this base model will raise: https://github.com/vllm-project/vllm/blob/c7f2cf2b7f67bce5842fedfdba508440fe257375/vllm/entrypoints/openai/protocol.py#L69Now, I believe all you inject extra is the port and url. If you remove those two parameters, it should be good. Please don’t remove any parameters that the user gave. This maximizes the user’s flexibility when interacting with vLLM. I’d be glad to test this once you have a branch up, but ideally it would be better to include a unit test for posterity. Thanks for looking into this. DamianOn 11. May 2024, at 14:07, omkar-sh ***@***.***> wrote:
Do we remove only port and url or all the kwargs provided by the user that are not in pydantic variables?
—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you authored the thread.Message ID: ***@***.***>
|
Made this PR. Please checkout. Link |
Hey. Thanks for your PR. I do not think that does what you think it does. The keys |
I missed negating the if statement. Fixed the same now. |
Moving this conversation to the PR. I left a comment there. |
Also |
DSPy version: 2.4.9
vLLM version: 0.4.2
When the
HFClientVLLM
performs an HTTP request to a vLLM server, it passeskwargs
. Thekwargs
has values such asport
andurl
that are not recognized by the vLLM server. This is in this lineThe issue is that in the latest vLLM release, extra (unrecognized) parameters, throw an error. This is due to line .
In all, when using the
HFClientVLLM
, this causes the following error:I hope that this issue is insightful.
The text was updated successfully, but these errors were encountered: