[Frontend] Expose custom args in OpenAI APIs #16862

afeldman-nm · 2025-04-18T17:31:43Z

Add a vllm_xargs: Optional[dict[str, Union[str,int,float]]] field to CompletionRequest, ChatCompletionRequest and TranscriptionRequest (these are the only OpenAIBaseModel subclasses which had a logits_processors field in v0.) This field is injected into SamplingParams.extra_args via SamplingParams.from_optional(); each dict key/value pair in extra_args becomes an assignment to an attribute of sampling_params.

Purpose

Enable extensible features such as logits processors and plugins to receive arbitrary custom arguments via the REST API. Mirror SamplingParams.extra_args in the REST API.

Test plan

Does not require additional unit tests (when logitsprocs extensibility is introduced later, this will implicitly test custom args). Pre-existing unit tests must pass so we know pre-existing features are not being broken.

Test results

N/A

Documentation changes

SamplingParams docstring clarifies that extra_args may plumb custom args to logitsprocs, plugins, etc. (previous it just said logitsprocs)
In vllm/entrypoints/openai/protocol.py: for CompletionRequest, ChatCompletionRequest, and TranscriptionRequest, move vllm_xargs definition inside the # --8<-- [start:completion-extra-params] section and add more detail to the description string.

Final note

The pre-existing behavior of protocol.py ChatCompletionRequest and CompletionRequest is that kv_transfer_params is passed into the engine via SamplingParams.extra_args; this PR simply merges vllm_xargs into SamplingParams.extra_args alongside kv_transfer_params. In the future it may be worth considering whether SamplingParams.extra_args is the best pathway for plumbing kv_transfer_params into the engine; it would seem to break the convention that SamplingParams.extra_args is not intended for "in-tree" functionality.

RFC: #17191

Fixes #16802

Signed-off-by: Andrew Feldman <afeldman@neuralmagic.com>

github-actions · 2025-04-18T17:31:53Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

njhill · 2025-04-18T19:04:19Z

Thanks @afeldman-nm! It would be good to include a test that shows how these can be passed via the OpenAI client sdk using its extra_body option: https://github.com/openai/openai-python?tab=readme-ov-file#undocumented-request-params

I'm unsure whether we want these new custom args to be in a nested json object (as you've done here) or just extra top-level args.

afeldman-nm · 2025-04-18T19:48:50Z

Thanks @njhill . Agree regarding the unit test. I need to think a bit about the right way to do it

Signed-off-by: Andrew Feldman <afeldman@neuralmagic.com>

afeldman-nm · 2025-04-23T15:05:57Z

@njhill @comaniac @WoosukKwon

CC: @robertgshaw2-redhat

tests/v1/entrypoints/openai/test_completion.py

vllm/entrypoints/openai/protocol.py

vllm/sampling_params.py

afeldman-nm · 2025-04-23T16:22:36Z

Thanks for your review @comaniac . After chatting with Cody, I think this interface change is sufficiently impactful to merit an RFC which I will write and share shortly.

Signed-off-by: Andrew Feldman <afeldman@neuralmagic.com>

Signed-off-by: Andrew Feldman <afeldman@redhat.com>

afeldman-nm · 2025-06-18T17:31:54Z

@afeldman-nm Glad to see it is still in progress. My use case is passing in truncate_prompt_tokens sampling parameters. Can we just unit test what we can test now and add more comprehensive unit tests when the logits process work is done?

Hi @helloworld1 - working on getting this PR landed as-is

Signed-off-by: Andrew Feldman <afeldman@redhat.com>

njhill

Thanks @afeldman-nm LGTM, just a couple of minor comments.

vllm/entrypoints/openai/protocol.py

Signed-off-by: Andrew Feldman <afeldman@redhat.com>

vllm/entrypoints/openai/protocol.py

Co-authored-by: Nick Hill <nhill@redhat.com>

Signed-off-by: Andrew Feldman <afeldman@redhat.com>

njhill

Thanks @afeldman-nm

Signed-off-by: Andrew Feldman <afeldman@neuralmagic.com> Signed-off-by: Andrew Feldman <afeldman@redhat.com> Co-authored-by: Nick Hill <nhill@redhat.com>

Signed-off-by: Andrew Feldman <afeldman@neuralmagic.com> Signed-off-by: Andrew Feldman <afeldman@redhat.com> Co-authored-by: Nick Hill <nhill@redhat.com> Signed-off-by: minpeter <kali2005611@gmail.com>

Signed-off-by: Andrew Feldman <afeldman@neuralmagic.com> Signed-off-by: Andrew Feldman <afeldman@redhat.com> Co-authored-by: Nick Hill <nhill@redhat.com> Signed-off-by: Yang Wang <elainewy@meta.com>

extra_args

55328d8

Signed-off-by: Andrew Feldman <afeldman@neuralmagic.com>

mergify bot added the frontend label Apr 18, 2025

afeldman-nm added 2 commits April 21, 2025 17:45

Merge branch 'main' into extra_args

cc44096

Merge branch 'main' into extra_args

876de25

afeldman-nm mentioned this pull request Apr 21, 2025

[Feature]: Support custom args in OpenAI (chat) completion requests #16802

Closed

afeldman-nm added 6 commits April 22, 2025 06:08

rename

191b9e1

Signed-off-by: Andrew Feldman <afeldman@neuralmagic.com>

rename

1b658cd

Signed-off-by: Andrew Feldman <afeldman@neuralmagic.com>

Merge branch 'main' into extra_args

6c892d8

extra_body

6a0f87c

Signed-off-by: Andrew Feldman <afeldman@neuralmagic.com>

completion custom arg unit test

ac57a7f

Signed-off-by: Andrew Feldman <afeldman@neuralmagic.com>

Merge branch 'main' into extra_args

9753c75

mergify bot added the v1 label Apr 22, 2025

afeldman-nm added 5 commits April 23, 2025 03:10

Merge branch 'main' into extra_args

c2f39bd

tweak extra_args; test sampling params extra args via api

5c43609

Signed-off-by: Andrew Feldman <afeldman@neuralmagic.com>

Merge branch 'main' into extra_args

1f8d6d1

remove unnecessary extra_body field/breakout

368f907

Signed-off-by: Andrew Feldman <afeldman@neuralmagic.com>

removed transcription scenario

a90311a

Signed-off-by: Andrew Feldman <afeldman@neuralmagic.com>

afeldman-nm marked this pull request as ready for review April 23, 2025 15:03

comaniac reviewed Apr 23, 2025

View reviewed changes

Merge branch 'main' into extra_args

0e7809d

afeldman-nm mentioned this pull request Apr 25, 2025

[RFC]: Custom sampling params support in REST API #17191

Open

1 task

afeldman-nm added 4 commits May 6, 2025 18:06

Merge branch 'main' into extra_args

510623c

revert sampling params

52988b8

Signed-off-by: Andrew Feldman <afeldman@neuralmagic.com>

Merge branch 'main' into extra_args

94e5855

impl based on rfc

a869a6d

Signed-off-by: Andrew Feldman <afeldman@neuralmagic.com>

afeldman-nm requested a review from aarnphm as a code owner June 18, 2025 02:33

mergify bot removed the needs-rebase label Jun 18, 2025

afeldman-nm added 5 commits June 17, 2025 22:43

fix

17f02ee

Signed-off-by: Andrew Feldman <afeldman@redhat.com>

remove unnecessary unit test

061ac67

Signed-off-by: Andrew Feldman <afeldman@redhat.com>

precedence

421c278

Signed-off-by: Andrew Feldman <afeldman@redhat.com>

pre-commit fix

f315e0e

Signed-off-by: Andrew Feldman <afeldman@redhat.com>

Merge branch 'main' into extra_args_merge

3d92a07

afeldman-nm added 2 commits June 18, 2025 13:40

Merge branch 'main' into extra_args_merge

f8609ff

Documentation changes

9c5f407

Signed-off-by: Andrew Feldman <afeldman@redhat.com>

afeldman-nm closed this Jun 18, 2025

afeldman-nm deleted the afeldman-nm/extra_args branch June 18, 2025 18:16

afeldman-nm restored the afeldman-nm/extra_args branch June 18, 2025 18:16

afeldman-nm reopened this Jun 18, 2025

njhill reviewed Jun 18, 2025

View reviewed changes

vllm/entrypoints/openai/protocol.py Outdated Show resolved Hide resolved

vllm/entrypoints/openai/protocol.py Outdated Show resolved Hide resolved

afeldman-nm added 4 commits June 18, 2025 14:40

refactor

0857dc4

Signed-off-by: Andrew Feldman <afeldman@redhat.com>

typing

f9c4e19

Signed-off-by: Andrew Feldman <afeldman@redhat.com>

typing

03c6010

Signed-off-by: Andrew Feldman <afeldman@redhat.com>

typing

95e1b0d

Signed-off-by: Andrew Feldman <afeldman@redhat.com>

njhill reviewed Jun 18, 2025

View reviewed changes

vllm/entrypoints/openai/protocol.py Outdated Show resolved Hide resolved

vllm/entrypoints/openai/protocol.py Outdated Show resolved Hide resolved

afeldman-nm and others added 2 commits June 18, 2025 15:18

Update vllm/entrypoints/openai/protocol.py

9daeaed

Co-authored-by: Nick Hill <nhill@redhat.com>

feedback

baf90c9

Signed-off-by: Andrew Feldman <afeldman@redhat.com>

njhill approved these changes Jun 18, 2025

View reviewed changes

njhill added the ready ONLY add when PR is ready to merge/full CI is needed label Jun 18, 2025

njhill changed the title ~~[V1] vLLM OpenAI API custom args~~ [Frontend] Expose custom args in OpenAI APIs Jun 19, 2025

njhill merged commit dfada85 into vllm-project:main Jun 19, 2025
78 checks passed

njhill deleted the afeldman-nm/extra_args branch June 19, 2025 00:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Frontend] Expose custom args in OpenAI APIs #16862

[Frontend] Expose custom args in OpenAI APIs #16862

Uh oh!

afeldman-nm commented Apr 18, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Apr 18, 2025

Uh oh!

njhill commented Apr 18, 2025

Uh oh!

afeldman-nm commented Apr 18, 2025

Uh oh!

afeldman-nm commented Apr 23, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

afeldman-nm commented Apr 23, 2025

Uh oh!

afeldman-nm commented Jun 18, 2025

Uh oh!

njhill left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

njhill left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[Frontend] Expose custom args in OpenAI APIs #16862

[Frontend] Expose custom args in OpenAI APIs #16862

Uh oh!

Conversation

afeldman-nm commented Apr 18, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test plan

Test results

Documentation changes

Final note

Uh oh!

github-actions bot commented Apr 18, 2025

Uh oh!

njhill commented Apr 18, 2025

Uh oh!

afeldman-nm commented Apr 18, 2025

Uh oh!

afeldman-nm commented Apr 23, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

afeldman-nm commented Apr 23, 2025

Uh oh!

afeldman-nm commented Jun 18, 2025

Uh oh!

njhill left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

njhill left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

afeldman-nm commented Apr 18, 2025 •

edited by github-actions bot

Loading