[deps][llm] Upgrade to vLLM 0.16.0 by jeffreywang-anyscale · Pull Request #61389 · ray-project/ray

jeffreywang-anyscale · 2026-02-27T22:39:40Z

Description

Upgrade to vLLM 0.16.0 and adapt to breaking API changes in init_app_state and truncate_prompt_tokens.

Breaking change 1 (Figured out by Claude)

vLLM 0.16.0's init_app_state no longer unconditionally initializes all serving endpoints. Instead, it expects a supported_tasks tuple to decide which serving objects. Without it, vLLM falls back to ('generate',) only.

Fix: query supported_tasks from engine_client.get_supported_tasks() (which inspects the model's actual capabilities) and pass it to init_app_state, matching what vLLM's own API server entrypoint does.

Breaking change 2

truncate_prompt_tokens is deprecated in pooling/embedding.

Fix

Add tokenization_kwargs as a new optional input column for embedding/pooling tasks, passed through to engine.encode().
Add backward-compatible shim: if users still pass truncate_prompt_tokens in pooling_params, it is automatically converted to the equivalent tokenization_kwargs with a deprecation warning. truncate_prompt_tokens: -1 is resolved to max_model_len.
Update tests to cover both the new tokenization_kwargs path and the legacy truncate_prompt_tokens compatibility path.

Related issues

Link related issues: "Fixes #1234", "Closes #1234", or "Related to #1234".

Additional information

Optional: Add implementation details, API changes, usage examples, screenshots, etc.

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

jeffreywang-anyscale · 2026-02-27T22:40:19Z

First pass with CC.

mkdir -p ~/.claude/skills/vllm-upgrade
touch ~/.claude/skills/vllm-upgrade/SKILL.md

Running through tests now to see if it misses anything.

gemini-code-assist

Code Review

This pull request upgrades the vLLM dependency to version 0.16.0. The changes are primarily focused on updating dependency files, including Dockerfiles, requirements files, and numerous lock files. The code has also been adapted to the new vLLM version by updating import paths and removing workarounds for issues that have been resolved in the new release. All changes appear to be correct and consistent with the goal of upgrading vLLM.

jeffreywang-anyscale · 2026-03-02T19:27:06Z

-                # vLLM 0.12.0 ignores truncate_prompt_tokens in the pooling_params.
-                # TODO (jeffreywang): Remove the following line once
-                # https://github.com/vllm-project/vllm/issues/31012 is fixed.
-                truncate_prompt_tokens=request.params.truncate_prompt_tokens,


AI @jeffreywang-anyscale validate the behavior of truncate_prompt_tokens.

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com> Signed-off-by: bittoby <bittoby@users.noreply.github.com>

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

Upgrade to vLLM 0.16.0

6dd9543

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

jeffreywang-anyscale requested review from a team, aslonnie, edoakes and richardliaw as code owners February 27, 2026 22:39

jeffreywang-anyscale added the go add ONLY when ready to merge, run all tests label Feb 27, 2026

gemini-code-assist bot reviewed Feb 27, 2026

View reviewed changes

jeffreywang-anyscale commented Feb 27, 2026

View reviewed changes

Comment thread python/ray/llm/_internal/serve/engines/vllm/vllm_engine.py

ray-gardener bot added serve Ray Serve Related Issue llm labels Feb 28, 2026

aslonnie approved these changes Feb 28, 2026

View reviewed changes

jeffreywang-anyscale mentioned this pull request Mar 2, 2026

[Core] VLLM engine v0.16.0 add backward compatibility #61425

Closed

jeffreywang-anyscale commented Mar 2, 2026

View reviewed changes

jeffreywang-anyscale mentioned this pull request Mar 2, 2026

[Core] VLLM engine v0.16.0 #61424

Closed

kouroshHakha approved these changes Mar 3, 2026

View reviewed changes

jeffreywang-anyscale added 2 commits March 3, 2026 13:03

Deprecate truncate_prompt_token

5b917d1

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

claude was right

e981702

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

kouroshHakha approved these changes Mar 4, 2026

View reviewed changes

kouroshHakha merged commit 2aa11b9 into master Mar 4, 2026
6 checks passed

kouroshHakha deleted the vllm-0.16.0 branch March 4, 2026 03:13

jeffreywang-anyscale mentioned this pull request Mar 5, 2026

[llm] Upgrade vllm to 0.17.0 in ray-llm #61445

Closed

bittoby pushed a commit to bittoby/ray that referenced this pull request Mar 6, 2026

[deps][llm] Upgrade to vLLM 0.16.0 (ray-project#61389)

d1cee44

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com> Signed-off-by: bittoby <bittoby@users.noreply.github.com>

ryanaoleary pushed a commit to ryanaoleary/ray that referenced this pull request Mar 13, 2026

[deps][llm] Upgrade to vLLM 0.16.0 (ray-project#61389)

cdd968a

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

pengw0048 mentioned this pull request Apr 10, 2026

[Serve] ray serve version not match vllm #62497

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[deps][llm] Upgrade to vLLM 0.16.0#61389

[deps][llm] Upgrade to vLLM 0.16.0#61389
kouroshHakha merged 3 commits intomasterfrom
vllm-0.16.0

jeffreywang-anyscale commented Feb 27, 2026 •

edited

Loading

Uh oh!

jeffreywang-anyscale commented Feb 27, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

jeffreywang-anyscale Mar 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jeffreywang-anyscale commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Breaking change 1 (Figured out by Claude)

Breaking change 2

Related issues

Additional information

Uh oh!

jeffreywang-anyscale commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

jeffreywang-anyscale Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jeffreywang-anyscale commented Feb 27, 2026 •

edited

Loading

jeffreywang-anyscale commented Feb 27, 2026 •

edited

Loading