Remove V0 Encoder-Decoder Support #24907

WoosukKwon · 2025-09-15T19:50:25Z

Remove V0 encoder decoder model runner.
Also, this PR deletes the deprecated models such as BART. After this PR, Whisper will be the only encoder-decoder model that are supported by vLLM.

mergify · 2025-09-15T19:52:21Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @WoosukKwon.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

gemini-code-assist

Code Review

This pull request restores support for Mllama4 and Skywork models. As part of this, it also removes a significant amount of code and tests related to a generic encoder-decoder model runner, effectively dropping support for models like BART, mBART, Donut, Florence2, and the original Mllama. While the changes are extensive, they appear to be mostly consistent with this goal. However, I've identified two critical bugs in an example script that will lead to NameError exceptions at runtime due to variables being used after their definitions were removed.

examples/offline_inference/vision_language.py

Signed-off-by: Woosuk Kwon <woosuk@thinkingmachines.ai>

mfournioux · 2025-09-23T06:37:55Z

@WoosukKwon your PR is deleting Donut model from offline inference examples. Could you please let me know why Donut model is not longer supported while this PR has been merged recently #23229 ?

WoosukKwon · 2025-09-23T15:39:35Z

@mfournioux We were planning to discontinue encoder-decoder model support (except Whisper) in the course of V0 deprecation. I think the PR was merged without the lack of this information.

Signed-off-by: Woosuk Kwon <woosuk@thinkingmachines.ai>

mfournioux · 2025-09-25T11:32:37Z

@mfournioux We were planning to discontinue encoder-decoder model support (except Whisper) in the course of V0 deprecation. I think the PR was merged without the lack of this information.

Many thanks for your reply, why only Whisper model is kept and others like Donut are not longer supported regarding this PR ? It means Donut model is not longer supported for next vllm version ?

WoosukKwon added the codex label Sep 15, 2025 — with ChatGPT Codex Connector

WoosukKwon requested review from hmellor, DarkLight1337, ywang96, tlrmchlsmth, yewentao256, robertgshaw2-redhat, simon-mo, aarnphm, NickLucche, youkaichao, mgoin, houseroad, ProExpertProg, zhuohan123, alexm-redhat, comaniac and njhill as code owners September 15, 2025 19:50

mergify bot added documentation Improvements or additions to documentation llama Related to Llama models multi-modality Related to multi-modality (#4194) new-model Requests to new models v1 labels Sep 15, 2025

WoosukKwon changed the base branch from main to codex/remove-v0-encoder-decoder-model-support September 15, 2025 19:51

mergify bot added the needs-rebase label Sep 15, 2025

gemini-code-assist bot reviewed Sep 15, 2025

View reviewed changes

examples/offline_inference/vision_language.py Outdated Show resolved Hide resolved

examples/offline_inference/vision_language.py Outdated Show resolved Hide resolved

WoosukKwon changed the base branch from codex/remove-v0-encoder-decoder-model-support to main September 15, 2025 19:53

mergify bot removed the needs-rebase label Sep 15, 2025

Remove encoder-decoder

c66902d

Signed-off-by: Woosuk Kwon <woosuk@thinkingmachines.ai>

WoosukKwon force-pushed the codex/remove-v0-encoder-decoder-model-support-htgyni branch from 848b568 to c66902d Compare September 15, 2025 20:04

WoosukKwon changed the title ~~Restore Mllama4 and Skywork models~~ Remove V0 Encoder-Decoder Support Sep 15, 2025

WoosukKwon added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 15, 2025

WoosukKwon added this to V0 Deprecation Sep 15, 2025

fix ci

f9e4fc1

Signed-off-by: Woosuk Kwon <woosuk@thinkingmachines.ai>

mergify bot added the ci/build label Sep 15, 2025

WoosukKwon added 3 commits September 15, 2025 23:58

remove florence

4ce2c58

Signed-off-by: Woosuk Kwon <woosuk@thinkingmachines.ai>

revert

83cd3f4

Signed-off-by: Woosuk Kwon <woosuk@thinkingmachines.ai>

remove llama 3.2

c3d4215

Signed-off-by: Woosuk Kwon <woosuk@thinkingmachines.ai>

WoosukKwon merged commit 759ef49 into main Sep 16, 2025
82 checks passed

WoosukKwon deleted the codex/remove-v0-encoder-decoder-model-support-htgyni branch September 16, 2025 04:17

github-project-automation bot moved this to Done in V0 Deprecation Sep 16, 2025

hmellor mentioned this pull request Sep 16, 2025

[V0 Deprecation] Drop V0 encoder-decoder runner #23300

Closed

Isotr0py mentioned this pull request Sep 16, 2025

[Misc] Add removed encoder-decoder models to previously supported models list #24961

Merged

5 tasks

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

Remove V0 Encoder-Decoder Support (vllm-project#24907)

1abcc0a

Signed-off-by: Woosuk Kwon <woosuk@thinkingmachines.ai>

mfournioux mentioned this pull request Sep 25, 2025

[Bug]: Donut model inference, CUDA out of memory #24971

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Remove V0 Encoder-Decoder Support #24907

Remove V0 Encoder-Decoder Support #24907

Uh oh!

WoosukKwon commented Sep 15, 2025 •

edited by github-actions bot

Loading

Uh oh!

mergify bot commented Sep 15, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mfournioux commented Sep 23, 2025

Uh oh!

WoosukKwon commented Sep 23, 2025

Uh oh!

mfournioux commented Sep 25, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Remove V0 Encoder-Decoder Support #24907

Remove V0 Encoder-Decoder Support #24907

Uh oh!

Conversation

WoosukKwon commented Sep 15, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mergify bot commented Sep 15, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mfournioux commented Sep 23, 2025

Uh oh!

WoosukKwon commented Sep 23, 2025

Uh oh!

mfournioux commented Sep 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

WoosukKwon commented Sep 15, 2025 •

edited by github-actions bot

Loading

mfournioux commented Sep 25, 2025 •

edited

Loading