Skip to content

Deprecate AutoModelForVision2Seq #38900

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 6 commits into from
Jul 14, 2025

Conversation

zucchini-nlp
Copy link
Member

What does this PR do?

As per title, we'll remove it anyway in the end so let's start raising warnings. I already ask all new models to not use Vision2Seq

@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@zucchini-nlp zucchini-nlp requested a review from Cyrilvallez July 1, 2025 12:54
@Cyrilvallez
Copy link
Member

I don't remember all the discussions around this, but is ImageTextToText what will be fully future-proof? So we don't go with ForCausalLM in the end?

@zucchini-nlp
Copy link
Member Author

I don't think we'll remove ImageTextToText in the near future though there were discussions on unifying "AutoForMultimodal" under one umbrella mapping. didn't yet start working on new auto class, but I think it will be a new auto-mapping that:

  1. Unifies all generative model, either under one class or two classes (one for text and other for multimodality)
  2. Main goal is to clean up garbage repetitive code and we don't plan to delete existing auto classes, otherwise it might break a lot of external libraries that depend on us. Until new auto is added, we'll have a surge of new VLMs added to the mapping which makes it harder to just remove it

Personally, I'm in favor of keeping ImageTextToText for a while as the recommended mapping

Copy link
Member

@Cyrilvallez Cyrilvallez left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

Copy link
Contributor

[For maintainers] Suggested jobs to run (before merge)

run-slow: auto

@zucchini-nlp zucchini-nlp merged commit 878d60a into huggingface:main Jul 14, 2025
25 checks passed
rjgleaton pushed a commit to rjgleaton/transformers that referenced this pull request Jul 17, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants