ALM base model class by eustlb · Pull Request #45534 · huggingface/transformers

eustlb · 2026-04-20T15:09:17Z

What does this PR do?

Fix a discrepancy in ALMs design compared to VLMs that for most of them don't have a base model class because the use the causal model for the text model directly, while Llava style (which is indeed more aligned with the lib philosophy) is rather:
XxModel: encoder + projector + language base model backbone
XxForConditionalGeneration: XxModel + lm_head

This is motivated to simplify changes required for vLLM compatibility as seen in #39330

how to ensure this is BC?

weight loading: thanks to the dynamic weight loader and conversion mapping
TODO further verifs

HuggingFaceDocBuilderDev · 2026-04-20T15:19:23Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

eustlb added 4 commits April 20, 2026 16:59

add a base model class

4cd44e4

ensure BC via conversion mapping

79b833b

auto classes

57436db

test updates

1bd269c

eustlb mentioned this pull request Apr 20, 2026

feat[vLLM × v5]: Add audio support for the Transformers backend vllm-project/vllm#39330

Draft

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ALM base model class#45534

ALM base model class#45534
eustlb wants to merge 4 commits intomainfrom
alm-base-model-class

eustlb commented Apr 20, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Apr 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

eustlb commented Apr 20, 2026

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Apr 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants