Adding support for GraniteDoclingHybrid by gabe-l-hart · Pull Request #44445 · huggingface/transformers

gabe-l-hart · 2026-03-04T20:54:17Z

What does this PR do?

This PR adds support for the forthcoming Granite Docling model based on the Granite 4 LLM architecture (GraniteMoeHybrid).

Draft Status

This PR is in draft pending the possibility of some additional changes:

Finalizing the vision projector
Finalizing the name as GraniteDoclingHybrid (versus eg GraniteMoeHybridDocling or similar)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Models:

vision models: @yonigozlan @molbap
multimodal models: @zucchini-nlp

zucchini-nlp

Nice work! Left a few comments so we are aligned with recent changes in v5, otherwise the PR is in good shape

src/transformers/models/auto/modeling_auto.py

src/transformers/models/granite_docling_hybrid/configuration_granite_docling_hybrid.py

src/transformers/models/granite_docling_hybrid/modular_granite_docling_hybrid.py

src/transformers/models/granite_docling_hybrid/processing_granite_docling_hybrid.py

zucchini-nlp · 2026-03-05T09:34:40Z

src/transformers/models/granite_docling_hybrid/processing_granite_docling_hybrid.py

+            **kwargs,
+        )
+
+        image_seq_len = image_seq_len if image_seq_len is not None else self.image_seq_len


am I right that this line is the only diff from Idefics3? If yes, I'd prefer to just copy completely from ideifcs because making image_seq_len an arg isn't very useful

The real diff is below where the GotOcr2 logic is swapped for the Idefics3 logic. I think this may be one of the TBD architectural decisions, so if the team decides to switch back to Idefics3, this ould all go away.

HuggingFaceDocBuilderDev · 2026-03-10T10:07:30Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

This is untested, but passes code generation correctly now Branch: GraniteDoclingHybrid AI-usage: draft (IBM Bob) Signed-off-by: Gabe Goodhart <ghart@us.ibm.com> Co-authored-by: omenetti.matteo@gmail.com Co-authored-by: nassarofficial@gmail.com

Branch: GraniteDoclingHybrid AI-usage: full (IBM Bob) Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

Branch: GraniteDoclingHybrid AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

…s after v5 rebase Branch: GraniteDoclingHybrid AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

This was getting injected in Idefics3ProcessorKwargs and causing errors since it's not a valid kwarg in the base ImageProcessorKwargs and somehow that's what was being validated against (versus Idefics3ImageProcessorKwargs) Branch: GraniteDoclingHybrid AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

Branch: GraniteDoclingHybrid AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

This all lives in `prepare_inputs_for_generation` Branch: GraniteDoclingHybrid AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

Branch: GraniteDoclingHybrid AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

This was recommended in review. Also, there was already a bug where self.image_seq_len was being used for the real computation, so this kwarg was never fully supported. Branch: GraniteDoclingHybrid AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

Branch: GraniteDoclingHybrid AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

github-actions · 2026-03-11T19:19:39Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: auto

zucchini-nlp reviewed Mar 5, 2026

View reviewed changes

gabe-l-hart force-pushed the GraniteDoclingHybrid branch 2 times, most recently from f2b4745 to a16c63c Compare March 9, 2026 21:11

gabe-l-hart added 23 commits March 11, 2026 12:57

feat: Register granite_docling_hybrid in all auto registries

20a6df0

Branch: GraniteDoclingHybrid AI-usage: full (IBM Bob) Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

fix: Add missing config forwarding for underlying text model

2c6094b

Branch: GraniteDoclingHybrid AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

fix: Fix how image_hidden_states are extracted from get_image_feature…

11d8217

…s after v5 rebase Branch: GraniteDoclingHybrid AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

chore: regen modeling

f4b8f26

Branch: GraniteDoclingHybrid AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

style: Linting fixes

f166660

Branch: GraniteDoclingHybrid AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

fix: No inline imports

43b3c7b

Branch: GraniteDoclingHybrid AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

fix: Remove unnecessary entry in MODEL_FOR_PRETRAINING_MAPPING_NAMES

9629b3c

Branch: GraniteDoclingHybrid AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

style: Fix copyright headers year

4c596a6

Branch: GraniteDoclingHybrid AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

fix: Remove dead/unused code/docstrings from review

d4d327a

Branch: GraniteDoclingHybrid AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

feat: Remove return_dict plumbing

00ce088

Branch: GraniteDoclingHybrid AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

feat: Consolidate config + processing into modular

1848d83

Branch: GraniteDoclingHybrid AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

chore: regen from modular after consolidation

4b73a3e

Branch: GraniteDoclingHybrid AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

fix: Remove cache initialization logic in forward

1e5f7bd

This all lives in `prepare_inputs_for_generation` Branch: GraniteDoclingHybrid AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

fix: Remove unnecessary class-attrs for GraniteDoclingHybridProcessor

1945128

Branch: GraniteDoclingHybrid AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

fix: Use auto_docstring for GraniteDoclingHybridProcessor

c99c53c

Branch: GraniteDoclingHybrid AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

feat: Use modern image fetching

5122f17

Branch: GraniteDoclingHybrid AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

chore: redo codegen

2d72415

Branch: GraniteDoclingHybrid AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

chore: Redo codegen

1ed97ab

Branch: GraniteDoclingHybrid AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

fix: Correctly update for no more cache_position

bee7b27

Branch: GraniteDoclingHybrid AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

chore: regen from modular

f82599a

Branch: GraniteDoclingHybrid AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

gabe-l-hart force-pushed the GraniteDoclingHybrid branch from 1bf67fb to f82599a Compare March 11, 2026 19:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding support for GraniteDoclingHybrid#44445

Adding support for GraniteDoclingHybrid#44445
gabe-l-hart wants to merge 23 commits intohuggingface:mainfrom
gabe-l-hart:GraniteDoclingHybrid

gabe-l-hart commented Mar 4, 2026

Uh oh!

zucchini-nlp left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zucchini-nlp Mar 5, 2026

Uh oh!

gabe-l-hart Mar 5, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Mar 10, 2026

Uh oh!

github-actions bot commented Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

gabe-l-hart commented Mar 4, 2026

What does this PR do?

Draft Status

Before submitting

Who can review?

Uh oh!

zucchini-nlp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zucchini-nlp Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

gabe-l-hart Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Mar 10, 2026

Uh oh!

github-actions bot commented Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants