[docstrings / type hints] Update outdated annotations for `past_key_values` #40803

gante · 2025-09-10T18:47:56Z

What does this PR do?

🎯 part of the effort to efforce better standardization

We have been migrating past_key_values from the old tuple of tuples of tensors format to the new Cache format. However, many type hints and docstrings were not updated accordingly -- our users are getting incorrect information from these annotations 😮

This PR aims to reduce incorrect information. A few notes:

I heavily relied on bulk changes, and I haven't double-checked all touched models to confirm they support Cache, the base class (as opposed to models like mamba). Nevertheless, even if there are a few inconsistencies, these models were previously annotated with the legacy format -- they are either models we didn't update due to low impact (and we'll likely deprecate soon), or the type hint was already incorrect to begin with 🤗
deprecated models also received bulk changes, I don't think it's worth to manually revert them 🙈
encoder-decoder models can have a more precise type hint and docs, I'll leave that for a future round. The updated docstring is also correct for them.

github-actions · 2025-09-10T18:49:05Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: aria, autoformer, aya_vision, bark, bart, bert, bert_generation, big_bird, bigbird_pegasus, biogpt, blenderbot, blenderbot_small, blip, bridgetower, camembert, chameleon

HuggingFaceDocBuilderDev · 2025-09-10T18:57:09Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

zucchini-nlp

I didn't go over all files, ig they are all same. Thanks a lot for updating these, hopefully it will decrease the amount of issues we get about cache from users

zucchini-nlp · 2025-09-11T08:21:55Z

src/transformers/models/aria/modeling_aria.py

    past_key_values (`Cache`, *optional*, returned when `use_cache=True` is passed or when `config.use_cache=True`):
-        Tuple of `tuple(torch.FloatTensor)` of length `config.n_layers`, with each tuple having 2 tensors of shape
-        `(batch_size, num_heads, sequence_length, embed_size_per_head)`)
+        It is a [`~cache_utils.Cache`] instance. For more details, see our [kv cache guide](https://huggingface.co/docs/transformers/en/kv_cache).


Thanks, my prev bulk update logic was incorrect and left out a lot apparently 😆

zucchini-nlp · 2025-09-11T08:24:46Z

src/transformers/models/bart/modeling_bart.py

            cross_attn_layer_head_mask (`torch.FloatTensor`): mask for cross-attention heads in a given layer of
                size `(decoder_attention_heads,)`.
-            past_key_values (`Tuple(torch.FloatTensor)`): cached past key and value projection states
+            past_key_values (`Cache`): cached past key and value projection states


I realized the description is so random from model to model. Would be nice to consolidate all with "auto-doctsring" decorator. I like the cache docs in there, it has more details about correct usage

Agreed!

I also noticed that we don't have perfect inheritance on some output classes (e.g. VLM output classes), which leads to redundant docstings. I started by changing them, but got errors -- decided to not do them in this PR, to keep the two different changes isolated. More autodocstrings is something we can definitely work on.

zucchini-nlp · 2025-09-11T08:30:29Z

src/transformers/models/bert/modeling_bert.py

        encoder_attention_mask: Optional[torch.Tensor] = None,
        labels: Optional[torch.Tensor] = None,
-        past_key_values: Optional[list[torch.Tensor]] = None,
+        past_key_values: Optional[Cache] = None,


small note on some pretrained model classes: we technically support old cache until 4.58 on them and then convert to new cache format in the base model. Though it will be painful to revert only these classes, so ig we can keep as is

yeah, I thought it would be best to only document the non-deprecated case, to save us work 🤗

Cyrilvallez

Thanks a lot! Very soon, we won't see anything related to those tuples anymore 😍
Merging as we still have those flaky tests!

…alues` (huggingface#40803) * some fixes * nits * indentation * indentation * a bunch of type hints * bulk changes

gante added 6 commits September 10, 2025 17:30

some fixes

20c13a2

nits

1792bad

indentation

33ae3ee

indentation

c2b0b14

a bunch of type hints

428309f

bulk changes

8396911

gante requested review from Cyrilvallez and zucchini-nlp September 10, 2025 18:48

zucchini-nlp approved these changes Sep 11, 2025

View reviewed changes

Cyrilvallez approved these changes Sep 15, 2025

View reviewed changes

Cyrilvallez merged commit 93f810e into huggingface:main Sep 15, 2025
21 of 23 checks passed

gante deleted the kv_type_docstring branch September 15, 2025 13:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[docstrings / type hints] Update outdated annotations for `past_key_values` #40803

[docstrings / type hints] Update outdated annotations for `past_key_values` #40803

Uh oh!

gante commented Sep 10, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Sep 10, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Sep 10, 2025

Uh oh!

zucchini-nlp left a comment

Uh oh!

zucchini-nlp Sep 11, 2025

Uh oh!

zucchini-nlp Sep 11, 2025

Uh oh!

gante Sep 11, 2025

Uh oh!

zucchini-nlp Sep 11, 2025

Uh oh!

gante Sep 11, 2025

Uh oh!

Cyrilvallez left a comment

Uh oh!

Uh oh!

Uh oh!

[docstrings / type hints] Update outdated annotations for past_key_values #40803

[docstrings / type hints] Update outdated annotations for past_key_values #40803

Uh oh!

Conversation

gante commented Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

github-actions bot commented Sep 10, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Sep 10, 2025

Uh oh!

zucchini-nlp left a comment

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

gante Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

gante Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

Cyrilvallez left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

[docstrings / type hints] Update outdated annotations for `past_key_values` #40803

[docstrings / type hints] Update outdated annotations for `past_key_values` #40803

gante commented Sep 10, 2025 •

edited

Loading