[cache] Only use scalars in `get_mask_sizes` #40907

Cyrilvallez · 2025-09-16T10:30:21Z

What does this PR do?

As per the title. We can rely on the scalar self.cumulative_length instead of tensor cache_position[0], as was introduced in #40893. It's much better for downstream masking and compilation support.

HuggingFaceDocBuilderDev · 2025-09-16T10:47:53Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

* remove tensor ops * style * style

Cyrilvallez added 3 commits September 16, 2025 12:27

remove tensor ops

8d7557d

style

e5c9c05

style

6a513b6

Cyrilvallez merged commit 0c1839d into main Sep 16, 2025
22 of 24 checks passed

Cyrilvallez deleted the align-mask-primitives branch September 16, 2025 10:49

ErfanBaghaei pushed a commit to ErfanBaghaei/transformers that referenced this pull request Sep 25, 2025

[cache] Only use scalars in get_mask_sizes (huggingface#40907)

15d5f49

* remove tensor ops * style * style

vijayabhaskar-ev pushed a commit to vijayabhaskar-ev/transformers that referenced this pull request Oct 2, 2025

[cache] Only use scalars in get_mask_sizes (huggingface#40907)

befd5b0

* remove tensor ops * style * style

yuchenxie4645 pushed a commit to yuchenxie4645/transformers that referenced this pull request Oct 4, 2025

[cache] Only use scalars in get_mask_sizes (huggingface#40907)

192ab05

* remove tensor ops * style * style

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[cache] Only use scalars in `get_mask_sizes` #40907

[cache] Only use scalars in `get_mask_sizes` #40907

Cyrilvallez commented Sep 16, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Sep 16, 2025

Uh oh!

Uh oh!

Uh oh!

[cache] Only use scalars in get_mask_sizes #40907

[cache] Only use scalars in get_mask_sizes #40907

Conversation

Cyrilvallez commented Sep 16, 2025

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Sep 16, 2025

Uh oh!

Uh oh!

Uh oh!

[cache] Only use scalars in `get_mask_sizes` #40907

[cache] Only use scalars in `get_mask_sizes` #40907