Fix StaticLayer.get_seq_length return type annotation (#45987) by Sanjays2402 · Pull Request #46173 · huggingface/transformers

Sanjays2402 · 2026-05-23T19:26:01Z

StaticLayer.cumulative_length is initialized as a shape-(1,) torch.Tensor (so that in-place .add_() updates stay torch.compile-friendly), and StaticLayer.get_seq_length() returns it directly. The current -> int annotation lies about that.

Per @Rocketknight1's comment on the issue:

I think this is mostly just a type-hint bug, right? There isn't really much difference between a tensor with shape (1,) and a 0-dim tensor, they'll both broadcast with any other tensor and work with item() etc. Also, we generally want to avoid item() where possible because it causes a CUDA sync. So maybe just make the return type int | Tensor or something but leave the rest of the code alone?

This PR does exactly that — minimal annotation-only change:

CacheLayerMixin.get_seq_length abstract: -> int → -> int | torch.Tensor
StaticLayer.get_seq_length: -> int → -> int | torch.Tensor, plus a docstring note explaining why a tensor is returned.

No runtime behavior change. No .item() calls added. Other concrete get_seq_length overrides (DynamicLayer, StaticSlidingWindowLayer, QuantizedLayer, ...) already return ints and don't need touching.

The four earlier PRs for this issue (#46005, #46010, #46081, #45997) all attempted heavier rewrites that added .item() or restructured cumulative_length, which is the opposite of the requested fix. This PR sticks to the maintainer-specified shape.

Before submitting

This PR fixes a typing issue and links to the existing issue ([Bug] StaticCache.get_seq_length() returns shape-(1,) Tensor despite -> int contract #45987).
Did you make sure to update the documentation with your changes? (Docstring updated to reflect actual return type.)
Did you write any new necessary tests? Not needed — annotation-only change with no runtime behavior delta.

Who can review?

@Rocketknight1 (you commented on #45987 with the requested approach)

) `StaticLayer.cumulative_length` is a shape-(1,) `torch.Tensor`, so `StaticLayer.get_seq_length()` actually returns a `Tensor`, not the `int` its annotation promised. Per maintainer guidance on huggingface#45987, calling `.item()` to coerce would force a CUDA sync and isn't worth it when the tensor broadcasts identically to an int at every call site. The right fix is just to relax the type annotation. - Update `CacheLayerMixin.get_seq_length` abstract signature to `int | torch.Tensor`. - Update `StaticLayer.get_seq_length` to match, with a docstring note explaining why a tensor is returned.

github-actions · 2026-05-23T19:38:56Z

View the CircleCI Test Summary for this PR:

https://huggingface.co/spaces/transformers-community/circle-ci-viz?pr=46173&sha=ece191

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix StaticLayer.get_seq_length return type annotation (#45987)#46173

Fix StaticLayer.get_seq_length return type annotation (#45987)#46173
Sanjays2402 wants to merge 1 commit into
huggingface:mainfrom
Sanjays2402:fix/get-seq-length-return-type-45987

Sanjays2402 commented May 23, 2026

Uh oh!

github-actions Bot commented May 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Sanjays2402 commented May 23, 2026

Before submitting

Who can review?

Uh oh!

github-actions Bot commented May 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant