fix: Cache XLNet relative_positional_encoding to avoid CPU computation by BillionClaw · Pull Request #44762 · huggingface/transformers

BillionClaw · 2026-03-16T16:17:54Z

XLNet.relative_positional_encoding creates intermediate tensors on CPU every forward pass because torch.arange was missing the device parameter. This causes unnecessary CPU-GPU transfers when running on CUDA.

Added device=self.device to all 4 torch.arange calls in the method.

Fixes #44737

…al_encoding The relative_positional_encoding method was creating tensors on CPU every forward pass because torch.arange was not using the device parameter. This caused unnecessary CPU-GPU transfers when running on CUDA. Fixes huggingface#44737

github-actions · 2026-03-17T04:42:43Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: xlnet

Rocketknight1 closed this Mar 18, 2026

Rocketknight1 added the Code agent slop label Mar 18, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Cache XLNet relative_positional_encoding to avoid CPU computation#44762

fix: Cache XLNet relative_positional_encoding to avoid CPU computation#44762
BillionClaw wants to merge 1 commit intohuggingface:mainfrom
BillionClaw:clawoss/fix/xlnet-relative-positional-encoding-device

BillionClaw commented Mar 16, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

BillionClaw commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Mar 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

BillionClaw commented Mar 16, 2026 •

edited

Loading