[`Flash Attention`] Disable packed sequences with pos ids only during torch compile by vasqu · Pull Request #41827 · huggingface/transformers

vasqu · 2025-10-23T17:20:36Z

Draft, only as a reference as what could be done. It would allow for full graph compile when using no attention mask.

Supported compile:

Bsz 1
- No mask
  - Before: No full graph, recompilations
  - After: Full graph
- Attn mask
  - Before: No full graph, recompilations
  - After: No full graph, recompilations
- Pos ids, no mask
  - Before: No full graph, recompilations
  - After: Not supported, silent wrong computations (if packed)
- Fa kwargs, no mask
  - Before: Full graph
  - After: Full graph
Bsz > 1
- No mask
  - Before: Full graph
  - After: Full graph
- Attn mask
  - Before: Same as bsz 1
  - After: Same as bsz 1

Tl;dr: core changes are

No attn mask: Full graph support vs recompilations and no full graph (bsz == 1)
Position ids but no attn mask: Not supported for compile vs recompilations and no full graph

HuggingFaceDocBuilderDev · 2025-10-23T17:28:58Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

vasqu added 2 commits October 23, 2025 18:13

fix fa compile

a05c0cb

simplify comment

c60a40b

vasqu mentioned this pull request Oct 23, 2025

torch.compile graph break in flash_attention_v2 backend #41803

Open

This was referenced Jan 5, 2026

FlexAttention backend support for sequence packing #43075

Open

masking_utils.create_causal_mask fails in @torch.compile(fullgraph=True) #42950

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`Flash Attention`] Disable packed sequences with pos ids only during torch compile#41827

[`Flash Attention`] Disable packed sequences with pos ids only during torch compile#41827
vasqu wants to merge 2 commits intohuggingface:mainfrom
vasqu:fix-fa-pos-ids-compile

vasqu commented Oct 23, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Oct 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

vasqu commented Oct 23, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Oct 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants