hooks/pyramid_attention_broadcast: fix redundant recompute at iteration 0 and free stale cache when outside timestep range by GitGlimpse895 · Pull Request #13467 · huggingface/diffusers

GitGlimpse895 · 2026-04-14T04:29:39Z

What does this PR do?

Fixes two bugs in PyramidAttentionBroadcastHook.new_forward:

Redundant iteration == 0 condition — self.state.cache is None already
covers the first-call case after every reset_state, making the extra guard
dead code that creates a misleading impression of two independent invariants.
Stale cache leaking GPU VRAM — when outside the active timestep range,
the hook was still writing self.state.cache = output, holding a full
hidden-state activation tensor on GPU until the next generation's
reset_state call. For video transformers with dozens of PAB-hooked layers
this accumulates hundreds of MBs of unreleased VRAM. The fix sets
self.state.cache = None immediately when outside the range.

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum?
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Who can review?

@yiyixuxu @sayakpaul @DN6

…on 0 and free stale cache when outside timestep range

GitGlimpse895 · 2026-04-17T02:15:47Z

@sayakpaul @DN6 — Checking in on this! I've updated the branch to stay current with the base. Could you please take a look and also approve the workflows so the CI tests can run?

sayakpaul · 2026-04-17T02:17:13Z

@claude could you do a first round of reviews?

github-actions · 2026-04-17T02:17:29Z

Claude finished @sayakpaul's task in 1m 27s —— View job

Review of PR #13467

Read repository review guidelines
Read the PR diff
Read full file for context
Post review feedback

Thanks for the PR. I walked through the logic for both changes — here's my review.

Change 1: Removing `self.state.iteration == 0`

Correct. After reset_state() (line 125–126), both iteration and cache are reset to 0 and None respectively. So at iteration 0, self.state.cache is None is always True, making self.state.iteration == 0 dead code. Good cleanup.

Change 2: Freeing cache when outside timestep range

Correct and well-reasoned. I traced the transition scenarios:

Outside→inside range: cache is None → self.state.cache is None triggers compute → cache gets populated. ✓
Inside→outside range: not is_within_timestep_range triggers compute → cache set to None. ✓
Staying inside range: cache skip logic works as before, self.state.cache = output updates on compute. ✓
Staying outside range: every call computes fresh (because not is_within_timestep_range is always True), cache stays None. ✓

Also note: in the original else branch, output = self.state.cache followed by self.state.cache = output was a no-op, so removing the unconditional assignment is clean.

Nit: missing blank line (style)

The diff removes a blank line at line 183 of the original, reducing the gap between the end of PyramidAttentionBroadcastHook and the module-level apply_pyramid_attention_broadcast function from two blank lines to one. PEP 8 / ruff E302 expects two blank lines before a top-level definition. Running make style should catch this — please run it before merge.

Fix this →

Verdict

Logic is sound, VRAM improvement is real. One minor style fix needed (blank line). LGTM otherwise.

HuggingFaceDocBuilderDev · 2026-04-17T02:29:32Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul · 2026-04-17T03:48:53Z

Any reason why this was closed?

GitGlimpse895 · 2026-04-17T06:32:56Z

@sayakpaul So sorry about that — I accidentally closed it myself while
trying to update the branch. My sincerest apologies for the confusion
and the noise! I've opened a new PR with the same fix here: #13497.

github-actions bot added hooks size/S PR with diff < 50 LOC labels Apr 14, 2026

sayakpaul requested a review from DN6 April 14, 2026 04:53

GitGlimpse895 force-pushed the fix/pab-cache-logic branch from 809742d to 7f2a8da Compare April 16, 2026 02:21

github-actions bot added size/S PR with diff < 50 LOC and removed size/S PR with diff < 50 LOC labels Apr 16, 2026

GitGlimpse895 force-pushed the fix/pab-cache-logic branch from 7f2a8da to 227b167 Compare April 17, 2026 01:56

github-actions bot added size/S PR with diff < 50 LOC and removed size/S PR with diff < 50 LOC labels Apr 17, 2026

hooks/pyramid_attention_broadcast: fix redundant recompute at iterati…

eb83319

…on 0 and free stale cache when outside timestep range

GitGlimpse895 force-pushed the fix/pab-cache-logic branch from 227b167 to eb83319 Compare April 17, 2026 02:15

github-actions bot added size/S PR with diff < 50 LOC and removed size/S PR with diff < 50 LOC labels Apr 17, 2026

Update pyramid_attention_broadcast.py

f4e7d58

github-actions bot added size/S PR with diff < 50 LOC and removed size/S PR with diff < 50 LOC labels Apr 17, 2026

Revert cache-freeing logic that broke cache state contract

e1e3d26

github-actions bot added size/S PR with diff < 50 LOC and removed size/S PR with diff < 50 LOC labels Apr 17, 2026

GitGlimpse895 closed this Apr 17, 2026

GitGlimpse895 mentioned this pull request Apr 17, 2026

hooks/pyramid_attention_broadcast: remove redundant iteration==0 guard and fix stale cache VRAM leak #13497

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hooks/pyramid_attention_broadcast: fix redundant recompute at iteration 0 and free stale cache when outside timestep range#13467

hooks/pyramid_attention_broadcast: fix redundant recompute at iteration 0 and free stale cache when outside timestep range#13467
GitGlimpse895 wants to merge 3 commits intohuggingface:mainfrom
GitGlimpse895:fix/pab-cache-logic

GitGlimpse895 commented Apr 14, 2026 •

edited

Loading

Uh oh!

GitGlimpse895 commented Apr 17, 2026

Uh oh!

sayakpaul commented Apr 17, 2026

Uh oh!

github-actions bot commented Apr 17, 2026 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Apr 17, 2026

Uh oh!

sayakpaul commented Apr 17, 2026

Uh oh!

GitGlimpse895 commented Apr 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

GitGlimpse895 commented Apr 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

GitGlimpse895 commented Apr 17, 2026

Uh oh!

sayakpaul commented Apr 17, 2026

Uh oh!

github-actions bot commented Apr 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review of PR #13467

Change 1: Removing self.state.iteration == 0

Change 2: Freeing cache when outside timestep range

Nit: missing blank line (style)

Verdict

Uh oh!

HuggingFaceDocBuilderDev commented Apr 17, 2026

Uh oh!

sayakpaul commented Apr 17, 2026

Uh oh!

GitGlimpse895 commented Apr 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

GitGlimpse895 commented Apr 14, 2026 •

edited

Loading

github-actions bot commented Apr 17, 2026 •

edited

Loading

Change 1: Removing `self.state.iteration == 0`