Flux2: Tensor tuples can cause issues for checkpointing by dxqb · Pull Request #12777 · huggingface/diffusers

dxqb · 2025-12-02T17:29:31Z

addresses #12776

What does this PR do?

This PR keeps the tuples, but moves the splitting from tensors into tuples of tensors to the transformer blocks, to avoid issues with checkpointing. By passing a tensor directly, torch.utils.checkpoint() identifies the tensor and saves it accordingly without running a backward through it multiple times.

This is a draft. If you agree with this change I can make it nicer. Among other things:

type hints are incorrect
splitting might not be necessary anymore, because they are used immediately after

Who can review?

@yiyixuxu and @asomoza

…sues

github-actions · 2026-01-09T15:03:41Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

dg845 · 2026-01-13T02:20:39Z

Hi @dxqb, thanks for opening this PR and thanks for your patience! This change looks good to me. As mentioned in #12776 (comment), it would be nice to have a small script to reproduce/test this behavior.

dxqb · 2026-02-01T22:46:45Z

Hi @dxqb, thanks for opening this PR and thanks for your patience! This change looks good to me. As mentioned in #12776 (comment), it would be nice to have a small script to reproduce/test this behavior.

no repro-code, but it's clear now why this happens. it's documented by pytorch: #12776 (comment)

… flux2_tuples

HuggingFaceDocBuilderDev · 2026-02-17T01:53:06Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

dg845

Thanks for the changes! Can you solve the merge conflicts with main? I think they may be a result of #12524, which switches over to using Python 3.9+ style type hints without explicit typing imports, including in transformer_flux2.py.

dxqb · 2026-02-17T21:07:17Z

Thanks for the changes! Can you solve the merge conflicts with main? I think they may be a result of #12524, which switches over to using Python 3.9+ style type hints without explicit typing imports, including in transformer_flux2.py.

done and tested using Nerogar/OneTrainer#1279

dg845 · 2026-02-19T00:38:14Z

@bot /style

github-actions · 2026-02-19T00:38:42Z

Style bot fixed some files and pushed the changes.

dg845 · 2026-02-19T01:02:25Z

Merging as the CI failure is unrelated.

dg845

Thanks!

* upgrade diffusers for huggingface/diffusers#12777 * add preset

split tensors inside the transformer blocks to avoid checkpointing is…

0881fe8

…sues

dxqb mentioned this pull request Dec 4, 2025

Flux2: Tensor tuples can cause issues for checkpointing #12776

Open

github-actions bot added the stale Issues that haven't received updates label Jan 9, 2026

yiyixuxu requested a review from dg845 January 10, 2026 02:24

github-actions bot removed the stale Issues that haven't received updates label Jan 13, 2026

Merge branch 'main' into flux2_tuples

e42e26f

dxqb mentioned this pull request Jan 24, 2026

Flux2 Dev+Klein support Nerogar/OneTrainer#1261

Merged

12 tasks

dxqb mentioned this pull request Feb 1, 2026

[Bug]: Flux2 full finetuning only works without offloading Nerogar/OneTrainer#1274

Closed

Merge commit '6a1904e' into flux2_tuples

5a30a36

dxqb mentioned this pull request Feb 1, 2026

Flux2 full-finetuning with offloading Nerogar/OneTrainer#1275

Closed

dxqb mentioned this pull request Feb 3, 2026

Flux2 full-finetuning with offloading Nerogar/OneTrainer#1279

Merged

dxqb added 3 commits February 13, 2026 09:29

Merge branch 'main' into flux2_tuples

de18458

clean up, fix type hints

b437775

Merge branch 'flux2_tuples' of https://github.com/dxqb/diffusers into…

5dbca07

… flux2_tuples

dxqb marked this pull request as ready for review February 13, 2026 08:58

dg845 reviewed Feb 17, 2026

View reviewed changes

dxqb added 3 commits February 17, 2026 21:17

upgrade type hints

080e8d8

fix merge error

a7ab34f

Merge branch 'main' into flux2_tuples

ed055c8

Merge branch 'main' into flux2_tuples

b2ab40b

Apply style fixes

7bb851c

dg845 approved these changes Feb 19, 2026

View reviewed changes

dg845 merged commit a577ec3 into huggingface:main Feb 19, 2026
10 of 11 checks passed

dxqb added a commit to Nerogar/OneTrainer that referenced this pull request Feb 19, 2026

Flux2 full-finetuning with offloading (#1279)

5ad53fe

* upgrade diffusers for huggingface/diffusers#12777 * add preset

dxqb deleted the flux2_tuples branch February 19, 2026 17:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flux2: Tensor tuples can cause issues for checkpointing#12777

Flux2: Tensor tuples can cause issues for checkpointing#12777
dg845 merged 11 commits intohuggingface:mainfrom
dxqb:flux2_tuples

dxqb commented Dec 2, 2025

Uh oh!

github-actions bot commented Jan 9, 2026

Uh oh!

dg845 commented Jan 13, 2026

Uh oh!

dxqb commented Feb 1, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Feb 17, 2026

Uh oh!

dg845 left a comment

Uh oh!

dxqb commented Feb 17, 2026

Uh oh!

dg845 commented Feb 19, 2026

Uh oh!

github-actions bot commented Feb 19, 2026 •

edited

Loading

Uh oh!

dg845 commented Feb 19, 2026

Uh oh!

dg845 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

dxqb commented Dec 2, 2025

What does this PR do?

Who can review?

Uh oh!

github-actions bot commented Jan 9, 2026

Uh oh!

dg845 commented Jan 13, 2026

Uh oh!

dxqb commented Feb 1, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Feb 17, 2026

Uh oh!

dg845 left a comment

Choose a reason for hiding this comment

Uh oh!

dxqb commented Feb 17, 2026

Uh oh!

dg845 commented Feb 19, 2026

Uh oh!

github-actions bot commented Feb 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dg845 commented Feb 19, 2026

Uh oh!

dg845 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

github-actions bot commented Feb 19, 2026 •

edited

Loading