Fix TypeError when loading float8 models by falling back to bfloat16 in local_torch_dtype by Desel72 · Pull Request #44596 · huggingface/transformers

Desel72 · 2026-03-11T13:03:19Z

Fix TypeError when loading float8 models by falling back to bfloat16 in local_torch_dtype

What does this PR do?

When loading FP8 models (e.g. Qwen/Qwen3.5-35B-A3B-FP8) with dtype="auto", the auto-detected dtype from checkpoint weights can be torch.float8_e4m3fn. This dtype flows to local_torch_dtype() which calls torch.set_default_dtype(), but PyTorch does not support float8 types as default dtype, causing:
TypeError: couldn't find storage object Float8_e4m3fnStorage

This happens when:

The top-level config has no dtype set (common with composite models where dtype is only in a sub-config)
_get_dtype() auto-detects torch.float8_e4m3fn from the checkpoint weights
FineGrainedFP8HfQuantizer doesn't override update_dtype(), so it can't intercept this

This PR adds a check in local_torch_dtype() to fall back to torch.bfloat16 when a float8 dtype is encountered. This only affects model skeleton initialization (set_default_dtype); actual float8 weights are still loaded correctly downstream via _load_pretrained_model.

Also adds a unit test to verify the fallback behavior for both float8_e4m3fn and float8_e5m2.

Fixes #44589

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case. TypeError: couldn't find storage object Float8_e4m3fnStorage #44589
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@Cyrilvallez (model loading / from_pretrained)
@SunMarc (quantization)

…in local_torch_dtype

Desel72 · 2026-03-11T13:42:03Z

Hi @Rocketknight1
Could you please share the reasons they were closed and what I should update to move toward merging?

Rocketknight1 · 2026-03-11T13:56:29Z

We're trying to avoid pure code agent PRs right now and working on formalizing a policy against them. The main reason is simply that we're able to run our own code agents if we need to - users running them on random issues just adds a useless middleman.

Desel72 · 2026-03-11T14:00:39Z

Thanks for your reply. Is there a any way to become a merged PR?

Fix TypeError when loading float8 models by falling back to bfloat16 …

4279fae

…in local_torch_dtype

Rocketknight1 closed this Mar 11, 2026

Rocketknight1 added the Code agent slop label Mar 11, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix TypeError when loading float8 models by falling back to bfloat16 in local_torch_dtype#44596

Fix TypeError when loading float8 models by falling back to bfloat16 in local_torch_dtype#44596
Desel72 wants to merge 1 commit into
huggingface:mainfrom
Desel72:fix/issue-#44589

Desel72 commented Mar 11, 2026

Uh oh!

Desel72 commented Mar 11, 2026

Uh oh!

Rocketknight1 commented Mar 11, 2026

Uh oh!

Desel72 commented Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Desel72 commented Mar 11, 2026

What does this PR do?

Before submitting

Who can review?

Uh oh!

Desel72 commented Mar 11, 2026

Uh oh!

Rocketknight1 commented Mar 11, 2026

Uh oh!

Desel72 commented Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants