[Modeling] Load FP8 safetensors such as DeepSeek #36828

kylesayrs · 2025-03-19T15:29:50Z

Purpose

Support loading safetensors in FP8

Changes

Add F8_E4M3 to str_to_torch_dtype mapping used by safetensors loading logic
Add a value error Cannot load safetensors of unknown dtype {k_dtype} if string lookup fails

Testing

Load DeepSeekV3, which has fp8 weight safetensors

from transformers import AutoModelForCausalLM

model_id = "deepseek-ai/DeepSeek-V3"
model = AutoModelForCausalLM.from_pretrained(
    model_id,
    device_map="auto",
    torch_dtype="auto",
    trust_remote_code=True,
)

Reviewers

@ArthurZucker

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

github-actions · 2025-03-19T15:30:03Z

Hi 👋, thank you for opening this pull request! The pull request is converted to draft by default. When it is ready for review, please click the Ready for review button (at the bottom of the PR page).

ArthurZucker

LGTM, is there no requirements (like cuda specific requirements?)

HuggingFaceDocBuilderDev · 2025-03-20T12:02:06Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

kylesayrs · 2025-03-20T21:02:10Z

@ArthurZucker Although fp8 operations have always been someone limited and hardware dependent (gpu vs cpu), there is no hardware requirement for loading fp8 tensors outside of the required torch version

ArthurZucker · 2025-03-27T09:04:00Z

Will just update the branch to have the ci run!

ArthurZucker · 2025-03-27T09:47:21Z

Thanks @kylesayrs 🤗

support loading fp8

3285cca

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

github-actions bot marked this pull request as draft March 19, 2025 15:30

kylesayrs changed the title ~~support loading fp8~~ [Modeling] Load FP8 safetensors such as DeepSeek Mar 19, 2025

kylesayrs closed this Mar 19, 2025

kylesayrs reopened this Mar 19, 2025

kylesayrs marked this pull request as ready for review March 19, 2025 20:07

github-actions bot requested review from ArthurZucker and Rocketknight1 March 19, 2025 20:08

ArthurZucker approved these changes Mar 20, 2025

View reviewed changes

Merge branch 'main' into kylesayrs/load-fp8

ac7f450

ArthurZucker merged commit d6d930a into huggingface:main Mar 27, 2025
18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Modeling] Load FP8 safetensors such as DeepSeek #36828

[Modeling] Load FP8 safetensors such as DeepSeek #36828

kylesayrs commented Mar 19, 2025 •

edited

Loading

github-actions bot commented Mar 19, 2025

ArthurZucker left a comment

HuggingFaceDocBuilderDev commented Mar 20, 2025

kylesayrs commented Mar 20, 2025

ArthurZucker commented Mar 27, 2025

ArthurZucker commented Mar 27, 2025

[Modeling] Load FP8 safetensors such as DeepSeek #36828

[Modeling] Load FP8 safetensors such as DeepSeek #36828

Conversation

kylesayrs commented Mar 19, 2025 • edited Loading

Purpose

Changes

Testing

Reviewers

github-actions bot commented Mar 19, 2025

ArthurZucker left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Mar 20, 2025

kylesayrs commented Mar 20, 2025

ArthurZucker commented Mar 27, 2025

ArthurZucker commented Mar 27, 2025

kylesayrs commented Mar 19, 2025 •

edited

Loading