Safetensors offload #20321

sgugger · 2022-11-18T20:44:30Z

What does this PR do?

This PRs make offload to disk more efficient for models that have a checkpoint in safetensors format: instead of re-saving everything as Numpy memory-mapped array, it uses directly the fact we can access the tensor in the checkpoint without loading the rest.

Goes with huggingface/accelerate#873

HuggingFaceDocBuilderDev · 2022-11-18T20:57:49Z

The documentation is not available anymore as the PR was closed or merged.

sgugger · 2022-11-21T19:19:36Z

src/transformers/modeling_utils.py

@@ -28,7 +28,7 @@

 import torch
 from packaging import version
-from torch import Tensor, device, nn


This import was dangerous with so many local variables named device. It was only used in a type annotation.

sgugger · 2022-11-21T19:20:33Z

src/transformers/modeling_utils.py

+            offload_index = {
+                p: {"safetensors_file": os.path.join(folder, f), "weight_name": p, "dtype": str_dtype}
+                for p, f in sharded_metadata["weight_map"].items()
+                if param_device_map[p] == "disk"
+            }


We build the offload_index here when the checkpoin is safetensors, this way we can skip loading all the shards and gain time.

LysandreJik

Great, very cool addition! Should make a lot of things simpler

* INtegrate safetensos in weight offloading * Use safetensors checkpoint for offload when available * Make naming consistent * Make load faster * Quality * Add default

sgugger added 2 commits November 18, 2022 14:38

INtegrate safetensos in weight offloading

c99b308

Use safetensors checkpoint for offload when available

b7b6b99

sgugger requested a review from LysandreJik November 18, 2022 20:44

sgugger mentioned this pull request Nov 18, 2022

Allow safetensors offload huggingface/accelerate#873

Merged

sgugger added 3 commits November 21, 2022 12:35

Make naming consistent

9ca0301

Make load faster

ffa9b24

Quality

dae2816

sgugger commented Nov 21, 2022

View reviewed changes

Add default

c1a6d4b

LysandreJik approved these changes Nov 25, 2022

View reviewed changes

sgugger merged commit 3016392 into main Nov 28, 2022

sgugger deleted the safetensors_offload branch November 28, 2022 15:35

sgugger mentioned this pull request Nov 29, 2022

Fix disk offload for full safetensors checkpoints #20497

Merged

MatthieuBizien mentioned this pull request Dec 1, 2022

[Proposal] Support saving to safetensors huggingface/diffusers#1494

Merged

This was referenced Mar 7, 2023

TypeError: 'NoneType' object is not subscriptable unitaryai/detoxify#75

Open

TypeError: 'NoneType' object is not subscriptable in modeling_utils.py #21995

Closed

rsmith49 mentioned this pull request Mar 23, 2023

Should update accelerate minimum version requirement to 0.15 #22351

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Safetensors offload #20321

Safetensors offload #20321

sgugger commented Nov 18, 2022

HuggingFaceDocBuilderDev commented Nov 18, 2022 •

edited

sgugger Nov 21, 2022

sgugger Nov 21, 2022

LysandreJik left a comment

Safetensors offload #20321

Safetensors offload #20321

Conversation

sgugger commented Nov 18, 2022

What does this PR do?

HuggingFaceDocBuilderDev commented Nov 18, 2022 • edited

sgugger Nov 21, 2022

Choose a reason for hiding this comment

sgugger Nov 21, 2022

Choose a reason for hiding this comment

LysandreJik left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Nov 18, 2022 •

edited