Make safetensors the default #2120

muellerzr · 2023-11-03T17:41:27Z

What does this PR do?

Similar to transformers, makes safetensors the default and a library requirement 🤗

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@LysandreJik @SunMarc @BenjaminBossan

HuggingFaceDocBuilderDev · 2023-11-03T17:46:54Z

The documentation is not available anymore as the PR was closed or merged.

LysandreJik

Looks great! Something that we're doing in transformers as well that you might want to do here, is to continue testing for safe_serialization=False in the common pathways.

By removing testing of the serialization of pytorch_model.bin (which we're doing as the default isn't explicitly defined in the tests and the default just changed), we're risking breaking that code without noticing.

You implemented another test, do you think maybe others would need implementing? That's what we did on transformers side (not merged yet): huggingface/transformers#27242

BenjaminBossan

In general, this LGTM. I have a few questions for my understanding, the implementation looks clean though.

The whole of fsdp_utils.py still relies on torch.save, should this also be adjusted?

src/accelerate/checkpointing.py

src/accelerate/accelerator.py

src/accelerate/checkpointing.py

BenjaminBossan · 2023-11-06T16:01:31Z

By removing testing of the serialization of pytorch_model.bin (which we're doing as the default isn't explicitly defined in the tests and the default just changed), we're risking breaking that code without noticing.

Good point.

You implemented another test, do you think maybe others would need implementing?

I think this is something where test coverage could be helpful, as it would reveal if we have code paths that are no longer taken but which used to be taken.

muellerzr · 2023-11-06T18:37:36Z

@LysandreJik @BenjaminBossan I've updated all of our tests that do anything with save/saving the model to use both .bin and .safetensors :)

SunMarc

LGTM ! I've left a few comments.

src/accelerate/checkpointing.py

BenjaminBossan

LGTM, thanks Zach. I have a few comments but none are blockers.

src/accelerate/checkpointing.py

src/accelerate/test_utils/testing.py

BenjaminBossan

Great, LGTM, thanks.

BenjaminBossan · 2023-11-08T11:25:04Z

src/accelerate/checkpointing.py

+    If `safe_serialization` is `True`, models will be saved with `safetensors` while the rest are saved using native
+    `pickle`.


BenjaminBossan · 2023-11-08T11:28:17Z

tests/test_state_checkpointing.py

+    return f"{func.__name__}_{param_based_name}"
+
+
+@parameterized_class(("use_safetensors",), [[True], [False]], class_name_func=parameterized_custom_name_func)


TIL about parameterized_class.

LysandreJik

Thanks for iterating @muellerzr! LGTM

SunMarc

Thanks for iterating !

muellerzr added 5 commits November 3, 2023 16:34

Make safetensors default

33a9172

Rm location

6c0f124

Actually flip flags

60c014e

Tests + update checkpointing

6f792c9

Add to setup

7c8e8c8

muellerzr requested review from BenjaminBossan, LysandreJik and SunMarc November 3, 2023 17:41

LysandreJik approved these changes Nov 6, 2023

View reviewed changes

BenjaminBossan reviewed Nov 6, 2023

View reviewed changes

src/accelerate/checkpointing.py Outdated Show resolved Hide resolved

src/accelerate/accelerator.py Show resolved Hide resolved

src/accelerate/checkpointing.py Show resolved Hide resolved

muellerzr added 3 commits November 6, 2023 17:42

Start of tests with both safetensors and without

2c069d1

Update tests to use both

acdb758

Remove from load state

bd377a2

SunMarc approved these changes Nov 6, 2023

View reviewed changes

src/accelerate/checkpointing.py Outdated Show resolved Hide resolved

src/accelerate/checkpointing.py Show resolved Hide resolved

src/accelerate/checkpointing.py Outdated Show resolved Hide resolved

BenjaminBossan approved these changes Nov 7, 2023

View reviewed changes

src/accelerate/checkpointing.py Show resolved Hide resolved

src/accelerate/test_utils/testing.py Outdated Show resolved Hide resolved

muellerzr added 5 commits November 7, 2023 17:48

Explicit tip

b718f5a

With suggestions

b7c5336

Simplify, don't abstract. Need to bring back to deepspeed however

28f263f

Refactor to use consts

4880e63

Keep how it was

7879980

muellerzr requested review from SunMarc and BenjaminBossan November 7, 2023 23:44

BenjaminBossan approved these changes Nov 8, 2023

View reviewed changes

LysandreJik approved these changes Nov 8, 2023

View reviewed changes

muellerzr added 2 commits November 8, 2023 08:37

Merge branch 'main' into safetensors-default-1

3a88675

Typo fix

0cb151a

SunMarc approved these changes Nov 8, 2023

View reviewed changes

muellerzr merged commit e638b1e into main Nov 8, 2023
26 checks passed

muellerzr deleted the safetensors-default-1 branch November 8, 2023 14:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make safetensors the default #2120

Make safetensors the default #2120

muellerzr commented Nov 3, 2023

HuggingFaceDocBuilderDev commented Nov 3, 2023 •

edited

Loading

LysandreJik left a comment

BenjaminBossan left a comment

BenjaminBossan commented Nov 6, 2023

muellerzr commented Nov 6, 2023

SunMarc left a comment

BenjaminBossan left a comment

BenjaminBossan left a comment

BenjaminBossan Nov 8, 2023

BenjaminBossan Nov 8, 2023

LysandreJik left a comment

SunMarc left a comment

		If `safe_serialization` is `True`, models will be saved with `safetensors` while the rest are saved using native
		`pickle`.

		return f"{func.__name__}_{param_based_name}"


		@parameterized_class(("use_safetensors",), [[True], [False]], class_name_func=parameterized_custom_name_func)

Make safetensors the default #2120

Make safetensors the default #2120

Conversation

muellerzr commented Nov 3, 2023

What does this PR do?

Before submitting

Who can review?

HuggingFaceDocBuilderDev commented Nov 3, 2023 • edited Loading

LysandreJik left a comment

Choose a reason for hiding this comment

BenjaminBossan left a comment

Choose a reason for hiding this comment

BenjaminBossan commented Nov 6, 2023

muellerzr commented Nov 6, 2023

SunMarc left a comment

Choose a reason for hiding this comment

BenjaminBossan left a comment

Choose a reason for hiding this comment

BenjaminBossan left a comment

Choose a reason for hiding this comment

BenjaminBossan Nov 8, 2023

Choose a reason for hiding this comment

BenjaminBossan Nov 8, 2023

Choose a reason for hiding this comment

LysandreJik left a comment

Choose a reason for hiding this comment

SunMarc left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Nov 3, 2023 •

edited

Loading