New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add safetensors support #4659
Add safetensors support #4659
Conversation
@microsoft-github-policy-service agree company="Gigabyte Technology" |
This is great, thank you @jihnenglin! Can you also create a PR on the MII side to update https://github.com/microsoft/DeepSpeed-MII/blob/main/requirements/requirements.txt to add |
Sure, will do! Edit: DeepSpeed-MII PR #296 |
I've made it automatically fetch |
Cleaning up this implementation a bit to avoid the deep if-else branching statements
Thanks @jihnenglin! I cleaned up your implementation a little bit so we could avoid some of the deeper nested if-else branching and make it a bit easier to read the code. LGTM |
The only a6000 test failing is a known failure, so we could bypass this if we need. |
Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com> Co-authored-by: Jeff Rasley <jerasley@microsoft.com> Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>
In response to DeepSpeed-MII issue #272
Introduce compatibility for HF repositories containing solely
.safetensors
checkpoints. In case of coexistence, prioritize.safetensors
checkpoints over.bin
ones.