feat: supports loading .safetensors params file #231

grzuy · 2023-08-02T18:57:11Z

closes #96

Opening proof of concept as draft while I continue working on some improvements and test coverage, and potentially any other feedback folks have :-)

lib/bumblebee.ex

mix.exs

jonatanklosko · 2023-08-03T18:33:40Z

Thanks for the PR, a couple minor comments :)

lib/bumblebee/conversion/pytorch.ex

jonatanklosko

LGTM!

Co-authored-by: Jonatan Kłosko <jonatanklosko@gmail.com>

grzuy · 2023-08-04T15:31:57Z

FWIW I plan to explore in a follow up PR also supporting safetensors sharded params files.

mix.exs

Co-authored-by: Jonatan Kłosko <jonatanklosko@gmail.com>

jonatanklosko · 2023-08-04T16:33:59Z

Thanks a lot!

grzuy · 2023-08-04T17:10:21Z

FWIW I plan to explore in a follow up PR also supporting safetensors sharded params files.

Mmm, correcting myself.

I think with the changes in this PR one would be able to load sharded safetensors param files.

For example, for

https://huggingface.co/stabilityai/StableBeluga-7B/tree/main

which contains the following files

model-00001-of-00002.safetensors
model-00002-of-00002.safetensors
model.safetensors.index.json

using

Bumblebee.load_model({:hf, "stabilityai/StableBeluga-7B"}, params_filename: "model.safetensors")

should use the existing sharded params loading logic, look at the index file and "just work".

grzuy · 2023-08-04T17:11:39Z

What would be a good improvement might be adding some decent auto-selection of the preferred file format based on what's available in the model repo without having the user needed to explicitly provide the file name.

jonatanklosko · 2023-08-04T20:47:59Z

Good call, so far most repos had the pytorch file and optionally other formats, but as safetensors become more popular there may be cases where it's just safetensors. Currently we do fallbacks, that is, request one file, if doesn't exist request another, and so on. I checked and looks like HF API now allows listing files, so I will later reevaluate if we can improve :)

jonatanklosko · 2023-09-27T16:14:07Z

FTR as of #256 we automatically detect if there are no parameters in the pytorch format, but safetensors one is available :)

feat: supports loading .safetensors params

3c49c8d

grzuy commented Aug 2, 2023

View reviewed changes

lib/bumblebee.ex Outdated Show resolved Hide resolved

Merge branch 'main' into safetensors

9dd53ce

grzuy changed the title ~~feat: supports loading .safetensors params~~ feat: supports loading .safetensors params file Aug 2, 2023

jonatanklosko reviewed Aug 3, 2023

View reviewed changes

lib/bumblebee.ex Outdated Show resolved Hide resolved

jonatanklosko reviewed Aug 3, 2023

View reviewed changes

mix.exs Outdated Show resolved Hide resolved

grzuy added 2 commits August 3, 2023 16:54

refactor: drops unncessary safetensors loader module

ef42ae6

docs: document new loader_fun option

fbbf02b

grzuy force-pushed the safetensors branch from f2f83f2 to fbbf02b Compare August 3, 2023 20:02

test: test we can load real model with safetensors

5e2f208

grzuy force-pushed the safetensors branch from a48992b to 5e2f208 Compare August 3, 2023 21:08

jonatanklosko reviewed Aug 4, 2023

View reviewed changes

lib/bumblebee/conversion/pytorch.ex Outdated Show resolved Hide resolved

jonatanklosko approved these changes Aug 4, 2023

View reviewed changes

Update lib/bumblebee/conversion/pytorch.ex

5a80509

Co-authored-by: Jonatan Kłosko <jonatanklosko@gmail.com>

grzuy marked this pull request as ready for review August 4, 2023 12:38

jonatanklosko reviewed Aug 4, 2023

View reviewed changes

mix.exs Outdated Show resolved Hide resolved

Update mix.exs

07bbe8e

Co-authored-by: Jonatan Kłosko <jonatanklosko@gmail.com>

jonatanklosko merged commit 3f691f2 into elixir-nx:main Aug 4, 2023
2 checks passed

grzuy deleted the safetensors branch August 4, 2023 16:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: supports loading .safetensors params file #231

feat: supports loading .safetensors params file #231

grzuy commented Aug 2, 2023 •

edited

Loading

jonatanklosko commented Aug 3, 2023

jonatanklosko left a comment

grzuy commented Aug 4, 2023

jonatanklosko commented Aug 4, 2023

grzuy commented Aug 4, 2023

grzuy commented Aug 4, 2023 •

edited

Loading

jonatanklosko commented Aug 4, 2023

jonatanklosko commented Sep 27, 2023

feat: supports loading .safetensors params file #231

feat: supports loading .safetensors params file #231

Conversation

grzuy commented Aug 2, 2023 • edited Loading

jonatanklosko commented Aug 3, 2023

jonatanklosko left a comment

Choose a reason for hiding this comment

grzuy commented Aug 4, 2023

jonatanklosko commented Aug 4, 2023

grzuy commented Aug 4, 2023

grzuy commented Aug 4, 2023 • edited Loading

jonatanklosko commented Aug 4, 2023

jonatanklosko commented Sep 27, 2023

grzuy commented Aug 2, 2023 •

edited

Loading

grzuy commented Aug 4, 2023 •

edited

Loading