Serialization

huggingface_hub contains helpers to help ML libraries serialize models weights in a standardized way. This part of the lib is still under development and will be improved in future releases. The goal is to harmonize how weights are serialized on the Hub, both to remove code duplication across libraries and to foster conventions on the Hub.

Save torch state dict

The main helper of the serialization module takes a state dictionary as input (e.g. a mapping between layer names and related tensors), splits it into several shards while creating a proper index in the process and save everything to disk. At the moment, only torch tensors are supported. Under the hood, it delegates the logic to split the state dictionary to [split_torch_state_dict_into_shards].

[[autodoc]] huggingface_hub.save_torch_state_dict

Split state dict into shards

The serialization module also contains low-level helpers to split a state dictionary into several shards, while creating a proper index in the process. These helpers are available for torch and tensorflow tensors and are designed to be easily extended to any other ML frameworks.

split_tf_state_dict_into_shards

[[autodoc]] huggingface_hub.split_tf_state_dict_into_shards

split_torch_state_dict_into_shards

[[autodoc]] huggingface_hub.split_torch_state_dict_into_shards

split_state_dict_into_shards_factory

This is the underlying factory from which each framework-specific helper is derived. In practice, you are not expected to use this factory directly except if you need to adapt it to a framework that is not yet supported. If that is the case, please let us know by opening a new issue on the huggingface_hub repo.

[[autodoc]] huggingface_hub.split_state_dict_into_shards_factory

Helpers

get_torch_storage_id

[[autodoc]] huggingface_hub.get_torch_storage_id

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

serialization.md

serialization.md

Serialization

Save torch state dict

Split state dict into shards

split_tf_state_dict_into_shards

split_torch_state_dict_into_shards

split_state_dict_into_shards_factory

Helpers

get_torch_storage_id

Files

serialization.md

Latest commit

History

serialization.md

File metadata and controls

Serialization

Save torch state dict

Split state dict into shards

split_tf_state_dict_into_shards

split_torch_state_dict_into_shards

split_state_dict_into_shards_factory

Helpers

get_torch_storage_id