Skip to content

Conversation

@alvarobartt
Copy link
Member

@alvarobartt alvarobartt commented Aug 9, 2024

Description

In order to support PyTorch inference over the recently released models the following dependencies have been bumped:

Additionally, huggingface_hub with the extra hf_transfer has also been included in order to support faster downloads using hf-transfer when setting the HF_HUB_ENABLE_HF_TRANSFER=1 environment variable, useful when downloading large models such as most of the text generation and text to image models.

Note

At the moment the dependencies are pinned within the huggingface-inference-toolkit Python package dependencies. Something to discuss is whether we want those to be set to >= with the minimum supported versions, so that those are tweaked and bumped within the Dockerfile images that the cloud providers are managing; or just keep the current approach and pin those to == and then release a new version of the Python package with the bumped dependencies.

@alvarobartt alvarobartt requested a review from oOraph August 9, 2024 08:20
@alvarobartt alvarobartt self-assigned this Aug 9, 2024
@alvarobartt alvarobartt merged commit c876b14 into main Aug 12, 2024
@alvarobartt alvarobartt deleted the bump-hf-dependencies branch August 12, 2024 06:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants