Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support deploy Inference Endpoint from model catalog #2892

Merged
merged 5 commits into from
Mar 10, 2025

Conversation

Wauplin
Copy link
Contributor

@Wauplin Wauplin commented Feb 27, 2025

solves #2880 cc @ErikKaum

This PR adds list_inference_catalog and create_inference_endpoint_from_catalog to list the HF Model Catalog (https://endpoints.huggingface.co/catalog) and create an Inference Endpoint from any of the listed models. Both methods are flagged as experimental as we might want to update the format in the future. This should be a low hanging fruit to allow people to quickly deploy a model.

@Wauplin Wauplin requested a review from ErikKaum February 27, 2025 15:43
@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@Wauplin Wauplin marked this pull request as ready for review March 7, 2025 15:46
@Wauplin Wauplin changed the title [Draft] to add deploy from catalog Support deploying Inference Endpoint from model catalog Mar 7, 2025
@Wauplin Wauplin changed the title Support deploying Inference Endpoint from model catalog Support deploy Inference Endpoint from model catalog Mar 7, 2025
Copy link
Contributor

@hanouticelina hanouticelina left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thank you @Wauplin!

@Wauplin
Copy link
Contributor Author

Wauplin commented Mar 10, 2025

@ErikKaum can I get a final review from you to confirm that's how to envisioned this API?

Copy link
Member

@ErikKaum ErikKaum left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry for being slow. Tested it locally and it works like a charm, really nice 🙌 thank you!

@Wauplin
Copy link
Contributor Author

Wauplin commented Mar 10, 2025

No worries 🤗 Let's get this merge then! 🚀

@Wauplin Wauplin merged commit 5ab9e33 into main Mar 10, 2025
19 checks passed
@Wauplin Wauplin deleted the 2880-add-deploy-from-catalog branch March 10, 2025 15:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants