-
Notifications
You must be signed in to change notification settings - Fork 653
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support deploy Inference Endpoint from model catalog #2892
Conversation
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
thank you @Wauplin!
@ErikKaum can I get a final review from you to confirm that's how to envisioned this API? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sorry for being slow. Tested it locally and it works like a charm, really nice 🙌 thank you!
No worries 🤗 Let's get this merge then! 🚀 |
solves #2880 cc @ErikKaum
This PR adds
list_inference_catalog
andcreate_inference_endpoint_from_catalog
to list the HF Model Catalog (https://endpoints.huggingface.co/catalog) and create an Inference Endpoint from any of the listed models. Both methods are flagged as experimental as we might want to update the format in the future. This should be a low hanging fruit to allow people to quickly deploy a model.