-
Notifications
You must be signed in to change notification settings - Fork 382
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Configure Container Startup Timeout #2869
Comments
Yeah. Good catch. I don't think that's needed in this case so you can still run it without that arg. Updated the repro command |
For reference, I'm trying the repro again now... |
Also note, your |
I see. So to clarify |
Ok I got this working locally on latest (daily) bits. I had to change your code to the following to ensure the endpoint was registered correctly: var hf =
builder.AddContainer("hf-generation", "ghcr.io/huggingface/text-generation-inference", tag: "1.4")
.WithBindMount("./data", "/data")
.WithHttpEndpoint(containerPort: 80, hostPort: 8080)
.WithArgs("--model-id", "HuggingFaceH4/zephyr-7b-beta"); I watched the logs in Docker Desktop and on the Aspire Dashboard and it took about 5 minutes to fully startup (having previously downloaded the model into the ./data directory): At that point I opened a terminal and ran the following command:
And got the following reply: {"generated_text":"\n\nDeep Learning is a subset of Machine Learning that uses Artificial Neural Networks with multiple"} I could see the generation details in the log too: If there was a problem before it seems it's fixed now, perhaps by the recent change to streaming logs. |
Awesome! Is this the right guidance for setting up the latest daily version of Aspire? https://github.com/dotnet/aspire/blob/main/docs/using-latest-daily.md |
Yep! Let us know if you run into any issues. Getting the feeds and workload just right can be a little tricky. |
Containers like HuggingFace text-generation-inference take a long time to startup because they're pulling down the model which is usually multi-GB files.
This causes the app to time out.
Provide a way to enable containers that take a long time to start up.
To repro in Aspire, add this to AppHost project.
The text was updated successfully, but these errors were encountered: