-
Notifications
You must be signed in to change notification settings - Fork 266
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
AWS EC2 instance never comes up - ASG reports 1 instance, but stuck in pending #572
Comments
Manually terminating the instance works fine as a way to unblock. |
Thanks for the details @dhalperi, seems like a really frustrating issue. If it keeps happening we'll add a check for it in the scaling lambda. |
This sounds like it might be related to something we've been seeing. We traced the issue back to The bootstrap failure doesn't appear to result in the instances getting flagged as unhealthy. On the other hand, it means the I'll raise a PR with a change that seems to have more or less solved this problem for us. It just adds an |
@dbaggerman in this instance the status was stuck at |
@lox, Yes that matches what we saw. The Instance State was |
This sure looks like an AWS bug not a BK bug.
We have new autoscaler-based deployment in us-west-1. Our ASG goes all the way down to 0 instances, up to 15. Usually things work great; in the last few days we've had VMs that don't come up for 20+ minutes.
When there's only 1 step queued for an instance, this can be frustrating - that blocked step hangs seemingly "forever". BK does not try to scale up, EC2 does not actually have the instance.
The group is big enough that new VMs are spun up in both AZs multiple times per day, which rules out basic config errors like one subnet not being networked correctly. It really seems like an AWS issue.
The text was updated successfully, but these errors were encountered: