New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error - Placement group is in use and may not be deleted. Starcluster 0.94 #277
Comments
It appears to happen very fast so I wonder if it's not waiting long enough for the instances using the security group to terminate? I should add the the worker node001 was a spot instance if that helps. |
After terminating the cluster only the placement group remains. Terminating again with the -force option is able to successfully terminate the placement group.
|
Try #218 |
@FinchPowers I'm not sure #218 will fix this. The exception is being raised on the first call to pg.delete() so the "waiting for placement group/security group to delete" stuff wouldn't even be involved in this case... I'm going to try to reproduce this and figure out what condition we need to wait for...unfortunately we might just have to do yet another try/except loop until it's successful. |
@FinchPowers Also IIRC placement groups and security groups are not linked so really just comes down to whether the instances are terminated. Perhaps this could be related to the spot request not completely closing before terminating the PG? Worth testing... |
@jtriley You are right this will not fix it. As you said, SC probably need to wait for complete instance termination before making the call to delete the placement group. |
Hi,
I'm able to successfully start a 2 node (master,node001) cluster in us-west-2 using m1.xlarge as the head and an HVM cc2.8xlarge as node001 without a problem. Trying to delete the cluster results in an error when it attempts to remove the placement group @sc-testcluster1. Ideas?
Thanks
John
The text was updated successfully, but these errors were encountered: