-
Notifications
You must be signed in to change notification settings - Fork 4.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CoreOS: Docker fail to restart with missing /etc/sysconfig/docker file #4747
Comments
See PR #4760 for a possible solution. |
I'd love to add unit tests too, when I have a minute. |
Sounds odd ... and admittedly we've not seen any behavior like this ... which version of CoreOS you running? .. strange the |
We've seen this on stable (1520.8.0) with some (unpredictable) regularity. I think |
I think the PR makes a ton of sense and would love to get it merged (just needs CLA). There might be more to it, but this feels right & the right fix. |
If we ssh into the node and start docker manually, all services will eventually restart and the node will join the cluster successfully.
We have tracked this down to this PR #3134
It looks like that sometimes the OnChangeExecute is too fast before the file /etc/sysconfig/docker is written and docker failed to restart.
Here's the entry in journalctl:
We would gladly fixes/test this, but we are unsure what is the best approach for kops at this point, see #9 for a couple solutions we put forward internally
Solutions that came up discussion this internally:
The text was updated successfully, but these errors were encountered: