-
Notifications
You must be signed in to change notification settings - Fork 24
Recreate kind cluster on every evg host reboot #32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I am split on this change, it risks loosing data or setups... I understand the desire to fix this, but on the other hand I don't like when my environment gets cleaned up automatically when it was still in use. For example, if I had run patches against deployments or single cluster deployments my environment will be resetted.
Can you add an env var to opt-in for this re-creation? Than each engineer can decide whether they want re-creation or not.
# Conflicts: # scripts/dev/recreate_kind_clusters.sh
MCK 1.5.0 Release NotesNew Features
Bug Fixes
|
lsierant
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! Trying it out immediately!
Summary
Evergreen hosts reboot every day or weekend (depending on your configuration). After every reboot, inter-cluster connectivity might be broken for some reason. Recreating the clusters is the only solution we have so far.
This is a systemd service that runs on every boot and recreates all clusters (including the kind-kind for single cluster tests).
Our tunnel command will now also get the kubeconfig from the host otherwise the tunnel won't open to the (new) ports of the recreated clusters.
The systemd service is not created by default. This needs to be explicitly done by running
Remove the architecture flag, and instead left it to be inferred from
uname.Proof of Work
Tested locally, but it would be nice if someone can checkout this branch and try it for themselves.
Checklist