New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Nomad server recover helper #77
Comments
ideally we could create a Bolt task to trigger the execution of the script |
@sebastianrakel @attachmentgenie do you have some thoughts here? maybe we sould have a bolt plan for this? |
I also feel a bolt task would be more appropriate, in a meltdown situation i dont see anyone changing and pushing hiera changes in an emergency. |
@attachmentgenie the idea is to create the file IMO the Bolt plan is eventually an addition to the puppet manifests. |
@attachmentgenie are you also good with the change, and is it clear how it works? |
Affected Puppet, Ruby, OS and module versions/distributions
n/a
How to reproduce
bring down the nomad daemon on all your nomad servers
What are you seeing
you won't be able to restart the daemon
What behaviour did you expect instead
have a procedure, a script, or a Bolt task
Output log
n/a
Proposed solution
Recovering from outage, is a time consuming operation, but it can be partially automated.
The manifest below creates a script which can be run from the servers to recovery the cluster.
If you have PuppetDB you can use
nomad_server_regex
otherwise you need to pre-fill a hash and usenomad_server_hash
.The text was updated successfully, but these errors were encountered: