Join GitHub today
GitHub is home to over 20 million developers working together to host and review code, manage projects, and build software together.
Improve HA recovery #6379
Conversation
mjs
added some commits
Oct 4, 2016
| if err != nil { | ||
| - logger.Debugf("connection failed, will retry: %v", err) | ||
| + logger.Errorf("mongodb connection failed, will retry: %v", err) |
wallyworld
Oct 5, 2016
•
Owner
s/error/warning
errors are supposed to be something a user acts on IIRC. the fact that it says "will retry" implies warning is sufficient
|
$$merge$$ |
|
Status: merge request accepted. Url: http://juju-ci.vapour.ws:8080/job/github-merge-juju |
jujubot
merged commit a73d91e
into
juju:master
Oct 5, 2016
mjs
deleted the
mjs:1588224-minimal-fix
branch
Oct 5, 2016
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
mjs commentedOct 5, 2016
•
Edited 1 time
-
mjs
Oct 5, 2016
This contains 2 fixes which improve Juju's handling of a HA primary change.
Fixes https://bugs.launchpad.net/juju/+bug/1588224
QA
Bootstrap and use enable-ha to create a 3 node controller. Once stable, stop the primary node. Watch the logs to see when controllers recover. Before these changes it would take minutes for the controllers to recover and the apiserver to be back up again. With these changes the time between the primary being stopped and the apiserver being back up again is < 60s.
There are many more improvements that can be made but they are more risky/invasive and will be tackled in future PRs.