-
Notifications
You must be signed in to change notification settings - Fork 26
Riak start script #73
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Should we also add some reachability related checks? |
The retry for reachability check has been added. |
iakkus
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good, thanks!
manuelstein
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Quick question: when the join succeeds, it would directly test if the node is listed in the cluster status as "joining". Could that go wrong? E.g. when the ring is busy with other stuff, could it take a second "cluster status" to find the node is "joining"? On one deployment I saw the third out of 3 nodes was marked "joining".
|
I thought the "riak-admin cluster join" succeeds only if it has
successfully marked a node as "joining". But, I guess it does no harm if
we add a sleep before that "joining" check. Pushed a simple change.
Thanks.
…On Wed, Jul 15, 2020 at 7:42 PM Manuel Stein ***@***.***> wrote:
***@***.**** commented on this pull request.
Quick question: when the join succeeds, it would directly test if the node
is listed in the cluster status as "joining". Could that go wrong? E.g.
when the ring is busy with other stuff, could it take a second "cluster
status" to find the node is "joining"? On one deployment I saw the third
out of 3 nodes was marked "joining".
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
<#73 (review)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAQSEDBAVCW4F254SJE4PRDR3XTAJANCNFSM4OZQ6VHQ>
.
|
This branch is to stabilize the riak start script via adding retries and reporting errors when the cluster is started unsuccessfully.