[yugabyted] yugabyted node fails to restart when correct leader master is not known. #14440
Labels
area/db-usability
issue related to DB usability project. Including yugabyted cli and yugabyted ui issues.
area/ossexp
DB usability Project
priority/high
High Priority
Projects
Issue: Leader master changes need to be handled transparently
Scenario 1:
Rolling update/ rolling upgrade
T0: 5 node cluster with --join 127.0.0.1
T1: bring down first node, 127.0.0.1
Rolling update of the leader master
T2: make required changes and bring back 127.0.0.1
Solution:
Yugabyted can be updated to create a list of masters, which can be provided to
yb-admin
command. This may not require update toyb-admin
commandRolling update of other nodes (non-leader or non-master nodes)
T3: master leader has changed, so the ip-address given in the --join flag is no longer the leader master
T4: start the node back up
- ./bin/yugabyted start --join=
- is join flag persisted?
- we need to provide the new master leader ip-address to work
Scenario 2:
Description:
T0: 5 node cluster with --join 127.0.0.1
T1: master leader, 127.0.0.1 fails
- new lead master gets elected, say 12.0.0.3
T2: when I get to 127.0.0.2
yb_admin command is used to find list of all masters. However based on the below logs, we can create the list of ip-address from the list of master already available to yugabyted.
Solution:
Yugabyted can be updated to have a list of current masters, which can be provided to
yb-admin
command. This may not require update toyb-admin
commandThe text was updated successfully, but these errors were encountered: