[docdb] Master falling behind does not get automatically remote bootstrapped #4799

bmatican · 2020-06-17T02:45:14Z

Jira Link: DB-1842
Currently, if a master falls behind to the point that the leader no longer has logs, or its follower unavailability window passes (default 15m for both), it would require manual intervention to remove it from the quorum and add it back.

At the raft level, we have this flag to control w/e or nor to evict failed followers: evict_failed_followers. This is disabled for masters though, as there is no automated mechanism for them to be added back into the quorum!

Moreover, until #2007 is resolved, the current steps of issuing a remove, followed by an add, require that in between, we actually wipe the master data and start back the master in shell mode (empty master_addresses flag).

Ideally, we'd have some mechanic through which we would automatically trigger a remote bootstrap for the master, if it comes back after the default 15m window.

cc @iSignal @hectorgcr

The text was updated successfully, but these errors were encountered:

bmatican added the area/docdb YugabyteDB core features label Jun 17, 2020

bmatican self-assigned this Jun 17, 2020

bmatican mentioned this issue Jun 17, 2020

[docs] Troubleshooting for a master fallen behind #4800

Closed

bmatican added this to To do in Usability Jun 23, 2020

bmatican added this to Short term in YBase features Jun 24, 2020

bmatican added this to To do in Master components Nov 13, 2020

yugabyte-ci added kind/bug This issue is a bug priority/medium Medium priority issue labels Jun 9, 2022

bmatican assigned lingamsandeep and unassigned bmatican Jul 7, 2022

yugabyte-ci added kind/enhancement This is an enhancement of an existing feature and removed kind/bug This issue is a bug labels Jul 30, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[docdb] Master falling behind does not get automatically remote bootstrapped #4799

[docdb] Master falling behind does not get automatically remote bootstrapped #4799

bmatican commented Jun 17, 2020 •

edited by yugabyte-ci

[docdb] Master falling behind does not get automatically remote bootstrapped #4799

[docdb] Master falling behind does not get automatically remote bootstrapped #4799

Comments

bmatican commented Jun 17, 2020 • edited by yugabyte-ci

bmatican commented Jun 17, 2020 •

edited by yugabyte-ci