raft: implement leader steps down #3866

xiang90 · 2015-11-14T20:30:44Z

From CONSENSUS: BRIDGING THEORY AND PRACTICE by diego.

A server might be in the leader state, but if it isn’t the current leader, it could be needlessly delaying client requests. For example, suppose a leader is partitioned from the rest of the cluster, but it can still communicate with a particular client. Without additional mechanism, it could delay a request from that client forever, being unable to replicate a log entry to any other servers. Meanwhile, there might be another leader of a newer term that is able to communicate with a majority of the cluster and would be able to commit the client’s request.

Thus, a leader in Raft steps down if an election timeout elapses without a successful round of heartbeats to a majority of its cluster; this allows clients to retry their requests with another server.

This is important to etcd since the pervious leader needs to cancel all connected watchers to prevent forever waiting.

/cc @bdarnell @gyuho

gyuho · 2015-11-14T20:41:40Z

Just to clarify, is this issue about improving the watch(er) implementation? Or do we need change on client implementation as well? Thanks,

xiang90 · 2015-11-14T20:43:57Z

Just to clarify, is this issue about improving the watch(er) implementation?

First, we need to improve raft. So that the leader can actually step down itself. Second, we need to improve watch implementation: if a server is partitioned, it should cancel all its watchers to avoid forever watching. Third, the new lease feature also needs leader stepping down.

Or do we need change on client implementation as well? Thanks,

I do not have plan for client side implementation right now.

gyuho · 2015-11-14T20:54:07Z

Makes sense. Thanks for clarification!

xiang90 added area/performance area/raft labels Nov 14, 2015

xiang90 self-assigned this Nov 14, 2015

xiang90 added this to the v2.3.0 milestone Nov 14, 2015

xiang90 mentioned this issue Nov 24, 2015

raft: support quorum check when raft is leader #3917

Merged

xiang90 closed this as completed in #3917 Nov 24, 2015

aaronlehmann mentioned this issue Sep 22, 2016

raft: use CheckQuorum option moby/swarmkit#1564

Merged

PapaYofen mentioned this issue Jan 18, 2019

brain split happened when network interrupts between dc seaweedfs/seaweedfs#825

Closed

komuw mentioned this issue Dec 16, 2020

Support CheckQuourm optimization hashicorp/raft#436

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

raft: implement leader steps down #3866

raft: implement leader steps down #3866

xiang90 commented Nov 14, 2015

gyuho commented Nov 14, 2015

xiang90 commented Nov 14, 2015

gyuho commented Nov 14, 2015

raft: implement leader steps down #3866

raft: implement leader steps down #3866

Comments

xiang90 commented Nov 14, 2015

gyuho commented Nov 14, 2015

xiang90 commented Nov 14, 2015

gyuho commented Nov 14, 2015