How to evaluate learner progress #15

jingyih · 2019-04-07T03:07:22Z

Update (5/7/2019): final design #15 (comment)

Let's discuss how to evaluate learner progress before promoting a learner member.

Here is the relevant part from design doc as reference:

Only accept promote request if and only if: The learner node is in a healthy state. The learner is in sync with leader or the delta is within the threshold (e.g. the number of entries to replicate to learner is less than 1/10 of snapshot count, which means it is less likely that even after promotion leader would not need send snapshot to the learner).

So far I have gathered suggestions from @gyuho and @xiang90, but have not yet come up with a complete solution.

Locally, learner knows its own committed index. But it does not have information on leader's raft log(?). So learner needs to query leader in order to understand its 'progress'. Do we need a new raft message type for the query? Maybe use read index query to get leader's committed index and compare it with its own?

After learner finds its 'progress', we can expose it via Ready.SoftState so that etcdserver can use it.

Please let me know your comments / suggestion.

cc @gyuho @xiang90 @WIZARD-CXY @jpbetz

The text was updated successfully, but these errors were encountered:

jingyih · 2019-04-08T01:23:21Z

Regarding the idea of using read index query to get leader's committed index, it is not straightforward to implement. Every ReadState resulted from a read index query will end up been consumed by the linearizable read loop. So in order for this idea to work, we probably need to pass additional flag to read index request context. So that later at the output of raft ready channel, we can distinguish the ReadState, bypass the linearizable read loop, and handle it separately. Any thoughts?

Ref:
Current request context for read index query:

etcd/etcdserver/v3_server.go

Line 651 in a621d80

if err := s.r.ReadIndex(cctx, ctxToSend); err != nil {

etcd/etcdserver/v3_server.go

Lines 630 to 632 in a621d80

    
           ctxToSend := make([]byte, 8) 
        
           id1 := s.reqIDGen.Next() 
        
           binary.BigEndian.PutUint64(ctxToSend, id1)

We need something like this:

type ctxToSend struct {
  reqID        uint64
  someFlagName bool // when unset, send the corresponding ReadState to r.readStateC; otherwise send to a different channel to get leader's committed index
}

jingyih · 2019-04-08T01:26:43Z

If you have concerns or better ideas, please let me know. Otherwise I will try to come up with a prototype in the next couple days.

WIZARD-CXY · 2019-04-10T05:16:08Z

I'm not an expert on this, but what about this one?
see func newReady(r *raft, prevSoftSt *SoftState, prevHardSt pb.HardState)
we can get r.prs and r.learnPrs in newReady and set it to rd.SoftState it will then be sent to node.readyc then we will get infos from Node.Ready() channel in the case rd := <-r.Ready(): of func (r *raftNode) start(rh *raftReadyHandler) . we can define a raftReadyHandler function to update the learner progress or peer progress just like updateCommittedIndex to update the commit index.

WIZARD-CXY · 2019-04-10T05:18:40Z

@jingyih I don't know if I explain clearly above, or I can code a prototype. I think my approach is simple and more native than the read index query approach.

jingyih · 2019-04-10T19:46:01Z

@WIZARD-CXY
My understanding is that ONLY leader tracks the progress of other nodes and actively maintains its r.prs and r.learnerPrs. So a follower node's r.prs does not have information about the leader. On the other hand, the member promote request can be sent to any member.

jingyih · 2019-04-10T21:32:07Z

Now that I thought about it. The method I proposed will not work. Because a random node who receives the member promote call does not have info on learner node, nor does it have proper ways to retrieve it.

jingyih · 2019-04-10T21:44:28Z

So basically a node can either query the progress in raft layer, or in etcdserver layer. If we want the query to happen in etcdserver layer, we need: A) expose progress of leader and learner via rd.SoftState, as suggested by @WIZARD-CXY, and we need to define a way for nodes to retrieve these information from leader (maybe via HTTP?). I like this approach because it decouples etcdserver and raft more cleanly, as compared to query leader and learner progress in raft layer.

jingyih · 2019-04-11T02:22:42Z

Talked to @WIZARD-CXY offline, he is going to prototype this. Regarding retrieving information from leader via HTTP, here are some references:

As an example, this is the code for retrieving members information from peer using HTTP client. Note that in our case we need to identify which peer is leader and only retrieve information from leader.

etcd/etcdserver/cluster_util.go

Line 58 in 9d62477

func GetClusterFromRemotePeers(lg *zap.Logger, urls []string, rt http.RoundTripper) (*membership.RaftCluster, error) {
How the request is handled.

etcd/etcdserver/api/etcdhttp/peer.go

Line 34 in 9d62477

func NewPeerHandler(lg *zap.Logger, s etcdserver.ServerPeer) http.Handler {

WIZARD-CXY · 2019-04-11T02:43:24Z

Thanks for the info @jingyih. I will prototype this one and send a pr asap. Meanwhile @xiang90 can u take a look?

jingyih · 2019-04-11T22:31:18Z

Let's start with the first step, which is exposing progress via ready softstate.

We can probably avoid the second step which is retrieving info from leader. On client side we may write some helper function to determine the leader endpoint, and only send member promote request to leader.

WIZARD-CXY · 2019-04-12T01:36:36Z

good idea

WIZARD-CXY · 2019-04-13T08:24:34Z

@jingyih After a closer look, I find sth interesting, check this function raftStatus in package etcdserver. It already have the progress infos if it is a leader. Haha, 踏破铁鞋无觅处得来全不费工夫.

WIZARD-CXY · 2019-04-13T08:35:18Z

@jingyih https://github.com/jingyih/etcd/pull/8/files see place where I mentioned you, we can use raftStatus there.

jingyih · 2019-04-13T09:16:47Z

@jingyih After a closer look, I find sth interesting, check this function raftStatus in package etcdserver. It already have the progress infos if it is a leader. Haha, 踏破铁鞋无觅处得来全不费工夫.

Nice!

jingyih · 2019-05-07T20:47:36Z

The final design is:

Leader has necessary information to evaluate learner progress. The information is exposed via raftStatus() function:

etcd/etcdserver/raft.go

Line 57 in e8e668c

raftStatus func() raft.Status
etcd server will forward member promote request to leader.

Implemented in #19 and #31.

jingyih mentioned this issue Apr 7, 2019

Task list: support raft learner in etcd. etcd-io/etcd#10537

Closed

33 tasks

jingyih mentioned this issue Apr 26, 2019

etcdctl: find leader when promoting learner #28

Closed

jingyih closed this as completed May 7, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to evaluate learner progress #15

How to evaluate learner progress #15

jingyih commented Apr 7, 2019 •

edited

Loading

jingyih commented Apr 8, 2019

jingyih commented Apr 8, 2019

WIZARD-CXY commented Apr 10, 2019

WIZARD-CXY commented Apr 10, 2019 •

edited

Loading

jingyih commented Apr 10, 2019 •

edited

Loading

jingyih commented Apr 10, 2019

jingyih commented Apr 10, 2019

jingyih commented Apr 11, 2019

WIZARD-CXY commented Apr 11, 2019

jingyih commented Apr 11, 2019 •

edited

Loading

WIZARD-CXY commented Apr 12, 2019 •

edited

Loading

WIZARD-CXY commented Apr 13, 2019

WIZARD-CXY commented Apr 13, 2019

jingyih commented Apr 13, 2019

jingyih commented May 7, 2019

How to evaluate learner progress #15

How to evaluate learner progress #15

Comments

jingyih commented Apr 7, 2019 • edited Loading

jingyih commented Apr 8, 2019

jingyih commented Apr 8, 2019

WIZARD-CXY commented Apr 10, 2019

WIZARD-CXY commented Apr 10, 2019 • edited Loading

jingyih commented Apr 10, 2019 • edited Loading

jingyih commented Apr 10, 2019

jingyih commented Apr 10, 2019

jingyih commented Apr 11, 2019

WIZARD-CXY commented Apr 11, 2019

jingyih commented Apr 11, 2019 • edited Loading

WIZARD-CXY commented Apr 12, 2019 • edited Loading

WIZARD-CXY commented Apr 13, 2019

WIZARD-CXY commented Apr 13, 2019

jingyih commented Apr 13, 2019

jingyih commented May 7, 2019

jingyih commented Apr 7, 2019 •

edited

Loading

WIZARD-CXY commented Apr 10, 2019 •

edited

Loading

jingyih commented Apr 10, 2019 •

edited

Loading

jingyih commented Apr 11, 2019 •

edited

Loading

WIZARD-CXY commented Apr 12, 2019 •

edited

Loading