etcdserver: add ability to auto-promote learners to voters #10887

maxenglander · 2019-07-11T13:14:06Z

Auto-promote learners

See #10537 for original mention of auto promoting learners.
See etcd-io/website#107 for changes to learner design doc.

Design is motivated by comments in original design PR:

There are three cases for auto-promote:

promote a learner to voter regardless of the cluster size.

promote a learner to voter if the cluster size is smaller than the expected size (for example, a voter is removed from the cluster)

promote a learner to voter if some other voter is unhealthy for some user defined duration (remove the unhealthy node and replace it with a learner)

...and also motivated by other planned 3.5 learner features:

A learner can serve as a read-only node that never gets promoted. In a weak consistency mode, learner only receives data from leader and never process writes. Serving reads locally without consensus overhead would greatly decrease the workloads to leader but may serve stale data. In a strong consistency mode, learner requests read index from leader to serve latest data, but still rejects writes.

Enables newly added learners to be automatically promoted to voting members.

With this PR, operators can supply "auto" and "promote rules" to member add API. When --learner and --auto are supplied to member add, the learner is automatically promoted to a voting member when any of its promotion rules are satisfied. If no promotion rules are supplied, the "default" promotion rule is used (90% caught up to leader progress).

If --learner is supplied but not --auto, the learner is not automatically promoted. Instead, the promotion rules determine whether an operator request to promote the learner is accepted. This implementation supplants and maintains compatibility with the current --learner behavior.

Operators can also supply --delay, which determines how long a promotion rule must be satisfied before voter is promoted. The default value is 0, meaning learners may be promoted immediately when a promotion rule is satisfied.

With this PR, learners may only be promoted to voters. However, the API changes are flexible enough to accommodate other promotion targets (e.g. "strong reader" or "weak reader"). Likewise, with this PR the only promotion criteria is "--min-progress". The API can easily accommodate additional promotion criteria such as --min-healthy-voters (or --min-cluster-size), which would promote member as soon as the number of healthy voters (or number of members) in cluster drops below a threshold.

jingyih · 2019-07-11T21:37:03Z

Thanks for implementing this feature. The original plan was to not include this in 3.4 release (will likely be in 3.5), but I am open to discussion on this. FYI, the feature cut date for 3.4 release is July 31, accordingly to the latest community meeting.

cc @xiang90 @gyuho @jpbetz @WIZARD-CXY

jingyih · 2019-07-11T21:37:28Z

Is this still WIP? Is it ready to be reviewed?

maxenglander · 2019-07-12T01:06:09Z

Hey @jingyih it's WIP. I still need to write tests, update documentation and some APIs (e.g. member list to include info on auto-promotion). I was hoping to get some initial feedback on the approach while I'm working on those things.

WIZARD-CXY · 2019-07-12T01:39:54Z

Thanks @maxenglander. I will take a closer look asap.

WIZARD-CXY · 2019-07-14T03:21:42Z

@maxenglander I went throught the code change, the overall looks good. Can we keep the autoPromote related code in the etcdserver?

maxenglander · 2019-07-14T18:17:55Z

@WIZARD-CXY are you asking if I can undo changes to all packages except etcdserver? If I do that, it won't be possible to provide users with an --auto-promote flag.

jingyih

We need to decide whether autoPromote is an etcd feature or Raft feature. I don't think changes to Raft package is necessary for implementing this feature. I prefer to keep Raft package having core functionalities only - implement autoPromote only in etcd server.

jingyih · 2019-07-16T01:08:42Z

clientv3/cluster.go

+	MemberAddAsAutoPromotingNode(ctx context.Context, peerAddrs []string) (*MemberAddResponse, error)
+
+	// MemberAddAsNode adds a new member as a node into the cluster.
+	MemberAddAsNode(ctx context.Context, peerAddrs []string) (*MemberAddResponse, error)


I think we should keep MemberAdd() unchanged for better backward compatibility.

Good point, done in 29fe958

jingyih · 2019-07-16T01:15:51Z

clientv3/integration/cluster_test.go

+			for _, member := range memberList.Members {
+				if member.ID == autoPromotingLearnerID {
+					if member.IsLearner == false {
+						break success


For better readability, maybe use a flag to record if this member is promoted, so that you can just do 'break' after this for loop.

Good idea, done in 29fe958

maxenglander · 2019-07-16T04:48:11Z

Hey @jingyih I can make that change, but could benefit from some input on how. I can see two ways of implementing this without changes to Raft package.

Change the behavior of member add --learner so that all learners are auto-promoted
Keep track of whether a learner is designated for auto-promotion only on the leader (in memory and/or on disk)

The drawback of 1. is that it is a breaking API change. Users might expect learners to remain in that state until manually promoted.

The drawback of 2. is that if the leader dies and leadership transfers to another member, a learner that is marked for auto-promotion will lose its auto-promotion designation, and thus be "demoted" to a regular learner, unless there is some (non-Raft) mechanism for having leaders share auto-promote designations with peers.

Can you provide input on whether 1. or 2. is preferable? Or perhaps some 3rd option I'm not thinking of?

I'd also be interested in understanding what criteria you use to decide that the ability to have add a learner qualifies as a Raft feature, whereas the ability to designate a learner for auto-promotion does not.

jingyih · 2019-07-17T02:07:47Z

The first step we should do is probably decide the behavior of member add API in v3.5. My own opinion is, in the very long run, 'member add' by default may simply add any new member as learner, which will be auto promoted once caught up. If user wants to add a learner member that will not be auto promoted, they need to specially set a flag. But in the near future (v3.5), the API is not clearly defined yet. (We might also need to implement a proper feature gate and put auto promote behind it) I am happy to start a draft on this so we could start the discussion.

I can make that change, but could benefit from some input on how. I can see two ways of implementing this without changes to Raft package.

I'd also be interested in understanding what criteria you use to decide that the ability to have add a learner qualifies as a Raft feature

Sorry my previous comment is inaccurate. What I meant was, auto promote's implementation might not need changes to Raft package and current API. etcdserver has cluster member information in s.cluster. So one possible option is to add autoPromote field only in etcdserver. From Raft's perspective, it may not need to distinguish between the learners. - I am happy to provide more details and suggestion. But since I feel the actual implementation is going to be depend on member add API in v3.5, we should agree on the API first.

maxenglander · 2019-07-17T03:13:36Z

@jingyih thanks for the comments. I'm happy to wait for you and other maintainers to firm up API plans after which I'm happy to make adjustments to code based on that. FWIW, here was what I was envisioning for API changes from v3.4 and onwards.

My own opinion is, in the very long run, 'member add' by default may simply add any new member as learner, which will be auto promoted once caught up.

I agree, I think this makes a lot of sense. My thinking was that, in v3.4, users would have the option of specifying member add --learner --auto-promote as a way of previewing that "long run" behavior.

In >= v3.5, I was thinking member add would end up behaving the same way as member add --learner --auto-promote, which would mostly be a matter of changing member add to trigger a pb.ConfChangeAddAutoPromotingNode. At that time, pb.ConfChangeAddNode would become an internal-only configuration change that can not be triggered via user API call, and is meant to be triggered by etcdserver once an auto-promoting node has caught up to leader.

Eventually (e.g. in v3.6 or later), member add --learner --auto-promote could optionally be deprecated since it would be redundant to member add.

If user wants to add a learner member that will not be auto promoted, they need to specially set a flag.

I was thinking that --learner would be that special flag. In other words, the behavior of member add would change starting at v3.5, but member add --learner would remain the same.

So one possible option is to add autoPromote field only in etcdserver. From Raft's perspective, it may not need to distinguish between the learners.

Let me know if I'm understanding you correctly. I think you're saying that autoPromote could be a global value set by a feature-gate, rather than (as in current PR) something that is specifiable per-learner. In other words, a given etcd cluster could support either auto-promoting learners or non-auto-promoting learners, but not both. If it were implemented that way, then pb.ConfChangeAddLearner would create an auto-promoting learner or a non-auto-promoting learner, depending on the value of the feature gate known in etcdserver, and therefore there would be no need for pb.ConfChangeAddAutoPromotingNode.

Thanks for all the discussion so far. As I said above, will wait to see what you and other members decide on in terms of v3.5 API, and make code adjustments accordingly.

jingyih · 2019-07-19T07:07:50Z

@maxenglander I agree with most part of your envisioning. Let's first finalize the API. Maybe just open a PR against the design doc [1] to add a new section 'Learner Implementation in v3.5'?

At the same time, I would like to explore the opportunity of possibly adding proper feature gate in etcd. AFAIK, currently in etcd the experimental features are enabled by special server side flags [2]. If not being careful, servers in the same cluster could end up having different settings on these. If we can improve this situation, we can choose to (or not to) put auto promote behind a feature gate, in additional to the '--auto-promote' flag in member add.

Another nice thing about having feature gate, is that we can always disable a new feature when it is added, and enable it in the next minor release. This is one step further towards officially support 1 minor version downgrade / rollback.

[1] https://github.com/etcd-io/etcd/blob/master/docs/server-learner.rst

[2] https://github.com/etcd-io/etcd/blob/master/Documentation/op-guide/configuration.md#experimental-flags

maxenglander · 2019-07-20T00:23:06Z

That sounds good @jingyih I will do that. Thank you for the guidance.

codecov-io · 2019-07-25T03:25:09Z

Codecov Report

Merging #10887 (fdb1712) into master (b67ed4e) will decrease coverage by 22.61%.
The diff coverage is 31.57%.

@@             Coverage Diff             @@
##           master   #10887       +/-   ##
===========================================
- Coverage   69.74%   47.12%   -22.62%     
===========================================
  Files         407      398        -9     
  Lines       32585    32614       +29     
===========================================
- Hits        22726    15370     -7356     
- Misses       7895    15220     +7325     
- Partials     1964     2024       +60

Flag	Coverage Δ
all	`47.12% <31.57%> (-22.62%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
client/v3/cluster.go	`18.18% <0.00%> (-74.32%)`	⬇️
server/etcdserver/api/etcdhttp/peer.go	`40.27% <0.00%> (-46.87%)`	⬇️
server/etcdserver/api/membership/cluster.go	`56.92% <0.00%> (-23.25%)`	⬇️
server/etcdserver/api/membership/errors.go	`100.00% <ø> (ø)`
server/etcdserver/api/v2v3/server.go	`0.00% <0.00%> (ø)`
server/etcdserver/cluster_util.go	`40.14% <0.00%> (-28.24%)`	⬇️
server/etcdserver/errors.go	`0.00% <ø> (ø)`
server/etcdserver/util.go	`65.42% <0.00%> (-33.21%)`	⬇️
server/proxy/grpcproxy/cluster.go	`30.92% <10.00%> (-39.53%)`	⬇️
etcdctl/ctlv3/command/member_command.go	`61.83% <25.00%> (-5.41%)`	⬇️
... and 253 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update b67ed4e...fdb1712. Read the comment docs.

stale · 2020-04-06T18:26:02Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale · 2020-09-28T18:53:38Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed after 21 days if no further activity occurs. Thank you for your contributions.

maxenglander · 2020-09-29T12:41:37Z

/remove stale

maxenglander · 2021-01-14T02:38:55Z

/remove-stale

maxenglander · 2021-02-19T01:22:23Z

Hi @ptabor thanks for reviewing etcd-io/website#107. I thought since you reviewed that I would ping you here to see if you can help move this PR forward. There are probably some additional changes I need to make in this PR, but I think it's a point where it can be reviewed.

stale · 2021-08-11T16:00:54Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed after 21 days if no further activity occurs. Thank you for your contributions.

maxenglander · 2021-09-02T04:19:41Z

Hi @ptabor @jingyih can we re-open this? I didn't mean to let it close.

maxenglander · 2021-09-02T04:19:50Z

/remove-stale

lilic · 2021-09-02T06:43:27Z

@maxenglander I reopened, do you mind rebasing with the main branch, thanks!

Enable new learners to be marked for automatic promotion to voters upon catching up with the leader. Adds the ability to supply one or more promotion rules when adding a new learnere. Promotion rules govern if and when a learner may be promoted to a new role. Currently only promotion to voter is supported, but the APIs added in this commit are flexible and allow for backwards-compatible introduction of additional promotion targets (e.g. reader).

maxenglander · 2021-09-11T17:05:59Z

Thank you for re-opening @lilic! I've rebased.

Ping @jingyih @ptabor let me know if there's a good way to move this PR forward. I'm still interested in getting this merged.

stale · 2021-12-11T14:20:06Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed after 21 days if no further activity occurs. Thank you for your contributions.

stale · 2022-06-11T02:59:39Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed after 21 days if no further activity occurs. Thank you for your contributions.

gyuho added this to the etcd-v3.5 milestone Jul 12, 2019

jingyih reviewed Jul 16, 2019

View reviewed changes

maxenglander mentioned this pull request Jul 20, 2019

docs: auto-promoting nodes #10913

Closed

gyuho mentioned this pull request Jul 26, 2019

Release etcd 3.4 #10943

Closed

19 tasks

JackyChiu mentioned this pull request Mar 23, 2020

Add Member During Network Partition edegens/ptype#63

Merged

stale bot added the stale label Apr 6, 2020

stale bot removed the stale label Apr 26, 2020

maxenglander force-pushed the auto-promote-learners branch 4 times, most recently from df82df7 to d5bc5de Compare May 13, 2020 04:48

maxenglander force-pushed the auto-promote-learners branch 2 times, most recently from ed38329 to b16b74b Compare June 30, 2020 05:08

maxenglander requested a review from jingyih June 30, 2020 16:57

stale bot added the stale label Sep 28, 2020

stale bot removed the stale label Jan 14, 2021

This was referenced Feb 17, 2021

content/docs/next/learning: propose v3.5 auto-promotion in design-learner.md etcd-io/website#107

Closed

What's the plan for version 3.5? When will it be released? #12330

Closed

maxenglander force-pushed the auto-promote-learners branch from baf146a to fdb1712 Compare February 19, 2021 01:06

ptabor added the rel/3.5-nice-to-have label Mar 30, 2021

hexfusion mentioned this pull request May 17, 2021

3.5 release next steps: Code freeze Monday 5/17 #12970

Closed

4 tasks

stale bot added the stale label Aug 11, 2021

stale bot closed this Sep 2, 2021

lilic reopened this Sep 2, 2021

stale bot removed the stale label Sep 2, 2021

maxenglander force-pushed the auto-promote-learners branch 2 times, most recently from 1e29cb3 to 065a53e Compare September 11, 2021 16:05

maxenglander force-pushed the auto-promote-learners branch from 065a53e to bd0cfdb Compare September 11, 2021 16:56

stale bot added the stale label Dec 11, 2021

pacoxu mentioned this pull request Dec 14, 2021

RFE: use learner mode for joining etcd members kubernetes/kubeadm#1793

Open

serathius removed this from the etcd-v3.5 milestone Dec 15, 2021

stale bot removed the stale label Dec 15, 2021

serathius removed the rel/3.5-nice-to-have label Mar 8, 2022

stale bot added the stale label Jun 11, 2022

stale bot closed this Jul 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

etcdserver: add ability to auto-promote learners to voters #10887

etcdserver: add ability to auto-promote learners to voters #10887

maxenglander commented Jul 11, 2019 •

edited

Loading

jingyih commented Jul 11, 2019

jingyih commented Jul 11, 2019

maxenglander commented Jul 12, 2019

WIZARD-CXY commented Jul 12, 2019

WIZARD-CXY commented Jul 14, 2019

maxenglander commented Jul 14, 2019

jingyih left a comment

jingyih Jul 16, 2019

maxenglander Jul 17, 2019

jingyih Jul 16, 2019

maxenglander Jul 17, 2019

maxenglander commented Jul 16, 2019

jingyih commented Jul 17, 2019

maxenglander commented Jul 17, 2019

jingyih commented Jul 19, 2019

maxenglander commented Jul 20, 2019

codecov-io commented Jul 25, 2019 •

edited

Loading

stale bot commented Apr 6, 2020

stale bot commented Sep 28, 2020

maxenglander commented Sep 29, 2020

maxenglander commented Jan 14, 2021

maxenglander commented Feb 19, 2021

stale bot commented Aug 11, 2021

maxenglander commented Sep 2, 2021

maxenglander commented Sep 2, 2021

lilic commented Sep 2, 2021

maxenglander commented Sep 11, 2021

stale bot commented Dec 11, 2021

stale bot commented Jun 11, 2022

etcdserver: add ability to auto-promote learners to voters #10887

etcdserver: add ability to auto-promote learners to voters #10887

Conversation

maxenglander commented Jul 11, 2019 • edited Loading

Auto-promote learners

jingyih commented Jul 11, 2019

jingyih commented Jul 11, 2019

maxenglander commented Jul 12, 2019

WIZARD-CXY commented Jul 12, 2019

WIZARD-CXY commented Jul 14, 2019

maxenglander commented Jul 14, 2019

jingyih left a comment

Choose a reason for hiding this comment

jingyih Jul 16, 2019

Choose a reason for hiding this comment

maxenglander Jul 17, 2019

Choose a reason for hiding this comment

jingyih Jul 16, 2019

Choose a reason for hiding this comment

maxenglander Jul 17, 2019

Choose a reason for hiding this comment

maxenglander commented Jul 16, 2019

jingyih commented Jul 17, 2019

maxenglander commented Jul 17, 2019

jingyih commented Jul 19, 2019

maxenglander commented Jul 20, 2019

codecov-io commented Jul 25, 2019 • edited Loading

Codecov Report

stale bot commented Apr 6, 2020

stale bot commented Sep 28, 2020

maxenglander commented Sep 29, 2020

maxenglander commented Jan 14, 2021

maxenglander commented Feb 19, 2021

stale bot commented Aug 11, 2021

maxenglander commented Sep 2, 2021

maxenglander commented Sep 2, 2021

lilic commented Sep 2, 2021

maxenglander commented Sep 11, 2021

stale bot commented Dec 11, 2021

stale bot commented Jun 11, 2022

maxenglander commented Jul 11, 2019 •

edited

Loading

codecov-io commented Jul 25, 2019 •

edited

Loading