Add etcd WithRequireLeader option to API watches #89881

embano1 · 2020-04-06T07:54:37Z

What type of PR is this?
/kind feature

What this PR does / why we need it:
Watches against etcd in the API server can hang forever if the etcd
cluster loses quorum, e.g. the majority of nodes crashes. This fix
improves responsiveness (detection and reaction time) of API server
watches against etcd in some rare (but still possible) edge cases so
that watches are terminated with "etcdserver: no leader" (ErrNoLeader).

Implementation behavior described by @jingyih:

The etcd server waits until it cannot find a leader for 3 election
timeouts to cancel existing streams. 3 is currently a hard coded
constant. The election timeout defaults to 1000ms.

If the cluster is healthy, when the leader is stopped, the leadership
transfer should be smooth. (leader transfers its leadership before
stopping). If leader is hard killed, other servers will take an election
timeout to realize leader lost and start campaign.

For further details, discussion and validation see
#89488 (comment)
and etcd-io/etcd#8980.

Signed-off-by: Michael Gasch mgasch@vmware.com

Which issue(s) this PR fixes:
Fixes #89488

Does this PR introduce a user-facing change?:

"NONE"

/sig api-machinery

cc/ @dims @jingyih

jingyih

/lgtm

jingyih · 2020-04-06T14:20:10Z

cc @wojtek-t @jpbetz

wojtek-t

/hold

Until I better understand the consequences of this change.

staging/src/k8s.io/apiserver/pkg/storage/etcd3/watcher.go

dims · 2020-04-07T11:25:47Z

/priority important-soon

dims · 2020-04-07T11:26:28Z

/assign @wojtek-t
/approve
/lgtm

Watches against etcd in the API server can hang forever if the etcd cluster loses quorum, e.g. the majority of nodes crashes. This fix improves responsiveness (detection and reaction time) of API server watches against etcd in some rare (but still possible) edge cases so that watches are terminated with `"etcdserver: no leader" (ErrNoLeader)`. Implementation behavior described by jingyih: ``` The etcd server waits until it cannot find a leader for 3 election timeouts to cancel existing streams. 3 is currently a hard coded constant. The election timeout defaults to 1000ms. If the cluster is healthy, when the leader is stopped, the leadership transfer should be smooth. (leader transfers its leadership before stopping). If leader is hard killed, other servers will take an election timeout to realize leader lost and start campaign. ``` For further details, discussion and validation see #89488 (comment) and etcd-io/etcd#8980. Closes: #89488 Signed-off-by: Michael Gasch <mgasch@vmware.com>

wojtek-t · 2020-04-08T11:38:54Z

/hold cancel
/lgtm
/approve

k8s-ci-robot · 2020-04-08T11:39:41Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: dims, embano1, wojtek-t

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~staging/src/k8s.io/apiserver/pkg/storage/OWNERS~~ [wojtek-t]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-ci-robot · 2020-04-08T12:32:22Z

@embano1: The following test failed, say /retest to rerun all failed tests:

Test name	Commit	Details	Rerun command
pull-kubernetes-node-e2e-containerd	45f4e1a137e378962f44ccd8620417f9e948f873	link	`/test pull-kubernetes-node-e2e-containerd`

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

fejta-bot · 2020-04-08T15:08:25Z

/retest
This bot automatically retries jobs that failed/flaked on approved PRs (send feedback to fejta).

Review the full test history for this PR.

Silence the bot with an /lgtm cancel or /hold comment for consistent failures.

k8s-ci-robot requested review from timothysc and wojtek-t April 6, 2020 07:55

jingyih reviewed Apr 6, 2020

View reviewed changes

k8s-ci-robot assigned jingyih Apr 6, 2020

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Apr 6, 2020

wojtek-t reviewed Apr 6, 2020

View reviewed changes

staging/src/k8s.io/apiserver/pkg/storage/etcd3/watcher.go Show resolved Hide resolved

k8s-ci-robot added priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. and removed needs-priority Indicates a PR lacks a `priority/foo` label and requires one. labels Apr 7, 2020

k8s-ci-robot assigned wojtek-t and dims Apr 7, 2020

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Apr 7, 2020

k8s-ci-robot added size/S Denotes a PR that changes 10-29 lines, ignoring generated files. and removed lgtm "Looks good to me", indicates that a PR is ready to be merged. size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. labels Apr 7, 2020

k8s-ci-robot added lgtm "Looks good to me", indicates that a PR is ready to be merged. and removed do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. labels Apr 8, 2020

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Apr 8, 2020

k8s-ci-robot merged commit 3c3fc80 into kubernetes:master Apr 8, 2020

k8s-ci-robot added this to the v1.19 milestone Apr 8, 2020

aojea mentioned this pull request Jul 18, 2022

Prolonged etcd reelection causes api-server to close all watchers. #111116

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add etcd WithRequireLeader option to API watches #89881

Add etcd WithRequireLeader option to API watches #89881

embano1 commented Apr 6, 2020 •

edited

jingyih left a comment

jingyih commented Apr 6, 2020

wojtek-t left a comment

dims commented Apr 7, 2020

dims commented Apr 7, 2020

wojtek-t commented Apr 8, 2020

k8s-ci-robot commented Apr 8, 2020

k8s-ci-robot commented Apr 8, 2020 •

edited

fejta-bot commented Apr 8, 2020

Add etcd WithRequireLeader option to API watches #89881

Add etcd WithRequireLeader option to API watches #89881

Conversation

embano1 commented Apr 6, 2020 • edited

jingyih left a comment

Choose a reason for hiding this comment

jingyih commented Apr 6, 2020

wojtek-t left a comment

Choose a reason for hiding this comment

dims commented Apr 7, 2020

dims commented Apr 7, 2020

wojtek-t commented Apr 8, 2020

k8s-ci-robot commented Apr 8, 2020

k8s-ci-robot commented Apr 8, 2020 • edited

fejta-bot commented Apr 8, 2020

embano1 commented Apr 6, 2020 •

edited

k8s-ci-robot commented Apr 8, 2020 •

edited