Client should wait before retrying PATCH/PUT/POSTs in case of http 429 from server #222

shyamjvs · 2017-06-27T18:49:44Z

We figured recently while running 4000-node cluster tests that the apiserver was saturated with requests and returning 429s, but clients were still sending requests.
In particular, kubelets and NPDs on the nodes were trying to continually retry PATCH/PUT requests on failures (leading to thousands of qps of 429s), even though they're designed to send updates once every one minute. So this is most likely an issue with client-go.

Following from discussion in kubernetes/node-problem-detector#124

cc @Random-Liu @gmarek @kubernetes/sig-scalability-misc

shyamjvs · 2017-06-27T18:50:56Z

cc @yujuhong

shyamjvs · 2017-06-27T18:57:06Z

We don't want to enter the risky state of being stuck with 429s eventually.

lavalamp · 2017-06-27T18:59:05Z

Do the clients not have a configured rate limit? Yes, we should make clients respect the server's request if they aren't right now.

…

On Tue, Jun 27, 2017 at 11:57 AM, Shyam JVS ***@***.***> wrote: We don't want to enter the risky state of being stuck with 429s eventually. — You are receiving this because you are on a team that was mentioned. Reply to this email directly, view it on GitHub <#222 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAngliFGbnJOisYkU0JDi-MxZ79w2lg4ks5sIVCDgaJpZM4OHChG> .

yujuhong · 2017-06-27T20:18:46Z

Do the clients not have a configured rate limit?

The client had a rate limit, and I believe the QPS was still within the limit. Maybe the problem is that the limit needs to be even lower for the cluster of this scale?

BTW, I was wrong in the initial issue. The rest client does not retry non-GET requests. I am not sure what's causing the high number of retries on 429 (both NPD's and kubelet's retry loop have large interval).

caesarxuchao · 2017-06-27T21:54:55Z

As i understand, this is not a release blocker, but we need to fix it.

caesarxuchao · 2017-06-27T23:37:21Z

@gmarek i vaguely remembered you added the backoffMgr. The history was lost when we moved the code to staging.

do you know if this comment is still true?
https://github.com/kubernetes/kubernetes/blob/master/staging/src/k8s.io/client-go/rest/client.go#L138-L139

// readExpBackoffConfig handles the internal logic of determining what the
// backoff policy is.  By default if no information is available, NoBackoff.
// TODO Generalize this see #17727 .
func readExpBackoffConfig() BackoffManager {

The env vars were empty on my 1.6.4 gke node.

caesarxuchao · 2017-06-28T00:14:42Z

~~As Yuju mentioned, the client is only going to retry if net.IsConnectionReset(err) && r.verb == "GET".~~

The code @yujuhong mentioned was wrapped in if err!=nil, and when server returns 429, err is nil (see https://github.com/golang/go/blob/master/src/net/http/client.go#L461-L464). So many operations are retried.

If the serve returns 429, before retrying, the client in total sleeps for

the amount of time demanded by the server, plus
the exponential backoff as recorded by the backoff manager. Though by default the backoff manager is empty, so it's 0.

Anyway, i think client is following apiserver's demand, so there is no bug on the client side.

Random-Liu · 2017-06-28T00:49:16Z

Anyway, i think client is following apiserver's demand, so there is no bug on the client side.

Shouldn't we configure the backoff manager?

caesarxuchao · 2017-06-28T00:54:51Z

IMO the backoff manager is a safety net. APIserver should specify longer duration of retry-after if its load is heavy.

Maybe we should configure the backoff manager, @gmarek do you know what number we should use to initialize the backoff manager?

yujuhong · 2017-06-28T00:59:29Z

IMO the backoff manager is a safety net. APIserver should specify longer duration of retry-after if its load is heavy.

I remember I also checked this yesterday. Of course there is always a TODO in the code: https://github.com/kubernetes/kubernetes/blob/v1.8.0-alpha.1/staging/src/k8s.io/apiserver/pkg/server/filters/maxinflight.go#L33

gmarek · 2017-06-28T08:25:59Z

@caesarxuchao - this is very good question, which probably has very complex answer as: it depends. It's fine to backoff quite some time for Status updates (with the exception of NodeStatus which serves as a heartbeat), but we probably don't want to backoff too much for Spec updates. This should be figured out by @kubernetes/sig-api-machinery-bugs (@smarterclayton ?). For now can we put there something that's non-zero? E.g. 5ms?

(Disclaimer: I don't think I did anything around backoff mgr:)

lavalamp · 2017-06-28T17:06:33Z

IMO folks making requests should understand what their requests are doing and whether it makes sense to use the backoff manager. I am not comfortable enabling it by default without thinking a lot more about it. There is another potential problem that should be considered, which is: is the client opening a new connection each time or does it reuse an existing one? The latter is much more efficient. We have a really common bug where people fail to read the entire contents of the response body, which causes the go library to not reuse the socket. Is it possible that is happening in this case?

…

On Wed, Jun 28, 2017 at 1:26 AM, Marek Grabowski ***@***.***> wrote: @caesarxuchao <https://github.com/caesarxuchao> - this is very good question, which probably has very complex answer as: it depends. It's fine to backoff quite some time for Status updates (with the exception of NodeStatus which serves as a heartbeat), but we probably don't want to backoff too much for Spec updates. This should be figured out by @kubernetes/sig-api-machinery-bugs <https://github.com/orgs/kubernetes/teams/sig-api-machinery-bugs> ( @smarterclayton <https://github.com/smarterclayton> ?). For now can we put there something that's non-zero? E.g. 5ms? (Disclaimer: I don't think I did anything around backoff mgr:) — You are receiving this because you are on a team that was mentioned. Reply to this email directly, view it on GitHub <#222 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAnglqoXGSJe5Wyoc5zl-BbKLKevPqjBks5sIg4ZgaJpZM4OHChG> .

caesarxuchao · 2017-06-28T17:21:46Z

We have a really common bug where people fail to read the entire contents of the response body,

It looks like it's taken care of: https://github.com/kubernetes/kubernetes/blob/v1.8.0-alpha.1/staging/src/k8s.io/client-go/rest/request.go#L846-L848

yujuhong · 2017-06-28T18:54:28Z

IMO folks making requests should understand what their requests are doing
and whether it makes sense to use the backoff manager. I am not comfortable
enabling it by default without thinking a lot more about it.

The backoff manager settings are not exposed through the REST client config...it gets these from the environment variables. They are also not configurable based on the type of requests. If we can make this more useful, perhaps we can remove kubelet's own retry loop.

Given that the client retries up to 10 times (not configurable), I think we should lower the number of retries for node status update in kubelet. There is no point trying to send the same status for a prolonged period time when kubelet can instead sending a new update (every 10s).

caesarxuchao · 2017-06-29T00:15:22Z

Exponential backoff manager is introduced in kubernetes/kubernetes#17529. Sorry @gmarek i confused it with the throttler, which i believed was introduced by you ;)

@jayunit100 do you know why we used env var rather than a config to initialize the backoff manager?

I suggest that we do these:

add backoffManager to RESTClient and rest#Config
deprecate readExpBackoffConfig after 2 releases.
move maxRetries to backoffManager

shyamjvs · 2017-06-30T18:31:34Z

Linking issue kubernetes/kubernetes#47344 for tracking.

fejta-bot · 2017-12-31T03:02:36Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

Prevent issues from auto-closing with an /lifecycle frozen comment.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or @fejta.
/lifecycle stale

fejta-bot · 2018-01-30T03:10:34Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten
/remove-lifecycle stale

shyamjvs · 2018-01-30T12:05:22Z

/remove-lifecycle rotten

fejta-bot · 2018-04-30T12:07:06Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

shyamjvs · 2018-04-30T18:08:49Z

/remove-lifecycle stale

…

On Mon, Apr 30, 2018, 2:07 PM fejta-bot ***@***.***> wrote: Issues go stale after 90d of inactivity. Mark the issue as fresh with /remove-lifecycle stale. Stale issues rot after an additional 30d of inactivity and eventually close. If this issue is safe to close now please do so with /close. Send feedback to sig-testing, kubernetes/test-infra and/or fejta <https://github.com/fejta>. /lifecycle stale — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#222 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AEIhk5xSIPuhijo3gMq9AQCAP7z7K2F0ks5ttv5tgaJpZM4OHChG> .

fejta-bot · 2018-09-03T02:22:49Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

nikhita · 2018-09-03T14:30:06Z

/remove-lifecycle stale

fejta-bot · 2018-12-02T15:29:54Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

fejta-bot · 2019-01-01T16:14:20Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten

fejta-bot · 2019-01-31T16:58:41Z

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

k8s-ci-robot · 2019-01-31T16:58:49Z

@fejta-bot: Closing this issue.

In response to this:

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

shyamjvs added the bug label Jun 27, 2017

caesarxuchao self-assigned this Jun 27, 2017

shyamjvs mentioned this issue Jul 7, 2017

Overloaded API servers (sending 429) never cause clients to rebalance kubernetes/kubernetes#48610

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 31, 2017

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jan 30, 2018

k8s-ci-robot removed the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label Jan 30, 2018

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Apr 30, 2018

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Apr 30, 2018

k8s-ci-robot added kind/bug Categorizes issue or PR as related to a bug. and removed bug labels Jun 5, 2018

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Sep 3, 2018

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Sep 3, 2018

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 2, 2018

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jan 1, 2019

k8s-ci-robot closed this as completed Jan 31, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Client should wait before retrying PATCH/PUT/POSTs in case of http 429 from server #222

Client should wait before retrying PATCH/PUT/POSTs in case of http 429 from server #222

shyamjvs commented Jun 27, 2017

shyamjvs commented Jun 27, 2017

shyamjvs commented Jun 27, 2017

lavalamp commented Jun 27, 2017 via email

yujuhong commented Jun 27, 2017

caesarxuchao commented Jun 27, 2017

caesarxuchao commented Jun 27, 2017 •

edited

caesarxuchao commented Jun 28, 2017 •

edited

Random-Liu commented Jun 28, 2017

caesarxuchao commented Jun 28, 2017

yujuhong commented Jun 28, 2017

gmarek commented Jun 28, 2017

lavalamp commented Jun 28, 2017 via email

caesarxuchao commented Jun 28, 2017 •

edited

yujuhong commented Jun 28, 2017

caesarxuchao commented Jun 29, 2017

shyamjvs commented Jun 30, 2017

fejta-bot commented Dec 31, 2017

fejta-bot commented Jan 30, 2018

shyamjvs commented Jan 30, 2018

fejta-bot commented Apr 30, 2018

shyamjvs commented Apr 30, 2018 via email

fejta-bot commented Sep 3, 2018

nikhita commented Sep 3, 2018

fejta-bot commented Dec 2, 2018

fejta-bot commented Jan 1, 2019

fejta-bot commented Jan 31, 2019

k8s-ci-robot commented Jan 31, 2019

Client should wait before retrying PATCH/PUT/POSTs in case of http 429 from server #222

Client should wait before retrying PATCH/PUT/POSTs in case of http 429 from server #222

Comments

shyamjvs commented Jun 27, 2017

shyamjvs commented Jun 27, 2017

shyamjvs commented Jun 27, 2017

lavalamp commented Jun 27, 2017 via email

yujuhong commented Jun 27, 2017

caesarxuchao commented Jun 27, 2017

caesarxuchao commented Jun 27, 2017 • edited

caesarxuchao commented Jun 28, 2017 • edited

Random-Liu commented Jun 28, 2017

caesarxuchao commented Jun 28, 2017

yujuhong commented Jun 28, 2017

gmarek commented Jun 28, 2017

lavalamp commented Jun 28, 2017 via email

caesarxuchao commented Jun 28, 2017 • edited

yujuhong commented Jun 28, 2017

caesarxuchao commented Jun 29, 2017

shyamjvs commented Jun 30, 2017

fejta-bot commented Dec 31, 2017

fejta-bot commented Jan 30, 2018

shyamjvs commented Jan 30, 2018

fejta-bot commented Apr 30, 2018

shyamjvs commented Apr 30, 2018 via email

fejta-bot commented Sep 3, 2018

nikhita commented Sep 3, 2018

fejta-bot commented Dec 2, 2018

fejta-bot commented Jan 1, 2019

fejta-bot commented Jan 31, 2019

k8s-ci-robot commented Jan 31, 2019

caesarxuchao commented Jun 27, 2017 •

edited

caesarxuchao commented Jun 28, 2017 •

edited

caesarxuchao commented Jun 28, 2017 •

edited