Start autoscaler in panic mode. #4795

vagababov · 2019-07-18T06:11:18Z

When autoscaler restarts it has no memory of the previous stats and immediately scales the deployment down. Under my load test conditions,
Autoscaler goes from 7 to 1 then to 2, only then to 6 pods, before finally reaching 7 after a few cycles.
This is obviously not a desired property, so this change fixes that behavior, by
starting Autoscaler in panic mode with panic pod count equal to the current pod count
if we have more than 1 serving pods right now.

Obviously if the deployment is scaled to 0, there's no reason to change logic
and if it is 1, then we won't scale below 1 anyway for the next stable window, so no need to panic either.

/assign @mattmoor @markusthoemmes

This is the GA scope of #2930

/lint

This starts autoscaler in panic mode and does not scale down deployment after autoscaler restart.

@mattmoor

When autoscaler restarts it has no memory of the previous stats and immediately scales the deployment down. Under my load test conditions, Autoscaler goes from 7 to 1 then to 2, only then to 6 pods, before finally reaching 7 after a few cycles. This is obviously not a desired property, so this change fixes that behavior, by starting Autoscaler in panic mode with panic pod count equal to the current pod count if we have more than 1 serving pods right now. Obviously if the deployment is scaled to 0, there's no reason to change logic and if it is 1, then we won't scale below 1 anyway for the next stable window, so no need to panic either. /assign @mattmoor @markusthoemmes This is the GA scope of knative#2930

knative-prow-robot

@vagababov: 0 warnings.

In response to this:

When autoscaler restarts it has no memory of the previous stats and immediately scales the deployment down. Under my load test conditions,
Autoscaler goes from 7 to 1 then to 2, only then to 6 pods, before finally reaching 7 after a few cycles.
This is obviously not a desired property, so this change fixes that behavior, by
starting Autoscaler in panic mode with panic pod count equal to the current pod count
if we have more than 1 serving pods right now.

Obviously if the deployment is scaled to 0, there's no reason to change logic
and if it is 1, then we won't scale below 1 anyway for the next stable window, so no need to panic either.

/assign @mattmoor @markusthoemmes

This is the GA scope of #2930

/lint
This starts autoscaler in panic mode and does not scale down deployment after autoscaler restart.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

markusthoemmes

Simple but very effective. Love it ❤️

mattmoor · 2019-07-18T13:47:31Z

pkg/autoscaler/autoscaler.go

@@ -24,6 +24,7 @@ import (
 	"sync"
 	"time"

+	"github.com/wacul/ptr"


🤦‍♂ Can we just add this to knative/pkg rather than juggling packages for this?

I don't think we need it at all. A local variable will do as well.

That would work too :)

https://github.com/knative/pkg/pull/519/files
Well, as soon as matt does this!

should be done

pkg/autoscaler/autoscaler.go

vagababov · 2019-07-19T03:57:32Z

This is ready

mattmoor

/lgtm
/approve

knative-prow-robot · 2019-07-19T04:02:49Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: mattmoor, vagababov

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [mattmoor]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

vagababov · 2019-07-19T04:47:08Z

/test pull-knative-serving-integration-tests

hmmm.. 🚀

knative-prow-robot assigned markusthoemmes and mattmoor Jul 18, 2019

knative-prow-robot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jul 18, 2019

googlebot added the cla: yes Indicates the PR's author has signed the CLA. label Jul 18, 2019

knative-prow-robot requested review from josephburnett and mdemirhan July 18, 2019 06:11

knative-prow-robot added size/S Denotes a PR that changes 10-29 lines, ignoring generated files. approved Indicates a PR has been approved by an approver from all required OWNERS files. labels Jul 18, 2019

knative-prow-robot reviewed Jul 18, 2019

View reviewed changes

knative-prow-robot added the area/autoscale label Jul 18, 2019

markusthoemmes reviewed Jul 18, 2019

View reviewed changes

mattmoor reviewed Jul 18, 2019

View reviewed changes

pkg/autoscaler/autoscaler.go Outdated Show resolved Hide resolved

vagababov added 2 commits July 18, 2019 11:10

Merge branch 'master' into 2930-panic

7ddbe12

Merge branch 'master' into 2930-panic

ae417f7

knative-prow-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/S Denotes a PR that changes 10-29 lines, ignoring generated files. labels Jul 18, 2019

vagababov marked this pull request as ready for review July 18, 2019 22:44

knative-prow-robot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jul 18, 2019

unit tests

fb3522f

vagababov force-pushed the 2930-panic branch from b9b5e88 to fb3522f Compare July 18, 2019 22:59

updates to the imports

8aa5206

knative-prow-robot added size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. and removed approved Indicates a PR has been approved by an approver from all required OWNERS files. size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Jul 19, 2019

merge

e934cee

vagababov force-pushed the 2930-panic branch from a349247 to e934cee Compare July 19, 2019 03:56

knative-prow-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels Jul 19, 2019

mattmoor reviewed Jul 19, 2019

View reviewed changes

knative-prow-robot added the lgtm Indicates that a PR is ready to be merged. label Jul 19, 2019

knative-prow-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jul 19, 2019

knative-prow-robot merged commit b52b020 into knative:master Jul 19, 2019

vagababov deleted the 2930-panic branch September 20, 2019 18:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Start autoscaler in panic mode. #4795

Start autoscaler in panic mode. #4795

vagababov commented Jul 18, 2019

knative-prow-robot left a comment

markusthoemmes left a comment

mattmoor Jul 18, 2019

markusthoemmes Jul 18, 2019

mattmoor Jul 18, 2019

vagababov Jul 18, 2019

mattmoor Jul 19, 2019

vagababov commented Jul 19, 2019

mattmoor left a comment

knative-prow-robot commented Jul 19, 2019

vagababov commented Jul 19, 2019

Start autoscaler in panic mode. #4795

Start autoscaler in panic mode. #4795

Conversation

vagababov commented Jul 18, 2019

knative-prow-robot left a comment

Choose a reason for hiding this comment

markusthoemmes left a comment

Choose a reason for hiding this comment

mattmoor Jul 18, 2019

Choose a reason for hiding this comment

markusthoemmes Jul 18, 2019

Choose a reason for hiding this comment

mattmoor Jul 18, 2019

Choose a reason for hiding this comment

vagababov Jul 18, 2019

Choose a reason for hiding this comment

mattmoor Jul 19, 2019

Choose a reason for hiding this comment

vagababov commented Jul 19, 2019

mattmoor left a comment

Choose a reason for hiding this comment

knative-prow-robot commented Jul 19, 2019

vagababov commented Jul 19, 2019