[HPA e2e] Reduce possible number of scale steps to minimize stabilization test flakiness #116040

pbeschetnov · 2023-02-24T13:32:01Z

What type of PR is this?

/kind bug
/kind flake

What this PR does / why we need it:

This PR reduces HPA stabilization test flakines.

Which issue(s) this PR fixes:

In this test we scale up from 2 -> 4, and back 4 -> 2 using scaleUp/scaleDown stabilization windows = 3m.

Consider scale up: the resource consumer asks for CPU usage of 4 replicas, sometimes it's not that precise and it appears to consume only 3 replicas usage. So, after the stabilization windows passes, we scale up from 2 to 3. Then we have to wait another 3m to scale up to 4 replicas. The test expects to scale up in one step 2 -> 4 and spend 3m on that. In reality it is 2 -> 3 -> 4 (6m). Finally, the test timeouts because of that.

I propose to eliminate possible intermediate steps and scale always between 2 and 3 replicas. This doesn't sacrifice precision, because it's verified in other HPA tests.

Does this PR introduce a user-facing change?

NONE

k8s-ci-robot · 2023-02-24T13:32:10Z

Hi @pbeschetnov. Thanks for your PR.

I'm waiting for a kubernetes member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

k8s-ci-robot · 2023-02-24T13:32:11Z

This issue is currently awaiting triage.

If a SIG or subproject determines this is a relevant issue, they will accept it by applying the triage/accepted label and provide further guidance.

The triage/accepted label can be added by org members by writing /triage accepted in a comment.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

sanposhiho

/lgtm
/ok-to-test

k8s-ci-robot · 2023-02-25T10:33:05Z

LGTM label has been added.

Git tree hash: d19aaccd9abe3c760bc038e073a97cdc593d34c7

sanposhiho · 2023-02-25T23:24:13Z

/retest

flaky #116061

pbeschetnov · 2023-02-27T08:21:24Z

/retest

pbeschetnov · 2023-02-27T09:30:34Z

/assign @mwielgus

mwielgus

/lgtm
/approve

k8s-ci-robot · 2023-03-07T10:01:53Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: mwielgus, pbeschetnov

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~test/e2e/autoscaling/OWNERS~~ [mwielgus]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

Reduce possible number of scale steps to improve test stability

e25badc

k8s-ci-robot added the needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. label Feb 24, 2023

k8s-ci-robot requested review from bskiba and mwielgus February 24, 2023 13:33

pbeschetnov changed the title ~~Reduce possible number of scale steps to reduce HPA stabilization test flakiness~~ Reduce possible number of scale steps to minimize HPA stabilization test flakiness Feb 24, 2023

pbeschetnov changed the title ~~Reduce possible number of scale steps to minimize HPA stabilization test flakiness~~ [HPA e2e] Reduce possible number of scale steps to minimize stabilization test flakiness Feb 24, 2023

sanposhiho reviewed Feb 25, 2023

View reviewed changes

k8s-ci-robot added ok-to-test Indicates a non-member PR verified by an org member that is safe to test. and removed needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Feb 25, 2023

k8s-ci-robot assigned sanposhiho Feb 25, 2023

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Feb 25, 2023

k8s-ci-robot assigned mwielgus Feb 27, 2023

This was referenced Mar 6, 2023

fix(HPA): make a difference in SuccessfulRescale events between the resource metric and the container resource metric #116045

Merged

flaky tests: pull-kubernetes-e2e-autoscaling-hpa-cm and pull-kubernetes-e2e-autoscaling-hpa-cpu #116315

Closed

mwielgus approved these changes Mar 7, 2023

View reviewed changes

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Mar 7, 2023

k8s-ci-robot merged commit 4eb29bc into kubernetes:master Mar 7, 2023

k8s-ci-robot added this to the v1.27 milestone Mar 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[HPA e2e] Reduce possible number of scale steps to minimize stabilization test flakiness #116040

[HPA e2e] Reduce possible number of scale steps to minimize stabilization test flakiness #116040

pbeschetnov commented Feb 24, 2023

k8s-ci-robot commented Feb 24, 2023

k8s-ci-robot commented Feb 24, 2023

sanposhiho left a comment

k8s-ci-robot commented Feb 25, 2023

sanposhiho commented Feb 25, 2023

pbeschetnov commented Feb 27, 2023

pbeschetnov commented Feb 27, 2023

mwielgus left a comment

k8s-ci-robot commented Mar 7, 2023

[HPA e2e] Reduce possible number of scale steps to minimize stabilization test flakiness #116040

[HPA e2e] Reduce possible number of scale steps to minimize stabilization test flakiness #116040

Conversation

pbeschetnov commented Feb 24, 2023

What type of PR is this?

What this PR does / why we need it:

Which issue(s) this PR fixes:

Does this PR introduce a user-facing change?

k8s-ci-robot commented Feb 24, 2023

k8s-ci-robot commented Feb 24, 2023

sanposhiho left a comment

Choose a reason for hiding this comment

k8s-ci-robot commented Feb 25, 2023

sanposhiho commented Feb 25, 2023

pbeschetnov commented Feb 27, 2023

pbeschetnov commented Feb 27, 2023

mwielgus left a comment

Choose a reason for hiding this comment

k8s-ci-robot commented Mar 7, 2023