In autoscaling tests, add PDBs for more kube-system pods #52796

aleksandra-malinowska · 2017-09-20T14:50:32Z

This adds PDBs for more kube-system pods in scale down tests. It should reduce flakes caused by evenly distributed system components blocking scale down of all nodes.

MaciekPytel · 2017-09-20T15:40:52Z

test/e2e/autoscaling/cluster_size_autoscaling.go

 		{label: "kubernetes-dashboard", min_available: 0},
+		{label: "l7-default-backend", min_available: 0},
+		{label: "heapster", min_available: 0},


Are you sure all of those can be safely restarted? I am rather worried about restarting heapster (or any other critical system pod) in e2e. We had enough pain with rescheduler tainting our nodes already.

It's hard to be sure, but:

in tests with broken nodes, we don't select a node based on any of those pods, so we sometimes accidentally cause them to become unavailable and rescheduled anyway,

we have timeouts and retry logic which should be enough to cover these scenarios if everything works as expected,

if it doesn't work as expected, I think we should fail, even if it's not CA that is responsible.

I don't necessarily agree with point 3 above. However, we discussed offline with @aleksandra-malinowska and it looks like restarting heapster should no longer break tests.

MaciekPytel · 2017-09-20T16:18:33Z

/lgtm

MaciekPytel · 2017-09-20T16:18:45Z

/approve no-issue

mwielgus · 2017-09-20T18:52:13Z

/approve no-issue

k8s-github-robot · 2017-09-20T18:52:56Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: MaciekPytel, aleksandra-malinowska, mwielgus

Associated issue requirement bypassed by: mwielgus

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these OWNERS Files:

~~test/e2e/autoscaling/OWNERS~~ [MaciekPytel,aleksandra-malinowska,mwielgus]

You can indicate your approval by writing /approve in a comment
You can cancel your approval by writing /approve cancel in a comment

k8s-github-robot · 2017-09-20T18:53:16Z

Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions here..

…aling-test-fix-4 Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.. Improve cluster autoscaling tests logging and error checking during cleanup This adds extra logs and error checks to autoscaling tests during PodDisruptionBudgets cleanup. It should help with identifying flake causes. Follow up to kubernetes#52796

add pdbs for more kube-system pods in scale down test

fbeb4de

aleksandra-malinowska added area/test kind/flake Categorizes issue or PR as related to a flaky test. release-note-none Denotes a PR that doesn't merit a release note. retest-not-required sig/autoscaling Categorizes an issue or PR as relevant to SIG Autoscaling. labels Sep 20, 2017

aleksandra-malinowska added this to the v1.8 milestone Sep 20, 2017

aleksandra-malinowska self-assigned this Sep 20, 2017

aleksandra-malinowska requested a review from mwielgus September 20, 2017 14:50

k8s-ci-robot added size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. labels Sep 20, 2017

aleksandra-malinowska requested a review from MaciekPytel September 20, 2017 15:22

MaciekPytel reviewed Sep 20, 2017

View reviewed changes

k8s-ci-robot assigned MaciekPytel Sep 20, 2017

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Sep 20, 2017

mwielgus added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Sep 20, 2017

k8s-github-robot merged commit 3f447dd into kubernetes:master Sep 20, 2017

aleksandra-malinowska mentioned this pull request Sep 21, 2017

Improve cluster autoscaling tests logging and error checking during cleanup #52843

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

In autoscaling tests, add PDBs for more kube-system pods #52796

In autoscaling tests, add PDBs for more kube-system pods #52796

aleksandra-malinowska commented Sep 20, 2017

MaciekPytel Sep 20, 2017

aleksandra-malinowska Sep 20, 2017

MaciekPytel Sep 20, 2017

MaciekPytel commented Sep 20, 2017

MaciekPytel commented Sep 20, 2017

mwielgus commented Sep 20, 2017

k8s-github-robot commented Sep 20, 2017

k8s-github-robot commented Sep 20, 2017

In autoscaling tests, add PDBs for more kube-system pods #52796

In autoscaling tests, add PDBs for more kube-system pods #52796

Conversation

aleksandra-malinowska commented Sep 20, 2017

MaciekPytel Sep 20, 2017

Choose a reason for hiding this comment

aleksandra-malinowska Sep 20, 2017

Choose a reason for hiding this comment

MaciekPytel Sep 20, 2017

Choose a reason for hiding this comment

MaciekPytel commented Sep 20, 2017

MaciekPytel commented Sep 20, 2017

mwielgus commented Sep 20, 2017

k8s-github-robot commented Sep 20, 2017

k8s-github-robot commented Sep 20, 2017