-
Notifications
You must be signed in to change notification settings - Fork 38.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Do not ignore unscheduled pods when NodeName not in set of worker nodes #92545
Conversation
When node scheduling tests were updated to use worker instead of master nodes the GetPodsScheduled function, which is tasked with getting all scheduled and not scheduled pods inadvertently was changed to ignore all pods that have an empty NodeName before checking whether pods had been scheduled or not. This updates the function to include pods without a NodeName in the check for unscheduled pods. Signed-off-by: hasheddan <georgedanielmangum@gmail.com>
/priority critical-urgent |
The following tests are currently being affected across a number of jobs: |
/assign @ahg-g |
This is a good one to look at for more context: https://prow.k8s.io/view/gcs/kubernetes-jenkins/logs/ci-kubernetes-gce-conformance-latest/1276467007495081987 |
@@ -1045,23 +1045,20 @@ func translateIPv4ToIPv6(ip string) string { | |||
// GetPodsScheduled returns a number of currently scheduled and not scheduled Pods on worker nodes. | |||
func GetPodsScheduled(workerNodes sets.String, pods *v1.PodList) (scheduledPods, notScheduledPods []v1.Pod) { | |||
for _, pod := range pods.Items { | |||
if workerNodes.Has(pod.Spec.NodeName) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Previously this was if !masterNodes.Has(pod.Spec.NodeName)
so this isn't fully going back to the same functionality. The check for unscheduled nodes could consider master nodes in this case, but from my understanding of this check that would be fine.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Very reasonable change for me.
Why we checked the NodeName is in names of workerNodes before checking the NodeName is empty (^^;)
/retest |
1 similar comment
/retest |
/lgtm Thanks @hasheddan ! /hold |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: hasheddan, Huang-Wei The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@hasheddan Thanks for doing this again.
/lgtm
/hold cancel
@@ -1045,23 +1045,20 @@ func translateIPv4ToIPv6(ip string) string { | |||
// GetPodsScheduled returns a number of currently scheduled and not scheduled Pods on worker nodes. | |||
func GetPodsScheduled(workerNodes sets.String, pods *v1.PodList) (scheduledPods, notScheduledPods []v1.Pod) { | |||
for _, pod := range pods.Items { | |||
if workerNodes.Has(pod.Spec.NodeName) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Very reasonable change for me.
Why we checked the NodeName is in names of workerNodes before checking the NodeName is empty (^^;)
/retest |
2 similar comments
/retest |
/retest |
#92563 |
/retest
…On Fri, Jun 26, 2020, 20:24 Kubernetes Prow Robot ***@***.***> wrote:
@hasheddan <https://github.com/hasheddan>: The following tests *failed*,
say /retest to rerun all failed tests:
Test name Commit Details Rerun command
pull-kubernetes-conformance-kind-ga-only-parallel 9ab31af
<9ab31af>
link
<https://prow.k8s.io/view/gcs/kubernetes-jenkins/pr-logs/pull/92545/pull-kubernetes-conformance-kind-ga-only-parallel/1276699460168060930> /test
pull-kubernetes-conformance-kind-ga-only-parallel
pull-kubernetes-e2e-gce 9ab31af
<9ab31af>
link
<https://prow.k8s.io/view/gcs/kubernetes-jenkins/pr-logs/pull/92545/pull-kubernetes-e2e-gce/1276699460247752704/> /test
pull-kubernetes-e2e-gce
pull-kubernetes-e2e-gce-100-performance 9ab31af
<9ab31af>
link
<https://prow.k8s.io/view/gcs/kubernetes-jenkins/pr-logs/pull/92545/pull-kubernetes-e2e-gce-100-performance/1276699460230975488/> /test
pull-kubernetes-e2e-gce-100-performance
Full PR test history
<https://prow.k8s.io/pr-history?org=kubernetes&repo=kubernetes&pr=92545>. Your
PR dashboard. Please help us cut down on flakes by linking to
<https://git.k8s.io/community/contributors/devel/sig-testing/flaky-tests.md#filing-issues-for-flaky-tests>
an open issue
<https://github.com/kubernetes/kubernetes/issues?q=is:issue+is:open> when
you hit one in your PR.
Instructions for interacting with me using PR comments are available here
<https://git.k8s.io/community/contributors/guide/pull-requests.md>. If
you have questions or suggestions related to my behavior, please file an
issue against the kubernetes/test-infra
<https://github.com/kubernetes/test-infra/issues/new?title=Prow%20issue:>
repository. I understand the commands that are listed here
<https://go.k8s.io/bot-commands>.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#92545 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAHADK3WQI7QGRRRPK2QUK3RYVQ5BANCNFSM4OJMZB5A>
.
|
/retest |
1 similar comment
/retest |
/retest Review the full test history for this PR. Silence the bot with an |
@hasheddan: The following tests failed, say
Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
/retest |
What type of PR is this?
/kind bug
What this PR does / why we need it:
When node scheduling tests were updated to use worker instead of master
nodes the GetPodsScheduled function, which is tasked with getting all
scheduled and not scheduled pods inadvertently was changed to ignore all
pods that have an empty NodeName before checking whether pods had been
scheduled or not. This updates the function to include pods without a
NodeName in the check for unscheduled pods.
Signed-off-by: hasheddan georgedanielmangum@gmail.com
Which issue(s) this PR fixes:
Relevant issues for failing tests: #92511 #92502
Follow up to #92509
Special notes for your reviewer:
Original PR: #92450
/cc @oomichi @BenTheElder
Does this PR introduce a user-facing change?:
Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.: