dind: only wait for Ready non-sdn nodes #8099

marun · 2016-03-17T17:14:13Z

The 'wait-for-cluster' command of hack/dind-cluster.sh was previously
evaluating all nodes when determining whether the cluster's nodes were
seen to be 'Ready' and not excluding 'NotReady'. The command now
excludes the sdn node, whose state is not relevant for determining
cluster readiness, and ensures that NotReady nodes are properly
excluded.

This should fix test flakes when the first networking test(s) lack for nodes.

marun · 2016-03-17T17:14:26Z

[testonlyextended][extended:networking]

marun · 2016-04-13T07:15:14Z

Filtering for a string (Ready) without explicitly excluding an unwanted token that embeds said string (NotReady) is not the recipe for success one might imagine.

The 'wait-for-cluster' command of hack/dind-cluster.sh was previously evaluating all nodes when determining whether the cluster's nodes were seen to be 'Ready' and not excluding 'NotReady'. The command now excludes the sdn node, whose state is not relevant for determining cluster readiness, and ensures that NotReady nodes are properly excluded.

marun · 2016-04-13T07:22:44Z

cc: @openshift/networking

openshift-bot · 2016-04-13T07:25:19Z

Evaluated for origin testonlyextended up to 7d93bad

openshift-bot · 2016-04-13T08:23:45Z

continuous-integration/openshift-jenkins/testonlyextended SUCCESS (https://ci.openshift.redhat.com/jenkins/job/test_pr_origin_extended/10/) (Extended Tests: networking)

danwinship · 2016-04-13T14:00:24Z

hack/dind-cluster.sh

-oc get nodes | grep Ready | wc -l")
-    node_count=$(echo "${node_count}" | tr -d '\r')
-    test "${node_count}" -ge "${NODE_COUNT}"
+oc get nodes | grep -v ${SDN_NODE_NAME} | grep -v NotReady | grep Ready | wc -l")


Maybe grep -v SchedulingDisabled instead of grep -v ${SDN_NODE_NAME}?

(Also, we really shouldn't be calling the master's node "the SDN node"... that suggests it's somehow important to the overall functioning of the SDN, which it isn't.)

Maybe grep -v SchedulingDisabled instead of grep -v ${SDN_NODE_NAME}?

I guess actually the math won't work with ${NODE_COUNT} if there was some other unschedulable node. So, ok. LGTM as is

What would you suggest calling the 'sdn node' instead?

"the master" ? or "the node process on the master"

I'm not stuck on 'sdn node', but I think a good name should reflect in some way that the node is required to ensure that the master has connectivity to the pods. I don't think either of those suggestions are sufficiently descriptive in that regard.

I do think you're right about filtering on SchedulingDisabled, though, since that is what the e2e tests check for.

eparis · 2016-04-13T17:49:47Z

only touches dind, no regression risk. approved [merge]

knobunc · 2016-04-13T17:49:53Z

LGTM.

openshift-bot · 2016-04-13T17:55:21Z

Evaluated for origin merge up to 7d93bad

openshift-bot · 2016-04-13T17:55:21Z

[Test]ing while waiting on the merge queue

marun · 2016-04-13T20:16:18Z

eparis: will the failure block the merge? despite what the bot says, the networking job did not fail.

marun · 2016-04-13T21:41:18Z

re-[test]

marun · 2016-04-13T21:44:43Z

@danmcp @eparis option to force a merge even with failing tests? The only job that this PR impacts is passing, so having to jump through hoops to get unrelated flakes to pass seems like the very definition of 'waste of time'.

openshift-bot · 2016-04-13T21:45:20Z

Evaluated for origin test up to 7d93bad

openshift-bot · 2016-04-13T22:00:22Z

continuous-integration/openshift-jenkins/merge SUCCESS (https://ci.openshift.redhat.com/jenkins/job/merge_pull_requests_origin/5589/) (Image: devenv-rhel7_3968)

openshift-bot · 2016-04-13T23:44:42Z

continuous-integration/openshift-jenkins/test SUCCESS (https://ci.openshift.redhat.com/jenkins/job/test_pr_origin/2983/) (Extended Tests: networking)

marun changed the title ~~dind: only wait for non-sdn nodes~~ WIP dind: only wait for non-sdn nodes Mar 17, 2016

danwinship mentioned this pull request Mar 21, 2016

Add kube component config tests, disable /logs on master, update kube-proxy init #8131

Merged

marun force-pushed the dind-ignore-sdn-node branch from f934eb8 to d7df840 Compare April 13, 2016 07:11

marun changed the title ~~WIP dind: only wait for non-sdn nodes~~ dind: only wait for Ready non-sdn nodes Apr 13, 2016

marun force-pushed the dind-ignore-sdn-node branch from d7df840 to 7d93bad Compare April 13, 2016 07:15

danwinship reviewed Apr 13, 2016
View reviewed changes

marun mentioned this pull request Apr 13, 2016

Improve networking extended test reliability #8506

Merged

openshift-bot merged commit b910941 into openshift:master Apr 14, 2016

marun mentioned this pull request Apr 14, 2016

Added the oadm command to the items automatically provioned by vagrant #8389

Closed

marun deleted the dind-ignore-sdn-node branch April 15, 2016 23:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dind: only wait for Ready non-sdn nodes #8099

dind: only wait for Ready non-sdn nodes #8099

marun commented Mar 17, 2016

marun commented Mar 17, 2016

marun commented Apr 13, 2016

marun commented Apr 13, 2016

openshift-bot commented Apr 13, 2016

openshift-bot commented Apr 13, 2016

danwinship Apr 13, 2016

danwinship Apr 13, 2016

marun Apr 13, 2016

danwinship Apr 13, 2016

marun Apr 13, 2016

eparis commented Apr 13, 2016

knobunc commented Apr 13, 2016

openshift-bot commented Apr 13, 2016

openshift-bot commented Apr 13, 2016

marun commented Apr 13, 2016

marun commented Apr 13, 2016

marun commented Apr 13, 2016

openshift-bot commented Apr 13, 2016

openshift-bot commented Apr 13, 2016

openshift-bot commented Apr 13, 2016

dind: only wait for Ready non-sdn nodes #8099

dind: only wait for Ready non-sdn nodes #8099

Conversation

marun commented Mar 17, 2016

marun commented Mar 17, 2016

marun commented Apr 13, 2016

marun commented Apr 13, 2016

openshift-bot commented Apr 13, 2016

openshift-bot commented Apr 13, 2016

danwinship Apr 13, 2016

Choose a reason for hiding this comment

danwinship Apr 13, 2016

Choose a reason for hiding this comment

marun Apr 13, 2016

Choose a reason for hiding this comment

danwinship Apr 13, 2016

Choose a reason for hiding this comment

marun Apr 13, 2016

Choose a reason for hiding this comment

eparis commented Apr 13, 2016

knobunc commented Apr 13, 2016

openshift-bot commented Apr 13, 2016

openshift-bot commented Apr 13, 2016

marun commented Apr 13, 2016

marun commented Apr 13, 2016

marun commented Apr 13, 2016

openshift-bot commented Apr 13, 2016

openshift-bot commented Apr 13, 2016

openshift-bot commented Apr 13, 2016