Fix endless loops in test.sh #692

ifireball · 2018-02-01T07:37:58Z

test.sh could go into an endless loop waiting for containers to start
up. This patch:

Puts a hard limit for the amount of cycles the script can spend in the
wait loops
Detects the scenario where the cluster apparently died so things would
never come up.

rmohr · 2018-02-01T08:51:58Z

can't you just set a hard limit on the job? That is at least what we do on our CI.

rmohr · 2018-02-01T08:54:39Z

automation/test.sh

    echo "Waiting for kubevirt pods to enter the Running state ..."
-    kubectl get pods -n kube-system --no-headers | >&2 grep -v Running || true
-    sleep 10
+    if ! kctl_out=$(kubectl get pods -n kube-system --no-headers); then


It is good to have limited retries, however could you do that in a little bit more sophisticated and reusable way? Maybe with something like a backoff-retry like it is done here: https://coderwall.com/p/--eiqg/exponential-backoff-in-bash ?

I don't think exponential backoff would be useful here, since it won't free up any usable resources effectively, since the run slave would still be held by the job.

Anyway re-factored the code to make it look a little nicer

ifireball · 2018-02-01T09:07:16Z

We're going to make the job impose a hard timeout externally too - but its better to also fix this here so that more accurate output can be shown.

rmohr · 2018-02-01T10:00:13Z

automation/test.sh

-    kubectl get pods -n kube-system -o'custom-columns=status:status.containerStatuses[*].ready,metadata:metadata.name' --no-headers | awk '/virt-controller/ && /true/' | wc -l
-    sleep 10
-done
+retry_check containers_ready 'KubeVirt virt-controller container' 'virt-controller'


I am not sure this will work. Wouldn't that check all virt-controller instances to be ready? We just need one (and actually only one can be ready at the same time).

hmm, I wanna find a way to do that without having to write another function just to have a slightly different condition.

can I assume the line showing the ready virt-controller will not contain the string 'false'?

ifireball · 2018-02-01T14:28:14Z

ci test please

`test.sh` could go into an endless loop waiting for containers to start up. This patch: - Puts a hard limit for the amount of cycles the script can spend in the wait loops - Detects the scenario where the cluster apparently died so things would never come up.

ifireball · 2018-02-04T13:22:45Z

@rmohr now the test is finally timing out on starting one of the KubeVirt containers. Is 10m not enough? how long do you think this should take?

The container that is failing to start is:

kube-dns-6f4fd4bdf-p75cb

The system seems to be trying to start 3 copies of it and managing to start only two.

rmohr · 2018-02-05T09:50:29Z

@ifireball that sounds like a problem with setting up k8s itself. It is very likely a problem in kubeadm, k8s or weave ...

I hope that we can soon start pre-creating k8s clusters, and only start preinstalled clusters in our vms anymore, to avoid such inconveniences ...

rmohr · 2018-02-05T09:52:08Z

@cynepco3hahue what @ifireball says regarding to kube-dns seems to be realted to the issue you told me about on irc.

rmohr · 2018-02-06T15:31:45Z

Somewhat depends on the outcome of #710

eedri · 2018-02-14T13:50:25Z

#710 is merged, is this patch still blocked?

ifireball · 2018-02-14T14:06:50Z

ci test please

ifireball · 2018-02-14T14:29:34Z

ci test please

rmohr · 2018-02-14T14:34:39Z

ok to test

ifireball · 2018-02-14T14:51:53Z

ci test please

eedri · 2018-02-28T08:01:12Z

ci test please

cynepco3hahue · 2018-02-28T09:41:33Z

CI still has a lot of troubles with vagrant machines

An action 'halt' was attempted on the machine 'node0',
15:47:16 [check-patch.el7.x86_64] but another process is already executing an action on the machine.
15:47:16 [check-patch.el7.x86_64] Vagrant locks each machine for access by only one process at a time.
15:47:16 [check-patch.el7.x86_64] Please wait until the other Vagrant process finishes modifying this
15:47:16 [check-patch.el7.x86_64] machine, then try again.

rmohr · 2018-02-28T11:03:45Z

retest this please

fabiand · 2018-03-13T09:45:33Z

retest this please

eedri · 2018-04-01T11:40:33Z

retest this please

eedri · 2018-04-01T11:40:55Z

ci test please

eedri · 2018-04-30T07:28:44Z

Any update?

eedri · 2018-04-30T07:29:13Z

@fabiand should we abandon this patch? it doesn't seem like its going somewhere

Issue 687

Signed-off-by: Federico Gimenez <fgimenez@redhat.com>

rmohr reviewed Feb 1, 2018

View reviewed changes

ifireball force-pushed the fix-tester-infinite-loop branch from 2c370f4 to af50008 Compare February 1, 2018 09:43

rmohr reviewed Feb 1, 2018

View reviewed changes

ifireball force-pushed the fix-tester-infinite-loop branch from af50008 to 8f28a74 Compare February 1, 2018 13:08

ifireball force-pushed the fix-tester-infinite-loop branch 3 times, most recently from aef3dce to ac4a63b Compare February 4, 2018 09:17

ifireball force-pushed the fix-tester-infinite-loop branch from ac4a63b to 47c4929 Compare February 4, 2018 11:20

fabiand closed this Jun 11, 2018

kubevirt-bot pushed a commit to kubevirt-bot/kubevirt that referenced this pull request Nov 6, 2020

Merge pull request kubevirt#692 from u5surf/issue-687

011e3d4

Issue 687

kubevirt-bot pushed a commit to kubevirt-bot/kubevirt that referenced this pull request Dec 7, 2021

Do not double mount /lib/modules for kind (kubevirt#692)

6703e85

Signed-off-by: Federico Gimenez <fgimenez@redhat.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix endless loops in test.sh #692

Fix endless loops in test.sh #692

ifireball commented Feb 1, 2018

rmohr commented Feb 1, 2018

rmohr Feb 1, 2018

ifireball Feb 1, 2018

ifireball commented Feb 1, 2018

rmohr Feb 1, 2018

ifireball Feb 1, 2018

ifireball commented Feb 1, 2018

ifireball commented Feb 4, 2018

rmohr commented Feb 5, 2018

rmohr commented Feb 5, 2018

rmohr commented Feb 6, 2018

eedri commented Feb 14, 2018

ifireball commented Feb 14, 2018

ifireball commented Feb 14, 2018

rmohr commented Feb 14, 2018

ifireball commented Feb 14, 2018

eedri commented Feb 28, 2018

cynepco3hahue commented Feb 28, 2018

rmohr commented Feb 28, 2018

fabiand commented Mar 13, 2018

eedri commented Apr 1, 2018

eedri commented Apr 1, 2018

eedri commented Apr 30, 2018

eedri commented Apr 30, 2018

Fix endless loops in test.sh #692

Fix endless loops in test.sh #692

Conversation

ifireball commented Feb 1, 2018

rmohr commented Feb 1, 2018

rmohr Feb 1, 2018

Choose a reason for hiding this comment

ifireball Feb 1, 2018

Choose a reason for hiding this comment

ifireball commented Feb 1, 2018

rmohr Feb 1, 2018

Choose a reason for hiding this comment

ifireball Feb 1, 2018

Choose a reason for hiding this comment

ifireball commented Feb 1, 2018

ifireball commented Feb 4, 2018

rmohr commented Feb 5, 2018

rmohr commented Feb 5, 2018

rmohr commented Feb 6, 2018

eedri commented Feb 14, 2018

ifireball commented Feb 14, 2018

ifireball commented Feb 14, 2018

rmohr commented Feb 14, 2018

ifireball commented Feb 14, 2018

eedri commented Feb 28, 2018

cynepco3hahue commented Feb 28, 2018

rmohr commented Feb 28, 2018

fabiand commented Mar 13, 2018

eedri commented Apr 1, 2018

eedri commented Apr 1, 2018

eedri commented Apr 30, 2018

eedri commented Apr 30, 2018