kubectl wait timeout argument is poorly documented and ill-suited to waiting on multiple resources #1219

mgabeler-lee-6rs · 2022-05-25T15:48:02Z

This is just a re-submit of #754 which, despite being confirmed & assigned, was closed as stale without any fix.

What happened:

Run kubectl wait with a selector matching more than one resource and a timeout

What you expected to happen:

The timeout should apply to the wait command, not to the individual resources.

With the timeout applying to resources sequentially, it makes waiting on more than one resource with any kind of timeout basically unusable.

How to reproduce it (as minimally and precisely as possible):

Create a deployment scaled to 2 or more replicas, and a label that can be used to match it
Run: kubectl wait pod --selector=... --timeout=30s
Observe that this runs for N*30s, where N is the number of pods

Anything else we need to know?:

cc @eranreshef the original reporter and @JabusKotze who assigned the prior issue to themselves

The text was updated successfully, but these errors were encountered:

ardaguclu · 2022-05-26T12:21:50Z

/sig cli

brianpursley · 2022-06-14T17:37:26Z

Here is a way to reproduce:

kubectl apply -f - << EOF
apiVersion: apps/v1
kind: Deployment
metadata:
  labels:
    app: dtest
  name: dtest
spec:
  replicas: 2 
  selector:
    matchLabels:
      app: dtest
  template:
    metadata:
      labels:
        app: dtest
    spec:
      containers:
      - name: bb
        image: busybox
        command: ["/bin/sh", "-c", "sleep infinity"]
EOF

time kubectl wait pod --selector=app=dtest --for=condition=ItWillNeverBeThis --timeout=5s

Output:

timed out waiting for the condition on pods/dtest-56c46b55dd-7tq8r
timed out waiting for the condition on pods/dtest-56c46b55dd-hg7x9

real	0m10.083s
user	0m0.112s
sys	0m0.011s

^ shows the command took 10s (because Replicas=2) when the timeout itself was only supposed to be 5s.

mpuckett159 · 2022-06-22T21:43:32Z

/triage accept

This was discussed on the bug scrub today and we agree that this is not good behavior. To solve this we will need to implement either contexts or goroutines to run these waiters in parallel to more appropriately match the user expectation here.

k8s-ci-robot · 2022-06-22T21:43:33Z

@mpuckett159: The label(s) triage/accept cannot be applied, because the repository doesn't have them.

In response to this:

/triage accept

This was discussed on the bug scrub today and we agree that this is not good behavior. To solve this we will need to implement either contexts or goroutines to run these waiters in parallel to more appropriately match the user expectation here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

mpuckett159 · 2022-06-22T21:44:17Z

/triage accepted
whoops

vitalyrychkov · 2022-09-09T14:02:38Z

Hello,

We have a similar problem, expecting kubectl wait to wait for X seconds in total with "--timeout=Xs", e.g.:

kubectl  wait --for=condition=available --timeout=10m deployment --all

However it waits for X seconds * Number of deployments with not-ready pods. Could you please consider also our scenario in the fix?

Kind Regards,

Vitaly

k8s-triage-robot · 2022-12-08T14:49:17Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

ardaguclu · 2022-12-08T15:04:27Z

/remove-lifecycle stale

If noone is willing to take it, I can work on that.

ardaguclu · 2022-12-16T10:18:22Z

/assign

R-Studio · 2023-07-20T07:46:34Z

Workaround for those using kubectl or oc before v1.27
You can use timeout before the kubectl wait or oc wait. For example with a timeout of max. 305s (the timeout of the timeout command should be a little larger then the timeout of the kubectl command):

timeout $((300+5)) kubectl wait --for=condition=Ready --all pod --timeout=300s

mgabeler-lee-6rs added the kind/bug Categorizes issue or PR as related to a bug. label May 25, 2022

k8s-ci-robot added the needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. label May 25, 2022

mgabeler-lee-6rs mentioned this issue May 25, 2022

The documentation of kubectl wait is a bit miss-leading #754

Closed

k8s-ci-robot added the sig/cli Categorizes an issue or PR as relevant to SIG CLI. label May 26, 2022

k8s-ci-robot added triage/accepted Indicates an issue or PR is ready to be actively worked on. and removed needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. labels Jun 22, 2022

minherz mentioned this issue Oct 6, 2022

Terraform's kubectl wait --for=condition=ready pods ... hangs GoogleCloudPlatform/microservices-demo#1079

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 8, 2022

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 8, 2022

k8s-ci-robot assigned ardaguclu Dec 16, 2022

ardaguclu mentioned this issue Dec 19, 2022

kubectl wait: wire generic context kubernetes/kubernetes#114574

Merged

k8s-ci-robot closed this as completed in kubernetes/kubernetes#114574 Dec 19, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kubectl wait timeout argument is poorly documented and ill-suited to waiting on multiple resources #1219

kubectl wait timeout argument is poorly documented and ill-suited to waiting on multiple resources #1219

mgabeler-lee-6rs commented May 25, 2022

ardaguclu commented May 26, 2022

brianpursley commented Jun 14, 2022

mpuckett159 commented Jun 22, 2022

k8s-ci-robot commented Jun 22, 2022

mpuckett159 commented Jun 22, 2022

vitalyrychkov commented Sep 9, 2022

k8s-triage-robot commented Dec 8, 2022

ardaguclu commented Dec 8, 2022 •

edited

Loading

ardaguclu commented Dec 16, 2022

R-Studio commented Jul 20, 2023

kubectl wait timeout argument is poorly documented and ill-suited to waiting on multiple resources #1219

kubectl wait timeout argument is poorly documented and ill-suited to waiting on multiple resources #1219

Comments

mgabeler-lee-6rs commented May 25, 2022

ardaguclu commented May 26, 2022

brianpursley commented Jun 14, 2022

mpuckett159 commented Jun 22, 2022

k8s-ci-robot commented Jun 22, 2022

mpuckett159 commented Jun 22, 2022

vitalyrychkov commented Sep 9, 2022

k8s-triage-robot commented Dec 8, 2022

ardaguclu commented Dec 8, 2022 • edited Loading

ardaguclu commented Dec 16, 2022

R-Studio commented Jul 20, 2023

ardaguclu commented Dec 8, 2022 •

edited

Loading