docker service update with --detach=false hangs on services with 0 tasks #627

wsong · 2017-10-19T16:49:59Z

Description

If you have a global service that cannot be scheduled anywhere (e.g. its placement constraints are incorrect), its task list will be empty. If you try to update that service with --detach=false, the CLI will hang while "waiting for new tasks."

Steps to reproduce the issue:

docker service create --name testservice --mode global --constraint node.platform.os==notarealos busybox sleep 24h
docker service ps testservice (note that the task list is empty)
docker service update --detach=false --label-add foo=bar testservice

Describe the results you received:
The CLI hangs forever with overall progress: waiting for new tasks

Describe the results you expected:
The update would go through

The client I tested this on was 17.09.0-ce, but this is likely an issue on any CLI with the --detach=false option.

The text was updated successfully, but these errors were encountered:

dnephin · 2017-10-19T17:08:32Z

What happens when you run with --detach=true ? Does it update as you expect?

Since the update doesn't fix the constraint problem then the service is never fixed, right? So I think I would expect the CLI to just hang and wait for it to be fixed. We could maybe add a timeout.

wsong · 2017-10-19T17:14:16Z

With --detach=true, the update goes through immediately and everything works fine.

If the expectation is that docker service update only works on schedulable services, then this is working as expected, but that's quite unintuitive. You can imagine a case where a service has two invalid placement constraints (e.g. node.platform.os==exampleos and node.hostname=examplehost); its quite reasonable that a user might remove the first constraint, then want to check the service, then realize that they need to remove the second constraint, then issue a second service update call.

Additionally, this means that it's not really possible to update services that are supposed to run on nodes that are not yet part of the cluster (e.g. you deploy a Windows service, but you haven't added any Windows nodes yet) unless you remember to specify --detach=true.

dnephin · 2017-10-19T17:19:11Z

Are you sure the update isn't applied? The entire purpose of --detach=false is to wait for the service tasks to stabalize, but the update should be applied first, before it starts waiting.

What happens if you inspect the service while the cli is stuck in "overall progress: waiting for new tasks" ? I would expect that the update has been applied, it's just the CLI is waiting for the tasks to be in the running state. I think this is the correct behaviour.

If you want to update services that you know will not be in a running state then use --detach=true.

wsong · 2017-10-19T17:20:54Z

Oh sorry, yes, the update does get applied. It's just that the CLI hangs and for a user it's not clear why.

dnephin · 2017-10-19T17:23:37Z

That seems like the right behaviour. How do you think this can be improved?

Could we add more details to the message "overall progress: waiting for new tasks" to make it clear?

Maybe we could add a timeout and print a more helpful message when the timeout is hit?

wsong · 2017-10-19T17:26:28Z

Well, if a service has zero tasks before the update is applied, should we really wait for the tasks to get updated? Maybe in that case the service update should just return right away, even if --detach=false is specified.

I don't know what the original intention of that flag was, but the way I interpreted it was "block until the update has gone through on all existing tasks." What is the use case for blocking a service update forever on a service with zero tasks?

dnephin · 2017-10-19T17:32:53Z

if a service has zero tasks before the update is applied, should we really wait for the tasks to get updated?

Yes. For example when going from 0 to 1+ replicas I would expect there to be no tasks before the update, but I would still want to wait on the new tasks to be running.

on all existing tasks

It's not just existing tasks, because the update could change the number of tasks.

What is the use case for blocking a service update forever on a service with zero tasks?

Expecting a service to have 0 tasks doesn't really seem like a common use case. There's already a way of handling this rare case, which is to use --detach=true. Since there are no tasks, there is nothing to wait on. The update is already "done" when service update --detach=true returns.

wsong · 2017-10-19T17:39:31Z

If this is working as intended, then that's fine; my point is just that this is somewhat surprising for global services. Replicated services always have at least one Pending task, even if they are not schedulable. If global services are not schedulable, however, then their task list is empty. This means that a docker service update --detach=false call might hang forever or it might work, depending on the current state of your node list. That's fairly unintuitive for users (or at least, it was for me).

dnephin · 2017-11-02T20:47:59Z

Opened #665 to add a timeout. Closing this issue as I believe it's working as intended, and #663 is an issue for adding a timeout.

#104) * fix: docker service update with `--detach=false` hangs on services with 0 tasks read more here: docker/cli#627 * refactor: use $(...) notation instead of legacy backticks `...` * refactor: `tasks_num` variable renamed to `num_tasks` * refactor: use `-f` for filtering service by name

GordonTheTurtle added the area/swarm label Oct 19, 2017

dnephin added the kind/enhancement label Oct 19, 2017

dnephin closed this as completed Nov 2, 2017

fooflington mentioned this issue Feb 24, 2023

Service gets stuck when calling "docker service update" and won't progress containrrr/shepherd#97

Closed

AliRezaBeitari mentioned this issue Jun 23, 2023

fix: docker service update with --detach=false hangs on services wi… containrrr/shepherd#104

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docker service update with --detach=false hangs on services with 0 tasks #627

docker service update with --detach=false hangs on services with 0 tasks #627

wsong commented Oct 19, 2017

dnephin commented Oct 19, 2017

wsong commented Oct 19, 2017

dnephin commented Oct 19, 2017 •

edited

Loading

wsong commented Oct 19, 2017

dnephin commented Oct 19, 2017

wsong commented Oct 19, 2017

dnephin commented Oct 19, 2017

wsong commented Oct 19, 2017

dnephin commented Nov 2, 2017

docker service update with --detach=false hangs on services with 0 tasks #627

docker service update with --detach=false hangs on services with 0 tasks #627

Comments

wsong commented Oct 19, 2017

dnephin commented Oct 19, 2017

wsong commented Oct 19, 2017

dnephin commented Oct 19, 2017 • edited Loading

wsong commented Oct 19, 2017

dnephin commented Oct 19, 2017

wsong commented Oct 19, 2017

dnephin commented Oct 19, 2017

wsong commented Oct 19, 2017

dnephin commented Nov 2, 2017

dnephin commented Oct 19, 2017 •

edited

Loading