Rolling update causes "out of sequence" errors for the client. #1379

tonistiigi · 2016-08-16T17:50:22Z

Rolling update internally changes properties for the service, causing the version to increment. This means that user can randomly get "out of sequence" errors. The solution should be to have a separate version for the spec part only so that it only covers all the properties of the service that a user can change.

This issue is causing TestSwarmPublishAdd to fail in Docker CI.

cc @aaronlehmann

The text was updated successfully, but these errors were encountered:

aaronlehmann · 2016-08-16T17:51:04Z

ping @aluzzardi

stevvooe · 2016-08-16T19:02:46Z

This is a feature. Clients that use zk or etcd have to deal with this, as well.

This allows the user to ensure that decisions are made on up to date data.

We can probably add retries if we don't like this behavior.

tonistiigi · 2016-08-16T19:15:04Z

@stevvooe "up to date data" is an arbitrary concept. Some users may want to make a decision based on the number of tasks running, the network used by service changing etc. I don't see why we should pick some random ones (UpdateStatus in this case) and provide the user a way to have a dependency on that field, while these fields are no way even visible to our client during update.

The versioning is very useful for avoiding collisions between user requests and making sure that the update matches user's intention. I'm just suggesting that we would limit that only to the fields that user can actually control.

stevvooe · 2016-08-16T19:38:10Z

@tonistiigi Anything that is observable by the user is a possible change in the data dependency, meaning that the user may have made a decision on it.

Breaking this guarantee will have untold effects on the system. If we want there to be different versioning semantics, we need to have separate objects. That is the data model. When we break the data model, we break the abstraction and make the system harder to reason about.

If we were building a consistent key value store, this wouldn't even be a question. It is wrong to apply updates to an object that are made with out of date data and usually APIs require something to get around those style of guarantees.

If we bypass this, we may as well get rid of raft, as it is a waste.

aaronlehmann · 2016-08-16T20:04:42Z

@stevvooe: I really don't understand your perspective. The user can only change the spec. Why would changes to the spec be informed by real-time progress of a rolling update? How would that even be possible with the current CLI design, since there's no check/set primitive? (docker service update doesn't give you a way to only update if the rolling update is in a certain state, and I don't see us adding that).

You're saying it's okay for updates to separate objects to have different versioning semantics, but why do you draw the line there instead of drawing it between the spec and the observed quantities? The ServiceUpdate control API only takes a service spec, not a whole service. It feels strange to me that the versioning for the broader service object comes into play for this.

stevvooe · 2016-08-16T21:30:42Z

We basically already have a check/set primitive with versioning.

The user modified fields are in the Spec and the result of that is in the surrounding object, which is versioned as a unit. This is the data model. We don't version sub-components of an object. Introducing this to services will just create an inconsistency in the API that we'll never return from.

And let's not think about the "current design". Let's actually think about the future and avoid introducing more and more micro-complexities. Just because we're not doing something today, doesn't mean we should sacrifice the properties of our data model. This solution just introduces more technical debt that others have to code around in the future.

If we want to version a separate component, let's make a separate component. I don't see what is so hard about that.

sylvainmouquet · 2017-09-26T09:45:43Z

The error occures today with docker version 17.07.0-ce

$ docker version

Client:
 Version:      17.07.0-ce
 API version:  1.31
 Go version:   go1.8.3
 Git commit:   8784753
 Built:        Tue Aug 29 17:42:53 2017
 OS/Arch:      linux/amd64

Server:
 Version:      17.07.0-ce
 API version:  1.31 (minimum version 1.12)
 Go version:   go1.8.3
 Git commit:   8784753
 Built:        Tue Aug 29 17:41:43 2017
 OS/Arch:      linux/amd64
 Experimental: false

vce-xx · 2018-04-01T17:02:29Z

@stevvooe What do you mean ? Shouldn't this be fixed ?

This is a feature.

bbotte · 2020-03-11T07:53:15Z

I use swarm, docker Server Version: 19.03.6 , when update service
docker service update --with-registry-auth service_name --image images_http_address
tips: Error response from daemon: rpc error: code = Unknown desc = update out of sequence

will repair?

matodrobec · 2020-04-15T17:02:10Z

Hello,

I use swarm and Docker version 19.03.8. When I update stack

docker stack deploy --with-registry-auth -c ./web.yml api-dev

I am getting:

 Updating service api-dev_webserver (id: zz5veiexnvsgleroomr2snecy)
 Updating service api-dev_data (id: v4hoc42ajz5jv17rsf7uwlcgj)
 failed to update service api-dev_data: Error response from daemon: rpc error: code = Unknown desc = update out of sequence

mhemrg · 2020-05-08T10:07:38Z

I'm using Docker Swarm's API to update services and sometimes I'm facing this issue. Is it safe to retry the operation with a new index key when I get an update out of sequence error?

trajano · 2020-05-11T19:20:36Z

I am wondering if K8S has the same issue https://stackoverflow.com/questions/61737609/does-kubernetes-suffer-from-the-update-out-of-sequence-errors-that-docker-swar

trajano · 2020-05-11T19:24:23Z

I think this sort of synchronization issues would've been solved by traditional database systems or even Kafka. I wonder if a traditional database or Kafka for scale can be used for the state backend. That would at least defer the responsibility out of Swarm Kit.

tonistiigi mentioned this issue Aug 16, 2016

Add retry checks to TestSwarmPublishAdd moby/moby#25765

Merged

aaronlehmann mentioned this issue Aug 16, 2016

Rolling update failure thresholds and rollback #1380

Merged

aaronlehmann mentioned this issue Aug 18, 2016

Versioned specs #1392

Closed

aaronlehmann mentioned this issue Feb 7, 2017

Service update may fail with "update out of sequence" error moby/moby#30794

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rolling update causes "out of sequence" errors for the client. #1379

Rolling update causes "out of sequence" errors for the client. #1379

tonistiigi commented Aug 16, 2016

aaronlehmann commented Aug 16, 2016

stevvooe commented Aug 16, 2016

tonistiigi commented Aug 16, 2016

stevvooe commented Aug 16, 2016

aaronlehmann commented Aug 16, 2016

stevvooe commented Aug 16, 2016

sylvainmouquet commented Sep 26, 2017

vce-xx commented Apr 1, 2018

bbotte commented Mar 11, 2020

matodrobec commented Apr 15, 2020

mhemrg commented May 8, 2020

trajano commented May 11, 2020

trajano commented May 11, 2020

Rolling update causes "out of sequence" errors for the client. #1379

Rolling update causes "out of sequence" errors for the client. #1379

Comments

tonistiigi commented Aug 16, 2016

aaronlehmann commented Aug 16, 2016

stevvooe commented Aug 16, 2016

tonistiigi commented Aug 16, 2016

stevvooe commented Aug 16, 2016

aaronlehmann commented Aug 16, 2016

stevvooe commented Aug 16, 2016

sylvainmouquet commented Sep 26, 2017

vce-xx commented Apr 1, 2018

bbotte commented Mar 11, 2020

matodrobec commented Apr 15, 2020

mhemrg commented May 8, 2020

trajano commented May 11, 2020

trajano commented May 11, 2020