The utility of creating a volume per node #1529

etoews · 2015-12-10T18:23:45Z

While researching issue #1528, it really made me wonder about the practical utility of Swarm creating a volume per node when doing a docker volume create. I realize that a volume per node is the expected behavior but what use case does that cover?

When I'm using Swarm, I want a place for my stuff. Doing a docker volume create seems ideal as it gives me a named volume where I can store persistent data (e.g. #1528). I can reference that volume later from other containers and not have to worry about orphaning the volume if any associated container gets removed. This seems to me like it's the primary use case for volumes.

But when Swarm creates a volume per node using the default driver, it seems to eliminate that use case. Now I have X volumes (depending on how many nodes I have) and I have no idea where my stuff is. It could be in any one of those volumes but I have no way of knowing which one much less being able to reference that one directly even if I did.

It seems to me we'll need more control over docker volume create in order to be able to effectively utilize volumes in Swarm.

Thoughts?

The text was updated successfully, but these errors were encountered:

kacole2 · 2015-12-10T21:16:27Z

👍 for bringing this to light again. I don't see a use case for this either.

If you use a volume driver to talk to a persistent datastore "outside" of the host itself, then you are creating multiple volumes on the storage end-point.

We've found the only way around this is to use docker volume create --volume-driver xyz on a single host in a swarm cluster where the cli tool isn't talking to the swarm master. After it's created, then we can use another host that is pointed to the swarm master to run the container and mount the volume. Here's a sample of the workflow Lab IV: Persist a container's state and data

If it's the normal expected behavior by the machine, that's understandable. But I wouldn't consider it to be normal expected behavior from a user perspective.

clintkitson · 2015-12-10T21:39:47Z

In theory the create operation is supposed to be idempotent. This means that no matter how many times the same volume name is requested, it does not impact the backend storage platform. The reality is that if you have 30 hosts all requesting to create a volume at one time, then it can be a race condition to see which one actually successfully creates the volume.

Different storage drivers may have workflows to achieve a volume creation. In the case of EC2, the workflow involves creating a volume and then assigning metadata. So the idempotency would be violated here since a volume operation may proceed on all hosts since the volume by name doesn't exist yet, but then when it comes time to change the metadata other hosts will fail after the first host succeeds.

cc @cpuguy83

cpuguy83 · 2015-12-12T14:00:25Z

The reasoning for this is to make sure a container started with docker run specifying a named volume can be scheduled to any node w/o having some weird side-effects (like expecting a rex-ray volume but actually getting a local volume due to implicit creation).

btw, I'm inclined to disable implicit volume creation on docker run when a volume name is specified... for instance, if a user does docker run -v somename:/foo, and somename doesn't exist, instead of creating a volume somename implicitly, instead error out.

kacole2 · 2015-12-12T14:34:48Z

Personally, I would prefer that a volume is NOT created unless specified through the docker volume create command. One spelling mistake and then you end up with a container and a volume that needs to be killed and re-created.

etoews · 2015-12-14T19:38:06Z

@cpuguy83 But let's look just a bit beyond that initial docker run.

It a very common use case to separate your data volume from your service container. That way you can kill an existing service version X, upgrade that service to version X+1, and still connect to the exact same data volume.

However, if there are N volumes and you cannot uniquely identify them, that makes it impossible to connect service version X+1 to the exact same data volume that service X was connected to.

Also, I just confirmed that when added a new node to a Swarm cluster that has a volume created with docker volume create --name test does not create the volume test on the new node. So even the initial docker run can still fail if the container gets scheduled to the node without that volume.

cpuguy83 · 2015-12-14T19:41:11Z

@everett-toews This is up to the volume driver being used.
Using the local driver you are of course going to be limited to local uses.
Working on enhancing engine to actually query all registered drivers for volumes instead of assuming the volumes were created locally.
Here's the PR: moby/moby#16534

glyph · 2015-12-17T08:52:24Z

👍 This behavior was hugely confusing to me. I'm just coming up to speed on Swarm, but I still don't quite understand all the implications of --volumes-from, and I was hoping that docker volume create would make this simpler. Apparently not :).

BrianAdams · 2015-12-22T21:36:19Z

Agree that this is confusing. Having the local driver just create a volume local to the requesting container makes sense. It would seem a reasonable limitation that you cannot schedule containers with local volumes on other nodes unless another volume with the same name has been created with some other workflow.

Also agree that I would prefer an explicit volume create and a volume affinity flag instead of implicitly creating the volume during docker run.

itzg · 2016-01-19T04:15:39Z

As a long time Docker user who tended not to use data containers, because it felt clunky, I was pleased to see docker volume and to see that it "worked" on Swarm. Before @everett-toews pointed me here, I assumed that:

An explicit docker volume was needed for all referenced named volumes
When using the local volume driver, that would imply a dependency affinity like with --volumes-from=dependency, but in this case a node affinity rather than container affinity

and those two assumed constraints seemed fine to me. What I gather from the discussion above is that neither assumption is correct, but would enforcing those help pin down behavior?

gittycat · 2016-03-15T06:38:20Z

@itzg Yes, data containers are clunky but the alternative (duplicated volumes) is a can of worms; You can't tell where your data is and in which state of sync it is.

nishanttotla · 2018-05-30T18:47:37Z

Closing due to lack of activity. Please reopen if you wish to continue discussing it.

etoews mentioned this issue Dec 10, 2015

docker volume create to a Swarm Master issues command for each host in cluster moby/moby#17271

Closed

etoews mentioned this issue Jan 19, 2016

Blog post: Deploying and Building Minecraft as a Service on Carina getcarina/getcarina.com#630

Merged

amitshukla added kind/enhancement area/volume labels Jan 29, 2016

etoews mentioned this issue Mar 14, 2016

Compose with Swarm can't locate named volumes docker/compose#2958

Closed

This was referenced Mar 15, 2016

"docker-compose run" doesn't recognise an existing volume docker/compose#3115

Closed

"docker-compose run" doesn't recognise an existing volume #1955

Closed

This was referenced Mar 15, 2016

[suggestions] Swarm and docker volume sub-command #1970

Closed

Swarm and REX-Ray thecodeteam/conferences#109

Closed

nishanttotla closed this as completed May 30, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The utility of creating a volume per node #1529

The utility of creating a volume per node #1529

etoews commented Dec 10, 2015

kacole2 commented Dec 10, 2015

clintkitson commented Dec 10, 2015

cpuguy83 commented Dec 12, 2015

kacole2 commented Dec 12, 2015

etoews commented Dec 14, 2015

cpuguy83 commented Dec 14, 2015

glyph commented Dec 17, 2015

BrianAdams commented Dec 22, 2015

itzg commented Jan 19, 2016

gittycat commented Mar 15, 2016

nishanttotla commented May 30, 2018

The utility of creating a volume per node #1529

The utility of creating a volume per node #1529

Comments

etoews commented Dec 10, 2015

kacole2 commented Dec 10, 2015

clintkitson commented Dec 10, 2015

cpuguy83 commented Dec 12, 2015

kacole2 commented Dec 12, 2015

etoews commented Dec 14, 2015

cpuguy83 commented Dec 14, 2015

glyph commented Dec 17, 2015

BrianAdams commented Dec 22, 2015

itzg commented Jan 19, 2016

gittycat commented Mar 15, 2016

nishanttotla commented May 30, 2018