Docker Clustering: Design proposal. #8859

aluzzardi · 2014-10-30T15:51:02Z

The goal and scope of this change is to allow docker to manage a cluster
of docker hosts.

The target audience for clustering is modeled after Docker itself:
developer and devops first then enterprise later. Enterprise requires
additional features such as authentication, ACLs, auditing and tooling
which will come at a later time.

The system is designed to handle workloads in the same way as Docker: it
can run both long-running and one-off tasks. Batch processing can be
built on top of the API using the one-off primitive.

The architecture is based on an evented model where the master queries
the registered slaves within the system and checks the current state of
the slaves against the requested state by the user. It reconciles any
differences and updates the cluster state with the required changes.

Signed-off-by: Andrea Luzzardi aluzzardi@gmail.com
Signed-off-by: Victor Vieux vieux@docker.com

@aluzzardi

Authors: @aluzzardi and @vieux. The goal and scope of this change is to allow docker to manage a cluster of docker hosts. It is designed to scale to ~100 machines in a master/slave architecture. When you have more than this amount of machines are have different requirements and are probably implementing your own infrastructure based software to deal with your specific challenges. The target audience for clustering is modeled after Docker itself: developer and devops first then enterprise later. Enterprise requires additional features such as authentication, ACLs, auditing and tooling which will come at a later time. The system is designed to handle workloads in the same way as Docker: it can run both long-running and one-off tasks. Batch processing can be built on top of the API using the one-off primitive. The architecture is based on an evented model where the master queries the registered slaves within the system and checks the current state of the slaves against the requested state by the user. It reconciles any differences and updates the cluster state with the required changes. Signed-off-by: Andrea Luzzardi <aluzzardi@gmail.com> Signed-off-by: Victor Vieux <vieux@docker.com>

LK4D4 · 2014-10-30T18:17:32Z

Probably here should be described some technical details. Like how master <-> slave relationship is organized, how master knows about new slaves etc.

jessfraz · 2014-10-30T19:13:57Z

docs/sources/userguide/cluster.md

+
+    $ docker -d --master --discovery https://discovery.hub.docker.com/u/demo-user/cluster-1
+
+Eventually, clustering will provide a built-in leader election algorithm making the `--master` flag obsolete.


Would this use a gossip protocol to assign a new master, probably out of scope for the initial proposal just wondering

I assume the leader election would be through an implementation of Raft as mentioned earlier today? I thin a gossip protocol like Serf might be useful for failure detection and state distribution to route important messages around failed nodes or network partitions.

-g

On Oct 30, 2014, at 3:14 PM, Jessie Frazelle notifications@github.com wrote:

In docs/sources/userguide/cluster.md:

$ docker -d --discovery https://discovery.hub.docker.com/u/demo-user/cluster-1

+In the future, other discovery mechanism will be provided that will not depend on the Hub.
+
+## Master
+
+In order to ensure consistency within the cluster one of your Docker Engines
+will need to be promoted to master within the cluster. If you are using
+the Hub's discovery service you will be able to promote any of your registered
+nodes to a master from the web interface. You can also statically assign one
+of the Docker Engines with the --master flag.
+

$ docker -d --master --discovery https://discovery.hub.docker.com/u/demo-user/cluster-1

+Eventually, clustering will provide a built-in leader election algorithm making the --master flag obsolete.
Would this use a gossip protocol to assign a new master, probably out of scope for the initial proposal just wondering

—
Reply to this email directly or view it on GitHub https://github.com/docker/docker/pull/8859/files#r19629151.

Please don't require a gossip protocol. Given that we have a discovery service, there is also no need. Once all nodes connected there, it knows about all nodes. I would try to avoid gossip protocols if possible because it's complex to reason about. A single master (or single discovery endpoint) is much simpler.

What are the available options for allowing fault tolerance in cluster ?(If master goes down.. one of the nodes has to be picked and given privileges). I want to use Docker swarm orchestration technology and docker swarm doesn't provide any fault tolerance.

thaJeztah · 2014-10-30T21:48:15Z

Interesting proposal. I must admit I have no experience with cluster management, but decided to give my thoughts anyway (hope I haven't made a complete fool of myself 😄)

Some additional thoughts;

how does this play together with the container groups proposal (#8637)? Will it be able to distribute groups across a cluster?

My second thought is harder to answer; is this something that should be implemented by Docker itself? The Docker Eco-system / community already has invested a lot in building solutions like Flocker and CoreOS Fleet (among others). Although no one solution will "fit all", I wonder if the docker community will appreciate it if docker itself is going to compete with them.

Just my thoughts, wdyt?

mkb · 2014-10-30T21:52:39Z

Does this need to be part of the main Docker project? Orchestration is a whole other problem beyond container management. I'd rather see these implemented in a parallel project so Docker remains simple and Docker users can choose between Docker orchestration and the other available options.

jessfraz · 2014-10-30T22:14:43Z

@mkb do you have reasons why not? I think this is so seamlessly integrated into the current docker cli commands/functionality, that I can't think of a reason why it shouldn't, but I would be curious to know.

mkb · 2014-10-30T22:26:39Z

@jfrazelle It comes down to style, really. I usually favor the Unix approach of small, composable tools that do one thing well. The less a tool does, the easier it is for new users (or new maintainers) to ramp up.

From purely personal standpoint, I love the idea of having the option of using Docker's cluster management but wouldn't want to be obligated to use those tools over some other.

To be clear, I'm not saying any of this is a bad idea at all. I'm just suggesting it might be better in a separate project.

cpuguy83 · 2014-10-30T22:32:35Z

@mkb did you see the demo?

thaJeztah · 2014-10-30T22:34:46Z

@jfrazelle I think it depends on how you look at Docker; is Docker the "end product" or a "framework" to build an end-product on? Currently, it seems to be both; it's usable out of the box to get working with containers (end product), but acts as a "framework/library" backing other end-products (as mentioned in my previous comment).

Having cluster management out of the box certainly is convenient, the question is; will that make other solutions obsolete? Will cluster management offered by docker itself be just "bare bones" and will other solutions still be required for more advanced needs?

And (to summarise my previous comment); is the goal of Docker to build an Eco-system "around it", or to be the Eco-system (for better words).

I don't know the answer to that, and don't know if these are "mutually exclusive".

No bad feelings, just thinking out loud here.

thaJeztah · 2014-10-30T22:36:15Z

@cpuguy83 where? Is it viewable online?

vieux · 2014-10-30T22:55:57Z

@thaJeztah the demo will be posted soon on the proposal.

thaJeztah · 2014-10-30T22:59:32Z

@vieux thanks! I'll watch this ticket, quite interested to see it.

asbjornenge · 2014-10-30T23:19:42Z

@vieux @aluzzardi really top notch stuff!

You mentioned in your demo that the cluster will diff the current running state with the new state and, I assume, create some sort of execution plan. Would it be out of scope for this system to handle transitioning between more complex states? So that I can describe (or dump?) some desired configuration and easily return to it?

@thaJeztah you can watch it here https://docker.com/community/globalhackday about 1 hour in.

vieux · 2014-10-31T00:25:22Z

Hi everyone, thank you for your feedback,

I added the video from the Docker Global Hack Day in the proposal.

lukemarsden · 2014-10-31T02:39:03Z

Awesome work @vieux & @aluzzardi. I'm interested to see the code for the demo, is that available on GitHub somewhere? In particular, did you modify the Docker API at all to support multi-host?

SamSaffron · 2014-10-31T04:26:29Z

how can you discuss this stuff in a pull request, this is madness

SamSaffron · 2014-10-31T04:32:55Z

The big problem with clustering is that getting something basic up is trivial, getting something correct up is fiendishly complex required leader election, raft like protocols, dealing with partitions and so on.

I feel this is biting off way more than what the building blocks of docker should provide, if this is a direction Docker wants to take then the first step should be adding the building blocks into the infrastructure (eg. bundling etcd or something), but I am not convinced this is Dockers role.

andreaturli · 2014-10-31T14:47:35Z

what kind of inter-containers connectivity your solution offers? If the scheduler decides to place my container1 in host1 and container2 in host2, are they part of the same subnet?

bfirsh · 2014-10-31T15:07:19Z

docs/sources/userguide/cluster.md

+
+To add a new or existing Docker
+Engine to your newly created cluster use the provided URL from the Hub
+with the `--discovery` flag when you start Docker in daemon mode.


It's not immediately clear that --discovery has anything to do with clustering. How about --cluster? It then makes it really clear that you're enabling clustering mode. The URL seems to be the unique identifier for a cluster, so this makes sense semantically.

+1 for --cluster

bfirsh · 2014-10-31T15:19:48Z

How are images managed across multiple nodes? E.g.:

What does docker images output? Does it say what nodes the images are on, or is the list of images global to the cluster?
If I do docker build -t myimage . can I then do docker run myimage without having to think about what node the build ran on?
How can I make sure that the redis running on one node is the same as the redis running on another node?

... and stuff like that.

discordianfish · 2014-11-11T19:59:14Z

@brendandburns Got it, right. I agree that we need some atomic scheduling unit for such things and, given that we advise people to run a container per service, this can't be containers. So whether they are called pods but I'm all up for that.

shykes · 2014-11-12T01:57:51Z

@brendandburns we discussed the topic of "declarative vs imperative" in today's live review. (aka "Docker tuesday" :).

Here is my conclusion:

Yes, any sound clustering solution must 1) distinguish "wanted state" from "actual state", 2) expose both to the user, 3) allow users to change wanted state, and 4) implement a mechanism for resolving differences.
This proposal is no exception, and clearly implements all 4 points above. As for how well this is designed and implemented, I trust the instincts of @aluzzardi and @vieux since they have the combined operational experience of Google SRE, Dotcloud platform engineering, and Microsoft Bing. All 3 operated at large scale (although different axis of scale for each), and each with different set of requirements. We are talking about people who know how the real world works, so let's give them the benefit of the doubt on implementation.
How to implement point 3 ("allow users to change wanted state") is a discussion of UI and design philosophy. It has nothing to do with "right" or "wrong" way to do clustering. One way is to edit a yaml file describing the entire wanted state, upload it in bulk, and let the tool infer what changed (this is how Kubernetes and Dotcloud do it). Another way is to specify discrete changes to the wanted state - "add this", "remove that", "change this value", etc. (this is how Docker does it, and this proposal preserves that behavior). Again: this is a UI decision, and does not impact the underlying architecture: in both cases, "the operational responsibility for keeping my application running is a part of the scheduling system, rather than part of a layer I have to build on top of the scheduling system". So your requirement is met.

TLDR: we are in violent agreement on underlying architecture and separation of concerns. What is left is a matter of bikeshedding^W UI preference.

jbeda · 2014-11-12T04:27:41Z

@shykes Thanks for writing this up.

Totally agree on your 1-4 points above. Based on experiences and mistakes in the past we explicitly call out desired vs. actual state in the API. I would say that if you API does handle desired vs. actual state you are building a declarative API.

Pure RESTful vs. custom verbs

However, I do think that the way that you modify the desired state is more than just a UI difference. There are pros and cons to each method. When thinking about this in REST terms, the question is if you support modifications via replacing the resource or via custom verbs. In the GCE API, I went with the custom verb route and I'm coming to think that it was a mistake due the sheer number of verbs.

It is important, I think, to differentiate the UI to the user vs. how this is modeled in the API. Just because you model the API as more RESTful with "full update" over custom verbs doesn't mean that you can't have affordances in your UX for incremental updates. The full update model doesn't require or imply YAML/JSON files. You can have a config file based system calling an imperative API (and doing reconciliation client side) and a imperative experience on top of a declarative system.

Error Handling

One thing that is worth calling out (and does relate to this proposal) is how you handle errors. In a truely declarative system failure isn't as clear cut as it might be on a single node. Specifically, if I try an schedule a container and there isn't enough capacity in the cluster, what is the appropriate action -- you could either fail immediately or you could accept the request and fail to converge desired state and actual state. The proposal above suggests that this would error immediately. But what if you can add capacity to the cluster? Shouldn't the desire to run the container be recorded immediately?

This situation is equivalent in many ways to having a container running and then having the cluster shrink due to hardware failure (or admin action). In that case you may now no longer have enough capacity to run all of the work that is desired. Some containers will go into a 'pending' state. Why is it possible to go pending due to cluster shrinkage but not when the cluster isn't large enough?

I think the desire to be able to return success or failure for an operation drives the need, in many ways, to have custom verbs as the update method. If this is really desired, one solution that we've considered is that the user would submit a new version of the resource and the API would return a set of operations that can succeed or fail (with some time bound, etc). That way the user can slam a new version in and the system breaks it down to the things that can succeed or fail.

Serial configuration vs. Settling

One thing that comes out of a multi-resource declarative system is that order of object doesn't matter nearly as much. This is something that I think we got wrong in places in the GCE API. If you do this right, you can reference an object before it is ready -- or even created. You can specify resources out of order and let the system settle as things become configured. This doesn't matter as much on a single node as most operations there are relatively time bounded. But with larger and larger systems (some of which may include a ticket to the network ops team) latencies increase and it is easier to use if you don't have to run an outside workflow/state machine to achieve a result. I can give more examples here if you like.

Hopefully this gives you some food for thought. We are building Kubernetes with this stuff in mind.

inthecloud247 · 2014-11-12T07:05:09Z

Thought it would be useful to add info about current relevant and really interesting work with container groups and stack composition by @aanand @crosbymichael and @nathanleclaire . Saw this on the docker-dev list:

"Hi all. I've been working on two new features: container grouping (docker groups)
and stack composition (docker up). Together, they will eventually form a complete
replacement for Fig in Docker itself."

https://github.com/aanand/docker/compare/composition
https://gist.github.com/aanand/9e7ac7185ffd64c1a91a

https://groups.google.com/forum/#!topic/docker-dev/th3yKNKbCWM

I think it's incredibly exciting to have so many different options for orchestration and service discovery. Before Docker, how much innovation and excitement was there in either field? And how many options have popped up out of the woodwork over the last few months. It's amazing to be able to choose among so many solid and creative projects. DNS-based service discovery ftw! And omg (!!!) even now some new alerting/logging systems after decades of stagnation.

But I think it's simply too early to start choosing the winners here and integrating these features into core. IMO, until the plugin system is ready, these features either need to be implemented as separate utilities or wrappers, and even then should tread carefully to not upset the developer ecosystem that has sprung up around Docker. There was a certain chilling affect I noticed after Fig was re-released as the 'official' docker way to run multiple containers on a host. Fig is cool, but some of the other competing systems were also great.

timothysc · 2014-11-12T21:22:23Z

@jbeda I chatted with @vieux and @aluzzardi yesterday, and I wrote the following notes. I'll let them add color, or greater clarity, where they think it's needed.

Clustering will be done via a plugin whose API is TBD but due out later...
The Docker CLI is meant to be the primary interface to support Docker clustering, and the plugin will extend the CLI via options (TBD). Ideally we'll rally around both the interface and options to provide consistency across different backends. e.g. --constraints=TBD-fubar
Kubernetes, Mesos, or possibly Marathon, could be added as a back end via the plugin API.
Any concepts, such as 1st classing PODs is a docker-core constraint. But imho it should be a added otherwise I believe clustering will become a mess.

I would hope to see as much common ground and re-usability "where possible", such that we can leverage linus's law, "given enough eyeballs, all bugs are shallow". For example:

Do we need to have two separate master worker implementations?
Do we need to have two label/tag/attribute systems?
Do we need to have two separate monitoring schemes?
Don't get me started on constraints (╯°□°）╯︵ ┻━┻
The list goes on...

The models atop could support different use cases, whether they are imperative or declarative depends on the use cases.

I trust @vieux and @aluzzardi, as we travel in similar circles, and I look forward to working together to make sure we can support multiple models with a common core.

kelseyhightower · 2014-11-12T23:32:25Z

I'm loving this open discussion around Docker clustering. After reading the proposal and watching the design review on YouTube, I really think etcd can offer a lot of value in the following ways:

Cluster Bootstrapping

The etcd discovery protocol can be used for cluster bootstrapping. Based on the desired user experience I think the discovery protocol is a perfect fit. Docker, Inc can host their own public discovery service dedicated to Docker clustering. Since the discovery service has an HTTP interface, auth can easily be layered on top at anytime.

Master election and Cluster state management

This is something we have been doing for a long time and heavily used in etcd, fleet, and flannel. Ideally this proposal can leverage the same stuff and focus on the Docker UI.

Open to collaboration

We are happy to discuss the usage/bundling of etcd or help reusing our raft implementation. We moved away from goraft, but we have made our new raft implementation available as a standalone package.

import "github.com/coreos/etcd/raft"

See the docs for more details.

LK4D4 · 2014-11-12T23:43:40Z

Using raft in docker would be nice I think. Also this is interesting thing to do.

brendandburns · 2014-11-13T05:24:58Z

@shykes I wanted to clarify my concerns with this proposal into a concise comment, since as you say, the declarative vs. imperative thing is somewhat of a distraction (although an important one) from the larger concerns around the proposal.

Concern 1: Atomicity

As it is currently stated, the atomic unit for clustering is a single Docker container. This is insufficient, as many real world applications consist of multiple groups of containers that must be co-scheduled to operate correctly. The easiest example of this from the Docker world is a data container and the app container. In the current approach it is easy to image examples where an attempt to serially schedule the data container, and then the app container in two different API calls will result in a failure to schedule.

For example, consider a data container that takes up 1G of disk, and an app container that takes up 0.5G of disk. I have two machines:

M1: 1.25G of disk free
M2: 10G of disk free

Given the current proposal, you would issue two separate commands. First
docker run -d my/data-container

Let's say that this schedules the data container onto M1

Now you issue a second command:
docker run -d --volumes-from data-container my/app-container

This container is unschedulable, since it is required to schedule onto the same machine as the data container (M1) but there is insufficient space to schedule there.

The correct answer of course is to schedule both containers onto M2, but unless you treat the two containers as an atomic unit, you will never be able to achieve that.

Concern 2: A single API that mixes concepts between node and cluster

The existing Docker API is focused on a single machine (aka a node). In trying to keep the same api between node and cluster, you are going to introduce extra fields and concepts into the API (e.g. the "NODE" field in the UX suggested in the proposal, or the scheduling constraints) that are irrelevant or unused in either the clustering side of the API or the node side of the API. You are effectively mashing two APIs together, because there is some degree of overlap. A better approach is to extract the type that is common (basically the container [or hopefully pod] definition) and introduce two different APIs, one that is focused on the node, and one that is focused on the cluster.

Concern 3: A lack of support for Labels or tags

If we take containers as the lowest primitive, people are going to want to build higher level primitives on top of those individual containers. Examples of this include replica sets, or load balancing groups.

In Kubernetes we have used labels and label selectors (queries) extensively to concisely express sets of containers/pods that make up a replica set or a load balancing group.

If you don't have something like a label for your containers people are either forced to

encode labels into the name of the container (super hacky)
maintain a separate parallel set of data structures for each container (ugly, hard to sync)
keep explicit lists of containers by name (hard to self-heal via introspecting the API)

Concern 4: An insufficient health model

The extent to which Docker currently monitors the health of a container is by ensuring that the process is running, but there are lots of applications failures that are due to server deadlock that are only visible if you do application level (for example HTTP) health checking. If these kind of health checks aren't part of the low level API, then users are going to again, be forced to create their own set of api objects to drive these kind of concepts down to the node.

Concern 5: An inability to update existing containers

The Docker API as it currently stands lacks the ability to update any aspect of the container (#6323). This is a large part about what I mean when I say that it is an imperative API, if I want to (for example) adjust the Memory that is available to the process, I have to kill the existing container, and restart a new container with the higher memory limit. This extends to other meta-information, like labels (if they existed) where it is extremely useful to be able to update a container without killing and restarting it. While dynamic updates are useful on the node, they become increasingly important in a scheduling system as you want to be able to dynamically adjust container limits to match actual usage, so that you can pack more containers onto any individual machine.

Those are my highest level concerns about the existing proposal. There are a few other smaller scale things...

I think that adopting Pods for scheduling (#8781) and adding labels would go a long way towards making this proposal more usable, but I would second @jbeda 's request for a clean delineation in the API objects between the desired state and the current state as well, and I truly believe that it's better to extract out the common objects and build a separate clustering API, rather than trying to smash it into the existing Docker API.

bfirsh · 2014-11-13T14:26:25Z

In case anyone missed it, here's the design review mentioned in a few comments above: https://www.youtube.com/watch?v=4etqZ4ghZus

johngossman · 2014-11-14T14:31:59Z

@brendandburns +1 to your comment about atomicity. And as you all know, these scheduling constraints get really hard...think putting up hundreds or thousands of containers (vms) in some sort of HA configuration with all the affinity and disaffinity rules. You need to know the final state of the dependency graph in order to be able to solve the constraint problem. Or at the minimum, you need to know when the end is coming because you can't resolve the system every time it changes, so you need transactions (and the simplest transaction is to provide the whole model in one gulp). You can still have the "incremental" edits to the model, which is what I think people mean when they say imperative, but you also need a "bulk" edit mode.

The above is all theoretical for a simple scheduler that can't solve complex constraints anyway...but I believe part of the proposal is to allow plugin schedulers, placement algorithsm etc. Some of these will not work without transactions.

johngossman · 2014-11-14T15:12:15Z

@kelseyhightower As much as I like etcd, I hope discovery, leader election and state management are pluggable in Docker. As much as I like raft, I hope the team doesn't try to reimplement these features starting at that level. Batteries and all that...

Though I should add, as much I like plugins, I like getting something running and then refactoring it.

glynd · 2014-11-17T15:53:41Z

The main point I'd liked to add is that having a master/standby solution assumed is not necessarily the best approach - for scale, geographic reach or reliability. I can understand the desire to do this in an initial v1 version, but it would be a good plan for any APIs or similar to be designed to cater for multiple coordinators being configured.

In a similar vein, it would be useful if a Docker node could be asked "please do this if your state is currently as I think it is" - ie conditional launches/shutdowns. This would also aid towards a distributed approach.

inthecloud247 · 2014-11-17T17:00:59Z

@bfirsh thanks for the youtube link. Didn't know about the recorded design review sessions... interesting.

inthecloud247 · 2014-11-17T17:22:17Z

If there's a built-in link to docker-hub, I hope it's possible to specify a new default url using a command-line option / config file variable / environment variable. Inevitably, Enterprise developers using Docker will start storing cluster metadata on dockerhub, and it'd be important for devop/IT orgs to manage enterprise-wide docker usage through management of default configuration values. It'd be simple to use boxen/chef/ansible/salt to push out and manage safe default values.

I know there was discussion about allowing federation/mirroring of the docker index to help increase availability. Are there similar plans for ensuring availability of this centralized docker cluster service? One simple solution would be to add this this functionality to the docker registry project https://github.com/docker/docker-registry to allow self-hosting.

discordianfish · 2014-11-17T18:10:19Z

@glynd I don't think this should be scope of Docker clustering: As soon as your cluster is so large you need to scale the master/coordinator, you should create multiple clusters per (availability) zone with individual masters. This seems to be in practice the more robust approach. At that scale, you would probably built something multi-cluster deployment on top of Docker.

glynd · 2014-11-18T11:20:12Z

@discordianfish If you go down the approach of a cluster per AZ you then have to have something to manage the different AZs. What happens if an AZ (or DC) goes down and you need to increase your capacity at other sites. If you don't have a global view on such things you can't act.

I'm saying the APIs and command lines should allow for this from the start, and take it into account in how the tooling is configured / used. Not that it is actually implemented like that from the start.

Then if someone does build a cluster approach they can easily layer it on, and use the same docker tooling and APIs - as well as any other 3rd party tools which have been written with the docker APIs in mind.

discordianfish · 2014-11-18T13:02:14Z

@glynd Right, you need some management on top of your clusters. I argue this is so site specific (and complex), that it's not reasonable to address this by Docker itself. At some point we might revisit that, but I think accounting for those use cases right now just drives up complexity for a sane first implementation.

glynd · 2014-11-18T13:21:08Z

@discordianfish Does it? On the API / tooling side it can be as simple as allowing more than one server to be configured instead of just taking a single hostname. That and how your server responds under partial success (i.e. the comments around Atomicity above when stretched to multiple servers.)

sthulb · 2014-11-26T11:47:59Z

@aluzzardi @vieux, how would one run non-deamonised containers through the cluster? I noticed that all the examples show daemonised containers. Would a non-daemonised container proxy stdout to the master or which ever host started the container?

dbason · 2014-12-01T21:05:28Z

+1 to @titanous ' issues. To me this seems to break separation of concerns. I don't have an issue with Docker developing a cross server solution, I do however have a problem with this being integrated into the containerization engine. I like docker because it is a simple building block and I feel this takes away from that. If we want cross host options it should be a choice what we use and how we implement that.

dbason · 2014-12-02T21:49:56Z

To expand on this here's my use case:
I'm currently using docker with one of the schedulers out there in community. I don't want to use Docker Hub for host registration/discovery inside of docker. I don't want to plug something else in because this is already implemented in the scheduler I'm running over the top. Will I be able to turn all of that off and just run docker in a standalone fashion as it is now?

shykes · 2014-12-02T21:50:54Z

Yes.

On Tue, Dec 2, 2014 at 10:49 PM, dbason notifications@github.com wrote:

To expand on this here's my use case:
I'm currently using docker with one of the schedulers out there in
community. I don't want to use Docker Hub for host registration/discovery
inside of docker. I don't want to plug something else in because this is
already implemented in the scheduler I'm running over the top. Will I be
able to turn all of that off and just run docker in a standalone fashion as
it is now?

—
Reply to this email directly or view it on GitHub
#8859 (comment).

boonkerz · 2015-01-04T00:22:12Z

When container are rebalanced as example an typical webapp
docker run -d --name mysql mysql
docker run -d --name elasticsearch elasticsearch
docker run -d --name web --link mysql:mysql --link elasticsearch:elasticsearch webserver
and the elasticsearch goes down
do the cluster also reconfigure the link on the webserver container?
and when the mysql is started on host 1 and the webserver on host2
do the cluster connect the right hosts together?

vieux · 2015-01-13T19:04:49Z

@boonkerz right now swarm don't support links.
There is a huge effort to improving the networking model directly into the docker engine.
With this improvements, the links will work between 2 hosts.
We are waiting for this to use it.

vieux · 2015-01-13T19:05:00Z

Hi everyone,

As you probably figured out, this design proposal was an early version of Swarm

Please redirect all your concerns to the Swarm issue tracker

Thanks!

jessfraz reviewed Oct 30, 2014
View reviewed changes

bfirsh reviewed Oct 31, 2014
View reviewed changes

thaJeztah mentioned this pull request Nov 13, 2014

Proposal: Add volume ls/inspect/rm/create commands #8484

Closed

docteurklein mentioned this pull request Nov 25, 2014

Proposal: Enable services to use different Docker Hosts docker/compose#608

Closed

SvenDowideit added the Proposal label Nov 26, 2014

vieux closed this Jan 13, 2015

aluzzardi deleted the clustering-proposal branch May 7, 2015 06:42


		$ docker -d --master --discovery https://discovery.hub.docker.com/u/demo-user/cluster-1

		Eventually, clustering will provide a built-in leader election algorithm making the `--master` flag obsolete.

Docker Clustering: Design proposal. #8859

Docker Clustering: Design proposal. #8859

Conversation

aluzzardi commented Oct 30, 2014

LK4D4 commented Oct 30, 2014

jessfraz Oct 30, 2014

Choose a reason for hiding this comment

gaberger Oct 30, 2014

Choose a reason for hiding this comment

discordianfish Oct 31, 2014

Choose a reason for hiding this comment

BhargavaRamM Apr 3, 2015

Choose a reason for hiding this comment

thaJeztah commented Oct 30, 2014

mkb commented Oct 30, 2014

jessfraz commented Oct 30, 2014

mkb commented Oct 30, 2014

cpuguy83 commented Oct 30, 2014

thaJeztah commented Oct 30, 2014

thaJeztah commented Oct 30, 2014

vieux commented Oct 30, 2014

thaJeztah commented Oct 30, 2014

asbjornenge commented Oct 30, 2014

vieux commented Oct 31, 2014

lukemarsden commented Oct 31, 2014

SamSaffron commented Oct 31, 2014

SamSaffron commented Oct 31, 2014

andreaturli commented Oct 31, 2014

bfirsh Oct 31, 2014

Choose a reason for hiding this comment

discordianfish Oct 31, 2014

Choose a reason for hiding this comment

bfirsh commented Oct 31, 2014

discordianfish commented Nov 11, 2014

shykes commented Nov 12, 2014

jbeda commented Nov 12, 2014

Pure RESTful vs. custom verbs

Error Handling

Serial configuration vs. Settling

inthecloud247 commented Nov 12, 2014

timothysc commented Nov 12, 2014

kelseyhightower commented Nov 12, 2014

Cluster Bootstrapping

Master election and Cluster state management

Open to collaboration

LK4D4 commented Nov 12, 2014

brendandburns commented Nov 13, 2014

Concern 1: Atomicity

Concern 2: A single API that mixes concepts between node and cluster

Concern 3: A lack of support for Labels or tags

Concern 4: An insufficient health model

Concern 5: An inability to update existing containers

bfirsh commented Nov 13, 2014

johngossman commented Nov 14, 2014

johngossman commented Nov 14, 2014

glynd commented Nov 17, 2014

inthecloud247 commented Nov 17, 2014

inthecloud247 commented Nov 17, 2014

discordianfish commented Nov 17, 2014

glynd commented Nov 18, 2014

discordianfish commented Nov 18, 2014

glynd commented Nov 18, 2014

sthulb commented Nov 26, 2014

dbason commented Dec 1, 2014

dbason commented Dec 2, 2014

shykes commented Dec 2, 2014

boonkerz commented Jan 4, 2015

vieux commented Jan 13, 2015

vieux commented Jan 13, 2015