[WIP] OrleansDockerUtils -> Docker and Docker Swarm Clusters for Orleans #2569

galvesribeiro · 2017-01-05T04:59:42Z

This PR brings support to Docker and Docker Swarp cluster for Orleans.

It is based on @ReubenBond's work on SF Membership Oracle but using the Docker APIs.

The design of this Membership Oracle is to not have a MBR just as SF doesn't. It works by communicating with Docker Daemon or Swarm endpoints using https://github.com/Microsoft/Docker.DotNet which is a wrapper over Docker APIs, and leveraging the Labels feature of docker.

The idea is to group a set of containers by deploymentId on the same host (Docker Daemon) or across multiple hosts (Swarm cluster). Whenever a client start, the DockerGatewayProvider connects to Docker and query for the current list of containers which are a silo, and are in the same deploymentId so it can build the gateway list. After that first list, it set a timer to refresh that list from docker. The same happens at the silo side which basically find the other silos by the deploymentId.

Basically there is no persistence at all neither MBR tables since all the containers can be found by the labels and when a container dies, it is automatically removed from docker list so it will not return anymore.

The hosting model is basically any container engine which implement Docker APIs (i.e. Docker Daemon, Docker Swarm, etc.). There is no change on how things are deployed or a new OrleansHost for that. All people need to do is to set the appropriate labels on the container which has the Orleans application code and everything will just works as long as the container is able to reach over network the Docker API host (i.e. the container host or Swarm manager).

There are two things that I'll implement after this PR:

Listen to docker stream of events and add/remove silos live -> the streams are very unstable yet and not available on production docker engines.
At runtime apply a label to the container mentioning that it is dead -> due to a blocker on docker for windows, commit changes to a running container isn't available yet.

This is a WIP. I was able to run manual tests here and made 2 containers and a client talk perfectly. The current code is totally usable. I've simulated adding and removing silos/containers to the cluster and they join and leave just fine.

I'm now working on get a unit test project with some DockerFiles so all people will need to run the tests is have Docker installed.

I appreciate any feedback while I'm working in the tests so I can address them quickly.

ReubenBond · 2017-01-05T05:43:38Z

src/OrleansDockerUtils/OrleansDockerExtensions.cs

+
+            if (parameters.TryGetValue("Certificate".ToUpper(), out certificate))
+            {
+                if (!File.Exists(certificate)) throw new FileNotFoundException("Unable to find certificate file");


Windows users might be confused about this, since they're used to having a cert store and referring to certs using thumbprints rather than file names.

Yeah I agree. But this is the way Linux users deal with certs. The cert stores are not implemented for linux yet so I rather stick with this filename for now.

I'm fine with that, I was just pointing out that the name might be a cause for confusion with some users. A name like CertificatePath or CertFile might be clearer.

ReubenBond · 2017-01-05T05:43:39Z

src/OrleansDockerUtils/OrleansDockerExtensions.cs

+
+            Credentials credentials = null;
+
+            if (parameters.TryGetValue("DaemonEndpoint".ToUpper(), out username))


DaemonEndpoint is the username?

The logic in this method isn't straightforward because if the user specifies DaemonEndpoint as well as Certificate, the credentials will first be set to BasicAuthCredentials and then to CertificateCredentials.

No, it is the Docker Daemon or Docker Swarm endpoint uri like http://10.0.0.1:2375 or tcp://localhost:2375

The variable is called username

Oh! now I saw the problem. It was a change before the push which led to this bug... the string on the key is wrong... It should be "Username". Fixed.

ReubenBond · 2017-01-05T05:44:44Z

src/OrleansDockerUtils/Utilities/ErrorCode.cs

+        Docker_GatewayProvider_ExceptionNotifyingSubscribers = DockerBase + 1,
+        Docker_GatewayProvider_ExceptionRefreshingGateways = DockerBase + 2,
+        Docker_MembershipOracle_ExceptionNotifyingSubscribers = DockerBase + 3,
+        Docker_MembershipOracle_ExceptionRefreshingPartitions = DockerBase + 4


Docker uses partitions?

Ops... Bad copy and paste... Removing

ReubenBond · 2017-01-05T05:48:04Z

src/OrleansDockerUtils/DockerSiloResolver.cs

+
+        public async Task Refresh()
+        {
+            var result = await Task.WhenAll((await dockerClient.Containers.ListContainersAsync(


await Task.WhenAll(await ...)?

I'd recommend moving the inner await outside so that it's more obvious what's happening here

ReubenBond · 2017-01-05T05:51:55Z

src/OrleansDockerUtils/DockerSiloResolver.cs

+                    return new DockerSiloInfo(_.Config.Hostname,
+                        new IPEndPoint(IPAddress.Parse(_.NetworkSettings.Networks.First().Value.IPAddress), 
+                            int.Parse(_.Config.Labels[DockerLabels.SILO_PORT])),
+                        new IPEndPoint(IPAddress.Parse(_.NetworkSettings.Networks.First().Value.IPAddress), 


Some variables could be extracted for clarity, eg the IPAddress, maybe _.Config. We tend not to use _ as a parameter name anymore (I used to do it a lot, but it doesn't fit with the codebase well).

Additionally, the body of this select could be extracted into its own method, like GetSiloFromContainer

ReubenBond · 2017-01-05T05:56:14Z

src/OrleansDockerUtils/DockerMembershipOracle.cs

+        private readonly ILocalSiloDetails localSiloDetails;
+        private readonly Logger log;
+        private readonly GlobalConfiguration globalConfig;
+        private readonly IDockerSiloResolver resolver;


Just to be sure, which parts of this file differ from the Service Fabric oracle other than this type?

Nothing is different actually. Just this resolver interface... Like I said yesterday, too much boilerplate code... We should later refactory this as a common abstract class or something in another PR.

gabikliot · 2017-01-05T07:02:03Z

Can you pleasevprovide a bit more details, describing how exactly you integrate with docker, where is the MBR table or who and how membership Oracle is provided, what is the hosting model, ... So people can understand without reading the code.

galvesribeiro · 2017-01-05T12:29:28Z

@gabikliot sorry, I was actually hoping just @ReubenBond jump in since he just implemented the Service Fabric Membership Oracle and this implementation is almost the same. But yes, I'll edit the PR and include more details.

Regarding MBR, there is no MBR table at all, just as in SF. I'll detail more on the PR text. Thanks

ReubenBond · 2017-01-05T12:33:05Z

Documentation on this would be great - I need to document "Hosting Orleans on Service Fabric", too.

galvesribeiro · 2017-01-05T12:42:10Z

Yes, the docs is the PR right after this one @ReubenBond

Unfortunately we sill need to write docs in a separated PR to use the gh-pages branch (unless GH find a way to multibranch PR!)

sergeybykov · 2017-01-05T17:08:37Z

This is a great effort!

Let's open an issue to discuss the design instead of mixing it here with code review feedback.

I'm trying to comprehend it, and have some basic initial questions.

How does it handle generations (epochs) of silos? I see the code that parses it from DockerLabels.GENERATION : var generation = int.Parse(container.Config.Labels[DockerLabels.GENERATION]);, but not where it is getting set. If the silo process restarts within a container, it will have a new generation. How does Docker know that?
due to a blocker on docker for windows, commit changes to a running container isn't available yet.

Does this mean that there is no way to mark a silo as dead?

In general, Is there an implied equivalency here of "container is up" and "silo is running"? Can't we get into a situation that a silo is unresponsive, e.g. it's messaging queues are or threads are blocked, but Docker will consider the container healthy and keep it in the list of container/silos on the cluster?

gabikliot · 2017-01-05T17:54:52Z

@galvesribeiro, I know how SF works and also what does it mean from Orleans perspective. I was asking about Docker Swarm. I am looking into more high level description of how it is used, both hosting and MBR, like here: http://dotnet.github.io/orleans/Documentation/Runtime-Implementation-Details/Consul-Deployment.html

For example, you can see here the long thread we went through when designing Consul based solution: #1267
You can see all the questions that were asked, all the options we considred.

So can we please open an issue and discuss the design, like @sergeybykov suggested. Ideally, I think we prefer doing that BEFORE code is submitted for review, so that people can actually comprehend the code, after the design is ready. Without seeing the design, its hard to give any valuable feedback.

gabikliot · 2017-01-05T17:59:12Z

Also would be interesting to ask why Swarm and not other Docker clustering technologies. Is there a specific reason, or customer who asked for that, or just a first one we picked without any particular reason?

galvesribeiro · 2017-01-05T18:52:47Z

@sergeybykov

Let's open an issue to discuss the design instead of mixing it here with code review feedback.

Ok will do in a minute.

How does it handle generations (epochs) of silos? I see the code that parses it from but not where it is getting set. If the silo process restarts within a container, it will have a new generation. How does Docker know that?

Docker containers are immutable. In no way a same container or its main process can restart. It is discarded and start a fresh one so the generation is pointless in this case. I'll describe in the issue how I do that.

Does this mean that there is no way to mark a silo as dead

If the silo process is frozen or crash, the container is killed and a new one must be spawned. 1 container has only 1 main process which in this case is the silo process. So if it dies, the whole container die. More on that in the issue.

@gabikliot

I agree, let me open an issue and link here in a minute.

Also would be interesting to ask why Swarm and not other Docker clustering technologies. Is there a specific reason, or customer who asked for that, or just a first one we picked without any particular reason?

You can use any docker clustering platform as long as they respect Docker APIs and they do (almost). Swarm is by far Today's most used technology for Docker clustering and the one that comes from Docker Inc. If you look at the core, there is no explicit mention tie to Swarm. The same code that talk with a single Docker Daemon talk to Swarm. Its orthogonal. Just a matter of change endpoints. I'll dive into details in the issue.

Anyway, linking the issue here soon. Thanks for the comments.

galvesribeiro · 2017-01-05T20:45:58Z

Btw, sidenote... I didn't added the .Net core projects yet, will do in the end since the code dont change. The tests I'm building some easy way to run from xUnit with a DockerFile so we can spin up containers on any dev machine.

gabikliot · 2017-01-06T07:55:59Z

Thank you for the detailed explanation @galvesribeiro .
Can you please provide a bit more details (not too much, but still a bit) about Swarm membership. Just a bit background, to make sure the models are compatible.

gabikliot · 2017-01-06T07:25:10Z

src/OrleansDockerUtils/DockerSiloResolver.cs

+
+        public async Task Refresh()
+        {
+            var containerList = await dockerClient.Containers.ListContainersAsync(


can you please replace var containerList with the actual type, so its more readable.

gabikliot · 2017-01-06T07:25:38Z

src/OrleansDockerUtils/DockerSiloResolver.cs

+                                },
+                            }
+                });
+            var inspectionResult = await Task.WhenAll(containerList.Select(c => dockerClient.Containers.InspectContainerAsync(c.ID)));


same comment about var here.

gabikliot · 2017-01-06T07:33:47Z

src/OrleansDockerUtils/DockerMembershipOracle.cs

+        /// Updates the status of this silo.
+        /// </summary>
+        /// <param name="status">The updated status.</param>
+        private void UpdateStatus(SiloStatus status)


UpdateStatus does not really do anything, it does not propagate the new status of this silos remotely to other silo. We need that in Orleans. There is a difference between Starting/Active, or Terminating/Stopping/Dead.

I don't understand @gabikliot. What you mean by "it does not do anything"?

foreach (var subscriber in subscribers.Values) { subscriber.SiloStatusChangeNotification(SiloAddress, status); }

It is notifying all the registered subscribers about the new status on this method. I don't see your point here, sorry. Can you elaborate?

It does not propagate the new status of this silo to other silos (yes, it does update local subscribers).

When a silo goes lets say to terminating state (just an example), we need to notify all other silos about it. Its not enough to notify all local components. If we can't do distributed silo states, that's a major gap that we are opening compared to the main MBR Oracle and we need to think through the implications.

Ok, then we have the same problem in SF implementation because it does the same thing there. o.O

The SF solution is not merged yet, so not really a good source of example to follow, at least until merged.
We need to solve that, right?

The propagation of silo status changes happens in FabricMembershipOracle.cs:L348. The only states which are supported are Active and Dead, but they satisfy the distinction for SiloStatus.IsTerminating().

Do you think this is an issue, @gabikliot?

This is a somewhat critical question I think so it may deserve its own issue.
If we only need 2 states for SF livenes, then maybe we need only 2 for the regular case as well? Maybe we can simplify the silo state machine and get rid of all other states and stay with just 2?
One need to look at what we are getting out if the other states. I wonder what @sergetbykov thinks. Would like to hear his opinion.

One thing is sure in my eyes: we should try to have similar silo state machines in all livenes cases, and not just ditch states in some cases cause we can't implementat them. If u go that route, it may work, or more likely you will be supporting diverging implementaion and at the end paying higher cost.

Alternatively: when I wrote the first integration with SF(WF at a time) 3-4 years ago, it was similar to your code now, and in addition my plan was to use their build in key value store to store silo states. I didn't implemenent it, but maybe it can work.

If u go that route, them yet another alternative is to ditch SF livenes and just use the key value store to implement MBR table. That is exactly what we did for Consul. It works, so here is an advantage. And Consul is comparable to SF, so why not? More choices.:-)

gabikliot · 2017-01-06T07:37:09Z

src/OrleansDockerUtils/DockerMembershipOracle.cs

+        {
+            try
+            {
+                await resolver.Refresh();


This is a bit weird. Oracle refreshes the docker resolver which notifies its subscribers (this Oracle) back about the MBR. Cycle of dependences? Or did I misunderstand it.

Yeah you are right. I copy that over from the SF implementation. I just refactored the timer out of both gateway and oracle. Now it is inside the resolver and this dependency don't happen anymore. Thanks for notice that.

galvesribeiro · 2017-01-06T13:13:34Z

Can you please provide a bit more details (not too much, but still a bit) about Swarm membership. Just a bit background, to make sure the models are compatible.

@gabikliot Sure, but sorry, I din't understand. You mean the Docker Swarm membership implementation from Docker Inc. or the the Docker Oracle Membership I implemented using Swarm for Orleans?

gabikliot · 2017-01-07T02:12:21Z

I meant the former: "Docker Swarm membership implementation from Docker Inc."

Mostly not how it is implemented, but what it provides to the users of that protocol. What are the semantics. Like does it notify or do you have to pool it, how fast it detects, does it have an option to specify health check ....

That knowledge will help review the latter (" Docker Oracle Membership I implemented using Swarm for Orleans").

galvesribeiro · 2017-01-07T02:15:48Z

@gabikliot I'm updating the issue related to this PR and will ping you there in a minute.

Fixed a small bug detected on a test

galvesribeiro · 2017-01-09T23:34:25Z

@gabikliot I've updated #2571 with more info related of health and membership.

About the Swarm membership implementation (or any other orchestration engine for containers like Mesos, DC/OS and Kubernetes), there nothing that can help us in this case. Swarm has its own Raft implementation for its manager, but it is restricted to Swarm usage. The other orchestration engines have their own and suffer of the same problem. They only care about their cluster, and not the containers. What they care is only if the container is up or down. Nothing more.

gabikliot · 2017-01-10T16:19:18Z

Does swarm have key value store? To store silo states or implement MBR table?

galvesribeiro · 2017-01-10T17:07:21Z

@gabikliot no, it doesn't. In fact, none of the current orchestrators have it and that is why we have having this whole discussion here. If we have it, we would be implementing the IMembershipTable like in Consul and Zookeeper. :(

galvesribeiro · 2017-01-30T23:51:18Z

Guys I'm closing this PR since it is not required anymore. See here: #2571 (comment)

Thanks for all the feedback.

galvesribeiro added 2 commits January 4, 2017 02:57

DockerUtils project structure and nuspec

65dd4ac

Initial DockerUtils

071d8e5

dnfclas added the cla-already-signed label Jan 5, 2017

galvesribeiro changed the title ~~[WIP] Dockerutils -> Docker and Docker Swarm Clusters~~ [WIP] OrleansDockerUtils -> Docker and Docker Swarm Clusters for Orleans Jan 5, 2017

ReubenBond reviewed Jan 5, 2017

View reviewed changes

galvesribeiro added 2 commits January 5, 2017 11:10

Addressed 1st round of feedback

b8b90da

Missing commit.

22d21c8

sergeybykov mentioned this pull request Jan 5, 2017

Service Fabric cluster membership providers #2542

Merged

galvesribeiro mentioned this pull request Jan 5, 2017

OrleansDockerUtils -> Docker and Docker Swarm Clusters for Orleans #2571

Closed

Consider if a silo has a Gateway installed or not based on docker label

dc277c2

gabikliot reviewed Jan 6, 2017

View reviewed changes

Addressed Gabi's feedback

b8bd7f9

Fixed a small bug detected on a test

sergeybykov mentioned this pull request Jan 10, 2017

Separate distributed silo health checks from cluster membership #2580

Open

sergeybykov added this to the 1.5.0 milestone Jan 14, 2017

galvesribeiro closed this Jan 30, 2017

github-actions bot locked and limited conversation to collaborators Dec 9, 2023


		Credentials credentials = null;

		if (parameters.TryGetValue("DaemonEndpoint".ToUpper(), out username))

[WIP] OrleansDockerUtils -> Docker and Docker Swarm Clusters for Orleans #2569

[WIP] OrleansDockerUtils -> Docker and Docker Swarm Clusters for Orleans #2569

Conversation

galvesribeiro commented Jan 5, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ReubenBond Jan 5, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gabikliot commented Jan 5, 2017 • edited Loading

galvesribeiro commented Jan 5, 2017

ReubenBond commented Jan 5, 2017

galvesribeiro commented Jan 5, 2017

sergeybykov commented Jan 5, 2017

gabikliot commented Jan 5, 2017

gabikliot commented Jan 5, 2017

galvesribeiro commented Jan 5, 2017

galvesribeiro commented Jan 5, 2017

gabikliot commented Jan 6, 2017

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

galvesribeiro commented Jan 6, 2017

gabikliot commented Jan 7, 2017

galvesribeiro commented Jan 7, 2017

galvesribeiro commented Jan 9, 2017

gabikliot commented Jan 10, 2017

galvesribeiro commented Jan 10, 2017 • edited Loading

galvesribeiro commented Jan 30, 2017 • edited Loading

galvesribeiro commented Jan 5, 2017 •

edited

Loading

ReubenBond Jan 5, 2017 •

edited

Loading

gabikliot commented Jan 5, 2017 •

edited

Loading

galvesribeiro commented Jan 10, 2017 •

edited

Loading

galvesribeiro commented Jan 30, 2017 •

edited

Loading