Documentation needed for cloud-cluster case #5418

justinsb · 2016-05-20T17:01:36Z

It isn't entirely clear how team-etcd would recommend running a cluster on AWS or GCE or other clouds, where we have things like easily programmable DNS & persistent-volumes. See for example kubernetes/kubernetes#19443

@philips suggested that it would be possible to run N instances with N persistent volumes, and to repoint DNS instead of performing cluster replacement (for normal operation).

It would be great to get some documentation for the "recommended" way of doing things, in particular I had these 4 questions on the DNS approach: kubernetes/kubernetes#19443 (comment)

Copying those 4 questions here:

What the correct value for ETCD_INITIAL_CLUSTER_STATE should be. I don't see how we can know this. The only thing we can know is whether the local node has been initialized (assuming we have persistent storage).
How we specify the cluster size (i.e. how does etcd know that the cluster is of size 3, and the quorum is 2?) Does this rely on us always have all the SRV records present? And if so, does this mean that we might as well use static configuration (ETCD_INITIAL_CLUSTER), given the SRV record must always have these values.
That etcd won't store the IP addresses that the DNS names resolve to (i.e. that we can repoint the DNS names).
Whether this is in fact the best option, or whether CoreOS would recommend something else.

In any case, the docs are great for the bare-metal case, where operator intervention is required to replace cluster members, but it would be great if they also covered the programmable-infrastructure case where we can hopefully auto-recover from most/many failure scenarios.

xiang90 · 2016-05-21T03:19:25Z

What the correct value for ETCD_INITIAL_CLUSTER_STATE should be.

After you bootstrap an etcd cluster, any new (without etcd data) etcd member that wants to join into the bootstrapped cluster should always set this to existing. If a previous machine loses it data somehow, it should be viewed as a new member. If that machine wants to rejoin the cluster, it should also set this to existing.

How we specify the cluster size

It only matters for the bootstrap case. So both way you mentioned works. After bootstrap, the cluster size is controlled by external tools explicitly or human via etcd reconfiguration API.

That etcd won't store the IP addresses that the DNS names resolve to

I do not follow this question. Can you explain more?

Whether this is in fact the best option, or whether CoreOS would recommend something else.

If you can use DNS and can easily manage it, then great! Use DNS for peer url. Then as long as the data still exists, machine replacement will not involve any reconfiguration. You only need to change DNS record somehow.

programmable-infrastructure

Most of the cases (when you do not lose data), human intervention is not required. If you lose your data, then the etcd member is lost. Additional work is required. etcd reconfiguration API is programmable, so you can write program against it.

xiang90 · 2016-05-27T18:02:08Z

@justinsb Does this answer your questions?

justinsb · 2016-05-27T19:48:27Z

Sorry for delay in replying, and thanks for confirming that repointing DNS works for when we replace nodes.

On the first two questions, I'm not entirely clear on how to bootstrap a cluster then. What I will do is have 3 EBS volumes (in 3 AZs), and 3 DNS names (etcd0, etcd1, etcd2). I can statically configure each etcd with ETCD_INITIAL_CLUSTER for the 3 nodes (ETCD_INITIAL_CLUSTER=etcd0,etcd1,etcd2). Then I will arrange to attach the EBS volumes to the nodes and repoint the dns name (I likely won't be able to use k8s PetSets, but you can imagine that we are using k8s PetSets).

So how do I set ETCD_INITIAL_CLUSTER_STATE in this scenario?

And does setting ETCD_INITIAL_CLUSTER=etcd0,etc1,etcd2 mean that the cluster will only initialize once a quorum of members (2 nodes) comes online? (I'm pretty sure that's what I want.)

xiang90 · 2016-06-01T17:13:16Z

So how do I set ETCD_INITIAL_CLUSTER_STATE in this scenario?

For static bootstrapping, always set this to new (default is new, so you do not need to set it).

cluster will only initialize once a quorum of members (2 nodes) comes online?

Yes. No writes will go in until the quorum is up.

xiang90 · 2016-06-09T16:59:42Z

@justinsb Does my reply answer your question?

xiang90 · 2016-06-17T20:50:06Z

@justinsb I am closing this due to low activity. Reopen if you have a follow-up.

justinsb · 2016-06-18T03:14:24Z

Sorry for delay & thanks! I can confirm that this setup does work perfectly (so far). I'm working on more testing on documentation, but thanks for the pointers.

xiang90 closed this as completed Jun 17, 2016

justinsb mentioned this issue Jul 19, 2016

Should be able to skip DNS creation kubernetes/kops#172

Closed

This was referenced Jul 27, 2016

protokube: what is it? kubernetes-retired/kube-deploy#149

Closed

Design for automated HA master deployment kubernetes/kubernetes#29649

Merged

justinsb mentioned this issue Dec 28, 2016

separate instancegroup for etcd kubernetes/kops#772

Closed

justinsb mentioned this issue Apr 3, 2017

"don't require a load balancer between cluster and control plane and still be HA" kubernetes/kubernetes#18174

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Documentation needed for cloud-cluster case #5418

Documentation needed for cloud-cluster case #5418

justinsb commented May 20, 2016

xiang90 commented May 21, 2016 •

edited by philips

Loading

xiang90 commented May 27, 2016

justinsb commented May 27, 2016

xiang90 commented Jun 1, 2016

xiang90 commented Jun 9, 2016

xiang90 commented Jun 17, 2016

justinsb commented Jun 18, 2016

Documentation needed for cloud-cluster case #5418

Documentation needed for cloud-cluster case #5418

Comments

justinsb commented May 20, 2016

xiang90 commented May 21, 2016 • edited by philips Loading

xiang90 commented May 27, 2016

justinsb commented May 27, 2016

xiang90 commented Jun 1, 2016

xiang90 commented Jun 9, 2016

xiang90 commented Jun 17, 2016

justinsb commented Jun 18, 2016

xiang90 commented May 21, 2016 •

edited by philips

Loading