Cluster config fixes and removal of meta.peers config field #3638

jwilder · 2015-08-12T19:17:55Z

This PR fixes a panic and a regression with the -hostname flag as well as removes the [meta].peers config option to avoid confusion. The supported method for joining a node to a cluster is using the -join flag.

The env var overrides panic if we if peers was set in the config. The type of that value is a []string and NumFields is not a valid for that type. We don't currently support this slice fields via the env variable settings. This particular one can be set with the -join flag already so we just skip these fields for now.

Adding a new peer must happen via the -join flag.

otoolep · 2015-08-12T19:19:23Z

meta/config.go

@@ -31,7 +31,7 @@ type Config struct {
 	Dir                 string        `toml:"dir"`
 	Hostname            string        `toml:"hostname"`
 	BindAddress         string        `toml:"bind-address"`
-	Peers               []string      `toml:"peers"`
+	Peers               []string      `toml:"-"`


What does this mean? Ignore in older config files?

Yes. It's ignored in old config files and not-settable via toml configs.

otoolep · 2015-08-12T19:20:34Z

+1

Cluster config fixes and removal of meta.peers config field

sebito91 · 2015-08-21T20:06:28Z

Honestly, not sure this makes much sense. How are we supposed to control having a leader + follower set for when clustering is open to 100s or 1000s of nodes? Does this mean we'll always need to start one machine as the effective 'leader', then just have a -join for all other machines in the cluster? What if the leader fails and we need to restart that node, should they start back up with -join or without?

If we look to something like hdfs, this is handled via zookeeper and leader election is seamless to the process (ie. all nodes start up the same way with leader election handled behind the scenes). This change puts a lot of that work onto the designer of the cluster, and effectively creates a leader + follower structure that doesn't really work when you get out to scale.

jwilder · 2015-08-21T20:28:13Z

When you start a cluster you can have them all start with the same -join flags. (e.g. set -join host1:8088,host2:8088,host3:8088 on all nodes). The first 3 nodes to join will take part in the raft cluster and will elect a leader amongst themselves. If the leader fails, a new leader will be elected from the remaining raft peers if a quorum is available.

Adding additional nodes just requires joining another node in an existing cluster. It does not need to join a member of the raft cluster. Any node is fine. Once a node has joined a cluster, the -join flag is not used anymore and it will re-use it's existing cluster state so it's fine to just set it once and leave it.

jwilder · 2015-08-21T20:30:56Z

You can also create a cluster by incrementally joining new nodes to existing ones but you don't have to do it this way. For example, start one, then start new nodes passing -join node1:8088. Once you have more than 1 member in the cluster, new nodes can join any of those nodes to expand the cluster.

beckettsean · 2015-08-21T20:55:11Z

Deleted prior comment, my understanding is actually incorrect.

sebito91 · 2015-08-24T13:33:10Z

@beckettsean, @jwilder: thanks for the feedback. Obviously the documentation is still under review and there are a few kinks to work out...completely understood. It makes a little more sense now that we can effectively daisy-chain members into the cluster as necessary (according to @jwilder).

The concern we had internally was that the -join logic was unclear...do we just specify the three nodes for raft consensus across all machines that would join the cluster? How would node 1000 work when specifying the same raft consensus set? What happens to the 1000 nodes in the cluster if the three in the raft consensus died?

We'll keep testing these features out and document any issues that come up.

jwilder · 2015-08-24T15:26:39Z

@sebito91 You can specify any nodes to join. Using the same 3 nodes every time is fine even with larger clusters.

If a raft node dies and will never come back, it would need to be replaced. If it just goes offline, it'll join the raft group when it restarts. With 3 raft nodes in the cluster, the cluster can support one raft node offline before availability is affected. With 5, it can support 2 failed. To replace a failed raft node, it's a manual process right now which still needs to be documented.

jwilder added 3 commits August 12, 2015 10:41

Make sure port is optional when passing hostanme flag

f07b8d4

Remove [meta].peers config option

17583f7

Adding a new peer must happen via the -join flag.

jwilder added the 2 - Working label Aug 12, 2015

jwilder changed the title ~~Clustering fixes~~ Clustering config fixes Aug 12, 2015

otoolep reviewed Aug 12, 2015
View reviewed changes

jwilder changed the title ~~Clustering config fixes~~ Cluster config fixes and removal of meta.peers config field Aug 12, 2015

Update changelog

e8f8ab9

jwilder added a commit that referenced this pull request Aug 12, 2015

Merge pull request #3638 from influxdb/jw-fixes

2e15449

Cluster config fixes and removal of meta.peers config field

jwilder merged commit 2e15449 into master Aug 12, 2015

jwilder deleted the jw-fixes branch August 12, 2015 19:43

jwilder removed the 2 - Working label Aug 12, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cluster config fixes and removal of meta.peers config field #3638

Cluster config fixes and removal of meta.peers config field #3638

jwilder commented Aug 12, 2015

otoolep Aug 12, 2015

jwilder Aug 12, 2015

otoolep commented Aug 12, 2015

sebito91 commented Aug 21, 2015

jwilder commented Aug 21, 2015

jwilder commented Aug 21, 2015

beckettsean commented Aug 21, 2015

sebito91 commented Aug 24, 2015

jwilder commented Aug 24, 2015

Cluster config fixes and removal of meta.peers config field #3638

Cluster config fixes and removal of meta.peers config field #3638

Conversation

jwilder commented Aug 12, 2015

otoolep Aug 12, 2015

Choose a reason for hiding this comment

jwilder Aug 12, 2015

Choose a reason for hiding this comment

otoolep commented Aug 12, 2015

sebito91 commented Aug 21, 2015

jwilder commented Aug 21, 2015

jwilder commented Aug 21, 2015

beckettsean commented Aug 21, 2015

sebito91 commented Aug 24, 2015

jwilder commented Aug 24, 2015