Cluster Setup #5706

jwilder · 2016-02-16T21:16:46Z

This PR addresses most of the issues from #5673. Specifically, it changes the following:

Ensures node IDs are the same when a node is running both meta and data services
Allows bind addresses where a hostname or IP is not specified to work correctly and bind to all interfaces by default. e.g. bind-address = ":8088" will now bind to all interfaces instead of just localhost.
Fixes the top-level hostname config option to allow overriding all bind address hostnames. This allows a node to advertise a different hostname than what is defined in the bind address setting. For example, if the config is bind-address = ":8088" and hostname = "influx1", the node will bind to all interfaces on port 8088 and remote nodes will reach this node using the address influx1:8088. If a hostname is not specified, we default to localhost for backwards compatibility. This may change to os.Hostname() in the future if/when Add gossip protocol for determining node addresses #5672 is implemented.
Adds the -hostname command-line option back to allow specifying both -join and -hostname as command-line flags if desired.
Enforces a configuration precedence and overriding ability defined as config file is overridden by env vars which are overriden by command-line flags. These options apply in order and update the Config used by the services and code.
Adds the join config file option back to meta config. This allows join servers to be specified in a config files, via env vars, or command-line flags and ordering precedence is the same as -hostname.

joelegasse · 2016-02-18T01:05:57Z

services/meta/client.go

@@ -1251,6 +1258,14 @@ func (e errRedirect) Error() string {
 	return fmt.Sprintf("redirect to %s", e.host)
 }

+type errCommand struct {


Is there a reason to make thing a struct instead of just an aliased type? type errCommand string

No particular reason other than the error above did it this way: #5706 (diff)

e-dard · 2016-02-18T15:09:31Z

Small nit but LGTM 👍

-hostname is back \o/

Fixes #5669

This fixes several issues related to the bind address and hostname: * Allows bind addresses where a hostname or IP is not specified to work correct and bind to all interfaces by default. * Fixes the top-level "hostname" config option to allow overridding all bind address hostnames. This allows a node to advertise a different hostname than what is defined in the bind address setting. * Adds the -hostname command-line option back to allow specifing both -join and -hostname as command-line flags. * Enforces a configuration precedence and overriding ability defined as config file is overridden by env vars which are overriden by command-line flags. Fixes #5670 #5671

Dropping a meta node that had already been removed from the config would fail because the raft.RemovePeers call would return an error that the address was unknown. This change skips calling RemovePeer if it doesn't exist. Dropping a non-existing ID would hang for 10 seconds becuase the meta.Client retryUntilExec didn't differentiate before command errors and redirect errors. In this case, the command would return an error but we'd try 10 more times and ultimately give up and return the error. We now return immediately if the command returned and error because retrying it will not succeed. Finally, the join loop had no delay and would immediately try to join the other nodes hundreds of times a second. We now pause a second if we've tried every node at least once.

Cluster Setup

jwilder added this to the 0.11.0 milestone Feb 16, 2016

jwilder force-pushed the jw-cluster branch from 61811c4 to e8ffd82 Compare February 16, 2016 21:18

jwilder changed the title ~~Use same node ID for meta and data nodes~~ Cluster Setup Feb 17, 2016

jwilder force-pushed the jw-cluster branch from fcb322b to f6164a7 Compare February 17, 2016 04:37

joelegasse reviewed Feb 18, 2016
View reviewed changes

jwilder force-pushed the jw-cluster branch from f6164a7 to 1d01ef4 Compare February 18, 2016 20:51

jwilder added 4 commits February 18, 2016 14:45

Use same node ID for meta and data nodes

a90114a

Fixes #5669

Add join config option back

04ba794

jwilder force-pushed the jw-cluster branch from 1d01ef4 to 04ba794 Compare February 18, 2016 21:45

jwilder added a commit that referenced this pull request Feb 18, 2016

Merge pull request #5706 from influxdata/jw-cluster

26163cb

Cluster Setup

jwilder merged commit 26163cb into master Feb 18, 2016

jwilder deleted the jw-cluster branch February 18, 2016 22:08

rossmcdonald mentioned this pull request Feb 19, 2016

bind IP and IP presented to cluster should be separately configured #5752

Closed

mvadu mentioned this pull request Feb 20, 2016

Error - testing Cluster setup on Windows #5715

Closed

PierreF mentioned this pull request Mar 16, 2016

[0.11.0-rc1] Cluster with -join need ALL node to restart #6027

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cluster Setup #5706

Cluster Setup #5706

jwilder commented Feb 16, 2016

joelegasse Feb 18, 2016

jwilder Feb 18, 2016

e-dard commented Feb 18, 2016

Cluster Setup #5706

Cluster Setup #5706

Conversation

jwilder commented Feb 16, 2016

joelegasse Feb 18, 2016

Choose a reason for hiding this comment

jwilder Feb 18, 2016

Choose a reason for hiding this comment

e-dard commented Feb 18, 2016