RPC Advertise Address not Advertisable if -bind 0.0.0.0 #186

ghost · 2015-10-01T05:12:40Z

Getting the following:

[root@nomad ~]# nomad agent -server -bootstrap-expect 1 -data-dir /tmp/nomad -bind 0.0.0.0
==> WARNING: Bootstrap mode enabled! Potentially unsafe operation.
==> Starting Nomad agent...
==> Error starting agent: server setup failed: Failed to start RPC layer: RPC advertise address is not advertisable: [::]:4647

This happens only if -bind 0.0.0.0 is set, setting an interface IP clears the error. Occurs whether or not IPv6 is enable. Tested on CentOS 7 and CoreOS Stable (in client only mode)

The text was updated successfully, but these errors were encountered:

LordFPL · 2015-10-01T09:43:08Z

I encounter the same pb : bind on 0.0.0.0 is working, but there is a problem with advertise address... only one can be used.

In consul, i never encounter this problem... maybe it's implicit... but in nomad you have to specify it.

Here is a part of my config :

bind_addr = "0.0.0.0"
advertise {
    rpc = "my.ip:4647"
    serf= "my.ip:4648"
}

I have asked in the google gorup about a way to specify a dynamic var like $(hostname -i).

ryanuber · 2015-10-01T18:49:18Z

Ah, so in Consul I think this worked because we had a function that would scan for a private IP address, and automatically use that. There were a number of different opinions surrounding that option, and although in most cases it made things easier to start, it was intentionally left out so that it would always be clear which address we were binding and advertising. It just gets ambiguous if there are multiple interfaces at play.

I'm marking this as a thinking ticket, because the UX can probably be improved. Thanks for reporting.

apognu · 2015-10-02T14:46:41Z

Maybe, in the case the specific IP address of the server isn't known in advance, its advertise addresses could be derived from a given subnet. Something in the likes of:

advertise {
  rpc = "10.0.0.0/8:4657"
  serf = "10.0.0.0/8:4648"
}

I already have a patch somewhere that would look for the first interface to have an IP on the given subnet and use it for advertising, if it is deemed interesting.

HenryTheHamster · 2015-10-06T08:28:41Z

+1 to the subnet idea, or something similar to avoid having to know the IP up front

cbednarski · 2015-10-06T08:32:19Z

I already have a patch somewhere that would look for the first interface to have an IP on the given subnet and use it for advertising, if it is deemed interesting.

@apognu That would be welcome!

apognu · 2015-10-06T09:21:26Z

Give me a few hours, I'll submit a PR.

cetex · 2015-10-18T21:47:00Z

I'd like to add another thing here, this should behave like consul does, so -bind=:: should work. (Bind to any ipv4 or ipv6 ip address available) as well as the commandline option -advertise=

In our consul environment i set all nodes to -bind=:: and then -advertise=<the "real" ip of the node>

On all masters i setup a secondary ip-address of 10.255.255.255 on loopback (this is anycasted through our network) so any client anywhere within our network will just "-retry-join=10.255.255.255" and find the closest running master available.

serf/raft seems to take care of the rest, the client finds a master and gets the full list of all other masters and then just seems to ignore the -retry-join ip.

c4milo · 2015-12-12T20:26:41Z

I'm also running into this. Consul's behavior seems to be what most people will like to see, including myself. Also, instead of giving an IP address, I would prefer to specify the network interface.

cbednarski · 2015-12-14T21:17:02Z

Based on the feedback here and feedback that we received in Consul we're considering the following:

Support bind based on named interface (eth1).
Support bind based on CIDR range (10.1.0.0/16). This allows you to get pretty granular if you have multiple interfaces or multiple IPs per interface.

Specifying the IP (as is currently supported) is pretty straightforward. If we do interface or CIDR we end up with a lot of messy edge cases.

Some interfaces may have aliases, like eth1:0, on linux.
Some interfaces may have multiple IPs associated with them (e.g. IPv4 and IPv6).
- Should we use the "first" IP that matches a specific interface or CIDR block instead of using all of them?
- If so how do we define "first"?
IPv6 CIDR is getting into weird territory.
Any other cross-platform considerations?

We end up using similar logic in at least 3 places so this is a good opportunity to factor it out:

Binding Nomad APIs when the agent starts
Detecting available networks during fingerprinting
Placing tasks into specific networks

I'd like to add another thing here, this should behave like consul does, so -bind=:: should work. (Bind to any ipv4 or ipv6 ip address available) as well as the commandline option -advertise=
In our consul environment i set all nodes to -bind=:: and then -advertise=

I'm not sure this is still supported in Consul 0.6.0. Also, since Nomad does not use serf across the entire cluster (only amongst the server nodes) we may not be able to do things exactly the same way that Consul does them.

jhartman86 · 2016-01-21T17:55:22Z

Bit by this issue as well, +1 for binding to named interface.

dadgar · 2016-03-22T04:22:23Z

Closing this as #941 lets you bind by interface name

sjwl · 2016-10-04T17:19:40Z

it appears #941 was pulled out from this commit? 079e55e

is there a corresponding issue/explanation why the feature was removed?

Changes user restore API to blocking (since it was blocking internally).

github-actions · 2022-12-19T02:12:04Z

I'm going to lock this issue because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active issues.
If you have found a problem that seems similar to this, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

ryanuber added the stage/thinking label Oct 1, 2015

apognu mentioned this issue Oct 6, 2015

Added subnet IP address parsing for advertise addresses (eg. 10.1.0.0… #223

Closed

cbednarski added the theme/networking label Oct 15, 2015

cbednarski added the type/enhancement label Dec 14, 2015

slackpad mentioned this issue Dec 14, 2015

Consul 0.6: dynamic bind address when running in a docker container? hashicorp/consul#1478

Closed

dadgar closed this as completed Mar 22, 2016

benbuzbee pushed a commit to benbuzbee/nomad that referenced this issue Jul 21, 2022

Merge pull request hashicorp#186 from hashicorp/restore-ifc

e1d3deb

Changes user restore API to blocking (since it was blocking internally).

github-actions bot locked as resolved and limited conversation to collaborators Dec 19, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RPC Advertise Address not Advertisable if -bind 0.0.0.0 #186

RPC Advertise Address not Advertisable if -bind 0.0.0.0 #186

ghost commented Oct 1, 2015

LordFPL commented Oct 1, 2015

ryanuber commented Oct 1, 2015

apognu commented Oct 2, 2015

HenryTheHamster commented Oct 6, 2015

cbednarski commented Oct 6, 2015

apognu commented Oct 6, 2015

cetex commented Oct 18, 2015

c4milo commented Dec 12, 2015

cbednarski commented Dec 14, 2015

jhartman86 commented Jan 21, 2016

dadgar commented Mar 22, 2016

sjwl commented Oct 4, 2016

github-actions bot commented Dec 19, 2022

RPC Advertise Address not Advertisable if -bind 0.0.0.0 #186

RPC Advertise Address not Advertisable if -bind 0.0.0.0 #186

Comments

ghost commented Oct 1, 2015

LordFPL commented Oct 1, 2015

ryanuber commented Oct 1, 2015

apognu commented Oct 2, 2015

HenryTheHamster commented Oct 6, 2015

cbednarski commented Oct 6, 2015

apognu commented Oct 6, 2015

cetex commented Oct 18, 2015

c4milo commented Dec 12, 2015

cbednarski commented Dec 14, 2015

jhartman86 commented Jan 21, 2016

dadgar commented Mar 22, 2016

sjwl commented Oct 4, 2016

github-actions bot commented Dec 19, 2022