[WIP] add support for `keepalived` #68

Kosta-Github · 2015-07-01T16:34:40Z

This is work-in-progress (but actually seems to work so far for me).

I would like to use keepalived for this functionality (from the docs):

... high-availability is achieved by VRRP protocol.
VRRP is a fundamental brick for router failover.

This allows you to specify a virtual IP and connect that virtual IP to one of the nodes running the HAProxy in the cluster. If that node is not reachable anymore it switches automatically to another node in the cluster. This is probably similar to AWS's elastic IP, but I am unfamiliar with that, since I cannot use AWS for various reasons.

The question is: would you be interested in integrating this functionality into your technology stack? If so, I would add something to the README.md as well and do some more testing.

This functionality paired with my last PR #59 provides you with a nice highly available load balancer mechanism.

Fix newline issue for load balanced TCP services

* commit 'adf314b89561d7ae9bdf8f47503c8ea4accff776':

sielaq · 2015-07-02T15:40:49Z

Hi @Kosta-Github ,

why do you need VIP and keepalived ?
What is a problem you try to solve ?
Is <your_service>.service.consul not HA enough?

Kosta-Github · 2015-07-07T08:55:36Z

From within the cluster the consul service discovery mechanism works. But when we start directing traffic from outside the cluster into it we cannot use the consul mechanism anymore.

Since our infrastructure doesn't support elastic load balancing or elastic IP I setup a virtual IP and all external DNS queries such as service_1.my_product.my_company.com, service_2.my_product.my_company.com, ... get resolved to this virtual IP. This virtual IP in turn is tied to one of the cluster nodes via keepalived and this node will be hit by the outside traffic and the HAProxy running on that node doing the load balancing. As soon as this node goes down or is not reachable keepalived does an automatic fail-over to one of the other cluster nodes.

This way my PR #59 allows to still access the different services from the outside by their corresponding service names (service_1, service_2, ...).

sielaq · 2015-07-07T10:23:41Z

We use HW for SSL interruption, so we can plug behind them HAproxy etc..
You can simulate this by setting up front apache / nginx
ans the use PassProxy mechanism:

ProxyPreserveHost Off
ProxyPass /your_endpoint  http://your_service.service.consul connectiontimeout=3 timeout=10 retry=2
ProxyPassReverse /your_endpoint  http://your_service.service.consul

moreover you can run consul agent on your apache boxes and generate apache configuration more dynamically. And use mode proxy balancer.

Kosta-Github · 2015-07-07T11:00:01Z

I am up for this kind of setup: https://thejimmahknows.com/high-availability-using-haproxy-and-keepalived/

This works pretty nice for us for the past week now.

I don't want to add another load balancer in front of that, since that would become a single-point-of-failure again.

…eraS

bogus-py · 2015-07-08T04:47:31Z

I would not recommend using the same haproxy for both internal and public facing services. Once you make your HAproxy accessible from the outside, anybody can access any of your services by manipulating the Host Header.
For this to succeed the attacker needs some knowledge about how your internal services are named, but this is nothing more than security by obscurity.

I would do as @sielaq suggested. Run separate HAproxys (or nginx, varnish, apache httpd, whatever) that only give access to your public facing services and use keepalived for HA.

Kosta-Github · 2015-07-08T04:53:27Z

Ok, sorry, I wasn't tat clear enough: with outside of the cluster I still mean from inside of the company's intranet. For traffic from the internet there are more systems around, doing auth, SSL termination, ... But those system should not be tightly coupled to the cluster implementations...

bogus-py · 2015-07-08T05:24:21Z

Got it.

Here's another idea on this that I've been playing with:
The consul domain (consul.) is configurable. What if we set it to something like consul.intern.mycompany.com. and configure our DNS servers for intern.mycompany.com to forward consul stuff to consul accordingly. This way we have proper DNS inside our intranet, and with the right dns-serch setting for docker within the cluster everything is still resolvable as usual.
Any thoughts?

Kosta-Github · 2015-07-08T07:21:04Z

The problem for me is, that I cannot change the company-wide DNS settings in that way.

I already had a fight with DevOps to allow mapping all DNS queries *.my_product.my_company.com to my_product.my_company.com. They came up with a solution that allows me to define up-to-max 20 names <name_01>.my_product.my_company.com ... <name_20>.my_product.my_company.com that will be mapped to my_product.my_company.com; no wildcard possible. And adding/changing those names will need to propagate through the DNS settings which right now takes >15 minutes...

And again, this is for the company intranet, not for internet accessibility.

bogus-py · 2015-07-08T07:32:04Z

Got it. In my case I'm the DevOps dude :-)

What I had in mind wasn't wildcard A-records but dns delegation (configure DNS servers to use consul DNS interface to resolv all *.consul.intern.mycompnay.com). But this of course requires cooperation from your DNS admin.

…stname of the panteras container

eBayClassifiedsGroup#73 & eBayClassifiedsGroup#69

… image Set the env variable `PANTERAS_RESTART` to `no` (default), `on-failure`, or `always`; see: https://docs.docker.com/reference/commandline/run/#restart-policies

1. create a `unique ID` for each request 2. inject this ID into the HTTP headers as `X-Unique-ID` 3. append this ID to the HTTP log format

sielaq · 2015-08-10T20:29:59Z

infrastructure/keepalived.conf

+        {{env "KEEPALIVED_VIP"}} # the virtual IP
+    }
+    unicast_peer {               # IP addresses of all other peer nodes
+        {{range nodes}}{{$n := .}}{{if ne $n.Address $node.Address}}{{$n.Address}}


I would do:

{{range service "consul"}}{{$n := .}}{{if ne $n.Address $node.Address}}{{$n.Address}}

{{nodes}} contains also slaves, are you running keepalived on every consul host ?

good catch; I will change that...

in order to limit peer list to the nodes running a `consul agent`

Add support for `keepalived`

Kosta-Github · 2015-08-12T14:24:36Z

cool; thanks for merging!

Kosta-Github and others added 6 commits June 30, 2015 19:17

Merge pull request #1 from Kosta-Github/Kosta/tcp_newline

adf314b

Fix newline issue for load balanced TCP services

started working on keepalived

55e24b0

some more fixes to keepalived.conf

6f66137

tolerate failures of restarting keepalived

44488f8

Merge commit 'adf314b89561d7ae9bdf8f47503c8ea4accff776'

9afd5f8

* commit 'adf314b89561d7ae9bdf8f47503c8ea4accff776':

HAProxy config fixes & one typo fixed

7efafd1

Kosta-Github and others added 7 commits July 7, 2015 14:31

Merge branch 'Kosta/aufs_for_vagrant' into Kosta/keepalived

aca3fad

Merge remote-tracking branch 'ebay/master' into Kosta/keepalived

7d7da52

DNS cleanup for docker 1.7.0

08d40d4

fix for eBayClassifiedsGroup#70

6c45176

eBayClassifiedsGroup#69 - HAproxy reload gets more smooth

1922c85

Merge branch 'master' of https://github.com/eBayClassifiedsGroup/Pant…

5097602

…eraS

Merge branch 'master' into Kosta/keepalived

d110bd4

Kosta-Github added 8 commits July 8, 2015 16:34

disable experimental zero-downtime haproxy reload mechanism again

c275225

allow to give the PanteraS container dedicated hostname

22369db

use PANTERAS_HOSTNAME (defaulting to HOSTNAME) for setting the ho…

c4356b3

…stname of the panteras container

more experiments with HOSTNAME

c7ae4df

Pull in zero-downtime haproxy reload mechanism from master

31f664b

eBayClassifiedsGroup#73 & eBayClassifiedsGroup#69

change killall -0 to pidof

631396c

add support for docker restart policies for the PanteraS docker…

76b739a

… image Set the env variable `PANTERAS_RESTART` to `no` (default), `on-failure`, or `always`; see: https://docs.docker.com/reference/commandline/run/#restart-policies

haproxy config: create unique ID for each request

4f16c92

1. create a `unique ID` for each request 2. inject this ID into the HTTP headers as `X-Unique-ID` 3. append this ID to the HTTP log format

Kosta-Github added 2 commits August 10, 2015 09:42

keepalived config: run the health check every second

7f12ccb

Merge remote-tracking branch 'ebay/master' into Kosta/keepalived

75585a3

This was referenced Aug 10, 2015

New release with updates #76

Merged

race condition between consul-template and restarting haproxy #69

Closed

sielaq reviewed Aug 10, 2015
View reviewed changes

use range service "consul" instead of range nodes

0505c23

in order to limit peer list to the nodes running a `consul agent`

sielaq added a commit that referenced this pull request Aug 12, 2015

Merge pull request #68 from Kosta-Github/Kosta/keepalived

0e08d7c

Add support for `keepalived`

sielaq merged commit 0e08d7c into eBayClassifiedsGroup:master Aug 12, 2015

Kosta-Github deleted the Kosta/keepalived branch August 13, 2015 15:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] add support for `keepalived` #68

[WIP] add support for `keepalived` #68

Kosta-Github commented Jul 1, 2015

sielaq commented Jul 2, 2015

Kosta-Github commented Jul 7, 2015

sielaq commented Jul 7, 2015

Kosta-Github commented Jul 7, 2015

bogus-py commented Jul 8, 2015

Kosta-Github commented Jul 8, 2015

bogus-py commented Jul 8, 2015

Kosta-Github commented Jul 8, 2015

bogus-py commented Jul 8, 2015

sielaq Aug 10, 2015

Kosta-Github Aug 11, 2015

Kosta-Github commented Aug 12, 2015

[WIP] add support for keepalived #68

[WIP] add support for keepalived #68

Conversation

Kosta-Github commented Jul 1, 2015

sielaq commented Jul 2, 2015

Kosta-Github commented Jul 7, 2015

sielaq commented Jul 7, 2015

Kosta-Github commented Jul 7, 2015

bogus-py commented Jul 8, 2015

Kosta-Github commented Jul 8, 2015

bogus-py commented Jul 8, 2015

Kosta-Github commented Jul 8, 2015

bogus-py commented Jul 8, 2015

sielaq Aug 10, 2015

Choose a reason for hiding this comment

Kosta-Github Aug 11, 2015

Choose a reason for hiding this comment

Kosta-Github commented Aug 12, 2015

[WIP] add support for `keepalived` #68

[WIP] add support for `keepalived` #68