Skip to content
This repository has been archived by the owner on Aug 2, 2023. It is now read-only.

no resilience on network issues #7

Closed
FaKod opened this issue May 31, 2017 · 5 comments · Fixed by #27
Closed

no resilience on network issues #7

FaKod opened this issue May 31, 2017 · 5 comments · Fixed by #27

Comments

@FaKod
Copy link

FaKod commented May 31, 2017

UseCase:
Running an Exoip Pod in a Kubernetes Cluster. All exoip Container default priority.

exoip removes peer Nics if it fails to connect the node. If the node that could not be connected (in case of a network issue or ACL misconfiguration) is still alive, no one re-adds the Nic again.

@FaKod
Copy link
Author

FaKod commented Jun 1, 2017

We are currently testing this simple solution: https://github.com/FaKod/exoip/blob/cs_wip_make_it_work/src/exoip/state.go#L20

It calles ObtainNic every 30s (configurable)

@FaKod
Copy link
Author

FaKod commented Jun 20, 2017

OK, since this runs every 30 seconds on 7 Nodes and 4 Clusters, this leads to some traffic on the Exoscale API.

Thats why I'm testing this now: https://github.com/FaKod/exoip-k8s

It uses Kubernetes Simple Leader Election... this may be a solution.

@marcaurele
Copy link

That's still not acceptable for the API calls. You should get the VM's nic and only add the EIP if it's missing.

@greut
Copy link
Contributor

greut commented Apr 9, 2018

digging old issues here.

Having a master node unsubscribing other ones seems okay. If they cannot communicate directly, you've got another issue. The addIpToNic in a while loop feels wrong.

@greut
Copy link
Contributor

greut commented Apr 11, 2018

We've been experiencing this kind of behaviour that looks alike:

  • two nodes are connected
  • they somehow disconnect each other from the eip
  • two nodes are present, ping each other, but are disconnected from the eip
  • 😿

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants