Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Weave randomly picks ipv6 and everything breaks #45858

Closed
hollowimage opened this issue May 15, 2017 · 10 comments
Closed

Weave randomly picks ipv6 and everything breaks #45858

hollowimage opened this issue May 15, 2017 · 10 comments
Assignees
Labels
sig/network Categorizes an issue or PR as relevant to SIG Network.

Comments

@hollowimage
Copy link

hollowimage commented May 15, 2017

Is this a request for help? (If yes, you should use our troubleshooting guide and community support channels, see http://kubernetes.io/docs/troubleshooting/.): no

What keywords did you search in Kubernetes issues before filing this one? (If you have found any duplicates, you should instead reply there.):


Is this a BUG REPORT or FEATURE REQUEST? (choose one): bug report

Kubernetes version (use kubectl version): 1.6.2 and 1.6.3

Environment:

  • Cloud provider or hardware configuration: AWS
  • OS (e.g. from /etc/os-release): debian jessie
  • Kernel (e.g. uname -a): n/a
  • Install tools: kops
  • Others: weave for CNI

What happened:
this morning after our cluster scaled nodes back up, the kube-dns would fail to start. after endless troubleshooting, the issue went away on its own, but i did notice one thing. I believe this is related to weave pods snagging up ipv6 on the weave interface inside the pod, or lack of ipv4 really...

weave     Link encap:Ethernet  HWaddr ca:aa:32:95:7d:4b
          inet6 addr: fe80::c8aa:32ff:fe95:7d4b/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1376  Metric:1
          RX packets:7 errors:0 dropped:0 overruns:0 frame:0
          TX packets:8 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:460 (460.0 B)  TX bytes:648 (648.0 B)

and the kube-dns would show up with IP address in get pods as bearing the ip of the node, instead of the in-cluster 10. range.

after it "fixed itself" the weave interface looked like:

weave     Link encap:Ethernet  HWaddr 76:e6:c0:5a:2e:c5
          inet addr:10.36.0.0  Bcast:0.0.0.0  Mask:255.240.0.0
          inet6 addr: fe80::74e6:c0ff:fe5a:2ec5/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1376  Metric:1
          RX packets:775 errors:0 dropped:0 overruns:0 frame:0
          TX packets:9 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:22188 (21.6 KiB)  TX bytes:690 (690.0 B)

What you expected to happen: not using ipv6, or adding a check to the CNI networking? not sure how best to handle it.

How to reproduce it (as minimally and precisely as possible):
n/a -- i do not know. it may be a race condition? but it went way as mysteriously as it happened, and i was not able to change anything over the course of the whole day.

Anything else we need to know:
during the course of the day, the cluster was "reset" (i.e. i termianted all instances) 10+ times.

the issue manifests by kube-dns not being able to start, and nodes bouncing up/down due to the PLEG events throwing a negative.

@cmluciano
Copy link

@hollowimage Is there a bug open on the Weave repository as well.

cc @bboreham

@cmluciano
Copy link

/area network

@k8s-ci-robot
Copy link
Contributor

@cmluciano: These labels do not exist in this repository: area/network.

In response to this:

/area network

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

@cmluciano
Copy link

/label sig/network

@cmluciano cmluciano added the sig/network Categorizes an issue or PR as relevant to SIG Network. label May 16, 2017
@hollowimage
Copy link
Author

@cmluciano i dont know, but i pinged the guys in their slack with this issue. honestly i just wanted to make a record of this somewhere at the time before it got lost.

@cmluciano
Copy link

I think we should open the issue on the Weave repository unless this affects more than one CNI plugin.

@bboreham
Copy link
Contributor

Agreed you should open this issue on the Weave repo.

Looking at the symptoms, that ipv6 address is just a link-local one automagically generated; the issue is that the Weave Net startup has not yet assigned an IPv4 address.

The reason for that may be in the (Docker) logs of the weave container.

@cmluciano
Copy link

Thanks @bboreham . @hollowimage please open this issue on the Weave repository

@cmluciano
Copy link

/assign

@cmluciano
Copy link

/close

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
sig/network Categorizes an issue or PR as relevant to SIG Network.
Projects
None yet
Development

No branches or pull requests

4 participants