NodePort on new node doesn't work without reboot #2737

cchanley2003 · 2019-07-20T00:22:39Z

Running k8s 1.12.3 and Calico Typha 3.8.0. NodePorts don't seem to work when a new node joins the cluster. When I curl an exposed node port that fronts a simple http server the call hangs. If I reboot that node, the node port works as expected. This is running Red Hat 7.6. The same steps with a cluster using weave's CNI doesn't have this behavior.

Expected Behavior

Expected behavior is that when a new node is marked Ready then NodePort services would work without having to reboot the machine.

Current Behavior

When a new node joins the cluster the node port doesn't seem to forward the request. The port is listening but calls to the port hang indefinitely. Rebooting the node fixes the erroneous behavior. Internal (pod to pod) cluster communication appears to work fine. Because this works fine if I stand up a cluster with everything the same but a different CNI I don't believe kube proxy is at fault. I confirmed that iptables is empty (other than the standard docker entries) before a node joins.

Possible Solution

None at this time

Steps to Reproduce (for bugs)

Any additional debugging steps would be appreciated?

Right now my steps are:

Stand a up a kubernetes HA cluster with version 1.12.3 using kubeadm
Install https://docs.projectcalico.org/v3.8/manifests/calico-typha.yaml
Deploy simple web server (httpd for instance) -- deployment + node port service
Add new nodes to cluster
Node port calls hang to any node within the cluster that hasn't been rebooted since joining (including master nodes)

Your Environment

Calico version typha 3.8.0 api datastore
Orchestrator version (e.g. kubernetes, mesos, rkt): kubernetes 1.12.3
Operating System and version: Red Hat 7.6

tmjd · 2019-07-22T21:16:39Z

I've never heard of a problem like this before so I don't know of anything immediately to check out.
If you could collect the output of iptables-save -c that may be helpful, please do that before you've rebooted a new node but after you've generated some traffic that reproduces the problem. (the -c should include packet counts so we can see if iptables is dropping traffic)

Is NetworkManager running on your nodes? I know there have been problems before with it but none that I remember with this type of behavior.

Please check the kubelet (for CNI errors) and calico-node logs before rebooting a new node to see if there are any problems setting up the networking for pods (or anything odd).

It may also be useful to check calicoctl node status (this must be run on the node with the problem) to see that all BGP sessions are able to be setup, though I would expect you to be able to see problems with that in the readiness of calico-node.

cchanley2003 · 2019-07-23T03:56:46Z

I'll work on getting the iptables outputs. Unfortunately behind an air gap for this cluster. The behavior seems similar to this: #875. We are running docker version 18.06.

The suggested solution from this ticket works as well. If I run iptables -P FORWARD ACCEPT the problem disappears on the problem nodes. So this fixes the issue without a reboot.

It looks like the issue was believed to be fixed by kubernetes/kubernetes#40182 which should be in k8s 1.12.3.

To answer your questions:

Is NetworkManager running -- No
Kubelet and calico node logs look clean
Node status indicates mesh connectivity to all other nodes in cluster

I'll work on providing iptables output and diffs.

cchanley2003 · 2019-07-24T15:27:07Z

So I was able to reproduce the problem with k8s 1.15.0. Here is the iptables output when the nodeport is being blocked. In this case we have a nexus server with a node port of 30100 and when this new node joined curl hung until iptables -P FORWARD ACCEPT was run. Then the node works as expected. This was run with a -c.
iptables_np.txt

caseydavenport · 2019-07-25T02:10:04Z

@cchanley2003 are you passing --cluster-cidr to your kube-proxy? I believe that should trigger a rule which allows pod traffic and might be related (based on only a quick skim)

cchanley2003 · 2019-07-31T23:09:30Z

I am not passing cluster-cidr to my kube-proxy. I'll look at kubeadm setup and see what I need to do to have that added into the default kubeadm installation. Just to be clear cluster-cidr should match the calico cidr range correct?

tmjd · 2019-08-01T13:14:28Z

Yes that is correct.

cchanley2003 · 2019-08-02T15:17:23Z

Going to close this issue. Believe that the problems were related to not handing kubeadm the cluster-cidr range. Thanks for the help.

rafaelvanoni assigned tmjd Jul 22, 2019

rafaelvanoni added the kind/support label Jul 22, 2019

cchanley2003 closed this as completed Aug 2, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NodePort on new node doesn't work without reboot #2737

NodePort on new node doesn't work without reboot #2737

cchanley2003 commented Jul 20, 2019 •

edited

tmjd commented Jul 22, 2019

cchanley2003 commented Jul 23, 2019 •

edited

cchanley2003 commented Jul 24, 2019

caseydavenport commented Jul 25, 2019

cchanley2003 commented Jul 31, 2019 •

edited

tmjd commented Aug 1, 2019

cchanley2003 commented Aug 2, 2019

NodePort on new node doesn't work without reboot #2737

NodePort on new node doesn't work without reboot #2737

Comments

cchanley2003 commented Jul 20, 2019 • edited

Expected Behavior

Current Behavior

Possible Solution

Steps to Reproduce (for bugs)

Your Environment

tmjd commented Jul 22, 2019

cchanley2003 commented Jul 23, 2019 • edited

cchanley2003 commented Jul 24, 2019

caseydavenport commented Jul 25, 2019

cchanley2003 commented Jul 31, 2019 • edited

tmjd commented Aug 1, 2019

cchanley2003 commented Aug 2, 2019

cchanley2003 commented Jul 20, 2019 •

edited

cchanley2003 commented Jul 23, 2019 •

edited

cchanley2003 commented Jul 31, 2019 •

edited