-
Notifications
You must be signed in to change notification settings - Fork 561
Kubernetes 1.11 cluster node lost its InternalIP #3503
Comments
This from a v1.11.0 cluster w/ 3 masters and 2 agent pools, each agent pool of which has 3 nodes. |
This seems similar to #2388. I don't think it's related to Azure-CNI; I'm seeing the same with cilium as the cni plugin. Another piece I'm seeing is that the kubelet of the node that does have its
It normally repeats every ~10s or so. Meanwhile cilium is giving me a log message
From their code this happens when the It looks to me like the cloud provider is, at some point, failing to find and correctly report the host internalip. An oddity is that the cilium messages about receiving bad updates starts about 80s earlier than the last update. Maybe both are reacting to something else? |
We probably experience a similar issue with missing
It worked a week ago, so the addresses have been lost on the way. |
@feiskyer @andyzhangx Just curious: is this a symptom we're tracking in some fashion? Have you heard about it? Thanks! |
Also @weinong notes that these symptoms have been periodically observed in AKS. We will look for patterns. |
No, didn't notice the issue before. Let me have a check what's possibly wrong. |
@stieler-it After messing around with Openstack I think I found a workaround. If you have the configdrive feature enabled in your openstack cluster you can then provision the nodes with that enabled and setup the cloud-config to only search using configdrive. This will then bypass the metadata api lookup which is what is failing. I am still letting my cluster run a few days but so far the internal-ip isn't missing. |
The fix is cherry-picking to 1.11 release kubernetes/kubernetes#70400. Still pending now. |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contribution. Note that acs-engine is deprecated--see https://github.com/Azure/aks-engine instead. |
See:
The text was updated successfully, but these errors were encountered: