-
Notifications
You must be signed in to change notification settings - Fork 34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bug: kubeproxy was trying to acces K8s apiserver through LB URL after removing the LB #1226
Comments
@JKBGIT1 could you pls check this and we can talk about this again during the next grooming. |
The problem isn't completely mirrored without steps 2. and 5. (stop and start of
As you can see kube-proxy still uses the LoadBalancer's URL even though the LB was removed.
This problem described above in this comment happens in both. |
My findings on this issue.
To fix this manually, I had to update the apiEndpoint in the config map, and rollout restart kube-proxy. |
Current Behaviour
The InputManifest contained some control-plane nodes and two static nodes. One static node was used as a worker node in the K8s cluster, and another one was used as a LoadBalancer. The worker node couldn't start any pod because the
systemd-resolved
service wasn't running there.We removed the LoadBalancer from the InputManifest. Then we started the
systemd-resolved
service on the worker node. After the start, the first pods on the worker node were created (kube-proxy and node-local), however, the Cilium agent on that worker node kept on restarting, because it couldn't connect to the Kubernetes apiserver. These logs were in the kube-proxy pod of that worker node.To me, it looks like the kube-proxy still tried to connect to the Kubernetes apiserver through the LoadBalancer URL, even though the LoadBalancer was removed.
Expected Behaviour
A healed worker node should recognize that the LoadBalancer isn't in use anymore, and most importantly it should be able to connect to the Kubernetes apiserver.
Steps To Reproduce
systemd-resolved
service isn't running on the static worker node.systemd-resolved
service on the static worker node.The text was updated successfully, but these errors were encountered: