Restarting iptables kube-proxier causes connections to fail #75360

PaulFurtado · 2019-03-14T06:37:14Z

What happened:
When restarting kube-proxy in iptables mode, there will be several seconds of timeouts because it flushes the KUBE-SERVICES nat chain and several others.

What you expected to happen:
Restarting kube-proxy should not impact traffic in any way.

How to reproduce it (as minimally and precisely as possible):

Run an HTTP server behind a ClusterIP service
Run an http client in a loop making requests to the HTTP server
kill -15 the kube-proxy process on the client node
Connections timeout for a few seconds until kube-proxy re-syncs the iptables rules

Anything else we need to know?:
Running with -v=6 makes it pretty clear what's happening:

I0314 05:35:28.808656       7 server_others.go:174] Tearing down inactive rules.
I0314 05:35:28.808673       7 iptables.go:419] running iptables -C [OUTPUT -t nat -m comment --comment handle ClusterIPs; NOTE: this must be before the NodePort rules -j KUBE-PORTALS-HOST]
I0314 05:35:28.812934       7 iptables.go:419] running iptables -C [PREROUTING -t nat -m comment --comment handle ClusterIPs; NOTE: this must be before the NodePort rules -j KUBE-PORTALS-CONTAINER]
I0314 05:35:28.816683       7 iptables.go:419] running iptables -C [OUTPUT -t nat -m addrtype --dst-type LOCAL -m comment --comment handle service NodePorts; NOTE: this must be the last rule in the chain -j KUBE-NODEPORT-HOST]
I0314 05:35:28.821715       7 iptables.go:419] running iptables -C [PREROUTING -t nat -m addrtype --dst-type LOCAL -m comment --comment handle service NodePorts; NOTE: this must be the last rule in the chain -j KUBE-NODEPORT-CONTAINER]
I0314 05:35:28.825080       7 iptables.go:419] running iptables -C [INPUT -t filter -m comment --comment Ensure that non-local NodePort traffic can flow -j KUBE-NODEPORT-NON-LOCAL]
I0314 05:35:28.826834       7 iptables.go:419] running iptables -F [KUBE-PORTALS-CONTAINER -t nat]
I0314 05:35:28.830456       7 iptables.go:419] running iptables -F [KUBE-PORTALS-HOST -t nat]
I0314 05:35:28.833272       7 iptables.go:419] running iptables -F [KUBE-NODEPORT-HOST -t nat]
I0314 05:35:28.836361       7 iptables.go:419] running iptables -F [KUBE-NODEPORT-CONTAINER -t nat]
I0314 05:35:28.839093       7 iptables.go:419] running iptables -F [KUBE-NODEPORT-NON-LOCAL -t filter]
I0314 05:35:28.840088       7 iptables.go:419] running iptables -C [OUTPUT -t nat -m comment --comment kubernetes service portals -j KUBE-SERVICES]
I0314 05:35:28.843330       7 iptables.go:419] running iptables -D [OUTPUT -t nat -m comment --comment kubernetes service portals -j KUBE-SERVICES]
I0314 05:35:28.847852       7 iptables.go:419] running iptables -C [PREROUTING -t nat -m comment --comment kubernetes service portals -j KUBE-SERVICES]
I0314 05:35:28.850976       7 iptables.go:419] running iptables -D [PREROUTING -t nat -m comment --comment kubernetes service portals -j KUBE-SERVICES]
I0314 05:35:28.855081       7 iptables.go:419] running iptables -C [POSTROUTING -t nat -m comment --comment kubernetes postrouting rules -j KUBE-POSTROUTING]
I0314 05:35:28.857992       7 iptables.go:419] running iptables -D [POSTROUTING -t nat -m comment --comment kubernetes postrouting rules -j KUBE-POSTROUTING]
I0314 05:35:28.862013       7 iptables.go:419] running iptables -F [KUBE-SERVICES -t nat]
I0314 05:35:28.865648       7 iptables.go:419] running iptables -X [KUBE-SERVICES -t nat]
I0314 05:35:28.869143       7 iptables.go:419] running iptables -F [KUBE-POSTROUTING -t nat]
I0314 05:35:28.872606       7 iptables.go:419] running iptables -X [KUBE-POSTROUTING -t nat]
I0314 05:35:29.159612       7 server.go:444] Version: v1.10.11

it says it's cleaning up inactive rules, but these are crucial to the iptables proxier. You can trivially reproduce by just running:

iptables -F KUBE-SERVICES -t nat

Environment:

Kubernetes version (use kubectl version): 1.10.11
Cloud provider or hardware configuration: amazon
OS (e.g: cat /etc/os-release): custom
Kernel (e.g. uname -a): 4.14.77-hs623.el6.x86_64
Install tools: custom
Others: iptables 1.6.2

We're running kube-proxy 1.10.11, but I've confirmed this issue with kube-proxy 1.13.4 too

The text was updated successfully, but these errors were encountered:

PaulFurtado · 2019-03-14T06:39:36Z

@kubernetes/sig-network-bugs

k8s-ci-robot · 2019-03-14T06:39:44Z

@PaulFurtado: Reiterating the mentions to trigger a notification:
@kubernetes/sig-network-bugs

In response to this:

@kubernetes/sig-network-bugs

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

andrewsykim · 2019-03-14T17:41:35Z

Possibly a bug in the proxy. It flushes iptabes if the proxier is able to use IPVS which seems like the wrong behavior to me if there are shared tables/chains between the IPVS and iptables proxier. One workaround I can think of right now is to unload the ipvs kernel module to skip this check.

andrewsykim · 2019-03-14T17:46:23Z

/assign

PaulFurtado · 2019-03-14T17:55:26Z

@andrewsykim oh, that's an interesting workaround, I'll give that a shot, thanks!

andrewsykim · 2019-03-14T18:03:35Z

np, I'll try to work on the bug fix in the meantime!

andrewsykim · 2019-03-14T18:53:06Z

/triage unresolved

PaulFurtado · 2019-03-15T01:25:16Z

@andrewsykim note that unloading the ip_vs module is not actually enough of a workaround because kube-proxy will load the kernel module if it sees that it is available. If I unload the ip_vs module and then rename ip_vs.ko to something else in /lib/modules then it does the right thing.

Dug slightly further: the way it probes for modules is by actually running modprobe. So the simplest hack that allows us to keep ip_vs loaded for other things on the system is to put a modprobe script on its PATH that just always exits 1 for the ip_vs modules. (We can stop mounting the modules dir into the kube-proxy container in our kubernetes clusters, but we also run kube-proxy on non-kubernetes nodes via the init system, so the PATH hack works well enough there). This should hold us over fine until we get to a version with your fix in it.

andrewsykim · 2019-03-15T15:28:32Z

Good to know, thanks for sharing!

andrewsykim · 2019-03-21T21:24:32Z

/assign @vllry

andrewsykim · 2019-03-21T21:28:21Z

Quick update on this issue from today's SIG Network call: we're going to try to get rid of the automatic proxy clean up altogether for v1.14.1 since this is considered a bug. @vllry is working on the KEP & implementation.

PaulFurtado added the kind/bug Categorizes issue or PR as related to a bug. label Mar 14, 2019

k8s-ci-robot added the needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. label Mar 14, 2019

k8s-ci-robot added sig/network Categorizes an issue or PR as relevant to SIG Network. and removed needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. labels Mar 14, 2019

PaulFurtado mentioned this issue Mar 14, 2019

Cluster IP will lost connection when restart kube-proxy in version 1.12.5 #73154

Closed

k8s-ci-robot assigned andrewsykim Mar 14, 2019

andrewsykim mentioned this issue Mar 14, 2019

Don't flush iptables chains when using iptables proxy but IPVS is available #75377

Closed

k8s-ci-robot added the triage/unresolved Indicates an issue that can not or will not be resolved. label Mar 14, 2019

andrewsykim mentioned this issue Mar 15, 2019

Remove kube-proxy's automatic clean up logic #75408

Closed

thockin removed the triage/unresolved Indicates an issue that can not or will not be resolved. label Mar 21, 2019

k8s-ci-robot assigned vllry Mar 21, 2019

andrewsykim mentioned this issue Apr 2, 2019

sig-network: updates to kube-proxy clean up KEP kubernetes/enhancements#932

Merged

vllry mentioned this issue Apr 3, 2019

Remove kube-proxy autocleanup for inactive modes #76109

Merged

k8s-ci-robot closed this as completed in #76109 Apr 5, 2019

andrewsykim mentioned this issue Apr 10, 2019

reference kube-proxy bug in v1.14.1 CHANGELOG #76399

Merged

This was referenced Apr 11, 2019

check cleanipvs flag when remove all ipvs and iptables #76379

Closed

Service access failure during kube-proxy rolling update #76376

Closed

masa213f mentioned this issue Dec 12, 2019

[cke] Prevent kube-proxy from clearing its IPVS / iptables rules after restarting cybozu-go/neco#692

Closed

4 tasks

shekhar-rajak mentioned this issue Feb 7, 2021

Remove kube-proxy's automatic clean up logic kubernetes/enhancements#2448

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Restarting iptables kube-proxier causes connections to fail #75360

Restarting iptables kube-proxier causes connections to fail #75360

PaulFurtado commented Mar 14, 2019

PaulFurtado commented Mar 14, 2019

k8s-ci-robot commented Mar 14, 2019

andrewsykim commented Mar 14, 2019 •

edited

Loading

andrewsykim commented Mar 14, 2019

PaulFurtado commented Mar 14, 2019

andrewsykim commented Mar 14, 2019

andrewsykim commented Mar 14, 2019

PaulFurtado commented Mar 15, 2019 •

edited

Loading

andrewsykim commented Mar 15, 2019

andrewsykim commented Mar 21, 2019

andrewsykim commented Mar 21, 2019 •

edited

Loading

Restarting iptables kube-proxier causes connections to fail #75360

Restarting iptables kube-proxier causes connections to fail #75360

Comments

PaulFurtado commented Mar 14, 2019

PaulFurtado commented Mar 14, 2019

k8s-ci-robot commented Mar 14, 2019

andrewsykim commented Mar 14, 2019 • edited Loading

andrewsykim commented Mar 14, 2019

PaulFurtado commented Mar 14, 2019

andrewsykim commented Mar 14, 2019

andrewsykim commented Mar 14, 2019

PaulFurtado commented Mar 15, 2019 • edited Loading

andrewsykim commented Mar 15, 2019

andrewsykim commented Mar 21, 2019

andrewsykim commented Mar 21, 2019 • edited Loading

andrewsykim commented Mar 14, 2019 •

edited

Loading

PaulFurtado commented Mar 15, 2019 •

edited

Loading

andrewsykim commented Mar 21, 2019 •

edited

Loading