In kubeproxy ipvs mode UDP traffic to Loadbalancer IP fails after node reboot #105192

VivekThrivikraman-est · 2021-09-22T10:36:30Z

What happened:

In ipvs mode client traffic(UDP) to a pod behind LoadbalancerIP is blackholed after node having the pod is rebooted.
The issue seems to be similar to the below iptable issue, since client continuously tries to connect to the loadbalancerIP (even before kubeproxy has applied all rules), and we see that stale conntracks entries remain even after rules are applied. After the stale conntrack entries are cleaned manually the traffic starts flowing :
https://github.com/kubernetes/kubernetes/pull/104151/files

From the below code looks like ipvs currently clears stale conntrack entries for ExternalIPs but not LoadbalancerIP(?):
https://github.com/kubernetes/kubernetes/blob/master/pkg/proxy/ipvs/proxier.go#:~:text=svcInfo.ClusterIP().String())-,for%20_%2C%20extIP%20%3A%3D%20range%20svcInfo.ExternalIPStrings()%20%7B,-staleServices.Insert(extIP

What you expected to happen:

UDP traffic between client and pod(behind Loadbalancer IP) should have been established after node reboot.

How to reproduce it (as minimally and precisely as possible):

Start a POD acting as a UDP server and expose it through Loadbalancer service
Make a UDP client to continuously send traffic to the LoadbalancerIP
Restart the node having the pod.
Even after the new pod serving UDP is up, the UDP traffic from client would be blackholed(until stale conntrack entries are cleared).

Anything else we need to know?:

Environment:

Kubernetes version (use kubectl version): 1.21.1
Cloud provider or hardware configuration:
Manufacturer: Dell Inc.
Product Name: PowerEdge R640
Version: Not Specified
OS (e.g: cat /etc/os-release): SUSE Linux Enterprise Server 15 SP2
Kernel (e.g. uname -a): Linux pool16-n108-wk16-n080 5.3.18-24.75.3.22886.0.PTF.1187468-default # 1 SMP Thu Sep 9 23:24:48 UTC 2021 (37ce29d) x86_64 x86_64 x86_64 GNU/Linux
Install tools:
Network plugin and version (if this is a network-related bug): Calico v3.19.1-12-baa55cf9
Others:

The text was updated successfully, but these errors were encountered:

VivekThrivikraman-est · 2021-09-22T10:38:17Z

/sig network

VivekThrivikraman-est · 2021-09-22T10:40:39Z

@uablrek @aojea

uablrek · 2021-09-22T10:46:34Z

/assign

I'll try to reproduce this using ctraffic but it may take some days.

aojea · 2021-09-22T13:13:42Z

yeah, ipvs kube-proxy doesn't have some of the latest "conntrack stale" fixes like #104151 , mainly because I'm not very familiar with IPVS and I can't know if all of them are needed or only some of them.

khenidak · 2021-09-24T23:18:58Z

/triage accepted

@aojea is it a matter of inserting externalIPs in conntrack?

uablrek · 2021-09-25T12:45:07Z

@khenidak No, when a node is rebootes and udp messages to a loadBalancerIP enters before kube-proxy is started, then invalid (UNREPLIED) conntrack entrys are created. They will black-hole traffic, and since traffic keeps coming they will never timeout.

This is not easy to reproduce. I found it best to direct incoming traffic to a node (2), and have one endpoint pod on another node (4). Then reboot the node where traffic enters (and let the pod live) while traffic is sent continously. After reboot, check the conntrack on the rebooted node.

# conntrack -p udp -L
udp      17 29 src=1000::1:c0a8:1c9 dst=1000:: sport=52245 dport=5003 [UNREPLIED] src=1000:: dst=1000::1:c0a8:1c9 sport=5003 dport=52245 mark=0 use=1
udp      17 29 src=1000::1:c0a8:1c9 dst=1000:: sport=60276 dport=5003 [UNREPLIED] src=1000:: dst=1000::1:c0a8:1c9 sport=5003 dport=60276 mark=0 use=1
udp      17 119 src=1000::1:c0a8:1c9 dst=1000:: sport=60719 dport=5003 src=1100::402 dst=1000::1:c0a8:102 sport=5003 dport=62141 [ASSURED] mark=0 use=1
udp      17 29 src=1000::1:c0a8:1c9 dst=1000:: sport=43534 dport=5003 [UNREPLIED] src=1000:: dst=1000::1:c0a8:1c9 sport=5003 dport=43534 mark=0 use=1
udp      17 29 src=1000::1:c0a8:1c9 dst=1000:: sport=50938 dport=5003 [UNREPLIED] src=1000:: dst=1000::1:c0a8:1c9 sport=5003 dport=50938 mark=0 use=1
udp      17 119 src=1000::1:c0a8:1c9 dst=1000:: sport=49618 dport=5003 src=1100::402 dst=1000::1:c0a8:102 sport=5003 dport=19547 [ASSURED] mark=0 use=1
udp      17 29 src=1000::1:c0a8:1c9 dst=1000:: sport=56135 dport=5003 [UNREPLIED] src=1000:: dst=1000::1:c0a8:1c9 sport=5003 dport=56135 mark=0 use=1
udp      17 29 src=1000::1:c0a8:1c9 dst=1000:: sport=52045 dport=5003 [UNREPLIED] src=1000:: dst=1000::1:c0a8:1c9 sport=5003 dport=52045 mark=0 use=1
udp      17 119 src=1000::1:c0a8:1c9 dst=1000:: sport=39766 dport=5003 src=1100::402 dst=1000::1:c0a8:102 sport=5003 dport=22024 [ASSURED] mark=0 use=1
udp      17 29 src=1000::1:c0a8:1c9 dst=1000:: sport=44823 dport=5003 [UNREPLIED] src=1000:: dst=1000::1:c0a8:1c9 sport=5003 dport=44823 mark=0 use=1
conntrack v1.4.5 (conntrack-tools): 10 flow entries have been shown.

In this example 3 out of 10 connections are ok. Those are the fortunate ones that didn't send a packet in the small time span from node start to kube-proxy start. (about 1 pkt/sec is sent on all 10 connections)

uablrek · 2021-09-25T16:24:54Z

/area kube-proxy
/area ipvs

VivekThrivikraman-est added the kind/bug Categorizes issue or PR as related to a bug. label Sep 22, 2021

k8s-ci-robot added needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. labels Sep 22, 2021

k8s-ci-robot added sig/network Categorizes an issue or PR as relevant to SIG Network. and removed needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. labels Sep 22, 2021

k8s-ci-robot assigned uablrek Sep 22, 2021

k8s-ci-robot added triage/accepted Indicates an issue or PR is ready to be actively worked on. and removed needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. labels Sep 24, 2021

uablrek mentioned this issue Sep 25, 2021

Clear initial UDP conntrack entries for loadBalancerIPs for proxy-mode=ipvs #105249

Merged

k8s-ci-robot added area/kube-proxy area/ipvs labels Sep 25, 2021

k8s-ci-robot closed this as completed in #105249 Sep 26, 2021

VivekThrivikraman-est mentioned this issue Dec 7, 2021

REQUEST: New membership for VivekThrivikraman-est kubernetes/org#3135

Closed

7 tasks

JornShen mentioned this issue May 17, 2022

UDP traffic to Loadbalancer IP fails after node reboot (mode ipvs) #108065

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

In kubeproxy ipvs mode UDP traffic to Loadbalancer IP fails after node reboot #105192

In kubeproxy ipvs mode UDP traffic to Loadbalancer IP fails after node reboot #105192

VivekThrivikraman-est commented Sep 22, 2021 •

edited

VivekThrivikraman-est commented Sep 22, 2021

VivekThrivikraman-est commented Sep 22, 2021

uablrek commented Sep 22, 2021

aojea commented Sep 22, 2021

khenidak commented Sep 24, 2021

uablrek commented Sep 25, 2021

uablrek commented Sep 25, 2021

In kubeproxy ipvs mode UDP traffic to Loadbalancer IP fails after node reboot #105192

In kubeproxy ipvs mode UDP traffic to Loadbalancer IP fails after node reboot #105192

Comments

VivekThrivikraman-est commented Sep 22, 2021 • edited

What happened:

What you expected to happen:

How to reproduce it (as minimally and precisely as possible):

Anything else we need to know?:

Environment:

VivekThrivikraman-est commented Sep 22, 2021

VivekThrivikraman-est commented Sep 22, 2021

uablrek commented Sep 22, 2021

aojea commented Sep 22, 2021

khenidak commented Sep 24, 2021

uablrek commented Sep 25, 2021

uablrek commented Sep 25, 2021

VivekThrivikraman-est commented Sep 22, 2021 •

edited