kube-proxy iptables random routing is inefficient #64580

bfleming-ciena · 2018-05-31T22:16:06Z

I've learned recently that userspace routing was replaced with iptables routing in k8s, and iptables uses random pod selection for load balancing.

This means most of the time we have pods that are idle. If it were round-robin then we would guarantee to distribute evenly.

Is this random selection the only option we have available right now?

Thanks

/sig network

islinwb · 2018-06-01T01:13:13Z

@stonefury Maybe you should try ipvs.

bfleming-ciena · 2018-06-01T23:27:12Z

Oh wow, didn't know that was in the pipeline. It's beta I see, but still, is this to replace iptables mode, or just an alternate mode? Either way, thanks for that @islinwb

dims · 2018-06-02T00:29:02Z

@stonefury it will be GA in 1.11 - #58442

islinwb · 2018-06-02T02:05:55Z

@stonefury Currently IPVS is an alternate mode but I guess someday most people will use it. It takes round-robin as the default load balancing algorithm and you can choose others. (See the ipvs docs 1, 2)

thockin · 2018-06-06T23:23:42Z

A) Random means "equal probability of hitting any backend". If you have idle pods it suggests you either don't have a statistically significant number of connections or you're doing client affinity or something to defeat the randomizer.

B) Round-robin becomes a distributed decision - each node chooses independently of each other node. So to your backend service it's basically random anyway.

Anyway ipvs mode is going GA in 1.11, so please feel free to try it out :)

bfleming-ciena · 2018-06-07T14:58:35Z

@thockin - Yes, I later learned the the dev team uses persistent connections to the microservice, and so they only do a small number of initial connections, so statistically it was not evenly distributed. I had them increase the front end so it would make more connections. So this ultimately was an implementation issue on the dev side, though having round-robin would've made this a non issue for them.

Thanks all

Jeffwan · 2018-07-14T18:31:53Z

@thockin

I use socket to connect to a cluster IP which has many backend pods listening on same port. I notice random is totally not guaranteed. I disable sticky session on Service level (by default) but I am wondering if iptable proxy mode has something in-built to reuse address of backend pod.

I send ~1000 request and they all go to same pod.

zghnr1993 · 2019-11-09T02:51:30Z

i'm wondering too . why many request go to same pod .

k8s-ci-robot added needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. sig/network Categorizes an issue or PR as relevant to SIG Network. and removed needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. labels May 31, 2018

thockin closed this as completed Jun 6, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kube-proxy iptables random routing is inefficient #64580

kube-proxy iptables random routing is inefficient #64580

bfleming-ciena commented May 31, 2018 •

edited

islinwb commented Jun 1, 2018

bfleming-ciena commented Jun 1, 2018

dims commented Jun 2, 2018

islinwb commented Jun 2, 2018 •

edited

thockin commented Jun 6, 2018

bfleming-ciena commented Jun 7, 2018

Jeffwan commented Jul 14, 2018

zghnr1993 commented Nov 9, 2019

kube-proxy iptables random routing is inefficient #64580

kube-proxy iptables random routing is inefficient #64580

Comments

bfleming-ciena commented May 31, 2018 • edited

islinwb commented Jun 1, 2018

bfleming-ciena commented Jun 1, 2018

dims commented Jun 2, 2018

islinwb commented Jun 2, 2018 • edited

thockin commented Jun 6, 2018

bfleming-ciena commented Jun 7, 2018

Jeffwan commented Jul 14, 2018

zghnr1993 commented Nov 9, 2019

bfleming-ciena commented May 31, 2018 •

edited

islinwb commented Jun 2, 2018 •

edited