Some Kernels have broken SO_REUSEPORT handling #126

kaechele · 2020-03-06T02:58:36Z

This bug is meant for future reference.

Newer versions of the Tunneldigger broker use SO_REUSEPORT to process multiple tunnels on one single port. In the past Tunneldigger used a NAT-based workaround to make this work. To simplify the code and remove unnecessary dependencies this workaround was removed.
Unfortunately there are several kernel bugs that prevent SO_REUSEPORT for UDP sockets from working properly, that are only fixed in fairly recent kernels.
This means that the change in conjunction with the bug has some peculiar implications for which Kernel versions can be used for brokers. (Tunneldigger clients are unaffected by all of this.)

Kernel versions 5.10.152 and newer exhibit the correct behaviour and should work.

You have probably landed here because you still use an older Linux distribution or haven't updated to a working Kernel version. If you are experiencing this issue you have two options:

Update your kernel to a supported version or upgrade your distribution to one that has a supported kernel version. In particular, Fedora 35 and newer as well as Debian 11 (Bullseye) and newer with the latest updates applied should work.
If you cannot upgrade the kernel, switch to the legacy branch that still carries the NAT hacks.

Kernel fixes

For the curious among you, the two fixes that are needed are:

net: udp: prefer listeners bound to an address (landed in 5.0)
udp: correct reuseport selection with connected sockets (landed in 5.4, backported to: 5.3.1, 5.2.17, 4.19.75, introduced a bug fixed by commit 69421bf98482d089e50799f45e48b25ce4a8d154 below)
udp: Update reuse->has_conns under reuseport_lock. (landed in 6.1, backported to 6.0.6, 5.15.76, 5.10.152, not backported to 5.3.1, 5.2.17, 4.19.75)

The text was updated successfully, but these errors were encountered:

PolynomialDivision · 2020-09-13T08:49:09Z

Could u say something what happens with old kernels? I updated the version to 19.07.4 with 4.14.180 or 4.19.X kernel.
I don't see any obvious issues with that kernel and the version while running tunneldigger. I will investigate further.

RalfJung · 2020-09-13T09:22:11Z

Clients were unable to reliably connect with broken kernels, so we saw many connection timeouts or other disconnects in the logs. Also see #129. You should also see warnings specifically pointing out that the kernel is likely buggy.

Maybe Ubuntu backported the problematic patches, who knows. (I assume you are using Ubuntu? You didn't state the distro you are using.)

PolynomialDivision · 2020-09-13T09:56:09Z

Maybe Ubuntu backported the problematic patches, who knows. (I assume you are using Ubuntu? You didn't state the distro you are using.)

OpenWrt. Thanks I will have a look at the log.

RalfJung · 2020-09-13T10:14:33Z

This issue is about the tunneldigger broker. Are you really running that server-side component on OpenWrt?

PolynomialDivision · 2020-09-13T10:30:14Z

This issue is about the tunneldigger broker. Are you really running that server-side component on OpenWrt?

No. :O Sry, than everytihng is fine. :D

… bei alten Kernelversionen siehe wlanslovenija/tunneldigger#126

… bei alten Kerneln siehe wlanslovenija/tunneldigger#126

neocturne · 2021-03-03T23:29:35Z

Would using SO_REUSEADDR instead of SO_REUSEPORT be an option? At least using a short test program, kernel 4.19 doesn't seem to show this bug with SO_REUSEADDR (I have not checked older kernels).

While implementing L2TP support for fastd (still work in progress), I noticed another advantage of SO_REUSEADDR: It can be set after bind() of the first socket, while SO_REUSEPORT needs to be set before bind(), which may accidentally allow two processes of the same user to bind to the same port.

With SO_REUSEADDR this can be prevented: Let a process bind its first socket without SO_REUSEADDR; this will fail if the port is already bound by another process. Then set SO_REUSEADDR on the first socket. On subsequent sockets, set SO_REUSEADDR before bind(), so they are allowed to use the same port as the first socket.

RalfJung · 2021-03-04T09:52:27Z

Would using SO_REUSEADDR instead of SO_REUSEPORT be an option?

I have to admit I am out of my league here; the differences between these flags are beyond my experience in this space. @kaechele did the implementation with SO_REUSEPORT, he might be able to comment. Other than that, if someone writes a PR that switches to SO_REUSEADDR, I'd be willing to test that on our servers and merge it if it works.

kaechele · 2021-03-06T00:05:00Z

I initially implemented this using SO_REUSEPORT as my research suggested this to be best practice from a security standpoint.
Your way of utilizing SO_REUSEADDR looks like a smart way to avoid double-binding a port already in use by the same user but for a different application.
Correct me if I'm wrong here but it looks like you trade the same-user bind protection for protection of user error in this case.
It seems like an edge case scenario that some other (malicious) user on the same machine would try to abuse a reused port to intercept or alter traffic. Given that L2TPv3 is not encrypted or authenticated anyway. So this is a sensible trade-off in my eyes.

I don't know if I have an immediate need to switch the current implementation over to SO_REUSEADDR but I'm sure it would be a quick thing to do anyway.
In any case I'm looking forward to playing with fastd's implementation as I love the idea of flexibility in selecting L2TP as an option if I require speed over security.

neocturne · 2021-03-06T10:27:59Z

Correct me if I'm wrong here but it looks like you trade the same-user bind protection for protection of user error in this case.
It seems like an edge case scenario that some other (malicious) user on the same machine would try to abuse a reused port to intercept or alter traffic.

This is correct. If running on the same machine as untrusted users, only using low ports for L2TP would mitigate the issue.

pmelange · 2023-03-13T21:03:16Z

So, we (freifunk berlin) have been trying to use the NAT-removed version and have run into some strange issues. It seems like if an in-between router which is also doing NAT (perhaps with an older kernel) then the post-NAT-removal doesn't work and the tunnels time out. It's stange because it works for some people, and not for others. And the only difference we can find is in the router in-between. For example, it works with a recent openwrt image just fine, but with a fritz 7590 with firware 7.50 it doesn't

We have reverted to 7c467e6

kaechele · 2023-03-13T21:20:15Z

Sounds like an issue unrelated to this Kernel bug, possibly in the NAT implementation of the faulty routers.
In any case, it would probably be best to open a separate issue and attach some debugging information so the issue can be looked into. Good debugging info would be excerpts of the conntrack table from affected routers or maybe even packet captures.

kaechele closed this as completed Mar 6, 2020

RalfJung mentioned this issue Mar 6, 2020

Reintroduce: Remove NAT logic (v2) #127

Merged

This comment has been minimized.

Sign in to view

RalfJung changed the title ~~Some Kernels have broken SO_REUSEPORT handling, only one Tunnel connection possible~~ Some Kernels have broken SO_REUSEPORT handling Mar 16, 2020

RalfJung mentioned this issue Mar 16, 2020

Incoming messages get incorrectly dispatched to broker #129

Closed

citronalco added a commit to citronalco/Ansible-Freifunk-Gateway that referenced this issue Dec 12, 2020

gateways_l2tp_slovenija: Installiere "legacy"-Branch von Tunneldigger…

e2d3c7a

… bei alten Kernelversionen siehe wlanslovenija/tunneldigger#126

citronalco mentioned this issue Dec 12, 2020

gateways_l2tp_slovenija: Installiere "legacy"-Branch von Tunneldigger bei alten Kerneln FreiFunkMuenster/Ansible-Freifunk-Gateway#146

Merged

citronalco added a commit to citronalco/Ansible-Freifunk-Gateway that referenced this issue Dec 12, 2020

gateways_l2tp_slovenija: Installiere "legacy"-Branch von Tunneldigger…

5ef3aae

… bei alten Kerneln siehe wlanslovenija/tunneldigger#126

pmelange mentioned this issue Aug 10, 2023

Frequent reconnection of clients #171

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some Kernels have broken SO_REUSEPORT handling #126

Some Kernels have broken SO_REUSEPORT handling #126

kaechele commented Mar 6, 2020 •

edited

Loading

This comment has been minimized.

PolynomialDivision commented Sep 13, 2020

RalfJung commented Sep 13, 2020 •

edited

Loading

PolynomialDivision commented Sep 13, 2020

RalfJung commented Sep 13, 2020

PolynomialDivision commented Sep 13, 2020

neocturne commented Mar 3, 2021

RalfJung commented Mar 4, 2021

kaechele commented Mar 6, 2021

neocturne commented Mar 6, 2021

pmelange commented Mar 13, 2023

kaechele commented Mar 13, 2023

Some Kernels have broken SO_REUSEPORT handling #126

Some Kernels have broken SO_REUSEPORT handling #126

Comments

kaechele commented Mar 6, 2020 • edited Loading

Kernel fixes

This comment has been minimized.

PolynomialDivision commented Sep 13, 2020

RalfJung commented Sep 13, 2020 • edited Loading

PolynomialDivision commented Sep 13, 2020

RalfJung commented Sep 13, 2020

PolynomialDivision commented Sep 13, 2020

neocturne commented Mar 3, 2021

RalfJung commented Mar 4, 2021

kaechele commented Mar 6, 2021

neocturne commented Mar 6, 2021

pmelange commented Mar 13, 2023

kaechele commented Mar 13, 2023

kaechele commented Mar 6, 2020 •

edited

Loading

RalfJung commented Sep 13, 2020 •

edited

Loading