Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Network-wide OLSR error on the Berlin Freifunk Network #66

Closed
pmelange opened this issue Nov 26, 2018 · 12 comments
Closed

Network-wide OLSR error on the Berlin Freifunk Network #66

pmelange opened this issue Nov 26, 2018 · 12 comments

Comments

@pmelange
Copy link
Contributor

This is a copy of issue freifunk-berlin/firmware#628

On 22.11.2019 all of the routers on the entire Berlin Backbone started printing the following error messages, repeating every second.

Thu Nov 22 13:53:08 2018 daemon.info olsrd[7015]: Received netlink error
code Invalid argument (-22)
Thu Nov 22 13:53:08 2018 daemon.err olsrd[7015]: . error: del route to
171.159.48.121/254.0.0.0 via 0.0.0.0 dev void onlink (Resource
temporarily unavailable 11)
Thu Nov 22 13:53:08 2018 daemon.err olsrd[7015]: Delete route
171.159.48.121/7 via 0.0.0.0: Resource temporarily unavailable

The only known methods to stop the error messages was to restart the OLSR4 service or to reboot the router.

This also effected every router attach to the BBB-VPN.

Every version of the OLSR daemon was hit by this problem. From 0.6.x to the latest 0.9.6.2

The cause of this message is unknown.

@HRogge
Copy link
Contributor

HRogge commented Dec 5, 2018 via email

@pmelange
Copy link
Contributor Author

pmelange commented Dec 5, 2018

Yes, very strange. No, I don't think that any node on the mesh network would announce 0.0.0.0. Also the 171.159.48.121 address (with a huge netmask) is strange. All the nodes on our mesh network are in the 10.0.0.0/8 address space. And no nodes should state that they have a /8 netmask either.

Take a look at the email thread on the freifunk-berlin mailing list.
https://lists.berlin.freifunk.net/pipermail/berlin/2018-November/038406.html

@HRogge
Copy link
Contributor

HRogge commented Dec 6, 2018 via email

@pmelange
Copy link
Contributor Author

pmelange commented Dec 6, 2018

I suppose it could be user error on some node. But the result is the same. Unfortunately I didn't check the routing tables to see if there is also a "via 0.0.0.0" entry.

If it is not possible for OLSR to delete a route, shouldn't OLSR handle it differently than repeatedly retrying to delete it?

@HRogge
Copy link
Contributor

HRogge commented Dec 6, 2018 via email

@pmelange
Copy link
Contributor Author

pmelange commented Dec 6, 2018

I guess it was some kind of mass hysteria memory corruption. I don't know how to repeat it.

Well, if it happens again (it might take a few years) shall I open another ticket?

@HRogge
Copy link
Contributor

HRogge commented Dec 7, 2018 via email

@pmelange
Copy link
Contributor Author

After one week, there was no answer on the freifunk-berlin mailing list. Closing

@pktpls
Copy link

pktpls commented Apr 19, 2022

This happened again today with a /4 HNA (errors above for a /7). When that HNA was withdrawn/expires, its removal from the kernel routing table started looping with the errors mentioned above.

@PolynomialDivision could you reopen?

@mathiashro
Copy link
Contributor

Hi @pktpls / @pmelange , anything we can do here (as the case is open for quite some time)?

@pmelange
Copy link
Contributor Author

pmelange commented Nov 1, 2024

I haven't seen this happen again and i don't know how to reproduce it.

@mathiashro
Copy link
Contributor

Thank you. I‘ll close it here the moment, we can reopen it once someone catches the error again.

Hope this is fine for you as well.

@mathiashro mathiashro closed this as not planned Won't fix, can't repro, duplicate, stale Nov 1, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants