New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
No client network with "mesh on lan/wan" but no cable plugged in. #905
Comments
Please provide the output of |
Output of
Output of |
Hmm, probably another netifd race condition... I'll see if I can reproduce the issue and test the lastest netifd patches. |
@neoraider did a netifd backport, could you @jannic test if your issue persists when building gluon using the update-netifd branch? https://github.com/freifunk-gluon/gluon/tree/update-netifd |
Sorry that it took so long to test the updated netifd. As the mentioned tree doesn't exist any more, I tried to find the commit manually. The one I found was fb2e14d, "netifd: update to LEDE 9a5801e7f6e8bc6641ca320e4497d298080f1b24" Unfortunately, the result was that I found exactly the same behavior as before: Without a cable plugged in, br-client is down, completely messing up networking. |
Hmm. Could you test the current master, now that we've moved to LEDE? |
Yes, and already did. First I thought the issue was solved, but on a second try, I observed the very same behavior as before. So it seems like it's still the same. |
probably related to this: Here you can find some collected Infos from two different nodes |
While was not able to reproduce the exact same issues described here or in #1079, I did get my node into a somewhat similar broken situation (in which bat0 would suddenly have br-mesh_lan as primary interface instead of primary0), caused by setup and teardown scripts running concurrently. I'll try to find a way to prevent such race conditions by adding some locking in the appropriate places. Another issue is that something (probably the internal switch) is reset late in boot, after the network is already partially set up, causing some interfaces to be torn down again after being set up. I think these two issues could cause the behaviour described here when the timing is very unfortunate. |
Please test with the current master (e45c303 or later), I've refactored the batman-adv interface management to make it much more robust. |
As #1079 is fixed, and I assume that this is basically the same issue, I'll close this as well. |
This may be the same or related to #635, at least the descriptions are similar. As #635 is closed, and I'm not sure it's the same issue, I created a new one.
In short, I also observe that with Mesh on LAN + Mesh on WAN activated, but no cable plugged into any LAN ports, client networking is completely broken. This time on gluon 2016.2 (to be exact, experimental build 2016.2-1~exp20161004 of Freifunk Aachen, which is, to my knowledge, just 2016.2 with Aachen site.conf). TL-WR841v9 with serial console attached to see what's happening.
To be sure it's not some unrelated configuration issue, I started by resetting the whole config with 'firstboot'. Then, I only configured the following fields in config mode:
After the following reboot, everything was fine. (LAN cable still connected.)
Then I removed the LAN cable and powercycled. Now, the client network (interface br-client) didn't come up, resulting in missing network connectivity, even though the batman network seems to be perfectly fine.
One obvious difference is in the output of 'ip link list'. (See attached files.):
During the whole process, there was no cable plugged in into the WAN port, and Wifi-Mesh was available. Meshing is still on IBSS, as 802.11s is not activated in the Aachen firmware, yet.
with_lan.txt
no_lan.txt
The text was updated successfully, but these errors were encountered: