Replies: 1 comment
-
|
Failing to service the message queue (often observed as the message queue filling up until it hits its limit) typically indicates an issue within the platform-specific radio driver. In OpenThread, the If the radio driver does not call
Troubleshooting Suggestions:
|
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
ot-br-posix on OpenWrt-based product + RCP, BR role during issue is Router/Leader.
GSDK_4.4.6 https://github.com/SiliconLabs/gecko_sdk/tree/gsdk_4.4
The same issue with openthread/ot-br-posix#2473
In some partition events, the Border Router enters singleton mode and continues to receive Thread traffic, but local MAC data TX stops making forward progress. In this state, partition recovery does not complete without restarting otbr-agent.
Failure signature observed repeatedly:
singleton remains true
MAC RxTotal continues to increase
MAC TxData (and often TxTotal) does not increase
BR remains stuck until otbr-agent restart
1 OTBR Gateway + 15 TME(Thread Mesh Extender) + 85 End devices.
Keep data transmission stream about 2 weeks.
A representative timeline from one reproduced run:
(Ref log full_log.log and full_log_2.log)
18:34:10 -> singleton detected (watchdog log: partition-guard singleton begin)
18:39:11 -> condition persisted for about 300 seconds with RX alive and no TxData progress
18:39:11 -> restart triggered as mitigation, after which topology rebuild events resumed
The core issue is the non-self-recovering state before restart.
full_log.log
full_log_2.log
Beta Was this translation helpful? Give feedback.
All reactions