New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
During Router promotion, there is a time-window which Leader and Router have different router RLOC #7664
Comments
I reproduced this issue with: Two devices:
Context: BR is provisioning the TR with Matter & Thread nwk credentials.
|
@AlanLCollins , thanks for raising this issue. One point of clarification, what needs updating here is the IPv6-to-RLOC mapping, not the SRP entry itself. The IPv6 address in the SRP entry should be the same for both child and router roles. Is it possible for you to provide debug-level logs on |
Please refer to the attachment which includes CHIP & otbr-agent log events. |
One thing I'm noticing is that there is no response to the Address Query:
The Address Query multicast message is sent twice:
But no response is received:
I'm also noticing a fair number of HDLC frame errors, an one happens right after the first Address Query is transmitted:
On a well-implemented serial port, we typically do not see frequent HDLC errors. Are these HDLC errors expected on your platform? |
Hello @jwhui , my apologies for the late response.
|
Great. Thanks.
Yes.
We could consider adding something like this as an optimization, but it would only work for 1-hop neighbors. If the device upgrading to router is multiple hops away from the BR, then this optimization would not apply. |
I will close this ticket, as there are recent improvements in the OT code which triggers the new Router to send MLE Advertisement right after the Router promotion, so this triggers EID-to-RLOC mapping to start faster. |
We still have the following: But this requires some Thread Group Spec / Test Spec updates. |
Ok. I will reopen the ticket to keep track of this. |
Resolved by #8307 |
Describe the bug
During Router promotion, there is a time-window which Leader and Router have different Router RLOC.
The Router starts using the new RLOC right after getting the "Delay Link response"
But the Leader does not update the neighbor table until receives advertisement from the router. Then "delayed message" is sent. So, during that time-window if the Router attempts sending higher layer protocol/application message to the Leader. The Leader rejects as "UnknownNeighbor".
Expected behavior
Either the Leader needs to update its neighbor table sooner, or the Router needs to wait to start using the new RLOC.
Console/log output
Fail_duringRouterPromotion.txt
Additional context
I reproduced this issue with SRP protocol during matter provisioning. The Router goes into promotion from CHILD to ROUTER right in the before triggering a DNS resolution.
The text was updated successfully, but these errors were encountered: