You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
As the problem occurs not every time, it's not easy to debug. Any further hints would be appreciated.
As a next step I will try to play with electionTimeout parameter (does that need a different heartbeat factor too?).
Could it help to use gRPC instead of Bolt, and how to configure that?
Or maybe changing RaftOptions#maxElectionDelayMs or stepDownWhenVoteTimedout=false?
Did you see a similar behaviour on your side too, or is it possible there is a general network problem on my side?
Thanks for your help
The text was updated successfully, but these errors were encountered:
Hi, @Excpt0r
I think I have found the reason,a prevote or a vote process will try to send a small packet to the other end to make sure the connection is available, and then it will be blocked by a dead node during the election.
The first solution:
Bolt has a small problem, it will be blocked for at least 1 second during the creation of connection, and this parameter cannot be modified. I have discussed this with the PMC of bolt, and he will fix this problem by the end of June. I recommend that you can increase election_timeout to 2 seconds (because decreasing the rpc_timeout will not work).
The second solution:
You can use grpc as the network framework by simply introducing jraft-extension/rpc-grpc-impl into your POM and decreasing the rpc_timeout.
Hi,
this is a follow up of #583.
Big thanks for your quick response and the new release!
I have tested now with the new version 1.3.7 that includes fix #586 from @fengjiachun
The logs of the test are attached, election duration 22s:
van-device-sub-0_jraft1.3.7.log
van-device-sub-2_jraft1.3.7.log
After that I tried out a different timeout as suggest from @killme2008 here
I reduced the rpcTimeout from 1000ms to 500ms an re-run the tests, logs attached, election duration 25s:
van-device-sub-0_jraft1.3.7_rpcTimeout500ms.log
van-device-sub-2_jraft1.3.7_rpcTimeout500ms.log
As the problem occurs not every time, it's not easy to debug. Any further hints would be appreciated.
As a next step I will try to play with electionTimeout parameter (does that need a different heartbeat factor too?).
Could it help to use gRPC instead of Bolt, and how to configure that?
Or maybe changing RaftOptions#maxElectionDelayMs or stepDownWhenVoteTimedout=false?
Did you see a similar behaviour on your side too, or is it possible there is a general network problem on my side?
Thanks for your help
The text was updated successfully, but these errors were encountered: