New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Forced join when network split #13584
Comments
|
Returning to backlog due to lack of interest/priority. My prototype can be revived from #17625 |
|
Taking note of that we got a user request for something similar as this issue. Discussed in #18067 |
|
Hello @patriknw, I see this got moved to Backlog due to lack of interest/priority. I would like to see this implemented as part of 2.4.0 release (if possible). I am happy to help wrapping up whatever is remaining and create a pull request, only thing is that I would just need a bit of guidance. Is that something you would be interested in? |
|
Thanks for wanting to help out. We need that! Have you looked at my prototype in #17625 ? The big question is what we want? The immediate answer is of course "we want it all, let members join and leave however they want, also during network partitions". That is a rather big change to how the cluster membership is currently designed and would be major undertaking. However, if the answer is more specific we might be able to support some scenarios, as I tried in the prototype. WDYT? |
|
I have been looking into the code and I believe that's what I had in mind. Another Member status where a node could join the cluster even if there is no convergence. Once convergence is reached, leader will move the Member from Weakly Up to Up. I have created this transition schema: I believe that is what we are trying to achieve. Can you double check? I believe everything is pretty much done except for the config parameter that will be required to enable or disable this feature to don't break Gossips coming from 2.3.X that don't know about WeaklyUp. Also, I believe there are some tests failing and maybe we need to merge the base 2.4-M3 (or the latest) because I believe there was some changes from where you implemented this. Thanks and let me know your thoughts on this :) |
|
Sounds good to me. I'll try to talk Roland and team next week to see if we can come to some conclusion. The Distributed Data module is another thing that could benefit from this. It could start using weakly up nodes in gossip replication, but not count them in consistency concerns such as |
|
Cool. I am already working on it. Please let me know if there is any update. |
|
Great, thanks for picking it up |
|
Hi Patrik, should we move this ticket to 2.4.X to add it in 2.4.1? |
* experimental feature, disabled by default * Adding documentation to mention weakly up members. plus adding new diagram.
* experimental feature, disabled by default * Adding documentation to mention weakly up members. plus adding new diagram.
+clu #13584 Accept joining to be WeaklyUp during network split

imported from https://www.assembla.com/spaces/akka/tickets/3584
The leader cannot move joining nodes to Up when there are unreachable nodes (no convergence). Some users might want to add more nodes also when there is a network split.
Probably those nodes should have a special Up status, but be taken in use by for example routers. They are then moved to real Up by the leader on convergence.
The text was updated successfully, but these errors were encountered: