Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

My cluster separated into one cluster per node #20

Open
theikell opened this issue Jul 16, 2014 · 2 comments
Open

My cluster separated into one cluster per node #20

theikell opened this issue Jul 16, 2014 · 2 comments
Labels

Comments

@theikell
Copy link

I expect the cause of this was related to an Azure network issue, but how can I monitor or better yet, prevent this from happening again.

Here's my story. I'm running development cluster of 3 VM's on Azure, they are all members of the same VLAN. All are running ES 1.2.2. I've setup discovery as per the docs. Everything works great and has been for about a month. Yesterday I started experiencing problems. While investigating I remoted into each node and logged onto Marvel using localhost. Each node reported that they were the only node in the cluster and that they were the master.

I restarted the ES service on two of the nodes and upon restart they discovered node that I did not restart, and the cluster reformed as expected.

I suspicion is that a network problem caused a communication failure between the nodes but when they network problem was resolved the nodes did not re-discover each other. I assume the discovery process only happens when a node starts, true?

I expect there is nothing I can do to prevent this from happening since it seems to have been initiated by an issue in the Azure datacenter, but is there a way to monitor this, or have the nodes attempt to rediscover each other with out me having to manually restart them?

@dadoonet dadoonet added the bug label Jul 16, 2014
@dadoonet dadoonet self-assigned this Jul 16, 2014
@dadoonet
Copy link
Member

I agree. I think something is not working as expected in that case.
I need to investigate and try to reproduce.

BTW, I think you should set minimum_master_nodes:2 in your case to avoid the split brain issue.

@theikell
Copy link
Author

Thank you for your advice regarding minimum_master_nodes, I'm making that change now.

@dadoonet dadoonet removed their assignment Oct 24, 2014
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants