New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
master node was force to rejoin #12415
Comments
@chenryn can you share you cluster state on a gist? you can get it via Also, can you post the complete logs? you redacted some things for brevity (...(many nodes here)...) but it is important to get a complete picture... |
@bleskes I upload cluster state and one circle rejoin logs to https://gist.github.com/chenryn/0aa3ba4742b3741d1f01 |
@chenryn thx. The cluster state in the ES is the same on all nodes except for a little flag indicating witch of the nodes is the local node. Your cluster state misses that flag, which causes the master to publish a new cluster state to itself (which we shouldn't do). This cause it the think there is another master active, responding with telling the other master to stop down. The other master (i.e., the same node) receives the command and steps down only to re-elect it self. The biggest question here is how did the node end up not having a local flag set. Do you have any custom plugins installed? Was anything else out of order before this started happening? |
No plugin installed. There was one client node died and reboot before the first rejoin happen, the "10.19.0.96" in above log. |
btw: what the local flag like? I check the state of another cluster, seems no different of this cluster. |
I got the same problem again:
|
@chenryn sorry for not getting back to you - I was out for two weeks. The flag is something internal and is not serialized to the rest api. You can see it if you connect via the Java API. You say you don't use any plugins. Is there anything else in your deployment that may be unusual? Do you embed ES? Can you reproduce this by any chance with a small setup you can share? |
No more feedback - closing |
yes, I didn;t reproduce this too. |
Elasticsearch 1.6.0
The master node is 10.19.0.100, the es.log record as follow. It discover itself as a
also master but with an older cluster_state
, then force itself to rejoin...This happened time after time.
I had try to restart 10.19.0.100 but no effect. Then I had to stop such master, restart all other nodes to detect another master, start this node. Now the cluster health is green.
The text was updated successfully, but these errors were encountered: