Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reroute after node join is processed #11960

Closed
wants to merge 1 commit into from

Conversation

bleskes
Copy link
Contributor

@bleskes bleskes commented Jun 30, 2015

#11776 has simplified our rerouting logic by removing a scheduled background reroute in favor of an explicit reroute during the cluster state processing of a node join (the only place where we didn't do it explicitly). While that change is conceptually good, it change semantics a bit in two ways:

  • shard listing actions underpinning shard allocation do not have access to that new node yet (causing errors during shard allocation see NoSuchNodeException during startup #11923

  • the very first cluster state published to a node already has shard assignments to it. This surfaced other issues we are working to fix separately

    This commit changes the reroute to be done post processing the initial join cluster state to side step these issues while we work on a longer term solution.

elastic#11776 has simplified our rerouting logic by removing a scheduled background reroute in favor of an explicit reroute during the cluster state processing of a node join (the only place where we didn't do it explicitly). While that change is conceptually good, it change semantics a bit in two ways:

 - shard listing actions underpinning shard allocation do not have access to that new node yet (causing errors during shard allocation see elastic#11923
 - the very first cluster state published to a node already has shard assignments to it. This surfaced other issues we are working to fix separately

 This commit changes the reroute to be done post processing the initial join cluster state to side step these issues while we work on a longer term solution.
@bleskes bleskes added v2.0.0-beta1 review :Distributed/Discovery-Plugins Anything related to our integration plugins with EC2, GCP and Azure v1.7.0 labels Jun 30, 2015
@kimchy
Copy link
Member

kimchy commented Jun 30, 2015

LGTM, thanks @bleskes!

@bleskes bleskes closed this in 182c59f Jun 30, 2015
@kevinkluge kevinkluge removed the review label Jun 30, 2015
bleskes added a commit that referenced this pull request Jun 30, 2015
- shard listing actions underpinning shard allocation do not have access to that new node yet (causing errors during shard allocation see #11923
- the very first cluster state published to a node already has shard assignments to it. This surfaced other issues we are working to fix separately

This commit changes the reroute to be done post processing the initial join cluster state to side step these issues while we work on a longer term solution.

Closes #11960
szroland pushed a commit to szroland/elasticsearch that referenced this pull request Jun 30, 2015
 - shard listing actions underpinning shard allocation do not have access to that new node yet (causing errors during shard allocation see elastic#11923
 - the very first cluster state published to a node already has shard assignments to it. This surfaced other issues we are working to fix separately

 This commit changes the reroute to be done post processing the initial join cluster state to side step these issues while we work on a longer term solution.

Closes elastic#11960
@clintongormley clintongormley added >bug and removed :Distributed/Discovery-Plugins Anything related to our integration plugins with EC2, GCP and Azure labels Jul 1, 2015
bleskes added a commit to bleskes/elasticsearch that referenced this pull request Jul 3, 2015
…join

This commit fixes it and adds an assert that an initial cluster state from master never has shards assigned to this node
bleskes added a commit that referenced this pull request Jul 3, 2015
bleskes added a commit that referenced this pull request Jul 3, 2015
@lcawl lcawl added :Distributed/Distributed A catch all label for anything in the Distributed Area. If you aren't sure, use this one. and removed :Allocation labels Feb 13, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
>bug :Distributed/Distributed A catch all label for anything in the Distributed Area. If you aren't sure, use this one. v1.7.0 v2.0.0-beta1
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

5 participants