Check index uuid when merging incoming cluster state into the local one #9541

bleskes · 2015-02-03T08:38:56Z

In big deployment ClusterState can be large. To make sure we keep reusing objects that were promoted to the Old Gen, ZenDiscovery has an optimization where it tries to reuse existing IndexMetaData object (containing among other things the mappings) from the current cluster state if they didn't change. The comparison currently uses the index name and the metadata version. This is however not enough and we should also check the index uuid. In extreme cases, where cluster state processing is slow and the index in question is deleted and recreated and these operations are batch processed together, we can use the wrong meta data if the version is also identical. This can happen if people create the index with all meta data predefined and no settings were changed.

Closes #9489

Note that this is done against the 1.4 branch as i want it to go there too.

…to local In big deployment ClusterState can be large. To make sure we keep reusing objects that were promoted to the Old Gen, ZenDiscovery has an optimization where it tries to reuse existing IndexMetaData object (containing among other things the mappings) from the current cluster state if they didn't change. The comparison currently uses the index name and the metadata version. This is however not enough and we should also check the index uuid. In extreme cases, where cluster state processing is slow and the index in question is deleted and recreated and these operations are batch processed together, we can use the wrong meta data if the version is also identical. This can happen if people create the index with all meta data predefined and no settings were changed. Closes elastic#9489

martijnvg · 2015-02-03T09:51:50Z

src/test/java/org/elasticsearch/indices/state/RareClusterStateTests.java

+                .put(DiscoveryModule.DISCOVERY_TYPE_KEY, "zen")
+                .build()).get();
+        assertFalse(client().admin().cluster().prepareHealth().setWaitForNodes("2").get().isTimedOut());
+        prepareCreate("test").setSettings(IndexMetaData.SETTING_NUMBER_OF_REPLICAS, cluster().numDataNodes() - 1).addMapping("type").get();


maybe use index.auto_expand_replicas instead?

yeah can do. Slightly fancier :)

martijnvg · 2015-02-03T10:00:05Z

@bleskes Great catch! How did you figured this out? (test failure?) I left a couple of comments/questions, but this looks good to me.

bleskes · 2015-02-03T11:35:12Z

@martijnvg thx. pushed another commit.

I was diagnosing a customer cluster and saw these kind of messages:

[2015-02-03 11:50:30,078][DEBUG][cluster.action.shard     ] [node_t1] [test][3] ignoring shard started, different index uuid, current IKfsCb3sT7WnfUcfpAcbxg, got [test][3], node[RBwFZ5K6TpuocqmI3dgHdw], [P], s[INITIALIZING], indexUUID [hABpEGPPQOuE_9UhwIa8Rg], reason [master [node_t1][8nttXu8AQb2r0wvtGO95VA][Boazs-Air.local][local[2]]{mode=local} marked shard as initializing, but shard state is [STARTED], mark shard as started]

martijnvg · 2015-02-03T12:21:35Z

@bleskes LGTM!

kimchy · 2015-02-03T12:29:40Z

LGTM

…local In big deployment ClusterState can be large. To make sure we keep reusing objects that were promoted to the Old Gen, ZenDiscovery has an optimization where it tries to reuse existing IndexMetaData object (containing among other things the mappings) from the current cluster state if they didn't change. The comparison currently uses the index name and the metadata version. This is however not enough and we should also check the index uuid. In extreme cases, where cluster state processing is slow and the index in question is deleted and recreated and these operations are batch processed together, we can use the wrong meta data if the version is also identical. This can happen if people create the index with all meta data predefined and no settings were changed. Closes #9489 Closes #9541

…local In big deployment ClusterState can be large. To make sure we keep reusing objects that were promoted to the Old Gen, ZenDiscovery has an optimization where it tries to reuse existing IndexMetaData object (containing among other things the mappings) from the current cluster state if they didn't change. The comparison currently uses the index name and the metadata version. This is however not enough and we should also check the index uuid. In extreme cases, where cluster state processing is slow and the index in question is deleted and recreated and these operations are batch processed together, we can use the wrong meta data if the version is also identical. This can happen if people create the index with all meta data predefined and no settings were changed. Closes elastic#9489 Closes elastic#9541

bleskes added the review label Feb 3, 2015

bleskes changed the title ~~Discovery: check in index uuid when merging incoming cluster state into the local one~~ Discovery: check index uuid when merging incoming cluster state into the local one Feb 3, 2015

martijnvg self-assigned this Feb 3, 2015

martijnvg reviewed Feb 3, 2015
View reviewed changes

feedback

29a3958

bleskes closed this in 896e865 Feb 3, 2015

bleskes deleted the index_create_uuid branch February 3, 2015 20:36

clintongormley added v1.4.3 v1.5.0 v2.0.0-beta1 >bug :Distributed/Discovery-Plugins Anything related to our integration plugins with EC2, GCP and Azure resiliency and removed review labels Feb 10, 2015

clintongormley changed the title ~~Discovery: check index uuid when merging incoming cluster state into the local one~~ Check index uuid when merging incoming cluster state into the local one Jun 7, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Check index uuid when merging incoming cluster state into the local one #9541

Check index uuid when merging incoming cluster state into the local one #9541

bleskes commented Feb 3, 2015

martijnvg Feb 3, 2015

bleskes Feb 3, 2015

martijnvg commented Feb 3, 2015

bleskes commented Feb 3, 2015

martijnvg commented Feb 3, 2015

kimchy commented Feb 3, 2015

Check index uuid when merging incoming cluster state into the local one #9541

Check index uuid when merging incoming cluster state into the local one #9541

Conversation

bleskes commented Feb 3, 2015

martijnvg Feb 3, 2015

Choose a reason for hiding this comment

bleskes Feb 3, 2015

Choose a reason for hiding this comment

martijnvg commented Feb 3, 2015

bleskes commented Feb 3, 2015

martijnvg commented Feb 3, 2015

kimchy commented Feb 3, 2015