Don't recover from buggy version #9925

s1monw · 2015-02-27T20:10:38Z

This commit forces a full recovery if the source node is < 1.4.0 and
prevents any recoveries from pre 1.3.2 nodes to
work around #7210

Closes #9922

note: this is just a start, I need to fix some BWC test first before this can be pulled in but I wanted to get the discussion going

rmuir · 2015-02-27T20:12:23Z

+1 this looks great.

rjernst · 2015-02-27T21:19:04Z

+1

mikemccand · 2015-02-27T21:33:36Z

src/main/java/org/elasticsearch/indices/recovery/RecoveryTarget.java

-            return;
+        final Version sourceNodeVersion = recoveryStatus.sourceNode().version();
+        if (sourceNodeVersion.before(Version.V_1_3_2) && recoverySettings.compress()) { // don't recover from pre 1.3.2 if compression is on?
+            throw new ElasticsearchIllegalStateException("Can't recovery from node "


Can't recovery -> Can't recover

mikemccand · 2015-02-27T21:36:36Z

+1, just left a couple minor comments.

s1monw · 2015-03-02T10:20:45Z

@mikemccand @rmuir @rjernst pushed a new commit with unittests...

bleskes · 2015-03-02T11:11:44Z

src/test/java/org/elasticsearch/indices/recovery/RecoveryTargetTests.java

+                    recoveryTarget.existingFiles(discoNode, store, withCompression);
+                    assertTrue(discoNode.version() + "  " + withCompression, version.onOrAfter(Version.V_1_3_2) || withCompression == false);
+                } catch (ElasticsearchIllegalStateException ex) {
+                    // all is good


can we check that we expect this exception? i.e., when version is before 1.3.2 and compression is on?

bleskes · 2015-03-02T11:21:38Z

+1 on the change. I think the 1.3.2 part is better implemented as an allocation decider to prevent the master repeatedly trying to allocate it and failing. I checked and it's fairly easy to integrate this into NodeVersionAllocationDecider by adding RecoverySettings to the constructor

private Decision isVersionCompatible(final RoutingNodes routingNodes, final String sourceNodeId, final RoutingNode target, RoutingAllocation allocation) {
        final RoutingNode source = routingNodes.node(sourceNodeId);
        if (recoverySettings.compress() && source.node().getVersion().before(Version.V_1_3_2)) {
            return allocation.decision(Decision.NO, NAME, "source node version [%s] has a known compression bug preventing allocation",
                    source.node().version());
        } else if (target.node().version().onOrAfter(source.node().version())) {
            /* we can allocate if we can recover from a node that is younger or on the same version
             * if the primary is already running on a newer version that won't work due to possible
             * differences in the lucene index format etc.*/
            return allocation.decision(Decision.YES, NAME, "target node version [%s] is same or newer than source node version [%s]",
                    target.node().version(), source.node().version());
        }  else {
            return allocation.decision(Decision.NO, NAME, "target node version [%s] is older than source node version [%s]",
                    target.node().version(), source.node().version());
        }
    }

s1monw · 2015-03-02T14:50:18Z

@bleskes here is a new commit

bleskes · 2015-03-02T14:54:15Z

...est/java/org/elasticsearch/cluster/routing/allocation/NodeVersionAllocationDeciderTests.java

+//        clusterState = stabilize(clusterState, service);
+//        routingTable = clusterState.routingTable();
+//        for (int i = 0; i < routingTable.index("test").shards().size(); i++) {
+//            assertThat(routingTable.index("test").shard(i).shards().size(), equalTo(3));


left overs?

hmm yeah :D

bleskes · 2015-03-02T14:56:08Z

LGTM. Thx @s1monw

This commit forces a full recovery if the source node is < 1.4.0 and prevents any recoveries from pre 1.3.2 nodes if compression is enabled to work around elastic#7210 Closes elastic#9922

clintongormley · 2015-03-03T10:04:22Z

Closes #9922

mikemccand reviewed Feb 27, 2015
View reviewed changes

bleskes reviewed Mar 2, 2015
View reviewed changes

[RECOVERY] Don't recover from buggy version

dd78370

This commit forces a full recovery if the source node is < 1.4.0 and prevents any recoveries from pre 1.3.2 nodes if compression is enabled to work around elastic#7210 Closes elastic#9922

s1monw force-pushed the issues/9922 branch from 682e7c0 to dd78370 Compare March 2, 2015 14:59

s1monw merged commit dd78370 into elastic:1.x Mar 2, 2015

s1monw mentioned this pull request Mar 17, 2015

disallow recovery from ancient versions #9922

Closed

clintongormley added >bug v2.0.0-beta1 release highlight v1.5.0 :Distributed/Recovery Anything around constructing a new shard, either from a local or a remote source. resiliency labels Mar 19, 2015

clintongormley changed the title ~~[RECOVERY] Don't recover from buggy version~~ Don't recover from buggy version Jun 8, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't recover from buggy version #9925

Don't recover from buggy version #9925

s1monw commented Feb 27, 2015

rmuir commented Feb 27, 2015

rjernst commented Feb 27, 2015

mikemccand Feb 27, 2015

mikemccand commented Feb 27, 2015

s1monw commented Mar 2, 2015

bleskes Mar 2, 2015

bleskes commented Mar 2, 2015

s1monw commented Mar 2, 2015

bleskes Mar 2, 2015

s1monw Mar 2, 2015

bleskes commented Mar 2, 2015

clintongormley commented Mar 3, 2015

Don't recover from buggy version #9925

Don't recover from buggy version #9925

Conversation

s1monw commented Feb 27, 2015

rmuir commented Feb 27, 2015

rjernst commented Feb 27, 2015

mikemccand Feb 27, 2015

Choose a reason for hiding this comment

mikemccand commented Feb 27, 2015

s1monw commented Mar 2, 2015

bleskes Mar 2, 2015

Choose a reason for hiding this comment

bleskes commented Mar 2, 2015

s1monw commented Mar 2, 2015

bleskes Mar 2, 2015

Choose a reason for hiding this comment

s1monw Mar 2, 2015

Choose a reason for hiding this comment

bleskes commented Mar 2, 2015

clintongormley commented Mar 3, 2015