Allow rebalancing primary shards on shared filesystems #10585

dakrone · 2015-04-14T05:21:06Z

Instead of failing the Engine for a shared filesystem, this change
allows a "soft close" of the Engine, where only the IndexWriter is
closed so that the replica can open an IndexWriter using the same
filesystem directory/mount.

Fixes #10469

dakrone · 2015-04-14T05:22:56Z

@s1monw this PR is still missing a test where replication failure is simulated (like you asked for), but I wanted to vet the idea of a "soft-close" of the Engine, I had to make some changes after #10452 was merged to allow an Engine to be closed without closing the Translog, because closing the translog caused all future recovery to hang when translog.snapshot() is called.

Let me know what you think and I will work on adding more tests tomorrow.

s1monw · 2015-04-14T08:50:35Z

@dakrone I don't think we should do the sep close methods an all that kind of stuff. I'd rather just call sync instead of close on the translog. Especially given the fact that @bleskes works on larger refactorings on that end so all the changes here are likely only needed on 1.x

dakrone · 2015-04-14T12:36:02Z

@s1monw wouldn't that leave the translog in a never-closed state then? Or is the translog closed somewhere else? Does that just rely on the injector being closed to close it?

s1monw · 2015-04-14T13:45:35Z

@dakrone the translog is close later - the sync is the important part..

Relates to elastic#10585

s1monw · 2015-04-21T11:06:49Z

@dakrone I think this is good functionality wise but I think the implementation needs to be less intrusive ie. I think we should implement this as a subclass of the engine instead of adding all these settings and changing how the recovery handler works and calling back into it. I took a quick step copying your test and adding this quick and dirty to the engien factory and I think it's cleaner what do you think about this s1monw@5fd56da

dakrone · 2015-04-21T16:27:41Z

@s1monw I updated this to use the method that you came up with

s1monw · 2015-04-21T18:11:42Z

src/test/java/org/elasticsearch/index/IndexWithShadowReplicasTests.java

@@ -310,6 +322,176 @@ public void testPrimaryRelocation() throws Exception {
    }

    @Test
+    @TestLogging("_root:DEBUG,index:TRACE")


do we still need that?

s1monw · 2015-04-21T18:13:34Z

left minor comments LGTM otherwise

bleskes · 2015-04-22T07:56:38Z

@dakrone this is the on that only goes to 1.x, right? asking because it still has the 2.0.0 label on it...

s1monw · 2015-04-22T09:09:13Z

@bleskes the plan is to push this impl to 1.x and another impl to master based on your refactoring in #10624 makes sense? I removed the 2.0 label for now

bleskes · 2015-04-22T09:09:55Z

yeah, makes total sense, just checking because of the label. Thx,

s1monw · 2015-04-22T09:11:11Z

src/test/java/org/elasticsearch/test/engine/MockSharedFSEngine.java

+/**
+ * TODO: document me!
+ */
+public class MockSharedFSEngine extends MockInternalEngine {


I think this should be a subclass of SharedFSEngine and use the new MockEngineSupport from #10700

s1monw · 2015-04-22T16:19:49Z

sweet! LGTM

Instead of failing the Engine for a shared filesystem, this change allows a "soft close" of the Engine, where only the IndexWriter is closed so that the replica can open an IndexWriter using the same filesystem directory/mount. Fixes elastic#10469

dakrone · 2015-04-22T17:21:05Z

This has been merged to 1.x only, I will rewrite and open a new PR once #10624 is merged into master, since it refactors much of the recovery process.

dakrone added v2.0.0-beta1 review v1.6.0 v1.5.2 labels Apr 14, 2015

kevinkluge added the in progress label Apr 14, 2015

dakrone force-pushed the allow-shadow-primary-relocation branch 3 times, most recently from a76d62c to 3fc625a Compare April 16, 2015 15:25

s1monw self-assigned this Apr 16, 2015

s1monw added a commit to s1monw/elasticsearch that referenced this pull request Apr 21, 2015

first cut all tests pass

5fd56da

Relates to elastic#10585

clintongormley added blocker and removed v1.5.2 labels Apr 21, 2015

s1monw reviewed Apr 21, 2015
View reviewed changes

s1monw removed the v2.0.0-beta1 label Apr 22, 2015

s1monw reviewed Apr 22, 2015
View reviewed changes

dakrone force-pushed the allow-shadow-primary-relocation branch from 5fc9fd3 to 24bf3de Compare April 22, 2015 15:15

dakrone force-pushed the allow-shadow-primary-relocation branch from 84e8a70 to cd57ed7 Compare April 22, 2015 16:21

dakrone closed this Apr 22, 2015

kevinkluge removed in progress labels Apr 22, 2015

clintongormley added v1.5.2 >enhancement labels Apr 26, 2015

clintongormley changed the title ~~[CORE] Allow rebalancing primary shards on shared filesystems~~ Allow rebalancing primary shards on shared filesystems May 29, 2015

dakrone deleted the allow-shadow-primary-relocation branch June 1, 2015 22:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow rebalancing primary shards on shared filesystems #10585

Allow rebalancing primary shards on shared filesystems #10585

dakrone commented Apr 14, 2015

dakrone commented Apr 14, 2015

s1monw commented Apr 14, 2015

dakrone commented Apr 14, 2015

s1monw commented Apr 14, 2015

s1monw commented Apr 21, 2015

dakrone commented Apr 21, 2015

s1monw Apr 21, 2015

s1monw commented Apr 21, 2015

bleskes commented Apr 22, 2015

s1monw commented Apr 22, 2015

bleskes commented Apr 22, 2015

s1monw Apr 22, 2015

s1monw commented Apr 22, 2015

dakrone commented Apr 22, 2015

Allow rebalancing primary shards on shared filesystems #10585

Allow rebalancing primary shards on shared filesystems #10585

Conversation

dakrone commented Apr 14, 2015

dakrone commented Apr 14, 2015

s1monw commented Apr 14, 2015

dakrone commented Apr 14, 2015

s1monw commented Apr 14, 2015

s1monw commented Apr 21, 2015

dakrone commented Apr 21, 2015

s1monw Apr 21, 2015

Choose a reason for hiding this comment

s1monw commented Apr 21, 2015

bleskes commented Apr 22, 2015

s1monw commented Apr 22, 2015

bleskes commented Apr 22, 2015

s1monw Apr 22, 2015

Choose a reason for hiding this comment

s1monw commented Apr 22, 2015

dakrone commented Apr 22, 2015