Make modifying operations durable by default. #11011

s1monw · 2015-05-06T12:29:13Z

This commit makes create, update and delete operations on an index durable
by default. The user has the option to opt out to use async translog flushes
on a per-index basis by settings index.translog.durability=request.

Initial benchmarks running on SSDs have show that indexing is about 7% - 10% slower
with bulk indexing compared to async translog flushes. This change is orthogonal to
the transaction log sync interval and will only sync the transaction log if the operation
has not yet been concurrently synced. Ie. if multiple indexing requests are submitted and
one operations sync call already persists the operations of others only one sync call is executed.

Relates to #10933

javanna · 2015-05-06T12:33:13Z

src/main/java/org/elasticsearch/action/bulk/BulkRequest.java

+     */
+    public void setDurable(boolean durable) {
+        this.durable = durable;
+    }


existing getters in this class use the non getter getters convention and settters return this. Don't want to start the whole discussion here but maybe at least in the same class we should follow one single convention.

setters are the way to go I will fix all the others but not in this PR

you mean you will fix every getter and setter in the java api? I didnt know we finally made a decision around this... would love to have a single way to do things throughout the codebase though

yeah we should just do what every java project does and use setters and getters

s1monw · 2015-05-06T19:14:38Z

@rmuir I pushed a new commit and replied to your comment

kimchy · 2015-05-06T19:22:28Z

@s1monw +1, looks great. As a follow up to this change, we need to think about an index level setting for it (or maybe adapt the "sync" config to do this automatically when set to 0), and have more benchmarks on more adventurous environments so we can help explain the tradeoffs

s1monw · 2015-05-06T19:38:09Z

@kimchy after thinking about this I think we should go only with an index level setting and drop the entire per request thing. The reason is that with per-request settings you need to change an application to change the value while with index level settings we can do this via the API which is more flexible and safer, opinions?

kimchy · 2015-05-06T19:39:04Z

@s1monw that was my reasoning towards having an index level setting for it, I tend to prefer it, we can always add per request later if needed

s1monw · 2015-05-06T19:43:06Z

ok moving towards index level setting

rmuir · 2015-05-06T20:43:28Z

thanks for improving the loop with asserts/comments!

s1monw · 2015-05-06T20:49:10Z

@kimchy I think it's ready now with an index level setting

kimchy · 2015-05-06T23:25:32Z

@s1monw this looks great

This commit makes create, update and delete operations on an index durable by default. The user has the option to opt out to use async translog flushes on a per-index basis by settings `index.translog.durability=request`. Initial benchmarks running on SSDs have show that indexing is about 7% - 10% slower with bulk indexing compared to async translog flushes. This change is orthogonal to the transaction log sync interval and will only sync the transaction log if the operation has not yet been concurrently synced. Ie. if multiple indexing requests are submitted and one operations sync call already persists the operations of others only one sync call is executed. Relates to elastic#10933

jpountz · 2015-05-07T08:57:31Z

Should we document this new setting and add a note to the resiliency status page?

s1monw · 2015-05-07T09:04:10Z

@jpountz I don't think we are done yet. I want to open a documentation issue once we are setted with all thte things - the more important issue where we fix the translog corruption is still coming

jpountz · 2015-05-07T09:05:20Z

Sounds great. Just wanted to make sure it doesn't get forgotten. :)

Today we are almost intentionally corrupt the translog if we loose a node due to powerloss or similary disasters. In the translog reading code we simply read until we hit an EOF exception ignoring the rest of the translog file once hit. There is no information stored how many records we are expecting or what the last written offset was. This commit restructures the translog to add checkpoints that are written with every sync operation recording the number of synced operations as well as the last synced offset. These checkpoints are also used to identify the actual transaction log file to open instead of relying on directory traversal. This change adds a significant amount of additional checks and pickyness to the translog code. For instance is the translog now associated with a specific engine via a UUID that is written to each translog file as part of it's header. If an engine opens a translog file it was not associated with the operation will fail. Closes to elastic#10933 Relates to elastic#11011

Add translog checkpoints to prevent translog corruption Closes to #10933 Relates to #11011

s1monw added v2.0.0-beta1 review discuss PITA labels May 6, 2015

javanna reviewed May 6, 2015
View reviewed changes

s1monw force-pushed the durable_by_default branch from 2d9f70f to aa18402 Compare May 7, 2015 08:16

s1monw merged commit aa18402 into elastic:master May 7, 2015

s1monw removed discuss review labels May 7, 2015

jpountz mentioned this pull request May 7, 2015

new write option to enable datastore-like durability #9151

Closed

s1monw mentioned this pull request May 13, 2015

Add translog checkpoints to prevent translog corruption #11143

Merged

mycrEEpy mentioned this pull request May 17, 2015

Node crashes can cause data loss #10933

Closed

s1monw added a commit that referenced this pull request May 18, 2015

Merge pull request #11143 from elastic/feature/translog_checkpoints

f7696ec

Add translog checkpoints to prevent translog corruption Closes to #10933 Relates to #11011

s1monw mentioned this pull request May 21, 2015

Clarify Translog Settings and Semantics in Docs #11287

Closed

clintongormley added >enhancement resiliency :Translog labels May 25, 2015

clintongormley mentioned this pull request Jun 5, 2015

[1.3.7] shard fails to start IllegalArgumentException[No type mapped for [0]] #11502

Closed

$@polyfractal$ polyfractal mentioned this pull request Jan 5, 2016

Default translog durability is "request", but page about translog means "async" elastic/elasticsearch-definitive-guide#462

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make modifying operations durable by default. #11011

Make modifying operations durable by default. #11011

s1monw commented May 6, 2015

javanna May 6, 2015

s1monw May 6, 2015

javanna May 6, 2015

s1monw May 6, 2015

s1monw commented May 6, 2015

kimchy commented May 6, 2015

s1monw commented May 6, 2015

kimchy commented May 6, 2015

s1monw commented May 6, 2015

rmuir commented May 6, 2015

s1monw commented May 6, 2015

kimchy commented May 6, 2015

jpountz commented May 7, 2015

s1monw commented May 7, 2015

jpountz commented May 7, 2015

Make modifying operations durable by default. #11011

Make modifying operations durable by default. #11011

Conversation

s1monw commented May 6, 2015

javanna May 6, 2015

Choose a reason for hiding this comment

s1monw May 6, 2015

Choose a reason for hiding this comment

javanna May 6, 2015

Choose a reason for hiding this comment

s1monw May 6, 2015

Choose a reason for hiding this comment

s1monw commented May 6, 2015

kimchy commented May 6, 2015

s1monw commented May 6, 2015

kimchy commented May 6, 2015

s1monw commented May 6, 2015

rmuir commented May 6, 2015

s1monw commented May 6, 2015

kimchy commented May 6, 2015

jpountz commented May 7, 2015

s1monw commented May 7, 2015

jpountz commented May 7, 2015