Optimize indexing for the autogenerated ID append-only case #20211

s1monw · 2016-08-29T13:32:01Z

If elasticsearch controls the ID values as well as the documents
version we can optimize the code that adds / appends the documents
to the index. Essentially we an skip the version lookup for all
documents unless the same document is delivered more than once.

On the lucene level we can simply call IndexWriter#addDocument instead
of #updateDocument but on the Engine level we need to ensure that we deoptimize
the case once we see the same document more than once.

This is done as follows:

Mark every request with a timestamp. This is done once on the first node that
receives a request and is fixed for this request. This can be even the
machine local time (see why later). The important part is that retry
requests will have the same value as the original one.
In the engine we make sure we keep the highest seen time stamp of "retry" requests.
This is updated while the retry request has its doc id lock. Call this maxUnsafeAutoIdTimestamp
When the engine runs an "optimized" request comes, it compares it's timestamp with the
current maxUnsafeAutoIdTimestamp (but doesn't update it). If the the request
timestamp is higher it is safe to execute it as optimized (no retry request with the same
timestamp has been run before). If not we fall back to "non-optimzed" mode and run the request as a retry one
and update the maxUnsafeAutoIdTimestamp unless it's been updated already to a higher value

Relates to #19813

If elasticsearch controls the ID values as well as the documents version we can optimize the code that adds / appends the documents to the index. Essentially we an skip the version lookup for all documents unless the same document is delivered more than once. On the lucene level we can simply call IndexWriter#addDocument instead of #updateDocument but on the Engine level we need to ensure that we deoptimize the case once we see the same documetn more than once. This is done as follows: 1. Mark every request with a timestamp. This is done once on the first node that receives a request and is fixed for this request. This can be even the machine local time (see why later). The important part is that retry requests will have the same value as the original one. 2. In the engine we make sure we keep the highest seen time stamp of "retry" requests. This is updated while the retry request has its doc id lock. Call this `highestDeOptimzeAddDocumentTimestamp` 3. When the engine runs an "optimized" request comes, it compares it's timestamp with the current `highestDeOptimzeAddDocumentTimestamp` (but doesn't update it). If the the request timestamp is higher it is safe to execute it as optimized (no retry request with the same timestamp has been run before). If not we fall back to "non-optimzed" mode and run the request as a retry one and update the `highestDeOptimzeAddDocumentTimestamp` unless it's been updated already to a higher value Closes elastic#19813

bleskes · 2016-08-29T13:44:48Z

core/src/main/java/org/elasticsearch/index/engine/InternalEngine.java

+                        if (highestDeOptimzeAddDocumentTimestamp.get() >= index.timestamp()) {
+                            break;
+                        }
+                    } while(highestDeOptimzeAddDocumentTimestamp.compareAndSet(deOptimizeTimestamp, index.timestamp()) == false);


we need to recapture deOptimizeTimestamp in the loop no?

…ons are not loaded

mikemccand · 2016-08-29T17:26:42Z

I tested performance on indexing geonames (8.6 M docs) on a fast 8 core box on a fast SSD with plenty of RAM (64 GB, vs 4 GB for JVM heap):

before:
 Total docs/sec: 44526.0
 Total docs/sec: 45460.2

after:
 Total docs/sec: 53650.4
 Total docs/sec: 53639.6

~20% speedup, nice :)

tlrx · 2016-08-30T06:47:14Z

core/src/main/java/org/elasticsearch/index/engine/Engine.java

+            return isAutogeneratedID;
+        }
+
+        public boolean isCanHaveDuplicates() {


public boolean canHaveDuplicates() ?

* add assertions that canHaveDuplicates is only true if isAutoGeneratedID is true * add more documentation

bleskes · 2016-08-30T09:45:43Z

core/src/main/java/org/elasticsearch/action/index/IndexRequest.java

-            }
+        if (allowIdGeneration && id == null) {
+            assert autoGeneratedID == false;
+            autoGeneratedID = true;


can we explicitly set can have duplicates to false here (and default it true)? or at least assert it is false? (I prefer the first)

I dont' think we should do this to be honest I added assertions along the way to ensure it's only true if autoGeneratedID is true too also it's documented. We can switch so something like:

enum IDSource { External, AutoGenerated, RetryAutoGenerated } `` to have the different states clear, WDYT?

…me in millis

s1monw · 2016-08-31T20:50:43Z

@bleskes thx so much for the in-depth reviews, this would have been impossible without that! I will wait for CI to run and push tomorrow morning

mikemccand · 2016-08-31T21:39:30Z

I re-tested performance gain on latest PR, still ~20% speedup! Nice:

before:

  Total docs/sec: 44299.1
  Total docs/sec: 43192.2  

after:

  Total docs/sec: 53321.2
  Total docs/sec: 52980.7

bleskes · 2016-08-31T22:03:15Z

@mikemccand w00t. I presume this was without replicas?

mikemccand · 2016-09-01T07:41:21Z

@bleskes yes: single node with defaults, except 4 GB heap.

- use auto-generated ids for indexing elastic#20211 - use rounded dates in queries elastic#20115

To ensure we don't add documents more than once even if it's mostly paranoia except of one case where we relocated a shards away and back to the same node while an initial request is in flight but has not yet finished AND is retried. Yet, this is a possible case and for that reason we ensure we pass on the maxUnsafeAutoIdTimestamp on when we prepare for translog recovery. Relates to #20211

Replication request may arrive at a replica before the replica's node has processed a required mapping update. In these cases the TransportReplicationAction will retry the request once a new cluster state arrives. Sadly that retry logic failed to call `ReplicationRequest#onRetry`, causing duplicates in the append only use case. This commit fixes this and also the test which missed the check. I also added an assertion which would have helped finding the source of the duplicates. This was discovered by https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+multijob-unix-compatibility/os=opensuse/174/ Relates #20211

s1monw added >enhancement review WIP v5.0.0-beta1 labels Aug 29, 2016

clintongormley added the das awesome label Aug 29, 2016

bleskes reviewed Aug 29, 2016
View reviewed changes

add tests for concurrent retires on the engine level and ensure versi…

7561a1c

…ons are not loaded

s1monw added 2 commits August 29, 2016 21:53

add replicating and recovering while appending unit tests

43841fb

Merge branch 'master' into issues/19813

87141e0

s1monw removed the WIP label Aug 29, 2016

s1monw added 2 commits August 29, 2016 22:14

add setting to opt out

ad74feb

fix compilation

aaa35a4

tlrx reviewed Aug 30, 2016
View reviewed changes

* Fix review comments

9e9c2c0

* add assertions that canHaveDuplicates is only true if isAutoGeneratedID is true * add more documentation

bleskes reviewed Aug 30, 2016
View reviewed changes

use math.max rather than math.min for parnoia when getting current ti…

836351d

…me in millis

add assertions that we use addDocument if we can in the auto gen ID case

8d52083

bleskes added the >breaking-java label Aug 31, 2016

s1monw added 2 commits September 1, 2016 08:40

Merge branch 'master' into issues/19813

a37c6c2

Merge branch 'master' into issues/19813

b96ee54

notify in the migration guide that op_type=create must have an ID

34f97de

s1monw added >breaking release highlight labels Sep 1, 2016

s1monw merged commit a0becd2 into elastic:master Sep 1, 2016

bleskes mentioned this pull request Sep 1, 2016

Optimize the case when _version is never specified / always 0 #19913

Closed

jpountz mentioned this pull request Sep 2, 2016

Add more information to the how-to docs. #20297

Merged

jpountz added a commit to jpountz/elasticsearch that referenced this pull request Sep 2, 2016

Add more information to the how-to docs. elastic#20297

cdc27b7

- use auto-generated ids for indexing elastic#20211 - use rounded dates in queries elastic#20115

bleskes mentioned this pull request Sep 2, 2016

Pass on maxUnsafeAutoIdTimestamp on recovery / relocation #20300

Merged

bleskes mentioned this pull request Oct 30, 2016

Retrying replication requests on replica doesn't call onRetry #21189

Merged

This was referenced Dec 22, 2016

Append-only indices #18069

Closed

poor index performance when index loadVersion #22318

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize indexing for the autogenerated ID append-only case #20211

Optimize indexing for the autogenerated ID append-only case #20211

s1monw commented Aug 29, 2016 •

edited

bleskes Aug 29, 2016

mikemccand commented Aug 29, 2016

tlrx Aug 30, 2016

bleskes Aug 30, 2016

s1monw Aug 30, 2016

s1monw commented Aug 31, 2016

mikemccand commented Aug 31, 2016

bleskes commented Aug 31, 2016

mikemccand commented Sep 1, 2016

Optimize indexing for the autogenerated ID append-only case #20211

Optimize indexing for the autogenerated ID append-only case #20211

Conversation

s1monw commented Aug 29, 2016 • edited

bleskes Aug 29, 2016

Choose a reason for hiding this comment

mikemccand commented Aug 29, 2016

tlrx Aug 30, 2016

Choose a reason for hiding this comment

bleskes Aug 30, 2016

Choose a reason for hiding this comment

s1monw Aug 30, 2016

Choose a reason for hiding this comment

s1monw commented Aug 31, 2016

mikemccand commented Aug 31, 2016

bleskes commented Aug 31, 2016

mikemccand commented Sep 1, 2016

s1monw commented Aug 29, 2016 •

edited