Reduce contention in DocumentsWriterPerThreadPool. #12199

jpountz · 2023-03-10T16:44:54Z

Obtaining a DWPT and putting it back into the pool is subject to contention. This change reduces contention by using 8 sub pools that are tried sequentially. When applied on top of #12198, this reduces the time to index geonames with 20 threads from ~19s to ~16-17s.

Obtaining a DWPT and putting it back into the pool is subject to contention. This change reduces contention by using 8 sub pools that are tried sequentially. When applied on top of apache#12198, this reduces the time to index geonames with 20 threads from ~19s to ~16-17s.

dweiss · 2023-03-10T17:01:49Z

lucene/core/src/java/org/apache/lucene/index/DocumentsWriterPerThreadPool.java

-        dwpt = newWriter();
+    ensureOpen();
+    DocumentsWriterPerThread dwpt = freeList.poll(DocumentsWriterPerThread::tryLock);
+    if (dwpt == null) {


For some reason, double locking always makes me cringe. There are lengthy discussions about this idiom all around the web (visibility of partially constructed objects).

I don't think that this is a case of double-checked locking. In general double-checked locking tries to reduce the overhead of acquiring a lock by adding a quick check before the lock. Here the logic is different, it would be legal to remove the call to poll under lock, I only added it because there is a chance that a thread had to wait on the lock, so we could check if another DWPT was added to the queue in the meantime in order to save creating a new DWPT. I don't think it's actually important, I could remove the second call to poll to make it look less like double-checked locking.

It's not about the poll. It's about whether dwpt = newWriter(); can be expanded and reordered by the compiler so that dwpt gets assigned a value before the constructor (or whatever initialization inside newWriter) takes place. Any thread checking dwpt == null outside synchronized could, at least in theory, see such a "partially constructed" object.

Alex Shipilev had a nice writeup about it - found it here: https://shipilev.net/blog/2014/safe-public-construction/

Thanks for sharing this blog post, I remember reading it in the past, it was a good re-read. I mentioned polls because I thought that they were what made you think that this code is a case of double-checked locking, as there is a first call before the lock and another one under the lock, like the null checks with double-checked locking with singletons. I need to go on weekend, I'll try to post a convincing explanation of why this is not a case of double-checked locking and why it is safe next week. By the way, thanks for looking!

No need to convince me, @jpountz - I just expressed my reluctance at it because, well, it requires convincing. :) Unless there's really a huge gain, I typically just go with what I understand works (making the field volatile, for example).

I still have the feeling that you are making incorrect assumptions about what this piece of code is doing (this method itself doesn't publish the object to other threads), but I took this thread as a call to keep things simple, so I removed the retry and added a few more comments about the logic of this pool.

I certainly am! But anyone looking at this code without jumping deep will have the same doubts, I think. Unless there is a convincing (performance) argument that it's worth it, I like the updated version a lot better - it's clear and simple.

Thanks code looks much better. I was also irritated by the long chain of "tries" with and without synchronized. To me actualy code is easier to read.

lucene/core/src/java/org/apache/lucene/index/DocumentsWriterPerThreadPool.java

uschindler · 2023-03-13T20:02:45Z

Thanks for the updates. Sorry for nitpicking, I just prefer code that goes sequentially and uses the pattern "exit method as soon as condition mets". This makes it easier to understand.

Nice that there's no double locking anymore.

s1monw

LGTM just a bunch of nits

s1monw · 2023-03-15T08:42:41Z

lucene/core/src/java/org/apache/lucene/index/ConcurrentApproximatePriorityQueue.java

+
+  // Only used for assertions
+  boolean contains(Object o) {
+    for (int i = 0; i < CONCURRENCY; ++i) {


nitpick can you add a check that assertions are enabled?

s1monw · 2023-03-15T08:44:10Z

lucene/core/src/java/org/apache/lucene/index/ConcurrentApproximatePriorityQueue.java

+  // Only used for assertions
+  boolean contains(Object o) {
+    for (int i = 0; i < CONCURRENCY; ++i) {
+      locks[i].lock();


this is really a nit pick but for methods that use a lock I'd prefer to assign the lock to a local var instead of dereferencing it again in the finally block.

s1monw · 2023-03-15T08:44:40Z

lucene/core/src/java/org/apache/lucene/index/ConcurrentApproximatePriorityQueue.java

+
+  boolean remove(Object o) {
+    for (int i = 0; i < CONCURRENCY; ++i) {
+      locks[i].lock();


same here, maybe use a local var for the lock. It really looks cleaner

Obtaining a DWPT and putting it back into the pool is subject to contention. This change reduces contention by using 8 sub pools that are tried sequentially. When applied on top of #12198, this reduces the time to index geonames with 20 threads from ~19s to ~16-17s.

jpountz · 2023-03-17T07:03:51Z

I suspect this change to be the source of the speedup when indexing vectors on https://home.apache.org/~mikemccand/lucenebench/indexing.html, but maybe more because of the introduced affinity between indexing threads and DWPTs than because of reduced contention since contention generally shows up when indexing is fast, which isn't the case with vectors?

After upgrading Elasticsearch to a recent Lucene snapshot, we observed a few indexing slowdowns when indexing with low numbers of cores. This appears to be due to the fact that we lost too much of the bias towards larger DWPTs in apache#12199. This change tries to add back more ordering by adjusting the concurrency of `DWPTPool` to the number of cores that are available on the local node.

After upgrading Elasticsearch to a recent Lucene snapshot, we observed a few indexing slowdowns when indexing with low numbers of cores. This appears to be due to the fact that we lost too much of the bias towards larger DWPTs in #12199. This change tries to add back more ordering by adjusting the concurrency of `DWPTPool` to the number of cores that are available on the local node.

After upgrading Elasticsearch to a recent Lucene snapshot, we observed a few indexing slowdowns when indexing with low numbers of cores. This appears to be due to the fact that we lost too much of the bias towards larger DWPTs in apache#12199. This change tries to add back more ordering by adjusting the concurrency of `DWPTPool` to the number of cores that are available on the local node.

vsop-479 · 2023-04-14T11:43:43Z

DocumentsWriterPerThread getAndLock() {
ensureOpen();
DocumentsWriterPerThread dwpt = freeList.poll(DocumentsWriterPerThread::tryLock);
if (dwpt != null) {
return dwpt;
}
// newWriter() adds the DWPT to the dwpts set as a side-effect. However it is not added to
// freeList at this point, it will be added later on once DocumentsWriter has indexed a
// document into this DWPT and then gives it back to the pool by calling
// #marksAsFreeAndUnlock.
return newWriter();
}

@jpountz Will this change makes more small segments generated than the synchronized version?

dweiss reviewed Mar 10, 2023

View reviewed changes

uschindler reviewed Mar 12, 2023

View reviewed changes

lucene/core/src/java/org/apache/lucene/index/DocumentsWriterPerThreadPool.java Outdated Show resolved Hide resolved

Simplify a bit and add more comments about the workings of this pool.

be8b8b8

uschindler reviewed Mar 13, 2023

View reviewed changes

lucene/core/src/java/org/apache/lucene/index/DocumentsWriterPerThreadPool.java Outdated Show resolved Hide resolved

jpountz added 3 commits March 13, 2023 18:08

Better style, approved by the policeman.

92a9fcb

More tests.

a560270

tidy

c853888

uschindler approved these changes Mar 13, 2023

View reviewed changes

s1monw approved these changes Mar 15, 2023

View reviewed changes

jpountz added 4 commits March 15, 2023 11:41

Merge branch 'main' into reduced_contention_dwpt_pool

8418029

CHANGES

3fdd42b

Extract lock to local variable.

6461538

Check if assertions are enabled.

ec760be

jpountz merged commit f324204 into apache:main Mar 15, 2023

jpountz deleted the reduced_contention_dwpt_pool branch March 15, 2023 12:17

jpountz mentioned this pull request Mar 29, 2023

Adjust DWPT pool concurrency to the number of cores. #12216

Merged

jpountz mentioned this pull request Dec 20, 2023

Concurrency bug DocumentsWriterPerThreadPool.getAndLock() uncovered by OpenJ9 test failures? #12916

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce contention in DocumentsWriterPerThreadPool. #12199

Reduce contention in DocumentsWriterPerThreadPool. #12199

jpountz commented Mar 10, 2023

dweiss Mar 10, 2023

jpountz Mar 10, 2023

dweiss Mar 10, 2023

dweiss Mar 10, 2023

jpountz Mar 10, 2023

dweiss Mar 10, 2023

jpountz Mar 13, 2023

dweiss Mar 13, 2023

uschindler Mar 13, 2023

uschindler commented Mar 13, 2023 •

edited

Loading

s1monw left a comment

s1monw Mar 15, 2023

s1monw Mar 15, 2023

s1monw Mar 15, 2023

jpountz commented Mar 17, 2023

vsop-479 commented Apr 14, 2023 •

edited

Loading

Reduce contention in DocumentsWriterPerThreadPool. #12199

Reduce contention in DocumentsWriterPerThreadPool. #12199

Conversation

jpountz commented Mar 10, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

uschindler commented Mar 13, 2023 • edited Loading

s1monw left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jpountz commented Mar 17, 2023

vsop-479 commented Apr 14, 2023 • edited Loading

uschindler commented Mar 13, 2023 •

edited

Loading

vsop-479 commented Apr 14, 2023 •

edited

Loading