Performance improvements to use read lock to access LRUQueryCache #13306

boicehuang · 2024-04-15T11:57:48Z

Elasticsearch (which based on lucene) can automatically infer types for users with its dynamic mapping feature. When users index some low cardinality fields, such as gender / age / status... they often use some numbers to represent the values, while ES will infer these fields as long, and ES uses BKD as the index of long fields.

Just as #541 said, when the data volume grows, building the result set of low-cardinality fields will make the CPU usage and load very high even if we use a boolean query with filter clauses for low-cardinality fields.

One reason is that it uses a ReentrantLock to limit accessing LRUQueryCache. QPS and costs of their queries are often high, which often causes trying locking failures when obtaining the cache, resulting in low concurrency in accessing the cache.

So I replace the ReentrantLock with a ReentrantReadWriteLock. I only use the read lock when I need to get the cache for a query,

I benchmarked this optimization by mocking some random LongPoint and querying them with one PointInSetQuery with bool filter.

doc count	field cardinality	query terms count	baseline QPS	candidate QPS	diff percentage
30000000	10	1	2481	5102	105.6%
30000000	1000000	1	6396	6596.48	3.1%

I think this change can help filter queries that need to query low-cardinality fields.

lucene/core/src/java/org/apache/lucene/search/LRUQueryCache.java

boicehuang · 2024-04-23T14:43:05Z

This optimization also benefits high-cost querys such as terms query with 10000 terms, by reading cache more frequently instead of searching inverted index

doc count	field cardinality	query terms count	baseline QPS	candidate QPS	diff percentage
30000000	1000000	10000	160	473	191.9%

benwtrent · 2024-04-24T18:18:23Z

lucene/core/src/java/org/apache/lucene/search/LRUQueryCache.java

@@ -265,7 +269,6 @@ boolean requiresEviction() {
  }

  CacheAndCount get(Query key, IndexReader.CacheHelper cacheHelper) {
-    assert lock.isHeldByCurrentThread();
    assert key instanceof BoostQuery == false;
    assert key instanceof ConstantScoreQuery == false;
    final IndexReader.CacheKey readerKey = cacheHelper.getKey();


As with anything multi-threaded, this is tricky. How we are actually doing the LRU, is via the uniqueQueries which is a LinkedHashMap constructed by new LinkedHashMap<>(16, 0.75f, true);

Which means, a get against this will actually do an UPDATE to the LinkedHashMap order. If multiple readers attempt to do this at the same time, I would think this would then become indeterminate.

From the java docs to LinkedHashMap

* Note that this implementation is not synchronized. * If multiple threads access a linked hash map concurrently, and at least * one of the threads modifies the map structurally, it must be * synchronized externally. This is typically accomplished by * synchronizing on some object that naturally encapsulates the map.

Since we have order dictated at access, not on insertion, we are modifying the map structurally on read.

So, Collections.synchronizedMap should be used, or locking manually around this particular get.

And that means now we are locking on read again, which might mean all performance gains go away.

can we adjust LinkedHashMap to be ConcurrentHashMap ?

I adjust cache AND uniqueQueries to be ConcurrentHashMap objects and get the benchmark results as follows.

doc count field cardinality query point baseline QPS candidate QPS diff percentage

30000000 10 1 2481 5102 105.6%

30000000 10 1 2481 4843 95.2% (using LongAdder )

30000000 10 1 2481 3851 55% (using LongAdder And ConcurrentHashMap)

In ConcurrentHashMap, although the order of insertion operations is uncertain, which leads to test failure, it ensures the thread safety of insertion operations and still has a 55% improvement.

I agree, but IdentityHashMap is not thread-safe.

I agree, but IdentityHashMap is not thread-safe.

I may be wrong, but I though if the writing to IdentityHashMap was protected (which it is via the write lock, which would block the readers), that means during read it isn't being mutated, and writes are fully synced. Consequently, its perfectly fine to read from multiple threads.

In my optimization, uniqueQueries is Collections.synchronizedMap(LinkedHashMap) to achieve LRU expiration, the writing to cache object is protected via the write lock, but reading cache object can be a concurrent action via making cache object to be ConcurrentHashMap. This is the key point of my optimization. I am also trying to read through all the concurrency paths.

I think we still need IdentityHashMap to clear the cache of expired queries.

The write lock will block the read lock (and read will block write). Meaning, structural changes to the IdentityHashMap are protected. It doesn't need to be wrapped Collections.synchronizedMap. In fact, the java docs for this lock has an example exactly like this but for TreeHashMap: https://docs.oracle.com/javase/8/docs/api/java/util/concurrent/locks/ReentrantReadWriteLock.html

benwtrent · 2024-04-29T16:10:31Z

lucene/core/src/java/org/apache/lucene/search/LRUQueryCache.java

@@ -628,7 +632,7 @@ private class LeafCache implements Accountable {

    LeafCache(Object key) {
      this.key = key;
-      cache = new IdentityHashMap<>();
+      cache = Collections.synchronizedMap(new IdentityHashMap<>());


I don't understand this. Aren't all accesses to LeafCache protected by the LRUQueryCache read/write locks?

Since LeafCache isn't a static class, it should have access to the enclosing class's lock.

For testing safety, putIfAbsent, remove, onDocIdSetCache, and onDocIdSetEviction should all do a assert writeLock.isHeldByCurrentThread();

Sorry, I misunderstood here before. Since there is a read lock, concurrent access to IdentityHashMap does not require additional synchronized lock. I optimized the code as per your suggestion.

benwtrent · 2024-05-01T12:35:45Z

@boicehuang what are the new numbers for your benchmarks for the current iteration? Indeed we will be synchronizing more, so I wonder if we will still see improvement.

Use read lock to access cache

e61ddfa

benwtrent reviewed Apr 22, 2024

View reviewed changes

lucene/core/src/java/org/apache/lucene/search/LRUQueryCache.java Show resolved Hide resolved

fix count

c9d0a0a

boicehuang mentioned this pull request Apr 23, 2024

Suggestion about LRUQueryCache Optimization #13318

Open

benwtrent reviewed Apr 24, 2024

View reviewed changes

boicehuang added 5 commits April 25, 2024 17:38

add ConcurrentHashMap for concurrency

57846c4

add synchronizedMap for concurrency

8897ff8

make cache ConcurrentHashMap

37f1977

make LeafCache ConcurrentHashMap

b3f5a82

optimize with synchronizedMap

fa11ecf

benwtrent reviewed Apr 29, 2024

View reviewed changes

boicehuang added 2 commits April 30, 2024 17:39

optimize with synchronizedMap

f9f96af

Merge branch 'apache:main' into readlock

3de83b0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance improvements to use read lock to access LRUQueryCache #13306

Performance improvements to use read lock to access LRUQueryCache #13306

boicehuang commented Apr 15, 2024 •

edited

boicehuang commented Apr 23, 2024

benwtrent Apr 24, 2024

benwtrent Apr 24, 2024

boicehuang Apr 25, 2024 •

edited

boicehuang Apr 25, 2024 •

edited

boicehuang Apr 25, 2024 •

edited

boicehuang Apr 26, 2024

benwtrent Apr 26, 2024

boicehuang Apr 27, 2024 •

edited

boicehuang Apr 28, 2024 •

edited

benwtrent Apr 29, 2024

benwtrent Apr 29, 2024

boicehuang Apr 30, 2024 •

edited

boicehuang Apr 30, 2024 •

edited

benwtrent commented May 1, 2024

Performance improvements to use read lock to access LRUQueryCache #13306

Are you sure you want to change the base?

Performance improvements to use read lock to access LRUQueryCache #13306

Conversation

boicehuang commented Apr 15, 2024 • edited

boicehuang commented Apr 23, 2024

benwtrent Apr 24, 2024

Choose a reason for hiding this comment

benwtrent Apr 24, 2024

Choose a reason for hiding this comment

boicehuang Apr 25, 2024 • edited

Choose a reason for hiding this comment

boicehuang Apr 25, 2024 • edited

Choose a reason for hiding this comment

boicehuang Apr 25, 2024 • edited

Choose a reason for hiding this comment

boicehuang Apr 26, 2024

Choose a reason for hiding this comment

benwtrent Apr 26, 2024

Choose a reason for hiding this comment

boicehuang Apr 27, 2024 • edited

Choose a reason for hiding this comment

boicehuang Apr 28, 2024 • edited

Choose a reason for hiding this comment

benwtrent Apr 29, 2024

Choose a reason for hiding this comment

benwtrent Apr 29, 2024

Choose a reason for hiding this comment

boicehuang Apr 30, 2024 • edited

Choose a reason for hiding this comment

boicehuang Apr 30, 2024 • edited

Choose a reason for hiding this comment

benwtrent commented May 1, 2024

boicehuang commented Apr 15, 2024 •

edited

boicehuang Apr 25, 2024 •

edited

boicehuang Apr 25, 2024 •

edited

boicehuang Apr 25, 2024 •

edited

boicehuang Apr 27, 2024 •

edited

boicehuang Apr 28, 2024 •

edited

boicehuang Apr 30, 2024 •

edited

boicehuang Apr 30, 2024 •

edited