Faster bucket search in ByteBufferHashTable #18952

puzpuzpuz · 2026-01-25T21:17:26Z

Description

Adds hash code comparison for large enough keys to ByteBufferHashTable#findBucket(). Also, changes key comparison to use long/int/byte instead of byte-only comparison (thus, the comparison is now closer to HashTableUtils#memoryEquals() used in MemoryOpenHashTable). These changes are aimed to speed-up bucket search in ByteBufferHashTable, especially in high-collision cases.

Microbenchmarks

Environment: Ryzen 7900x, Ubuntu 24.04, OpenJDK 64-bit 17.0.17

Before:

Benchmark                                (keySize)  Mode  Cnt   Score   Error  Units
ByteBufferHashTableBenchmark.findBucket          8  avgt    5  19.965 ± 0.212  ns/op
ByteBufferHashTableBenchmark.findBucket         16  avgt    5  26.816 ± 1.282  ns/op
ByteBufferHashTableBenchmark.findBucket         32  avgt    5  36.174 ± 0.337  ns/op
ByteBufferHashTableBenchmark.findBucket         64  avgt    5  49.581 ± 0.482  ns/op
ByteBufferHashTableBenchmark.findBucket        128  avgt    5  72.990 ± 1.429  ns/op

After:

Benchmark                                (keySize)  Mode  Cnt   Score   Error  Units
ByteBufferHashTableBenchmark.findBucket          8  avgt    5   5.502 ± 0.338  ns/op
ByteBufferHashTableBenchmark.findBucket         16  avgt    5  11.830 ± 0.046  ns/op
ByteBufferHashTableBenchmark.findBucket         32  avgt    5  15.965 ± 0.135  ns/op
ByteBufferHashTableBenchmark.findBucket         64  avgt    5  20.522 ± 0.069  ns/op
ByteBufferHashTableBenchmark.findBucket        128  avgt    5  29.035 ± 1.806  ns/op

Release note

Speed-up bucket search in hash table used by GROUP BY

Key changed/added classes in this PR

ByteBufferHashTable

This PR has:

jtuglu1 · 2026-01-25T21:32:35Z

Thanks! Can we include a performance test (see benchmarks folder)?

puzpuzpuz · 2026-01-27T19:16:45Z

Thanks! Can we include a performance test (see benchmarks folder)?

@jtuglu1 done in 85869a8 - the measurements on my box are in the benchmark description. I've also made hash code check mandatory (previously it was disabled for keys <= 8 bytes).

puzpuzpuz · 2026-01-27T20:24:37Z

Sorry, I accidentally broke the empty bucket check with the earlier commit - fixed it in 2d9520f. Also updated the benchmark results with the measurements obtained on the latest commit.

puzpuzpuz · 2026-01-27T20:49:18Z

More updates. I've noticed that the change introduced implicit endianess dependency since it now checks an int for the empty bit instead of the previous single byte check. This should be fixed in 952fab3 and 7ea3339. No more changes from my side.

gianm

The changes look good to me. Thank you for including a benchmark as well.

jtuglu1

Thank you! Left 2 small, non-blocking comments.

jtuglu1 · 2026-01-28T06:48:11Z

processing/src/main/java/org/apache/druid/query/groupby/epinephelinae/ByteBufferHashTable.java

+      final int storedHashWithUsedFlag = targetTableBuffer.getInt(bucketOffset);

-      if ((targetTableBuffer.get(bucketOffset) & 0x80) == 0) {
+      if ((storedHashWithUsedFlag & 0x80000000) == 0) {


nit: while we're here, can we name this mask? It's used in other places below (byte-level mask) and makes it easier to read potentially.

Addressed in 5fb014f

jtuglu1 · 2026-01-28T06:53:13Z

processing/src/main/java/org/apache/druid/query/groupby/epinephelinae/ByteBufferHashTable.java

+  )
+  {
+    // Compare 8 bytes at a time
+    while (length >= Long.BYTES) {


Maybe we can save a comparison by switching to a do/while loop since I believe length will always be ≥ 8. This likely will not show up in the benchmark, however. Unfortunately we cannot do something like [[likely]] in Java I don't think.

I'd rather keep the method correct in the face of smaller keys, if this ever changes in the future. Also, if the keys are always >=8, the first branch will be always taken, so CPU's branch predictor should make it very cheap.

puzpuzpuz · 2026-01-28T20:50:37Z

@jtuglu1 @gianm thanks for the reviews!

FrankChen021 · 2026-01-29T02:02:16Z

@puzpuzpuz Thanks for the change. I'm wondering how much does this change improve(like the CPU usage) in a real cluster?

puzpuzpuz · 2026-01-29T10:44:11Z

@puzpuzpuz Thanks for the change. I'm wondering how much does this change improve(like the CPU usage) in a real cluster?

This is a small change and unlikely it's a significant bottleneck in typical workloads, but I'm guessing here. BTW are there any public benchmarks in which Druid actively participates? If so, it's a good idea to check those to be able to make more educated optimizations (if any required/possible).

Faster bucket search in ByteBufferHashTable

7cc4eab

jtuglu1 self-requested a review January 26, 2026 06:16

Add benchmark

85869a8

Fix wrong check

2d9520f

puzpuzpuz added 2 commits January 27, 2026 22:33

Fix endianess dependency when clearing used bits

952fab3

Apply the same fix to LimitedBufferHashGrouper's method overrides

7ea3339

gianm approved these changes Jan 28, 2026

View reviewed changes

jtuglu1 approved these changes Jan 28, 2026

View reviewed changes

Address nit comment

5fb014f

jtuglu1 merged commit 1b5a85d into apache:master Jan 28, 2026
40 checks passed

puzpuzpuz deleted the puzpuzpuz_faster_bucket_find branch January 28, 2026 20:50

Faster bucket search in ByteBufferHashTable #18952

Faster bucket search in ByteBufferHashTable #18952

Uh oh!

Conversation

puzpuzpuz commented Jan 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Microbenchmarks

Release note

Key changed/added classes in this PR

Uh oh!

jtuglu1 commented Jan 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

puzpuzpuz commented Jan 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

puzpuzpuz commented Jan 27, 2026

Uh oh!

puzpuzpuz commented Jan 27, 2026

Uh oh!

gianm left a comment

Choose a reason for hiding this comment

Uh oh!

jtuglu1 left a comment

Choose a reason for hiding this comment

Uh oh!

jtuglu1 Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

puzpuzpuz Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

jtuglu1 Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

puzpuzpuz Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

puzpuzpuz commented Jan 28, 2026

Uh oh!

FrankChen021 commented Jan 29, 2026

Uh oh!

puzpuzpuz commented Jan 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

puzpuzpuz commented Jan 25, 2026 •

edited

Loading

jtuglu1 commented Jan 25, 2026 •

edited

Loading

puzpuzpuz commented Jan 27, 2026 •

edited

Loading

jtuglu1 Jan 28, 2026 •

edited

Loading