Add new heauristic 'num_collapsible_entry_reads_sampled' by joshkang97 · Pull Request #14434 · facebook/rocksdb

joshkang97 · 2026-03-06T21:29:06Z

Summary

Add per-file sampling of "collapsible" entry reads (single deletions, merges, and kNotFound results) that may later be used to help inform read-triggered compactions. This is a better metric than num_reads_sampled as it is more targeted towards reads that could be avoided via compaction.

The existing behavior of num_reads_sampled is that reads only gets sampled on iterator creation for a file. It is problematic because next/prev() calls are not sampled, nor are additional seeks().

This PR moves sampling to per-seek/next granularity within LevelIterator and adds a new num_collapsible_entry_reads_sampled counter that tracks how often a file serves entries that could be eliminated by compaction.

Note only L1+ files have iterator seeks/nexts/prevs sampled. Introducing this at L0 would require wrapping table reader iterators, introducing a performance cost.

Key changes

New counter num_collapsible_entry_reads_sampled in FileSampledStats tracks sampled reads that encounter deletions, single deletions, merges, or kNotFound results in both Get and Iterator paths.
Moved sampling from file-open to per-operation in LevelIterator: sampling now happens in SampleRead() called from Seek(), SeekForPrev(), SeekToFirst(), SeekToLast(), Next(), NextAndGetResult(), and Prev(). The should_sample parameter was removed from LevelIterator's constructor.
Differentiated sampling rate for Next() vs Seek(): should_sample_file_read_next() uses a 64x lower sampling rate (kFileReadSampleRate * 64) since Next() is cheaper than Seek() and called more frequently.
Collapsible tracking in Get path: Version::Get() now increments the collapsible counter when GetContext::State() is kNotFound, kMerge, or kDeleted.
Collapsible tracking in MultiGet path: MultiGetFromSST also increments the collapsible counter for the same states.

Test Plan

Added new DB tests for both num_reads_sampled and num_collapsible_entry_reads_sampled

Benchmark results (readrandom, readseq)

Setup: 1M keys, 16-byte keys, 100-byte values, no compression, fillrandom+compact

Benchmark	Params	ops/s (main)	ops/s (feature)	% change
readrandom	seed=1, threads=1	387,194	389,449	+0.6%
readseq	seed=1, threads=1	5,598,371	5,572,975	-0.5%

No meaningful performance regression observed — differences are within run-to-run noise.

github-actions · 2026-03-06T21:36:05Z

✅ clang-tidy: No findings on changed lines

Completed in 291.3s.

meta-codesync · 2026-03-06T22:24:02Z

@joshkang97 has imported this pull request. If you are a Meta employee, you can view this in D95613793.

meta-codesync · 2026-03-06T23:12:30Z

@joshkang97 has imported this pull request. If you are a Meta employee, you can view this in D95613793.

xingbowang

Looks good overall.
I don't see num_collapsible_entry_reads_sampled is used. I guess there is a follow up change to leverage it in the compaction logic.

xingbowang · 2026-03-09T19:16:15Z

+  // Decrease probability of sampling next() to discount it as it is cheaper
+  // than seek()


To be more accurate, it is more like next() is called a lot more frequent than seek. Therefore, we want to lower the sampling rate to avoid introducing performance penalty. I guess this is why we use a counter instead of random to make sampling decision as well.

The actual idea is to decrease probability but keep the amount the same. The reasoning is that the read cost of a seek() is ~64x (very loose estimation) as expensive as a next().

And yep the lowered probability lowers the cost of sample_collapsible_entry_file_read_inc(), and using a counter is more performant than division

xingbowang · 2026-03-09T19:24:14Z

+inline void sample_file_read_inc(const FileMetaData* meta) {
  meta->stats.num_reads_sampled.fetch_add(kFileReadSampleRate,
                                          std::memory_order_relaxed);
 }
+
+inline void sample_collapsible_entry_file_read_inc(const FileMetaData* meta) {
+  meta->stats.num_collapsible_entry_reads_sampled.fetch_add(
+      kFileReadSampleRate, std::memory_order_relaxed);
+}


Should we increase different amount based on whether it is seek or next, as their sampling rate is different.

Replied in my other comment

meta-codesync · 2026-03-09T23:46:21Z

@joshkang97 merged this pull request in 42eff8b.

) Summary: Add per-file sampling of "collapsible" entry reads (single deletions, merges, and kNotFound results) that may later be used to help inform read-triggered compactions. This is a better metric than `num_reads_sampled` as it is more targeted towards reads that could be avoided via compaction. The existing behavior of `num_reads_sampled` is that reads only gets sampled on iterator creation for a file. It is problematic because next/prev() calls are not sampled, nor are additional seeks(). This PR moves sampling to per-seek/next granularity within `LevelIterator` and adds a new `num_collapsible_entry_reads_sampled` counter that tracks how often a file serves entries that could be eliminated by compaction. Note only L1+ files have iterator seeks/nexts/prevs sampled. Introducing this at L0 would require wrapping table reader iterators, introducing a performance cost. ## Key changes - **New counter `num_collapsible_entry_reads_sampled`** in `FileSampledStats` tracks sampled reads that encounter deletions, single deletions, merges, or kNotFound results in both Get and Iterator paths. - **Moved sampling from file-open to per-operation** in `LevelIterator`: sampling now happens in `SampleRead()` called from `Seek()`, `SeekForPrev()`, `SeekToFirst()`, `SeekToLast()`, `Next()`, `NextAndGetResult()`, and `Prev()`. The `should_sample` parameter was removed from `LevelIterator`'s constructor. - **Differentiated sampling rate for Next() vs Seek()**: `should_sample_file_read_next()` uses a 64x lower sampling rate (`kFileReadSampleRate * 64`) since Next() is cheaper than Seek() and called more frequently. - **Collapsible tracking in Get path**: `Version::Get()` now increments the collapsible counter when `GetContext::State()` is `kNotFound`, `kMerge`, or `kDeleted`. - **Collapsible tracking in MultiGet path**: `MultiGetFromSST` also increments the collapsible counter for the same states. Pull Request resolved: facebook#14434 Test Plan: - Added new DB tests for both num_reads_sampled and num_collapsible_entry_reads_sampled ### Benchmark results (readrandom, readseq) Setup: 1M keys, 16-byte keys, 100-byte values, no compression, fillrandom+compact | Benchmark | Params | ops/s (main) | ops/s (feature) | % change | |------------|--------------------|-------------|--------------------------|----------| | readrandom | seed=1, threads=1 | 387,194 | 389,449 | +0.6% | | readseq | seed=1, threads=1 | 5,598,371 | 5,572,975 | -0.5% | No meaningful performance regression observed — differences are within run-to-run noise. Reviewed By: xingbowang Differential Revision: D95613793 Pulled By: joshkang97 fbshipit-source-id: 9dd09c9b7527b148424bde5686f4157c7a9e1214

joshkang97 added 3 commits March 6, 2026 11:52

init

1e27972

Merge remote-tracking branch 'upstream/main' into unwanted_io_stats

411f2bf

fix

a535fda

meta-cla Bot added the CLA Signed label Mar 6, 2026

joshkang97 requested review from hx235, pdillinger and xingbowang March 6, 2026 21:29

joshkang97 marked this pull request as ready for review March 6, 2026 22:23

format

e67df3f

xingbowang approved these changes Mar 9, 2026

View reviewed changes

meta-codesync Bot closed this in 42eff8b Mar 9, 2026

facebook-github-bot added the Merged label Mar 9, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add new heauristic 'num_collapsible_entry_reads_sampled' #14434

Add new heauristic 'num_collapsible_entry_reads_sampled' #14434
joshkang97 wants to merge 4 commits into
facebook:mainfrom
joshkang97:unwanted_io_stats

joshkang97 commented Mar 6, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Mar 6, 2026 •

edited

Loading

Uh oh!

meta-codesync Bot commented Mar 6, 2026

Uh oh!

meta-codesync Bot commented Mar 6, 2026

Uh oh!

xingbowang left a comment

Uh oh!

xingbowang Mar 9, 2026

Uh oh!

joshkang97 Mar 9, 2026 •

edited

Loading

Uh oh!

xingbowang Mar 9, 2026

Uh oh!

joshkang97 Mar 9, 2026

Uh oh!

meta-codesync Bot commented Mar 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		// Decrease probability of sampling next() to discount it as it is cheaper
		// than seek()

Conversation

joshkang97 commented Mar 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Key changes

Test Plan

Benchmark results (readrandom, readseq)

Uh oh!

github-actions Bot commented Mar 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ clang-tidy: No findings on changed lines

Uh oh!

meta-codesync Bot commented Mar 6, 2026

Uh oh!

meta-codesync Bot commented Mar 6, 2026

Uh oh!

xingbowang left a comment

Choose a reason for hiding this comment

Uh oh!

xingbowang Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

joshkang97 Mar 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

xingbowang Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

joshkang97 Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

meta-codesync Bot commented Mar 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

joshkang97 commented Mar 6, 2026 •

edited

Loading

github-actions Bot commented Mar 6, 2026 •

edited

Loading

joshkang97 Mar 9, 2026 •

edited

Loading