Reduce overhead of SortFileByOverlappingRatio() #10161

siying · 2022-06-14T01:26:14Z

Summary: Currently SortFileByOverlappingRatio() is O(nlogn). It is usually OK but When there are a lot of files in an LSM-tree, SortFileByOverlappingRatio() can take non-trivial amount of time. The problem is severe when the user is loading keys in sorted order, where compaction is only trivial move and this operation becomes the bottleneck and limit the total throughput. This commit makes SortFileByOverlappingRatio() only find the top 50 files based on score. 50 files are usually enough for the parallel compactions needed for the level, and in case it is not enough, we would fall back to random, which should be acceptable.

Test Plan:
Run a fillseq that generates a lot of files, and observe throughput improved (although stall is not yet eliminated). The command ran:

TEST_TMPDIR=/dev/shm/ ./db_bench_sort --benchmarks=fillseq --compression_type=lz4 --write_buffer_size=5000000 --num=100000000 --value_size=1000

The throughput improved by 11%.

facebook-github-bot · 2022-06-14T01:26:34Z

@siying has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

jay-zhuang · 2022-06-22T16:41:04Z

db/version_set.cc

+    // We don't pick last level files based on compaction priority,
+    // so we don't need to do the sorting.
+    if (level != num_levels() - 1) {


the level should never be bottommost:
https://github.com/facebook/rocksdb/blob/48ce44240c398850a752d13d1a27c9849ae6fad2/db/version_set.cc#L3216-L3217

Ahh. Got it.

jay-zhuang · 2022-06-22T16:46:15Z

db/version_set.cc

+  // need to pick many files, so we limit files for this partial order.
+  // In case we use up all the sorted files, we are essentially pick files
+  // in random order, but it should be rare.
+  const size_t kTotalSortedElements = 8;


For kByCompensatedSize, it sorts the top n (50) files:
https://github.com/facebook/rocksdb/blob/48ce44240c398850a752d13d1a27c9849ae6fad2/db/version_set.cc#L3229-L3233

can we reuse that and apply it to both kByCompensatedSize and kMinOverlappingRatio, or even to all the other strategies.

jay-zhuang · 2022-06-22T16:47:30Z

db/version_set.cc

+  std::nth_element(temp->begin(), temp->begin() + num_to_sort, temp->end(),
+                   comp_func);
+  std::sort(temp->begin(), temp->begin() + num_to_sort, comp_func);


I think std::partial_sort() would be better.

Summary: Currently SortFileByOverlappingRatio() is O(nlogn). It is usually OK but When there are a lot of files in an LSM-tree, SortFileByOverlappingRatio() can take non-trivial amount of time. The problem is severe when the user is loading keys in sorted order, where compaction is only trivial move and this operation becomes the bottleneck and limit the total throughput. This commit does two things: (1) SortFileByOverlappingRatio() to only find the top 8 files based on score. 8 files are usually enough for the parallel compactions needed for the level, and in case it is not enough, we would fall back to random, which should be acceptable. (2) Don't sort files in the last level Test Plan: Run a fillseq that generates a lot of files, and observe throughput improved (although stall is not yet eliminated). The command ran: TEST_TMPDIR=/dev/shm/ ./db_bench_sort --benchmarks=fillseq --compression_type=lz4 --write_buffer_size=5000000 --num=100000000 --value_size=1000 The throughput improved by 11%.

facebook-github-bot · 2022-06-23T05:24:27Z

@siying has updated the pull request. You must reimport the pull request before landing.

facebook-github-bot · 2022-06-23T05:25:08Z

@siying has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2022-06-23T05:25:54Z

@siying has updated the pull request. You must reimport the pull request before landing.

facebook-github-bot · 2022-06-23T05:27:12Z

@siying has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

jay-zhuang

LGTM

siying requested review from jay-zhuang and ajkr June 14, 2022 01:26

facebook-github-bot added the CLA Signed label Jun 14, 2022

siying mentioned this pull request Jun 22, 2022

Intra-L0 compaction reduces usage of trivial move during loads in key order #10075

Open

jay-zhuang reviewed Jun 22, 2022

View reviewed changes

siying added 2 commits June 22, 2022 22:16

Address comments

25330e2

siying force-pushed the meta_overlap_sort branch from 48ce442 to 25330e2 Compare June 23, 2022 05:24

Fix HISTORY.md

36c7f06

jay-zhuang approved these changes Jun 23, 2022

View reviewed changes

facebook-github-bot closed this in 246d469 Jun 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce overhead of SortFileByOverlappingRatio() #10161

Reduce overhead of SortFileByOverlappingRatio() #10161

siying commented Jun 14, 2022 •

edited

Loading

facebook-github-bot commented Jun 14, 2022

jay-zhuang Jun 22, 2022

siying Jun 23, 2022

jay-zhuang Jun 22, 2022

jay-zhuang Jun 22, 2022

facebook-github-bot commented Jun 23, 2022

facebook-github-bot commented Jun 23, 2022

facebook-github-bot commented Jun 23, 2022

facebook-github-bot commented Jun 23, 2022

jay-zhuang left a comment

Reduce overhead of SortFileByOverlappingRatio() #10161

Reduce overhead of SortFileByOverlappingRatio() #10161

Conversation

siying commented Jun 14, 2022 • edited Loading

facebook-github-bot commented Jun 14, 2022

jay-zhuang Jun 22, 2022

Choose a reason for hiding this comment

siying Jun 23, 2022

Choose a reason for hiding this comment

jay-zhuang Jun 22, 2022

Choose a reason for hiding this comment

jay-zhuang Jun 22, 2022

Choose a reason for hiding this comment

facebook-github-bot commented Jun 23, 2022

facebook-github-bot commented Jun 23, 2022

facebook-github-bot commented Jun 23, 2022

facebook-github-bot commented Jun 23, 2022

jay-zhuang left a comment

Choose a reason for hiding this comment

siying commented Jun 14, 2022 •

edited

Loading