Reduce scope of compression dictionary to single SST #4952

ajkr · 2019-02-06T01:02:40Z

Our previous approach was to train one compression dictionary per compaction, using the first output SST to train a dictionary, and then applying it on subsequent SSTs in the same compaction. While this was great for minimizing CPU/memory/I/O overhead, it did not achieve good compression ratios in practice. In our most promising potential use case, moderate reductions in a dictionary's scope make a major difference on compression ratio.

So, this PR changes compression dictionary to be scoped per-SST. It accepts the tradeoff during table building to use more memory and CPU. Important changes include:

The BlockBasedTableBuilder has a new state when dictionary compression is in-use: kBuffered. In that state it accumulates uncompressed data in-memory whenever Add is called.
After accumulating target file size bytes or calling BlockBasedTableBuilder::Finish, a BlockBasedTableBuilder moves to the kUnbuffered state. The transition (EnterUnbuffered()) involves sampling the buffered data, training a dictionary, and compressing/writing out all buffered data. In the kUnbuffered state, a BlockBasedTableBuilder behaves the same as before -- blocks are compressed/written out as soon as they fill up.
Samples are now whole uncompressed data blocks, except the final sample may be a partial data block so we don't breach the user's configured max_dict_bytes or zstd_max_train_bytes. The dictionary trainer is supposed to work better when we pass it real units of compression. Previously we were passing 64-byte KV samples which was not realistic.

Test Plan:

new unit test to verify there is a dictionary specific to each bottom-level SST
stress test for various configs of max_dict_bytes and zstd_max_train_bytes:

$ for train in 0 1 1024 4096 524288 4194304 ; do for dict in 0 256 4096 524288 4194304 ; do mkdir -p /data/compaction_bench/train_${train}-dict_${dict}/ && TEST_TMPDIR=/data/compaction_bench/train_${train}-dict_${dict}/ python tools/db_crashtest.py blackbox --simple --write_buffer_size=1048576 --target_file_size_base=1048576 --target_file_size_multiplier=1 --max_bytes_for_level_base=4194304 --compression_type=zstd --duration=120 --max_key=10000000 --interval=30 --compression_zstd_max_train_bytes=$train --compression_max_dict_bytes=$dict --value_size_mult=33 --writepercent=10 --readpercent=50 --max_background_compactions=8 || break; done ; done

shadow tested myrocks udb data and found compression ratio improvement

ajkr · 2019-02-06T01:07:19Z

This does not include charging memory usage to block cache. I need to catch up on testing before adding another feature. Hopefully we can add that feature in a separate PR.

facebook-github-bot

@ajkr has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

ajkr · 2019-02-07T07:19:51Z

I am not sure about automated test. We can try something like the following, though I find it hard to imagine it'll be worth its complexity/maintenance.

Set target file size 1MB
Repeat three times: Insert 1000 1KB values of the same random string; insert 1000 1KB values of completely random data
Run manual compaction. Before the compaction there should be six files. After it there should be three files because the dictionaries should be super-effective at compressing the repeated random strings.
Also do some reads and check each of the files has a dictionary (e.g., a full scan should increment compression dict access count by three).

riversand963

LGTM except for a few minor comments. Thanks @ajkr for the improvement.

table/block_based_table_builder.cc

util/compression.h

table/block_based_table_builder.h

facebook-github-bot · 2019-02-11T23:31:02Z

@ajkr has updated the pull request. Re-import the pull request

facebook-github-bot · 2019-02-12T00:04:21Z

@ajkr has updated the pull request. Re-import the pull request

facebook-github-bot

@ajkr has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2019-02-12T00:19:57Z

@ajkr has updated the pull request. Re-import the pull request

facebook-github-bot

@ajkr has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2019-02-12T01:02:53Z

@ajkr has updated the pull request. Re-import the pull request

facebook-github-bot

@ajkr has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2019-02-12T02:12:25Z

@ajkr has updated the pull request. Re-import the pull request

facebook-github-bot

@ajkr has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot added the CLA Signed label Feb 6, 2019

ajkr requested review from riversand963 and siying and removed request for riversand963 February 6, 2019 01:04

facebook-github-bot reviewed Feb 6, 2019

View reviewed changes

riversand963 approved these changes Feb 7, 2019

View reviewed changes

table/block_based_table_builder.cc Outdated Show resolved Hide resolved

util/compression.h Show resolved Hide resolved

table/block_based_table_builder.h Outdated Show resolved Hide resolved

ajkr and others added 9 commits February 11, 2019 16:04

rebase "Reduce compression dictionary scope to single SST"

a7b2490

only use compression dictionary for bottom-level compaction output files

c4f65c8

allow partial data block as final sample

12fe220

make format

75fc849

bug fixes for bloom filter and table properties

cd121ca

build dictionary and switch to unbuffered mode during table building

1e00842

limit buffer to file size bytes

2d33709

fix PresetCompressionDict test

024e9aa

add test case for dictionary locality

17ce894

ajkr force-pushed the zstd_dict_per_sst branch from 67c5b15 to 17ce894 Compare February 12, 2019 00:04

facebook-github-bot reviewed Feb 12, 2019

View reviewed changes

address comments

d35f40a

facebook-github-bot reviewed Feb 12, 2019

View reviewed changes

update history.md

3dcc94b

facebook-github-bot reviewed Feb 12, 2019

View reviewed changes

reduce risk of flaky test

49d136a

facebook-github-bot reviewed Feb 12, 2019

View reviewed changes

facebook-github-bot closed this in 62f70f6 Feb 12, 2019

Connor1996 mentioned this pull request Nov 15, 2019

Implement zstd dictionary compression in Titan tikv/tikv#5743

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce scope of compression dictionary to single SST #4952

Reduce scope of compression dictionary to single SST #4952

ajkr commented Feb 6, 2019 •

edited

Loading

ajkr commented Feb 6, 2019

facebook-github-bot left a comment

ajkr commented Feb 7, 2019

riversand963 left a comment

facebook-github-bot commented Feb 11, 2019

facebook-github-bot commented Feb 12, 2019

facebook-github-bot left a comment

facebook-github-bot commented Feb 12, 2019

facebook-github-bot left a comment

facebook-github-bot commented Feb 12, 2019

facebook-github-bot left a comment

facebook-github-bot commented Feb 12, 2019

facebook-github-bot left a comment

Reduce scope of compression dictionary to single SST #4952

Reduce scope of compression dictionary to single SST #4952

Conversation

ajkr commented Feb 6, 2019 • edited Loading

ajkr commented Feb 6, 2019

facebook-github-bot left a comment

Choose a reason for hiding this comment

ajkr commented Feb 7, 2019

riversand963 left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Feb 11, 2019

facebook-github-bot commented Feb 12, 2019

facebook-github-bot left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Feb 12, 2019

facebook-github-bot left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Feb 12, 2019

facebook-github-bot left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Feb 12, 2019

facebook-github-bot left a comment

Choose a reason for hiding this comment

ajkr commented Feb 6, 2019 •

edited

Loading