c-deps: backport RocksDB range deletion performance fix #26877

benesch · 2018-06-21T05:41:58Z

The current implementation of range deletion tombstones in RocksDB
suffers from a performance bug that causes excessive CPU usage on every
read operation in a database with many range tombstones. Dropping a
large table can easily result in several thousand range deletion
tombstones in one store, resulting in an unusable cluster as documented
in #24029.

Backport a refactoring of range deletion tombstone that fixes the
performance problem. This refactoring has also been proposed upstream as
facebook/rocksdb#4014.

A more minimal change was also proposed in facebook/rocksdb#3992--and
that patch better highlights the exact nature of the bug than the patch
backported here, for those looking to understand the problem. But this
refactoring, though more invasive, gets us one step closer to solving a
related problem where range deletions can cause excessively large
compactions (#26693). These large compactions do not appear to brick the
cluster but undoubtedly have some impact on performance.

Fix #24029.

Release note: None

cockroach-teamcity · 2018-06-21T05:42:05Z

This change is

The current implementation of range deletion tombstones in RocksDB suffers from a performance bug that causes excessive CPU usage on every read operation in a database with many range tombstones. Dropping a large table can easily result in several thousand range deletion tombstones in one store, resulting in an unusable cluster as documented in cockroachdb#24029. Backport a refactoring of range deletion tombstone that fixes the performance problem. This refactoring has also been proposed upstream as facebook/rocksdb#4014. A more minimal change was also proposed in facebook/rocksdb#3992--and that patch better highlights the exact nature of the bug than the patch backported here, for those looking to understand the problem. But this refactoring, though more invasive, gets us one step closer to solving a related problem where range deletions can cause excessively large compactions (cockroachdb#26693). These large compactions do not appear to brick the cluster but undoubtedly have some impact on performance. Fix cockroachdb#24029. Release note: None

benesch · 2018-06-21T20:53:48Z

CI passed so... ?

petermattis · 2018-06-21T21:03:33Z

I'm taking a look.

Review status: complete! 0 of 0 LGTMs obtained

Comments from Reviewable

petermattis · 2018-06-21T21:59:32Z

I left some comments on the upstream PR.

Review status: complete! 0 of 0 LGTMs obtained

Comments from Reviewable

benesch · 2018-07-16T13:46:25Z

This was taken care of by #27520. h/t @petermattis.

benesch requested review from bdarnell, tbg, petermattis and a team June 21, 2018 05:41

benesch force-pushed the rdperf branch from 7990a9d to 7a071e1 Compare June 21, 2018 17:07

benesch closed this Jul 16, 2018

benesch deleted the rdperf branch July 16, 2018 13:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

c-deps: backport RocksDB range deletion performance fix #26877

c-deps: backport RocksDB range deletion performance fix #26877

benesch commented Jun 21, 2018

cockroach-teamcity commented Jun 21, 2018

benesch commented Jun 21, 2018

petermattis commented Jun 21, 2018

petermattis commented Jun 21, 2018

benesch commented Jul 16, 2018

c-deps: backport RocksDB range deletion performance fix #26877

c-deps: backport RocksDB range deletion performance fix #26877

Conversation

benesch commented Jun 21, 2018

cockroach-teamcity commented Jun 21, 2018

benesch commented Jun 21, 2018

petermattis commented Jun 21, 2018

petermattis commented Jun 21, 2018

benesch commented Jul 16, 2018