Fix bug causing incorrect data returned by snapshot read #9648

riversand963 · 2022-03-01T17:59:40Z

This bug affects use cases that meet the following conditions

(has only the default column family or disables WAL) and
has at least one event listener
atomic flush is NOT affected.

If the above conditions meet, then RocksDB can release the db mutex before picking all the
existing memtables to flush. In the meantime, a snapshot can be created and db's sequence
number can still be incremented. The upcoming flush will ignore this snapshot.
A later read using this snapshot can return incorrect result.

To fix this issue, we call the listeners callbacks after picking the memtables so that we avoid
creating snapshots during this interval.

Test plan
make check

facebook-github-bot · 2022-03-01T18:15:56Z

@riversand963 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2022-03-01T21:58:41Z

@riversand963 has updated the pull request. You must reimport the pull request before landing.

facebook-github-bot · 2022-03-01T22:00:21Z

@riversand963 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

ajkr

LGTM! Nice test case

ajkr · 2022-03-03T00:58:29Z

HISTORY.md

@@ -5,7 +5,8 @@
 * Added BlobDB options to `ldb`

 ### Bug Fixes
-* * Fixed a data race on `versions_` between `DBImpl::ResumeImpl()` and threads waiting for recovery to complete (#9496)
+* Fixed a data race on `versions_` between `DBImpl::ResumeImpl()` and threads waiting for recovery to complete (#9496)
+* Fixed a bug due to db mutex release causing incorrect result returned for snapshot read.


"db mutex release" - this isn't user API level or very concrete. Should we say an unlikely race condition and/or list the conditions? Maybe we say the symptom too: queries to snapshots created with this race condition may return old values, including resurfacing deleted data.

Thanks @ajkr for the review. Updated HISTORY

facebook-github-bot · 2022-03-03T01:16:23Z

@riversand963 has updated the pull request. You must reimport the pull request before landing.

facebook-github-bot · 2022-03-03T01:18:29Z

@riversand963 has updated the pull request. You must reimport the pull request before landing.

facebook-github-bot · 2022-03-03T01:18:57Z

@riversand963 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary: This bug affects use cases that meet the following conditions - (has only the default column family or disables WAL) and - has at least one event listener - atomic flush is NOT affected. If the above conditions meet, then RocksDB can release the db mutex before picking all the existing memtables to flush. In the meantime, a snapshot can be created and db's sequence number can still be incremented. The upcoming flush will ignore this snapshot. A later read using this snapshot can return incorrect result. To fix this issue, we call the listeners callbacks after picking the memtables so that we avoid creating snapshots during this interval. Pull Request resolved: #9648 Test Plan: make check Reviewed By: ajkr Differential Revision: D34555456 Pulled By: riversand963 fbshipit-source-id: 1438981e9f069a5916686b1a0ad7627f734cf0ee

Summary: Test only, no change to functionality. Extremely low risk of library regression. Update test key generation by maintaining existing and non-existing keys. Update db_crashtest.py to drive multiops_txn stress test for both write-committed and write-prepared. Add a make target 'blackbox_crash_test_with_multiops_txn'. Running the following commands caught the bug exposed in #9571. ``` $rm -rf /tmp/rocksdbtest/* $./db_stress -progress_reports=0 -test_multi_ops_txns -use_txn -clear_column_family_one_in=0 \ -column_families=1 -writepercent=0 -delpercent=0 -delrangepercent=0 -customopspercent=60 \ -readpercent=20 -prefixpercent=0 -iterpercent=20 -reopen=0 -ops_per_thread=1000 -ub_a=10000 \ -ub_c=100 -destroy_db_initially=0 -key_spaces_path=/dev/shm/key_spaces_desc -threads=32 -read_fault_one_in=0 $./db_stress -progress_reports=0 -test_multi_ops_txns -use_txn -clear_column_family_one_in=0 -column_families=1 -writepercent=0 -delpercent=0 -delrangepercent=0 -customopspercent=60 -readpercent=20 \ -prefixpercent=0 -iterpercent=20 -reopen=0 -ops_per_thread=1000 -ub_a=10000 -ub_c=100 -destroy_db_initially=0 \ -key_spaces_path=/dev/shm/key_spaces_desc -threads=32 -read_fault_one_in=0 ``` Running the following command caught a bug which will be fixed in #9648 . ``` $TEST_TMPDIR=/dev/shm make blackbox_crash_test_with_multiops_wc_txn ``` Pull Request resolved: #9568 Reviewed By: jay-zhuang Differential Revision: D34308154 Pulled By: riversand963 fbshipit-source-id: 99ff1b65c19b46c471d2f2d3b47adcd342a1b9e7

facebook-github-bot added the CLA Signed label Mar 1, 2022

riversand963 mentioned this pull request Mar 1, 2022

Improve stress test for transactions #9568

Closed

riversand963 force-pushed the fix-tombstone-snapshot branch from 437aba6 to c9db282 Compare March 1, 2022 21:58

ajkr approved these changes Mar 3, 2022

View reviewed changes

riversand963 added 6 commits March 2, 2022 17:18

Reproduce a bug of incorrect read result

fa6ad67

Move listener notification after memtable picking

a2801ca

Add some comments

373fc4c

Update HISTORY.md

f9fa693

MSVC build

171c5d7

Address review comments

46de46e

riversand963 force-pushed the fix-tombstone-snapshot branch from 168e0a1 to 46de46e Compare March 3, 2022 01:18

facebook-github-bot closed this in 659a16d Mar 3, 2022

riversand963 deleted the fix-tombstone-snapshot branch March 3, 2022 05:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix bug causing incorrect data returned by snapshot read #9648

Fix bug causing incorrect data returned by snapshot read #9648

riversand963 commented Mar 1, 2022

facebook-github-bot commented Mar 1, 2022

facebook-github-bot commented Mar 1, 2022

facebook-github-bot commented Mar 1, 2022

ajkr left a comment

ajkr Mar 3, 2022 •

edited

Loading

riversand963 Mar 3, 2022

facebook-github-bot commented Mar 3, 2022

facebook-github-bot commented Mar 3, 2022

facebook-github-bot commented Mar 3, 2022

Fix bug causing incorrect data returned by snapshot read #9648

Fix bug causing incorrect data returned by snapshot read #9648

Conversation

riversand963 commented Mar 1, 2022

facebook-github-bot commented Mar 1, 2022

facebook-github-bot commented Mar 1, 2022

facebook-github-bot commented Mar 1, 2022

ajkr left a comment

Choose a reason for hiding this comment

ajkr Mar 3, 2022 • edited Loading

Choose a reason for hiding this comment

riversand963 Mar 3, 2022

Choose a reason for hiding this comment

facebook-github-bot commented Mar 3, 2022

facebook-github-bot commented Mar 3, 2022

facebook-github-bot commented Mar 3, 2022

ajkr Mar 3, 2022 •

edited

Loading