Update `TestGet()` to verify against expected state #10553

cbi42 · 2022-08-22T19:30:39Z

Summary: updated TestGet() in no_batched_op_stress to check the result of Get() operations against expected state (expected_state_manager_). More specifically, if Get() finds a key, expected state should not have DELETION_SENTINEL for the same key, and if Get() returns NotFound for a key, expected state should not have the key. One intention for this change it to verify correctness of code path change regarding range tombstones.

Test plan: run db_stress with nonzero readpercent: ./db_stress_branch --readpercent=57 --prefixpercent=4 --writepercent=25 -delpercent=5 --iterpercent=5 --delrangepercent=4. When I initially used wrong column family in thread->shared->Get, the test reported inconsistencies.

internally run various flavor of crash test

facebook-github-bot · 2022-08-22T23:31:05Z

@cbi42 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

ajkr

Will comment test instructions on internal diff.

ajkr · 2022-08-23T06:16:44Z

db_stress_tool/no_batched_ops_stress.cc

@@ -360,9 +360,27 @@ class NonBatchedOpsStressTest : public StressTest {
      }
      // found case
      thread->stats.AddGets(1, 1);
+      if (thread->shared->Get(rand_column_families[0], rand_keys[0]) ==


What if read_opts_copy.timestamp is set by MaybeUseOlderTimestampForPointLookup?

Since we only have the latest expected state, I think we can only do the verification when timestamp is not set. I'll update accordingly.

facebook-github-bot · 2022-08-23T19:55:59Z

@cbi42 has updated the pull request. You must reimport the pull request before landing.

facebook-github-bot · 2022-08-23T19:58:05Z

@cbi42 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2022-08-23T20:52:38Z

@cbi42 has updated the pull request. You must reimport the pull request before landing.

facebook-github-bot · 2022-08-23T20:53:41Z

@cbi42 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

ajkr

LGTM

pdillinger · 2022-08-24T15:38:01Z

HISTORY.md

@@ -1,5 +1,7 @@
 # Rocksdb Change Log
 ## Unreleased
+### Behavior Change
+* Updated `TestGet()` in `no_batched_op_stress` (default stress test) to check the result of Get() operations against expected state.


This seems like information that is absolutely no concern to users. How are they supposed to even interpret this without digging into internal source code? We already have problems with important users not reading release notes, and adding noise to the signal does not help the situation.

Remove it is fine with me.

absolutely no concern to users

#9385 pleasantly surprised me that our crash test appears to be run heavily at least one place outside of Meta. Still, removing it is fine with me.

Thanks for pointing this out. I wasn't aware of the problems with important users not reading release notes, but I realize and agree that stress test detail is not of interest to most users. I'll remove the change log in another PR.

Summary: As mentioned in #5506 (comment), `db_stress` does not have much verification for iterator correctness. It has a `TestIterate()` function, but that is mainly for comparing results between two iterators, one with `total_order_seek` and the other optionally sets auto_prefix, upper/lower bounds. Commit 49a0581 added a new `TestIterateAgainstExpected()` function that compares iterator against expected state. It locks a range of keys, creates an iterator, does a random sequence of `Next/Prev` and compares against expected state. This PR is based on that commit, the main changes include some logs (for easier debugging if a test fails), a forward and backward scan to cover the entire locked key range, and a flag for optionally turning on this version of Iterator testing. Added constraint that the checks against expected state in `TestIterateAgainstExpected()` and in `TestGet()` are only turned on when `--skip_verifydb` flag is not set. Remove the change log introduced in #10553. Pull Request resolved: #10538 Test Plan: Run `db_stress` with `--verify_iterator_with_expected_state_one_in=1`, and a large `--iterpercent` and `--num_iterations`. Checked `op_logs` manually to ensure expected coverage. Tweaked part of the code in #10449 and stress test was able to catch it. - internally run various flavor of crash test Reviewed By: ajkr Differential Revision: D38847269 Pulled By: cbi42 fbshipit-source-id: 8b4402a9bba9f6cfa08051943cd672579d489599

Summary: Each read from memtable used to read and fragment all the range tombstones into a `FragmentedRangeTombstoneList`. #10380 improved the inefficient here by caching a `FragmentedRangeTombstoneList` with each immutable memtable. This PR extends the caching to mutable memtables. The fragmented range tombstone can be constructed in either read (This PR) or write path (#10584). With both implementation, each `DeleteRange()` will invalidate the cache, and the difference is where the cache is re-constructed.`CoreLocalArray` is used to store the cache with each memtable so that multi-threaded reads can be efficient. More specifically, each core will have a shared_ptr to a shared_ptr pointing to the current cache. Each read thread will only update the reference count in its core-local shared_ptr, and this is only needed when reading from mutable memtables. The choice between write path and read path is not an easy one: they are both improvement compared to no caching in the current implementation, but they favor different operations and could cause regression in the other operation (read vs write). The write path caching in (#10584) leads to a cleaner implementation, but I chose the read path caching here to avoid significant regression in write performance when there is a considerable amount of range tombstones in a single memtable (the number from the benchmark below suggests >1000 with concurrent writers). Note that even though the fragmented range tombstone list is only constructed in `DeleteRange()` operations, it could block other writes from proceeding, and hence affects overall write performance. Pull Request resolved: #10547 Test Plan: - TestGet() in stress test is updated in #10553 to compare Get() result against expected state: `./db_stress_branch --readpercent=57 --prefixpercent=4 --writepercent=25 -delpercent=5 --iterpercent=5 --delrangepercent=4` - Perf benchmark: tested read and write performance where a memtable has 0, 1, 10, 100 and 1000 range tombstones. ``` ./db_bench --benchmarks=fillrandom,readrandom --writes_per_range_tombstone=200 --max_write_buffer_number=100 --min_write_buffer_number_to_merge=100 --writes=200000 --reads=100000 --disable_auto_compactions --max_num_range_tombstones=1000 ``` Write perf regressed since the cost of constructing fragmented range tombstone list is shifted from every read to a single write. 6cbe5d8 is included in the last column as a reference to see performance impact on multi-thread reads if `CoreLocalArray` is not used. micros/op averaged over 5 runs: first 4 columns are for fillrandom, last 4 columns are for readrandom. | |fillrandom main | write path caching | read path caching |memtable V3 (#10308) | readrandom main | write path caching | read path caching |memtable V3 | |--- |--- |--- |--- |--- | --- | --- | --- | --- | | 0 |6.35 |6.15 |5.82 |6.12 |2.24 |2.26 |2.03 |2.07 | | 1 |5.99 |5.88 |5.77 |6.28 |2.65 |2.27 |2.24 |2.5 | | 10 |6.15 |6.02 |5.92 |5.95 |5.15 |2.61 |2.31 |2.53 | | 100 |5.95 |5.78 |5.88 |6.23 |28.31 |2.34 |2.45 |2.94 | | 100 25 threads |52.01 |45.85 |46.18 |47.52 |35.97 |3.34 |3.34 |3.56 | | 1000 |6.0 |7.07 |5.98 |6.08 |333.18 |2.86 |2.7 |3.6 | | 1000 25 threads |52.6 |148.86 |79.06 |45.52 |473.49 |3.66 |3.48 |4.38 | - Benchmark performance of`readwhilewriting` from #10552, 100 range tombstones are written: `./db_bench --benchmarks=readwhilewriting --writes_per_range_tombstone=500 --max_write_buffer_number=100 --min_write_buffer_number_to_merge=100 --writes=100000 --reads=500000 --disable_auto_compactions --max_num_range_tombstones=10000 --finish_after_writes` readrandom micros/op: | |main |write path caching |read path caching |memtable V3 | |---|---|---|---|---| | single thread |48.28 |1.55 |1.52 |1.96 | | 25 threads |64.3 |2.55 |2.67 |2.64 | Reviewed By: ajkr Differential Revision: D38895410 Pulled By: cbi42 fbshipit-source-id: 930bfc309dd1b2f4e8e9042f5126785bba577559

facebook-github-bot added the CLA Signed label Aug 22, 2022

cbi42 mentioned this pull request Aug 22, 2022

Cache fragmented range tombstone list for mutable memtables #10547

Closed

cbi42 force-pushed the stress-get-against-expected branch from c38cd97 to 8dd3762 Compare August 22, 2022 23:30

cbi42 requested a review from ajkr August 23, 2022 03:00

ajkr reviewed Aug 23, 2022

View reviewed changes

cbi42 added 3 commits August 23, 2022 13:52

Update TestGet() to verify against expected state

f6bdc10

Add history

237387e

exclude timestamp read

6a2a5f3

cbi42 force-pushed the stress-get-against-expected branch from 43f3b10 to 6a2a5f3 Compare August 23, 2022 20:52

ajkr approved these changes Aug 23, 2022

View reviewed changes

facebook-github-bot closed this in 198e5d8 Aug 24, 2022

pdillinger reviewed Aug 24, 2022

View reviewed changes

cbi42 mentioned this pull request Aug 24, 2022

Add Iterator test against expected state to stress test #10538

Closed

cbi42 mentioned this pull request Aug 27, 2022

Verify Iterator/Get() against expected state in only no_batched_ops_test #10590

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update `TestGet()` to verify against expected state #10553

Update `TestGet()` to verify against expected state #10553

cbi42 commented Aug 22, 2022 •

edited

Loading

facebook-github-bot commented Aug 22, 2022

ajkr left a comment

ajkr Aug 23, 2022

cbi42 Aug 23, 2022

facebook-github-bot commented Aug 23, 2022

facebook-github-bot commented Aug 23, 2022

facebook-github-bot commented Aug 23, 2022

facebook-github-bot commented Aug 23, 2022

ajkr left a comment

pdillinger Aug 24, 2022

ajkr Aug 24, 2022

cbi42 Aug 24, 2022

Update TestGet() to verify against expected state #10553

Update TestGet() to verify against expected state #10553

Conversation

cbi42 commented Aug 22, 2022 • edited Loading

facebook-github-bot commented Aug 22, 2022

ajkr left a comment

Choose a reason for hiding this comment

ajkr Aug 23, 2022

Choose a reason for hiding this comment

cbi42 Aug 23, 2022

Choose a reason for hiding this comment

facebook-github-bot commented Aug 23, 2022

facebook-github-bot commented Aug 23, 2022

facebook-github-bot commented Aug 23, 2022

facebook-github-bot commented Aug 23, 2022

ajkr left a comment

Choose a reason for hiding this comment

pdillinger Aug 24, 2022

Choose a reason for hiding this comment

ajkr Aug 24, 2022

Choose a reason for hiding this comment

cbi42 Aug 24, 2022

Choose a reason for hiding this comment

Update `TestGet()` to verify against expected state #10553

Update `TestGet()` to verify against expected state #10553

cbi42 commented Aug 22, 2022 •

edited

Loading