cache_flat_mutation_reader: use the correct schema in prepare_hash #14305

michoecho · 2023-06-19T16:28:27Z

Since mvcc: make schema upgrades gentle (51e3b93),
rows pointed to by the cursor can have different (older) schema
than the schema of the cursor's snapshot.

However, one place in the code wasn't updated accordingly,
causing a row to be processed with the wrong schema in the right
circumstances.

This passed through unit testing because it requires
a digest-computing cache read after a schema change,
and no test exercised this.

This series fixes the bug and adds a unit test which reproduces the issue.

Fixes #14110

avikivity · 2023-06-19T16:48:53Z

mutation/mutation_cleaner.hh

@@ -24,6 +25,7 @@ class mutation_cleaner_impl final {
        snapshot_list snapshots;
        logalloc::allocating_section alloc_section;
        bool done = false; // true means the worker was abandoned and cannot access the mutation_cleaner_impl instance.
+        rwlock merging_enabled; // Allows for pausing the background merging. Used only for testing purposes.


Is a rwlock really needed? Will you have multiple "reader"s?

In fact I'm not sure a lock is needed at all, since the non-test size only uses trylock. Isn't it enough to have a bool?

In fact I'm not sure a lock is needed at all, since the non-test size only uses trylock. Isn't it enough to have a bool?

On the non-test size, the worker blocks on the lock.

It's possible to implement this with a bool too. But then I would have to make the returned guard be a separate type, which signals the condition variable when it's destroyed. I figured rwlock was cleaner.

Is a rwlock really needed? Will you have multiple "reader"s?

No, probably not. In fact, I first used a mutex for this. But then I decided that a reentrant/nestable API for pausing would be nicer.

Changed to an int (like a bool, but reentrant) in v2.

tgrabiec · 2023-06-19T17:08:40Z

mutation/mutation_partition.cc

            }
-            merge_some();
-            return stop_iteration::no;
+            return with_lock(w->merging_enabled.for_write(), [this] () noexcept {


This adds a deferring point after w->done is checked. We must not attempt merge_some() when done is true because region() may no longer be valid after ~mutation_cleaner_impl().

True. The w->done check has to be inside the lock for this to be correct.

That's what I get for playing with locks...

I'm deeply ashamed of having authored a concurrency bug. Thank you for saving me from the disgrace of getting it into master.

I hope v2 is correct.

In unit tests, we would want to delay the merging of some MVCC versions to test the transient scenarios with multiple versions present. In many cases this can be done by holding snapshots to all versions. But sometimes (i.e. during schema upgrades) versions are added and scheduled for merge immediately, without a window for the test to grab a snapshot to the new version. This patch adds a pause() method to mutation_cleaner, which ensures that no asynchronous/implicit MVCC version merges happen within the scope of the call. This functionality will be used by a test added in an upcoming patch.

Since `mvcc: make schema upgrades gentle` (51e3b93), rows pointed to by the cursor can have different (older) schema than the schema of the cursor's snapshot. However, one place in the code wasn't updated accordingly, causing a row to be processed with the wrong schema in the right circumstances. This passed through unit testing because it requires a digest-computing cache read after a schema change, and no test exercised this. Fixes scylladb#14110

michoecho · 2023-06-19T20:59:28Z

v2:

Got rid of rwlock to avoid having 2 concurrency primitives in one place. Replaced it with a counter of active pauses. pause() increments the counter and the guard returned from pause() decrements it and signals the condition variable.
The above also fixes (I hope I didn't make any mistakes this time) the incorrect algorithm of v1 which could access the dead mutation_cleaner_impl.

scylladb-promoter · 2023-06-19T23:10:21Z

CI state SUCCESS - https://jenkins.scylladb.com/job/scylla-master/job/scylla-ci/1929/

michoecho requested a review from tgrabiec as a code owner June 19, 2023 16:28

michoecho force-pushed the fix_14410 branch from cca8eca to 8b6e91f Compare June 19, 2023 16:32

avikivity reviewed Jun 19, 2023

View reviewed changes

tgrabiec requested changes Jun 19, 2023

View reviewed changes

michoecho added 3 commits June 19, 2023 22:50

test: boost/row_cache_test: add a reproducer for scylladb#14110

02bcb5d

michoecho force-pushed the fix_14410 branch from 8b6e91f to 02bcb5d Compare June 19, 2023 20:52

tgrabiec approved these changes Jun 19, 2023

View reviewed changes

scylladb-promoter merged commit 5fa08ad into scylladb:master Jun 20, 2023
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cache_flat_mutation_reader: use the correct schema in prepare_hash #14305

cache_flat_mutation_reader: use the correct schema in prepare_hash #14305

michoecho commented Jun 19, 2023

avikivity Jun 19, 2023

michoecho Jun 19, 2023

michoecho Jun 19, 2023

tgrabiec Jun 19, 2023

michoecho Jun 19, 2023

michoecho Jun 19, 2023 •

edited

michoecho commented Jun 19, 2023

scylladb-promoter commented Jun 19, 2023

cache_flat_mutation_reader: use the correct schema in prepare_hash #14305

cache_flat_mutation_reader: use the correct schema in prepare_hash #14305

Conversation

michoecho commented Jun 19, 2023

avikivity Jun 19, 2023

Choose a reason for hiding this comment

michoecho Jun 19, 2023

Choose a reason for hiding this comment

michoecho Jun 19, 2023

Choose a reason for hiding this comment

tgrabiec Jun 19, 2023

Choose a reason for hiding this comment

michoecho Jun 19, 2023

Choose a reason for hiding this comment

michoecho Jun 19, 2023 • edited

Choose a reason for hiding this comment

michoecho commented Jun 19, 2023

scylladb-promoter commented Jun 19, 2023

michoecho Jun 19, 2023 •

edited