[WIP] fix db checkpoint async bug #2982

yontyon · 2023-04-03T10:45:06Z

Problem Overview
The current implementation of the db checkpoint feature has a synchronization bug:
While we take the db checkpoint in the background, we don't align anything with the checkpoint sequence number, i.e. the block number, bft metadata, pending reserved pages, and more.
Once we put more client requests in a different resolution than 150 we start to see a wide set of issues:
For example:
1. The recovered replica won't start from a stable checkpoint, instead, it starts from the point where the db checkpoint was taken.
2. In a case where the checkpoint was taken in the middle of another execution phase, we won't have the pending reserved pages to recover correctly.
3. We trim the block at the point where the db checkpoint was taken, but we don't update the bft metadata accordingly.
  Below is an example of part of these issues:
  On replica 0, block 302 was created on sequence number 304

0|03-04-2023 10:48:24.125|INFO |skvbctest.replica|post-execution-thread|||304||executeWriteCommand|L:482|ConditionalWrite message handled; writesCounter=302 currBlock=302 | [SQ:1215]

However, on recovery, the recovered replica has the same block was created on sequence number 305:

5|03-04-2023 10:48:34.293|INFO |skvbctest.replica|post-execution-thread|||305||executeWriteCommand|L:482|ConditionalWrite message handled; writesCounter=1 currBlock=302 | [SQ:3]

This PR proposes a fix, in which, we pin the bft sequence number before starting the async part, and align everything accordingly.

This PR doesn't handle the case of explicitly creating db checkpoint by the operator, as it assumes to be used for clients only (which cares only about the blockchain)

Testing Done
CI + Changing an existing test to verify the changes

WildFireFlum · 2023-04-03T11:09:49Z

kvbc/include/kvbc_adapter/replica_adapter.hpp

@@ -109,7 +109,19 @@ class ReplicaBlockchain : public IBlocksDeleter,
  std::optional<categorization::Updates> getBlockUpdates(BlockId block_id) const override final {
    return reader_->getBlockUpdates(block_id);
  }
-
+  // find the first block which has the given sequence number in its metadata


Can multiple blocks have the same sequence number in their metadata?

yes, but they will all be placed sequentially one after another, so as long as we stop on the first one it's ok

WildFireFlum · 2023-04-03T11:12:19Z

tests/apollo/test_skvbc_dbsnapshot.py

@@ -934,7 +934,7 @@ async def test_restore_from_snapshot_of_other(self, bft_network, tracker):

        crashed_replica = list(bft_network.random_set_of_replicas(1, {initial_prim}))[0]
        bft_network.stop_replica(crashed_replica)
-        await skvbc.send_n_kvs_sequentially(DB_CHECKPOINT_WIN_SIZE)  # run till the next checkpoint
+        await skvbc.send_n_kvs_sequentially(DB_CHECKPOINT_WIN_SIZE + 100)  # run till the next checkpoint


maybe int(DB_CHECKPOINT_WIN_SIZE * 1.5) so that if we change this constant the test remains valid?

WildFireFlum · 2023-04-03T11:13:47Z

This PR proposes a fix, in which, we pin the bft sequence number before starting the async part, and align everything accordingly.
Can you detail what data you persist differently from before and why?

cloudnoize · 2023-04-16T11:27:29Z

kvbc/src/v4blockchain/v4_blockchain.cpp

    auto new_snap_shot = RecoverySnapshot{&native_client_->rawDB()};
-    chkpnt_snap_shots_[last_reachable_id] = new_snap_shot.get();
+    chkpnt_snap_shots_[block_id_at_chkpnt] = new_snap_shot.get();


This is incorrect; the newly created snapshot represents the current state of the blockchain i.e. the state of the latest block id, while the change assumes that it represents a block id at the checkpoint, which is a block in the past.

yontyon requested a review from a team as a code owner April 3, 2023 10:45

vmwclabot added the cla-not-required label Apr 3, 2023

WildFireFlum reviewed Apr 3, 2023

View reviewed changes

fix db checkpoint async bug

b7e4591

yontyon force-pushed the trim-db-checkpoint-to-latest-checkpoint branch from c1719ad to b7e4591 Compare April 13, 2023 11:40

cloudnoize reviewed Apr 16, 2023

View reviewed changes

yontyon closed this Apr 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] fix db checkpoint async bug #2982

[WIP] fix db checkpoint async bug #2982

yontyon commented Apr 3, 2023 •

edited

WildFireFlum Apr 3, 2023

yontyon Apr 3, 2023

WildFireFlum Apr 3, 2023

WildFireFlum commented Apr 3, 2023 •

edited

cloudnoize Apr 16, 2023

[WIP] fix db checkpoint async bug #2982

[WIP] fix db checkpoint async bug #2982

Conversation

yontyon commented Apr 3, 2023 • edited

WildFireFlum Apr 3, 2023

Choose a reason for hiding this comment

yontyon Apr 3, 2023

Choose a reason for hiding this comment

WildFireFlum Apr 3, 2023

Choose a reason for hiding this comment

WildFireFlum commented Apr 3, 2023 • edited

cloudnoize Apr 16, 2023

Choose a reason for hiding this comment

yontyon commented Apr 3, 2023 •

edited

WildFireFlum commented Apr 3, 2023 •

edited