feat(storage): concurrent checkpoint #2184

xxhZs · 2022-04-28T02:37:29Z

What's changed and what's your intention?

concurrent checkpoint

we do the initial version of concurrent barrier

Inject barrier in order, and concurrent barrier collect, commit_epoch in order.
We save managed_barrier_state in map<epoch,state>
if err, we will recovery all epoch in graph.
if send barrier to build actor , We will pause injecting barriers
Add metrics(barrier_nums and barrier_send_latency).
We simply set the upper limit of the max in-flight-barrier-nums. We can change it like checkpoint_interval_ms(deafult :10)

newly introduced structs and fields

We use CheckpointControl to control the injection and save the state of barrier. is_recovery and is_build_actor can pause inject barrier. The state of barrier is saved in queue. It makes the commit orderly
In managed_state we use map to save collected actor for different epoch

Checklist

I have written necessary docs and comments
I have added necessary unit tests and integration tests

Refer to a related PR or issue link (optional)

#1156

codecov · 2022-04-28T02:46:05Z

Codecov Report

Merging #2184 (3f57f61) into main (ac77464) will increase coverage by 0.26%.
The diff coverage is 86.71%.

@@            Coverage Diff             @@
##             main    #2184      +/-   ##
==========================================
+ Coverage   73.60%   73.86%   +0.26%     
==========================================
  Files         765      765              
  Lines      104939   105248     +309     
==========================================
+ Hits        77236    77744     +508     
+ Misses      27703    27504     -199

Flag	Coverage Δ
rust	`73.86% <86.71%> (+0.26%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
src/common/src/config.rs	`45.37% <0.00%> (-1.30%)`	⬇️
src/meta/src/lib.rs	`1.29% <0.00%> (-0.04%)`	⬇️
src/meta/src/rpc/server.rs	`0.00% <0.00%> (ø)`
src/meta/src/stream/mod.rs	`39.13% <ø> (ø)`
src/stream/src/task/barrier_manager.rs	`67.67% <0.00%> (-2.85%)`	⬇️
src/stream/src/task/stream_manager.rs	`0.00% <0.00%> (ø)`
...c/stream/src/task/barrier_manager/managed_state.rs	`73.00% <75.60%> (-5.52%)`	⬇️
src/meta/src/barrier/recovery.rs	`87.86% <78.57%> (+87.86%)`	⬆️
src/meta/src/barrier/mod.rs	`81.19% <86.53%> (+11.60%)`	⬆️
src/meta/src/model/barrier.rs	`86.66% <91.66%> (+8.09%)`	⬆️
... and 19 more

📣 Codecov can now indicate which changes are the most critical in Pull Requests. Learn more

pleiadesian · 2022-05-02T14:34:30Z

src/meta/src/barrier/mod.rs

+            let heap = self.epoch_heap.clone();
+            let hummock_manager = self.hummock_manager.clone();
+
+            tokio::spawn(async move {


How about using multiple coroutines instead of multiple threads here? 🤔

We can collect the futures and await them without thread-overprovisioning, where the latency of RPC is also overlapped.

futures.push(async move{ ... }) ... let res = select_all(futures).await;

If coroutine works, there would be no data race on env and epoch_heap, so RwLock could be mitigated.

env: Arc<MetaSrvEnv<S>>, epoch_heap: Arc<BinaryHeap<u64>>,

okoko , I will do it after I finish the basic functions.

CLAassistant · 2022-05-05T08:22:49Z

All committers have signed the CLA.

src/meta/src/barrier/mod.rs

BugenZhao

Could you please add some docs for newly introduced structs and fields?

src/meta/src/barrier/mod.rs

src/stream/src/task/stream_manager.rs

skyzh · 2022-05-24T08:31:59Z

I've updated the branch so that we can fairly compare with 944b998 :) May pull this branch before committing new changes.

skyzh · 2022-05-24T10:06:18Z

Just tried to bench this on AWS. However, all create table and create materialized source statement will stuck. Guess there's deadlock, maybe you'll need to debug more 🤣

The bug might be produced easier in 3 compute node release mode. Use ./risedev configure to enable release build, and update the default section of risedev.yml to start 3 compute nodes.

src/meta/src/barrier/mod.rs

src/meta/src/rpc/metrics.rs

src/meta/src/barrier/mod.rs

proto/stream_service.proto

src/meta/src/barrier/mod.rs

src/meta/src/manager/env.rs

src/stream/src/task/barrier_manager/managed_state.rs

src/meta/src/barrier/mod.rs

skyzh · 2022-06-06T12:52:16Z

Need to fix conflict. We have a new Grafana generator, and we should adapt to that.

src/meta/src/model/barrier.rs

src/meta/src/barrier/mod.rs

src/stream/src/task/barrier_manager/managed_state.rs

src/meta/src/barrier/mod.rs

src/meta/src/barrier/recovery.rs

src/meta/src/barrier/mod.rs

hzxa21 · 2022-06-20T15:09:59Z

With #2184 (comment) and #2184 (comment), barrier_complete_and_commit implementation can be simplified:

async fn barrier_complete_and_commit(....) {
        let (failed_nodes, res) = match result {
            Ok(resp) => {
                // Try complete barriers
                let succeeded_nodes = checkpoint_control.succeed(prev_epoch);
                let mut failed_nodes = Vec::new();
                let mut res: Result<()> = Ok(());

                // Complete_barriers does the following things:
                // - commit epoches in hummock_manager
                // - trigger post completion work on barriers
                // It returns:
                // - Ok(()) if all succeed
                // - Err((failed_nodes, err)) for the failed nodes and the error.
                if let Err((failed_nodes, err)) = self.complete_barriers(succeeded_nodes) {
                    // Fail all pending barriers as well
                    failed_nodes.concat(checkpoint_control.fail());
                    return (failed_nodes, Err(err));
                }
                Ok(())
            }
            Err(err) => {
                // Fail all barriers
                (checkpoint_control.fail(), Err(err))
            }
        };

        if !failed_nodes.is_empty() {
            if self.enable_recovery {
                // Post collection failure
                for node in failed_nodes {
                    ...
                }
                
                // Trigger recovery
                ...
            } else {
                panic!(...)
            }
        }
}

src/meta/src/barrier/mod.rs

skyzh

Generally LGTM, good work!

grafana/risingwave-dashboard.py

src/common/src/config.rs

src/config/risingwave.toml

src/meta/src/barrier/mod.rs

skyzh · 2022-06-22T06:38:23Z

src/meta/src/barrier/mod.rs

                // there's barrier scheduled.
-                _ = self.scheduled_barriers.wait_one() => {}
+                _ = self.scheduled_barriers.wait_one(), if checkpoint_control.can_inject_barrier(self.in_flight_barrier_nums) => {}


Possible performance regression, need to think twice. select! will evaluate all the conditions before actually doing selects. Is it possible that:

can_inject_barrier = false at first

no in-flight barrier (barrier_complete_rx will stuck)

At this time, only the shutdown branch and barrier_complete_rx will be activated. Even if can_inject_barrier becomes true sometime later, the thread will still block on polling barrier_complete_rx.

Seems that checkpoint_control is a local state and will not be modified by other thread. So while the current thread is blocked, it's not likely to be changed.

can_inject_barrier will return false in two cases.

the nums of in-flight > max. So there are in-flight barrier

is_build_actor = true. So there is at least one in-flight barrier
So in-flight barrier returns ==>can_inject_barrier becomes true. ==> continue next loop

Looks like there won't be any deadlock if all related logics are left untouched.

Anyway, you can think of this code like:

loop { if can_inject_barrier { select! { shutdown, complete_rx } } else { select! { shutdown, complete_rx, scheduled_barrier } } }

If it is believed that this will always work without deadlock, then everything would be fine :)

src/meta/src/rpc/metrics.rs

skyzh · 2022-06-22T06:46:36Z

... who's root in this PR? Maybe you'll need to revisit commit history and remove the unknown user?

BugenZhao

Generally LGTM. Good work!!

src/meta/src/barrier/mod.rs

src/stream/src/task/barrier_manager/managed_state.rs

BugenZhao · 2022-06-22T07:03:58Z

src/stream/src/task/barrier_manager.rs

+    /// remove all collect rx less than `prev_epoch`
+    pub fn drain_collect_rx(&mut self, prev_epoch: u64) {
+        self.collect_complete_receiver
+            .drain_filter(|x, _| x < &prev_epoch);
+    }


Should this be <=?

It will be used before send_barrier. So there are not prev_epoch in collect_complete_receiver

src/meta/src/barrier/mod.rs

BugenZhao · 2022-06-22T07:06:54Z

So there're no necessary changes on the shared buffer? That's really cool. 🥵

xxhZs · 2022-06-22T07:53:07Z

... who's root in this PR? Maybe you'll need to revisit commit history and remove the unknown user?

I commit code on another computer with root

xxhZs · 2022-06-23T08:13:37Z

The result of bench(query 1)

skyzh · 2022-06-23T08:28:09Z

The benchmark looks cool, that's so fast. By the way, would you please wait for all metrics to be fully loaded before taking a screenshot?

hzxa21

LGTM. Good work!

xxhZs linked an issue Apr 28, 2022 that may be closed by this pull request

Tracking: Support concurrent checkpoint #1156

Closed

4 tasks

github-actions bot added the type/feature label Apr 28, 2022

BugenZhao self-requested a review April 28, 2022 03:08

hzxa21 self-requested a review April 28, 2022 15:26

pleiadesian reviewed May 2, 2022

View reviewed changes

hzxa21 reviewed May 5, 2022

View reviewed changes

src/meta/src/barrier/mod.rs Outdated Show resolved Hide resolved

src/meta/src/barrier/mod.rs Outdated Show resolved Hide resolved

src/meta/src/barrier/mod.rs Outdated Show resolved Hide resolved

yezizp2012 mentioned this pull request May 5, 2022

feat(streaming): [DO NOT MERGE]temporary solution for monotonic epoch using quanta::Clock #2338

Closed

2 tasks

xxhZs mentioned this pull request May 7, 2022

feat(storage): conflict detector in no order #2365

Merged

2 tasks

This was referenced May 15, 2022

feat(storage): concurrent checkpoint #2527

Closed

feat(storage): concurrent checkpoint #2525

Closed

xxhZs force-pushed the 1156-support-concurrent-checkpoint branch 2 times, most recently from 2ded88b to faf343c Compare May 19, 2022 14:13

xxhZs marked this pull request as ready for review May 20, 2022 12:57

BugenZhao reviewed May 23, 2022

View reviewed changes

src/meta/src/barrier/mod.rs Outdated Show resolved Hide resolved

src/meta/src/barrier/mod.rs Outdated Show resolved Hide resolved

src/stream/src/task/stream_manager.rs Outdated Show resolved Hide resolved

skyzh self-requested a review May 23, 2022 09:21

wenym1 self-requested a review May 24, 2022 07:21

hzxa21 reviewed May 24, 2022

View reviewed changes

src/meta/src/barrier/mod.rs Outdated Show resolved Hide resolved

src/meta/src/barrier/mod.rs Outdated Show resolved Hide resolved

skyzh reviewed Jun 2, 2022

View reviewed changes

src/meta/src/barrier/mod.rs Outdated Show resolved Hide resolved

src/meta/src/rpc/metrics.rs Outdated Show resolved Hide resolved

hzxa21 requested changes Jun 6, 2022

View reviewed changes

BugenZhao reviewed Jun 6, 2022

View reviewed changes

BugenZhao requested a review from yezizp2012 June 6, 2022 12:25

hzxa21 reviewed Jun 8, 2022

View reviewed changes

src/meta/src/model/barrier.rs Outdated Show resolved Hide resolved

xxhZs mentioned this pull request Jun 13, 2022

feat(meta): separate inject_barrier_RPC and collect_over_RPC #3129

Merged

3 tasks

hzxa21 reviewed Jun 17, 2022

View reviewed changes

src/meta/src/barrier/mod.rs Outdated Show resolved Hide resolved

src/meta/src/barrier/mod.rs Outdated Show resolved Hide resolved

src/meta/src/barrier/mod.rs Outdated Show resolved Hide resolved

wenym1 reviewed Jun 20, 2022

View reviewed changes

src/stream/src/task/barrier_manager/managed_state.rs Outdated Show resolved Hide resolved

src/meta/src/barrier/mod.rs Outdated Show resolved Hide resolved

src/meta/src/barrier/mod.rs Outdated Show resolved Hide resolved

hzxa21 reviewed Jun 20, 2022

View reviewed changes

src/meta/src/barrier/mod.rs Outdated Show resolved Hide resolved

hzxa21 reviewed Jun 20, 2022

View reviewed changes

hzxa21 reviewed Jun 22, 2022

View reviewed changes

src/meta/src/barrier/mod.rs Outdated Show resolved Hide resolved

src/meta/src/barrier/mod.rs Outdated Show resolved Hide resolved

src/meta/src/barrier/mod.rs Outdated Show resolved Hide resolved

skyzh reviewed Jun 22, 2022

View reviewed changes

BugenZhao reviewed Jun 22, 2022

View reviewed changes

xxhZs added 2 commits June 23, 2022 11:15

move from 1156-support

ed19f8a

move from 1156-support

e3a7e34

xxhZs mentioned this pull request Jun 23, 2022

feat(meta): concurrent checkpoint #3415

Closed

3 tasks

xxhZs force-pushed the 1156-support-concurrent-checkpoint branch from 82d42d3 to e3a7e34 Compare June 23, 2022 04:24

hzxa21 approved these changes Jun 23, 2022

View reviewed changes

Merge branch 'main' into 1156-support-concurrent-checkpoint

3f57f61

hzxa21 merged commit 3754f92 into main Jun 23, 2022

hzxa21 deleted the 1156-support-concurrent-checkpoint branch June 23, 2022 08:52

skyzh mentioned this pull request Jun 27, 2022

meta: concurrent create MV #3490

Closed

This was referenced Jun 28, 2022

feat(meta): no pause checkpoint with building actor #3511

Merged

bug: Barrier carry wrong create_mv msg from cn to meta(e2e_parallel_test_fail) #3598

Closed

xxhZs mentioned this pull request Sep 16, 2022

fix(barriers): recovery use err epoch #5396

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(storage): concurrent checkpoint #2184

feat(storage): concurrent checkpoint #2184

xxhZs commented Apr 28, 2022 •

edited

codecov bot commented Apr 28, 2022 •

edited

pleiadesian May 2, 2022

xxhZs May 3, 2022 •

edited

CLAassistant commented May 5, 2022 •

edited

BugenZhao left a comment

skyzh commented May 24, 2022

skyzh commented May 24, 2022

skyzh commented Jun 6, 2022

hzxa21 commented Jun 20, 2022 •

edited

skyzh left a comment

skyzh Jun 22, 2022

wenym1 Jun 22, 2022

xxhZs Jun 22, 2022

skyzh Jun 23, 2022 •

edited

skyzh Jun 23, 2022 •

edited

skyzh commented Jun 22, 2022

BugenZhao left a comment

BugenZhao Jun 22, 2022

xxhZs Jun 22, 2022

BugenZhao commented Jun 22, 2022

xxhZs commented Jun 22, 2022

xxhZs commented Jun 23, 2022 •

edited

skyzh commented Jun 23, 2022

hzxa21 left a comment

feat(storage): concurrent checkpoint #2184

feat(storage): concurrent checkpoint #2184

Conversation

xxhZs commented Apr 28, 2022 • edited

What's changed and what's your intention?

concurrent checkpoint

newly introduced structs and fields

Checklist

Refer to a related PR or issue link (optional)

codecov bot commented Apr 28, 2022 • edited

Codecov Report

pleiadesian May 2, 2022

Choose a reason for hiding this comment

xxhZs May 3, 2022 • edited

Choose a reason for hiding this comment

CLAassistant commented May 5, 2022 • edited

BugenZhao left a comment

Choose a reason for hiding this comment

skyzh commented May 24, 2022

skyzh commented May 24, 2022

skyzh commented Jun 6, 2022

hzxa21 commented Jun 20, 2022 • edited

skyzh left a comment

Choose a reason for hiding this comment

skyzh Jun 22, 2022

Choose a reason for hiding this comment

wenym1 Jun 22, 2022

Choose a reason for hiding this comment

xxhZs Jun 22, 2022

Choose a reason for hiding this comment

skyzh Jun 23, 2022 • edited

Choose a reason for hiding this comment

skyzh Jun 23, 2022 • edited

Choose a reason for hiding this comment

skyzh commented Jun 22, 2022

BugenZhao left a comment

Choose a reason for hiding this comment

BugenZhao Jun 22, 2022

Choose a reason for hiding this comment

xxhZs Jun 22, 2022

Choose a reason for hiding this comment

BugenZhao commented Jun 22, 2022

xxhZs commented Jun 22, 2022

xxhZs commented Jun 23, 2022 • edited

skyzh commented Jun 23, 2022

hzxa21 left a comment

Choose a reason for hiding this comment

xxhZs commented Apr 28, 2022 •

edited

codecov bot commented Apr 28, 2022 •

edited

xxhZs May 3, 2022 •

edited

CLAassistant commented May 5, 2022 •

edited

hzxa21 commented Jun 20, 2022 •

edited

skyzh Jun 23, 2022 •

edited

skyzh Jun 23, 2022 •

edited

xxhZs commented Jun 23, 2022 •

edited