feat(batch): split plan into fragments for local execution mode #3032

lmatz · 2022-06-07T08:20:05Z

What's changed and what's your intention?

Split plans into fragments that can be executed in local execution mode.

Move some test cases into .part so that both distributed and local modes can use them.

Will add more test cases and support needed in some future PRs.

Checklist

I have written necessary docs and comments
I have added necessary unit tests and integration tests
All checks passed in ./risedev check (or alias, ./risedev c)

Refer to a related PR or issue link (optional)

#2978

codecov · 2022-06-07T08:32:58Z

Codecov Report

Merging #3032 (80bb184) into main (2aab325) will decrease coverage by 0.05%.
The diff coverage is 4.00%.

@@            Coverage Diff             @@
##             main    #3032      +/-   ##
==========================================
- Coverage   73.72%   73.66%   -0.06%     
==========================================
  Files         739      739              
  Lines      101713   101792      +79     
==========================================
+ Hits        74985    74986       +1     
- Misses      26728    26806      +78

Flag	Coverage Δ
rust	`73.66% <4.00%> (-0.06%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
src/batch/src/rpc/service/task_service.rs	`0.00% <0.00%> (ø)`
src/frontend/src/handler/query.rs	`0.00% <0.00%> (ø)`
src/frontend/src/scheduler/local.rs	`0.00% <0.00%> (ø)`
src/frontend/src/scheduler/plan_fragmenter.rs	`93.92% <0.00%> (-0.72%)`	⬇️
src/frontend/src/scheduler/task_context.rs	`0.00% <0.00%> (ø)`
src/frontend/src/session.rs	`45.47% <33.33%> (-0.21%)`	⬇️
src/frontend/src/optimizer/mod.rs	`94.17% <66.66%> (-0.38%)`	⬇️
src/meta/src/hummock/mock_hummock_meta_client.rs	`42.39% <0.00%> (-1.09%)`	⬇️
src/meta/src/barrier/mod.rs	`69.13% <0.00%> (-0.33%)`	⬇️
src/storage/src/hummock/local_version_manager.rs	`84.12% <0.00%> (-0.16%)`	⬇️

📣 Codecov can now indicate which changes are the most critical in Pull Requests. Learn more

liurenjie1024 · 2022-06-08T02:25:26Z

src/frontend/src/scheduler/local.rs

+                } else {
+                    // We should only have one child stage of the root stage for now.
+                    assert_eq!(second_stage_id.len(), 1);
+                    let second_stage_id = second_stage_id.iter().next().unwrap();


The child stages of root maybe more than 1, think about following sql:
select * from t1, t2 where t1.a = t2.a, in this case we have plan like

HashJoin / \ Exchange Exchange / \ TableScan TableScan

You can refer to https://github.com/singularity-data/risingwave/blob/408e9fb5249b12b1b457287adc4deba13c301f18/src/frontend/src/scheduler/distributed/stage.rs#L354 for child plan fragment and exchange id mapping.

I thought select * from t1, t2 where t1.a = t2.a is not considered as a point query and thus is executed in the normal distributed mode instead of local execution mode. 🤔

The example from quip doc:
SELECT pk, t1.a, t1.fk, t2.b FROM t1, t2 WHERE t1.fk = t2.pk AND t1.pk = 114514
which is a point query as t1.pk = 114514 is specified and thus it uses a look up join instead of hash join.

Choosing the appropriate plan should be the optimizer's responsibility, and the scheduler should not have such strong assumptions about the plan. The local execution mode is optimized for point query, but it doesn't mean the user can't execute hash join/sort-merge join.
The example SELECT pk, t1.a, t1.fk, t2.b FROM t1, t2 WHERE t1.fk = t2.pk AND t1.pk = 114514 picks lookup join only when t2.pk has index or is a primary key on it, and chooses hash join when it doesn't have an index.

lmatz · 2022-06-11T11:53:56Z

Added two more tests for local mode, i.e. join and range_scan.
join has two stages on the same level below the root stage.

lmatz · 2022-06-11T12:02:24Z

will support MergeExchange in a separate one

src/frontend/src/handler/query.rs

src/frontend/src/scheduler/local.rs

chinawch007 · 2022-06-25T15:07:53Z

Excuse me, what's the meaning of local execution mode?

lmatz · 2022-06-25T15:20:13Z

Excuse me, what's the meaning of local execution mode?

Some operators are directly executed on the frontend node instead of the compute nodes.
The downstream stages of an exchange operator are scheduled by the exchange operator instead of the scheduler in the frontend node.

Only those queries of certain execution plans(relatively simple) are classified as queries executed in local execution mode. The time spent on the execution of these queries is typically dozens of milliseconds, thus reducing RPCs becomes profitable.

github-actions bot added the type/feature label Jun 7, 2022

lmatz requested a review from liurenjie1024 June 7, 2022 08:35

lmatz mentioned this pull request Jun 7, 2022

fix(config): parameters are case insensitive in set command #3033

Merged

1 task

lmatz force-pushed the lz/local branch from 24bd60a to 63691b1 Compare June 7, 2022 09:56

liurenjie1024 reviewed Jun 8, 2022

View reviewed changes

lmatz force-pushed the lz/local branch 2 times, most recently from 5e7bfa4 to a355dfd Compare June 11, 2022 11:52

lmatz requested a review from liurenjie1024 June 13, 2022 04:12

liurenjie1024 reviewed Jun 13, 2022

View reviewed changes

src/frontend/src/handler/query.rs Outdated Show resolved Hide resolved

liurenjie1024 reviewed Jun 13, 2022

View reviewed changes

src/frontend/src/scheduler/local.rs Outdated Show resolved Hide resolved

lmatz force-pushed the lz/local branch from 1cea0a8 to 9633508 Compare June 13, 2022 09:36

liurenjie1024 approved these changes Jun 13, 2022

View reviewed changes

lmatz added 3 commits June 13, 2022 17:55

feat(batch): split plan into fragments for local execution mode

454926d

clippy, please forgive me

baaa928

revision: use debug and add query id and stage id

80bb184

lmatz force-pushed the lz/local branch from 9633508 to 80bb184 Compare June 13, 2022 09:55

lmatz enabled auto-merge (squash) June 13, 2022 09:59

lmatz merged commit 3bca26e into main Jun 13, 2022

lmatz deleted the lz/local branch June 13, 2022 10:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(batch): split plan into fragments for local execution mode #3032

feat(batch): split plan into fragments for local execution mode #3032

lmatz commented Jun 7, 2022 •

edited

codecov bot commented Jun 7, 2022 •

edited

liurenjie1024 Jun 8, 2022

lmatz Jun 8, 2022 •

edited

liurenjie1024 Jun 8, 2022

lmatz commented Jun 11, 2022

lmatz commented Jun 11, 2022

chinawch007 commented Jun 25, 2022

lmatz commented Jun 25, 2022

feat(batch): split plan into fragments for local execution mode #3032

feat(batch): split plan into fragments for local execution mode #3032

Conversation

lmatz commented Jun 7, 2022 • edited

What's changed and what's your intention?

Checklist

Refer to a related PR or issue link (optional)

codecov bot commented Jun 7, 2022 • edited

Codecov Report

liurenjie1024 Jun 8, 2022

Choose a reason for hiding this comment

lmatz Jun 8, 2022 • edited

Choose a reason for hiding this comment

liurenjie1024 Jun 8, 2022

Choose a reason for hiding this comment

lmatz commented Jun 11, 2022

lmatz commented Jun 11, 2022

chinawch007 commented Jun 25, 2022

lmatz commented Jun 25, 2022

lmatz commented Jun 7, 2022 •

edited

codecov bot commented Jun 7, 2022 •

edited

lmatz Jun 8, 2022 •

edited