Stop copying LogicalPlan and Exprs in `TypeCoercion` (10% faster planning) #10356

alamb · 2024-05-02T19:18:07Z

~~Note it has code from #10410 so that might good to review first~~

Which issue does this PR close?

Part of #9637 -- let's make DataFusion planning faster by not copying so much

Rationale for this change

Now that we have the nice TreeNode API thanks to #8913 and @peter-toth let's use it to both simplify the code and avoid copies

What changes are included in this PR?

rewrite TypeCoercion via TreeNode API
Introduce LogicalPlan::recompute_schema to recompute the schema after expressions in a plan are changed

Are these changes tested?

Existing CI

Are there any user-facing changes?

Faster planning:

12% faster TPCH planning
10% faster TPCDS planning

Details

group                                         main                                   type_coercion
-----                                         ----                                   -------------
logical_aggregate_with_join                   1.00  1221.8±12.51µs        ? ?/sec    1.00  1220.4±11.13µs        ? ?/sec
logical_plan_tpcds_all                        1.00    160.3±1.91ms        ? ?/sec    1.00    159.5±1.81ms        ? ?/sec
logical_plan_tpch_all                         1.02     17.4±0.22ms        ? ?/sec    1.00     17.0±0.20ms        ? ?/sec
logical_select_all_from_1000                  1.00     18.7±0.10ms        ? ?/sec    1.01     18.9±0.17ms        ? ?/sec
logical_select_one_from_700                   1.00   808.2±23.16µs        ? ?/sec    1.01   817.7±10.21µs        ? ?/sec
logical_trivial_join_high_numbered_columns    1.00    763.6±7.80µs        ? ?/sec    1.00   763.1±13.86µs        ? ?/sec
logical_trivial_join_low_numbered_columns     1.00   748.3±10.69µs        ? ?/sec    1.00   747.1±15.11µs        ? ?/sec
physical_plan_tpcds_all                       1.12  1509.7±10.59ms        ? ?/sec    1.00   1350.0±8.12ms        ? ?/sec
physical_plan_tpch_all                        1.10    102.2±1.80ms        ? ?/sec    1.00     93.1±1.49ms        ? ?/sec
physical_plan_tpch_q1                         1.09      5.7±0.07ms        ? ?/sec    1.00      5.2±0.06ms        ? ?/sec
physical_plan_tpch_q10                        1.06      4.8±0.10ms        ? ?/sec    1.00      4.5±0.05ms        ? ?/sec
physical_plan_tpch_q11                        1.05      4.2±0.10ms        ? ?/sec    1.00      4.0±0.08ms        ? ?/sec
physical_plan_tpch_q12                        1.07      3.4±0.06ms        ? ?/sec    1.00      3.2±0.06ms        ? ?/sec
physical_plan_tpch_q13                        1.04      2.3±0.06ms        ? ?/sec    1.00      2.2±0.05ms        ? ?/sec
physical_plan_tpch_q14                        1.03      3.0±0.05ms        ? ?/sec    1.00      2.9±0.06ms        ? ?/sec
physical_plan_tpch_q16                        1.06      4.1±0.06ms        ? ?/sec    1.00      3.9±0.07ms        ? ?/sec
physical_plan_tpch_q17                        1.04      3.8±0.05ms        ? ?/sec    1.00      3.7±0.05ms        ? ?/sec
physical_plan_tpch_q18                        1.07      4.3±0.06ms        ? ?/sec    1.00      4.0±0.06ms        ? ?/sec
physical_plan_tpch_q19                        1.28      8.1±0.11ms        ? ?/sec    1.00      6.3±0.08ms        ? ?/sec
physical_plan_tpch_q2                         1.12      8.8±0.08ms        ? ?/sec    1.00      7.9±0.08ms        ? ?/sec
physical_plan_tpch_q20                        1.10      5.1±0.09ms        ? ?/sec    1.00      4.6±0.09ms        ? ?/sec
physical_plan_tpch_q21                        1.10      6.9±0.10ms        ? ?/sec    1.00      6.3±0.07ms        ? ?/sec
physical_plan_tpch_q22                        1.08      3.8±0.09ms        ? ?/sec    1.00      3.5±0.09ms        ? ?/sec
physical_plan_tpch_q3                         1.04      3.4±0.06ms        ? ?/sec    1.00      3.3±0.06ms        ? ?/sec
physical_plan_tpch_q4                         1.06      2.5±0.05ms        ? ?/sec    1.00      2.3±0.04ms        ? ?/sec
physical_plan_tpch_q5                         1.12      5.1±0.09ms        ? ?/sec    1.00      4.6±0.09ms        ? ?/sec
physical_plan_tpch_q6                         1.13  1863.3±56.21µs        ? ?/sec    1.00  1643.3±80.88µs        ? ?/sec
physical_plan_tpch_q7                         1.13      6.5±0.11ms        ? ?/sec    1.00      5.8±0.11ms        ? ?/sec
physical_plan_tpch_q8                         1.11      8.4±0.10ms        ? ?/sec    1.00      7.6±0.06ms        ? ?/sec
physical_plan_tpch_q9                         1.09      6.3±0.09ms        ? ?/sec    1.00      5.8±0.06ms        ? ?/sec
physical_select_all_from_1000                 1.45     88.7±0.40ms        ? ?/sec    1.00     61.3±0.45ms        ? ?/sec
physical_select_one_from_700                  1.05      3.9±0.05ms        ? ?/sec    1.00      3.7±0.05ms        ? ?/sec

alamb · 2024-05-02T19:19:14Z

datafusion/expr/src/logical_plan/plan.rs

@@ -467,6 +468,200 @@ impl LogicalPlan {
        self.with_new_exprs(self.expressions(), inputs.to_vec())
    }

+    /// Recomputes schema and type information for this LogicalPlan if needed.


I believe this is a new API for using TreeNode to rewrite plans in ways that change the schema.

This effectively factors out the recalculation part of LogicalPlan::new_with_exprs

I tried to find a way to use reuse this logic in LogicalPlan::new_with_exprs but was not able to without forcing (another) clone

FYI @peter-toth I suspect you may need something like this for common subexpression elimination / #9873

datafusion/expr/src/logical_plan/plan.rs

peter-toth · 2024-05-03T08:15:11Z

datafusion/optimizer/src/analyzer/type_coercion.rs

+            .map_data(|expr| original_name.restore(expr))
+    })?
+    // coerce join expressions specially
+    .map_data(|plan| expr_rewrite.coerce_joins(plan))?


Since expr_rewrite.coerce_joins(plan) can change the plan, shouldn't its result be Result<Transformed<LogicalPlan>>? And then here we should probably use map_transformed() instead of the current map_data().

for anyone following along, the response is https://github.com/apache/datafusion/pull/10356/files#r1588998665 (tldr should do as a follow on PR)

peter-toth · 2024-05-03T08:17:57Z

datafusion/optimizer/src/analyzer/type_coercion.rs

+    // coerce join expressions specially
+    .map_data(|plan| expr_rewrite.coerce_joins(plan))?
+    // recompute the schema after the expressions have been rewritten as the types may have changed
+    .map_data(|plan| plan.recompute_schema())


Do we always need to run plan.recompute_schema()? If the Transformed<LogicalPlan>'s .transformed is false then probably we don't need to.

This is an excellent point. At the moment, I think we do need to always run recompute_schema because the TypeCoercionRewriter doesn't return Transformed (and thus we don't know if any actual expression coercion was done, so we have to assume it was).

I filed #10365 to track improving this

Hmm, I think you use TypeCoercionRewriter in expr.rewrite(&mut expr_rewrite)? and that rewrite() returns Transformed<Expr> and then that Transformed<Expr> is propagated up into plan.map_expressions(), that returns Transformed<LogicalPlan>. So you have the necessary Transformed to decide if recompute_schema() is needed. Or not? 🙂

You are correct (of course!) thank you for pointing it out. Now that analyze_internal returns Transformed would work. However, there is still code like this:

let new_plan = analyze_internal(self.schema, unwrap_arc(subquery.subquery))?.data; Ok(Transformed::yes(Expr::Exists(Exists { subquery: Subquery { subquery: Arc::new(new_plan), outer_ref_columns: subquery.outer_ref_columns, }, negated, }))) }

Which discards the transformed information (and in this case always returns Transformed::true).

In order to keep the PRs small and easier to review I would like to not change this PR (it is no worse than main in regards to recomputing schema) and I will make a follow on PR to avoid recomputing schema when unecessary

Ah ok, it seems there are many unnecessary Transformed::yess in the current code. But false positive transformeds doesn't cause any issue...

Sure, a follow-up PR sounds good, I agree that this PR already looks really nice!

Here is my draft followup: #10369

It is quite large (it requires updating the entire expression rewriter) so I am glad we left it in a separate PR

comphead · 2024-05-12T00:27:28Z

datafusion/optimizer/src/analyzer/type_coercion.rs

    // get schema representing all available input fields. This is used for data type
    // resolution only, so order does not matter here
-    let mut schema = merge_schema(new_inputs.iter().collect());
+    let mut schema = merge_schema(plan.inputs());


datafusion/optimizer/src/analyzer/type_coercion.rs

comphead · 2024-05-12T00:33:47Z

datafusion/optimizer/src/analyzer/type_coercion.rs

+            .map(|(lhs, rhs)| {
+                // coerce the arguments as though they were a single binary equality
+                // expression
+                let (lhs, rhs) = self.coerce_binary_op(lhs, Operator::Eq, rhs)?;


I'm not sure if this method needed, as it looks like we just cast lhs, rhs? it feels it can be simplified?

I think coerce_binary_op is different than just casting lhs and rhs as it first calls get_input_types:

let (left_type, right_type) = get_input_types( &left.get_type(self.schema)?, &op, &right.get_type(self.schema)?, )?;

And get_input_types usese the comparison coercion rules to figure out a common set if types to coerce lhs and rhs to.

Co-authored-by: Oleks V <comphead@users.noreply.github.com>

…o alamb/type_coercion

alamb · 2024-05-15T18:55:04Z

@comphead I think this PR is ready to go. Would you be willing to approve it? Or are there other comments you would like to see addressed?

comphead · 2024-05-15T18:59:51Z

datafusion/optimizer/src/analyzer/type_coercion.rs

+    /// For example, on_exprs like `t1.a = t2.b AND t1.x = t2.y` will be stored
+    /// as a list of `(t1.a, t2.b), (t1.x, t2.y)`
+    fn coerce_joins(&mut self, plan: LogicalPlan) -> Result<LogicalPlan> {
+        let LogicalPlan::Join(mut join) = plan else {


thats an interesting syntax

it checks the plan can be deconstructed into LogicalPlan::Join(...) and if its not the else branch is triggered?

that is exactly right. It is one of my favorite Rust syntax's as it often can avoid a level of indenting

https://doc.rust-lang.org/rust-by-example/flow_control/let_else.html

comphead

lgtm thanks @alamb!

alamb

Thank you @comphead 🙏

alamb · 2024-05-15T19:57:58Z

datafusion/optimizer/src/analyzer/type_coercion.rs

+    /// For example, on_exprs like `t1.a = t2.b AND t1.x = t2.y` will be stored
+    /// as a list of `(t1.a, t2.b), (t1.x, t2.y)`
+    fn coerce_joins(&mut self, plan: LogicalPlan) -> Result<LogicalPlan> {
+        let LogicalPlan::Join(mut join) = plan else {


that is exactly right. It is one of my favorite Rust syntax's as it often can avoid a level of indenting

https://doc.rust-lang.org/rust-by-example/flow_control/let_else.html

github-actions bot added logical-expr Logical plan and expressions optimizer Optimizer rules labels May 2, 2024

alamb force-pushed the alamb/type_coercion branch from 66ff8d5 to 5c1f2c4 Compare May 2, 2024 19:26

alamb mentioned this pull request May 2, 2024

Avoid copies in TypeCoercion via TreeNode API #10039

Closed

alamb commented May 2, 2024

View reviewed changes

alamb marked this pull request as ready for review May 2, 2024 20:26

alamb changed the title ~~Stop copying LogicalPlan and Exprs in TypeCoercion~~ Stop copying LogicalPlan and Exprs in TypeCoercion (10% faster planning) May 2, 2024

alamb commented May 2, 2024

View reviewed changes

datafusion/expr/src/logical_plan/plan.rs Outdated Show resolved Hide resolved

alamb mentioned this pull request May 3, 2024

[Epic] A collection of issues to improve planning performance / speed / efficiency #5637

Open

15 tasks

peter-toth reviewed May 3, 2024

View reviewed changes

This was referenced May 3, 2024

Onyl recompute schema in TypeCoercion when necessary #10365

Open

Only recompute schema in TypeCoercion when necessary #10369

Draft

DataFusion weekly project plan (Andrew Lamb) - May 6, 2024 #10395

Closed

Add LogicalPlan::recompute_schema for handling rewrite passes

826d51f

alamb mentioned this pull request May 7, 2024

Add LogicalPlan::recompute_schema for handling rewrite passes #10410

Closed

Stop copying LogicalPlan and Exprs in TypeCoercion

5ed976b

alamb force-pushed the alamb/type_coercion branch from 3105658 to 5ed976b Compare May 7, 2024 15:15

Merge remote-tracking branch 'apache/main' into alamb/type_coercion

602b90f

github-actions bot removed the logical-expr Logical plan and expressions label May 10, 2024

comphead reviewed May 12, 2024

View reviewed changes

datafusion/optimizer/src/analyzer/type_coercion.rs Outdated Show resolved Hide resolved

comphead reviewed May 12, 2024

View reviewed changes

alamb and others added 3 commits May 13, 2024 11:38

Merge remote-tracking branch 'apache/main' into alamb/type_coercion

0e87fb3

Apply suggestions from code review

41ecf4b

Co-authored-by: Oleks V <comphead@users.noreply.github.com>

Merge branch 'alamb/type_coercion' of github.com:alamb/datafusion int…

b43a345

…o alamb/type_coercion

comphead reviewed May 15, 2024

View reviewed changes

comphead approved these changes May 15, 2024

View reviewed changes

alamb commented May 15, 2024

View reviewed changes

alamb merged commit c312ffe into apache:main May 15, 2024
23 checks passed

alamb deleted the alamb/type_coercion branch May 15, 2024 19:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stop copying LogicalPlan and Exprs in `TypeCoercion` (10% faster planning) #10356

Stop copying LogicalPlan and Exprs in `TypeCoercion` (10% faster planning) #10356

alamb commented May 2, 2024 •

edited

alamb May 2, 2024 •

edited

alamb May 2, 2024

peter-toth May 3, 2024

alamb May 13, 2024 •

edited

peter-toth May 3, 2024 •

edited

alamb May 3, 2024 •

edited

peter-toth May 3, 2024 •

edited

alamb May 3, 2024

peter-toth May 3, 2024

alamb May 3, 2024

comphead May 12, 2024

comphead May 12, 2024

alamb May 13, 2024

alamb commented May 15, 2024

comphead May 15, 2024

comphead May 15, 2024

alamb May 15, 2024

comphead left a comment

alamb left a comment

alamb May 15, 2024

Stop copying LogicalPlan and Exprs in TypeCoercion (10% faster planning) #10356

Stop copying LogicalPlan and Exprs in TypeCoercion (10% faster planning) #10356

Conversation

alamb commented May 2, 2024 • edited

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

alamb May 2, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

alamb May 13, 2024 • edited

Choose a reason for hiding this comment

peter-toth May 3, 2024 • edited

Choose a reason for hiding this comment

alamb May 3, 2024 • edited

Choose a reason for hiding this comment

peter-toth May 3, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

alamb commented May 15, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

comphead left a comment

Choose a reason for hiding this comment

alamb left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Stop copying LogicalPlan and Exprs in `TypeCoercion` (10% faster planning) #10356

Stop copying LogicalPlan and Exprs in `TypeCoercion` (10% faster planning) #10356

alamb commented May 2, 2024 •

edited

alamb May 2, 2024 •

edited

alamb May 13, 2024 •

edited

peter-toth May 3, 2024 •

edited

alamb May 3, 2024 •

edited

peter-toth May 3, 2024 •

edited