[CALCITE-7266] Optimize the "well-known count bug" correction by rubenada · Pull Request #4614 · apache/calcite

rubenada · 2025-11-04T10:03:47Z

core/src/test/java/org/apache/calcite/sql2rel/RelDecorrelatorTest.java

suibianwanwank · 2025-11-04T16:31:03Z

I believe there's way for optimization here, but merely considering LEFT and COUNT doesn't seem sufficient in my view.

Test in sub-query.iq:

SELECT deptno, (SELECT CASE WHEN SUM(sal) > 10 then 'VIP' else 'Regular' END expr
                   FROM emp e
                   WHERE d.deptno = e.deptno) a
FROM dept d;

!ok

Currently, this query will be decorrelated by hepPlanner. Suppose we remove HepPlanner.

--- a/core/src/main/java/org/apache/calcite/sql2rel/RelDecorrelator.java
+++ b/core/src/main/java/org/apache/calcite/sql2rel/RelDecorrelator.java
@@ -247,9 +247,7 @@ public static RelNode decorrelateQuery(RelNode rootRel,
         new RelDecorrelator(corelMap,
             cluster.getPlanner().getContext(), relBuilder);

-    RelNode newRootRel = decorrelationRules == null
-        ? decorrelator.removeCorrelationViaRule(rootRel)
-        : decorrelator.removeCorrelationViaRule(rootRel, decorrelationRules);
+    RelNode newRootRel = rootRel;

     if (SQL2REL_LOGGER.isDebugEnabled()) {
       SQL2REL_LOGGER.debug(
@@ -324,7 +322,7 @@ protected RelNode decorrelate(RelNode root) {
     HepPlanner planner = createPlanner(program);

     planner.setRoot(root);
-    root = planner.findBestExp();
+//    root = planner.findBestExp();
     if (SQL2REL_LOGGER.isDebugEnabled()) {
       SQL2REL_LOGGER.debug("Plan before extracting correlated computations:\n"
           + RelOptUtil.toString(root));

After this PR, the decorrelator will return incorrect results.

DEPTNO | A
--------+---------
      10 | VIP
      20 | VIP
      30 | VIP
      40 | NULL

rubenada · 2025-11-04T18:17:12Z

Thanks for taking a look @suibianwanwank . Maybe I'm doing something wrong, but I'm getting the same results for your sample query with: a) RelDecorrelator disabled, b) RelDecorrelator enabled with current code, c) RelDecorrelator with this PR code:

+--------+---------+
| DEPTNO | A       |
+--------+---------+
|     10 | VIP     |
|     20 | VIP     |
|     30 | VIP     |
|     40 | Regular |
+--------+---------+

PS: I have pushed the test, just to double-check

suibianwanwank · 2025-11-05T02:15:27Z

As mentioned above, this happens because the RBO decorrelates such patterns in advance. However, I believe the RelDecorrelator framework itself should ensure correctness, rather than relying on rules to pre-handle certain bad cases. After all, pattern-based approaches in decorrelation are inherently limited in what they can cover.

core/src/main/java/org/apache/calcite/sql2rel/RelDecorrelator.java

rubenada · 2025-11-05T08:44:56Z

Ok, I understand now what you mean @suibianwanwank , there's indeed a regression with the proposed patch (observable only if we deactivate the decorrelation via rules step).
Thanks for the feedback @iwanttobepowerful .
It's clear that this idea need some rework.... WIP

BTW It seems there was a similar PR on Spark apache/spark#43341 👀 UPDATE: after a closer look, it seems that PR on Spark was to avoid calling the extra join in case of several Aggregates on the same subtree, whereas the current PR idea is looking at the possibility of avoiding it in case of LeftCorrelate.

rubenada · 2025-11-05T15:07:03Z

@iwanttobepowerful I haven't looked in detail, but it seems that Spark uses a slightly different approach to deal with the count bug (at least in some cases), with the usage of this "alwaysTrue" value. Notice that some of the manipulations done by Spark might be done (more or less) in Calcite not by the RelDecorrelator itself, but by certain auxiliary rules called via HepPlanner inside the RelDecorrelator, so in Calcite this process is intermingled among rule transformations + the pure decorrelate algorithm itself (which might be not ideal, as stated by @suibianwanwank above).

I'm not entirely sure, but I have the impression that the "LEFT" approach might be valid if the Aggregate result is not further manipulated (as in the counter-example proposed by @suibianwanwank ), i.e. we could avoid the rewrite if the Correlate is LEFT and the Aggregate is directly its right child (this seems to fix the counter-example).
I've just pushed a new commit with this idea.

suibianwanwank · 2025-11-06T02:51:00Z

core/src/main/java/org/apache/calcite/sql2rel/RelDecorrelator.java

+          // Otherwise call except if this is a LEFT Correlate with the Aggregate being its RHS,
+          // in that case NULL is effectively the same as empty (which promotes NULL on the RHS)
          (!parentPropagatesNullValues
-              && requireNonNull(frameStack.peek()).left.getJoinType() != JoinRelType.LEFT))) {


It seems like it could be optimized this way🤔 in decorrelateRel(Join):

final Frame rightFrame = getInvoke(oldRight, true, rel, parentPropagatesNullValues); //to: final Frame rightFrame = getInvoke(oldRight, true, rel, true);

You mean on decorrelateRel(Correlate) ?
That actually seems to do the trick in a much cleaner way. It maintains the plans adjusted in the PR, does not fail on the counter-example that you proposed on my initial commit, and it also works as expected on my downstream project's tests if I apply it.
I've pushed this change, cleaning up the previous modifications.

I could also add the "counter-example" as a unit test in RelDecorrelatorTest, but it would require some minor adjustments in RelDecorrelator to allow running the decorrelation algorithm without any type of rule prior. It's manageable, adds more flexibility (and can be done in a way to keep things backward-compatible)

LGTM, An additional thought is whether an inner join would also work.

Test pushed

rubenada · 2025-11-11T08:08:36Z

@suibianwanwank @iwanttobepowerful are there other remarks for this change? Shall I squash commits to prepare the merge?

suibianwanwank

LGTM!

sonarqubecloud · 2025-11-11T11:42:47Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
95.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

rubenada force-pushed the CALCITE-7266 branch from 062bd70 to e9f9a19 Compare November 4, 2025 10:39

rubenada changed the title ~~[CALCITE-7266] Optimize the "well-known count bug" fix~~ [CALCITE-7266] Optimize the "well-known count bug" correction Nov 4, 2025

iwanttobepowerful reviewed Nov 4, 2025

View reviewed changes

core/src/test/java/org/apache/calcite/sql2rel/RelDecorrelatorTest.java Show resolved Hide resolved

iwanttobepowerful reviewed Nov 5, 2025

View reviewed changes

core/src/main/java/org/apache/calcite/sql2rel/RelDecorrelator.java Show resolved Hide resolved

suibianwanwank reviewed Nov 6, 2025

View reviewed changes

rubenada marked this pull request as ready for review November 11, 2025 08:06

suibianwanwank approved these changes Nov 11, 2025

View reviewed changes

[CALCITE-7266] Optimize the "well-known count bug" correction

f3e7b9c

rubenada force-pushed the CALCITE-7266 branch from 7a565a0 to f3e7b9c Compare November 11, 2025 11:19

rubenada added the LGTM-will-merge-soon Overall PR looks OK. Only minor things left. label Nov 11, 2025

rubenada merged commit 0f148c7 into apache:main Nov 12, 2025
21 of 36 checks passed

Conversation

rubenada commented Nov 4, 2025

Uh oh!

Uh oh!

suibianwanwank commented Nov 4, 2025

Uh oh!

rubenada commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

suibianwanwank commented Nov 5, 2025

Uh oh!

Uh oh!

rubenada commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rubenada commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

suibianwanwank Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

rubenada Nov 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rubenada Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

suibianwanwank Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

rubenada Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

rubenada commented Nov 11, 2025

Uh oh!

suibianwanwank left a comment

Choose a reason for hiding this comment

Uh oh!

sonarqubecloud bot commented Nov 11, 2025

Quality Gate passed

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

rubenada commented Nov 4, 2025 •

edited

Loading

rubenada commented Nov 5, 2025 •

edited

Loading

rubenada commented Nov 5, 2025 •

edited

Loading

rubenada Nov 6, 2025 •

edited

Loading