[CALCITE-6749] RelMdUtil#setAggChildKeys may return an incorrect result by rubenada · Pull Request #4115 · apache/calcite

rubenada · 2024-12-27T14:56:21Z

see https://issues.apache.org/jira/browse/CALCITE-6749

rubenada · 2024-12-27T14:57:19Z

As a consequence of the fix, there are a couple of plan changes on *.iq tests, but the result is still correct.

mihaibudiu · 2024-12-27T19:11:03Z

core/src/test/java/org/apache/calcite/rel/metadata/RelMdUtilTest.java

+  /** Test case for
+   * <a href="https://issues.apache.org/jira/browse/CALCITE-6749">[CALCITE-6749]
+   * RelMdUtil#setAggChildKeys may return an incorrect result</a>. */
+  @Test void testSetAggChildKeys() {


Frankly, this is hard to review.
I saw the discussion in the JIRA, but it's not obvious at all that "9" is the right answer.
I wonder whether having the full plan here would actually be easier - at least in a comment.

Basically we need to convert: "field index X on the Aggregate corresponds to which field index Y on the Aggregate's input?"
If we have this plan

LogicalAggregate(group=[{0}], EXPR$1=[COUNT(DISTINCT $1)]) LogicalProject(DEPTNO=[$9], JOB=[$2]) LogicalJoin(condition=[=($7, $9)], joinType=[right]) LogicalTableScan(table=[[CATALOG, SALES, EMP]]) LogicalTableScan(table=[[CATALOG, SALES, DEPT]])

The Aggregate rowType has two fields (0 and 1), the first one (index 0) corresponds to the group 0, and the second one (index 1) corresponds to the argCall COUNT(DISTINCT $1). In this case, if we want to convert into the input's, we'll have 0 => 0, and 1 => 1; there is no problem on the current code.

However, if we have this equivalent plan (same query plan, but without the intermediate Project):

LogicalAggregate(group=[{9}], EXPR$1=[COUNT(DISTINCT $2)]) LogicalJoin(condition=[=($7, $9)], joinType=[right]) LogicalTableScan(table=[[CATALOG, SALES, EMP]]) LogicalTableScan(table=[[CATALOG, SALES, DEPT]])

Now the Aggregate still has two fields (0 and 1), the first one (index 0) corresponds to the group 9, and the second one (index 1) corresponds to the argCall COUNT(DISTINCT $2). In this case, if we want to convert into the input's, we'll have 0 => 9, and 1 => 2; the current code will work fine only for the argCall conversion, but not for the group conversion.

Notice I'm just applying here the same conversion that exists already e.g. when propagating the RelMdColumnOrigins computation past an Aggregate (in terms of our second example: "if we want to get origin of the column 0 of the Aggregate, we need to compute the origin of the column 9 of the Aggregate's input"). The auxiliary method RelMdUtil.setAggChildKeys (used by RelMdDistinctRowCount and RelMdPopulationSize for Aggregates) should behave in the same way as RelMdColumnOrigin for this conversion.

core/src/main/java/org/apache/calcite/rel/metadata/RelMdUtil.java

sonarqubecloud · 2025-01-07T08:58:58Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
100.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

mihaibudiu reviewed Dec 27, 2024

View reviewed changes

arkanovicz reviewed Jan 2, 2025

View reviewed changes

core/src/main/java/org/apache/calcite/rel/metadata/RelMdUtil.java Show resolved Hide resolved

mihaibudiu approved these changes Jan 6, 2025

View reviewed changes

mihaibudiu added the LGTM-will-merge-soon Overall PR looks OK. Only minor things left. label Jan 6, 2025

[CALCITE-6749] RelMdUtil#setAggChildKeys may return an incorrect result

848d144

rubenada force-pushed the CALCITE-6749 branch from 86c6c91 to 848d144 Compare January 7, 2025 08:16

rubenada merged commit c686e2e into apache:main Jan 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CALCITE-6749] RelMdUtil#setAggChildKeys may return an incorrect result#4115

[CALCITE-6749] RelMdUtil#setAggChildKeys may return an incorrect result#4115
rubenada merged 1 commit intoapache:mainfrom
rubenada:CALCITE-6749

rubenada commented Dec 27, 2024

Uh oh!

rubenada commented Dec 27, 2024

Uh oh!

mihaibudiu Dec 27, 2024

Uh oh!

rubenada Dec 30, 2024 •

edited

Loading

Uh oh!

Uh oh!

sonarqubecloud bot commented Jan 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

rubenada commented Dec 27, 2024

Uh oh!

rubenada commented Dec 27, 2024

Uh oh!

mihaibudiu Dec 27, 2024

Choose a reason for hiding this comment

Uh oh!

rubenada Dec 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sonarqubecloud bot commented Jan 7, 2025

Quality Gate passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rubenada Dec 30, 2024 •

edited

Loading