Skip to content

[GLUTEN-8432][CH]Remove duplicate output attributes of aggregate's child#8450

Merged
lgbo-ustc merged 4 commits intoapache:mainfrom
bigo-sg:bug_8432
Jan 17, 2025
Merged

[GLUTEN-8432][CH]Remove duplicate output attributes of aggregate's child#8450
lgbo-ustc merged 4 commits intoapache:mainfrom
bigo-sg:bug_8432

Conversation

@lgbo-ustc
Copy link
Contributor

@lgbo-ustc lgbo-ustc commented Jan 7, 2025

What changes were proposed in this pull request?

(Please fill in changes proposed in this fix)

Fixes: #8432

CH doesn' support duplicate columns in a block. Normally, there is no duplicate attributes in the output of an aggregate's child. There is one exception, when AQE is enabled, an shuffle resutl may be reused. If one of the aggregate function is count(distinct) and its arguments contain the grouping keys, duplicate attributes would be introduced. And we cannot erase the duplicate attributes by #8164, since the rule cannot be applied.

How was this patch tested?

(Please explain how this patch was tested. E.g. unit tests, integration tests, manual tests)

manual tests

(If this patch involves UI changes, please attach a screenshot; otherwise, remove this)

@github-actions
Copy link

github-actions bot commented Jan 7, 2025

#8432

@github-actions
Copy link

github-actions bot commented Jan 7, 2025

Run Gluten Clickhouse CI on x86

@lgbo-ustc lgbo-ustc changed the title [GLUTEN-8432][CH]Remove duplicated output attributes of aggregate's child [GLUTEN-8432][CH]Remove duplicate output attributes of aggregate's child Jan 7, 2025
@github-actions
Copy link

github-actions bot commented Jan 7, 2025

Run Gluten Clickhouse CI on x86

@zzcclp
Copy link
Contributor

zzcclp commented Jan 7, 2025

It better to add some test cases for this scenario?

@github-actions
Copy link

Run Gluten ClickHouse CI on ARM

1 similar comment
@github-actions
Copy link

Run Gluten ClickHouse CI on ARM

@github-actions
Copy link

Run Gluten ClickHouse CI on ARM

@github-actions
Copy link

Run Gluten ClickHouse CI on ARM

@github-actions
Copy link

Run Gluten ClickHouse CI on ARM

@github-actions
Copy link

Run Gluten ClickHouse CI on ARM

@github-actions
Copy link

Run Gluten ClickHouse CI on ARM

Copy link
Contributor

@zzcclp zzcclp left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@lgbo-ustc lgbo-ustc merged commit 17d017e into apache:main Jan 17, 2025
5 checks passed
baibaichen pushed a commit to baibaichen/gluten that referenced this pull request Feb 1, 2025
…ild (apache#8450)

* Try to remove grouping keys in arguments of count(distinct)

* adjust child's output in CHHashAggregateExecTransformer

* update

* update
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[CH] org.apache.gluten.exception.GlutenException: Not found column 8Ð'ï

2 participants