[GLUTEN-12172][CH] Fix group limit first array result offset by lgbo-ustc · Pull Request #12173 · apache/gluten

lgbo-ustc · 2026-05-28T12:04:32Z

What changes are proposed in this pull request?

RowNumGroupArraySorted writes aggregate results into a newly created ColumnArray. For the first output row the array offsets vector can be empty, but insertResultInto read result_array_offsets.back() before appending the first offset. That is undefined behavior and can crash when aggregate top-k writes its first result.

Treat an empty offsets vector as having previous offset 0 before appending the next cumulative offset. Add a ClickHouse backend regression test that forces row_number top-k through the aggregate group limit path and validates the first array result row against vanilla Spark.

closed #12172

How was this patch tested?

UTs

Was this patch authored or co-authored using generative AI tooling?

co-authored using generative AI tooling

RowNumGroupArraySorted writes aggregate results into a newly created ColumnArray. For the first output row the array offsets vector can be empty, but insertResultInto read result_array_offsets.back() before appending the first offset. That is undefined behavior and can crash when aggregate top-k writes its first result. Treat an empty offsets vector as having previous offset 0 before appending the next cumulative offset. Add a ClickHouse backend regression test that forces row_number top-k through the aggregate group limit path and validates the first array result row against vanilla Spark.

github-actions · 2026-05-28T12:13:31Z

Run Gluten Clickhouse CI on x86

github-actions · 2026-05-29T05:59:29Z

Run Gluten Clickhouse CI on x86

zzcclp

LGTM

github-actions Bot added the CLICKHOUSE label May 28, 2026

[CH] Stabilize group limit empty offsets test

b598112

zzcclp approved these changes May 29, 2026

View reviewed changes

zzcclp merged commit d87c2b0 into main May 29, 2026
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GLUTEN-12172][CH] Fix group limit first array result offset#12173

[GLUTEN-12172][CH] Fix group limit first array result offset#12173
zzcclp merged 2 commits into
mainfrom
bug_group_limit_empty_offsets

lgbo-ustc commented May 28, 2026

Uh oh!

github-actions Bot commented May 28, 2026

Uh oh!

github-actions Bot commented May 29, 2026

Uh oh!

zzcclp left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

lgbo-ustc commented May 28, 2026

What changes are proposed in this pull request?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

github-actions Bot commented May 28, 2026

Uh oh!

github-actions Bot commented May 29, 2026

Uh oh!

zzcclp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants