Fix aggregate group by count by mijoharas · Pull Request #645 · pgdogdev/pgdog

mijoharas · 2025-12-03T17:07:50Z

Addresses #638.

I had a bit of spare time, and was curious about this.

So, it turns out the issue is an incorrectly parsed out group_by most of the time. The group_by expects an index into the target_list, so I made that work for the case where you have the exact table_name.column_name or same column_name in the select and the group by queries.

this feels a little bit janky to me, but 🤷 figured I'd raise for feedback, especially as I see a draft PR for a rewritten query engine.

(michael)@127.0.0.1:6432 16:54:06 [repro_sharded]
> select count(1), user_id from example group by example.user_id;
 count | user_id
-------+---------
     6 |       1
     3 |       2
     4 |       3
(3 rows)

(michael)@127.0.0.1:6432 16:54:07 [repro_sharded]
> select count(1), example.user_id from example group by example.user_id;
 count | user_id
-------+---------
     6 |       1
     3 |       2
     4 |       3
(3 rows)

(michael)@127.0.0.1:6432 16:54:14 [repro_sharded]
> select example.user_id, count(1), example.user_id from example group by example.user_id;
unexpected field count in "D" message
(michael)@127.0.0.1:6432 16:54:21 [repro_sharded]
> select example.user_id, count(1) from example group by example.user_id;
 user_id | count
---------+-------
       1 |     6
       2 |     3
       3 |     4
(3 rows)

(michael)@127.0.0.1:6432 16:54:28 [repro_sharded]
> select user_id, count(1) from example group by example.user_id;
 user_id | count
---------+-------
       3 |     4
       1 |     6
       2 |     3
(3 rows)

(michael)@127.0.0.1:6432 16:54:32 [repro_sharded]
> select example.user_id, count(1) from example group by user_id;
 user_id | count
---------+-------
         |    13
(1 row)

(michael)@127.0.0.1:6432 16:54:37 [repro_sharded]
> select user_id, count(1) from example group by user_id;
 user_id | count
---------+-------
       3 |     4
       1 |     6
       2 |     3
(3 rows)

See here for results, including some cases where it fails to work correctly. (where we duplicate the column in the select, and where we specify inconsistent example.user_id / user_id (NOTE: it only fails if we're more specific in the select than in the group_by.

codecov · 2025-12-03T17:12:13Z

Codecov Report

❌ Patch coverage is 93.11927% with 15 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
pgdog/src/frontend/router/parser/aggregate.rs	93.11%	15 Missing ⚠️

📢 Thoughts on this report? Let us know!

mijoharas · 2025-12-03T17:12:51Z

oh, and this'd probably want some tests if we went with something like this. 😅.

levkk

Looks very reasonable. I remember building support for this actually, I'm not entirely sure where it went. Maybe I'm wrong, it's been a few months. Would you be able to add a quick test, just to make sure the parsing logic works as you'd expect. Can put it in the same file and assert that it detected the correct column index. Cheers!

mijoharas · 2025-12-03T17:45:45Z

Ok, nice, I'll throw those in tomorrow.

mijoharas · 2025-12-04T14:08:23Z

Alright, I cleaned it up a bit, and added some tests. Should be good. There might be more tests than we actually want, so feel free to delete (a few of them cover the same ground).

levkk

Awesome!

Addresses pgdogdev#638. I had a bit of spare time, and was curious about this. So, it turns out the issue is an incorrectly parsed out `group_by` most of the time. The `group_by` expects an index into the `target_list`, so I made that work for the case where you have the exact `table_name.column_name` or same `column_name` in the `select` and the `group by` queries. this feels a little bit janky to me, but 🤷 figured I'd raise for feedback, especially as I see a draft PR for a rewritten query engine. ```sql (michael)@127.0.0.1:6432 16:54:06 [repro_sharded] > select count(1), user_id from example group by example.user_id; count | user_id -------+--------- 6 | 1 3 | 2 4 | 3 (3 rows) (michael)@127.0.0.1:6432 16:54:07 [repro_sharded] > select count(1), example.user_id from example group by example.user_id; count | user_id -------+--------- 6 | 1 3 | 2 4 | 3 (3 rows) (michael)@127.0.0.1:6432 16:54:14 [repro_sharded] > select example.user_id, count(1), example.user_id from example group by example.user_id; unexpected field count in "D" message (michael)@127.0.0.1:6432 16:54:21 [repro_sharded] > select example.user_id, count(1) from example group by example.user_id; user_id | count ---------+------- 1 | 6 2 | 3 3 | 4 (3 rows) (michael)@127.0.0.1:6432 16:54:28 [repro_sharded] > select user_id, count(1) from example group by example.user_id; user_id | count ---------+------- 3 | 4 1 | 6 2 | 3 (3 rows) (michael)@127.0.0.1:6432 16:54:32 [repro_sharded] > select example.user_id, count(1) from example group by user_id; user_id | count ---------+------- | 13 (1 row) (michael)@127.0.0.1:6432 16:54:37 [repro_sharded] > select user_id, count(1) from example group by user_id; user_id | count ---------+------- 3 | 4 1 | 6 2 | 3 (3 rows) ``` See here for results, including some cases where it fails to work correctly. (where we duplicate the column in the select, and where we specify inconsistent `example.user_id` / `user_id` (NOTE: it only fails if we're more specific in the `select` than in the `group_by`.

mijoharas marked this pull request as draft December 3, 2025 17:12

levkk reviewed Dec 3, 2025

View reviewed changes

mijoharas added 6 commits December 4, 2025 11:24

Fix aggregate count to parse columns.

0588b32

Clean up aggregate count fix a bit.

355523f

Remove unnecessary clone.

2be9188

Cargo format.

937b9a5

Write tests for group by column parsing.

52c56c9

Switch to checking order in aggregate count and test.

31336bc

mijoharas force-pushed the fix-aggregate-group-by-count branch from b1fea01 to 31336bc Compare December 4, 2025 13:55

mijoharas marked this pull request as ready for review December 4, 2025 13:55

levkk approved these changes Dec 4, 2025

View reviewed changes

levkk merged commit 345bd28 into pgdogdev:main Dec 4, 2025
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix aggregate group by count#645

Fix aggregate group by count#645
levkk merged 6 commits intopgdogdev:mainfrom
meetcleo:fix-aggregate-group-by-count

mijoharas commented Dec 3, 2025

Uh oh!

codecov Bot commented Dec 3, 2025 •

edited

Loading

Uh oh!

mijoharas commented Dec 3, 2025

Uh oh!

levkk left a comment

Uh oh!

mijoharas commented Dec 3, 2025

Uh oh!

mijoharas commented Dec 4, 2025

Uh oh!

levkk left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

mijoharas commented Dec 3, 2025

Uh oh!

codecov Bot commented Dec 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

mijoharas commented Dec 3, 2025

Uh oh!

levkk left a comment

Choose a reason for hiding this comment

Uh oh!

mijoharas commented Dec 3, 2025

Uh oh!

mijoharas commented Dec 4, 2025

Uh oh!

levkk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov Bot commented Dec 3, 2025 •

edited

Loading