[CALCITE-3866] "numeric field overflow" when running the generated SQL in PostgreSQL #1867

wenhuitang · 2020-03-23T12:09:09Z

Pull request for https://issues.apache.org/jira/browse/CALCITE-3866
As for aggregate function sum, most of the time, its return type is equal to its operand's type, but if its operand's type has precision, the precision of the result may be larger than the oeprans's type. So may be we can set precision to the Max value if the oeprand's type has precision, especially for the type decimal.

XuQianJin-Stars · 2020-03-23T16:06:31Z

LGTM +1

hsyuan

+1

DonnyZone · 2020-05-07T09:02:03Z

Hi, @wenhuitang, could you please rebase and resolve the conflict? We can merge it into 1.23.0.
Moreover, I suggest to simplify the title and commit message as:
"numeric field overflow" when running the generated SQL in PostgreSQL

DonnyZone · 2020-05-07T09:06:22Z

I investigated the return type for sum function in Hive[1] and Spark[2]. It seems they both compute the precision as Math.min(MAX_PRECISION, inputPrecision + 10). Maybe we can align with them.
@wenhuitang @XuQianJin-Stars @hsyuan WDYT?

[1]https://github.com/apache/hive/blob/54b87999fa0c23f9902faae609e7441e0693a22b/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFSum.java#L247

[2]https://github.com/apache/spark/blob/272d229005b7166ab83bbb8f44a4d5e9d89424a1/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/Sum.scala#L56

…L in PostgreSQL (Wenhui Tang)

wenhuitang · 2020-05-09T11:38:50Z

I investigated the return type for sum function in Hive[1] and Spark[2]. It seems they both compute the precision as Math.min(MAX_PRECISION, inputPrecision + 10). Maybe we can align with them.
@wenhuitang @XuQianJin-Stars @hsyuan WDYT?

[1]https://github.com/apache/hive/blob/54b87999fa0c23f9902faae609e7441e0693a22b/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFSum.java#L247

[2]https://github.com/apache/spark/blob/272d229005b7166ab83bbb8f44a4d5e9d89424a1/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/Sum.scala#L56

Thanks a lot. I have rebased this pr. IMO, as for computing precision as Math.min(MAX_PRECISION, inputPrecision + 10), I did not find any theory or standard for it. To avoid the problem completely, it would be ok with max precision.

DonnyZone · 2020-05-09T12:01:23Z

LGTM!

wenhuitang force-pushed the CALCITE-3866 branch from 478714a to 7c7ca98 Compare March 23, 2020 12:11

XuQianJin-Stars approved these changes Mar 23, 2020

View reviewed changes

hsyuan force-pushed the master branch from e42a79b to a0ef3c9 Compare March 29, 2020 16:52

hsyuan approved these changes May 4, 2020

View reviewed changes

hsyuan added the LGTM-will-merge-soon Overall PR looks OK. Only minor things left. label May 4, 2020

wenhuitang changed the title ~~[CALCITE-3866] ReturnTypes.AGG_SUM may cause "numeric field overflow" on PostgreSQL when generate the sql after using the rule AggregateJoinTransposeRule.EXTENDED~~ [CALCITE-3866] "numeric field overflow" when running the generated SQL in PostgreSQL May 9, 2020

[CALCITE-3866] "numeric field overflow" when running the generated SQ…

b809498

…L in PostgreSQL (Wenhui Tang)

wenhuitang force-pushed the CALCITE-3866 branch from 7c7ca98 to b809498 Compare May 9, 2020 11:36

DonnyZone merged commit e081c5b into apache:master May 9, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CALCITE-3866] "numeric field overflow" when running the generated SQL in PostgreSQL #1867

[CALCITE-3866] "numeric field overflow" when running the generated SQL in PostgreSQL #1867

wenhuitang commented Mar 23, 2020

XuQianJin-Stars commented Mar 23, 2020

hsyuan left a comment

DonnyZone commented May 7, 2020

DonnyZone commented May 7, 2020 •

edited

Loading

wenhuitang commented May 9, 2020

DonnyZone commented May 9, 2020

[CALCITE-3866] "numeric field overflow" when running the generated SQL in PostgreSQL #1867

[CALCITE-3866] "numeric field overflow" when running the generated SQL in PostgreSQL #1867

Conversation

wenhuitang commented Mar 23, 2020

XuQianJin-Stars commented Mar 23, 2020

hsyuan left a comment

Choose a reason for hiding this comment

DonnyZone commented May 7, 2020

DonnyZone commented May 7, 2020 • edited Loading

wenhuitang commented May 9, 2020

DonnyZone commented May 9, 2020

DonnyZone commented May 7, 2020 •

edited

Loading