[LIVY-754][THRIFT] Encode precision and scale for decimal type. #288

wypoon · 2020-04-02T01:50:31Z

What changes were proposed in this pull request?

When a org.apache.livy.thriftserver.session.DataType.DECIMAL is converted to a org.apache.hive.service.rpc.thrift.TTypeDesc for sending a Thrift response to a client request for result set metadata, the TTypeDesc contains a TPrimitiveTypeEntry(TTypeId.DECIMAL_TYPE) without TTypeQualifiers (which are needed to capture the precision and scale).
With this change, we include the qualifiers in the TPrimitiveTypeEntry. We use both the name and the DataType of a field type to construct the TTypeDesc. We are able to do this without changing the existing internal representation for data types because we can obtain the precision and scale from the name of the decimal type.

How was this patch tested?

Use beeline to connect to the Thrift server. Do a select from a table with a column of decimal type.
Also extended an existing integration test.

codecov-io · 2020-04-02T02:32:47Z

Codecov Report

Merging #288 into master will increase coverage by 0.06%.
The diff coverage is n/a.

@@             Coverage Diff              @@
##             master     #288      +/-   ##
============================================
+ Coverage     68.19%   68.26%   +0.06%     
- Complexity      964      965       +1     
============================================
  Files           104      104              
  Lines          5952     5952              
  Branches        900      900              
============================================
+ Hits           4059     4063       +4     
+ Misses         1314     1310       -4     
  Partials        579      579

Impacted Files	Coverage Δ	Complexity Δ
.../scala/org/apache/livy/sessions/SessionState.scala	`61.11% <0.00%> (ø)`	`2.00% <0.00%> (ø%)`
...ain/scala/org/apache/livy/utils/SparkYarnApp.scala	`75.00% <0.00%> (+1.25%)`	`40.00% <0.00%> (ø%)`
...in/java/org/apache/livy/rsc/driver/JobWrapper.java	`88.57% <0.00%> (+5.71%)`	`9.00% <0.00%> (+1.00%)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ee7fdfc...6f5b968. Read the comment docs.

andrasbeni

@wypoon thanks for the fix. The change in general looks good to me. However, I had a few comments you may want to consider.

andrasbeni · 2020-04-02T08:14:44Z

thriftserver/server/src/main/scala/org/apache/livy/thriftserver/types/Schema.scala

+    // name can be one of
+    // 1. decimal
+    // 2. decimal(p)
+    // 3. decimal(p, s)


Are decimal and decimal(p) actually possible? I understand these forms can be used to declare the type but based on org.apache.spark.sql.types.DecimalType I don't think the json omits scale or precision.

I might be wrong here. If so, then I believe the parsing logic should be tested for decimal and decimal(p) also.

In the Hive that I used, I do not actually encounter decimal or decimal(p). I defined a Hive table with columns of each of those variants, and doing a "desc table" returns columns of type decimal(10,0) and decimal(p,0) for the first two variants. In this case, the json that Spark generates and used by DataTypeUtils.schemaFromSparkJson only has the third variant.
Nevertheless, I decided to handle all 3 variants purely as a defensive measure. It may be redundant but it doesn't hurt.

scala> def f(name: String): (Int, Int) = { | if (name == "decimal") { | (10, 0) | } else { | val suffix = name.substring(7) | require(suffix.startsWith("(") && suffix.endsWith(")"), | name + " is not of the form decimal(<precision>,<scale>)") | val parts = suffix.substring(1, suffix.length - 1).split(",") | if (parts.length == 1) { | (parts(0).trim.toInt, 0) | } else { | (parts(0).trim.toInt, parts(1).trim.toInt) | } | } | } f: (name: String)(Int, Int) scala> f("decimal") res0: (Int, Int) = (10,0) scala> f("decimal(7)") res1: (Int, Int) = (7,0) scala> f("decimal(9, 2)") res2: (Int, Int) = (9,2) scala> f("decimal_type") java.lang.IllegalArgumentException: requirement failed: decimal_type is not of the form decimal(<precision>,<scale>) at scala.Predef$.require(Predef.scala:224) at f(<console>:28) ... 49 elided

I feel that it is overkill to write a unit test just for that block of code. The above suffices.

thriftserver/server/src/main/scala/org/apache/livy/thriftserver/types/Schema.scala

wypoon · 2020-04-03T00:47:30Z

@mgaido91 @jerryshao can you please review?

mgaido91

the change seems fine to me, just a minor style comment. May you please add tests for all the possible cases, though? Thanks.

thriftserver/server/src/main/scala/org/apache/livy/thriftserver/types/Schema.scala

wypoon · 2020-04-06T18:15:39Z

Added a couple more cases to the integration test.

mgaido91

LGTM

wypoon · 2020-05-20T04:08:33Z

@mgaido91 can you please merge this (since you have already approved it)?
I thought it was already merged, but it appears that it isn't.

[LIVY-754][THRIFT] Encode precision and decimal for decimal type.

6f5b968

wypoon changed the title ~~[LIVY-754][THRIFT] Encode precision and decimal for decimal type.~~ [LIVY-754][THRIFT] Encode precision and scale for decimal type. Apr 2, 2020

andrasbeni reviewed Apr 2, 2020

View reviewed changes

[LIVY-754] Address nit.

7e6031f

mgaido91 reviewed Apr 3, 2020

View reviewed changes

thriftserver/server/src/main/scala/org/apache/livy/thriftserver/types/Schema.scala Outdated Show resolved Hide resolved

[LIVY-754] Address Marco's feedback.

07d0ddc

mgaido91 approved these changes Apr 6, 2020

View reviewed changes

mgaido91 closed this in 3b9bbef May 20, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LIVY-754][THRIFT] Encode precision and scale for decimal type. #288

[LIVY-754][THRIFT] Encode precision and scale for decimal type. #288

wypoon commented Apr 2, 2020 •

edited

codecov-io commented Apr 2, 2020 •

edited

andrasbeni left a comment

andrasbeni Apr 2, 2020

wypoon Apr 2, 2020

wypoon Apr 2, 2020 •

edited

wypoon Apr 2, 2020

wypoon commented Apr 3, 2020

mgaido91 left a comment

wypoon commented Apr 6, 2020

mgaido91 left a comment

wypoon commented May 20, 2020

[LIVY-754][THRIFT] Encode precision and scale for decimal type. #288

[LIVY-754][THRIFT] Encode precision and scale for decimal type. #288

Conversation

wypoon commented Apr 2, 2020 • edited

What changes were proposed in this pull request?

How was this patch tested?

codecov-io commented Apr 2, 2020 • edited

Codecov Report

andrasbeni left a comment

Choose a reason for hiding this comment

andrasbeni Apr 2, 2020

Choose a reason for hiding this comment

wypoon Apr 2, 2020

Choose a reason for hiding this comment

wypoon Apr 2, 2020 • edited

Choose a reason for hiding this comment

wypoon Apr 2, 2020

Choose a reason for hiding this comment

wypoon commented Apr 3, 2020

mgaido91 left a comment

Choose a reason for hiding this comment

wypoon commented Apr 6, 2020

mgaido91 left a comment

Choose a reason for hiding this comment

wypoon commented May 20, 2020

wypoon commented Apr 2, 2020 •

edited

codecov-io commented Apr 2, 2020 •

edited

wypoon Apr 2, 2020 •

edited