[SPARK-17018][SQL] literals.sql for testing literal parsing #14598

petermaxlee · 2016-08-11T07:58:02Z

What changes were proposed in this pull request?

This patch adds literals.sql for testing literal parsing end-to-end in SQL.

How was this patch tested?

The patch itself is only about adding test cases.

petermaxlee · 2016-08-11T07:58:57Z

sql/core/src/test/resources/sql-tests/results/literals.sql.out

+
+
+-- !query 5
+select 32768 S


Shouldn't this throw an exception?

cc @cloud-fan / @rxin / @hvanhovell

Why? This will create Integer literal 32768 aliased as S. select 32768S does throw an exception.

petermaxlee · 2016-08-11T08:09:31Z

sql/core/src/test/resources/sql-tests/results/literals.sql.out

+
+
+-- !query 13
+select 1D, 1 D, 1.2D, 1e10, 1.5e5, .10 D, 0.10 D, .1e5


there is a bug here too.

I would expect .10 D to be parsed as double, not decimal.

D is a double literal (like in Hive). So this checks out.

petermaxlee · 2016-08-11T08:17:20Z

sql/core/src/test/resources/sql-tests/results/literals.sql.out

+-- !query 7 schema
+struct<>
+-- !query 7 output
+org.apache.spark.sql.catalyst.parser.ParseException


This exception message can be better. It doesn't actually say out of range.

Hmmmm - this is funny. The exception/message is produced by java.lang.Long.parseLong(...), but that doesn't seem to produce something sensible. I was expecting the something similar to the error java.lang.Short.parseShort(...) produces.

Should we parse integral literals as BigInteger, and then turn them into appropriate types? That way we have more control.

We do that already for 'untyped' (without a suffix) integral literals. I like your suggestion (this means we can also control the exception for Short better), could you open an PR for this?

Sure. Will do.

Nevermind this one. We have someone working on it.

SparkQA · 2016-08-11T09:58:33Z

Test build #63593 has finished for PR 14598 at commit 15d2eec.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-08-11T10:02:38Z

Test build #63596 has finished for PR 14598 at commit c40c957.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-08-11T10:05:56Z

Test build #63598 has finished for PR 14598 at commit d60b2bb.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-08-11T10:14:48Z

Test build #63597 has finished for PR 14598 at commit 243cd39.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

hvanhovell · 2016-08-11T13:45:28Z

sql/core/src/test/resources/sql-tests/inputs/literals.sql

+select 1234567890123456789012345678901234567890.0;
+
+-- super large scientific notation numbers should still be valid doubles
+select 123456789012345678901234567890123456789e10, 123456789012345678901234567890123456789.1e10;


Shouldn't we also add really large double, 1E309 for instance (that will actually evaluate to positive infinity).

Let me add that.

hvanhovell · 2016-08-11T13:54:00Z

This is pretty cool :)

I am comparing this to the ExpressionParserSuite. Shouldn't we add support for more complex string cases, intervals and a type constructors, see: https://github.com/apache/spark/blob/master/sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/parser/ExpressionParserSuite.scala#L331-L498. If we do we can move these tests out of the ExpressionParserSuite.

petermaxlee · 2016-08-11T18:59:14Z

I have updated this to include more string literals and added timestmap/date/interval parsing. That said, I didn't add all the test cases for interval because there were a large number, and I felt those are best left for parser unit tests.

petermaxlee · 2016-08-11T19:02:34Z

I also didn't include \b and \0 parsing. Otherwise github shows the result file as binary and refuse to display the diff, which makes it more difficult to review.

hvanhovell · 2016-08-11T20:33:11Z

sql/core/src/test/resources/sql-tests/inputs/literals.sql

+-- invalid timestamp
+select timestamp '2016-33-11 20:54:00.000';
+
+-- internal


NIT: interval? :)

SparkQA · 2016-08-11T20:48:15Z

Test build #63625 has finished for PR 14598 at commit 00bc63a.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-08-11T20:51:15Z

Test build #63626 has finished for PR 14598 at commit 5565144.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

rxin · 2016-08-11T20:54:41Z

I'm going to merge this in master/2.0.

## What changes were proposed in this pull request? This patch adds literals.sql for testing literal parsing end-to-end in SQL. ## How was this patch tested? The patch itself is only about adding test cases. Author: petermaxlee <petermaxlee@gmail.com> Closes #14598 from petermaxlee/SPARK-17018-2. (cherry picked from commit cf93678) Signed-off-by: Reynold Xin <rxin@databricks.com>

[SPARK-17018][SQL] literals.sql for testing literal parsing

15d2eec

petermaxlee reviewed Aug 11, 2016
View reviewed changes

Add more floating point literals

c40c957

petermaxlee reviewed Aug 11, 2016
View reviewed changes

remove spaces

243cd39

petermaxlee reviewed Aug 11, 2016
View reviewed changes

Large decimals

d60b2bb

petermaxlee mentioned this pull request Aug 11, 2016

[SPARK-17013][SQL] handle corner case for negative integral literal #14599

Closed

hvanhovell reviewed Aug 11, 2016
View reviewed changes

petermaxlee added 2 commits August 11, 2016 10:56

Merge branch 'master' into SPARK-17018-2

3f0b006

Add more literals

00bc63a

petermaxlee added 2 commits August 11, 2016 12:00

remove \b

bd72b4e

Remove \0

5565144

hvanhovell reviewed Aug 11, 2016
View reviewed changes

Fix typo

457d8da

asfgit closed this in cf93678 Aug 11, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-17018][SQL] literals.sql for testing literal parsing #14598

[SPARK-17018][SQL] literals.sql for testing literal parsing #14598

petermaxlee commented Aug 11, 2016

petermaxlee Aug 11, 2016

hvanhovell Aug 11, 2016

petermaxlee Aug 11, 2016

hvanhovell Aug 11, 2016 •

edited

Loading

petermaxlee Aug 11, 2016

hvanhovell Aug 11, 2016

petermaxlee Aug 11, 2016

hvanhovell Aug 11, 2016

petermaxlee Aug 11, 2016

hvanhovell Aug 11, 2016

SparkQA commented Aug 11, 2016

SparkQA commented Aug 11, 2016

SparkQA commented Aug 11, 2016

SparkQA commented Aug 11, 2016

hvanhovell Aug 11, 2016

petermaxlee Aug 11, 2016

hvanhovell commented Aug 11, 2016

petermaxlee commented Aug 11, 2016

petermaxlee commented Aug 11, 2016

hvanhovell Aug 11, 2016

SparkQA commented Aug 11, 2016

SparkQA commented Aug 11, 2016

rxin commented Aug 11, 2016



		-- !query 13
		select 1D, 1 D, 1.2D, 1e10, 1.5e5, .10 D, 0.10 D, .1e5

[SPARK-17018][SQL] literals.sql for testing literal parsing #14598

[SPARK-17018][SQL] literals.sql for testing literal parsing #14598

Conversation

petermaxlee commented Aug 11, 2016

What changes were proposed in this pull request?

How was this patch tested?

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hvanhovell Aug 11, 2016 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SparkQA commented Aug 11, 2016

SparkQA commented Aug 11, 2016

SparkQA commented Aug 11, 2016

SparkQA commented Aug 11, 2016

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hvanhovell commented Aug 11, 2016

petermaxlee commented Aug 11, 2016

petermaxlee commented Aug 11, 2016

Choose a reason for hiding this comment

SparkQA commented Aug 11, 2016

SparkQA commented Aug 11, 2016

rxin commented Aug 11, 2016

hvanhovell Aug 11, 2016 •

edited

Loading