[SPARK-29034][SQL] PostgreSQL dialect conformance for string constants with C-style escapes #26715

yaooqinn · 2019-11-29T13:06:34Z

What changes were proposed in this pull request?

On one hand, we now use spark.sql.parser.escapedStringLiterals to control whether to escape string literal or not.
On the other hand, we use spark.sql.dialect to choose spark or PostgreSQL dialect. When we use the PostgreSQL dialect, we should obey the C-style escape behavior of PostgreSQL.

Supported

An escape string constant is specified by writing the letter E (upper or lower case) just before the opening single quote, e.g., E'foo'.

Not supported

When continuing an escape string constant across lines, write E only before the first opening quote.

Because PostgreSQL follows the SQL standard that is

Two string constants that are only separated by whitespace with at least one newline are concatenated and effectively treated as if the string had been written as one constant.

, which is hard to follow in Spark's Parser.

Why are the changes needed?

PostgreSQL dialect conformance

Does this PR introduce any user-facing change?

yes, when we use the PostgreSQL dialect, we use 'E' to define an escape string constant

How was this patch tested?

add ut.

… with C-style escapes

yaooqinn · 2019-11-29T13:32:17Z

retest this please

SparkQA · 2019-11-29T17:16:12Z

Test build #114629 has finished for PR 26715 at commit 7ae10e9.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2019-11-29T22:46:29Z

Test build #114634 has finished for PR 26715 at commit db93221.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

yaooqinn · 2019-12-18T10:18:33Z

closed because pg dialect removed

yaooqinn added 2 commits November 29, 2019 20:45

[SPARK-29034][SQL] PostgreSQL dialect conformace for string constants…

d0a478f

… with C-style escapes

nit

7ae10e9

import

8431fb0

mv tests

db93221

dongjoon-hyun added the SQL label Dec 5, 2019

yaooqinn closed this Dec 18, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-29034][SQL] PostgreSQL dialect conformance for string constants with C-style escapes #26715

[SPARK-29034][SQL] PostgreSQL dialect conformance for string constants with C-style escapes #26715

yaooqinn commented Nov 29, 2019 •

edited

yaooqinn commented Nov 29, 2019

SparkQA commented Nov 29, 2019

SparkQA commented Nov 29, 2019

yaooqinn commented Dec 18, 2019

[SPARK-29034][SQL] PostgreSQL dialect conformance for string constants with C-style escapes #26715

[SPARK-29034][SQL] PostgreSQL dialect conformance for string constants with C-style escapes #26715

Conversation

yaooqinn commented Nov 29, 2019 • edited

What changes were proposed in this pull request?

Supported

Not supported

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

yaooqinn commented Nov 29, 2019

SparkQA commented Nov 29, 2019

SparkQA commented Nov 29, 2019

yaooqinn commented Dec 18, 2019

yaooqinn commented Nov 29, 2019 •

edited