Skip to content

[SPARK-55991] Fix unicode related SQL text corruption with parameters#54798

Closed
srielau wants to merge 2 commits intoapache:masterfrom
srielau:emoji
Closed

[SPARK-55991] Fix unicode related SQL text corruption with parameters#54798
srielau wants to merge 2 commits intoapache:masterfrom
srielau:emoji

Conversation

@srielau
Copy link
Contributor

@srielau srielau commented Mar 14, 2026

What changes were proposed in this pull request?

Fix parameter substitution code to be mindful of unicode supplemental characters

Why are the changes needed?

Emojies (and other special characters) cause corruption of the SQL text if parameter markers are substiution due to offset issues. codepoint vs character

Does this PR introduce any user-facing change?

No

How was this patch tested?

Wrote new testcases

Was this patch authored or co-authored using generative AI tooling?

YEs Claude Opus 4.6 high

@cloud-fan cloud-fan closed this in 4d79768 Mar 14, 2026
cloud-fan pushed a commit that referenced this pull request Mar 14, 2026
### What changes were proposed in this pull request?

Fix parameter substitution code to be mindful of unicode supplemental characters

### Why are the changes needed?

Emojies (and other special characters) cause corruption of the SQL text if parameter markers are substiution due to offset issues. codepoint vs character

### Does this PR introduce _any_ user-facing change?

No

### How was this patch tested?

Wrote new testcases

### Was this patch authored or co-authored using generative AI tooling?

YEs Claude Opus 4.6 high

Closes #54798 from srielau/emoji.

Authored-by: Serge Rielau <serge@rielau.com>
Signed-off-by: Wenchen Fan <wenchen@databricks.com>
(cherry picked from commit 4d79768)
Signed-off-by: Wenchen Fan <wenchen@databricks.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants