[SPARK-54289][SQL][FOLLOW-UP] Make Merge Into update assignment by field default for UPDATE SET * and align configs #53199

szehon-ho · 2025-11-24T21:16:34Z

What changes were proposed in this pull request?

Follow up of: #53149

Make the update assignment by field the Spark 4.1 behavior. For context, the case to allow assignment key and value to be different struct for MERGE INTO is new in Spark 4.1 so we have a chance to define the behavior. In Spark, nested fields are usually treated as top level column so it should follow the behavior: see #53149 (comment)

Why are the changes needed?

See above

Does this PR introduce any user-facing change?

No, this feature is unreleased (allowing assignment source to be of different struct type as target)

How was this patch tested?

Existing unit test

Was this patch authored or co-authored using generative AI tooling?

No

dongjoon-hyun

+1, LGTM.
Merged to master/4.1.

…eld default for UPDATE SET * and align configs ### What changes were proposed in this pull request? Follow up of: #53149 1. Make the update assignment by field the Spark 4.1 behavior. For context, the case to allow assignment key and value to be different struct for MERGE INTO is new in Spark 4.1 so we have a chance to define the behavior. In Spark, nested fields are usually treated as top level column so it should follow the behavior: see #53149 (comment) 2. Rename existing config to control the struct type compatibility check in assignment. We do not need to mention 'source' as actually the assignment can be to anything, not necessarily to source table. ### Why are the changes needed? See above ### Does this PR introduce _any_ user-facing change? No, this feature is unreleased (allowing assignment source to be of different struct type as target) ### How was this patch tested? Existing unit test ### Was this patch authored or co-authored using generative AI tooling? No Closes #53199 from szehon-ho/merge_schema_evolution_update_nested_follow. Authored-by: Szehon Ho <szehon.apache@gmail.com> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org> (cherry picked from commit 9846dd8) Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>

dongjoon-hyun · 2025-11-24T23:45:06Z

Thank you, @szehon-ho .

cloud-fan · 2025-11-25T01:33:52Z

late LGTM

dongjoon-hyun · 2025-11-25T01:39:53Z

Thank you, @cloud-fan .

…eld default for UPDATE SET * and align configs ### What changes were proposed in this pull request? Follow up of: apache#53149 1. Make the update assignment by field the Spark 4.1 behavior. For context, the case to allow assignment key and value to be different struct for MERGE INTO is new in Spark 4.1 so we have a chance to define the behavior. In Spark, nested fields are usually treated as top level column so it should follow the behavior: see apache#53149 (comment) 2. Rename existing config to control the struct type compatibility check in assignment. We do not need to mention 'source' as actually the assignment can be to anything, not necessarily to source table. ### Why are the changes needed? See above ### Does this PR introduce _any_ user-facing change? No, this feature is unreleased (allowing assignment source to be of different struct type as target) ### How was this patch tested? Existing unit test ### Was this patch authored or co-authored using generative AI tooling? No Closes apache#53199 from szehon-ho/merge_schema_evolution_update_nested_follow. Authored-by: Szehon Ho <szehon.apache@gmail.com> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>

Remove config

a38c520

github-actions bot added the SQL label Nov 24, 2025

szehon-ho mentioned this pull request Nov 24, 2025

[SPARK-54289][SQL] Allow MERGE INTO to preserve existing struct fields for UPDATE SET * when source struct has less nested fields than target struct #53149

Closed

dongjoon-hyun approved these changes Nov 24, 2025

View reviewed changes

dongjoon-hyun closed this in 9846dd8 Nov 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-54289][SQL][FOLLOW-UP] Make Merge Into update assignment by field default for UPDATE SET * and align configs #53199

[SPARK-54289][SQL][FOLLOW-UP] Make Merge Into update assignment by field default for UPDATE SET * and align configs #53199

szehon-ho commented Nov 24, 2025 •

edited

Loading

Uh oh!

dongjoon-hyun left a comment

Uh oh!

dongjoon-hyun commented Nov 24, 2025

Uh oh!

cloud-fan commented Nov 25, 2025

Uh oh!

dongjoon-hyun commented Nov 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[SPARK-54289][SQL][FOLLOW-UP] Make Merge Into update assignment by field default for UPDATE SET * and align configs #53199

[SPARK-54289][SQL][FOLLOW-UP] Make Merge Into update assignment by field default for UPDATE SET * and align configs #53199

Conversation

szehon-ho commented Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

dongjoon-hyun left a comment

Choose a reason for hiding this comment

Uh oh!

dongjoon-hyun commented Nov 24, 2025

Uh oh!

cloud-fan commented Nov 25, 2025

Uh oh!

dongjoon-hyun commented Nov 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

szehon-ho commented Nov 24, 2025 •

edited

Loading