[SPARK-55848][SQL] Fix incorrect dedup results with SPJ partial clustering by naveenp2708 · Pull Request #54679 · apache/spark

naveenp2708 · 2026-03-08T06:41:42Z

What changes were proposed in this pull request?

This PR fixes a data correctness bug where SPJ with partial clustering produces incorrect results for post-join dedup operations (dropDuplicates and Window-based row_number dedup).

The root cause: KeyGroupedPartitioning.satisfies0() delegates to super.satisfies0() (from HashPartitioningLike), which also matches ClusteredDistribution and returns true — short-circuiting the isPartiallyClustered guard. This means EnsureRequirements never inserts an Exchange before downstream dedup operators when partial clustering is active.

The fix adds an isPartiallyClustered flag to KeyGroupedPartitioning and restructures satisfies0() to check ClusteredDistribution first, returning false when partially clustered. EnsureRequirements then automatically inserts the necessary Exchange.

Why are the changes needed?

Without this fix, any Spark user running SPJ with partial clustering enabled who applies dropDuplicates() or Window-based dedup on the join output gets silently inflated results. This affects production pipelines using Iceberg, Delta Lake, or any DataSource V2 connector with bucketed tables.

Does this PR introduce any user-facing change?

Yes. Queries using SPJ with partial clustering followed by dedup operations will now return correct results. An additional shuffle may be introduced for these specific query patterns. Queries without post-join dedup are unaffected.

How was this patch tested?

New tests in KeyGroupedPartitioningSuite: SPARK-54378 dropDuplicates, SPARK-54378 Window dedup, SPARK-55848 dropDuplicates
All 69 tests in KeyGroupedPartitioningSuite pass
All 26 tests in EnsureRequirementsSuite pass
All 98 tests in DistributionAndOrderingSuite pass

Was this patch authored or co-authored using generative AI tooling?

No.

…Distribution

naveenp2708 · 2026-03-08T06:44:15Z

@peter-toth @szehon-ho @cloud-fan @chirag-s-db

This PR addresses the correctness issue discussed in #54378. Test cases included as requested by @peter-toth.

peter-toth · 2026-03-08T06:50:53Z

Can you please revert formatting changes? Now this PR shows +2,876 -1,961 lines of diff, but most of them seems unnecessary.

…ering When SPJ partial clustering splits a partition across multiple tasks, post-join dedup operators (dropDuplicates, Window row_number) produce incorrect results because KeyGroupedPartitioning.satisfies0() incorrectly reports satisfaction of ClusteredDistribution via super.satisfies0() short-circuiting the isPartiallyClustered guard. This fix adds an isPartiallyClustered flag to KeyGroupedPartitioning and restructures satisfies0() to check ClusteredDistribution first, returning false when partially clustered. EnsureRequirements then inserts the necessary Exchange. Plain SPJ joins without dedup are unaffected. Closes apache#54378

naveenp2708 · 2026-03-08T07:09:26Z

@peter-toth Done — rebased and cleaned up the formatting changes. The diff should now show only the actual fix (+192 -35).

peter-toth · 2026-03-08T07:20:15Z

@peter-toth Done — rebased and cleaned up the formatting changes. The diff should now show only the actual fix (+192 -35).

Thank you. Let me review this later today or tomorrow morning.

peter-toth

Can you please add new test case similar to the existing SPARK-53322: checkpointed scans avoid shuffles for aggregates, because after this fix there should be a shuffle added when a checkpointed KeyGroupedPartitioning partitioning is isPartiallyClustered, but we have a node above the checkpoint that requires ClusteredDistribution.

peter-toth · 2026-03-09T08:57:15Z

    }
  }

+  test("[SPARK-54378] dropDuplicates after SPJ with partial clustering should give correct " +


Can you please start the new test names with SPARK-55848:

peter-toth · 2026-03-09T08:58:52Z

+
+    Seq(true, false).foreach { partiallyClustered =>
+      withSQLConf(
+          SQLConf.REQUIRE_ALL_CLUSTER_KEYS_FOR_CO_PARTITION.key -> false.toString,


Do we need to turn REQUIRE_ALL_CLUSTER_KEYS_FOR_CO_PARTITION off for this test?

V2_BUCKETING_PUSH_PART_VALUES_ENABLED is enabled by default so we can omit it.

peter-toth · 2026-03-09T09:00:06Z

+             |FROM testcat.ns.$items i
+             |JOIN testcat.ns.$purchases p ON i.id = p.item_id
+             |""".stripMargin)
+        checkAnswer(df, Seq(Row(1), Row(2), Row(3)))


Can you please check the presence of shuffles and the number of partitons of scans are the expected?

peter-toth · 2026-03-09T09:00:35Z

+    }
+  }
+
+  test("[SPARK-54378] Window dedup after SPJ with partial clustering should give correct " +


peter-toth · 2026-03-09T09:00:54Z

+    }
+  }
+
+  test("SPARK-55848: dropDuplicates after SPJ with partial clustering should produce " +


Is this test different to the first one?

peter-toth · 2026-03-09T09:44:19Z

+    // requires all rows with the same key to be co-located in a single task. Without this
+    // guard, downstream operators such as dropDuplicates or Window functions would skip
+    // their required shuffle and produce incorrect results.
+    // See SPARK-54378 / SPARK-55848.


Is SPARK-54378 related to this issue?

peter-toth · 2026-03-09T20:06:02Z

@naveenp2708, I've merged #54330 to master which causes lots of conflicts in this PR.
I would suggest splitting this PR into 2 PRs. The first one should target master and add the regression tests, the second one should target branch-4.1 and add both the fix and the tests as well.

naveenp2708 · 2026-03-10T03:52:17Z

Thanks for the thorough review @peter-toth! I'll address all feedback and split into two PRs:

PR 1 (targeting master):

Tests only (regression tests for the fix in [SPARK-55535][SPARK-55092][SQL] Refactor KeyGroupedPartitioning and Storage Partition Join #54330)
Rename tests to SPARK-55848:
Remove duplicate test
Remove unnecessary SQLConf settings
Add plan assertions (shuffle presence + partition counts)
Add checkpointed scan test

PR 2 (targeting branch-4.1):

Fix (isPartiallyClustered) + all tests

Closing this PR and opening the new ones shortly.

naveenp2708 added 3 commits February 28, 2026 15:18

[SPARK-54378] Fix incorrect dedup results with SPJ partiallyClustered…

5de557f

…Distribution

Merge branch 'apache:master' into master

a308ad6

Merge branch 'apache:master' into master

fd1c96e

naveenp2708 changed the title ~~Fix/spj partial clustering dedup~~ [SPARK-55848][SQL] Fix incorrect dedup results with SPJ partial clustering Mar 8, 2026

peter-toth mentioned this pull request Mar 8, 2026

[SPARK-55535][SPARK-55092][SQL] Refactor KeyGroupedPartitioning and Storage Partition Join #54330

Closed

naveenp2708 force-pushed the fix/spj-partial-clustering-dedup branch from 5368d8f to 4f9752e Compare March 8, 2026 07:04

peter-toth requested changes Mar 9, 2026

View reviewed changes

naveenp2708 requested a review from peter-toth March 10, 2026 00:13

naveenp2708 closed this Mar 10, 2026

Conversation

naveenp2708 commented Mar 8, 2026

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

naveenp2708 commented Mar 8, 2026

Uh oh!

peter-toth commented Mar 8, 2026

Uh oh!

naveenp2708 commented Mar 8, 2026

Uh oh!

peter-toth commented Mar 8, 2026

Uh oh!

peter-toth left a comment

Choose a reason for hiding this comment

Uh oh!

peter-toth Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

peter-toth Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

peter-toth Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

peter-toth Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

peter-toth Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

peter-toth Mar 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

peter-toth commented Mar 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

naveenp2708 commented Mar 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

peter-toth Mar 9, 2026 •

edited

Loading

peter-toth commented Mar 9, 2026 •

edited

Loading