Skip to content

[SPARK-52353][SQL] Fix bug with wrong constraints in LogicalRDDs referencing previous iterations #51056

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Conversation

Pajaraja
Copy link
Contributor

What changes were proposed in this pull request?

Remove constraints and statistics from LogicalRDDs used to store results of previous iterations.

Why are the changes needed?

In some cases logicalRDDs used for memorizing the previous iteration results produces incorrect statistics, so the filter gets pruned out, leading to an infinite recursion. We should fix this bug by removing these stats.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

The failing test case was added as a golden file. This test requires shutting off the optimization of converting small results into LocalRelation.

Was this patch authored or co-authored using generative AI tooling?

No.

@github-actions github-actions bot added the SQL label May 30, 2025
@HyukjinKwon HyukjinKwon changed the title [SPARK-52353] Fix bug with wrong constraints in LogicalRDDs referencing previous iterations [SPARK-52353][SQL] Fix bug with wrong constraints in LogicalRDDs referencing previous iterations Jun 1, 2025
@Pajaraja Pajaraja closed this Jun 6, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant