-
Notifications
You must be signed in to change notification settings - Fork 28.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-32838][SQL]Check DataSource insert command path with actual path #30057
Conversation
Test build #129842 has finished for PR 30057 at commit
|
Kubernetes integration test starting |
Kubernetes integration test starting |
Kubernetes integration test status success |
Kubernetes integration test status success |
Kubernetes integration test starting |
Kubernetes integration test status success |
Test build #129873 has finished for PR 30057 at commit
|
Kubernetes integration test starting |
Kubernetes integration test status success |
Test build #129875 has finished for PR 30057 at commit
|
Kubernetes integration test starting |
Kubernetes integration test status success |
Test build #129935 has finished for PR 30057 at commit
|
throw new AnalysisException( | ||
s"Cannot overwrite a path that is also being read from.") | ||
} | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If we don't care about dynamicPartitionFilter, we can just add a check rule about this issue on SparkPlan using selectedpartition
.
retest this please |
Kubernetes integration test starting |
Kubernetes integration test status success |
Test build #129943 has finished for PR 30057 at commit
|
ping @dongjoon-hyun @cloud-fan |
cc @viirya |
Can you describe clearly in the PR description what bug this fixes? The PR description is important for reviewers to understand the issue quickly. Please describe the issue the PR tries to fix in details. Thanks. |
Test build #136640 has finished for PR 30057 at commit
|
Kubernetes integration test unable to build dist. exiting with code: 1 |
retest this please |
Test build #136710 has finished for PR 30057 at commit
|
Kubernetes integration test unable to build dist. exiting with code: 1 |
Kubernetes integration test starting |
Kubernetes integration test status failure |
Test build #136751 has finished for PR 30057 at commit
|
retest this please |
Kubernetes integration test unable to build dist. exiting with code: 1 |
Test build #136849 has finished for PR 30057 at commit
|
Test build #751743803 for PR 30057 at commit |
Kubernetes integration test starting |
Kubernetes integration test status failure |
Test build #137415 has finished for PR 30057 at commit
|
Gentle ping |
We're closing this PR because it hasn't been updated in a while. This isn't a judgement on the merit of the PR in any way. It's just a way of keeping the PR queue manageable. |
gentle ping @cloud-fan @viirya |
retest this please |
Kubernetes integration test starting |
Kubernetes integration test status success |
Test build #142666 has finished for PR 30057 at commit
|
@AngersZhuuuu any update about this, thanks! |
@AngersZhuuuu @cloud-fan We are facing this same issue with Spark 3.2. can we get this PR merged if there are no concerns? |
What changes were proposed in this pull request?
Currently, we verify path in
DataSourceAnalysis
For partition table, we can know that both
outputPath
and collectedinputPaths
are table's , not partition's path.Actually, when we overwrite table A's partition
a1
select from partitiona2
, then path won't conflict. But it compare both use table's path, then throw AnalysisException.So we need to check this after Optimizer(got partition pushed down), If we don't care about
dynamicPartitionFilter
, we can just add a check rule about this issue on SparkPlan usingFileSourceScanExec.selectedPartitions
.Why are the changes needed?
Fix bug
Does this PR introduce any user-facing change?
No
How was this patch tested?
UT