-
Notifications
You must be signed in to change notification settings - Fork 28.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-21272] SortMergeJoin LeftAnti does not update numOutputRows #18494
Conversation
Would it be possible to add a test suite into SQLMetricsSuite? |
cc @hvanhovell |
Yeah, it's better to add a test to SQLMetricsSuite for this. |
ok to test |
Test build #79047 has finished for PR 18494 at commit
|
add to whitelist |
Test build #79069 has finished for PR 18494 at commit
|
Unfortunately, the comments in the test cases of test("SortMergeJoin(left-anti) metrics when right table is empty") {
val testDataForJoin = testData2.filter("a < 0") // Empty but not optimized to EmptyRDD
withTempView("testDataForJoin") {
testDataForJoin.createOrReplaceTempView("testDataForJoin")
val df = spark.sql(
"SELECT * FROM testData2 ANTI JOIN testDataForJoin ON testData2.a = testDataForJoin.a")
testSparkPlanMetrics(df, 1, Map(
0L -> ("SortMergeJoin", Map("number of output rows" -> 6L)))
)
}
} |
Test build #79467 has finished for PR 18494 at commit
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Thanks! Merging to master/2.2. |
## What changes were proposed in this pull request? Updating numOutputRows metric was missing from one return path of LeftAnti SortMergeJoin. ## How was this patch tested? Non-zero output rows manually seen in metrics. Author: Juliusz Sompolski <julek@databricks.com> Closes #18494 from juliuszsompolski/SPARK-21272.
## What changes were proposed in this pull request? Updating numOutputRows metric was missing from one return path of LeftAnti SortMergeJoin. ## How was this patch tested? Non-zero output rows manually seen in metrics. Author: Juliusz Sompolski <julek@databricks.com> Closes apache#18494 from juliuszsompolski/SPARK-21272.
What changes were proposed in this pull request?
Updating numOutputRows metric was missing from one return path of LeftAnti SortMergeJoin.
How was this patch tested?
Non-zero output rows manually seen in metrics.