-
Notifications
You must be signed in to change notification settings - Fork 28k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-9487] Use the same num. worker threads in Java/Scala unit tests #15848
Conversation
Just to be clear, from the discussion in the JIRA, it seems And.. it seems
is missed? |
Yes to your first comment on PageViewStream, for the other one TestSQLContext it was unfortunately missed, I'lol add that with the next pull request which will also contain the code fixes to the Python unit tests, is it ok to merge this one without the things you mention. |
Also I really would love to avoid having to rebase yet again due to more needed commits :) to this pull request |
@skanjila rebases are a fact of life, but they're not hard. I doubt it will be an issue here. |
Jenkins test this please |
Jenkins add to whitelist |
Also please fix the title to [SPARK-9487], and remove Python reference |
Test build #68529 has finished for PR 15848 at commit
|
Test build #68531 has finished for PR 15848 at commit
|
Test build #3422 has finished for PR 15848 at commit
|
Test build #68897 has finished for PR 15848 at commit
|
Assert.assertEquals(expected, result); | ||
|
||
|
||
for (int i=0;i<expected.size();i++) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This only asserts that the actual result is a superset of the expected results. It won't catch the case that it contains unexpected values. You could check both ways.
@skanjila are you proceeding with this? if not can you close it for now? |
Yes the plan is to comment out the failures for now but keep the rest of the changes, fix the Python unit tests with local[4] and then come back and fix the rest of the failures, let me know if you see issues with this approach |
This is actually what I was worried of initially. So, you mean you are going to fix the failure tests here in this PR at the end? |
The plan is to comment out the tests causing Jenkins failures for now, fix the Python unit tests to work with local[4] and then come back and fix these other Jenkins failures, make sense? |
In my personal opinion, I think we should fix here together. Maybe we could
I think making them ignored sounds pretty unsafe if they are a lot. They exist to test regression and we would not be able to detect them until they are actually tested. It is so close for Spark 2.1 release and I am worried if we meet unexpected test failures when it is actually backported. If this PR blocks the release, it might be acceptable if this is the only choice but it seems not. |
Assert.assertTrue(resultTupleList.containsAll(expectedTupleList)); | ||
} | ||
|
||
//Assert.assertEquals(expected, result); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
BTW, I think we might not need to leave this comment as it is able to be tracked down via blame button or history.
@HyujkinKwon I saw at least 3 to 4 failures in unit tests that happened in Jenkins but not locally (meaning the test succeeded locally). I completely agree with you about not marking something ignored as it has the potential to come back and cause more work later. It may take longer to fix all the failures to use local[4] than the 2.1 release. Am wondering if it makes sense to keep this pull request open, I'd like to keep working on this but focused towards fixing the Pyspatl failures at this point . |
Sorry I meant Pyspark failures from unit tests , pardon my typos |
I saw several PRs time to time to run automatic Jenkins tests (with the title including If this PR is not supposed to be updated like few weeks, I think it is reasonable to close this for now. |
I've changed the underlying trait for LogisticRegressionSuite and OneVsRestSuite to use local[2], will add in python changes next , want to get this PR across with this set of changes |
Test build #69297 has finished for PR 15848 at commit
|
@skanjila We would be able to close this if there are no updates for now. |
HyukjinKwon I've been out of the country the last 2 weeks, that's why you don't see updates, I plan to resume work on this next week by making unit tests in the sql module more reliable, once that passes I'll pick more modules |
@skanjila Could we close this for now? |
Yes by all means |
What changes were proposed in this pull request?
Changed code to use local[4] instead of local[2] when creating SparkConf, Fixed JavaAPISuite unit test, added TestSQLContext
How was this patch tested?
Ran unit tests across core, mllib,external, streaming without issues, note that this is the second version of the pull request as the first one got messed up by doing rebase incorrectly. Once this gets committed I will work on python pieces next.
Please review https://cwiki.apache.org/confluence/display/SPARK/Contributing+to+Spark before opening a pull request.