[SPARK-22332][ML][TEST] Fix NaiveBayes unit test occasionly fail (cause by test dataset not deterministic) #19558

WeichenXu123 · 2017-10-23T10:53:56Z

What changes were proposed in this pull request?

Fix NaiveBayes unit test occasionly fail:
Set seed for BrzMultinomial.sample, make generateNaiveBayesInput output deterministic dataset.
(If we do not set seed, the generated dataset will be random, and the model will be possible to exceed the tolerance in the test, which trigger this failure)

How was this patch tested?

Manually run tests multiple times and check each time output models contains the same values.

SparkQA · 2017-10-23T12:03:28Z

Test build #82974 has finished for PR 19558 at commit de780e3.

This patch fails to build.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2017-10-23T12:04:12Z

Test build #82978 has finished for PR 19558 at commit be2606b.

This patch fails to build.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2017-10-23T12:14:34Z

Test build #82983 has finished for PR 19558 at commit 331d026.

This patch fails Scala style tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2017-10-23T14:00:58Z

Test build #82985 has finished for PR 19558 at commit 3ea8c50.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

WeichenXu123 · 2017-10-25T01:09:49Z

cc @jkbradley @MrBago

jkbradley · 2017-10-25T21:31:04Z

LGTM
Tested locally, and it fixed the non-determinism.
Merging with master and branch-2.2
Thanks @WeichenXu123 !

…se by test dataset not deterministic) ## What changes were proposed in this pull request? Fix NaiveBayes unit test occasionly fail: Set seed for `BrzMultinomial.sample`, make `generateNaiveBayesInput` output deterministic dataset. (If we do not set seed, the generated dataset will be random, and the model will be possible to exceed the tolerance in the test, which trigger this failure) ## How was this patch tested? Manually run tests multiple times and check each time output models contains the same values. Author: WeichenXu <weichen.xu@databricks.com> Closes #19558 from WeichenXu123/fix_nb_test_seed. (cherry picked from commit 841f1d7) Signed-off-by: Joseph K. Bradley <joseph@databricks.com>

…se by test dataset not deterministic) ## What changes were proposed in this pull request? Fix NaiveBayes unit test occasionly fail: Set seed for `BrzMultinomial.sample`, make `generateNaiveBayesInput` output deterministic dataset. (If we do not set seed, the generated dataset will be random, and the model will be possible to exceed the tolerance in the test, which trigger this failure) ## How was this patch tested? Manually run tests multiple times and check each time output models contains the same values. Author: WeichenXu <weichen.xu@databricks.com> Closes apache#19558 from WeichenXu123/fix_nb_test_seed. (cherry picked from commit 841f1d7) Signed-off-by: Joseph K. Bradley <joseph@databricks.com>

init pr

de780e3

WeichenXu123 force-pushed the fix_nb_test_seed branch from be2606b to de780e3 Compare October 23, 2017 11:48

update

331d026

fix style

3ea8c50

asfgit closed this in 841f1d7 Oct 25, 2017

WeichenXu123 deleted the fix_nb_test_seed branch April 24, 2019 21:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-22332][ML][TEST] Fix NaiveBayes unit test occasionly fail (cause by test dataset not deterministic) #19558

[SPARK-22332][ML][TEST] Fix NaiveBayes unit test occasionly fail (cause by test dataset not deterministic) #19558

WeichenXu123 commented Oct 23, 2017 •

edited

Loading

SparkQA commented Oct 23, 2017

SparkQA commented Oct 23, 2017

SparkQA commented Oct 23, 2017

SparkQA commented Oct 23, 2017

WeichenXu123 commented Oct 25, 2017

jkbradley commented Oct 25, 2017

[SPARK-22332][ML][TEST] Fix NaiveBayes unit test occasionly fail (cause by test dataset not deterministic) #19558

[SPARK-22332][ML][TEST] Fix NaiveBayes unit test occasionly fail (cause by test dataset not deterministic) #19558

Conversation

WeichenXu123 commented Oct 23, 2017 • edited Loading

What changes were proposed in this pull request?

How was this patch tested?

SparkQA commented Oct 23, 2017

SparkQA commented Oct 23, 2017

SparkQA commented Oct 23, 2017

SparkQA commented Oct 23, 2017

WeichenXu123 commented Oct 25, 2017

jkbradley commented Oct 25, 2017

WeichenXu123 commented Oct 23, 2017 •

edited

Loading