[SPARK-15150][EXAMPLE][DOC] Update LDA examples#12927
[SPARK-15150][EXAMPLE][DOC] Update LDA examples#12927zhengruifeng wants to merge 13 commits intoapache:masterfrom
Conversation
|
Test build #57865 has finished for PR 12927 at commit
|
|
Test build #57867 has finished for PR 12927 at commit
|
|
@dongjoon-hyun Sorry for cc a lot but it would be great if I can hear your thoughts. I saw your PRs fixing some examples with the component |
|
Test build #57968 has finished for PR 12927 at commit
|
There was a problem hiding this comment.
println(s"The lower bound on the log likelihood of the entire corpus: $ll")
println(s"The upper bound bound on perplexity: $lp")
|
Test build #57991 has finished for PR 12927 at commit
|
|
Test build #58088 has finished for PR 12927 at commit
|
|
Test build #58089 has finished for PR 12927 at commit
|
|
Test build #58101 has finished for PR 12927 at commit
|
|
Test build #58102 has finished for PR 12927 at commit
|
|
Test build #58316 has finished for PR 12927 at commit
|
|
Test build #58317 has finished for PR 12927 at commit
|
|
Test build #58329 has finished for PR 12927 at commit
|
|
minor comment on some comments, otherwise LGTM. @yanboliang ? |
|
Test build #58345 has finished for PR 12927 at commit
|
|
Please rebase your PR to current master and fix merge conflicts |
|
Test build #58356 has finished for PR 12927 at commit
|
|
LGTM. Thanks! |
|
Merged to master and branch-2.0. Thanks! |
## What changes were proposed in this pull request? 1,create a libsvm-type dataset for lda: `data/mllib/sample_lda_libsvm_data.txt` 2,add python example 3,directly read the datafile in examples 4,BTW, change to `SparkSession` in `aft_survival_regression.py` ## How was this patch tested? manual tests `./bin/spark-submit examples/src/main/python/ml/lda_example.py` Author: Zheng RuiFeng <ruifengz@foxmail.com> Closes #12927 from zhengruifeng/lda_pe. (cherry picked from commit d88afab) Signed-off-by: Nick Pentreath <nickp@za.ibm.com>
What changes were proposed in this pull request?
1,create a libsvm-type dataset for lda:
data/mllib/sample_lda_libsvm_data.txt2,add python example
3,directly read the datafile in examples
4,BTW, change to
SparkSessioninaft_survival_regression.pyHow was this patch tested?
manual tests
./bin/spark-submit examples/src/main/python/ml/lda_example.py