[SPARK-9481][MLlib]Add logLikelihood to LocalLDAModel #7801

feynmanliang · 2015-07-30T18:03:28Z

@jkbradley Exposes bound (variational log likelihood bound) through public API as logLikelihood. Also adds unit tests, some DRYing of LDASuite, and includes unit tests mentioned in #7760

SparkQA · 2015-07-30T18:45:17Z

Test build #39078 has finished for PR 7801 at commit f0996d8.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

jkbradley · 2015-07-30T20:45:48Z

mllib/src/main/scala/org/apache/spark/mllib/clustering/LDAModel.scala

This is a lower bound, not an upper bound, on the log likelihood.

jkbradley · 2015-07-30T20:49:32Z

This looks good, but it made me realize that gensim defines log perplexity as the negation of the log perplexity in the Online LDA paper's Eq 15. I prefer the Online LDA paper way, since low perplexity sounds like a good thing to me (whereas with gensim, high perplexity is better). I checked Stanford NLP, and they also say lower perplexity is better. Can you please modify the logPerplexity code to negate the returned value?

SparkQA · 2015-07-31T17:42:16Z

Test build #39238 has finished for PR 7801 at commit 6d1b2c9.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

jkbradley · 2015-07-31T19:11:56Z

LGTM, thanks! Merging with master

jkbradley reviewed Jul 30, 2015
View reviewed changes

Feynman Liang added 2 commits July 31, 2015 09:48

Add logLikelihood

5f62b20

Negate perplexity definition

6d1b2c9

feynmanliang force-pushed the SPARK-9481-logLikelihood branch from f0996d8 to 6d1b2c9 Compare July 31, 2015 16:56

feynmanliang mentioned this pull request Jul 31, 2015

[SPARK-8936][MLlib] OnlineLDA document-topic Dirichlet hyperparameter optimization #7836

Closed

asfgit closed this in a8340fa Jul 31, 2015

feynmanliang deleted the SPARK-9481-logLikelihood branch August 3, 2015 19:38

feynmanliang changed the title ~~[SPARK-9481]Add logLikelihood to LocalLDAModel~~ [SPARK-9481][MLlib]Add logLikelihood to LocalLDAModel Aug 10, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-9481][MLlib]Add logLikelihood to LocalLDAModel #7801

[SPARK-9481][MLlib]Add logLikelihood to LocalLDAModel #7801

Uh oh!

feynmanliang commented Jul 30, 2015

Uh oh!

SparkQA commented Jul 30, 2015

Uh oh!

jkbradley Jul 30, 2015

Uh oh!

jkbradley commented Jul 30, 2015

Uh oh!

SparkQA commented Jul 31, 2015

Uh oh!

jkbradley commented Jul 31, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[SPARK-9481][MLlib]Add logLikelihood to LocalLDAModel #7801

[SPARK-9481][MLlib]Add logLikelihood to LocalLDAModel #7801

Uh oh!

Conversation

feynmanliang commented Jul 30, 2015

Uh oh!

SparkQA commented Jul 30, 2015

Uh oh!

jkbradley Jul 30, 2015

Choose a reason for hiding this comment

Uh oh!

jkbradley commented Jul 30, 2015

Uh oh!

SparkQA commented Jul 31, 2015

Uh oh!

jkbradley commented Jul 31, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants