-
Notifications
You must be signed in to change notification settings - Fork 28.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-11940][PYSPARK][ML] Python API for ml.clustering.LDA PR2 #12723
Conversation
CC: @yanboliang Would you have time to take a look at this PR? Thanks! |
Test build #57081 has finished for PR 12723 at commit
|
pass | ||
|
||
|
||
class LDA(JavaEstimator, HasFeaturesCol, HasMaxIter, HasSeed, HasCheckpointInterval, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@inherit_doc
Made one pass, thanks. |
…erit_doc to LDA and models
@yanboliang Thanks! I think I addressed everything so far. |
Test build #57163 has finished for PR 12723 at commit
|
I saw you put |
I've wondered about the save/load in doc tests. On the one hand, it's nice to have that example, but on the other hand, it's going to be an extra save/load test for every run of the Jenkins tests (once we beef up the actual unit tests). I'll add save/load for now but we should re-evaluate in the future. |
Updated! |
Test build #57268 has finished for PR 12723 at commit
|
test this please |
Test build #57281 has finished for PR 12723 at commit
|
Ready now? |
LGTM, thanks! |
Thanks @yanboliang and @zjffdu ! |
What changes were proposed in this pull request?
pyspark.ml API for LDA
This replaces [https://github.com//pull/10242]
How was this patch tested?