Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LDA model persistence #12

Closed
trpstra opened this issue Jan 2, 2021 · 2 comments
Closed

LDA model persistence #12

trpstra opened this issue Jan 2, 2021 · 2 comments

Comments

@trpstra
Copy link

trpstra commented Jan 2, 2021

Thanks for this library, it seems really useful.
I have been playing around a bit with a feature extractor pipeline of countvectoriser and tfidf transformer feeding into an LDA transformer, but I can't seem to save the Fit'ed pipeline to disk and reload it later to Transform new docs. Looking at the serialized pipeline in json, it seems the vocabulary is there, as well as the tokenizer info and various LDA params, but I don't see the induced topics (matrices). Maybe this is a problem with the way I serialized it? If you can point to a working example of how to properly serialize a trained LDA model and re-use it later, that would be great.
Thanks again!

@trpstra trpstra changed the title LDA model persisting LDA model persistence Jan 2, 2021
@james-bowman
Copy link
Owner

james-bowman commented Jan 3, 2021 via email

@trpstra
Copy link
Author

trpstra commented Jan 4, 2021

Thanks, that makes sense. I will have a look.

@trpstra trpstra closed this as completed Jan 4, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants