Optimise LdaModel.show_topics performance to only call get_lambda once #1006

nicolaslecrique · 2016-11-11T10:04:52Z

This function calls show_topic for each topic, which calls get_topic_terms which calls self.state.get_lambda().

self.state.get_lambda() call is very expensive, it is not dependent on a topic and can be called once for all.
On my setup (500 topics, 100k words, learned with 500k documents), calling show_topics takes 30sec, with 28sec inside the 500 get_lambda() calls

souravsingh · 2016-11-13T06:29:19Z

I would like to work on the issue. How do I start?

tmylk · 2016-11-13T08:23:58Z

Let's finish your other PRs first @souravsingh :)

bhargavvader · 2016-11-16T16:04:45Z

@tmylk I tried a fair amount to try and fix this but the only solutions which I could think of are not very pretty.

We could maybe have an option to pass the lambda values, so instead of calling it within the function each time, the matrix is just passed around and we can pick up the row we need.

The other option is to completely avoid calling show_topic in show_topics, and just write up code to do what is happening and keep self.state.get_lambda() only once in the beginning of the function. This also works, but is not really pretty, and there is code duplication.

Both the methods I tried work and pass tests without having to call get_lambda multiple times, but don't seem like the best solution, so I did not open a PR. If one of them sounds okay, I'll open a PR for it.

Any other suggestions?

tmylk · 2016-11-17T14:16:55Z

The optimised show_topics should not call the show_topic. That is a good solution

nicolaslecrique · 2017-02-11T09:52:54Z

Thanks a lot ! I'll check this

tmylk added bug Issue described a bug difficulty easy Easy issue: required small fix labels Nov 11, 2016

tmylk changed the title ~~fix LdaModel.show_topics performance~~ Optimise LdaModel.show_topics performance to only call get_lambda once Nov 11, 2016

nicolaslecrique mentioned this issue Nov 13, 2016

remove TopicModel.get_topics monkey patch gator-life/gator.life#79

Open

bhargavvader mentioned this issue Nov 23, 2016

Optimised show_topics #1028

Merged

tmylk closed this as completed in 0b2f6b8 Nov 27, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimise LdaModel.show_topics performance to only call get_lambda once #1006

Optimise LdaModel.show_topics performance to only call get_lambda once #1006

nicolaslecrique commented Nov 11, 2016

souravsingh commented Nov 13, 2016

tmylk commented Nov 13, 2016

bhargavvader commented Nov 16, 2016

tmylk commented Nov 17, 2016

nicolaslecrique commented Feb 11, 2017

Optimise LdaModel.show_topics performance to only call get_lambda once #1006

Optimise LdaModel.show_topics performance to only call get_lambda once #1006

Comments

nicolaslecrique commented Nov 11, 2016

souravsingh commented Nov 13, 2016

tmylk commented Nov 13, 2016

bhargavvader commented Nov 16, 2016

tmylk commented Nov 17, 2016

nicolaslecrique commented Feb 11, 2017