Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parallel computation of multiple coherences #77

Open
3 tasks
MichaelRoeder opened this issue Jul 19, 2022 · 0 comments
Open
3 tasks

Parallel computation of multiple coherences #77

MichaelRoeder opened this issue Jul 19, 2022 · 0 comments

Comments

@MichaelRoeder
Copy link
Member

User story

As a researcher, I am interested in calculating several coherence values for a given topic. At the moment, I have to rerun Palmetto several times. Each time, I provide the set of all topics and one of the coherence measures for which I would like to get the values. I would appreciate it if I could simply provide a list of coherence measures and Palmetto would give me all the results at once.

Another advantage would be the time that could be saved. Since the index is the performance bottleneck, it might be faster to simply calculate all coherence values for a single topic while the necessary counts are in the main memory instead of reading them again and again from the index.

Development steps

  • Adapt the inner workflow of Palmetto
  • Update the CLI interface to allow a list of coherence measure names instead of a single name
  • Update the web service to allow the same functionality (optional)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant