Topic Modeling Workflow in Python
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.

README.md

tmw - Topic Modeling Workflow

10.5281/zenodo.32490

What is tmw?

tmw is a python module for topic modeling, including some preprocessing of texts and some postprocessing of topic model data. This set of functions is experimental in nature and quality.

Requirements

  • tmw has been developed for and tested only on Linux (Ubuntu 14.04).
  • Python 3 (tested with 3.4), Mallet (tested with 2.0.7) and TreeTagger with the desired parameter files.
  • Python 3 packages numpy, pandas, matplotlib, lxml, scipy, seaborn, wordcloud.

Usage

Acknowledgements

Most of the code has been written by Christof Schöch, with significant contributions by Daniel Schlör. The project is inspired by Allan Riddell's TaTOM tutorials. We are grateful to all the existing packages and code this project reuses.

Contact