Topics is a gentle introduction to Topic Modeling. It provides a convenient, modular workflow that can be entirely controlled from within and which comes with a well documented Jupyter notebook, integrating three of the most popular LDA implementations: Gensim, MALLET, and lda. Users not yet familiar with working with Python scripts can test basic topic modeling in a Flask-based GUI demonstrator.
At the moment, there are three Jupyter notebooks:
- Download and install the latest version of WinPython.
- Download and install Git.
- Open the WinPython PowerShell Prompt.exe in your WinPython folder and type
git clone https://github.com/DARIAH-DE/Topics.git
to clone Topics into your WinPython folder. - Type
cd .\Topics
in WinPython PowerShell to navigate to the Topics folder. 5a. Either: Typepip install .
in Winpython PowerShell to install packages required by Topics 5b. Or: Typepip install -r requirements.txt
in Winpython PowerShell to install Topics with additional development packages. - Type
jupyter notebook
in WinPython PowerShell to open Jupyter, select one of the files with suffix.ipynb
and follow the instructions. - Note: For the development packages the Python module future is needed. Depending in your WinPython and your Windows version you might have to install future manually.
- Therefore, download the latest future-x.xx.x-py3-none-any.whl.
- Open the WinPython Control Panel.exe in your WinPython folder.
- Install the future-wheel via the WinPython Control Panel.exe.
- Download and install Git.
- Open the command-line interface, type
git clone https://github.com/DARIAH-DE/Topics.git
to clone Topics into your working directory. - Note: The distribution packages
libfreetype6-dev
andlibpng-dev
and a compiler for C++, e.g. gcc have to be installed. - Open the command-line interface, navigate to the folder Topics and type
pip install . --user
to install the required packages. - Install Jupyter and run it by typing
jupyter notebook
in the command-line. - Access the folder Topics through Jupyter in your browser, select one of the files with suffix
.ipynb
and follow the instructions.
- Download and unzip MALLET.
- Set the environment variable for MALLET.
For more detailed instructions, have a look at this.