This repo implements the strategies outlined in Principled Analysis of Energy Discourse across Domains with Thesaurus-based Automatic Topic Labeling (2021) and is an easy way to implement an automatic labelling technique for LDA-style topics or lists of keywords using a pre-defined thesaurus.
If any questions arise please don't hesitate to contact me at:
tscelsi@student.unimelb.edu.au
pip install -e dtm_toolkit
from dtm_toolkit.auto_labelling import AutoLabel
...
In the examples folder dtm_toolkit/examples
, we show some examples of analying using the dtm analysis module of the toolkit, and also some simple examples of the automatic topic labelling procedure.
Within the toolkit we mainly provide tools for preprocessing text, automatic labelling and the creation of valid input for the dynamic topic model (DTM) as implemented here.