Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
ambiguous_sentences_topic_modelling.ipynb		ambiguous_sentences_topic_modelling.ipynb
country_topic_model_bert.ipynb		country_topic_model_bert.ipynb
process_countries.py		process_countries.py
requirements.txt		requirements.txt

README.md

bert-based-topic-model

In this approach we use BERTopic modelling on a corpus of country-based evaluation reports. The corpus is scraped and generated using scripts found in scrape-tool repo.

setup

To run the Jupyter notebook; first install the dependencies

pip install -r requirements.txt

the run the Jupyter interface in the source directory

jupyter lab

country_topic_model_bert.ipynb

This notebook implements guided topic modeling using BERTopic and a dictionary to seed and guid the modeling.

ambiguous_sentences_topic_modelling.ipynb

This notebook analyses the set of sentences labelled as ambiguous to find latent topics in the ambiguity. It applies BERTopic and SentenceTransformer. KeyBERTInspired is used as a representation model to better name the found topics.

dataset files

The notebook will attempt to download the corpus file and dictionary file from the democracy-dataset repo. The corpus is a CSV file structured as sentence, country, year and source where sentence is a sentence from the report, country, year and source identify where the sentece was extracted from.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bert-based-topic-model

bert-based-topic-model

README.md

README.md

ambiguous_sentences_topic_modelling.ipynb

ambiguous_sentences_topic_modelling.ipynb

country_topic_model_bert.ipynb

country_topic_model_bert.ipynb

process_countries.py

process_countries.py

requirements.txt

requirements.txt

README.md

bert-based-topic-model

setup

country_topic_model_bert.ipynb

ambiguous_sentences_topic_modelling.ipynb

dataset files

Files

bert-based-topic-model

Directory actions

More options

Directory actions

More options

Latest commit

History

bert-based-topic-model

Folders and files

parent directory

bert-based-topic-model

setup

country_topic_model_bert.ipynb

ambiguous_sentences_topic_modelling.ipynb

dataset files