This repository contains some basic topic modelling (using the gensim library) in Python. The corpus is based on the Manchester child language corpus (downloaded from the CHILDES website: http://childes.psy.cmu.edu/).
To create the LDA model and save the to file, run:
python model.py lda
Or if you prefer to run LSI/LSA, run:
python model.py lsi
To generate the figures and save them to file, run:
python clustering.py
##Dendrogram
##PCA