a python implementation of latent dirichlet allocation(lda) using gibbs sampling algorithm
Python
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
images init Jul 10, 2016
README.md init Jul 10, 2016
dataset.txt init Jul 10, 2016
dataset_cn.txt init Jul 10, 2016
main.py init Jul 10, 2016
stopwords.dic init Jul 10, 2016

README.md

LDA (Latent Dirichlet Allocation)

This is a python implementation of LDA using gibbs sampling algorithm.

The following picture shows the top 10 words in the 10 topics (set K = 10) generated by this algorithm over 16 sentences about one piece on wikipedia.

res1

The following picture shows the top 10 words in the 10 topics (set K = 10) generated by this algorithm over 5000 chinese sina social news.

res2

The following picture shows the top 10 words in the 30 topics (set K = 30) generated by this algorithm over 5000 chinese sina social news.

res3

Author