a python implementation of latent dirichlet allocation(lda) using gibbs sampling algorithm
Python
Latest commit 49341f5 Jul 10, 2016 @laserwave init
Permalink
Failed to load latest commit information.
images
README.md
dataset.txt
dataset_cn.txt
main.py
stopwords.dic

README.md

LDA (Latent Dirichlet Allocation)

This is a python implementation of LDA using gibbs sampling algorithm.

The following picture shows the top 10 words in the 10 topics (set K = 10) generated by this algorithm over 16 sentences about one piece on wikipedia.

res1

The following picture shows the top 10 words in the 10 topics (set K = 10) generated by this algorithm over 5000 chinese sina social news.

res2

The following picture shows the top 10 words in the 30 topics (set K = 30) generated by this algorithm over 5000 chinese sina social news.

res3

Author