Term Project
Sequential Version
Author: Kai Zhen
Date: 10/05/2016 11:45pm
http://pages.iu.edu/~zhenk/lda_appendix.pdf
Arguments: harpLDAinput2.dat toyoutput.txt toyoutput2.txt 1000 0.5 0.1 30 10 5
=====================Top 10 words for topic 0=====================
protests
man
tiananmen
party
beijing
china
square
tanks
chinese
tank
=====================Top 10 words for topic 1=====================
learning
bayesian
human
data
models
cognitive
computational
concepts
language
people
LDA is more for the situation where each topic is relatively as powerful as others. If there are 4 docs mainly for topic A, 4 for topic B, and only 1 for topic C. Then there is an issue.