Latent Dirichlet Allocation Algorithm
This is a Java implementation of Latent Dirichlet Allocation (LDA) algorithm using Gibbs sampling technique.
To learn about how LDA works, you can read David Blei & Andrew Ng paper:
One essential problem in NLP is to build a generative model to describe a collection of documents. In particular, one might be interested in extracting underlying topics from a collection of documents. LDA models a document as a bag of words in which words (w's) are the observed variables while topics are latent variables.
How to build/run LDA project?
LDA is a Maven project. If you use Eclipse with Maven plugin, you need to import LDA as a Maven project and build it from there.
For more information on how to import a Maven project check this thread:
Or, watch the following video: https://www.youtube.com/watch?v=BlkgrXb3L0c
For technical questions you can reach Kazem at k[DOT]jahanbakhsh[AT]gamil[DOT]com