Latent Dirichlet Allocation (LDA) implementation in Java.
Java
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
resources first commit Oct 14, 2014
src first commit Oct 14, 2014
README.md
pom.xml first commit Oct 14, 2014

README.md

Latent Dirichlet Allocation Algorithm

This is a Java implementation of Latent Dirichlet Allocation (LDA) algorithm using Gibbs sampling technique.

To learn about how LDA works, you can read David Blei & Andrew Ng paper:

http://machinelearning.wustl.edu/mlpapers/paper_files/BleiNJ03.pdf

One essential problem in NLP is to build a generative model to describe a collection of documents. In particular, one might be interested in extracting underlying topics from a collection of documents. LDA models a document as a bag of words in which words (w's) are the observed variables while topics are latent variables.

Contributor

Kazem Jahanbakhsh

How to build/run LDA project?

LDA is a Maven project. If you use Eclipse with Maven plugin, you need to import LDA as a Maven project and build it from there.

For more information on how to import a Maven project check this thread:

http://stackoverflow.com/questions/2061094/importing-maven-project-into-eclipse

Or, watch the following video: https://www.youtube.com/watch?v=BlkgrXb3L0c

Contact

For technical questions you can reach Kazem at k[DOT]jahanbakhsh[AT]gamil[DOT]com