Skip to content

soheil-mp/Latent-Dirichlet-Allocation-LDA

Repository files navigation

Latent Dirichlet Allocation (LDA)

LDA is used to classify texts to a specific topic. It builds topics per document model and words per topic model. We describe latent Dirichlet allocation (LDA), a generative probabilistic model for collections of discrete data such as text corpora. LDA is a three-level hierarchical Bayesian model, in which each item of a collection is modeled as a finite mixture over an underlying set of topics. Each topic is, in turn, modeled as an infinite mixture over an underlying set of topic probabilities. In the context of text modeling, the topic probabilities provide an explicit representation of a document. Read more in "Latent Dirichlet Allocation" paper.

Fig.1 - The intuitions behind latent Dirichlet allocation. (image taken here)

Result

We used a dataset for news report. You can see the result of generated topics for the gieven report here:

Fig.2 - The output result for generated topics.

About

Generating topics for any given news report using LDA

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published