Skip to content

Example of Topic Coherence Calculation for LDA model in Apache Spark

License

Notifications You must be signed in to change notification settings

gnupinguin/spark-lda-coherence

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

spark-lda-coherence

Example of Topic Coherence Calculation for LDA (Latent Dirichlet allocation) model in Apache Spark.

The example uses Pointwise mutual information (PMI) for topic coherence calculation. In details, it uses Intrinsic UMass measure. Helpful articles:

How to use

There is example of usage in CoherenceTest file.

Also, you can compile the project and add it dependency to your project:

Example, publish to local Maven: sbt publishM2

And next:

libraryDependencies += "io.github.gnupinguin" %% "ldacoherence_2.12" % "1.0"

About

Example of Topic Coherence Calculation for LDA model in Apache Spark

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages