A research package for topic modeling
Java Python
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
examples
gradle/wrapper
src
.travis.yml
Fugue.py
LICENSE
README.md
build.gradle
gradlew
gradlew.bat
settings.gradle

README.md

Fugue

Build Status codecov.io Dependency Status Code Climate Codacy Badge GitHub license

A Topic Modeling Package

Fugue is a research oriented topic modeling package for text mining and machine learning practitioners, designed for understanding models and algorithms. The philosophy of the package is to include mature models and algorithms that represent the state-of-the-art of research in topic modeling.

Models/Algorithms

  • Latent Dirichlet allocation (LDA) with collapsed Gibbs sampling
    • Using "Estimate theta" method for computing perplexity in test documents.
    • Using multiple MCMC chains to average results.
    • Slice sampling for hyper-parameter sampling
    • Optimizain methods for hyper-parameter tuning

Roadmap

We are expecting v1.0 in late 2016. See Detailed Plan.

Prerequisites

  • Gradle
  • JDK
  • Python 2.7

The design of the package is to limit the prerequesite packages and therefore you can easily experiment new models with minimum environments.

Platforms

  • Mac/Linux

How to Compile

$ python Fugue.py

Run AP example

$ python Fugue.py --task train