DNML

The implementation of paper Decomposed Normalized Maximum Likelihood Codelength Criterion for Selecting Hierarchical Latent Variable Models

Hyper-parameter's name and meaning

K: Topic size, the complexity of the model V: vocabulary size, the number of unique words in given documents. D: Document size, the number of documents. alpha: the hyper parameter of the dirichlet distribution of topic distribution in the documents. beta: the hyper parameter of the dirichlet distribution of word distribution in the documents.

Models

LDA

Artificial Data

All artificial data is generated by the generator in model/topic_model/ArtificialDataGenerator.py Example usage:

dd = LDAArtificialDataGenerator(K, V, alpha, beta, k_noise=0, noise_alpha=1, noise_beta=0.1, random_state=None)
X = dd.generate_artificial_data(D, N, noise_threshold=0.0)

Learner

The actual learner is LatentDirichletAllocation in /model/topic_model/LDA/VB/online_lda.py.

LatentDirichletAllocationWithSample is a thin wrapper of LatentDirichletAllocation to sample the latent variable Z to compute the decomposed NML code length.

LatentDirichletAllocationWithScore is a thin wrapper of LatentDirichletAllocationWithSample to calculate different methods like DNML, a-NML, AIC, BIC and VB criterion based on the sampled Z.

learner = LatentDirichletAllocationWithScore(verbose=0, learning_method='batch', evaluate_every=evaluate_every, perp_tol=perp_tol, max_iter=max_iter)
learner.fit(X)
criterion_scores = learner.score_new(X)

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
model		model
model_selector		model_selector
script		script
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DNML

Hyper-parameter's name and meaning

Models

LDA

Artificial Data

Learner

Other methods

Laplace Approximation

SBM (Stochastic Block Model)

Artificial Data

Learner

About

Releases

Packages

Languages

tianyi-wu/DNML

Folders and files

Latest commit

History

Repository files navigation

DNML

Hyper-parameter's name and meaning

Models

LDA

Artificial Data

Learner

Other methods

Laplace Approximation

SBM (Stochastic Block Model)

Artificial Data

Learner

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages