KIDS-DMM (Keyword Informative Discriminative Scheme Dirichlet Multinomial Mixture)

KIDS-DMM uses an open-source Java package to implement the algorithm.

1. Requirements

Java （Version=1.8）

2. Datasets

We procided the following three short text datasets for evaluation, SearchSnippets, GoogleNews, and Biomedical. All of corpus files and the corresponding label files have been prepared in the path ./datasets according to the survey, Short Text Topic Modeling Techniques, Applications, and Performance: A Survey.

Taking SearchSnippets as an example, the dataset file path is as follows.

datasets

SearchSnippets

word_wiki

SearchSnippets.txt

SearchSnippets_label.txt

SearchSnippets_vocab.txt

SearchSnippets_Word2VecSim.txt

For the corresponding word_wiki and the word2VecSim, you can download from this following this paper.

3. Run and Evaluate KIDS-DMM

bash run.sh

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.idea		.idea
datasets		datasets
lib		lib
out		out
results		results
src		src
KIDS-DMM.iml		KIDS-DMM.iml
README.md		README.md
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KIDS-DMM (Keyword Informative Discriminative Scheme Dirichlet Multinomial Mixture)

1. Requirements

2. Datasets

3. Run and Evaluate KIDS-DMM

About

Releases

Packages

Languages

rwang16/KIDS-DMM

Folders and files

Latest commit

History

Repository files navigation

KIDS-DMM (Keyword Informative Discriminative Scheme Dirichlet Multinomial Mixture)

1. Requirements

2. Datasets

3. Run and Evaluate KIDS-DMM

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages