Find file History
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
..
Failed to load latest commit information.
5k_words.vocab
README.txt
README_TOOLS.txt
makeLM
prep.awk
wsj5kc.Z.DMP

README.txt

---------------------------------------------
What is this?
---------------------------------------------
wsj5k.DMP is a 5000 word language model suitable for use with Sphinx-3 and
Sphinx-4.  The file is in the CMU binary (DMP) format. It contains:

    4,988 unigrams
1,529,984 bigrams
7,851,482 trigrams

This language model was created using the CMU-Cambridge Statistical Language
Modeling toolkit and data from the LDC. This is a closed vocabulary model.
Good-Turing discounting was applied.