Carmel-Repository

This is a repository for all finite-state machines that are compatible with the Carmel toolkit (http://www.isi.edu/licensed-sw/carmel).

FSA	Description
none yet	none yet

WFSA	Description
wfsa001	English letter 2-gram model, P(e)
wfsa002	English phoneme 3-gram model, P(epron); plus generic n-gram WFSA builder
wfsa003	Letter 3-gram models for dozens of languages, built from UNDHR data
wfsa004	Letter 3-gram model for English, built from Penn Treebank plaintext
wfsa005	Pronounceable grapheme segment 3-gram model for English, built from the aligned CMU Pronunciation Dictionary
wfsa006	Accepts any word of alphabet [A-Z] except for the word "FILTER." Includes code for building any such filter.

FST	Description
fst001	English-to-Cipher letter substitutor, fully connected
fst002	English-to-Phonemes, built from CMU Pron Dict
fst002a	English-to-Phonemes, with fixes
fst003	Pinyin-if-to-pinyin, bi-directional

WFST	Description
wfst001	English-to-French word-to-word translator, P(f\|e)
wfst002	French-to-English word-to-word translator, P(e\|f)
wfst003	Pinyin-to-Chinese bi-directional P(p\|c) and P(c\|p)
wfst004	Pinyin-if-to-Epron, Pinyin initial/final to IPA (English Pronounciation)
wfst005	English-to-Syllable-Stress, includes original CMU pronunciation lexicon
wfst006	English phonemes to grapheme (spell out), useful for OOV
wfst007	English graphemes to pronounceable grapheme segment sequences
wfst008	Adds start and end tags to English pronounceable grapheme segment sequences
wfst009	English pronounceable grapheme segment sequences to phoneme (pronouncer), useful for OOV

Cascade	Description
casc001	English-to-English word-to-word paraphraser, P(e\|e)
casc002	English letter-substitution cipher solver
casc003	English-to-Chinglish
casc004	English word to pronounciation
casc005	Symbol clustering using EM (any # of clusters, any n-gram order)

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
casc001		casc001
casc002		casc002
casc003		casc003
casc004		casc004
casc005		casc005
fst001		fst001
fst002		fst002
fst002a		fst002a
fst003		fst003
japanese_transliteration		japanese_transliteration
wfsa001		wfsa001
wfsa002		wfsa002
wfsa003		wfsa003
wfsa004		wfsa004
wfsa005		wfsa005
wfsa006		wfsa006
wfst001		wfst001
wfst002		wfst002
wfst003		wfst003
wfst004		wfst004
wfst005		wfst005
wfst006		wfst006
wfst007		wfst007
wfst008		wfst008
wfst009		wfst009
README.md		README.md

lizuyao2010/Carmel-Repository