As a newbee to Kaldi, I am sometimes confused with the name convention of Kaldi. Below is a note of the common Kaldi abbrevations and corresponding full names / examples.
abbr. | full | example |
---|---|---|
egs | examples | kaldi/egs/wsj |
tri | triphone model | tri1, tri2, tri3 |
feat | (acoustic) feature | feats.scp |
cmvn | cepstral mean and variance normalization feature | cmvn.scp |
spk | speaker | utt2spk |
utt | utterance | utt2spk |
mdl | model | final.mdl |
sat | speaker adaptive training | train_sat.sh |
lda | linear discriminant analysis | train_lda_mllt.sh |
mllt | maximum likelihood linear transform | train_lda_mllt.sh |
acwt | acoustic model weight | post_decode_acwt |
lmwt | language model weight | lmwt |
lexiconp | lexicon with probablity | lexiconp.txt |
hires | high-resolution | mfcc_hires.conf |
conf | configuration | mfcc.conf |
HCLG | HMM, Context-dependency, Lexicon, and Grammar | HCLG.fst |
bg | bigram | |
pr | prune | |
bd | big dictionary | |
nosp | no silence probabilities and pronunciation probabilities | lang_tmp_nosp |
tgpr | pruned tri-gram (language model) | lang_tmp_tgpr |
sp | speed-perturbed | exp/ali_train_set_sp |
lores | low-resolution | |