You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
ngram training stage in espnet/egs2/TEMPLATE/slu1/slu.sh
Current command in the script: cut -f 2 -d " " ${data_feats}/lm_train.txt | lmplz -S "20%" --discount_fallback -o ${ngram_num} - >${ngram_exp}/${ngram_num}gram.arpa
This only take the first token in the transcripts for ngram training
Seems like should be following to take all the tokens in transcripts cut -f 2- -d " " ${data_feats}/lm_train.txt | lmplz -S "20%" --discount_fallback -o ${ngram_num} - >${ngram_exp}/${ngram_num}gram.arpa
The text was updated successfully, but these errors were encountered:
@rxpwang Thank you for bringing this to my attention. I am currently reviewing the issue and will open a PR if necessary to address it as soon as possible.
Describe the bug
ngram training stage in espnet/egs2/TEMPLATE/slu1/slu.sh
Current command in the script:
cut -f 2 -d " " ${data_feats}/lm_train.txt | lmplz -S "20%" --discount_fallback -o ${ngram_num} - >${ngram_exp}/${ngram_num}gram.arpa
This only take the first token in the transcripts for ngram training
Seems like should be following to take all the tokens in transcripts
cut -f 2- -d " " ${data_feats}/lm_train.txt | lmplz -S "20%" --discount_fallback -o ${ngram_num} - >${ngram_exp}/${ngram_num}gram.arpa
The text was updated successfully, but these errors were encountered: