-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ngram scorer update #1992
Ngram scorer update #1992
Conversation
qmpzzpmq
commented
Jun 1, 2020
•
edited
edited
- glitch fix
- added an option for full or part ngram scorer for decoding acceleration
- and transform it to local score
- RESULT update, lower test CER from 6.7% to 6.6%
ngram scorer | dev set decoding time(s) |
---|---|
full | 9860 |
part | 4090 |
ngram local score
Codecov Report
@@ Coverage Diff @@
## develop #1992 +/- ##
===========================================
- Coverage 62.24% 61.98% -0.26%
===========================================
Files 258 258
Lines 21907 21997 +90
===========================================
Hits 13635 13635
- Misses 8272 8362 +90
Continue to review full report at Codecov.
|
@sw005320 |
Cool!
You can ignore this. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
espnet/bin/asr_recog.py
Outdated
type=str, | ||
default="full", | ||
choices=("full", "part"), | ||
help="ngram scorer choices", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could you explain a bit more about the full and part scorer options?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If the ngram is set as a part scorer, similar to CTC scorer, the ngram scorer only score to topID.
but ngram is set as a full scorer, the ngram score all ID.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oh sorry, I meant to say that it’s better to add such descriptions. Probably in the help message?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
good idea, I will add it.
| Sum/Avg | 14326 205341 | 94.1 5.7 0.2 0.1 6.0 41.7 | | ||
exp/train_sp_pytorch_train_pytorch_transformer_lr1.0/decode_test_decode_pytorch_transformer_lm0.7_4gram_0.3/result.txt | ||
| SPKR | # Snt # Wrd | Corr Sub Del Ins Err S.Err | | ||
| Sum/Avg | 7176 104765 | 93.5 6.3 0.2 0.1 6.6 44.6 | | ||
``` | ||
- only e2e model | ||
exp/train_sp_pytorch_train_pytorch_transformer_lr1.0/decode_dev_decode_pytorch_transformer/result.txt | ||
| SPKR | # Snt # Wrd | Corr Sub Del Ins Err S.Err | | ||
| Sum/Avg | 14326 205341 | 93.6 6.2 0.2 0.1 6.5 45.6 | | ||
exp/train_sp_pytorch_train_pytorch_transformer_lr1.0/decode_test_decode_pytorch_transformer/result.txt |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
These lines should be in the coding mode. Add "```", i.e.,
exp/train_sp_pytorch_train_pytorch_transformer_lr1.0/decode_dev_decode_pytorch_transformer/result.txt
| SPKR | # Snt # Wrd | Corr Sub Del Ins Err S.Err |
| Sum/Avg | 14326 205341 | 93.6 6.2 0.2 0.1 6.5 45.6 |
exp/train_sp_pytorch_train_pytorch_transformer_lr1.0/decode_test_decode_pytorch_transformer/result.txt
| SPKR | # Snt # Wrd | Corr Sub Del Ins Err S.Err |
| Sum/Avg | 7176 104765 | 92.7 7.1 0.2 0.1 7.4 49.8 |
Sorry to ask many things. |
@sw005320 I will try |
@sw005320 done |
Many thanks! |