New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Zipformer for Common Voice #997
Conversation
BTW it seems that |
There's an ongoing PR about that |
@@ -90,7 +90,7 @@ def compute_fbank_commonvoice_splits(args): | |||
subset = "train" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could you please add a RESULTS.md
to document the results, pre-trained models, tensorboard logs?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sure
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
@@ -0,0 +1,105 @@ | |||
# Copyright 2021 Xiaomi Corp. (authors: Fangjun Kuang) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could you replace it with a symlink?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Also for other files like joiner.py
and model.py
.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sure.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks!
Data Preparation
The
prepare.sh
prepares the English (en) dataset of version 13.0 by default.Result
To reproduce the above result, use the following commands for training:
and the following commands for decoding:
Pretrained model is available at
https://huggingface.co/yfyeung/icefall-asr-cv-corpus-13.0-2023-03-09-en-pruned-transducer-stateless7-2023-04-17
The tensorboard log for training is available at
https://tensorboard.dev/experiment/j4pJQty6RMOkMJtRySREKw/