New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Conv2dSubsampling1 module and test it in AphasiaBank ASR recipe #4892
Conversation
Codecov Report
@@ Coverage Diff @@
## master #4892 +/- ##
==========================================
- Coverage 76.61% 76.57% -0.04%
==========================================
Files 603 603
Lines 53677 53737 +60
==========================================
+ Hits 41122 41151 +29
- Misses 12555 12586 +31
Flags with carried forward coverage won't be shown. Click here to find out more.
📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Better results!
Looks good to me in general!
Just one concern about the name of the new module, Conv2dSubsampling1
doesn't do subsampling actually.
Oh right, what's your suggestion? Should we just call it |
egs2/aphasiabank/asr1/README.md
Outdated
@@ -9,6 +9,24 @@ | |||
- Git hash: `39c1ec0509904f16ac36d25efc971e2a94ff781f` | |||
- Commit date: `Wed Dec 21 12:50:18 2022 -0500` | |||
|
|||
## asr_train_asr_ebranchformer_small_wavlm_large1 | |||
|
|||
- [train_asr_ebranchformer_small_wavlm_large.yaml](conf/tuning/train_asr_ebranchformer_small_wavlm_large.yaml) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you add a short note here to explain the difference? E.g., this config uses xxx as the input layer which does not perform downsampling.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Are this config name and path correct? It looks same as the previous one.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks! Fixed in 4734d75
I think |
Thanks a lot! |
Conv2dSubsampling1
module is similar toConv2dSubsampling
except the subsampling ratio is 1:1.AphasiaBank English ASR experiments using E-Branchformer+WavLM showed a CER decrease from 17.7 to 17.0 by switching from
Conv2dSubsampling2
toConv2dSubsampling1
.The user can specify
input_layer: conv2d1
in the encoder configuration to use this module.