single language track setups #4895

DanBerrebbi · 2023-01-30T16:59:16Z

I changed some single language langs :
I removed pol because similar to rus and nob because similar to swe,
I added French because there was no roman language(french, spanish, italian, portuguese ...) and added Swahili because there was no African language.
For dataset selections, it is summarized on the last page of https://docs.google.com/document/d/1sb8SyDjcMf7FDiZHH8wVcZ0EADtXdNBF3LpA9Cu0I1k/edit

For test sets format, I think that it is good to keep only one test set per language with all the datasets of this lang. This way it is an easy decoding process and then WE can split it the decoded file to have scores per dataset and so compute metrics for domain shifts ... . So we have flexibility for scoring and the user has a simple process.

Points to be discussed :

lang choices
VoxPopuli not working
Should we use speed pert ? In my opinion no

…mSUPERB

ftshijt

Looks good to me. Please follow our discussion and fix the CI issues. Let's accelerate the process~ Thanks @DanBerrebbi

egs2/msuperb/asr1/local/single_lang_data_prep.py

codecov · 2023-01-31T05:21:08Z

Codecov Report

Merging #4895 (2bf7dd2) into master (e37ee27) will increase coverage by 3.48%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master    #4895      +/-   ##
==========================================
+ Coverage   73.10%   76.58%   +3.48%     
==========================================
  Files         603      603              
  Lines       53709    53737      +28     
==========================================
+ Hits        39264    41155    +1891     
+ Misses      14445    12582    -1863

Flag	Coverage Δ
test_integration_espnet1	`66.33% <ø> (ø)`
test_integration_espnet2	`47.60% <ø> (ø)`
test_python	`66.45% <ø> (+3.55%)`	⬆️
test_utils	`23.35% <ø> (?)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
espnet2/uasr/espnet_model.py	`0.00% <0.00%> (ø)`
espnet/nets/pytorch_backend/e2e_vc_transformer.py	`86.72% <0.00%> (+0.11%)`	⬆️
espnet/nets/pytorch_backend/rnn/attentions.py	`98.12% <0.00%> (+0.13%)`	⬆️
espnet/nets/pytorch_backend/e2e_vc_tacotron2.py	`80.48% <0.00%> (+0.15%)`	⬆️
espnet/nets/chainer_backend/e2e_asr_transformer.py	`69.59% <0.00%> (+0.20%)`	⬆️
espnet/nets/pytorch_backend/lm/seq_rnn.py	`86.88% <0.00%> (+0.21%)`	⬆️
espnet2/bin/asr_transducer_inference.py	`94.04% <0.00%> (+0.39%)`	⬆️
espnet2/svs/espnet_model.py	`6.25% <0.00%> (+0.52%)`	⬆️
espnet2/enh/layers/dnn_beamformer.py	`97.74% <0.00%> (+0.56%)`	⬆️
espnet2/diar/espnet_model.py	`96.29% <0.00%> (+0.61%)`	⬆️
... and 49 more

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

ftshijt · 2023-01-31T14:02:41Z

Many thanks! Looks great to me.

Dan Georges Berrebbi added 2 commits January 30, 2023 11:47

single lang tracks

ff34129

Merge branch 'mSUPERB' of https://github.com/DanBerrebbi/espnet into …

d48f00f

…mSUPERB

mergify bot added the ESPnet2 label Jan 30, 2023

ftshijt reviewed Jan 31, 2023

View reviewed changes

egs2/msuperb/asr1/local/single_lang_data_prep.py Outdated Show resolved Hide resolved

Dan Georges Berrebbi added 3 commits January 30, 2023 23:10

easy to launch for users

56be2dc

easy to launch for users

c5103f9

easy to launch for users

2bf7dd2

sw005320 added ASR Automatic speech recogntion Recipe labels Jan 31, 2023

sw005320 added this to the v.202301 milestone Jan 31, 2023

ftshijt merged commit 6a83c97 into espnet:master Jan 31, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

single language track setups #4895

single language track setups #4895

DanBerrebbi commented Jan 30, 2023

ftshijt left a comment

codecov bot commented Jan 31, 2023 •

edited

ftshijt commented Jan 31, 2023

single language track setups #4895

single language track setups #4895

Conversation

DanBerrebbi commented Jan 30, 2023

ftshijt left a comment

Choose a reason for hiding this comment

codecov bot commented Jan 31, 2023 • edited

Codecov Report

ftshijt commented Jan 31, 2023

codecov bot commented Jan 31, 2023 •

edited