Closed CHiME-7 DASR adding evaluation inference + adding support to use diarization baseline "pre-computed" JSONs (new PR) #5228

popcornell · 2023-06-12T21:50:41Z

new PR from #5183 which had problems due to error of applying black.
@simpleoier I made another PR and closed the other one.

codecov · 2023-06-12T22:25:34Z

Codecov Report

Merging #5228 (a959fce) into master (096e2bb) will decrease coverage by 0.01%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master    #5228      +/-   ##
==========================================
- Coverage   74.43%   74.43%   -0.01%     
==========================================
  Files         642      642              
  Lines       57611    57605       -6     
==========================================
- Hits        42885    42880       -5     
+ Misses      14726    14725       -1

Flag	Coverage Δ
test_integration_espnet1	`66.28% <ø> (ø)`
test_integration_espnet2	`47.52% <ø> (ø)`
test_python	`65.14% <ø> (-0.01%)`	⬇️
test_utils	`23.28% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

see 5 files with indirect coverage changes

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

popcornell · 2023-06-14T10:48:40Z

@simpleoier can you take a look ? This is important for the challenge to merge ASAP.
Also now I am modifying only challenge related files so it should be good to go

simpleoier

Sorry for the late action. Please ping me on slack.

simpleoier · 2023-06-14T18:06:21Z

egs2/chime7_task1/asr1/run.sh

+  asr_tt_set="kaldi/chime6/dev/gss kaldi/dipco/dev/gss/ kaldi/mixer6/dev/gss/"
+elif
+  [ $decode_only == "eval" ]; then
+  # apply gss only on dev


The comment should be updated as eval, right?

simpleoier · 2023-06-14T18:19:26Z

egs2/chime7_task1/asr1/run.sh

@@ -80,9 +80,15 @@ asr_batch_size=$(calc_int 128*$ngpu) # reduce 128 bsz if you get OOMs errors
 asr_max_lr=$(calc_float $ngpu/10000.0)
 asr_warmup=$(calc_int 40000.0/$ngpu)

-if [ $decode_only -eq 1 ]; then
+if [ $decode_only == "dev" ]; then


Will you add a check if decode_only is neither dev nor eval?
I feel decode_only is a bit confusing. How about using skip_train (default: false) to control whether to process the training set? And using test_sets to let users define the gss and asr_tt sets? For example

test_sets="chime6_dev dipco_dev mixer6_dev" gss_dsets+="${test_sets}" test_sets_list=(${test_sets// / }) asr_tt_sets= for dset in ${test_sets_list[@]}; do asr_tt_sets+=$(echo "kaldi/${dset}/gss " | tr "_" "/")

This is nice but will require too much change in the codebase at this stage.

yeah, indeed. We may not need to update this at the moment.

simpleoier · 2023-06-14T18:21:08Z

egs2/chime7_task1/asr1/local/data.sh

@@ -26,7 +26,7 @@ background_snrs="20:10:15:5:0"
 gss_dsets=$(echo $gss_dsets | tr "," " ") # split by commas


-if [ $decode_only == 1 ]; then
+if [ -n "$decode_only" ]; then


we can use ${skip_train} and ${test_sets} instead of decode_only.

as before, we cannot change extensively the recipe at this point. I added the check for the argument.
Note that this will also require to change all two README.md and notify participants.

simpleoier · 2023-06-14T18:21:30Z

egs2/chime7_task1/diar_asr1/README.md

 you can run:
 ```bash
 ./run.sh --chime7-root YOUR_PATH_TO_CHiME7_ROOT --stage 2 --ngpu YOUR_NUMBER_OF_GPUs \
 --use-pretrained popcornell/chime7_task1_asr1_baseline \
--decode-only 1 --gss-max-batch-dur 30-360-DEPENDING_ON_GPU_MEM \
+--decode-only dev --gss-max-batch-dur 30-360-DEPENDING_ON_GPU_MEM \


skip_train and test_sets instead of decode-only.

popcornell · 2023-06-14T19:41:34Z

I added the check for checking the decode_only argument but we cannot make extensive changes at this point.
This is fine the way it is and making changes is too much hassle, we would need to retest everything and will confuse participants.
We should merge this recipe

adding changes

aaef96a

mergify bot added ESPnet2 README labels Jun 12, 2023

popcornell mentioned this pull request Jun 12, 2023

CHiME-7 DASR adding evaluation inference + adding support to use diarization baseline "pre-computed" JSONs #5183

Closed

simpleoier reviewed Jun 14, 2023

View reviewed changes

popcornell added 2 commits June 14, 2023 21:38

added check for decode_only argument

a130763

added quotes

a959fce

simpleoier added the auto-merge Enable auto-merge label Jun 14, 2023

mergify bot merged commit 2839fb7 into espnet:master Jun 14, 2023
24 of 25 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Closed CHiME-7 DASR adding evaluation inference + adding support to use diarization baseline "pre-computed" JSONs (new PR) #5228

Closed CHiME-7 DASR adding evaluation inference + adding support to use diarization baseline "pre-computed" JSONs (new PR) #5228

popcornell commented Jun 12, 2023 •

edited

codecov bot commented Jun 12, 2023 •

edited

popcornell commented Jun 14, 2023 •

edited

simpleoier left a comment

simpleoier Jun 14, 2023

popcornell Jun 14, 2023

simpleoier Jun 14, 2023

popcornell Jun 14, 2023

simpleoier Jun 14, 2023

simpleoier Jun 14, 2023

popcornell Jun 14, 2023

simpleoier Jun 14, 2023

popcornell commented Jun 14, 2023

Closed CHiME-7 DASR adding evaluation inference + adding support to use diarization baseline "pre-computed" JSONs (new PR) #5228

Closed CHiME-7 DASR adding evaluation inference + adding support to use diarization baseline "pre-computed" JSONs (new PR) #5228

Conversation

popcornell commented Jun 12, 2023 • edited

codecov bot commented Jun 12, 2023 • edited

Codecov Report

popcornell commented Jun 14, 2023 • edited

simpleoier left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

popcornell commented Jun 14, 2023

popcornell commented Jun 12, 2023 •

edited

codecov bot commented Jun 12, 2023 •

edited

popcornell commented Jun 14, 2023 •

edited