CHiME-7 DASR fixes from participants feedback #4999

popcornell · 2023-03-12T22:57:02Z

These fixes address mainly @kamo-naoyuki comments.

I have added mixer6 dev ihm.
Re-formatted the kaldi utterance-id so that it is easier to debug utterances e.g. all follow {speaker}-{record_id}-{start}_{end}-{mic-channel} now as suggested.
Other small fixes too.

Added evaluation script.

I also added memory figures from @boeddeker in the README

codecov · 2023-03-13T12:26:48Z

Codecov Report

Merging #4999 (656038f) into master (aeddd8e) will decrease coverage by 0.39%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master    #4999      +/-   ##
==========================================
- Coverage   77.03%   76.65%   -0.39%     
==========================================
  Files         606      606              
  Lines       53755    53721      -34     
==========================================
- Hits        41409    41178     -231     
- Misses      12346    12543     +197

Flag	Coverage Δ
test_integration_espnet1	`66.29% <ø> (ø)`
test_integration_espnet2	`47.96% <ø> (ø)`
test_python	`66.39% <ø> (-0.48%)`	⬇️
test_utils	`23.01% <ø> (-0.27%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

see 21 files with indirect coverage changes

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

mergify · 2023-03-15T16:26:14Z

This pull request is now in conflict :(

…to chime7task1

kamo-naoyuki · 2023-03-21T22:52:13Z

egs2/chime7_task1/asr1/README.md

+either the "style" of the baseline GSS ones or the ones belonging to close-talk mics.
+
+To evaluate the new enhanced data, e.g. `data/chime6/dev/my_enhanced`, you can `run ./asr.sh --feats_type raw_copy --skip_train true --test_sets chime6/dev/enhanced`.
+


Sorry, I wrote them really roughly.

Actually we can't run asr.sh directly, because it requires train_set etc.

I think it's better to create a wrapping script for this purpose e.g. name as eval.sh to run asr.sh with required options.

It's enough to select one from my 1 and 2 ideas.

It's better to write an example to run the eval.sh

e.g.

utils/copy_data_dir.sh data/kaldi/chime6/dev/gss data/kaldi/chime6/dev/my_enhanced # rewrite wav.scp ./eval.sh --data_sets "kaldi/chime6/dev/my_enhanced"

I think these should be done in the next PR.

let me delete the last bit then.

I think diarization part has a priority, so please just keep this comment in mind for now

I think that
run.sh --stage 3 --asr-tt-set kaldi/chime6/dev/gss --decode-only 1 --use-pretrained popcornell/chime7_task1_asr1_baseline --asr-dprep-stage 4

could do it actually.

Yeah, I think so, it's also okay, but it seems a little bit complicated,

Also note that an option for --feats_type raw_copy is not prepared in run.sh

Another idea is copying the dumpdir.

utils/copy_data_dir.sh dump/kaldi/chime6/dev/gss dump/kaldi/chime6/dev/my_enhanced echo raw > dump/kaldi/chime6/dev/my_enhanced/feats_type # Rewrite dump/kaldi/chime6/dev/my_enhanced/wav.scp # --skip_data_prep true --skip_train true --test_sets kaldi/chime6/dev/my_enhanced

So assuming that there is no ffmpeg or sox piping I could use raw_copy to avoid the copying right ?

So assuming that there is no ffmpeg or sox piping I could use raw_copy to avoid the copying right ?

Yes, that's right.

for more information, see https://pre-commit.ci

# Conflicts: # egs2/chime7_task1/asr1/README.md

popcornell · 2023-03-22T15:28:08Z

Can we merge ?

egs2/chime7_task1/asr1/run.sh

…to chime7task1

kamo-naoyuki · 2023-03-22T21:13:28Z

By the way, could you also put the number of utterances in the README for each set at the final state? Especially, I'd like to know the number of training set since this recipe is complicated and I wonder I could generate all correctly.

kamo-naoyuki · 2023-03-22T21:20:05Z

Please let me know if you think ready, then I'll merge quickly

for more information, see https://pre-commit.ci

popcornell · 2023-03-22T22:52:56Z

I have added the details of the number of utterances for each dataset.
I think you can merge

kamo-naoyuki · 2023-03-22T22:57:47Z

Thanks many!

popcornell · 2023-03-22T23:03:59Z

Well, thank you actually !

popcornell · 2023-03-23T10:27:30Z

Tests fail because some link is not reachable

…to chime7task1

desh2608 · 2023-04-15T13:29:53Z

egs2/chime7_task1/asr1/local/data.sh

-  for dset in chime6 dipco; do
-    lhotse kaldi export -p ${manifests_root}/${dset}/${dset_part}/${dset}-${mic}_recordings_${dset_part}.jsonl.gz  ${manifests_root}/${dset}/${dset_part}/${dset}-${mic}_supervisions_${dset_part}.jsonl.gz data/kaldi/${dset}/${dset_part}/${mic}
+  for dset in chime6 dipco mixer6; do
+    lhotse kaldi export ${manifests_root}/${dset}/${dset_part}/${dset}-${mic}_recordings_${dset_part}.jsonl.gz  ${manifests_root}/${dset}/${dset_part}/${dset}-${mic}_supervisions_${dset_part}.jsonl.gz data/kaldi/${dset}/${dset_part}/${mic}


I had to add the --prefix-spk-id flag in lhotse kaldi export otherwise the created data dir was not sorted by spk id.

popcornell added 9 commits March 9, 2023 17:07

add common error about sox

dfecbbf

add double quotes for cmd_gss

df4009f

memory consumption figures by boedekker

3097e0a

add quotes on cmd

0e20b29

changed to round

3aa5301

exit if wav-rev not found

0e181cf

implement fixes suggested by naoyuki kamo

d42b9f2

implement fixes suggested by naoyuki kamo

122447e

Merge branch 'master' of https://github.com/espnet/espnet

a2c4ccb

mergify bot added ESPnet2 README labels Mar 12, 2023

popcornell added 7 commits March 13, 2023 00:37

applied black, updated README

9457648

changed error log for selection. now more clear

f24a197

addressed linters errors

f13d49c

added results.

02983f4

added remarks about GPU in shared mode

c3b301b

do not include by default close-talk

d51b3e7

parsing2json for scoring

51fef16

popcornell added 9 commits March 13, 2023 15:52

added uem generation for mixer6 dev

b9ee140

added uem generation for dipco too for consistency

686a4c0

more informative

affaefc

sort to make it consistent

d9d1b6b

results updated

e400953

added RESULTS.md

e7bdb86

Merge branch 'master' into chime7task1

8ba4856

added inference.log

01d9b3d

updated README.md

e5ee10f

mergify bot added the conflicts label Mar 15, 2023

Merge branch 'chime7task1' of https://github.com/popcornell/espnet in…

0d6cc58

…to chime7task1

kamo-naoyuki reviewed Mar 21, 2023

View reviewed changes

popcornell and others added 7 commits March 21, 2023 23:59

fixing the command to evaluate

8b542c9

need to be string

51a6c58

[pre-commit.ci] auto fixes from pre-commit.com hooks

bacb17b

for more information, see https://pre-commit.ci

Merge remote-tracking branch 'samco/chime7task1' into chime7task1

ede3556

# Conflicts: # egs2/chime7_task1/asr1/README.md

if diarization file has no words use a placeholder

26cdf47

added option for using forced alignment

e7539cf

not hasattr

f60bf51

popcornell added 2 commits March 22, 2023 16:35

Merge branch 'master' into chime7task1

484e8f2

Merge branch 'master' into chime7task1

b1c37ff

popcornell commented Mar 22, 2023

View reviewed changes

egs2/chime7_task1/asr1/run.sh Outdated Show resolved Hide resolved

popcornell added 3 commits March 22, 2023 21:24

mentioning that all datasets can be used

7cdcb51

Merge branch 'chime7task1' of https://github.com/popcornell/espnet in…

f3ed9c2

…to chime7task1

implemented naoyuki fix

5f32093

popcornell and others added 2 commits March 22, 2023 23:51

added report on the utterances

8c6019d

[pre-commit.ci] auto fixes from pre-commit.com hooks

1a9f35e

for more information, see https://pre-commit.ci

kamo-naoyuki added the auto-merge Enable auto-merge label Mar 22, 2023

Merge branch 'master' into chime7task1

1cec2c4

popcornell added 2 commits March 23, 2023 12:56

fi missing

abfa62f

Merge branch 'chime7task1' of https://github.com/popcornell/espnet in…

656038f

…to chime7task1

mergify bot merged commit b579d70 into espnet:master Mar 23, 2023
24 checks passed

desh2608 reviewed Apr 15, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CHiME-7 DASR fixes from participants feedback #4999

CHiME-7 DASR fixes from participants feedback #4999

popcornell commented Mar 12, 2023 •

edited

codecov bot commented Mar 13, 2023 •

edited

mergify bot commented Mar 15, 2023

kamo-naoyuki Mar 21, 2023 •

edited

popcornell Mar 21, 2023

kamo-naoyuki Mar 21, 2023 •

edited

popcornell Mar 21, 2023

kamo-naoyuki Mar 21, 2023

kamo-naoyuki Mar 22, 2023

kamo-naoyuki Mar 22, 2023

popcornell Mar 22, 2023

kamo-naoyuki Mar 22, 2023

popcornell commented Mar 22, 2023

kamo-naoyuki commented Mar 22, 2023 •

edited

kamo-naoyuki commented Mar 22, 2023 •

edited

popcornell commented Mar 22, 2023

kamo-naoyuki commented Mar 22, 2023

popcornell commented Mar 22, 2023

popcornell commented Mar 23, 2023

desh2608 Apr 15, 2023

		either the "style" of the baseline GSS ones or the ones belonging to close-talk mics.

		To evaluate the new enhanced data, e.g. `data/chime6/dev/my_enhanced`, you can `run ./asr.sh --feats_type raw_copy --skip_train true --test_sets chime6/dev/enhanced`.

CHiME-7 DASR fixes from participants feedback #4999

CHiME-7 DASR fixes from participants feedback #4999

Conversation

popcornell commented Mar 12, 2023 • edited

codecov bot commented Mar 13, 2023 • edited

Codecov Report

mergify bot commented Mar 15, 2023

kamo-naoyuki Mar 21, 2023 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kamo-naoyuki Mar 21, 2023 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

popcornell commented Mar 22, 2023

kamo-naoyuki commented Mar 22, 2023 • edited

kamo-naoyuki commented Mar 22, 2023 • edited

popcornell commented Mar 22, 2023

kamo-naoyuki commented Mar 22, 2023

popcornell commented Mar 22, 2023

popcornell commented Mar 23, 2023

Choose a reason for hiding this comment

popcornell commented Mar 12, 2023 •

edited

codecov bot commented Mar 13, 2023 •

edited

kamo-naoyuki Mar 21, 2023 •

edited

kamo-naoyuki Mar 21, 2023 •

edited

kamo-naoyuki commented Mar 22, 2023 •

edited

kamo-naoyuki commented Mar 22, 2023 •

edited