Add ConferencingSpeech 2021 recipe to egs2 #4192

Emrys365 · 2022-03-23T16:00:41Z

ConferencingSpeech 2021 is a far-field multi-channel speech enhancement challenge.

The training data includes several English and Mandarin corpora, while test data are semi-real and real recordings in Mandarin.

This PR adds the enh recipe for task2 (Multi-channel speech enhancement with multiple distributed microphone arrays).

sw005320

some minor comments

sw005320 · 2022-03-23T16:11:09Z

egs2/README.md

@@ -19,6 +19,7 @@ See: https://espnet.github.io/espnet/espnet2_tutorial.html#recipes-using-espnet2
 | chime4                  | The 4th CHiME Speech Separation and Recognition Challenge                               | ASR/Multichannel ASR    | ENG                   | http://spandh.dcs.shef.ac.uk/chime_challenge/chime2016/                                                      |              |
 | cmu_indic               | CMU INDIC                                                                               | TTS                     | 7 languages           | http://festvox.org/cmu_indic/                                                                                |              |
 | commonvoice             | The Mozilla Common Voice                                                                | ASR                     | 13 languages          | https://voice.mozilla.org/datasets                                                                           |              |
+| conferencingspeech21    | Far-field Multi-channel Speech Enhancement Challenge for Video Conferencing (ConferencingSpeech 2021) | SE                     | 2 languages         | https://tea-lab.qq.com/conferencingspeech-2021                                                               |              |


2 languages -> ENG, CMN
?

OK. This is more clear.

egs2/conferencingspeech21/enh1/local/data.sh

sw005320 · 2022-03-23T16:18:36Z

@LiChenda, can I ask you to review this PR?

LiChenda · 2022-03-24T10:46:05Z

@sw005320 Sure, I can review this PR.

LiChenda · 2022-03-30T11:58:54Z

egs2/conferencingspeech21/enh1/local/config_from_generated.py

+            path_noise = noise_data[path_noise]
+            out.write(
+                f"{path_clean} {start_time} {path_noise} "
+                f"/path/{args.tag}/{path_rir}.wav {snr} {scale}\n"


Does "/path/" exist on filesystem?

This is a placeholder. We don't actually use this path for simulation. Because this python script is intended for generating the simulation configuration from an already generated data.

LiChenda

LGTM!

codecov · 2022-04-02T09:09:57Z

Codecov Report

Merging #4192 (bc486fd) into master (56c1c0d) will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master    #4192   +/-   ##
=======================================
  Coverage   80.71%   80.71%           
=======================================
  Files         453      453           
  Lines       39575    39575           
=======================================
  Hits        31944    31944           
  Misses       7631     7631

Flag	Coverage Δ
test_integration_espnet1	`67.13% <ø> (ø)`
test_integration_espnet2	`50.05% <ø> (ø)`
test_python	`67.16% <ø> (ø)`
test_utils	`24.45% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

📣 Codecov can now indicate which changes are the most critical in Pull Requests. Learn more

Emrys365 · 2022-04-02T10:59:53Z

~~The error in https://github.com/espnet/espnet/runs/5798659465?check_suite_focus=true#step:8:6228 seems not related to this PR.~~
Now it is all clear.

Emrys365 · 2022-04-11T06:09:33Z

~~The error in https://github.com/espnet/espnet/runs/5962161616?check_suite_focus=true#step:7:762 seems not related to this PR either.~~

# /__w/espnet/espnet/test_utils/test_update_json_sh.bats: line 580: jsondiff: command not found

sw005320 · 2022-04-12T14:34:57Z

Thanks, @Emrys365!
Can you also prepare the README file with its result and upload the model link?

Emrys365 · 2022-04-12T15:10:58Z

Thanks, @Emrys365! Can you also prepare the README file with its result and upload the model link?

I was not sure if it is appropriate to upload my local model. Because it was trained based on a local branch of espnet, where a customized Preprocessor was defined and used in espnet2/train/preprocessor.py.

This customized Preprocessor was intended for on-the-fly data simulation, which cannot be achieved with current main branch.

sw005320 · 2022-04-13T00:03:43Z

I see.

Add ConferencingSpeech 2021 recipe

1068f18

Emrys365 added Recipe ESPnet2 SE Speech enhancement labels Mar 23, 2022

Update db.sh for ConferencingSpeech

cf796f1

sw005320 added this to the v.0.10.7 milestone Mar 23, 2022

sw005320 reviewed Mar 23, 2022

View reviewed changes

mergify bot added the README label Mar 23, 2022

LiChenda reviewed Mar 30, 2022

View reviewed changes

LiChenda approved these changes Mar 31, 2022

View reviewed changes

Emrys365 added 5 commits March 31, 2022 23:51

Update ConferencingSpeech21 recipe

909e858

Update ConferencingSpeech21 recipe

ec7be4f

Merge branch 'master' of github.com:espnet/espnet into complex_support

a4bc176

Merge branch 'master' of github.com:espnet/espnet into complex_support

9e43edb

Fix symbolic links

9489122

Merge branch 'master' of github.com:espnet/espnet into complex_support

6119e32

Emrys365 mentioned this pull request Apr 10, 2022

Add AISHELL-4 ENH recipe #4249

Merged

kan-bayashi modified the milestones: v.0.10.7, v.202205 Apr 11, 2022

Merge branch 'master' of github.com:espnet/espnet into complex_support

bc486fd

sw005320 merged commit d263372 into espnet:master Apr 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ConferencingSpeech 2021 recipe to egs2 #4192

Add ConferencingSpeech 2021 recipe to egs2 #4192

Emrys365 commented Mar 23, 2022

sw005320 left a comment

sw005320 Mar 23, 2022

Emrys365 Mar 24, 2022

sw005320 commented Mar 23, 2022

LiChenda commented Mar 24, 2022

LiChenda Mar 30, 2022 •

edited

Emrys365 Mar 30, 2022

LiChenda Mar 31, 2022

LiChenda left a comment

codecov bot commented Apr 2, 2022 •

edited

Emrys365 commented Apr 2, 2022 •

edited

Emrys365 commented Apr 11, 2022 •

edited

sw005320 commented Apr 12, 2022

Emrys365 commented Apr 12, 2022

sw005320 commented Apr 13, 2022

Add ConferencingSpeech 2021 recipe to egs2 #4192

Add ConferencingSpeech 2021 recipe to egs2 #4192

Conversation

Emrys365 commented Mar 23, 2022

sw005320 left a comment

Choose a reason for hiding this comment

sw005320 Mar 23, 2022

Choose a reason for hiding this comment

Emrys365 Mar 24, 2022

Choose a reason for hiding this comment

sw005320 commented Mar 23, 2022

LiChenda commented Mar 24, 2022

LiChenda Mar 30, 2022 • edited

Choose a reason for hiding this comment

Emrys365 Mar 30, 2022

Choose a reason for hiding this comment

LiChenda Mar 31, 2022

Choose a reason for hiding this comment

LiChenda left a comment

Choose a reason for hiding this comment

codecov bot commented Apr 2, 2022 • edited

Codecov Report

Emrys365 commented Apr 2, 2022 • edited

Emrys365 commented Apr 11, 2022 • edited

sw005320 commented Apr 12, 2022

Emrys365 commented Apr 12, 2022

sw005320 commented Apr 13, 2022

LiChenda Mar 30, 2022 •

edited

codecov bot commented Apr 2, 2022 •

edited

Emrys365 commented Apr 2, 2022 •

edited

Emrys365 commented Apr 11, 2022 •

edited