Add Libriheavy small and medium ASR2 recipes #5512

akreal · 2023-10-30T13:14:18Z

What?

ASR2 recipe for Libriheavy medium subset.

Why?

I'm running experiments with a causal LM and ASR2 on this dataset and would like to have a comparison point without a causal LM.

Codecov Report

Merging #5512 (10c3bb8) into master (0d0ab98) will decrease coverage by 12.34%.
Report is 5 commits behind head on master.
The diff coverage is n/a.

@@             Coverage Diff             @@
##           master    #5512       +/-   ##
===========================================
- Coverage   70.31%   57.97%   -12.34%     
===========================================
  Files         711      710        -1     
  Lines       65757    65670       -87     
===========================================
- Hits        46237    38074     -8163     
- Misses      19520    27596     +8076

Flag	Coverage Δ
test_integration_espnet2	`48.61% <ø> (ø)`
test_python_espnet1	`?`
test_python_espnet2	`51.36% <ø> (+<0.01%)`	⬆️
test_utils	`22.19% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

see 130 files with indirect coverage changes

📣 Codecov offers a browser extension for seamless coverage viewing on GitHub. Try it in Chrome or Firefox today!

akreal · 2023-11-03T13:31:01Z

I also added the recipe for the small subset. It should be ready for review now.

The official baseline seems to do case sensitive scoring for the non-normalized text, so I added -s to score_opts in these recipes.

sclite skips all lines starting with ** and prints a warning for all lines starting with * (it's hardcoded as a comment character). I've added the removal of * in beginnings of texts in the scoring stage of asr2.sh. I've manually verified that it works as intended on the dev set, it should have no effect on other recipes.

sw005320

Thanks, @akreal!

akreal marked this pull request as draft October 30, 2023 13:14

mergify bot added ESPnet2 README labels Oct 30, 2023

ftshijt added Recipe ASR Automatic speech recogntion labels Oct 30, 2023

ftshijt added this to the v.202312 milestone Oct 30, 2023

akreal force-pushed the libriheavy-medium-asr2 branch from 258b47c to 86ed3a7 Compare October 30, 2023 14:59

Add Libriheavy small and medium ASR2 recipes

fee35fd

akreal force-pushed the libriheavy-medium-asr2 branch from 86ed3a7 to fee35fd Compare November 3, 2023 13:16

akreal changed the title ~~[WIP] Add Libriheavy medium ASR2 recipe~~ [WIP] Add Libriheavy small and medium ASR2 recipes Nov 3, 2023

akreal changed the title ~~[WIP] Add Libriheavy small and medium ASR2 recipes~~ Add Libriheavy small and medium ASR2 recipes Nov 3, 2023

akreal marked this pull request as ready for review November 3, 2023 13:18

akreal and others added 2 commits November 6, 2023 08:09

Merge branch 'master' into libriheavy-medium-asr2

51e459c

Merge branch 'master' into libriheavy-medium-asr2

10c3bb8

sw005320 approved these changes Nov 8, 2023

View reviewed changes

sw005320 added the auto-merge Enable auto-merge label Nov 8, 2023

mergify bot merged commit d610dbc into espnet:master Nov 8, 2023
25 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Libriheavy small and medium ASR2 recipes #5512

Add Libriheavy small and medium ASR2 recipes #5512

akreal commented Oct 30, 2023

codecov bot commented Oct 30, 2023 •

edited

akreal commented Nov 3, 2023

sw005320 left a comment

Add Libriheavy small and medium ASR2 recipes #5512

Add Libriheavy small and medium ASR2 recipes #5512

Conversation

akreal commented Oct 30, 2023

What?

Why?

See also

codecov bot commented Oct 30, 2023 • edited

Codecov Report

akreal commented Nov 3, 2023

sw005320 left a comment

Choose a reason for hiding this comment

codecov bot commented Oct 30, 2023 •

edited