Libri100 recipe for standalone Transducer #4698

b-flo · 2022-10-07T12:53:35Z

Add scripts + configs for streaming and offline Transducer. The second model is training, I'll add results to the README and update both models to hf.
~~Also, I added some minor fixes to the PR, it won't impact previous versions in any way.~~

I made a separate directory for asr_transducer1 here because we need some separation between the two versions of Transducer in ESPnet2 but I don't really have an opinion on the design. It makes sense because we defined this version as a new "task" but we could also keep asr1 and use some prefixes for the standalone version.

codecov · 2022-10-07T13:17:37Z

Codecov Report

Merging #4698 (ce88312) into master (3297e10) will increase coverage by 0.63%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master    #4698      +/-   ##
==========================================
+ Coverage   74.36%   74.99%   +0.63%     
==========================================
  Files         654      655       +1     
  Lines       58347    58546     +199     
==========================================
+ Hits        43391    43909     +518     
+ Misses      14956    14637     -319

Flag	Coverage Δ
test_integration_espnet1	`66.24% <ø> (-0.05%)`	⬇️
test_integration_espnet2	`47.65% <ø> (+0.70%)`	⬆️
test_python	`65.28% <ø> (+0.09%)`	⬆️
test_utils	`23.27% <ø> (-0.01%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

see 15 files with indirect coverage changes

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

b-flo · 2022-10-16T09:10:24Z

@ftshijt @sw005320 If this design is okay, the PR can be merged. Otherwise I'll make the requested changes.

I'll update HF links in another PR, I'm currently running some additional experiments.

b-flo · 2022-10-16T09:12:34Z

egs2/librispeech_100/asr_transducer1/run.sh

+    --ngpu 1 \
+    --nj 32 \
+    --inference_nj 32 \
+    --nbpe 500 \


BPE size is 500 vs 5000 for the CTC-Att baseline model. I guess we can further improve results with bigger BPE size but I don't have enough resource for that.

sw005320 · 2022-10-16T12:55:27Z

@ftshijt @sw005320 If this design is okay, the PR can be merged. Otherwise I'll make the requested changes.

I'll update HF links in another PR, I'm currently running some additional experiments.

Sorry, I'll have some reviews after the ICASSP deadline. Can you wait for a week or so?

mergify · 2022-10-21T11:15:59Z

This pull request is now in conflict :(

egs2/librispeech_100/asr_transducer1/run.sh

b-flo · 2023-02-10T14:37:16Z

It's more stable on my side so I'll start training different model architectures and settings before opening the corresponding PR. I'll use this branch (and message) to keep track of the best models.

Model	Mode	Num. params.	BPE	dev_clean (%WER)	dev_other (%WER)	test_clean (%WER)	test_other (%WER)
E-Branchformer/Transformer CTC-Att	offline	38.47M	5000	6.1	16.7	6.3	17.0

Conformer/RNN (old)	offline	30.56M	500	5.9	17.6	6.4	17.9
Conformer/RNN (new)	offline	30.53M	500	5.8	16.9	6.0	17.0
E-Branchformer/RNN (new)	offline	29.12M	500	5.7	16.8	6.0	17.1
E-Branchformer/MEGA (tmp)	offline	40.66M	500	5.7	16.5	6.0	16.7

for more information, see https://pre-commit.ci

…ansducer_v1.1

b-flo · 2023-06-22T08:37:18Z

@sw005320 If we are okay with the design (i.e.: adding a asr_transducer1 for standalone Transducer recipes), I think the PR can be merged.

Pre-trained model links will be added after I finish experiments with RWKV and I need to re-train the models anyways. At least, users have some examples for the standalone version and Librispeech-100 for now (which was asked before).

b-flo added 6 commits October 6, 2022 14:55

add zero pad for convolution (ref: icefall)

a623908

fix aux. lm loss, reporter variable and doc

c4ef9cf

parser -> group for add_argument

4d045df

add libri-100 asr transducer task

df6a0de

typo

2db74a9

add streaming transducer recipe

8a4c5ea

mergify bot added ESPnet2 README labels Oct 7, 2022

b-flo added this to the v.202211 milestone Oct 7, 2022

b-flo added Recipe RNNT (RNN) transducer related issue and removed README labels Oct 7, 2022

mergify bot added the README label Oct 7, 2022

b-flo mentioned this pull request Oct 7, 2022

asr.py vs asr_transducer.py #4691

Closed

add offline model results

508f339

b-flo commented Oct 16, 2022

View reviewed changes

b-flo mentioned this pull request Oct 20, 2022

Small changes for standalone Transducer #4722

Merged

mergify bot added the conflicts label Oct 21, 2022

b-flo added 2 commits October 21, 2022 11:18

fix conflict

9f4f12d

Merge branch 'master' into std_transducer_v1.1

ce53063

mergify bot removed the conflicts label Oct 21, 2022

add sub factor param for new version

359ac2d

sw005320 reviewed Nov 30, 2022

View reviewed changes

egs2/librispeech_100/asr_transducer1/run.sh Outdated Show resolved Hide resolved

kan-bayashi modified the milestones: v.202211, v.202301 Dec 11, 2022

kan-bayashi removed this from the v.202301 milestone Feb 1, 2023

kan-bayashi added this to the v.202303 milestone Feb 1, 2023

b-flo added 2 commits February 10, 2023 14:03

Merge branch 'master' into std_transducer_v1.1

ec7f361

update offline model and run script

9e5005d

b-flo added 2 commits February 14, 2023 09:46

add ebranchformer config and results

17af213

rework README

aa033d1

b-flo mentioned this pull request Apr 25, 2023

Standalone Transducer v1.1 #5140

Merged

kan-bayashi modified the milestones: v.202303, v.202307 May 1, 2023

b-flo and others added 5 commits June 22, 2023 08:05

Merge branch 'master' into std_transducer_v1.1

6048cbb

add MEGA config and results

29c5487

[pre-commit.ci] auto fixes from pre-commit.com hooks

9890efe

for more information, see https://pre-commit.ci

merge some lines for readibility

b541bd1

Merge remote-tracking branch 'origin/std_transducer_v1.1' into std_tr…

ce88312

…ansducer_v1.1

kan-bayashi modified the milestones: v.202307, v.202312 Aug 3, 2023

kan-bayashi modified the milestones: v.202310, v.202312 Oct 25, 2023

kan-bayashi modified the milestones: v.202312, v.202405 Feb 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Libri100 recipe for standalone Transducer #4698

Libri100 recipe for standalone Transducer #4698

b-flo commented Oct 7, 2022 •

edited

codecov bot commented Oct 7, 2022 •

edited

b-flo commented Oct 16, 2022

b-flo Oct 16, 2022

sw005320 commented Oct 16, 2022

mergify bot commented Oct 21, 2022

b-flo commented Feb 10, 2023 •

edited

b-flo commented Jun 22, 2023 •

edited

Libri100 recipe for standalone Transducer #4698

Are you sure you want to change the base?

Libri100 recipe for standalone Transducer #4698

Conversation

b-flo commented Oct 7, 2022 • edited

codecov bot commented Oct 7, 2022 • edited

Codecov Report

b-flo commented Oct 16, 2022

b-flo Oct 16, 2022

Choose a reason for hiding this comment

sw005320 commented Oct 16, 2022

mergify bot commented Oct 21, 2022

b-flo commented Feb 10, 2023 • edited

b-flo commented Jun 22, 2023 • edited

b-flo commented Oct 7, 2022 •

edited

codecov bot commented Oct 7, 2022 •

edited

b-flo commented Feb 10, 2023 •

edited

b-flo commented Jun 22, 2023 •

edited