rnn_transducer #231

arthur-compton · 2021-10-28T09:50:56Z

I've been running the examples in the "conformer/" and in the "rnn_transducer/" directories and comparing the models with those already provided on drive.

The conformer training works as expected, and the results of the model I trained are almost identical to the results obtained with the pretrained model (I am using the three librispeech training sets for training, 960 hours).

The training in the rnn_transducer example, however, doesn't really converge to anything usable. I've tried with the configuration in the codebase and with the slightly different configuration in drive. In both cases the loss reduces just a little bit during training but certainly too little, so that the final model has not learnt much.

My guess is that there is something broken in the rnn_transducer example. Has anyone tried it out with a recent version of the code? I've tried version 1.0.3 (TF2.6), 1.0.1, and 1.0.0 (TF2.4.1): in all cases the training doesn't really converge.

Any suggestion is very much appreciated!

maxeduc · 2022-09-30T03:11:45Z

Just tried this repo, and agreed. It seems either the wrong config file was uploaded or there's a regression in the repo (or Tensorflow). Any tips on what's happening here would be greatly appreciated.

yiqiaoc11 · 2023-01-21T23:34:15Z

Same Issue here. Adam with warmstep-40000 didn't learn anything. Can we @usimarit take a look at the code?

nglehuy added the need to reproduce Need a code or time to reproduce the issue label Sep 2, 2022

yiqiaoc11 mentioned this issue Feb 8, 2023

training problem with rnn_transducer #279

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rnn_transducer #231

rnn_transducer #231

arthur-compton commented Oct 28, 2021

maxeduc commented Sep 30, 2022

yiqiaoc11 commented Jan 21, 2023

rnn_transducer #231

rnn_transducer #231

Comments

arthur-compton commented Oct 28, 2021

maxeduc commented Sep 30, 2022

yiqiaoc11 commented Jan 21, 2023