Pre-trained models tracker #85

SeanNaren · 2017-06-12T09:36:32Z

On each of the datasets provided, we must train a Deepspeech model. The overall architecture is encompassed in this command:

python train.py  --rnn_type gru --hidden_size 800 --hidden_layers 5 --checkpoint --visdom --train_manifest /path/to/train_manifest.csv --val_manifest /path/to/val_manifest.csv --epochs 100 --num_workers $(nproc) --cuda

In the above command you must replace the manifests paths with the correct paths to the dataset. A few notes:

No noise injection for the pre-trained models, or augmentations
Train till convergence (should get a nice smooth training curve hopefully!)
For smaller datasets, you may need to reduce the learning rate annealing by adding the flag --learning anneal and setting it to a smaller value, like 1.01. For larger datasets, the default is fine (up to around 4.5k hours from internal testing on the deepspeech.torch version)

A release will be cut from the DeepSpeech package that will have the models, and a reference to the latest release added to the README to find latest models!

Progress tracker for datasets:

AN4
TEDLium
LibriSpeech

Let me know if you plan on working on running any of these, and I'll update the ticket with details!

The text was updated successfully, but these errors were encountered:

ryanleary · 2017-06-12T13:02:38Z

I was planning on adding sortagrad back in before the training if that seems reasonable. Definitely seems to help with convergence.

I'll take on an4 and LibriSpeech.

SeanNaren · 2017-06-12T13:07:12Z

@ryanleary definitely, does #83 work well for you? Not sure if you had time to test this, it seems like a better solution regarding memory usage! Got some time now to test, so will report back

EDIT: pulling the branch in now, it does a fair job in keeping the memory usage low by bucketing the similar sized utterances and sampling out of this instead! Will update the master branch as soon as changes are addressed.

ryanleary · 2017-06-12T15:48:51Z

This model is kind of large for an4. Having difficulty getting it to converge. Were you able to get it to converge in the past?

SeanNaren · 2017-06-12T16:01:17Z

@ryanleary I'll check once I'm back home, but I have gotten the full architecture to converge (albeit not the best score possible).

ryanleary · 2017-06-12T16:04:30Z

That was, presumably, with the torch version that had batch norm though, right?

SeanNaren · 2017-06-12T16:27:06Z

Thats true... I'll try this as soon as I can!

Just fyi easiest place to contact me directly will probably be the PyTorch slack channel... send me a direct message there if you need me ASAP! If you also need an invite, feel free to send me an email on my github email.

ryanleary · 2017-06-12T16:32:37Z

Kicked off a 1000 hr libri training. Will know later tonight if convergence looks promising. Will probably take at least a few days to converge since I only have 2x Titan Xs for it.

SeanNaren · 2017-06-12T19:15:21Z

Just handling progress on this, currently blocking this for an updated architecture that is more suited for production environments and the size of datasets that we are dealing with; the current architecture is slightly too large!

SeanNaren · 2017-06-12T21:35:35Z

Currently playing at around 40M parameters with these parameters:

python benchmark.py --rnn_type gru --hidden_size 800 --hidden_layers 5

SeanNaren · 2017-06-14T14:27:34Z

I've updated the params after speaking to @ryanleary! will try getting in training for the tedlium corpus.

SiddGururani · 2017-06-14T16:45:11Z

If it's possible, could the people training the models also plot the loss on the validation set? I'm curious to see if it's just me that's getting this negative correlation between the WER and the validation loss (issue #78).

ryanleary · 2017-06-17T05:59:09Z

AN4 model is complete. Librispeech is still in progress. Below are the current evaluations:

Corpus	Test Set	Network	WER	CER
an4	an4-test	5x800gru	10.521	4.772
libri1k	libri-val	5x800gru	20.758	7.787
libri1k	libri-test	5x800gru	22.088	8.194
libri1k	test-clean	5x800gru	11.546	3.538
libri1k	test-other	5x800gru	31.813	12.483

SiddGururani · 2017-06-24T01:49:34Z

@ryanleary Any updates on the librispeech training?

ryanleary · 2017-06-24T02:01:26Z

I stopped the training after 44 epochs due to diminishing returns. I think the training may have slowed due to #100. Will probably retrain at some point in the future, but the model is good enough for now.

Corpus	Test Set	Network	WER	CER
libri1k	libri-val	5x800gru	20.512	7.687
libri1k	libri-test	5x800gru	21.686	8.064
libri1k	test-clean	5x800gru	11.203	3.362
libri1k	test-other	5x800gru	31.312	12.286

SeanNaren · 2017-06-24T09:17:28Z

@ryanleary thanks! What is libri-val/libri-test? Not sure which test sets these are

ryanleary · 2017-06-24T10:30:34Z

libri-val is dev-clean.tar.gz and dev-other.tar.gz combined.
libri-test is test-clean.tar.tz and test-other.tar.gz combined.

slbinilkumar · 2017-08-11T11:55:13Z

Corpus	Test Set	Network	WER	CER
an4	an4-test	5x800gru	10.521	4.772
libri1k	libri-val	5x800gru	20.758	7.787
libri1k	libri-test	5x800gru	22.088	8.194
libri1k	test-clean	5x800gru	11.546	3.538
libri1k	test-other	5x800gru	31.813	12.483

for libriik test-clean whats your validation set ? whats your training parameters? for getting this result libri1k | test-clean | 5x800gru | 11.546 | 3.538 .How much time taken for it to converge? How many epochs you trained for achieving this?

ryanleary · 2017-08-14T19:59:03Z

They're 5 layers of 800-dim GRU bidirectional RNNs. Everything else is more or less default. The libri1k trained for 44 epochs, which was several days on 2x Titan X GPUs.

The combined dev-clean and dev-other was used for validation, and the results of evaluation on the dev set is listed as libri-val. test-clean is the 'test set, "clean" speech' from http://www.openslr.org/12/.

SeanNaren · 2017-08-18T10:37:55Z

Coming to the end of training a model on TEDLium, then will start voxforge!

SeanNaren · 2017-08-21T09:28:40Z

I've removed voxforge from the pretrained nets, and propose we do a combination of all open sourced dataset train instead due to not having its own validation dataset.

@ryanleary any chance you could send me your trained model on slack so I can verify that they work on the master branch, then create a release to put the pre-trained networks up?

ybzhou · 2017-08-23T23:36:44Z

@ryanleary is the result obtained from greedy decoder?

daksunt · 2017-08-24T02:59:03Z

hi, when will the pre-trained networks be released?

ryanleary · 2017-08-24T03:01:16Z

@ybzhou yes.

SeanNaren · 2017-08-24T13:30:08Z

Models are now provided under releases here. Hopefully a production model based on the formatted data sources in this library will be training shortly.

Huge thanks to @ryanleary for training most of these models :)

SeanNaren mentioned this issue Jun 12, 2017

Pre-trained models #59

Closed

SeanNaren added the Blocked label Jun 12, 2017

SeanNaren removed the Blocked label Jun 24, 2017

SeanNaren self-assigned this Jun 24, 2017

ryanleary mentioned this issue Aug 14, 2017

training parameters for libri1k #132

Closed

SeanNaren closed this as completed Aug 24, 2017

Minju-Jung mentioned this issue Dec 7, 2017

AN4 fail to reproduce the reported result #193

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pre-trained models tracker #85

Pre-trained models tracker #85

SeanNaren commented Jun 12, 2017 •

edited

Loading

ryanleary commented Jun 12, 2017

SeanNaren commented Jun 12, 2017 •

edited

Loading

ryanleary commented Jun 12, 2017

SeanNaren commented Jun 12, 2017

ryanleary commented Jun 12, 2017 •

edited

Loading

SeanNaren commented Jun 12, 2017

ryanleary commented Jun 12, 2017

SeanNaren commented Jun 12, 2017

SeanNaren commented Jun 12, 2017 •

edited

Loading

SeanNaren commented Jun 14, 2017

SiddGururani commented Jun 14, 2017

ryanleary commented Jun 17, 2017 •

edited

Loading

SiddGururani commented Jun 24, 2017

ryanleary commented Jun 24, 2017 •

edited

Loading

SeanNaren commented Jun 24, 2017

ryanleary commented Jun 24, 2017 •

edited

Loading

slbinilkumar commented Aug 11, 2017

ryanleary commented Aug 14, 2017

SeanNaren commented Aug 18, 2017

SeanNaren commented Aug 21, 2017

ybzhou commented Aug 23, 2017

daksunt commented Aug 24, 2017

ryanleary commented Aug 24, 2017

SeanNaren commented Aug 24, 2017

Pre-trained models tracker #85

Pre-trained models tracker #85

Comments

SeanNaren commented Jun 12, 2017 • edited Loading

ryanleary commented Jun 12, 2017

SeanNaren commented Jun 12, 2017 • edited Loading

ryanleary commented Jun 12, 2017

SeanNaren commented Jun 12, 2017

ryanleary commented Jun 12, 2017 • edited Loading

SeanNaren commented Jun 12, 2017

ryanleary commented Jun 12, 2017

SeanNaren commented Jun 12, 2017

SeanNaren commented Jun 12, 2017 • edited Loading

SeanNaren commented Jun 14, 2017

SiddGururani commented Jun 14, 2017

ryanleary commented Jun 17, 2017 • edited Loading

SiddGururani commented Jun 24, 2017

ryanleary commented Jun 24, 2017 • edited Loading

SeanNaren commented Jun 24, 2017

ryanleary commented Jun 24, 2017 • edited Loading

slbinilkumar commented Aug 11, 2017

ryanleary commented Aug 14, 2017

SeanNaren commented Aug 18, 2017

SeanNaren commented Aug 21, 2017

ybzhou commented Aug 23, 2017

daksunt commented Aug 24, 2017

ryanleary commented Aug 24, 2017

SeanNaren commented Aug 24, 2017

SeanNaren commented Jun 12, 2017 •

edited

Loading

SeanNaren commented Jun 12, 2017 •

edited

Loading

ryanleary commented Jun 12, 2017 •

edited

Loading

SeanNaren commented Jun 12, 2017 •

edited

Loading

ryanleary commented Jun 17, 2017 •

edited

Loading

ryanleary commented Jun 24, 2017 •

edited

Loading

ryanleary commented Jun 24, 2017 •

edited

Loading