Pre-trained models #59

danielhauagge · 2017-05-12T00:09:18Z

Any pre-trained models available?

SeanNaren · 2017-05-15T13:32:49Z

Currently not available, will get to this as soon as I can :)

ryanleary · 2017-05-18T03:29:18Z

I have a decent-ish Libri model I can upload somewhere if you'd like.

SeanNaren · 2017-05-18T10:19:04Z

@ryanleary that would be awesome! Has it been trained on the latest checkpoint system? Would make integration easier

ryanleary · 2017-05-18T15:22:16Z

Yes. It's definitely a preliminary model, but somewhat functional. Trained on 11 epochs of 1k hrs libri with augmentation.

Model name:          deepspeech_11.pth.tar
DeepSpeech version:  0.0.1

Recurrent Neural Network Properties
  RNN Type:          lstm
  RNN Layers:        4
  RNN Size:          400
  Classes:           29

Model Features
  Labels:            _'ABCDEFGHIJKLMNOPQRSTUVWXYZ
  Sample Rate:       16000
  Window Type:       hamming
  Window Size:       0.02
  Window Stride:     0.01

Training Information
  Epochs:            11
  Min Loss:          15.670
  Min CER:           8.914
  Min WER:           23.752

Test Set	WER	CER
clean	14.295	4.391
noisy	35.354	14.285
combined	25.302	9.562

Shall we start up a wiki for this kind of thing as well as other documentation?

SeanNaren · 2017-05-18T16:56:35Z

Really good idea, will get to it ASAP and open a PR to get this together!

EDIT: @ryanleary to keep simple do you think just a new file in the repo under the name PRETRAINED.md would suffice?

ryanleary · 2017-05-23T13:30:49Z

Oops missed the edit. That's probably alright. My only thought for wiki was so that there wouldn't need to be PRs every time there are new models. I don't really have a strong preference though. Do you have a preference where I upload that model above?

SeanNaren · 2017-05-30T12:08:32Z

That's a good idea @ryanleary! I'm going to try push to get the skip_rnn branch merged into pytorch because I want all the models atleast on initial release to be the pure DS2 architecture (which requires skip_rnn to be implemented).

Then I'll open a new issue to keep track of models trained!

ryanleary · 2017-05-30T13:17:10Z

Sure thing. Definitely looking forward to getting full batch norm support. Will retrain once we have a build of pytorch that supports it.

ryanleary · 2017-06-11T15:22:22Z

Since the skip_input work appears to be stalled, did you want to do this now or continue to wait?

SeanNaren · 2017-06-11T17:53:46Z

@ryanleary, I'll create a new issue with a plan on what needs to be done for the networks; my initial thoughts is that skip input isn't viable long term without cuDNN support. My reasoning on this is that it already takes long to train the DS architecture, and not utilising cuDNN slows this down drastically.

It will be even worse when Volta NVIDIA GPUs come out, and we can't utilise the new architecture. As a result I think the 'vanilla' architecture will have to stray a bit, and be a batch norm on top of the cuDNN RNN (architectures etc will be outlined in the issue!)

SeanNaren · 2017-06-11T18:14:48Z

@ryanleary and whoever else has input into this, does it make sense to train all models regardless of dataset on the full DS2 architecture (as close to this as possible).

ryanleary · 2017-06-11T18:19:20Z

I think that's certainly ideal, but we can update models in the future. Having some pretrained models that match what's currently implemented will at least help people experiment with this with a model that is better than "toy".

As an aside, I'm personally looking forward more to getting BatchNorm and lookahead convolutions implemented and moving toward the "Production" DeepSpeech implementation. Should be easier to train and it looks like it only costs about 5% relative performance hit [Spectrogram -> 2d conv -> 2d conv -> GRU -> GRU -> GRU [forward-only] -> 1D Row Conv -> FC]

SeanNaren · 2017-06-12T09:37:50Z

@ryanleary agreed. In my head getting the beam search language model integrated (taking it from the TF fork in the other issue) is the main step towards production DS, and probably the biggest at this stage!

I've opened a new ticket at #85 tracking progress of pre-trained models, will close this one.

SeanNaren changed the title ~~Pre trained model~~ Pre-trained models May 15, 2017

SeanNaren added the help wanted label May 15, 2017

SeanNaren closed this as completed May 15, 2017

SeanNaren reopened this May 15, 2017

SeanNaren added the Blocked label May 30, 2017

SeanNaren closed this as completed Jun 12, 2017

spakhomov mentioned this issue Sep 21, 2018

Error when using multi-GPU training #324

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pre-trained models #59

Pre-trained models #59

danielhauagge commented May 12, 2017

SeanNaren commented May 15, 2017

ryanleary commented May 18, 2017

SeanNaren commented May 18, 2017

ryanleary commented May 18, 2017 •

edited

Loading

SeanNaren commented May 18, 2017 •

edited

Loading

ryanleary commented May 23, 2017

SeanNaren commented May 30, 2017 •

edited

Loading

ryanleary commented May 30, 2017

ryanleary commented Jun 11, 2017

SeanNaren commented Jun 11, 2017 •

edited

Loading

SeanNaren commented Jun 11, 2017

ryanleary commented Jun 11, 2017

SeanNaren commented Jun 12, 2017

Pre-trained models #59

Pre-trained models #59

Comments

danielhauagge commented May 12, 2017

SeanNaren commented May 15, 2017

ryanleary commented May 18, 2017

SeanNaren commented May 18, 2017

ryanleary commented May 18, 2017 • edited Loading

SeanNaren commented May 18, 2017 • edited Loading

ryanleary commented May 23, 2017

SeanNaren commented May 30, 2017 • edited Loading

ryanleary commented May 30, 2017

ryanleary commented Jun 11, 2017

SeanNaren commented Jun 11, 2017 • edited Loading

SeanNaren commented Jun 11, 2017

ryanleary commented Jun 11, 2017

SeanNaren commented Jun 12, 2017

ryanleary commented May 18, 2017 •

edited

Loading

SeanNaren commented May 18, 2017 •

edited

Loading

SeanNaren commented May 30, 2017 •

edited

Loading

SeanNaren commented Jun 11, 2017 •

edited

Loading