Add Silero Speech-To-Text models #153

snakers4 · 2020-09-12T07:42:23Z

Please kindly review our Speech-To-Text models

netlify · 2020-09-12T07:56:43Z

Deploy preview for pytorch-hub-preview ready!

Built with commit 78f39ca

https://deploy-preview-153--pytorch-hub-preview.netlify.app

snakers4 · 2020-09-14T06:55:03Z

Maybe I did something wrong, but I cannot see our model's page here

snakers4 · 2020-09-16T13:29:23Z

Hi, @ailzhang @bertmaher @wconstab could you please help with assigning a correct person to review the submission?
Thanks!

ailzhang

@snakers4
Thanks for contributing!
If you move your md file out of docs you should be able to see it on the webpage.
Also once you do that, code snippet in the the markdown will be executed in the CI so you probably also need a working example there (e.g. test_files)

snakers4 · 2020-09-17T07:40:29Z

Hi,

If you move your md file out of docs you should be able to see it on the webpage.

Moved it to the root folder

Also once you do that, code snippet in the the markdown will be executed in the CI so you probably also need a working example there (e.g. test_files)

Added a small validation dataset download so that it works end-to-end, tested it locally

snakers4 · 2020-09-17T09:00:44Z

fixed the imports and path issues
hopefully it will be fine this time

snakers4 · 2020-09-17T10:24:40Z

@ailzhang

looks like all is fixed
please take a look

soumith

great great addition.

I have one important comment around dependencies (so that the hub example works in Google Colab) and another comment which is a suggestion.
Please take a look

soumith · 2020-09-22T05:18:26Z

snakers4_silero-models_stt.md

+# see https://github.com/snakers4/silero-models for utils and more examples
+
+device = torch.device('cpu')  # gpu also works, but our models are fast enough for CPU
+model, decoder, utils = torch.hub.load(github='snakers4/silero-models',


would this require omegaconf and torchaudio to be installed?

if so, you should add a cell like in https://github.com/pytorch/hub/blob/master/nvidia_deeplearningexamples_waveglow.md#example
which pip installs these extra packages.

that will make the thing instantly run in Google Colab, and is really valuable!

Hi,

would this require omegaconf and torchaudio to be installed?

Yeah, this would
That is why I needed to include them in your CI environment above
Actually in colab I also needed to install soundfile, as we are using it as a backend for TorchAudio

if so, you should add a cell like in
that will make the thing instantly run in Google Colab, and is really valuable!

yeah, this totally makes sense
by the way, you can see a more extended colab version here

I will add this shortly

soumith · 2020-09-22T05:23:55Z

snakers4_silero-models_stt.md

+ read_audio,
+ prepare_model_input) = utils  # see function signature for details
+
+torch.hub.download_url_to_file('http://www.openslr.org/resources/83/midlands_english_female.zip',


instead of a zip file, extracting it and then running a batch of wav files through, it would be much nicer and more illustrative of downloading a single wav file and processing it through.

like:

torch.hub.download_url_to_file('some download url for speech.wav', dst='speech.wav) input = prepare_model_input(glob('midlands_english_female/*.wav'), device=device) output = model(input)

one of reasons why I am downloading whole validation dataset and running a batch is to demonstrate the models are actually quite fast on CPU as well as on GPU

but I see your point
I will add a default example with one file, but I would nevertheless keep the utils
because for a first time user batching may be an issue, so I would like to solve that

I will add the changes shortly

soumith · 2020-09-22T15:03:49Z

thanks for the great contribution. it should go live sometime today, maybe in a couple of hours.

snakers4 · 2020-09-22T17:42:52Z

checked here I guess it is not there yet, looking forward to the release!

on a side note, as a small self-funded team we feel extremely proud to have made something worthy of including in this hub
speech has been asking for some care for quite some time

soumith · 2020-09-22T20:29:12Z

apparently we've moved to a once-per-day update of the site now. it should be live by tomorrow morning.

snakers4 · 2020-09-23T09:02:51Z

it is live now

Add Silero Speech-To-Text models

0bfd55a

ailzhang suggested changes Sep 16, 2020

View reviewed changes

Move the MD file to root folder

92d6c3a

Fix util imports, fix models.yml path issues

1121205

snakers4 added 3 commits September 17, 2020 09:29

Add TorchAudio to the test environment

11a9e15

Add -y to TorchAudio install

4162891

Add OmegaConf installation to build

4061c7d

soumith reviewed Sep 22, 2020

View reviewed changes

Add dependencies installation, add one file example

78f39ca

soumith approved these changes Sep 22, 2020

View reviewed changes

soumith merged commit 9146b39 into pytorch:master Sep 22, 2020

Add Silero Speech-To-Text models #153

Add Silero Speech-To-Text models #153

Uh oh!

Conversation

snakers4 commented Sep 12, 2020

Uh oh!

netlify bot commented Sep 12, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

snakers4 commented Sep 14, 2020

Uh oh!

snakers4 commented Sep 16, 2020

Uh oh!

ailzhang left a comment

Choose a reason for hiding this comment

Uh oh!

snakers4 commented Sep 17, 2020

Uh oh!

snakers4 commented Sep 17, 2020

Uh oh!

snakers4 commented Sep 17, 2020

Uh oh!

soumith left a comment

Choose a reason for hiding this comment

Uh oh!

soumith Sep 22, 2020

Choose a reason for hiding this comment

Uh oh!

snakers4 Sep 22, 2020

Choose a reason for hiding this comment

Uh oh!

soumith Sep 22, 2020

Choose a reason for hiding this comment

Uh oh!

snakers4 Sep 22, 2020

Choose a reason for hiding this comment

Uh oh!

soumith commented Sep 22, 2020

Uh oh!

snakers4 commented Sep 22, 2020

Uh oh!

soumith commented Sep 22, 2020

Uh oh!

snakers4 commented Sep 23, 2020

Uh oh!

Uh oh!

netlify bot commented Sep 12, 2020 •

edited

Loading