Skip to content

andrenatal/DeepSpeech

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Project DeepSpeech

Project DeepSpeech is an open source Speech-To-Text engine that uses a model trained by machine learning techniques, based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow project to facilitate implementation.

Prerequisites

Recommendations

If you have a capable (nVidia) GPU, it is highly recommended to install TensorFlow with GPU support. Training will likely be significantly quicker than using the CPU.

Training a model

Open a terminal, change to the directory of the DeepSpeech checkout and run ipython DeepSpeech.ipynb. This should open your default browser with the DeepSpeech notebook. From here, you can inspect the notebook and alter any variables with regards to what dataset is used, how many training iterations are run and the default values of the training parameters.

Once you are satisfied with the settings, you can begin to train a model by selecting 'Cell' from the notebook menu bar and choosing 'Run All'.

Exporting a model for serving

If the ds_export_dir environment variable is set, or the export_dir variable is set manually, a model will have been exported to this directory during training. If training has been performed without exporting a model, a model can be exported by setting the variable to the directory you'd like to export to (e.g. export_dir = os.path.join(checkpoint_dir, 'export')) and running the model exporting cell manually. If the notebook has been restarted since training, you will need to run all the cells above the training cell first before running the export cell, to declare and initialise the required variables and functions.

Refer to the corresponding README.md for information on building and running the client.

About

ipython notebook for DeepSpeech

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 47.8%
  • Jupyter Notebook 45.5%
  • HTML 6.2%
  • Shell 0.5%