openshift-ml-demo

This is a machine learning demo that uses deepspeech and tensorflow to recognize voice recording in english and translate using google translate to the desired language. Output of what was recognized in the recording and translation done to stdout.

The inital work using deepspeech and google translate was done by melwitt.

We are not using GPU in this demo but rather CPU. Performance would be likely higher using GPU. To use gpu the deepspeech-gpu python module is needed instead of deepspeech and of course nvidia driver and nvidia cuda framework, otherwise there should be no difference.

Prerequisites

A wave file recorded in English using 16Khz and mono. Should be saved as 16bit PCM.
Python2
Python2-devel
Development tools and libraries (gcc, etc)
Python modules
- deepspeech or deepspeech-gpu
- googletrans
- jamspell
- numpy
- webrtcvad

Note: all prerequisites are taken care of by the Dockerfile so this is just provided for information if you want to build this machine learning stack yourself.

Language Support

The translation language can be anything supported by google translate. Use the ISO-639-1 code. Language packs for en, de, es, pl and fr are installed out-of-the-box. If you would like additional languages you need to update Dockerfile and add them.

Running in Docker

$ docker pull ktenzer/deepspeech:latest

$ docker run -i -t <image id>

Running in OpenShift

To deploy and run on OpenShift you can use the template or deploy from Dockerfile. The template will also create a persistent storage volume so you need persistent storage for that. The models which are large (several GB) are stored on volume. This allows for a fast build. Using Dockerfile means that the models end up in the container themselves. The models are downloaded the first time the container is built and run.

Clone repository

$ git clone https://github.com/ktenzer/openshift-ml-demo.git

Create new project

$ oc new-project ml-demo

Create template in ml-demo project

$ oc create -f openshift-ml-demo/yaml/openshift4-ml-demo.yaml

Deploy template from project

$ oc new-app --template=ml-demo -n ml-demo

Deploy just using Docker in OpenShift

$ oc new-app https://github.com/ktenzer/openshift-ml-demo.git

Usage

Once deployed either with OpenShift or Docker use the provided python script stream_with_sentences.py in the container under the /app/repo directory. It takes a translation language, path to a wave file and path to pre-built machine learned models. The script will decode the audo file and output audo to text and at the same time translate to the translation language. Accuracy and error correction on text are also performed.

We have provided a demo.wav audio sample for testing also located under the /app/repo directory.

Note: The machine learned models are very basic and as such accuracy is not nearly as high as it could be. I estimate around 60%. For real-world usage a much more robust model is needed.

$ /deepspeech/openshift-ml-demo/python2 stream_with_sentences.py --lang <en|de|es|pl> --file </path/to/.wav> --models </path/to/models>

Name		Name	Last commit message	Last commit date
Latest commit History 122 Commits
static		static
yaml		yaml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
demo.wav		demo.wav
package.json		package.json
requirements.txt		requirements.txt
server.js		server.js
startup.sh		startup.sh
stream_with_sentences.py		stream_with_sentences.py
translate.py		translate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

openshift-ml-demo

Prerequisites

Language Support

Running in Docker

Running in OpenShift

Clone repository

Create new project

Create template in ml-demo project

Deploy template from project

Deploy just using Docker in OpenShift

Usage

About

Releases

Packages

Contributors 2

Languages

License

ktenzer/openshift-ml-demo

Folders and files

Latest commit

History

Repository files navigation

openshift-ml-demo

Prerequisites

Language Support

Running in Docker

Running in OpenShift

Clone repository

Create new project

Create template in ml-demo project

Deploy template from project

Deploy just using Docker in OpenShift

Usage

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages