Speech-to-Text---Python

Simple speech to text for Python. Run the script, say some things into your microphone, and then see what you said (or an approximation).

Powered by pyaudio and Sphinx.

Installation

Sphinxbase

Download sphinxbase and extract the files.

Now, run:

cd sphinxbase
./configure;make clean all;make install
cd python
python setup.py install

You may need to use sudo for make install or python setup.py install.

Pocketsphinx

Download pocketsphinx and extract the files.

Now, run:

cd pocketsphinx
./configure;make clean all;make install
cd python
python setup.py install

Packages (Linux only)

Now, run:

cd speech_to_text
sudo xargs -a apt-packages.txt apt-get install

Pyaudio

Now, download the right version of pyaudio and install it.

Language files

If you want to speak english, you need to get the english language model and the english acoustic model.

You will need to put the acoustic model into scribe/hmm, and the language model into scribe/lm.

The filetree should look like this for english:

speech_to_Text
├── dict
│   └── cmu07a.dic
├── hmm
│   ├── feat.params
│   ├── feature_transform
│   ├── mdef
│   ├── means
│   ├── mixture_weights
│   ├── noisedict
│   ├── README
│   ├── transition_matrices
│   └── variances
├── lm
│   └── cmusphinx-5.0-en-us.lm.dmp

For other languages, check here, or see below on training your own model. If you use different language models, acoustic models, or dictionaries, you will want to change these paths in recognizer.py:

hmdir = os.path.join(BASE_PATH, "hmm")
lmdir = os.path.join(BASE_PATH, "lm/cmusphinx-5.0-en-us.lm.dmp")
dictd = os.path.join(BASE_PATH, "dict/cmu07a.dic")

Run

To run, you just have to:

cd speech_to_text
python speech_to_text.py

Please use the below in line command if the above python script does n't give the proper output.

Note: It should run in the pocketspinx source directory.

pocketsphinx_continuous -infile i/p audio file.wav -hmm /home/ubuntu/Downloads/speech-recognizer/hmm -lm /home/ubuntu/Downloads/speech-recognizer/lm/cmusphinx-5.0-en-us.lm.dmp -dict /home/ubuntu/Downloads/speech-recognizer/dict/cmu07a.dic > o/p file.txt

You should be able to talk for a few seconds, after which it will spend some time processing, and the show you what you said.

Configure

There are some options that you can modify at the top of speech_to_text.py. The easiest one to modify is RECORD_SECONDS.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech-to-Text---Python

Installation

Sphinxbase

Pocketsphinx

Packages (Linux only)

Pyaudio

Language files

Run

Configure

More reading

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README		README
README.md		README.md
Speech_to_Text.py		Speech_to_Text.py
apt-packages.txt		apt-packages.txt
cmu07a.dic		cmu07a.dic
cmusphinx-5.0-en-us.lm.dmp		cmusphinx-5.0-en-us.lm.dmp
feat.params		feat.params
feature_transform		feature_transform
mdef		mdef
means		means
mixture_weights		mixture_weights
noisedict		noisedict
transition_matrices		transition_matrices
variances		variances

vs-sakthi/Speech-to-Text---Python

Folders and files

Latest commit

History

Repository files navigation

Speech-to-Text---Python

Installation

Sphinxbase

Pocketsphinx

Packages (Linux only)

Pyaudio

Language files

Run

Configure

More reading

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages