This a beginners exploration into Speech Recognition in Python.
Inspiration derived from RealPython's Article, "The Ultimate Guide To Speech Recognition With Python."
Packages Used:
- Python v3.6.4
- PyDub
- PyAudio v0.2.11 + PortAudio v19.0
- SpeechRecognition v3.8.1
- Google Cloud Speech API v0.36.0
- Auto Convert Incorrect Filetypes
- Process Multiple Files Using RegEx
- Live Audio Capability
- Video Captioning + Output Log with Timestamps
- Reduce Noise via DSP Filters before Analysis
- Combine Seperate Microphone and Audio File Functions with a Param
- Use
try
andExcept
block with UnknownValueError for Calls to the API - Add Multi-Language Support (Vietnamese, Spanish)
TYPE INTRO HERE
- Run
$ python -m speech_recognition
to do microphone based live Speech Recognition
Intruction one
enter command here
Intruction Two
enter command here
Then command here
.
[API reference](link here)