Skip to content

Beginner Exploration: Speech Recognition in Python

Notifications You must be signed in to change notification settings

AllenDuong/SpeechPy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 

Repository files navigation

SpeechPy

This a beginners exploration into Speech Recognition in Python.
Inspiration derived from RealPython's Article, "The Ultimate Guide To Speech Recognition With Python."

Packages Used:

  • Python v3.6.4
  • PyDub
  • PyAudio v0.2.11 + PortAudio v19.0
  • SpeechRecognition v3.8.1
  • Google Cloud Speech API v0.36.0

To Do

  • Auto Convert Incorrect Filetypes
  • Process Multiple Files Using RegEx
  • Live Audio Capability
  • Video Captioning + Output Log with Timestamps
  • Reduce Noise via DSP Filters before Analysis
  • Combine Seperate Microphone and Audio File Functions with a Param
  • Use try and Except block with UnknownValueError for Calls to the API
  • Add Multi-Language Support (Vietnamese, Spanish)

TYPE INTRO HERE

Notes:

  • Run $ python -m speech_recognition to do microphone based live Speech Recognition

Getting started

Intruction one

enter command here

Intruction Two

enter command here

Then command here.

Usage

[API reference](link here)

About

Beginner Exploration: Speech Recognition in Python

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages