deepspeech-stt

Introduction

A slim Python client for Mozilla's DeepSpeech speech-to-text

Usage

from src.deepspeech_stt import deepspeech_predict

ouput_text: str = deepspeech_predict(
  wav_file_path,
  batch_after_silence=True,
  silence_threshold=45, # 45db
  filters=["logmmse_denoise", "butter_bandpass_filter"]
)

Parameter	Default	Description
`wave_filename`	`None`	Path to wave file
`batch_after_silence`	`True`	Create batch from input splitting after natural gaps of silence
`silence_threshold`	`50`	The threshold (in decibels) below reference to consider as silence
`filters`	`None`	List of signal filters to apply as pre-processing: `butter_bandpass_filter`, `high_pass_filter`, `low_pass_filter`, `logmmse_denoise`
See notebook for examples

Installation

Download Mozilla's DeepSpeech 0.7.4 pre-trained model (~200mb)

Then run:

poetry install
poetry shell

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.github/worflows		.github/worflows
model		model
notebooks		notebooks
samples		samples
src		src
tests		tests
.flake8		.flake8
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
mypy.ini		mypy.ini
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

deepspeech-stt

Introduction

Usage

Installation

About

Releases

Packages

Languages

crodriguez1a/deepspeech-stt

Folders and files

Latest commit

History

Repository files navigation

deepspeech-stt

Introduction

Usage

Installation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages