VoiceWave-Inference

VoiceWave is a Speech to Text model that is trained on the LJ-Speech Dataset. It is implemented using Python.

Dataset

The model is trained on the LJ-Speech Dataset, a public domain speech dataset consisting of 13,100 short audio clips of a single speaker reading passages from 7 non-fiction books.

Requirements

Python 3.7 or later
Other dependencies listed in requirements.txt

Installation

Clone this repository
Install the dependencies using pip:

pip install -r requirements.txt

Usage

Step 1: Download the model from the github under the releases section.
Step 2: Extract the model and place it in the root directory of the project.
step 3: Run the command uvicorn server:app --reload in the root directory of the project.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
__pycache__		__pycache__
static		static
.gitignore		.gitignore
LICENCE		LICENCE
configs.py		configs.py
convert.py		convert.py
model.py		model.py
readme.md		readme.md
requirements.txt		requirements.txt
server.py		server.py
temp.wav		temp.wav

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pycache

pycache

static

static

.gitignore

.gitignore

LICENCE

LICENCE

configs.py

configs.py

convert.py

convert.py

model.py

model.py

readme.md

readme.md

requirements.txt

requirements.txt

server.py

server.py

temp.wav

temp.wav

Repository files navigation

VoiceWave-Inference

Dataset

Requirements

Installation

Usage

License

About

Releases

Packages

Languages

License

CodeBulletin/VoiceWaveInfrence

Folders and files

Latest commit

History

Repository files navigation

VoiceWave-Inference

Dataset

Requirements

Installation

Usage

License

About

Resources

License

Stars

Watchers

Forks

Languages