VoiceWave is a Speech to Text model that is trained on the LJ-Speech Dataset. It is implemented using Python.
The model is trained on the LJ-Speech Dataset, a public domain speech dataset consisting of 13,100 short audio clips of a single speaker reading passages from 7 non-fiction books.
- Python 3.7 or later
- Other dependencies listed in
requirements.txt
- Clone this repository
- Install the dependencies using pip:
pip install -r requirements.txt
- Step 1: Download the model from the github under the releases section.
- Step 2: Extract the model and place it in the root directory of the project.
- step 3: Run the command
uvicorn server:app --reload
in the root directory of the project.
This project is licensed under the MIT License - see the LICENSE file for details.