SoundWave: Speech Assistance For People With Stuttering

Stuttering is a speech disorder that disrupts fluency, causing pauses, repetitions, and difficulty in verbal communication. Individuals with stuttering often face challenges in social, educational, and professional settings due to speech interruptions. Existing speech assistance tools lack real-time processing and personalization, limiting their effectiveness in providing immediate support. This project aims to develop SoundWave: A Speech Assistance System for People with Stuttering, which enhances communication fluency through artificial intelligence.

The system integrates Speech-to-Text (STT) using the Vosk ASR model, which converts spoken words into text. Next-Word Prediction, powered by GPT-2, anticipates the user’s intended word based on context, reducing pauses and hesitations. Text-to-Speech (TTS) using YourTTS then converts the predicted text into speech while maintaining natural intonation. Additionally, Voice Cloning technology ensures the generated speech retains the user’s original voice, making interactions more personal and seamless. SoundWave offers an innovative, assistive solution that empowers individuals with stuttering, enhances their communication skills, and improves their quality of life. This system has potential applications in therapy, education, and professional environments, making speech assistance more accessible and effective.

Features

Speech-to-Text (STT): Converts the user’s spoken words into text using Automatic SpeechRecognition (ASR).
Next-word Prediction: Predicts the next word the user is likely to say based on context using GPT-2 model
Text-to-Speech (TTS): Converts the predicted text into natural-sounding speech using YourTTS.
Voice Cloning: Generates speech using the user’s own voice rather than a robotic or generic voice with minimal training data.
Real-Time Inference: Optimized for fast and efficient speech synthesis.

Getting Started

Installation

Clone the repository:

git clone https://github.com/sreenaddh/SoundWave.git
cd SoundWave

Install dependencies:
```
pip install -r requirements.txt
```

Run the application:

python server.py  # Running local server
cd soundwave
flutter run  # For running app on pc

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
Python-3.10.11		Python-3.10.11
__pycache__		__pycache__
dataset		dataset
dataset_samples		dataset_samples
local_recipe		local_recipe
reference_audios		reference_audios
soundwave		soundwave
temp_files		temp_files
templates		templates
trained_model_output		trained_model_output
vosk-model-small-en-us-0.15		vosk-model-small-en-us-0.15
.gitignore		.gitignore
README.md		README.md
datasetcreator.py		datasetcreator.py
fix_trainer.py		fix_trainer.py
flask_test.py		flask_test.py
requirements.txt		requirements.txt
rtvctrained.py		rtvctrained.py
server.py		server.py
trained_voice_loader.py		trained_voice_loader.py
voice_trainer_module.py		voice_trainer_module.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SoundWave: Speech Assistance For People With Stuttering

Features

Getting Started

Installation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SoundWave: Speech Assistance For People With Stuttering

Features

Getting Started

Installation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages