ClearSpeak

Developed by Pavan Kumar, ClearSpeak is a Python application that utilizes Google's Speech-to-Text API for real-time audio transcription. The application includes a user-friendly graphical interface built with Tkinter, designed to provide clear transcription of human speech while filtering out background noise.

Features

Real-Time Transcription: Instantaneous transcription of speech from the microphone.
Noise Filtering: Distinguishes between human speech and background noise.
User Interface: Easy-to-use GUI for starting and stopping transcription.

Prerequisites

Before starting, ensure you have the following:

Python 3.x installed.
An active Google Cloud Platform (GCP) account.
Speech-to-Text API enabled in your GCP account.
Your GCP service account key file downloaded.

Setup and Installation

Clone the Repository

git clone https://github.com/ascender1729/ClearSpeak.git
cd ClearSpeak

Environment Setup

Create a virtual environment to manage your project's dependencies:

python -m venv myenv
.\myenv\Scripts\Activate.ps1  # On Windows
source myenv/bin/activate  # On Unix or MacOS

Install Dependencies

Install the required libraries:

pip install google-cloud-speech pyaudio

Configure GCP Credentials

Set your credentials to authenticate with Google Cloud:

os.environ['GOOGLE_APPLICATION_CREDENTIALS'] = 'path_to_your_service_account_key.json'

Running the Application

Execute the application with:

python transcribe.py

How to Use

Click "Start Transcription" to begin.
Click "Stop Transcription" to end. The transcribed text will be displayed in the application window.

Troubleshooting

For pyaudio installation issues:

pip install pipwin
pipwin install pyaudio

Contributing

To contribute:

Fork the repository.
Create a new branch (git checkout -b feature/YourFeature).
Commit your changes (git commit -m 'Add YourFeature').
Push to the branch (git push origin feature/YourFeature).
Create a new Pull Request.

License

This project is available under the MIT License.

Contact

Pavan Kumar - pavankumard.pg19.ma@nitp.ac.in

LinkedIn: linkedin.com/in/im-pavankumar

Project Link: ClearSpeak

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
LICENSE		LICENSE
README.md		README.md
transcribe.py		transcribe.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LICENSE

LICENSE

README.md

README.md

transcribe.py

transcribe.py

Repository files navigation

ClearSpeak

Table of Contents

Features

Prerequisites

Setup and Installation

Clone the Repository

Environment Setup

Install Dependencies

Configure GCP Credentials

Running the Application

How to Use

Troubleshooting

Contributing

License

Contact

About

Languages

License

ascender1729/ClearSpeak

Folders and files

Latest commit

History

LICENSE

LICENSE

README.md

README.md

transcribe.py

transcribe.py

Repository files navigation

ClearSpeak

Table of Contents

Features

Prerequisites

Setup and Installation

Clone the Repository

Environment Setup

Install Dependencies

Configure GCP Credentials

Running the Application

How to Use

Troubleshooting

Contributing

License

Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Languages