Skip to content

Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...

Notifications You must be signed in to change notification settings

sambhavdurishetty001/Audio-to-Audio-translation

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Speech and Speech Translation App

This is a Flask web app that allows you to translate audio and video files to an audio output in any languag of your choice using OpenAI's whisper, gpt-3.5-turbo and Eleven labs. It uses the pytube, moviepy, and pydub libraries to download and process the videos, and nltk for tokenizing the text.

Watch a simple demo of the project:

6428e6f5e849b34fcd6d5b53_webm.1.mp4

Installation

Clone this repository and navigate to the project directory:

bash Copy code git clone https://github.com//.git cd Create a virtual environment and install the required packages:

''' Copy code python -m venv venv source venv/bin/activate # on Windows, use "venv\Scripts\activate" pip install -r requirements.txt '''

Usage

''' Run the app locally using Flask: arduino Copy code export FLASK_APP=app.py export FLASK_ENV=development flask run ''' Navigate to http://localhost:5000/ in your web browser to access the app.

Features

Upload audio or video files or links. Transcribe audio or video and generate a text output. Chuck words of over 3000 tokens. Use the OpenAI API to translate. Use Elevenlabs API to convert text to audio.

Dependencies

Flask Flask-SocketIO Flask-Bootstrap PyTube moviepy pydub nltk OpenAI API key Elevenlabs API key

Contributing

Contributions to this project are welcome. To contribute, please follow these steps: Fork this repository. Create a new branch: git checkout -b my-new-branch Make your changes and commit them: git commit -m "Add some feature" Push to the branch: git push origin my-new-branch Create a new pull request. Please include a clear description of your changes and their purpose.

About

Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 65.8%
  • HTML 15.4%
  • JavaScript 10.0%
  • CSS 8.8%