Automatic Translations Project - Complete Guide

If you want to see a video about how to use this repo, check this video

Prerequisites

Ensure that Python 3.x is installed:

python --version

FFmpeg

FFmpeg is required for audio and video processing. Follow the steps below to install it:

Windows

Download the FFmpeg executable from the official FFmpeg website.
Extract the downloaded zip file to a folder (e.g., C:\ffmpeg).
Add the bin directory to your system's PATH:
- Open the Start Menu, search for "Environment Variables", and select "Edit the system environment variables".
- Click on "Environment Variables".
- Under "System variables", find the Path variable and click "Edit".
- Click "New" and add the path to the bin directory (e.g., C:\ffmpeg\bin).
- Click "OK" to save the changes.

Unix/MacOS

Install FFmpeg using a package manager:

# For Ubuntu/Debian
sudo apt update
sudo apt install ffmpeg

# For MacOS using Homebrew
brew install ffmpeg

Environment Setup

Create and activate a virtual environment:

py -m venv venv
source venv/Scripts/activate  # Windows
source venv/bin/activate      # Unix/MacOS

Install the dependencies:

pip install -r requirements.txt

Configuration

Duplicate .env.example to .env and add the credentials:

cp .env.example .env

Using `main.py`

Place the videos in /videos and run:

python main.py --language [TARGET_LANGUAGE] --action [ACTION]

--language or -l: Specifies the target language for the translation (default is English).
--action or -a: Defines the last action to perform (options: extract, transcribe, translate, all).

Explanation of Actions

Extract: Extracts the audio from the source video
Transcribe: Transcribes the source video into vtt or json format
Translate: Translates the transcription into the target language
All: Performs all the above actions and generates a new version of the video in the target language using ElevenLabs (you must have the API KEY)

Using `record.py`

Record, transcribe, and translate audio in real-time:

Choose the microphone ID to record: First, the application will show you the available devices, just type the ID of the one you want to use.
Set the target language: Then, you will be asked to type the language you want to translate to, it can be any language.
After that, just press Enter to start recording and you're done!

Using `voice_assistant.py`

This script acts as a voice assistant that records audio, transcribes it, generates responses using OpenAI, and converts the responses into speech. It allows users to select audio devices, voices, and system prompts. The script also logs the times for transcription, response generation, and speech generation, and concatenates audio files into a single conversation file.

Using `notetaker.py`

This script records audio from a selected microphone and transcribes it. The transcription is then reformatted into a more readable format using markdown syntax. The script saves the formatted transcription to a markdown file.

Output

main.py: MP3 audio, transcription, translation, and translated audio in /output, additionally the translated video if you selected all.
record.py: WAV recordings, transcriptions, translations, and translated audio in recording_sessions.
voice_assistant.py: WAV recordings, transcriptions, responses, and concatenated audio files in notes.
notetaker.py: WAV recordings and formatted transcriptions in notes.

Additional Information

.gitignore excludes venv and output.
Scripts ignore .gitkeep in /videos.

Follow this guide for efficient use of the Automatic Translations Project.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Automatic Translations Project - Complete Guide

Prerequisites

FFmpeg

Windows

Unix/MacOS

Environment Setup

Configuration

Using `main.py`

Explanation of Actions

Using `record.py`

Using `voice_assistant.py`

Using `notetaker.py`

Output

Additional Information

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
docs		docs
images		images
notes		notes
output		output
recording_sessions		recording_sessions
src		src
videos		videos
.env.example		.env.example
.gitignore		.gitignore
main.py		main.py
notetaker.py		notetaker.py
readme.ES.md		readme.ES.md
readme.md		readme.md
record.py		record.py
requirements.txt		requirements.txt
voice_assistant.py		voice_assistant.py

Folders and files

Latest commit

History

Repository files navigation

Automatic Translations Project - Complete Guide

Prerequisites

FFmpeg

Windows

Unix/MacOS

Environment Setup

Configuration

Using main.py

Explanation of Actions

Using record.py

Using voice_assistant.py

Using notetaker.py

Output

Additional Information

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Using `main.py`

Using `record.py`

Using `voice_assistant.py`

Using `notetaker.py`

Packages