Skip to content

germanocastanho/audio-transcriber

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

2 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸŽ™οΈ Audio Transcriber

Audio Transcriber is a Python script that transcribes audio and video files into text using OpenAI's transcription models. Specifically designed for Brazilian Portuguese content, it automatically infers speakers from context and formats the transcription appropriately. Perfect for any audio/video content! πŸ“„

πŸš€ Main Features

  • Easy Transcription: πŸ”Š Convert audio/video files to text with a simple command-line interface.
  • Brazilian Focused: πŸ‡§πŸ‡· Optimized for accurate transcription of Brazilian Portuguese content.
  • Context Inference: πŸ‘₯ Automatically identifies speakers and formats dialogue appropriately.
  • Markdown Output: πŸ“ Saves transcriptions as simple but useful markdown files with unique IDs.
  • Format Flexibility: πŸŽ₯ Supports various file formats, including MP3, MP4, WAV, M4A and more.

βœ… Prerequisites

πŸ› οΈ Local Installation

# Clone the repository
git clone https://github.com/germanocastanho/audio-transcriber.git

# Navigate to the directory
cd audio-transcriber

# Set up a virtual environment
python3 -m venv .venv
source .venv/bin/activate

# Install dependencies
pip install -r requirements.txt

# Set up your API key
echo "OPENAI_API_KEY=YOUR_API_KEY" > .env

# Run the "main.py" script
python3 main.py

πŸ“œ Free Software

Distributed under the GNU GPL v3, ensuring freedom - as in "free speech" - to use, modify, and redistribute the software, as long as these freedoms are preserved in any derivative versions. By using or contributing, you support the free software philosophy and help build a libertarian technological environment! ✊

About

Python script that transcribes audio and video files πŸŽ™οΈ

Topics

Resources

License

Stars

Watchers

Forks

Languages