Audio Transcriber is a Python script that transcribes audio and video files into text using OpenAI's transcription models. Specifically designed for Brazilian Portuguese content, it automatically infers speakers from context and formats the transcription appropriately. Perfect for any audio/video content! π
- Easy Transcription: π Convert audio/video files to text with a simple command-line interface.
- Brazilian Focused: π§π· Optimized for accurate transcription of Brazilian Portuguese content.
- Context Inference: π₯ Automatically identifies speakers and formats dialogue appropriately.
- Markdown Output: π Saves transcriptions as simple but useful markdown files with unique IDs.
- Format Flexibility: π₯ Supports various file formats, including MP3, MP4, WAV, M4A and more.
- Python 3.12+, available through the official website.
- OpenAI API Key, obtainable from the OpenAI platform.
# Clone the repository
git clone https://github.com/germanocastanho/audio-transcriber.git
# Navigate to the directory
cd audio-transcriber
# Set up a virtual environment
python3 -m venv .venv
source .venv/bin/activate
# Install dependencies
pip install -r requirements.txt
# Set up your API key
echo "OPENAI_API_KEY=YOUR_API_KEY" > .env
# Run the "main.py" script
python3 main.py
Distributed under the GNU GPL v3, ensuring freedom - as in "free speech" - to use, modify, and redistribute the software, as long as these freedoms are preserved in any derivative versions. By using or contributing, you support the free software philosophy and help build a libertarian technological environment! β