Speech to Text Transcription

Simple Python script to transcribe audio files using OpenAI's API with word-level timestamps.

Setup

Install dependencies:

pip install -r requirements.txt

Set your OpenAI API key:

export OPENAI_API_KEY="your-api-key-here"

Or create a .env file:

OPENAI_API_KEY=your-api-key-here

Usage

python transcribe.py <audio_file> [output_file]

Examples

# Transcribe audio.mp4 and save to audio.txt
python transcribe.py audio.mp4

# Transcribe audio.mp4 and save to custom output file
python transcribe.py audio.mp4 output.txt

Features

Supports MP4, MP3, WAV, and other audio formats
Word-level timestamps (character-aware)
Simple command-line interface
Automatically saves transcriptions to text files

Output Format

When using word-level timestamps, the output includes:

Full transcript text
Word-level timestamps with start and end times

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
transcribe.py		transcribe.py
transkr1.m4a		transkr1.m4a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Speech to Text Transcription

Setup

Usage

Examples

Features

Output Format

About

Uh oh!

Releases

Packages

Languages

blackdorn/speechtotext

Folders and files

Latest commit

History

Repository files navigation

Speech to Text Transcription

Setup

Usage

Examples

Features

Output Format

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages