whisper-notes

Transcribe audio recordings and process text documents into structured, categorized markdown using OpenAI Whisper and the Anthropic API. Designed as an intake pipeline for Obsidian vaults.

Requirements

macOS with Apple Silicon (or any platform that supports Whisper)
uv (Python package manager)
ffmpeg
An Anthropic API key

Setup

# Install uv and ffmpeg (if not already installed)
brew install uv ffmpeg

# Install Python dependencies
uv sync

# Add your Anthropic API key
cp .env.example .env
# Then open .env and replace the placeholder with your key

Usage

# Process a single audio file
uv run main.py recording.m4a

# Process a text document
uv run main.py notes.txt

# Process all supported files in a folder (recursive)
uv run main.py recordings/

# Use a more accurate Whisper model
uv run main.py recording.m4a --model medium

# Use a different Claude model for formatting
uv run main.py recording.m4a --claude-model opus

# Skip Claude formatting entirely (raw transcription only, audio files only)
uv run main.py recording.m4a --no-format

# Reprocess files that were already processed
uv run main.py recordings/ --force

When processing a directory, you'll see a preview of what will be processed before confirming:

Files to process: 3
  Audio: 2
  Documents: 1
  Estimated API calls: 3 (Claude)
  Whisper transcriptions: 2

Proceed? [Y/n]

Files that have already been processed are automatically skipped on subsequent runs (use --force to override).

Output

Processed files are written to a processed/ directory with structured frontmatter:

---
title: weightlifting-goals
date: 2026-03-21
processed: 2026-03-21T14:32:00
status: raw
medium: audio
---

## Summary

This recording covers a discussion about weightlifting goals for the
upcoming quarter, including specific targets for squat and deadlift
numbers and thoughts on training split adjustments.

## Cleaned Transcription

I want to increase my squat to 300 lbs by the end of the quarter.
For deadlifts, I'm going to focus on form over weight for now.

---

## Raw Transcription

i want to increase my squat to like 300 lbs by the end of the quarter
um for deadlifts im going to focus on form over weight for now...

For audio files, an archival copy of the markdown is also written next to the source audio, and the audio file is renamed to match the generated title.

For text documents, the source file is left untouched (no archival copy, no rename).

Index

Each run updates processed/_index.md — a table tracking all processed files, their source paths, dates, and vault assignments. Unsupported files found during collection are listed separately.

Vault categorization

Claude assigns each processed file to a vault category from config.json. Default vaults:

fantasy-writing
game-development
self-observations
software-engineering

If Claude suggests a vault that doesn't exist, you'll be prompted to add it. Edit config.json directly to add or remove vaults.

Supported formats

Audio: .m4a .mp3 .wav .mp4 .mov .m4v

Documents: .txt .md .docx .rtf

Whisper models

Model	Speed	Accuracy	Best for
`tiny`	Fastest	Low	Quick drafts
`base`	Fast	Good	Clear speech
`small`	Moderate	Better	Accents, noisy audio (default)
`medium`	Slow	Great	High accuracy needed
`large`	Slowest	Best	Maximum accuracy

The model is downloaded automatically on first use and cached locally.

Claude models

Model	Speed	Cost	Best for
`haiku`	Fastest	Lowest	Quick processing
`sonnet`	Moderate	Mid	Good balance (default)
`opus`	Slowest	Highest	Best quality formatting

Development

# Run tests
uv run pytest tests/ -v

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
tests		tests
whisper_notes		whisper_notes
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
CLAUDE.md		CLAUDE.md
README.md		README.md
config.example.json		config.example.json
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

whisper-notes

Requirements

Setup

Usage

Output

Index

Vault categorization

Supported formats

Whisper models

Claude models

Development

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

whisper-notes

Requirements

Setup

Usage

Output

Index

Vault categorization

Supported formats

Whisper models

Claude models

Development

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages