audio-mcp

Remote MCP server for audio processing, deployed to Aurora and reachable only over Tailscale.

What it does

Transcribe audio from YouTube URLs, HTTP(S) audio URLs, inline base64 payloads, or previously uploaded files. Backends: Groq Whisper (cloud, default) or local faster-whisper on CPU.
Generate audio from text, with three backends: piper (local, free, Polish default voice), Google Cloud TTS Standard, OpenAI gpt-4o-mini-tts. Polish text is normalised (URLs and long hashes removed, acronyms respelled phonetically).
Job artefacts (transcription.json, transcription.txt, audio.mp3) are downloadable via HTTP using URLs returned in each tool response.

Connecting clients

All clients connect over Tailscale to https://audio-mcp.uaru-teeth.ts.net/mcp.

Claude Desktop

Edit ~/Library/Application Support/Claude/claude_desktop_config.json:

{
  "mcpServers": {
    "audio-mcp": {
      "type": "http",
      "url": "https://audio-mcp.uaru-teeth.ts.net/mcp"
    }
  }
}

Claude Code

Edit ~/.claude/.mcp.json:

{
  "mcpServers": {
    "audio-mcp": {
      "type": "http",
      "url": "https://audio-mcp.uaru-teeth.ts.net/mcp",
      "timeout": 900000
    }
  }
}

The raised timeout covers long CPU transcriptions.

Development

uv sync
make dev           # run server locally
make test          # run tests with coverage
make lint          # ruff check

Requirements: Python 3.12, uv, ffmpeg.

Architecture

FastAPI + FastMCP v3 — HTTP + MCP protocol
Groq — cloud transcription (Whisper)
faster-whisper — local CPU transcription
piper — local TTS (Polish, free)
Google Cloud TTS / OpenAI TTS — cloud TTS options
SQLite (aiosqlite) — job and upload metadata
Tailscale — secure network access

Name		Name	Last commit message	Last commit date
Latest commit History 74 Commits
.github/workflows		.github/workflows
app		app
docs		docs
scripts		scripts
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
.python-version		.python-version
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

audio-mcp

What it does

Connecting clients

Claude Desktop

Claude Code

Development

Architecture

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

audio-mcp

What it does

Connecting clients

Claude Desktop

Claude Code

Development

Architecture

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages