Distill — Live Speaker Translation

Distill is a local-first live translation setup made of a Chrome extension and a FastAPI backend. The extension captures tab audio, streams it to the backend over WebSocket, and plays translated audio back through your selected output device.

Current Project Shape

backend/ is the active server. It handles translation sessions, settings, and profile storage.
extension/ is the active Chrome extension you load into Chrome.
User profiles are stored locally in SQLite at backend/profiles.db.
web/ still contains older prototype assets and Convex-related code, but it is not part of the current local setup documented here.

Architecture

The extension captures tab audio in Chrome.
Audio is streamed to ws://localhost:8000/ws/translate.
The backend uses either AzureTranslationClient or AzureConversationClient for the STT path, depending on STT_PROVIDER.
The backend produces translated text from the incoming speech.
In the live extension flow, translated speech is synthesized with AzureTtsClient.
Translated audio is streamed back to the extension for playback.
The extension plays the translated audio through the selected output device.

Quick Start

1. Start the backend

cd backend
cp .env.example .env
uv sync
uv run uvicorn main:app --reload --port 8000

Once the backend is running:

Health check: http://localhost:8000/health
Local dashboard: http://localhost:8000/

2. Add your API keys

Edit backend/.env and set the keys required for the path you are running.

For the current live Azure path:

AZURE_SPEECH_KEY
AZURE_SPEECH_REGION

Other keys used elsewhere in the backend:

SPEECHMATICS_API_KEY
MINIMAX_API_KEY
SUPERMEMORY_API_KEY

The backend also reads provider and tuning settings such as:

STT_PROVIDER
TTS_PROVIDER
TRANSLATION_TRIGGER_CHAR_THRESHOLD
AZURE_SEGMENTATION_SILENCE_MS
SPEECHMATICS_MAX_DELAY

3. Build the Chrome extension

cd extension
npm install
npm run build

Then load it in Chrome:

Open chrome://extensions
Enable Developer mode
Click Load unpacked
Select extension/dist

4. Use it

Open a tab with audio, such as Google Meet or YouTube
Click the Distill extension icon
Choose source and target language
Choose an output device
Start translation

Audio Routing

Translated audio plays through the output device selected in the extension.

Profiles And Storage

Profiles are managed by the backend and stored locally in SQLite. The relevant API routes are:

GET /api/profiles
POST /api/profiles
GET /api/voice-profile
POST /api/voice-profile
PATCH /api/voice-status

The storage layer lives in backend/services/profile_store.py.

Notes

The extension expects the backend on localhost:8000.
The backend serves a small local dashboard at / for health and some settings.
The current live extension flow uses Azure for the active STT and TTS path.
Some legacy Convex files still exist under web/, but the current README no longer treats them as part of the supported setup.

Tech Stack

Extension: React, TypeScript, Vite, Chrome MV3
Backend: Python, FastAPI, WebSocket, SQLite, uv
Live speech path: Azure Speech Translation or Azure Conversation Transcriber, plus Azure Speech Synthesis
Other integrated services in the backend: Speechmatics, MiniMax, Supermemory

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.claude		.claude
backend		backend
docs/superpowers		docs/superpowers
extension		extension
web		web
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
architecture.md		architecture.md
architecture.mmd		architecture.mmd
demo-pitch.txt		demo-pitch.txt
walkthrough.md		walkthrough.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Distill — Live Speaker Translation

Current Project Shape

Architecture

Quick Start

1. Start the backend

2. Add your API keys

3. Build the Chrome extension

4. Use it

Audio Routing

Profiles And Storage

Notes

Tech Stack

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Distill — Live Speaker Translation

Current Project Shape

Architecture

Quick Start

1. Start the backend

2. Add your API keys

3. Build the Chrome extension

4. Use it

Audio Routing

Profiles And Storage

Notes

Tech Stack

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages