Caddy

Your invisible desktop AI assistant. Powered by Backboard.

Quick Start

Prerequisites

Node.js
Git
Python 3.10+ with uv
A Backboard API key

Installation

git clone [repository-url]
cd caddy
npm install

Environment

Create a .env file in the root:

BACKBOARD_API_KEY=your_api_key_here

macOS Permissions

The app requires these permissions (System Settings > Privacy & Security):

Permission	Why	Grant via
Screen Recording	Screenshot capture and screen watch OCR	Privacy & Security > Screen Recording
Microphone	Live audio transcription	Prompted on first use
Accessibility	Global keyboard shortcuts (Cmd+B, Cmd+H)	Privacy & Security > Accessibility

If shortcuts or screenshots aren't working, check that the Electron app (or Terminal, during development) is listed and enabled under each permission.

System Audio Capture (BlackHole)

By default the app records your microphone. To capture system audio (e.g., a Zoom call, a recorded meeting), install BlackHole and create a Multi-Output Device so you can hear audio AND the app can capture it simultaneously.

1. Install BlackHole

brew install blackhole-2ch

2. Create a Multi-Output Device

Open Audio MIDI Setup (Spotlight search or /Applications/Utilities/)
Click + in the bottom-left corner > Create Multi-Output Device
Check both:
- Your headphones or speakers
- BlackHole 2ch
Rename it to something like "Headphones + BlackHole" (right-click the name)

3. Route audio

Set your Mac's Sound Output to the new Multi-Output Device (System Settings > Sound > Output)
In the app, click the ▾ dropdown next to the mic button and select BlackHole 2ch as the audio input

Now audio plays through your headphones (so you can hear) and is simultaneously routed to BlackHole (so the app can transcribe it).

Run

./start.sh

This starts the transcription server on port 5111 and the Electron app together.

Architecture

Caddy is an Electron overlay that sits invisibly on top of your screen. It captures screenshots, transcribes audio, and routes everything through Backboard for LLM analysis and memory.

Component	Role
`electron/`	Invisible overlay, screenshot capture, keyboard shortcuts
`api/`	Flask backend — all LLM + memory via Backboard SDK
`transcribe/`	Local Whisper transcription + Tesseract OCR server
`src/`	React frontend (queue, solutions, chat)

Keyboard Shortcuts

Shortcut	Action
`Cmd+B`	Toggle window visibility
`Cmd+H`	Take screenshot
`Cmd+Enter`	Get solution
`Cmd+Arrow Keys`	Move window
`Cmd+Q`	Quit

Features

Invisible overlay — translucent, always-on-top, undetectable
Screenshot analysis — capture anything on screen, get instant AI answers
Audio intelligence — real-time transcription and analysis
Contextual chat — ask follow-up questions with full conversation memory
Model selector — switch between any LLM provider available on Backboard
Screen watch — periodic OCR + LLM analysis of what's on screen
Cross-platform — macOS, Windows, Linux

Troubleshooting

If the app doesn't start, make sure nothing is using ports 5111 (transcription) or 5180 (vite dev server):

lsof -i :5111 :5180

For Sharp/Python build errors:

rm -rf node_modules package-lock.json
SHARP_IGNORE_GLOBAL_LIBVIPS=1 npm install --ignore-scripts
npm rebuild sharp

License

ISC

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
api		api
electron		electron
renderer		renderer
src		src
transcribe		transcribe
worker-script/node		worker-script/node
.gitattributes		.gitattributes
.gitignore		.gitignore
.npmrc		.npmrc
LICENSE		LICENSE
README.md		README.md
doc.md		doc.md
image.png		image.png
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
postcss.config.js		postcss.config.js
start.sh		start.sh
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Caddy

Quick Start

Prerequisites

Installation

Environment

macOS Permissions

System Audio Capture (BlackHole)

Run

Architecture

Keyboard Shortcuts

Features

Troubleshooting

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Caddy

Quick Start

Prerequisites

Installation

Environment

macOS Permissions

System Audio Capture (BlackHole)

Run

Architecture

Keyboard Shortcuts

Features

Troubleshooting

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages