Skip to content

KenKaiii/brah

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🎙️ Brah

Your AI that actually sees and does.

GitHub release MIT License YouTube Skool

Brah is a desktop voice assistant that lives in the corner of your screen. Talk to it and it listens, looks at your screen, controls your computer, and gets things done, all in realtime through the OpenAI Realtime API.

It's not just a chatbot. It takes screenshots and reasons about what's on screen, drives your mouse and keyboard, automates the browser, and manages your tasks and calendar, hands-free.


🧠 Why this exists

Most voice assistants can talk but can't do. Brah closes that gap: it sees your actual screen, controls your actual computer, and runs real tools, so a conversation turns into action instead of just suggestions.


✨ What it actually does

Realtime voice

Low-latency voice in, voice out, powered by the OpenAI Realtime API. Live transcription and natural barge-in so you can interrupt and steer mid-sentence.

Sees your screen

Take screenshots of any window or display and have the model analyze what's there: visible text, UI elements, errors, and the obvious next action.

Computer use

Two modes:

  • Browser mode: A sandboxed Chromium browser (via Playwright) for web tasks
  • OS mode: Full OS-level control (via nut.js) to drive the real mouse and keyboard

Planner

Built-in tasks and calendar the agent can create, update, and remind you about.

Web search & fetch

Pulls live information and reads pages on demand.


🚀 Getting started

git clone https://github.com/KenKaiii/brah.git
cd brah
npm install
npm start

Sign in to OpenAI from inside the app to start a Realtime session. That's it.


🔒 Privacy

  • Data (planner, activity, screenshots) is stored locally on your machine
  • Credentials are encrypted via your system keychain (Electron safeStorage)
  • No analytics, no telemetry

🛠️ For developers

npm run check   # format + lint (Biome)
npm test        # check + Node test suite
npm run build:mac

Stack: Electron + OpenAI Realtime API + Playwright + nut.js

Code signing is auto-discovered from your keychain (or CSC_* env vars); with no certificate the macOS build falls back to ad-hoc / unsigned.


👥 Community


📄 License

MIT


An AI that sees your screen, runs your computer, and gets things done, hands-free.

About

Your AI that actually sees and does. A realtime desktop voice assistant that sees your screen, controls your computer, and gets things done, hands-free.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors