Kirtos 🤖

A local-first, voice-driven AI assistant for macOS — think Jarvis for your desktop.

Kirtos listens to your voice, understands intent, and executes actions on your machine — all without sending your data to the cloud (except for LLM calls).

✨ Features

🎤 Voice Control — Talk naturally via LiveKit + Deepgram STT
🧠 Intent Classification — Fast regex classifier + LLM fallback (OpenRouter)
🔊 Natural TTS — Premium voice responses via Cartesia
💬 WhatsApp Integration — Send/read messages by contact name via Baileys
🌐 Browser Control — Search, play YouTube, open websites
💻 System Control — Volume, brightness, apps, Do Not Disturb, notifications
📱 iMessage — Send messages to contacts via macOS
🎵 Music — Play local music files
📚 Knowledge — Wikipedia search
😄 Fun — Jokes on demand

📁 Project Structure

kirtos/
├── agent/          # Node.js backend — intent parsing, executors, services
│   ├── src/
│   │   ├── executor/   # Intent handlers (system, browser, whatsapp, etc.)
│   │   ├── services/   # Core services (STT, TTS, classifier, WhatsApp)
│   │   └── policy/     # Security engine, intent definitions, permissions
│   └── .env.example    # ← Copy to .env and fill in your API keys
├── app/            # React + Vite frontend — voice UI with WebGL orb
├── docs/           # Architecture documentation
├── start.sh        # Start both agent + app
└── .env.template   # Root env template

🚀 Quick Start

Prerequisites

macOS (required — uses native AppleScript for system control)
Node.js 18+
API keys for: LiveKit, Deepgram, OpenRouter, Cartesia

Setup

# 1. Clone the repo
git clone https://github.com/techieujjwal/community-dashboard.git kirtos
cd kirtos

# 2. Install dependencies
npm run install:all

# 3. Set up environment variables
cp agent/.env.example agent/.env
# Edit agent/.env with your API keys

# 4. Start the app
bash start.sh

The agent runs on http://localhost:3001 and the UI opens at http://localhost:5173.

WhatsApp Setup (Optional)

Start the agent
Say "connect whatsapp"
Scan the QR code in the terminal with your phone
You can now send/read messages by voice!

🗣️ Example Commands

Command	What it does
"What is JavaScript?"	Wikipedia search
"Play lofi beats on YouTube"	Opens YouTube with the video
"Set brightness to 50%"	Adjusts screen brightness
"Send whatsapp to Utkarsh hi"	Sends WhatsApp message
"Read my whatsapp messages"	Reads recent messages
"Tell me a joke"	Fetches a random joke
"Open Terminal"	Launches an app
"Play music"	Plays local music
"Enable Do Not Disturb"	Toggles Focus mode

🔐 Security

All API keys are stored in agent/.env (never committed)
WhatsApp session credentials in .whatsapp-auth/ (gitignored)
Intent execution uses a policy engine with role-based permissions
Sensitive actions (WhatsApp send, shell exec) require confirmation

🧠 NLP Architecture

Kirtos uses a 3-tier "Waterfall" Intent Classification approach for latency vs. intelligence trade-offs:

Fast Regex Classifier (Tier 1): ~1ms latency. Handles simple, highly predictive commands (e.g., "Set volume to 50").
Local NLP Classifier (Tier 2): ~5ms latency. Runs a local ML model (Python API) offline for more flexible command patterns.
LLM Fallback (Tier 3): High latency, maximum intelligence. Uses OpenRouter / OpenAI / DigitalOcean for complex conversational interactions, context resolution, and multi-step plans.

🛠 Tech Stack

Layer	Technology
Voice Transport	LiveKit WebRTC
Speech-to-Text	Deepgram
Text-to-Speech	Cartesia
Intent Parsing	Fast regex + OpenRouter LLM
WhatsApp	Baileys (no Docker needed)
Frontend	React + Vite + WebGL
Backend	Node.js + Fastify + WebSocket

📄 License

ISC

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
agent		agent
app		app
docs		docs
.env.template		.env.template
.gitignore		.gitignore
PROJECT_BRIEF.md		PROJECT_BRIEF.md
README.md		README.md
antigravity.yaml		antigravity.yaml
package.json		package.json
postcss.config.js		postcss.config.js
start.sh		start.sh
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kirtos 🤖

✨ Features

📁 Project Structure

🚀 Quick Start

Prerequisites

Setup

WhatsApp Setup (Optional)

🗣️ Example Commands

🔐 Security

🧠 NLP Architecture

🛠 Tech Stack

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Kirtos 🤖

✨ Features

📁 Project Structure

🚀 Quick Start

Prerequisites

Setup

WhatsApp Setup (Optional)

🗣️ Example Commands

🔐 Security

🧠 NLP Architecture

🛠 Tech Stack

📄 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages