AI Assistant Orchestrator

A production-grade AI agent platform that accepts natural language commands over SMS, HTTP, CLI, Telegram, and Discord — and executes them as validated, auditable actions. Built with TypeScript, Claude (Anthropic), and SQLite.

Features

Multi-Channel Input

SMS via Twilio (HMAC-SHA1 verified)
REST API (POST /command)
CLI (npm run cli -- "your command")
Telegram bot (long-polling)
Discord bot

LLM Planning Pipeline

Every command goes through a structured pipeline:

User Input → Claude (JSON planner) → Zod schema validation → Tool allowlist check → Execute → SQLite audit log

Claude never runs code directly. It outputs a strict JSON action plan that is validated against registered schemas before anything executes.

RAG — Knowledge Base

Upload documents and ask questions about them. The assistant automatically retrieves relevant context and injects it into every LLM prompt.

Upload .txt, .md, .pdf files via POST /kb/upload
Ingest web pages via POST /kb/url
Documents are chunked (500 chars, 100-char overlap) and indexed with TF-IDF vectors
At query time, cosine similarity retrieves top-3 relevant chunks
Context is silently prepended to the planner prompt — no extra user step required

Multi-Step Research Agent (ReAct Loop)

A Reasoning + Acting agent that iteratively searches the web before synthesizing an answer:

Decides what to search based on the task
Executes a web.search call, observes results
Evaluates: is this enough, or do I need more?
Repeats up to 3 iterations
Synthesizes a final comprehensive answer

"research how WebSockets compare to SSE"
→ searches "WebSocket vs SSE performance"
→ searches "SSE use cases 2024"
→ synthesizes → returns formatted answer

Specialized Sub-Agents

Domain-specific planners invoked via agent.delegate:

Agent	Purpose
`job`	Job search planning, email triage, application tracking
`study`	Study schedules, flashcards, progress tracking
`fitness`	Workout plans, reminders, habit tracking
`discordOps`	Discord channel management and announcements
`research`	Multi-step web research with ReAct loop

Tool Registry (25+ tools)

All tools are registered with name, permission level, confirmation level, and a Zod schema. Unknown tools are rejected at the validator before execution.

Category	Tools
Tasks	`tasks.create`, `tasks.list`, `tasks.complete`, `tasks.delete`
Reminders	`reminders.create`, `reminders.list`, `reminders.delete`
Notes	`notes.create`, `notes.search`, `notes.list`
Knowledge Base	`kb.ingest`, `kb.search`, `kb.list`
Web	`web.search` (Brave), `web.summarize`
Email	`email.list`, `email.read`, `email.send`, `email.summarize`, `email.archive`, `email.triage`
Messaging	`sms.reply`, `telegram.reply`, `discord.post`, `assistant.reply`
Utility	`weather.current`, `briefing.get`, `history.query`, `agent.delegate`

Security Model

Tool allowlist: only registered tools can execute — unknown tool names are rejected
Zod .strict() validation: extra fields on any action are rejected
Confirmation levels: none (auto-run), soft (auto-run with log), hard (user must reply "yes")
Max actions cap: plans with more than 5 actions are rejected
Twilio HMAC-SHA1 verification on /sms/inbound
API key auth (X-ASSISTANT-KEY) on /phone/webhook
Rate limiting: 30 req/min per IP

Audit Trail

Every command, plan, and action result is persisted to SQLite via Prisma. Browse with npm run db:studio.

Tech Stack

Layer	Technology
Runtime	Node.js 20+, TypeScript (ESM)
LLM	Anthropic Claude (`claude-sonnet-4-5`)
Database	SQLite via Prisma ORM
API	Express.js
SMS	Twilio
Messaging	Telegram Bot API, Discord.js
Email	Gmail API (OAuth2)
Web Search	Brave Search API
File Parsing	multer (upload), pdf-parse (PDF text extraction)
Validation	Zod

Architecture

src/
├── agent/           # Claude integration: planner, system prompt, JSON parser
├── executor/        # Pipeline orchestration, allowlist validation, confirmations
├── tools/
│   ├── schemas/     # Zod schemas for every tool
│   ├── implementations/  # Execution logic — each returns { ok, data, summary }
│   └── registry.ts  # Tool registration: name, permission, confirmation, schema, execute
├── subagents/       # Specialized planners (job, study, fitness, discordOps, research)
├── rag/             # RAG system: chunker, TF-IDF vectorizer, retriever
├── api/             # Express route handlers
├── store/           # Prisma client + schema (Command, Action, Task, Note, Document…)
├── scheduler/       # Cron-based reminder firing
├── telegram/        # Telegram bot
├── discord/         # Discord bot
└── ui/              # Web dashboard

Quick Start

# Install dependencies
npm install

# Set up environment
cp .env.example .env
# Add your CLAUDE_API_KEY at minimum

# Create database
npm run db:push

# Start server
npm run dev

# Send a command
npm run cli -- "create task finish the slides by Friday"
npm run cli -- "remind me to call mom at 6pm"
npm run cli -- "research how transformers work in NLP"

Upload a Document to the Knowledge Base

# Upload a file
curl -X POST http://localhost:3500/kb/upload \
  -F "file=@resume.pdf" \
  -F "title=My Resume"

# Ingest a web page
curl -X POST http://localhost:3500/kb/url \
  -H "Content-Type: application/json" \
  -d '{"url": "https://docs.example.com", "title": "Project Docs"}'

# Now ask about it — RAG context is injected automatically
npm run cli -- "what's my experience with React?"

# List all documents
curl http://localhost:3500/kb/docs

Environment Variables

Variable	Required	Description
`CLAUDE_API_KEY`	Yes	Anthropic API key
`ASSISTANT_KEY`	Yes	Auth key for `/phone/webhook`
`DATABASE_URL`	No	SQLite path (default: `file:./data.db`)
`TWILIO_ACCOUNT_SID`	No	Twilio SMS
`TWILIO_AUTH_TOKEN`	No	Twilio SMS
`TWILIO_PHONE_NUMBER`	No	Twilio SMS
`TELEGRAM_BOT_TOKEN`	No	Telegram bot
`DISCORD_BOT_TOKEN`	No	Discord bot
`DISCORD_WEBHOOK_URL`	No	Discord webhook for posts
`BRAVE_SEARCH_API_KEY`	No	Web search (required for research agent)
`GMAIL_CLIENT_ID`	No	Gmail integration
`GMAIL_CLIENT_SECRET`	No	Gmail integration
`DRY_RUN`	No	Set `true` to log actions without executing

API Reference

Method	Path	Description
`POST`	`/command`	Send a natural language command
`POST`	`/sms/inbound`	Twilio webhook
`GET`	`/history`	Query command history
`GET`	`/confirm/pending`	List actions awaiting confirmation
`POST`	`/confirm`	Confirm or deny a pending action
`POST`	`/kb/upload`	Upload `.txt`, `.md`, or `.pdf` to knowledge base
`POST`	`/kb/url`	Ingest a web page into knowledge base
`GET`	`/kb/docs`	List all knowledge base documents
`DELETE`	`/kb/docs/:id`	Delete a document
`GET`	`/health`	Health check
`GET`	`/`	Web dashboard

How to Add a New Tool

Define a Zod schema in src/tools/schemas/<name>.ts
Write the implementation in src/tools/implementations/<name>.ts — return { ok, data, summary }
Register in src/tools/registry.ts with name, description, permission, confirmation, schema, execute
Add an example to src/agent/systemPrompt.ts

Follow the <namespace>.<verb> convention (e.g. calendar.create, slack.post).

Database Schema

Command  →  Action[]        (full audit trail of every plan + execution)
Task                        (to-dos with status and due date)
Reminder                    (scheduled notifications)
Note                        (tagged quick notes)
Bookmark                    (saved URLs)
Document  →  DocumentChunk[]  (knowledge base with TF-IDF vectors)

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.github/workflows		.github/workflows
bin		bin
docs		docs
scripts		scripts
src		src
.env.example		.env.example
.gitignore		.gitignore
1.md		1.md
CLAUDE.md		CLAUDE.md
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Assistant Orchestrator

Features

Multi-Channel Input

LLM Planning Pipeline

RAG — Knowledge Base

Multi-Step Research Agent (ReAct Loop)

Specialized Sub-Agents

Tool Registry (25+ tools)

Security Model

Audit Trail

Tech Stack

Architecture

Quick Start

Upload a Document to the Knowledge Base

Environment Variables

API Reference

How to Add a New Tool

Database Schema

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI Assistant Orchestrator

Features

Multi-Channel Input

LLM Planning Pipeline

RAG — Knowledge Base

Multi-Step Research Agent (ReAct Loop)

Specialized Sub-Agents

Tool Registry (25+ tools)

Security Model

Audit Trail

Tech Stack

Architecture

Quick Start

Upload a Document to the Knowledge Base

Environment Variables

API Reference

How to Add a New Tool

Database Schema

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages