Version: 4.1 (Production-Grade with Four Foundations) Status: Authoritative Quick Nav: index.md - AI-optimized navigation
"Tell Butler what matters. Butler handles the rest, safely, consistently, across your real digital world."
Every feature must answer ONE of four questions:
| Question | Service | What it determines |
|---|---|---|
| WHO? | Identity | Voice face, user profile |
| WHERE + WHEN + WHAT around? | Context | Location, device proximity, time, environment |
| WHAT WANT? | Intent | Command, question, request, conversation |
| HOW RESPOND? | Response | Spoken, visual, notification, action |
Butler ONLY responds when explicitly prompted or when monitoring critical context:
- ✅ Responding to explicit commands
- ✅ Answering direct questions
- ✅ Following user-set reminders
- ✅ Acting on detected emergencies
- ❌ Unsolicited news alerts
- ❌ Random fun facts
- ❌ Unprompted recommendations
- ❌ Tracking without consent
Reference: perfect-design.md - Complete four foundations documentation
| Service | Purpose |
|---|---|
| Gateway | REST API, idempotency, streaming |
| Auth | JWT, passkeys, JWKS |
| Orchestrator | Durable execution, interrupts |
| Memory | Temporal model, entity resolution |
| ML | Retrieval → ranking cascade |
| Tools | Capability runtime, policy |
| Security | Policy engine, OPA |
| Realtime | Typed events, delivery classes |
| Device | Capability-based, health connectors |
| Data | Domain schema, outbox, RLS |
| Vision | Stacked perception, verification |
| Audio | Dual-STT, TTS stack |
| Communication | Policy layer, SLOs |
| Observability | Platform, workflow telemetry |
| Search | Full-text + semantic |
| Plugins | MCP-first tool extensions |
| (Reserved) | Future expansion |
| (Reserved) | Future expansion |
Gateway → Auth → Orchestrator → Memory → Tools
Butler optimally distributes work between client and server:
| Client (latency + privacy) | Server (intelligence + memory) |
|---|---|
| Wake word detection | ASR (Automatic Speech Recognition) |
| VAD (Voice Activity Detection) | Intent classification |
| Sensor data collection | User profile management |
| Local embedding extraction | Cross-device context |
| Matter/IoT control | Response generation |
| Audio playback | Memory storage |
Reference: client-server-split.md - Complete split mapping
| Layer | Technology |
|---|---|
| Vision AI | GroundingDINO, SAM 2, InsightFace |
| Audio AI | ECAPA-TDNN, Silero VAD, Whisper |
| Backend | FastAPI, PostgreSQL, Redis, Kafka |
| Infrastructure | Kubernetes, NGINX, Prometheus |
| Mobile | React Native (Expo) |
| Category | Description | Link |
|---|---|---|
| AI Index | Optimized for AI agents | index.md |
| Governance | Constitution, rules, models | 00-governance |
| Core Docs | BRD → PRD → TRD → HLD → LLD | 01-core |
| 18 Services | All service specifications | 02-services |
| Reference | API, workflows, plugins | 03-reference |
| Operations | Runbooks, security, deployment | 04-operations |
| Development | Setup, build order | 05-development |
| Pattern | Description |
|---|---|
| Four-state health | STARTING → HEALTHY → DEGRADED → UNHEALTHY |
| RFC 9457 errors | Problem Details format |
| 18 services | Gateway through Plugins |
| Macro/Routine/Workflow | Three execution layers |
| Service boundaries | Gateway NEVER calls Memory |
When docs conflict, resolve in this order:
- 00-governance/platform-constitution.md (Highest)
- 00-governance/system-design-rules.md
- 01-core/BRD.md → PRD.md → TRD.md → HLD.md → LLD.md
- 02-services/*.md
- 03-reference/*.md
- 04-operations/*.md
- Code
- KISS - Keep It Simple, Stupid
- SOLID - Clean boundaries
- Modular monolith - Extraction-ready
- Event-driven - Async over sync
- Security-first - Trust by default
| Metric | Target |
|---|---|
| Users | 1M |
| RPS (peak) | 10K |
| Latency P95 | <1.5s |
| Availability | 99.9% |
For AI agents, start with:
- index.md - Navigation
- 00-governance/platform-constitution.md - Thesis
- 01-core/HLD.md - Architecture
For engineers, start with:
- 05-development/SETUP.md - Local setup
- 01-core/HLD.md - Architecture
- 05-development/build-order.md - Build sequence
| Channel | Contact |
|---|---|
| Engineering | #butler-engineering |
| Security | security@butler.lasmoid.ai |
| Documentation | docs@butler.lasmoid.ai |
| Need | File |
|---|---|
| Navigation | index.md - Start here |
| System overview | perfect-design.md - Four foundations |
| Architecture | client-server-split.md - Client vs server |
| Capabilities | butler-capability-matrix.md - 177 features |
| Setup | cross-platform-deployment-matrix.md - Deployment |
| Services | infrastructure-architecture-spec.md - Infrastructure |
| Metric | Target |
|---|---|
| Users | 1M |
| RPS (peak) | 10K |
| Latency P95 | <1.5s |
| Availability | 99.9% |
| Channel | Contact |
|---|---|
| Engineering | #butler-engineering |
| Security | security@butler.lasmoid.ai |
| Documentation | docs@butler.lasmoid.ai |
Document owner: Architecture Team Version: 4.1 (Production-Grade with Four Foundations) Last Updated: 2026-04-20