OpenPilot — AI Personal Computer Assistant & Agent Runtime Platform

OpenPilot is a single-process, all-in-one AI agent runtime platform supporting multi-model, multi-channel, and multi-agent collaboration. It provides a complete HTTP/WebSocket API gateway and a Web control panel. Designed as a personal computer assistant, it covers a wide range of daily life and work tasks.

🎯 What It Can Do: Life & Work Task Checklist

📁 File & Document Processing

Read/Write Local Files — Read, create, and edit any text file on your computer (code, config, notes, etc.)
Code Patching — Apply precise patches to code files, supporting OpenAI-format patches
Batch File Operations — Bulk rename, move, and organize files via Shell tools

💻 Shell Commands & System Operations

Execute Terminal Commands — Run any Shell command on your computer (install software, compile projects, manage processes, etc.)
Background Process Management — Start, monitor, and stop background services
System Info Queries — Check disk space, memory usage, network status, etc.

🌐 Network & Information Retrieval

HTTP Requests — Call any API, fetch web content, download data
Web Browsing — Automated browser operations (open pages, take screenshots, click, execute JS)
Information Search — Search and retrieve real-time information via web tools

🖥️ Screen & Screenshots

Desktop Screenshots — One-click screen capture (macOS native screencapture)
Screen Recording — Record screen video clips (macOS)

🧠 Memory & Knowledge Management

Long-term Memory — Persistently store personal preferences, frequently used info, and work habits via USER.md
Session Search — Full-text search of conversation history (SQLite FTS5) to quickly find past discussions
Session Management — Create, switch, compact, and delete conversation sessions to keep context clean

💬 Multi-Channel Messaging

Telegram Bot — Chat with your AI assistant anytime, anywhere via Telegram (supports DM and groups)
Discord Bot — Use the AI assistant in Discord servers (supports DM, channels, threads)
Web Chat Interface — Built-in browser chat panel with streaming output and tool call display
Cross-Channel Session Isolation — Conversations across different channels are isolated for privacy

🤖 Multi-Agent Collaboration

Multi-Agent Management — Create AI agents with different roles (coding assistant, writing assistant, analyst, etc.)
Agent Routing — Automatically assign agents based on channel, user, or group
Sub-Agents — Main agents can spawn sub-agents for complex tasks (depth limits, concurrency control)
Agent Identity — Each agent has its own name, avatar, and system prompt

🔮 PolyOracle — AI Prediction Market Analysis

Real-time Market Data — Connects to Polymarket Gamma API for live prediction market quotes
AI Probability Analysis — AI independently evaluates event probabilities and compares with market prices to find +EV opportunities
Signal Recording — All analysis results are persisted to the database for historical tracking
Scheduled Auto-Scanning — Cron job scans markets every 4 hours (configurable)
Push Notifications — +EV opportunities are automatically pushed to Telegram/Discord (24h dedup)
Visual Dashboard — Market quotes, AI signals, Cron status, and notification settings in one place

📊 Quant Copilot — AI-Powered Stock Analysis

Technical Analysis — Automated technical indicator calculation (SMA, RSI, MACD, Bollinger Bands) via Python/pandas
AI-Driven Insights — LLM analyzes technical data and generates buy/sell/hold signals with confidence scores
Finnhub News Integration — Fetches latest market news for sentiment-aware analysis
Configurable Watchlist — Set your stock watchlist and signal threshold via the Web UI
Auto Python Environment — Python venv and dependencies are automatically installed on startup
Agent Model Routing — Uses the model configured for the quant-analyst agent, with fallback chain support

📈 Auto Quant Trading — Automated Quantitative Trading Pipeline

⏰ Scheduled Task Scheduling

Cron Scheduler — Persistent scheduled task system based on node-cron
Database Storage — Task definitions stored in SQLite, manageable via UI
Manual Trigger — Support manual task execution via API or UI
Concurrency Control — Prevents duplicate execution of the same task

🔧 System Configuration & Management

Web Control Panel — Full management interface covering chat, sessions, models, channels, agents, skills, config, cron, usage, audit, etc.
Dynamic Configuration — Modify system config in real-time via UI or API, no manual file editing needed
Field-Level Descriptions — Config UI fetches schema metadata to display labels and help text for each field
Multi-Model Switching — Supports 11+ AI providers, 30+ models, freely switchable at runtime
API Key Rotation — Multiple keys per provider with automatic switching + cooldown mechanism
Failover — Automatic fallback to alternative models when the primary model fails
Audit Logging — All tool invocations are traceable

🖼️ Image Generation

Multi-Engine Support — Supports Qwen (Tongyi Wanxiang), Stability AI, OpenAI DALL·E, local Stable Diffusion
Natural Language to Image — Describe the desired image, AI generates it automatically
Auto-Send — Generated images are automatically sent to Telegram/Discord via PendingFiles

📄 Document Generation

PDF Generation — Markdown content auto-converted to PDF (supports CJK, code highlighting, tables)
PPT Generation — JSON slide data auto-generated into PowerPoint files (theme configurable)
Auto-Send — Generated documents are automatically sent to the user

🎤 Voice Interaction (STT/TTS)

Speech-to-Text (STT) — Supports Google Gemini, OpenAI Whisper, DashScope Qwen Omni, and more
Text-to-Speech (TTS) — Supports Edge TTS (free) and OpenAI TTS
Voice Message Loop — Voice messages auto-transcribed → AI processes → voice reply (inbound mode)
No Text Confusion — Text commands get text replies; voice commands get voice replies

🎬 Video Editing (Phase 1 MVP)

Video Probing — Extract video metadata via ffprobe (duration, resolution, codec, frame rate, bitrate)
Video Trimming — Trim video clips by specifying time ranges
Video Speed Change — Supports 0.25x to 4.0x speed (video + audio sync)
Timeline JSON — LLM outputs structured editing instructions; tools perform deterministic FFmpeg rendering
Fail-Fast — Returns clear error immediately when FFmpeg is unavailable
Auto-Send — Rendered videos are automatically sent to the user via PendingFiles

🧩 Skill Extensions

Community Skill Market — Search and install community-contributed skills (ClawHub + SkillsMP dual source)
One-Click Install — Install new skills from the community market with one click
Skill Management — Enable/disable installed skills

🔒 Security & Approvals

Policy Engine — Tool-level allow/deny/require-approval policies
Execution Approval — Dangerous operations (Shell, file writes) require human approval in production mode
DM Security Policy — Supports open/allowlist/pairing/disabled modes
API Key Masking — Sensitive information in config is automatically redacted

📋 Typical Use Cases

Scenario	How It Works
Get prediction market opportunities every morning	Cron scheduled scan → AI analysis → Telegram push
Analyze stock technical indicators	Quant Copilot → Python technical analysis → AI signal generation
Remotely operate your computer from phone	Send message via Telegram/Discord → AI executes Shell commands
Organize project files	Describe needs in chat → AI reads/writes files + executes commands
Monitor website changes	Scheduled task + browser screenshot + push notification
Code review & modification	Discuss in chat → AI reads code → generates patches
Query real-time information	AI calls HTTP tools to fetch API data
Manage multiple AI roles	Create different agents, bind to different channels/groups
Remember personal preferences	AI auto-writes to USER.md, auto-loads in next conversation
Generate promotional images	Describe the scene → AI calls image generation → auto-sends to chat
Generate PDF reports	Provide content → AI generates Markdown → auto-converts to PDF
Create PPT presentations	Describe outline → AI generates slide JSON → auto-generates .pptx
Send voice messages to AI	Send voice on Telegram → STT transcription → AI processes → voice reply
Edit video clips	Send video + "trim 1:20-1:40" → AI generates Timeline → FFmpeg renders
Auto quant trading	Stock scan → AI signal → confidence filter → auto order → stop-loss monitor → notification
Paper trading test	Configure paper mode → orders route to Longport simulated API → verify strategy without real money

Architecture

┌─────────────────────────────────────────────────────────┐
│                    Control UI (React)                    │
│              Vite + TailwindCSS + Zustand                │
└──────────────────────┬──────────────────────────────────┘
                       │ HTTP / WebSocket
┌──────────────────────▼──────────────────────────────────┐
│                   Gateway (Express)                      │
│         REST API · WebSocket Stream · Static Serve       │
├──────────────────────────────────────────────────────────┤
│                   Pi Agent Runtime                        │
│        ReAct Loop · Tool Pipeline · Context Guard        │
├──────────┬───────────┬───────────┬───────────────────────┤
│  Model   │  Session  │   Tool    │     Channel           │
│  Manager │  Manager  │  Executor │     Manager           │
│          │           │           │                       │
│ OpenAI   │ SQLite    │ File      │ Telegram (grammy)     │
│ Anthropic│ JSONL     │ Network   │ Discord (discord.js)  │
│ Google   │ LRU Cache │ Shell     │ Slack                 │
│ DeepSeek │           │ Browser   │                       │
│ Ollama   │           │ Patch     │ CommandLane           │
│ Qwen     │           │ Memory    │ InboundDebouncer      │
│ 11+ prov │           │ Screen    │ PairingStore          │
│          │           │ SubAgent  │                       │
│          │           │ Polymarket│                       │
│          │           │ Stock     │                       │
│          │           │ Image     │                       │
│          │           │ Document  │                       │
│          │           │ Voice     │                       │
│          │           │ Video     │                       │
│          │           │ Trading   │                       │
├──────────┴───────────┴───────────┴───────────────────────┤
│  AgentManager · SubagentRegistry · PolicyEngine · Audit  │
├──────────────────────────────────────────────────────────┤
│  CronScheduler · PolymarketScanner · StockScanner        │
│  NotificationService · VoiceService · ImageRouter        │
│  AutoTradingPipeline · TradingGateway · StopLossManager  │
│  SignalEvaluator · StrategyEngine · PortfolioManager     │
├──────────────────────────────────────────────────────────┤
│              Sandbox · PluginManager · Skills             │
└──────────────────────────────────────────────────────────┘

Tech Stack

Layer	Technology
Backend Runtime	Node.js + TypeScript
HTTP Framework	Express 5
WebSocket	ws
Database	SQLite (better-sqlite3) + JSONL audit logs
Frontend Framework	React 19 + Vite 7
Styling	TailwindCSS 4
State Management	Zustand 5
Code Editor	Monaco Editor
Charts	Recharts
Telegram	grammy
Discord	discord.js v14
AI SDK	openai · @anthropic-ai/sdk · @google/generative-ai
Scheduled Tasks	node-cron
Voice Synthesis	edge-tts (Python)
Stock Analysis	yfinance + pandas (Python)
Trading	longport SDK (Longport OpenAPI)
Video Processing	FFmpeg / FFprobe
Backend Testing	Jest + ts-jest + fast-check (property-based)
Frontend Testing	Vitest + Testing Library
Build	tsc (backend) + Vite (frontend)

Feature Completion Status

Core Runtime

Pi Agent ReAct loop (multi-turn tool calls + auto-retry + failover)
Multi-model support: 11+ providers, 30+ models, auto-discovered at runtime
Auth Profile rotation (multiple keys per provider with auto-switch + cooldown)
Failover chain (automatic fallback to alternative models on failure)
Context window guard (auto-compacts session when limit exceeded)
Concurrency limiter (overload protection)

Session Management

SQLite persistence + LRU memory cache
JSONL parallel audit log writing
Session CRUD (create, load, save, delete)
Session compaction
Daily token quota

Tool System

14 built-in tool categories: File, Network, Shell, Browser, Patch, Memory, SubAgent, Screen, Polymarket, Stock, Trading, Image, Document, Voice, Video
PolicyEngine (allow/deny/require-approval)
AuditLogger audit logging
Tool catalog (38+ tools, 16 categories, grouped by Profile)
Execution approval queue (human approval in production mode)

Multi-Agent

AgentManager: agent CRUD + file management (SOUL.md / IDENTITY.md, etc.)
Agent-level model + system prompt overrides
SubagentRegistry: sub-agent lifecycle management (depth limits, concurrency control)
Agent routing bindings (7-level priority: peer > peer.parent > guild+roles > guild > team > account > channel > default)

Multi-Channel

ChannelManager unified channel management (register, connect, disconnect, reconnect, health check)
Telegram full implementation (grammy, DM/group/supergroup, Bot commands)
Discord full implementation (discord.js v14, DM/channel/thread, attachments, message chunking)
Slack framework implementation
Gateway adapter path (reads multi-account config from config.json5)
InboundDebouncer inbound message debouncing
PairingStore device pairing
SecurityGate DM security policy (open/allowlist/pairing/disabled)
Outbound message chunked delivery

PolyOracle — AI Prediction Market Analysis

Gamma API real-time market data integration
AI probability analysis (single Analyst architecture, supports qwen/gemini/deepseek, etc.)
Signal storage (market_signals table)
+EV opportunity identification (edge ≥ 5% threshold, configurable)
Scheduled auto-scanning (CronScheduler + PolymarketScanner)
Telegram/Discord push notifications (24h dedup)
Visual dashboard (5 tabs: About, Markets, Signals, Cron, Notification Settings)
Metric tooltip explanations + usage guide

Quant Copilot — AI Stock Analysis

StockScanner service with Python-based technical analysis (SMA, RSI, MACD, Bollinger Bands)
AI-driven signal generation (buy/sell/hold with confidence scores)
Finnhub API integration for market news and sentiment
Configurable watchlist and signal threshold via Web UI
Auto Python venv setup and dependency installation on startup
Agent-level model routing (quant-analyst agent config → fallback chain → auto-detect)
Stock analysis tools registered in tool catalog
Unit tests and property-based tests

Auto Quant Trading — Automated Trading Pipeline

Scheduled Task System

CronScheduler (node-cron + SQLite persistence)
Task CRUD (create, update, delete, enable/disable)
Manual trigger execution
Concurrency guard (prevents duplicate task runs)
Handler registration mechanism (extensible for new task types)

Notification Service

NotificationService (Telegram/Discord push)
Signal notifications (+EV opportunity push)
Scan summary notifications
System alert notifications
24h dedup (notified_at field)

Image Generation

ImageRouter multi-engine routing (Qwen/Stability/OpenAI/local SD)
Auto-save to ~/.openpilot/generated/
PendingFiles mechanism for auto-sending to Channel

Document Generation

PDF generation (Markdown → HTML → PDF, supports Puppeteer rendering)
PPT generation (JSON slides → .pptx, theme configurable)
PendingFiles mechanism for auto-sending to Channel

Voice Interaction (STT/TTS)

VoiceService unified voice service
STT multi-engine: Google Gemini, OpenAI Whisper, DashScope Qwen Omni
TTS multi-engine: Edge TTS (free), OpenAI TTS
Voice message loop (inbound mode: voice in → voice out)
OGG/Opus → MP3 auto-transcoding (ffmpeg)
DashScope SSE streaming response parsing

Video Editing (Phase 1 MVP)

FFmpeg Guardian (Fail-Fast dependency check)
video_probe_tool (ffprobe metadata probing)
video_edit_tool (Timeline JSON → FFmpeg deterministic rendering)
Timeline JSON validator (format, time range, speed range)
FFmpeg command builder (trim, speed_up, add_subtitle)
PendingFiles mechanism for auto-sending to Channel
VideoConfig integration (ffmpegPath, outputDir, renderTimeout)

API Gateway

REST API: 60+ endpoints (sessions, chat, models, agents, channels, config, skills, cron, polymarket, stocks, trading, etc.)
WebSocket streaming chat (stream_start → stream_chunk → tool_call_start → tool_call_result → stream_end)
Concurrent request guard (no parallel requests for the same session)
Request rate limiting + input validation + security middleware (Helmet)
Container health probes (/healthz, /readyz)
Static asset serving (Control UI SPA)

Control UI Panel

Configuration System

JSON5 config file (~/.openpilot/config.json5)
Environment variable overrides
API dynamic read/write + persistence
API Key masking protection
Deep merge updates
Schema-driven field labels and descriptions in UI

Skill System

Built-in skill status reports
Community skill dual source (ClawHub + SkillsMP)
Keyword search + AI semantic search
One-click install

Test Coverage

Backend: 40+ test suites, 970+ test cases passing
Frontend: 20 test suites, 154 test cases passing
Multi-agent collaboration tests (single-channel 21 cases + cross-channel 20 cases)
Discord integration tests (43 cases)
E2E production readiness tests
StockScanner unit tests + property-based tests
Stock tools unit tests + property-based tests
Trading module: 12 test suites (unit + property-based tests with fast-check)
Tool catalog tests

Project Structure

openpilot/
├── src/                        # Backend source code
│   ├── index.ts                # Entry point, full bootstrap flow
│   ├── api/                    # Express API gateway + WebSocket
│   │   ├── server.ts           # 60+ REST endpoints + WS streaming
│   │   ├── tradingRoutes.ts    # Trading API endpoints (orders, config, pipeline, stop-loss)
│   │   └── middleware.ts       # Rate limiting, input validation, security
│   ├── runtime/                # AI runtime
│   │   ├── AIRuntime.ts        # Core execution engine (retry, failover, concurrency)
│   │   └── sandbox.ts          # Sandbox isolation
│   ├── pi-agent/               # Pi Agent ReAct loop
│   │   ├── PiAgent.ts          # ReAct main loop
│   │   └── PiSession.ts        # Session transcript management
│   ├── models/                 # Model providers
│   │   ├── ModelManager.ts     # Model discovery, config, rotation, failover
│   │   ├── OpenAIProvider.ts   # OpenAI / compatible APIs
│   │   ├── AnthropicProvider.ts
│   │   └── GeminiProvider.ts   # Google Generative AI
│   ├── session/                # Session persistence
│   │   ├── SessionManager.ts   # SQLite + LRU Cache + JSONL
│   │   └── database.ts         # Schema initialization
│   ├── channels/               # Multi-channel system
│   │   ├── types.ts            # Channel plugin abstraction layer
│   │   ├── ChannelManager.ts   # Unified management + routing + health check
│   │   ├── TelegramChannel.ts  # Telegram (grammy)
│   │   ├── DiscordChannel.ts   # Discord (discord.js v14)
│   │   ├── SlackChannel.ts     # Slack
│   │   ├── CommandLane.ts      # Concurrency control lane
│   │   ├── InboundDebouncer.ts # Inbound debouncing
│   │   └── PairingStore.ts     # Device pairing
│   ├── agents/                 # Agent management
│   │   ├── AgentManager.ts     # CRUD + files + identity
│   │   ├── SubagentRegistry.ts # Sub-agent lifecycle
│   │   └── types.ts            # AgentInfo types
│   ├── tools/                  # Tool system
│   │   ├── ToolExecutor.ts     # Executor + hook chain
│   │   ├── PolicyEngine.ts     # Policy engine
│   │   ├── auditHook.ts        # Audit logging
│   │   ├── toolCatalog.ts      # Tool catalog (38+ tools, 16 categories)
│   │   ├── fileTools.ts        # File operations
│   │   ├── networkTools.ts     # HTTP requests
│   │   ├── shellTools.ts       # Shell commands
│   │   ├── browserTools.ts     # Browser automation
│   │   ├── patchTools.ts       # Code patching
│   │   ├── memoryTools.ts      # Persistent memory
│   │   ├── screenTools.ts      # Screen capture/recording
│   │   ├── polymarketTools.ts  # Polymarket market tools
│   │   ├── stockTools.ts       # Stock analysis tools
│   │   ├── imageTools.ts       # Image generation (multi-engine)
│   │   ├── documentTools.ts    # Document generation (PDF/PPT)
│   │   ├── voiceTools.ts       # Voice tools (STT/TTS/status)
│   │   ├── videoTools.ts       # Video editing (probe/render)
│   │   └── subAgentTools.ts    # Sub-agent invocation
│   ├── cron/                   # Scheduled tasks
│   │   └── CronScheduler.ts    # Cron scheduler (SQLite persistence)
│   ├── services/               # Business services
│   │   ├── PolymarketScanner.ts # Market scanning + AI analysis
│   │   ├── StockScanner.ts     # Stock technical analysis + AI signals
│   │   ├── StrategyEngine.ts   # Technical indicator strategy engine
│   │   ├── PortfolioManager.ts # Position tracking + P&L analysis
│   │   ├── SignalTracker.ts    # Signal lifecycle management
│   │   ├── NotificationService.ts # Push notifications (Telegram/Discord)
│   │   ├── VoiceService.ts     # Voice service (STT/TTS multi-engine)
│   │   ├── ImageRouter.ts      # Image generation routing (multi-engine)
│   │   └── trading/            # Automated trading system
│   │       ├── types.ts        # Trading type definitions
│   │       ├── tradingSchema.ts # DB schema (orders, stop-loss, audit, etc.)
│   │       ├── AutoTradingPipeline.ts # Signal polling + order orchestration
│   │       ├── TradingGateway.ts # Unified entry point (paper/live routing)
│   │       ├── LongportAdapter.ts # Longport broker API adapter
│   │       ├── SignalEvaluator.ts # Confidence filtering + dedup
│   │       ├── QuantityCalculator.ts # Order sizing (fixed/kelly)
│   │       ├── StopLossManager.ts # Stop-loss/take-profit monitoring
│   │       ├── TradeNotifier.ts # Trade notification formatting
│   │       ├── OrderManager.ts  # Order CRUD + status transitions
│   │       ├── RiskController.ts # Pre-order risk checks
│   │       ├── PaperTradingEngine.ts # Local paper engine fallback
│   │       └── PositionSyncer.ts # Position sync with broker
│   ├── skills/                 # Skill system
│   │   ├── community.ts        # Community skills (ClawHub + SkillsMP)
│   │   └── types.ts
│   ├── config/                 # Configuration system
│   │   └── index.ts            # JSON5 loading + env var overrides + persistence
│   ├── plugins/                # Plugin system
│   │   └── PluginManager.ts
│   ├── types/                  # Core type definitions
│   │   └── index.ts
│   └── logger.ts               # Structured logging
├── scripts/                    # Python scripts
│   ├── stock_analysis.py       # Technical indicator calculation (SMA, RSI, MACD, BB)
│   ├── backtest_engine.py      # Strategy backtesting engine
│   └── requirements.txt        # Python dependencies (yfinance, pandas)
├── frontend/                   # Frontend Control UI
│   ├── src/
│   │   ├── App.tsx             # Main app + routing
│   │   ├── components/
│   │   │   ├── chat/           # Chat components (message list, input, tool calls)
│   │   │   ├── charts/         # Chart components (K-line chart)
│   │   │   ├── session/        # Session list
│   │   │   ├── model/          # Model selector
│   │   │   ├── common/         # Common components (confirm dialog, error banner, progress bar)
│   │   │   ├── layout/         # Layout (sidebar, topbar)
│   │   │   ├── views/          # Page views (20+: Chat, Sessions, Trading Dashboard, Auto Trading, etc.)
│   │   │   └── tools/          # Audit log component
│   │   ├── stores/             # Zustand state management (uiStore, tradingStore)
│   │   ├── services/           # API client
│   │   ├── hooks/              # Custom hooks
│   │   └── types/              # Frontend types
│   └── index.html
├── data/                       # Data directory
│   ├── sessions.db             # SQLite database
│   └── sessions-jsonl/         # JSONL audit logs
├── dist/                       # Backend compiled output
└── frontend/dist/              # Frontend build output → dist/control-ui/

Quick Start

Requirements

Node.js >= 20
npm >= 9
Python >= 3.10 (for stock analysis and voice features; auto-configured on startup)

Installation

# Clone the project
git clone https://github.com/chenjy16/OpenPilot.git
cd openpilot

# Install backend dependencies
npm install

# Install frontend dependencies
cd frontend && npm install && cd ..

Configuration

# Copy the environment variable template
cp .env.example .env

# Edit .env, configure at least one AI provider API Key:
#   OPENAI_API_KEY=sk-...
#   ANTHROPIC_API_KEY=sk-ant-...
#   GOOGLE_AI_API_KEY=AIza...

# (Optional) Configure channel Bot tokens:
#   TELEGRAM_BOT_TOKEN=...
#   DISCORD_BOT_TOKEN=...

# (Optional) Configure Finnhub API Key for stock analysis:
#   FINNHUB_API_KEY=...
#   (Can also be set via the Web UI under Quant Copilot config section)

# (Optional) Configure Longport for auto quant trading:
#   Credentials are configured via the Web UI Trading Config panel
#   (app_key, app_secret, paper_access_token from https://open.longbridge.com)

Advanced configuration uses JSON5 format, located at ~/.openpilot/config.json5:

{
  // Channel configuration
  channels: {
    telegram: { enabled: true, token: "..." },
    discord: { enabled: true, token: "..." },
  },
  // Gateway configuration
  gateway: {
    port: 3000,
    bind: "loopback",  // loopback | lan | auto | custom
  },
  // Custom model providers
  models: {
    providers: {
      qwen: {
        apiKey: "sk-...",
        baseUrl: "https://dashscope.aliyuncs.com/compatible-mode/v1",
        models: [{ id: "qwen3.5-flash", name: "qwen3.5-flash" }],
      },
    },
  },
  // PolyOracle notification config
  polymarket: {
    notify: {
      enabled: true,
      telegram: { chatId: "your-chat-id" },
      minEdge: 0.10,
    },
  },
  // Quant Copilot config
  stockAnalysis: {
    finnhubApiKey: "your-finnhub-api-key",
    watchlist: "AAPL,GOOGL,MSFT,TSLA",
    signalThreshold: 0.6,
  },
  // Voice configuration
  voice: {
    stt: { model: "qwen3-omni-flash/qwen3-omni-flash", language: "zh" },
    tts: { auto: "inbound", model: "edge/default", voice: "zh-CN-XiaoxiaoNeural" },
  },
  // Video editing configuration
  video: {
    outputDir: "~/.openpilot/generated/video",
    renderTimeout: 120000,
  },
}

Build

# Build backend
npm run build

# Build frontend (outputs to dist/control-ui/)
cd frontend && npx vite build && cd ..

Run

# Start the service (single process, includes API + WebSocket + Control UI)
node dist/index.js

After startup:

Control UI: http://127.0.0.1:3000 (browser access)
API Gateway: http://127.0.0.1:3000/api/
WebSocket: ws://127.0.0.1:3000/ws
Health Check: http://127.0.0.1:3000/healthz

Testing

# Backend tests
npx jest --testPathIgnorePatterns="database.test" --no-coverage

# Frontend tests
cd frontend && npx vitest --run

Deployment

Single Process (Recommended)

NODE_ENV=production node dist/index.js

systemd Service

[Unit]
Description=OpenPilot AI Agent Platform
After=network.target

[Service]
Type=simple
User=openpilot
WorkingDirectory=/opt/openpilot
ExecStart=/usr/bin/node dist/index.js
Restart=on-failure
RestartSec=5
Environment=NODE_ENV=production

[Install]
WantedBy=multi-user.target

Docker

FROM node:20-slim
WORKDIR /app
COPY package*.json ./
RUN npm ci --production
COPY dist/ ./dist/
COPY scripts/ ./scripts/
COPY frontend/dist/ ./dist/control-ui/
EXPOSE 3000
HEALTHCHECK CMD curl -f http://localhost:3000/healthz || exit 1
CMD ["node", "dist/index.js"]

Network Binding Modes

Mode	Bind Address	Use Case
`loopback`	127.0.0.1	Local development (default)
`lan`	0.0.0.0	LAN access
`auto`	0.0.0.0	Auto-detect
`custom`	Custom	Specify `gateway.customBindHost`

API Overview

Category	Endpoint	Description
Health	`GET /healthz`	Container liveness probe
Health	`GET /readyz`	Readiness probe (includes DB check)
Chat	`POST /api/chat`	Synchronous chat
Chat	`WS /ws`	Streaming chat (WebSocket)
Sessions	`GET/POST/DELETE /api/sessions`	Session CRUD
Models	`GET /api/models`	Model catalog
Agents	`GET/POST/PUT/DELETE /api/agents`	Agent CRUD
Channels	`GET /api/channels`	Channel status
Skills	`GET /api/skills/community/search`	Community search
Config	`GET/PUT /api/config`	System config read/write
Config	`GET /api/config/schema`	Config field metadata (labels, descriptions)
Cron	`GET/POST/PUT/DELETE /api/cron/jobs`	Scheduled tasks
Polymarket	`GET /api/polymarket/markets`	Market data
Polymarket	`GET /api/polymarket/signals`	AI signals
Polymarket	`POST /api/polymarket/scan`	Manual scan trigger
Stocks	`POST /api/stocks/analyze`	Stock technical analysis + AI signal
Trading	`GET /api/trading/orders`	Order list (with filter)
Trading	`POST /api/trading/orders`	Place order
Trading	`GET/PUT /api/trading/config`	Trading config (mode, auto-trade, quantity, etc.)
Trading	`GET /api/trading/account`	Account info (paper or live)
Trading	`GET /api/trading/positions`	Current positions
Trading	`GET /api/trading/pipeline/status`	Auto-trading pipeline status
Trading	`GET /api/trading/pipeline/signals`	Recent processed signals
Trading	`GET /api/trading/stop-loss`	Active stop-loss monitors
Usage	`GET /api/usage`	Token usage statistics
Audit	`GET /api/audit-logs`	Tool audit logs

Supported Models

Provider	Models
OpenAI	gpt-5.2, gpt-5.1, gpt-5, gpt-4o, o3, o4-mini, etc.
Anthropic	claude-opus-4, claude-sonnet-4, claude-haiku-4.5, etc.
Google	gemini-3-pro, gemini-2.5-pro, gemini-2.5-flash, gemini-2.0-flash, etc.
DeepSeek	deepseek-chat, deepseek-reasoner
Qwen	qwen3.5-plus, qwen3.5-flash (via OpenAI-compatible API)
Ollama	Local models (auto-discovered)
OpenRouter / Together / Moonshot / Doubao / MiniMax	Via OpenAI-compatible API

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
docs		docs
frontend		frontend
scripts		scripts
skills/summarize		skills/summarize
src		src
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
jest.config.js		jest.config.js
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Folders and files

Latest commit

History

Repository files navigation