Trio Core

Open-source camera intelligence for your network

ONVIF discovery + RTSP streaming + YOLO detection + VLM scene understanding.
One pip install. Runs on any Mac, Linux, or Windows machine.

What is Trio Core?

Trio Core is the open-source camera agent for Trio AI. It runs on your local network, discovers cameras via ONVIF, and either:

Relays video to Trio Cloud for cloud ingest, analysis, and dashboards
Runs AI locally for live monitoring, counting, and scene understanding on your own hardware

┌──────────────────────────────────────────────────────────┐
│  Your Network                                            │
│                                                          │
│  IP Camera ──RTSP──► Trio Core ──HTTPS──► Trio Cloud     │
│                      (this repo)         (paid, $99/cam) │
│                          │                               │
│                          └── or run AI locally           │
│                             (free, open source)          │
└──────────────────────────────────────────────────────────┘

Core capabilities:

Discover — Auto-find cameras on your network via ONVIF and resolve RTSP URLs
Monitor — Live RTSP camera analysis with watch prompts, object counts, and event digests
Relay — Stream RTSP, webcam, or video-file sources to Trio Cloud over HTTP MPEG-TS
Analyze — Run scene understanding on images and videos from the CLI or API
Serve — Expose local inference APIs for detection and description
Tailscale auto-proxy — Works through Tailscale networks automatically

Quick Start

# Install
pip install 'trio-core[mlx]'      # Apple Silicon local AI
pip install 'trio-core[cuda]'     # NVIDIA GPU local AI
pip install trio-core             # Discovery, relay, API

# Check your setup
trio doctor

# Discover cameras on your network
trio discover

# Start local monitoring (Apple Silicon default)
trio cam --rtsp rtsp://admin:pass@192.168.1.100/stream

# Or relay to Trio Cloud
trio relay --camera rtsp://admin:pass@192.168.1.100/stream \
           --token YOUR_TOKEN

Two Modes

Mode 1: Cloud Relay (Trio Cloud customers)

Trio Core takes an RTSP stream, webcam, or video file, registers a camera with Trio Cloud, and pushes HTTP MPEG-TS to the cloud ingest endpoint. All AI processing happens in the cloud.

trio relay --camera rtsp://admin:pass@192.168.1.100/stream \
           --token YOUR_TOKEN

What Cloud does: session management → ingest → analysis → dashboard

Mode 2: Local AI (open-source users)

Run everything locally on your own machine. No cloud needed, no subscription.

# Default local monitor (Apple Silicon)
trio cam --rtsp rtsp://admin:pass@192.168.1.100/stream

# Count objects
trio cam --rtsp rtsp://... --count

# Smart event digest
trio cam --rtsp rtsp://... --digest

# Analyze a saved image or video
trio analyze photo.jpg -q "What's here?"

Note: Use --model and --backend to override the default local model selection. Local mode gives you real-time detection and descriptions, but no persistent memory, entity tracking, historical analytics, or dashboard. For those features, use Trio Cloud.

Features

ONVIF Camera Discovery

trio discover
# Found 2 camera(s):
#   [1] Reolink RLC-810A (192.168.1.100)
#       RTSP: rtsp://192.168.1.100:554/h264Preview_01_main
#   [2] Hikvision DS-2CD2143 (192.168.1.101)
#       RTSP: rtsp://192.168.1.101:554/Streaming/Channels/101

Tailscale Auto-Proxy

If you use Tailscale, Trio Core automatically detects when the macOS network extension blocks camera access and creates a transparent TCP proxy:

trio cam --rtsp rtsp://admin:pass@192.168.1.100/stream
# Tailscale detected — starting TCP proxy via system Python...
# Proxy: 127.0.0.1:15554 → 192.168.1.100:554
# (continues normally, user sees no difference)

YOLO Object Detection

Built-in YOLOv10n (ONNX, 9MB) with tiled detection for accuracy:

trio cam --rtsp rtsp://... --count
# [14:23:46] People: 3, Vehicles: 2
# [14:24:12] People: 5, Vehicles: 2 (+2 people)

VLM Scene Description

Supports multiple local AI configurations:

Mode	Command	Notes
Default local monitor	`trio cam --rtsp ...`	Uses the built-in default model for live monitoring
Custom local model	`trio cam --rtsp ... --model <MODEL_ID>`	Override the Hugging Face model ID
Transformers backend	`trio analyze photo.jpg --backend transformers --model Qwen/Qwen2.5-VL-3B-Instruct`	CUDA or CPU
Adapter / fine-tune	`trio cam --rtsp ... --adapter ./adapter_dir`	Load a LoRA adapter directory

CLI

trio discover                                 # Find cameras via ONVIF
trio cam --rtsp rtsp://... --count            # Live monitor + object counts
trio cam --host 192.168.1.100 -p pass         # Resolve RTSP via ONVIF + monitor
trio cam --rtsp rtsp://... --digest           # Event timeline with scene understanding
trio relay --camera rtsp://... --token ...    # Relay to Trio Cloud
trio relay --discover -p pass --token ...     # Discover a camera and relay it
trio serve                                    # Start inference API server
trio analyze photo.jpg -q "What's here?"      # Analyze a single image or video
trio webcam -w "person at the door"           # Webcam monitor with alerts
trio smoke                                    # End-to-end smoke test
trio doctor                                   # Diagnose setup issues
trio device                                   # Show hardware info
trio claw --camera rtsp://... --gateway ws://127.0.0.1:18789  # OpenClaw node

API Reference

Start the local inference server:

trio serve                          # default: 0.0.0.0:8100
trio serve --port 9000              # custom port
TRIO_API_KEY=secret trio serve      # enable auth

`POST /api/inference/detect`

YOLO object detection.

curl -X POST http://localhost:8100/api/inference/detect \
  -H "Content-Type: application/json" \
  -d '{"image_b64": "'$(base64 -i photo.jpg)'"}'

{
  "people_count": 3, "vehicle_count": 1,
  "by_class": {"person": 3, "car": 1},
  "crops_b64": [{"class": "person", "bbox": [100, 50, 200, 300], "confidence": 0.92}],
  "elapsed_ms": 45
}

`POST /api/inference/describe`

VLM scene description.

curl -X POST http://localhost:8100/api/inference/describe \
  -H "Content-Type: application/json" \
  -d '{"image_b64": "'$(base64 -i photo.jpg)'", "prompt": "Describe what you see."}'

`POST /api/inference/crop-describe`

Combined: YOLO detects → crop → VLM describes each entity → full scene description.

Python SDK

from trio_core import TrioCore, EngineConfig

engine = TrioCore()
engine.load()

result = engine.analyze_video("photo.jpg", "What do you see?")
print(result.text)

Supported Models

Tier 1 — Full optimization (native loading + visual token compression + KV reuse)

Model	Params	4-bit VRAM	Best for
Qwen3-VL-8B	8B	~5GB	Recommended — best accuracy
Qwen2.5-VL-3B	3B	~2GB	Fast, lightweight
Qwen3.5	0.8-9B	0.5-5G	Flexible range
InternVL3	1-2B	1-1.6G	Tiny devices

Tier 2 — Inference only (via mlx-vlm)

Gemma 3n, SmolVLM2, Phi-4, FastVLM, and any model supported by mlx-vlm.

Architecture

                          Trio Core
                              |
              +---------------+---------------+
              |                               |
         YOLO Pipeline                   VLM Pipeline
              |                               |
    YOLOv10n ONNX (9MB)            Qwen/Claude/GPT/any LLM
    tiled 2x2 detection              native MLX loading
    ByteTrack tracking               ToMe token compression
              |                       KV cache reuse
              |                               |
              +---------------+---------------+
                              |
              +-------+-------+-------+
              |       |       |       |
          /detect  /describe  /crop   Relay
                              -describe  to Cloud

Trio Cloud integration

When connected to Trio Cloud, Edge is just a lightweight relay:

Camera → Edge (RTSP pull + compress) → Trio Cloud (all AI in cloud)

Edge sends ~50-100 KB/s per camera. No GPU needed on the edge device.

Configuration

Variable	Default	Description
`TRIO_MODEL`	`Qwen3-VL-8B-4bit`	HuggingFace model ID
`TRIO_YOLO_MODEL`	(auto-downloaded)	Path to YOLO ONNX model
`TRIO_API_KEY`	(none)	Bearer token for API auth
`TRIO_CLOUD_URL`	(none)	Trio Cloud API URL for relay mode
`TRIO_CLOUD_TOKEN`	(none)	Trio Cloud auth token

See src/trio_core/config.py for all options.

Troubleshooting

Problem	Solution
`trio discover` finds no cameras	Make sure cameras are on the same subnet. Some routers block multicast.
Camera found but can't connect	Check username/password. Try `trio cam --rtsp rtsp://admin:pass@IP/stream` directly.
Tailscale blocking camera access	Trio Core auto-detects this and creates a proxy. If it doesn't work, try `trio doctor`.
First run slow	Model download (~2-5 GB). Subsequent runs start instantly.
Out of memory	Use a smaller model: `TRIO_MODEL=mlx-community/Qwen2.5-VL-3B-Instruct-4bit`

Run trio doctor to diagnose most issues.

License

Apache 2.0 — see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 260 Commits
.github		.github
adapters/surveillance-qwen35-2b		adapters/surveillance-qwen35-2b
assets		assets
data/eval_benchmark		data/eval_benchmark
deploy		deploy
docs/superpowers/specs		docs/superpowers/specs
examples		examples
experiments		experiments
models/yolov10n/onnx		models/yolov10n/onnx
research		research
scripts		scripts
src/trio_core		src/trio_core
surveillance_vqa		surveillance_vqa
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
GLOSSARY.md		GLOSSARY.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Trio Core

What is Trio Core?

Quick Start

Two Modes

Mode 1: Cloud Relay (Trio Cloud customers)

Mode 2: Local AI (open-source users)

Features

ONVIF Camera Discovery

Tailscale Auto-Proxy

YOLO Object Detection

VLM Scene Description

CLI

API Reference

`POST /api/inference/detect`

`POST /api/inference/describe`

`POST /api/inference/crop-describe`

Python SDK

Supported Models

Tier 1 — Full optimization (native loading + visual token compression + KV reuse)

Tier 2 — Inference only (via mlx-vlm)

Architecture

Trio Cloud integration

Configuration

Troubleshooting

License

About

Uh oh!

Releases 7

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Trio Core

What is Trio Core?

Quick Start

Two Modes

Mode 1: Cloud Relay (Trio Cloud customers)

Mode 2: Local AI (open-source users)

Features

ONVIF Camera Discovery

Tailscale Auto-Proxy

YOLO Object Detection

VLM Scene Description

CLI

API Reference

POST /api/inference/detect

POST /api/inference/describe

POST /api/inference/crop-describe

Python SDK

Supported Models

Tier 1 — Full optimization (native loading + visual token compression + KV reuse)

Tier 2 — Inference only (via mlx-vlm)

Architecture

Trio Cloud integration

Configuration

Troubleshooting

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 7

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

`POST /api/inference/detect`

`POST /api/inference/describe`

`POST /api/inference/crop-describe`

Packages