Local AI Mission Console 🚀

Your browser becomes a private AI computer.

The Local AI Mission Console is a minimal, elegant, and powerful demonstration of local Edge AI. It bridges the gap between browser-side execution (using the RunAnywhere Web SDK) and local backend inference (using Ollama), creating a fully private, multimodal AI pipeline.

Video Demo

🌟 The Vision

Traditional AI applications send your voice, text, and images to centralized servers. The Local AI Mission Console flips this model: it brings the AI directly to your hardware.

By running Vision-Language Models (VLMs) and Text-to-Speech (TTS) locally, it guarantees:

Zero Latency: No waiting for server round-trips.
Absolute Privacy: Your camera feed and data never leave your machine.
Offline Capability: Works completely decoupled from the cloud.

🏗️ Technical Architecture

This project pioneers a hybrid "Local-Edge" architecture, optimally distributing heavy AI workloads across your machine's resources:

graph TD
    A[Camera Feed] -->|Frames| B[Web Browser App]
    B -->|Base64 Image| C{{Ollama Local API}}
    C -->|Qwen2.5-VL / Llama3| D[Scene Description]
    D -->|Text| B
    B -->|Sherpa-ONNX WASM| E[Piper TTS Engine]
    E -->|PCM Audio| F[Browser AudioContext]
    
    style B fill:#f9f,stroke:#333,stroke-width:2px
    style C fill:#bbf,stroke:#333,stroke-width:2px
    style E fill:#bfb,stroke:#333,stroke-width:2px

The Pipeline

Perception: The browser safely accesses your webcam and extracts a high-quality frame.
Reasoning (Ollama): Because Vision-Language Models (like qwen2.5-vl) are GPU-intensive, the heavy lifting is offloaded to Ollama running locally on your hardware. This keeps the browser lightweight.
Synthesis (RunAnywhere / WebAssembly): The text response is streamed back to the browser, where the RunAnywhere Web SDK initializes a WebAssembly-compiled Sherpa-ONNX engine. It uses a Piper en_US-amy-low voice model to synthesize studio-quality speech entirely within the browser's memory.

The TTS Magic (Robust WASM Staging)

To achieve flawless in-browser TTS, the app uses an advanced WASM filesystem hydration technique:

The app fetches a single .tgz.bin archive containing the Piper .onnx model, tokens.txt, and over 350 files of espeak-ng-data.
This archive is manually extracted directly into the Sherpa-ONNX virtual filesystem.
The TTS provider is accessed via the SDK's ExtensionPoint registry to ensure perfect physical singleton isolation, preventing Vite module-duplication errors.

🚀 Getting Started

Prerequisites

Ollama: Download and install Ollama.
Vision Model: Pull a robust VLM (we recommend Qwen for vision):
```
ollama run qwen2.5-vl
```
(Note: You can use any model Ollama supports; the UI will autodetect them).
Camera Access: A webcam is required.

Installation

Clone the repository and install dependencies:
```
npm install
```
Run the development server:
```
npm run dev
```
Open the app at http://localhost:5173. Select your model, and click "Analyze Scene"!

💡 Ideas for Extensions and Forks

The Local AI Mission Console is designed as a foundation. Here are ways you can extend it:

Continuous Monitoring (Dashcam Mode): Modify pipeline.ts to run a setInterval loop, creating an AI that narrates what it sees every 10 seconds.
Security Guard: Add an instruction to the Ollama prompt: "Only respond if you see a person, describe their clothing." Tie this to an alert system.
Accessibility Aide: Deploy this on a mobile browser. Users can point their phone at signs or documents, and the local AI reads it aloud to them.
Local RAG Integration: Instead of just describing the scene, pass the description to a local ChromaDB instance to retrieve context before generating the speech.

🛠️ Built With

[RunAnywhere Web SDK]https://github.com/RunanywhereAI/runanywhere-sdks/tree/main/sdk/runanywhere-web: Standardized APIs for bridging WASM AI, Audio/Video capture, and device capabilities.
Ollama: Local LLM/VLM inference engine.
Vite & TypeScript: Fast, typed frontend tooling.
Piper/Sherpa-ONNX: Blazing fast offline Text-to-Speech.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
public		public
src		src
.gitignore		.gitignore
README.md		README.md
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
style.css		style.css
tsconfig.json		tsconfig.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Local AI Mission Console 🚀

🌟 The Vision

🏗️ Technical Architecture

The Pipeline

The TTS Magic (Robust WASM Staging)

🚀 Getting Started

Prerequisites

Installation

💡 Ideas for Extensions and Forks

🛠️ Built With

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Local AI Mission Console 🚀

🌟 The Vision

🏗️ Technical Architecture

The Pipeline

The TTS Magic (Robust WASM Staging)

🚀 Getting Started

Prerequisites

Installation

💡 Ideas for Extensions and Forks

🛠️ Built With

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages