HandOff 🤲

Hand work off to an AI agent that actually uses the web for you — and gets smarter every run.

HandOff is a Chrome extension that combines 9 LLM providers and the Ark Vision Engine with a self-learning engine. It sees, clicks, types, and verifies — and remembers what worked, what failed, and how to do it better next time.

Features

🖱️ Visual Computer Use — AI sees your screen and interacts with any website
👁️ Ark Vision Engine — Proprietary vision engine for screenshot-based perception
🧠 Self-Learning Engine — Execution memory, failure analysis, and auto-skill generation
⚡ Auto-Skills — Repeated workflows become reusable skills with reliability tracking
🔄 Hybrid Brain — Adaptive mode switching between DOM, Vision, Memory, and Skill execution
⏸️ Human-in-the-Loop — Pause, resume, or override agent actions anytime
📊 Live Action Feed — Watch every step the agent takes in real-time
🔒 Privacy First — Everything runs locally in your browser, no external servers

What Makes HandOff Different

Feature	HandOff	Other Agents
Memory	Persistent — remembers every run	Stateless — forgets everything
Failures	Analyzes root cause, generates fix strategies, never repeats same mistake	Retries blindly
Skills	3+ successful runs → auto-generates reusable skill	Manual scripting
Vision	Ark Vision (self-hosted) + 9 LLM providers (cloud) with automatic fallback	Single model, no fallback
Transparency	Full dashboard: skills, site profiles, failure analysis	Black box

Use Cases

Form Filling — "Fill this CRM with my contact data"
Web Research — "Find all AI events in the next 60 days"
Data Extraction — "Extract all items into a structured table"
Workspace Cleanup — "Organize this board by priority"
Dashboard Audits — "Check all metrics and flag anomalies"

Quick Start

1. Install Dependencies

cd HandOff
npm install

2. Build the Extension

npm run build

3. Load in Chrome

Open chrome://extensions/
Enable "Developer mode" (top right)
Click "Load unpacked"
Select the dist folder

4. Configure

Click the HandOff extension icon
Open Settings (gear icon)
Enter your Gemini API key
(Optional) Enable Ark Vision Engine and set your server endpoint
Save

5. Start Using

Navigate to any website
Click the HandOff icon to open the side panel
Describe what you want done
Watch the agent work — and learn!

Ark Vision Engine (Optional)

Ark Vision gives HandOff a dedicated vision model for screenshot-based perception. It's optional — any LLM provider works great on its own, but Ark adds a self-hosted, privacy-first vision layer.

In HandOff Settings: enable Ark Vision Engine → set your server endpoint → click Test to verify.

If the server goes down mid-task, HandOff automatically falls back to the selected LLM provider.

Development

# Start dev server with hot reload
npm run dev

# Build for production
npm run build

Architecture

Chrome Extension (Manifest V3)
│
├─ Side Panel (React + Tailwind + Zustand)
│   ├─ Task Input
│   ├─ Action Feed (with learning step indicators)
│   ├─ Controls (Pause/Resume/Stop)
│   ├─ Learning Panel (skills, memory, site profiles)
│   └─ Settings (LLM config + Ark Vision toggle)
│
├─ Background Worker
│   ├─ Agent Core (orchestration + state machine)
│   ├─ Gemini Client (cloud LLM)
│   ├─ Ark Vision Client (self-hosted vision engine)
│   └─ Self-Learning Engine
│       ├─ Execution Memory (traces + patterns)
│       ├─ Failure Learning (analysis + fix strategies)
│       ├─ Auto-Skill Engine (workflow → reusable skill)
│       └─ Hybrid Brain (mode selection per action)
│
└─ Content Script
    └─ Action Executor (click/type/scroll/navigate)

Self-Learning Loop

Task starts
  → executionMemory.startTrace()
  → hybridBrain.decideMode() → picks Skill / Vision / DOM / Memory
  → If proven skill exists → skill-guided execution
  → Each action → executionMemory.recordAction()
  → On failure → failureLearning.analyze() → generates fix → retries with learned correction
  → On success → hybridBrain.recordModeOutcome() → updates site reliability
Task ends
  → executionMemory.completeTrace()
  → autoSkill.detectNewSkills()
  → Next time same task → faster, more reliable, zero re-teaching

Tech Stack

AI: 9 LLM providers + Ark Vision Engine (self-hosted, optional)
UI: React 18 + Tailwind CSS + Lucide Icons
State: Zustand
Build: Vite
Storage: Chrome Storage API (all local, no backend)
Extension: Chrome Manifest V3

Roadmap

License

MIT

Built with ❤️ for the future of work.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
icons		icons
public		public
scripts		scripts
src		src
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
manifest.json		manifest.json
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
sidepanel.html		sidepanel.html
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HandOff 🤲

Features

What Makes HandOff Different

Use Cases

Quick Start

1. Install Dependencies

2. Build the Extension

3. Load in Chrome

4. Configure

5. Start Using

Ark Vision Engine (Optional)

Development

Architecture

Self-Learning Loop

Tech Stack

Roadmap

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

HandOff 🤲

Features

What Makes HandOff Different

Use Cases

Quick Start

1. Install Dependencies

2. Build the Extension

3. Load in Chrome

4. Configure

5. Start Using

Ark Vision Engine (Optional)

Development

Architecture

Self-Learning Loop

Tech Stack

Roadmap

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages