An open-source Chrome extension for AI-powered browser automation with natural language commands. Similar to OpenAI Operator/Atlas.
- 🤖 Multi-Agent System - Planner, Navigator, and Validator agents work together
- 💬 Natural Language Commands - Just tell it what you want to do
- 🎯 Floating Toolbar - Quick access from any web page
- 📱 Side Panel UI - Full chat interface with action history
- 🔌 Multiple LLM Providers - OpenAI, Anthropic, Gemini, Ollama
- 🔒 Privacy First - Your API keys, your data
Coming soon
-
Clone the repository:
git clone https://github.com/jasimea/xbrowser.git cd xbrowser -
Install dependencies:
npm install
-
Build the extension:
npm run build
-
Load in Chrome:
- Go to
chrome://extensions/ - Enable "Developer mode"
- Click "Load unpacked"
- Select the
dist/folder
- Go to
# Start development server with hot reload
npm run dev
# Type check
npm run type-check
# Build for production
npm run build┌─────────────────────────────────────────────────────────────────┐
│ Task Executor │
├─────────────────────────────────────────────────────────────────┤
│ │
│ ┌──────────────┐ ┌──────────────┐ ┌──────────────┐ │
│ │ Planner │───>│ Navigator │───>│ Validator │ │
│ │ Agent │ │ Agent │ │ Agent │ │
│ └──────────────┘ └──────────────┘ └──────────────┘ │
│ │ │ │ │
│ Strategy & Execute Verify │
│ Planning Actions Results │
│ │
└─────────────────────────────────────────────────────────────────┘
- TypeScript - Type-safe code
- React 18 - UI components
- Tailwind CSS - Styling
- Vite - Build tool
- LangChain.js - AI framework
- Zustand - State management
- Zod - Schema validation
- Click the XBrowser extension icon
- Go to Settings tab
- Enter your API keys for the LLM providers you want to use
- Assign models to each agent (Planner, Navigator, Validator)
Type directly in the floating toolbar at the top of any page:
- "Click the login button"
- "Fill the form with test data"
- "Extract all product prices"
Click the extension icon to open the side panel for:
- Full conversation history
- Action history and replay
- Detailed settings
- Phase 2: Storage & Configuration Layer
- Phase 3: LLM Provider Factory
- Phase 4: Browser Automation Layer
- Phase 5: Action System (20+ actions)
- Phase 6: Multi-Agent System
- Phase 7: Message Passing & Service Worker
- Phase 8: Side Panel UI
- Phase 9: Polish & Testing
Contributions are welcome! Please read our contributing guidelines first.
MIT License - see LICENSE for details.
- Inspired by Nanobrowser
- Architecture influenced by BrowserOS
- Built with LangChain.js