BrowserMind is a browser automation agent built on top of browser-use. giving you a single interface to run any browser task using the LLM of your choice. Point it at a task, pick a model, and let it run.
Form automation Β· Data scraping Β· Complex workflows Β· Automation
| Feature | Description |
|---|---|
| Multi-LLM Support | Works with Anthropic (Claude), OpenAI (GPT-4), Groq, Google (Gemini), and Ollama |
| Speed Optimized | Built-in speed optimization for fast task completion with minimal steps |
| Easy Setup | Minimal configurationβjust set your API keys and go |
| Headless Browsing | Efficient headless browser automation out of the box |
| Error Handling | Graceful fallbacks and error management |
| Extensible Tools | Built on Browser-Use's comprehensive tool ecosystem |
git clone https://github.com/rojansapkota/BrowserMind.git
cd BrowserMind
pip install -r requirements.txtCreate a .env file in the project root:
# Choose one or more providers
GROQ_API_KEY=your_groq_api_key
ANTHROPIC_API_KEY=your_anthropic_api_key
OPENAI_API_KEY=your_openai_api_key
GOOGLE_API_KEY=your_google_api_keypython main.pyWhen prompted, enter a task:
Enter your task for the agent: Find the latest AI news on Hacker News
The agent will automatically browse and complete the task for you.
- Python 3.11.9
# 1. Clone the repository
git clone https://github.com/rojansapkota/BrowserMind.git
cd BrowserMind
# 2. Install dependencies
pip install -r requirements.txt
# 3. Install Chromium (if not already installed)
# Browser-Use handles this automatically, but you can manually install:
# python -m playwright install chromium
# 4. Set up your environment variables
cp .env.example .env # Create from template (optional)
# Edit .env with your API keysgrocery-use-large.mp4
pc-use-large.mp4
| Provider | Setup | Model |
|---|---|---|
| Groq | Set GROQ_API_KEY |
meta-llama/llama-4-scout-17b-16e-instruct |
| Anthropic | Set ANTHROPIC_API_KEY |
claude-3-5-sonnet-20240620 |
| OpenAI | Set OPENAI_API_KEY |
gpt-4.1 |
Set GOOGLE_API_KEY |
gemini-2.0-flash-lite |
|
| Ollama | Run locally via Ollama | llama3.2:latest (customizable) |
# API Keys
GROQ_API_KEY=your_key_here
ANTHROPIC_API_KEY=your_key_here
OPENAI_API_KEY=your_key_here
GOOGLE_API_KEY=your_key_here
# Ollama (for local models)
OLLAMA_BASE_URL=http://localhost:11434Edit main.py to customize:
# Browser window size
window_size={'width': 1280, 'height': 720}
# Page load wait time
minimum_wait_page_load_time=0.1
# Wait between actions
wait_between_actions=0.1
# Temperature (0 = deterministic, higher = more creative)
temperature=0.0from main import run_agent
import asyncio
# Simple task
asyncio.run(run_agent("Find the current Bitcoin price", provider='groq'))
# Use different providers
asyncio.run(run_agent("Fill out this form", provider='anthropic'))
asyncio.run(run_agent("Scrape product listings", provider='openai'))import asyncio
from main import initialize_agent
async def automated_workflow():
query = "Navigate to google.com and search for 'Browser Mind'"
agent, browser_session = initialize_agent(query, provider='groq')
try:
result = await agent.run()
print(f"Task completed: {result}")
finally:
await browser_session.close()
asyncio.run(automated_workflow())- Form Filling: Automatically complete and submit web forms
- Data Scraping: Extract structured data from websites
- Research: Gather information from multiple sources
- Monitoring: Watch for changes and alerts
- Automation: Repetitive browser tasks
CUSTOM_PROMPT = """
Your custom instructions here.
Focus on speed and accuracy.
"""
agent = Agent(
task=query,
llm=llm,
browser_session=browser_session,
extend_system_message=CUSTOM_PROMPT
)# Pull a model first
ollama pull mistral
# Then use it in main.py
return ChatOllama(model='mistral')| Issue | Solution |
|---|---|
| API Key Error | Ensure your .env file is in the project root and API key is valid |
| Browser Won't Open | Check if Chromium is installed; run playwright install chromium |
| Timeout Errors | Increase minimum_wait_page_load_time in the configuration |
| Module Not Found | Run pip install -r requirements.txt again |
- Browser-Use Documentation: docs.browser-use.com
- Browser-Use GitHub: github.com/browser-use/browser-use
- Playwright Documentation: playwright.dev
Contributions are welcome! Please feel free to:
- Report bugs via Issues
- Submit pull requests with improvements
- Share your use cases and examples
This project is licensed under the MIT License - see the LICENSE file for details.
Made with β€οΈ for the OpenSource community

