Doppelganger — Browser Automation for Everyone

Doppelganger — Browser Automation for Everyone

Doppelganger is a self‑hosted, block-first automation control plane built for teams that want predictable, auditable browser workflows without pushing sensitive data to third‑party SaaS. It bundles a React/Vite frontend, an Express/Playwright backend, helper scripts, and optional CLI tooling so you can sketch blocks, inject JavaScript, rotate proxies, and run everything locally.

What You Get

Block‑based automation — build flows with actions like click, type, wait, hover, and execute JavaScript against modern pages.
Task API + CLI — trigger saved tasks via HTTP (/tasks/:id/api) or npx doppelganger while passing variables and securing runs with the API key you control.
Captures & storage — automatically store screenshots/recordings and cookies; view them in the captures tab, reset storage, or download built assets.
Proxy management — host, rotate, or import HTTP/SOCKS proxies, flag a default, and toggle rotation per task.
Security-first — session authentication, IP allowlists, secret management, and audit trails live entirely inside your environment.

Architecture Snapshot

Frontend
- Vite with React (TypeScript) drives /dashboard, /tasks, /settings, /executions, and /captures.
- The Settings screen is tabbed (System, Data, Proxies) and houses panels for API keys, user agents, layout, storage, and version info.
- Components call /api/* endpoints through the Vite dev proxy (see vite.config.mts), sharing APP_VERSION via src/utils/appInfo.ts.
Backend
- server.js (Express) handles auth (/api/auth), task metadata, hooks into Playwright, and exposes /api/settings/* for runtime configuration.
- Requirements: Node 18+ (LTS), Playwright bundled via npm install.
- Storage is plain‑file: data/ for proxies and allowlists, public/captures for visuals, storage_state.json for cookies.
Scripts & automation
- scripts/postinstall.js runs when dependencies install (keep an eye if you customize).
- agent.js, headful.js, scrape.js expose specialized runners; the CLI binary bin/cli.js wires them for npx doppelganger.
Code layout highlights
- src/App.tsx glues together routing, alerts, and the sidebar that links dashboards, tasks, and settings.
- src/components houses reusable panels (API keys, storage, captures, proxies) that map directly to backend endpoints.
- server.js embeds all HTTP handlers in one file; use the data/ helpers for proxies, API keys, and user agent preferences if you customize behavior.

Getting Started

Docker (Recommended)

Docker Compose (Multi-arch / ARM / Apple Silicon)

The easiest way to run Doppelganger on any architecture (including M1/M2/M3 Macs) is via Docker Compose.

Clone the repository:

git clone https://github.com/mnemosynestack/doppelganger.git
cd doppelganger

Start the services:

docker compose up --build -d

This starts the app on http://localhost:11345 and the VNC viewer on http://localhost:54311.

Docker Run (Standard)

docker pull mnemosyneai/doppelganger
docker run -d \
  --name doppelganger \
  -p 11345:11345 \
  -p 54311:54311 \
  -e SESSION_SECRET=replace_with_long_random_value \
  -v $(pwd)/data:/app/data \
  -v $(pwd)/public:/app/public \
  -v $(pwd)/storage_state.json:/app/storage_state.json \
  mnemosyneai/doppelganger

Visit http://localhost:11345. Stop/start with docker stop/start doppelganger.

The first visit loads the login/setup screen. After you create the admin account and sign in, the dashboard replaces the login view and stays visible for as long as the session remains valid; returning users are redirected straight to the dashboard until they explicitly log out or the session expires.

Local Development (npm)

Install dependencies:

npm install

Launch backend + frontend:

npm run server
npm run dev

Frontend calls /api via the Vite proxy defined in vite.config.mts; the backend listens on process.env.VITE_BACKEND_PORT (default 11345).

Install Release via npm

If you just want to run the packaged release (no source checkout), install the published npm package and run doppelganger directly.

npm install -g @doppelgangerdev/doppelganger
doppelganger

Or use npx:

npx @doppelgangerdev/doppelganger

If you prefer not to install globally, clone the repo, run npm install to pull dependencies, and then run npx @doppelgangerdev/doppelganger inside that folder. This ensures npx can resolve the package from the local registry/cache while still shipping the same dashboard experience.

Set SESSION_SECRET and optionally mount data/, public/, and storage_state.json (match the Docker volume layout). The CLI spins up the same Express/Playwright stack and opens the browser-based dashboard at http://localhost:11345 unless you override PORT.

Session Secret

Set SESSION_SECRET before any run. A quick generator:

node -e "console.log(require('crypto').randomBytes(32).toString('hex'))"

Configuration

Variable	Purpose	Default
`SESSION_SECRET`	Signs session cookies. Required.	—
`ALLOWED_IPS`	Comma list for basic IP allowlisting.	none (open)
`TRUST_PROXY`	Honor `X-Forwarded-*` when behind a reverse proxy.	`0`
`ALLOW_PRIVATE_NETWORKS`	Allow scraping local/private IPs (SSRF risk).	`true`
`VITE_DEV_PORT`	Port for front-end dev server.	`5173`
`VITE_BACKEND_PORT`	Backend port for proxying + scripts.	`11345`
`DB_TYPE`	Optional database type overriding disk storage. Set to `postgres` to use PostgreSQL.	—
`DB_POSTGRESDB_HOST`	Hostname for the PostgreSQL database (required if DB_TYPE is postgres).	—
`DB_POSTGRESDB_PORT`	Port for the PostgreSQL database (required if DB_TYPE is postgres).	—
`DB_POSTGRESDB_USER`	Username for the PostgreSQL database (required if DB_TYPE is postgres).	—
`DB_POSTGRESDB_PASSWORD`	Password for the PostgreSQL database (required if DB_TYPE is postgres).	—

Proxy rotation also respects data/proxies.json (see below), and data/allowed_ips.json works as an alternate allowlist format.

Advanced Configuration

PLAYWRIGHT_BROWSERS_PATH (or set PLAYWRIGHT_CHROMIUM_EXECUTABLE_PATH) when using a shared Playwright installation.
NODE_ENV=production enables the bundled dist/ client and reduces console verbosity.
HOST=0.0.0.0 allows binding beyond localhost inside Docker containers, while PORT overrides the Express listen port (defaults to 11345).
Set LOG_LEVEL to debug if you need more Playwright or proxy diagnostics; this can also be a custom wrapper when running node server.js.
Headful mode: the headful/visible browser binds to 54311, so open that port alongside 11345 when running headful.js or other headful flows.

UI Walkthrough

Dashboard — quick stats, recent runs, and a “New Task” entry point (block or agent).
Task Editor — drag blocks (click, type, wait, scroll, press, JavaScript); toggle “Rotate Proxies”; run/stop tasks; inspect results with pins & logs.
Captures — review screenshots/recordings stored under public/captures; delete individually or refresh.
Executions — historical runs with detail drill-down and the ability to re-run or download results.
Settings
System tab: regenerate or copy API key, select user agent, adjust layout ratio, view/copy version (VersionPanel), and clear storage.
Data tab: manage captures and cookies.
Proxies tab: add/import proxies, set defaults, toggle rotation, and inspect host vs saved entries.

CLI & Agent Mode

Use npx doppelganger (or npm run cli) to launch the interactive CLI that shows tasks, status, and logs.
Behind the scenes, bin/cli.js can invoke agent.js, headful.js, or scrape.js depending on the runtime mode (--agent, --headful, --scrape).
Run node agent.js --help to see flags like --task, --browser, or --version. These runners share the same settings (API key, proxies, storage) as the web UI.
When connecting via the API key, prefer Authorization: Bearer <key> so reverse proxies can normalize headers; the CLI also accepts a --api-key flag for scripted runs.

Agent capabilities

Tasks use the JSON schema outlined in AGENT_SPEC.md, including mode/modes (agent/block), wait times, selectors, and stealth flags.
Support for all action types in the spec (click, type, wait, press, scroll, javascript, csv, hover, merge, screenshot, if/else/end, loops, foreach, stop, set, on_error, start), so you can encode complex flows.
Variable templating ( {$var} ), structured conditions, and helper functions such as exists(), text(), and block output ensure reusable, data-driven tasks.
Extraction scripts run in the browser context after the page renders; you can return JSON/CSV by reading DOM nodes directly as documented in AGENT_SPEC.md.

Proxies

Proxies can be defined via the UI or data/proxies.json:

[
  "http://user:pass@proxy1.example.com:8000",
  { "server": "socks5://proxy2.example.com:1080", "label": "data center" }
]

host is always available and represents your machine’s default IP.
Rotation settings (round-robin or random) live in the Settings screen and persist through the backend endpoints.
Import/export operations live behind /api/settings/proxies/import.

API Surface

Doppelganger exposes a comprehensive REST API for integration with agents (like OpenClaw) or custom automation scripts. All endpoints are hosted locally, typically on port 11345.

Authentication: If enabled, provide the x-api-key header or Authorization: Bearer <key>. For internal network use, this may be optional depending on your settings.

Task Management API

GET /api/tasks: List all saved automation profiles.
POST /api/tasks: Create a new task profile.
PUT /api/tasks/:id: Update an existing task profile.
POST /api/tasks/:id/api: Execute a predefined task. Pass {"variables": {}} in the body to override execution variables dynamically.

Execution & Logging API

GET /api/executions: Retrieve paginated logs of all past runs.
GET /api/executions/:id: View the exact steps, result JSON, and configuration state of a specific run.

Data Management API

GET /api/data/captures: List generated screenshots, videos, and downloads.
DELETE /api/data/captures/:name: Delete a specific capture.
POST /api/clear-screenshots: Removes all files in public/captures.
POST /api/clear-cookies: Deletes storage_state.json.

Task Scripting Tips

Use JavaScript blocks to scrape structured data:

return document.querySelectorAll('article').length;

Keep CSS selectors narrow; the block-based editor surfaces #, ., and attribute hints.
When running headlessly, toggle headful.js or agent.js depending on whether you need a visible browser for debugging.
Set task.variables via the API to re-use generic workflows across multiple domains.

Workflow Recipe

Design a task in the editor starting with a goto block and a wait block to give pages time to render.
Add conditional javascript blocks to test for specific DOM elements; use the retry/timer controls per block.
Attach extract (JSON output) or screenshot actions before submitting so you can inspect results in the Captures tab.
Toggle “Rotate Proxies” if you need egress diversity and pick a default proxy on Settings → Proxies.
Save the task, pin results you care about, and use the POST /tasks/:id/api endpoint with variables like {"variables":{"query":"books"}} to run it from automation tools.

Testing & Validation

Run npm run build before packaging for production; the dist/ folder contains the compiled assets.
Backend logging writes to the console; capture output from server.js for debugging proxies, authentication, or Playwright failures.
Playwright logs are visible in the running Node process and under node_modules/.cache when using the CLI.

Troubleshooting

“Session expired” in the UI: confirm SESSION_SECRET is consistent and cookies aren’t blocked by your browser.
Proxy import fails: inspect data/proxies.json for valid URLs; the backend validates server as a string.
API key lost: copy from Settings → System tab.

Data Lifecycle

Captures land in public/captures; regular cleanups can be scripted via POST /api/clear-screenshots.
Cookies live in storage_state.json. Back up this file before clearing cookies via the UI or /api/clear-cookies.
Proxy lists, user-agent preferences, and settings persist under data/ (look for proxies.json, allowed_ips.json, etc.) — treat this directory as your config source control.
Use Storage controls in Settings to clear data after experimentation cycles, and keep layouts or version info tracked via localStorage as shown in src/components/SettingsScreen.tsx.

Maintenance

The project is governed by the GNU General Public License v3.0, which grants rights for distribution and modification as per the GPLv3 terms.
Keep data/ and storage_state.json backed up if you rely on historical cookies or proxies.
Release updates by pulling mnemosyneai/doppelganger (Docker) or npm i @doppelgangerdev/doppelganger (npm). The Settings view always displays the current package version.
Contributions: follow .github/ templates, respect CONTRIBUTING.md, and run available lint/test scripts if you touch critical areas.

Roadmap

Security Considerations

Never commit your SESSION_SECRET, API keys, or storage_state.json into shared repositories.
Use ALLOWED_IPS/data/allowed_ips.json to gate the UI when deploying to a network-exposed host.
Rotate API keys periodically via Settings, and log all automation runs through the Executions tab for audit purposes.
Playwright runs inside the same Node process; keep dependencies up to date and rebuild node_modules after significant OS patches.

Community

Report issues or request features via the GitHub repo issue tracker.
Follow the authors on https://github.com/mnemosynestack for releases.
Share automation recipes with other self-hosted users in your org, but respect the license for sharing infrastructure.
Join the community on Discord.

Support the Project

If you find this project helpful, please consider supporting its development. Your contributions help keep the project maintained and the lights on!

Other ways to help:

Star the repository to help others find it.
Share the project with your network.
Contribute to the code or documentation.

Name		Name	Last commit message	Last commit date
Latest commit History 475 Commits
.github		.github
.jules		.jules
bin		bin
node_modules		node_modules
public		public
scripts		scripts
specs		specs
src		src
tests		tests
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
AGENTS.md		AGENTS.md
AGENT_SPEC.md		AGENT_SPEC.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
GEMINI.md		GEMINI.md
GithubStarPill.tsx		GithubStarPill.tsx
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
SECURITY.md		SECURITY.md
SKILL.md		SKILL.md
TERMS.md		TERMS.md
agent.js		agent.js
banner.png		banner.png
common-utils.js		common-utils.js
demo-run.gif		demo-run.gif
docker-compose.yml		docker-compose.yml
extraction-worker.js		extraction-worker.js
headful.js		headful.js
html-utils.js		html-utils.js
icon.svg		icon.svg
index.html		index.html
logo.png		logo.png
mock_server.js		mock_server.js
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
proxy-rotation.js		proxy-rotation.js
proxy-utils.js		proxy-utils.js
scrape.js		scrape.js
server.js		server.js
start-vnc.sh		start-vnc.sh
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
url-utils.js		url-utils.js
user-agent-settings.js		user-agent-settings.js
verify_captures.js		verify_captures.js
vite.config.mts		vite.config.mts

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Doppelganger — Browser Automation for Everyone

What You Get

Architecture Snapshot

Getting Started

Docker (Recommended)

Docker Compose (Multi-arch / ARM / Apple Silicon)

Docker Run (Standard)

Local Development (npm)

Install Release via npm

Session Secret

Configuration

Advanced Configuration

UI Walkthrough

CLI & Agent Mode

Agent capabilities

Proxies

API Surface

Task Management API

Execution & Logging API

Data Management API

Task Scripting Tips

Workflow Recipe

Testing & Validation

Troubleshooting

Data Lifecycle

Maintenance

Roadmap

Security Considerations

Community

Support the Project

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 36

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages