Local-First Computer-Use Agent (Copilot Agent Mode + Playwright MCP)

What this is

A minimal, local-first workspace for running GitHub Copilot “Agent mode” in VS Code to execute multi-step “computer-use” workflows.

Browser automation via the official Playwright MCP server
Optional database/toolbox MCPs (e.g., queries/exports)
Deterministic local transforms (terminal/file ops)

Durable repo-based memory (instructions, skills, and run logs) keeps the system safe and reproducible.

Source of truth for scope/design: docs/PRODUCT_REQUIREMENTS.md.

Current property focus

This workspace is currently being used to evaluate a specific property:

ACTIVE: Vineyard Point / Dockside condos (Lake Norman-adjacent)
- Address anchor: 18861 Vineyard Point Ln, Cornelius, NC 28031
- Package: properties/README.md
- Active run: runs/2026-01-19__vineyard-point/

Historical note:

ARCHIVED: Alexander Chase (Cornelius, NC) evaluation (retained under runs/2026-01-17__second-home-condo-net-cost/)

Prereqs

Node.js (node, npx)
- Optional (Homebrew): brew install node
Python 3 (python3)
- Optional (Homebrew): brew install python
VS Code with GitHub Copilot Agent Mode enabled

Safety / HITL

Auth handoff: If login/SSO/MFA/CAPTCHA appears, the agent stops and asks you to take over. You complete auth, then type “Done”.
Irreversible actions: The agent must ask immediately before clicking Complete / Submit / Attest / Finish (or anything irreversible).
No URL storage: Training URLs are session-only. Do not write them into repo files; use <TRAINING_URL> placeholders in skills/logs.

Setup

MCP config is committed in .vscode/mcp.json. Open this repo in VS Code.
Ensure VS Code lists the Playwright MCP server in its MCP/tools UI.

This repo configures Playwright MCP out of the box; additional MCP servers (e.g., database/toolbox MCPs) are optional and should follow the same safety + RUN_ID isolation conventions.

Azure OpenAI (GPT-5.2)

Guide: docs/GPT_5_2_INTEGRATION_GUIDE.md
Create a local .env from .env.example (do not commit secrets)
Quick smoke test: python -m agent_tools.llm.smoketest --model azure-gpt-5.2 --prompt "hello"

Document synthesis (chunked, with coverage warnings)

This repo includes local-first synthesis tooling that avoids silent truncation by using chunked map-reduce, and always emits a Coverage / Limit Warnings section.

Single PDF synthesis (writes Markdown + optional JSON manifest):

python -m agent_tools.llm.summarize_file \
  --pdf "/path/to/file.pdf" \
  --out "runs/<RUN_ID>/exports/<slug>__synthesis.md" \
  --manifest "runs/<RUN_ID>/exports/<slug>__synthesis.manifest.json" \
  --chunk-summaries-dir "runs/<RUN_ID>/tmp/<slug>__chunks"

Full folder synthesis (PDF/EML/TXT/MD) + per-doc outputs:

python -m agent_tools.llm.summarize_folder \
  --dir "/path/to/folder" \
  --out "runs/<RUN_ID>/exports/folder_synthesis.md" \
  --manifest "runs/<RUN_ID>/exports/folder_synthesis.manifest.json" \
  --per-doc-dir "runs/<RUN_ID>/exports/docs" \
  --tmp-dir "runs/<RUN_ID>/tmp/docs"

Notes:

If a PDF extractor appears to duplicate identical page text (a known issue with some PDFs), the output will include a warning like PDF_REDUNDANCY_DEDUPED.
Model credentials and endpoints are always local (.env + config/models.json placeholders). Do not commit secrets or internal endpoints.

Microsoft Graph (Office 365)

Guide: docs/GRAPH_AUTH_REPLICATION_GUIDE.md
Create a local .env from .env.example (do not commit secrets)
Validate auth + access (interactive loopback + cache): python -m agent_tools.graph.validate
List next workweek events (Eastern): python -m agent_tools.graph.validate --next-workweek --list-events --outlook-timezone "Eastern Standard Time" --display-timezone "America/New_York"
Verified 30-min availability next workweek (8:30 AM–4:30 PM Eastern): python -m agent_tools.graph.validate --next-workweek --availability --outlook-timezone "Eastern Standard Time" --display-timezone "America/New_York"
Export sent mail compiled to a recipient (writes to ignored tmp/ by default): python -m agent_tools.graph.export_sent_mail --to <RECIPIENT_EMAIL> --include-cc --out-dir tmp/mail_exports_<slug>

Web search (Bing + Tavily)

Bing (native): Use for quick lookups and broad discovery. It’s built into Copilot and is policy-controlled (no local MCP server to install).
Tavily (MCP, optional): Use for deep research loops where the agent should search → open → extract documentation. This is a third-party service and requires an API key.

Parallel runs note:

Unlike Playwright, Tavily typically does not require per-worktree “isolation” (no browser/profile state). The main parallelism risk is shared API rate limits/quotas when multiple windows use the same key.

Setup and prompting guidance: docs/Copilot Web Search Configuration and Usage.md

If the MCP server fails to start, switch the package in .vscode/mcp.json to @microsoft/mcp-server-playwright and record what worked in this README.

Reference docs

AGENTS.md
docs/ARCHITECTURE.md
docs/Local-First Browser Agent Briefing.md
docs/Copilot Web Search Configuration and Usage.md
docs/PRODUCT_REQUIREMENTS.md
docs/PARALLEL_RUNS.md
docs/DEPENDENCIES_AND_UTILS.md
docs/CORE_REPO_WORK_LOG.md
Skills Index: .github/skills/README.md (Skills live in .github/skills/)

How repo memory works

Automatic: .github/copilot-instructions.md (custom instructions applied every turn).
On-demand: Skills in .github/skills/ are only used when the agent opens them (start with the Skills Index).
Run continuity (default): runs/<RUN_ID>/HANDOFF.md (run-local; allowlisted for tracking on run branches; not promoted to main).
Session notes (optional): notes/agent-runs/ (intentionally-versioned, sanitized summaries; one file per session).
Core log (optional index): docs/CORE_REPO_WORK_LOG.md (optional; PR descriptions + git history are canonical).
Session vs repo: Chat/session memory is not durable; repo memory is.

Run-local working area (per worktree / per RUN_ID)

Each run/worktree uses a RUN_ID in the format:

YYYY-MM-DD__short-slug

Instance-only materials (briefing docs, raw inputs, exports, one-off scripts) should live under:

runs/<RUN_ID>/... (run-local quarantine; not promoted to main by default)

The bootstrap prompt initializes a standard structure for each run:

runs/<RUN_ID>/HANDOFF.md (append-only continuity journal)
runs/<RUN_ID>/briefing/ (briefing materials specific to this run)
runs/<RUN_ID>/inputs/ (raw inputs specific to this run)
runs/<RUN_ID>/exports/ (exports produced during the run)
runs/<RUN_ID>/scripts/ (one-off scripts used only for this run)
runs/<RUN_ID>/tmp/ (scratch)
runs/<RUN_ID>/downloads/ (downloads for this run)
runs/<RUN_ID>/playwright-profile/ (per-run Playwright user-data-dir)
runs/<RUN_ID>/playwright-output/ (per-run Playwright outputs)

Notes:

Do not store sensitive URLs/tokens/secrets in any run files; use placeholders like <TRAINING_URL>.
notes/**, scripts/**, requirements.txt, and .vscode/** are treated as non-core by default (see Core vs run-local below).

Tracking note:

By default, run artifacts under runs/ are tracked so run branches can fully capture state for rollback/continuity.
A GitHub Actions guardrail blocks changes under runs/ from being merged into main.

Workspace-specific Playwright MCP (instance isolation)

When you run multiple VS Code windows (one per git worktree), Playwright must be isolated per window/worktree to avoid shared cookies, sessions, downloads, and output collisions.

Recommended pattern (per worktree window):

Run .github/prompts/bootstrap_playwright_run_isolation.prompt.md.
Configure the Playwright MCP server to use:
- --user-data-dir runs/<RUN_ID>/playwright-profile/
- --output-dir runs/<RUN_ID>/playwright-output/
In VS Code’s MCP UI, ensure the active server is workspace-scoped for this window (not a shared/global server). If multiple Playwright MCP servers are running, stop the global/shared one to avoid the agent binding to the wrong instance.

Output-dir note:

If --output-dir is set, screenshot/save APIs typically interpret filenames as relative to the output dir. Pass a simple filename like mcp-validation.png (not a path that already includes runs/<RUN_ID>/playwright-output/) to avoid duplicated subpaths.

Validation prompt

Use Copilot Agent Mode with Playwright MCP tools enabled:

Navigate to https://example.com
Report the page title
Take a screenshot

Recommended workflow (Launcher + Worktrees)

Use this when you want multiple local agent instances without collisions.

Posture:

Keep one “launcher” VS Code window opened on main.
Create one branch + git worktree per run.
Open one VS Code window per worktree.

In the launcher window:

git switch main
git pull --ff-only

RUN_ID="YYYY-MM-DD__short-slug"
git worktree add -b "run/${RUN_ID}" "../wt-${RUN_ID}" main
code -n "../wt-${RUN_ID}"

Inside the run/worktree window:

Follow .github/prompts/bootstrap_computer_use_agent.prompt.md.
Then run .github/prompts/bootstrap_playwright_run_isolation.prompt.md to configure per-run Playwright MCP isolation (profile/output dirs) to avoid cross-window collisions.
Keep run continuity in runs/<RUN_ID>/HANDOFF.md.

Note: Playwright isolation is critical for parallel runs; Tavily is usually configured user-scoped and does not need per-worktree isolation.

Core vs run-local (short)

Core/shared paths (eligible to promote to main): AGENTS.md, README.md, .github/**, docs/**, tools/**.
Non-core by default (do not promote to main): runs/<RUN_ID>/**, notes/**, scripts/**, requirements.txt, .vscode/**.

Note: .vscode/mcp.json is committed to main as a baseline, but during a run it often becomes worktree-specific (e.g., per-run --user-data-dir / --output-dir). Treat run-specific edits as non-core unless you are intentionally updating the shared baseline.

Promotion to main (core-only)

Keep run/<RUN_ID> as the long-lived run journal (may include run artifacts).
Promote changes back to main by copying/merging only the allowlisted core paths (path-based promotion), not by merging the full run branch.

Details: docs/PARALLEL_RUNS.md

Dependencies

Dependencies are managed in tiers to keep the base reproducible while enabling optional capabilities.

Tier A (Base): always installed; keep slim
Tier B (Optional packs): install on demand
Tier C (Run-local): experiments

Details: docs/DEPENDENCIES_AND_UTILS.md

Training run procedure

You paste the training URL into chat as session-only: <TRAINING_URL>.
Agent opens the browser and navigates.
If login/SSO/MFA/CAPTCHA appears, agent stops; you complete it; you type “Done”.
Agent proceeds through the training:
- Detects gating (disabled Next/Continue, timers, “video must finish”)
- Waits for explicit completion signals (button enabled, timer hits 0, completed badge)
- Scrolls/handles hidden controls and common overlays
Before the final completion/submission step (Complete/Submit/Attest/Finish), the agent asks for explicit confirmation.

After-action

Optionally write a session note in notes/agent-runs/ using notes/agent-runs/_TEMPLATE.md.
Update or extract skills in .github/skills/, especially .github/skills/training-navigation/SKILL.md, with new recovery rules or landmarks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Local-First Computer-Use Agent (Copilot Agent Mode + Playwright MCP)

What this is

Current property focus

Prereqs

Safety / HITL

Setup

Azure OpenAI (GPT-5.2)

Document synthesis (chunked, with coverage warnings)

Microsoft Graph (Office 365)

Web search (Bing + Tavily)

Reference docs

How repo memory works

Run-local working area (per worktree / per RUN_ID)

Workspace-specific Playwright MCP (instance isolation)

Validation prompt

Recommended workflow (Launcher + Worktrees)

Dependencies

Training run procedure

After-action

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
.github		.github
.vscode		.vscode
agent_tools		agent_tools
config		config
docs		docs
notes/agent-runs		notes/agent-runs
runs		runs
scripts		scripts
templates		templates
tools		tools
.env.example		.env.example
.gitignore		.gitignore
AGENTS.md		AGENTS.md
README.md		README.md
requirements.txt		requirements.txt

mbosseme/computer-use-1

Folders and files

Latest commit

History

Repository files navigation

Local-First Computer-Use Agent (Copilot Agent Mode + Playwright MCP)

What this is

Current property focus

Prereqs

Safety / HITL

Setup

Azure OpenAI (GPT-5.2)

Document synthesis (chunked, with coverage warnings)

Microsoft Graph (Office 365)

Web search (Bing + Tavily)

Reference docs

How repo memory works

Run-local working area (per worktree / per RUN_ID)

Workspace-specific Playwright MCP (instance isolation)

Validation prompt

Recommended workflow (Launcher + Worktrees)

Dependencies

Training run procedure

After-action

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages