Skip to content

realbrowser 0.3.0

Latest

Choose a tag to compare

@darkamenosa darkamenosa released this 13 May 20:31
· 7 commits to main since this release
c8eabfd

This release moves Realbrowser from 0.2.1 to 0.3.0. The release goal is
Claude Code support, faster and lower-token browsing, and a more stable plugin
runtime across macOS, Linux, and Windows.

Added:

  • Claude Code plugin support via .claude-plugin/plugin.json.
  • Claude Code skill layout under skills/realbrowser/, with the CLI, wrappers,
    skill instructions, and references colocated for plugin discovery.
  • Portable launch wrappers for macOS/Linux shells, Windows cmd.exe, and
    PowerShell.
  • read tree: compact ARIA accessibility tree via CDP with refs for actions.
    Use --interactive/-i, --compact/-c, --depth/-d, --diff/-D,
    and --selector to keep reads bounded.
  • Task and session tab registries so agents reuse one same-origin tab instead
    of creating duplicate tabs after a failed action or across nested sessions.
  • tab done and tab close --mine cleanup paths that only touch agent-owned
    tabs.
  • action scroll, read autocomplete, and read overlay for generic browsing
    flows that need scroll containers, transient UI, or overlay diagnostics.
  • Cross-platform focus capture/restore for explicit stopped-profile launches:
    macOS via AppleScript, Linux via xdotool when present, and Windows via
    PowerShell.
  • Windows process-tree cleanup for anonymous browser sessions.

Changed:

  • The runtime version is now 0.3.0 everywhere.
  • Browsing guidance is target-first and site-agnostic: acquire one target, read
    compact state, act once, and verify with a diff or page wait.
  • SKILL.md stays concise and delegates command/reference detail to files under
    references/.
  • read tree -i -c is the default interaction reader; broad HTML, full-page
    screenshots, and large body dumps are fallback/debug tools.
  • Large reads, network bodies, HAR files, screenshots, and traces are directed
    to --out so local tools can inspect them without spending model context.
  • tab ensure --background uses only background-safe CDP/browser paths. It now
    probes live browserContextIds with chrome://version, closes mismatches, and
    refuses OS Chrome launch for already-running browser-scoped profiles.
  • Signed-in profile handling prefers direct DevToolsActivePort WebSocket
    reuse to avoid repeated Chrome approval prompts.
  • Profile ownership is monotonic once proven, preventing later label/cache
    writes from downgrading a verified tab.
  • Profile routing uses chrome://version Profile Path readback when profile
    ownership is not otherwise proven. Wrong-profile or unverifiable probe tabs
    are closed before the requested URL is loaded.
  • Session-registry hits are reusable but not automatically considered mine,
    so cleanup commands do not close sibling task tabs.
  • Health checks report stale/dead CDP sockets accurately so callers can restart
    the daemon instead of timing out repeatedly.
  • Help and workflow examples use neutral hosts and labels such as
    example.com, app.example, app, and project-a.

Fixed:

  • wait ready --screenshot <path> no longer treats the output path as a CSS
    selector.
  • action state --root <ref> resolves ref-based and CSS roots instead of always
    falling back to the active root.
  • ARIA refs are assigned from finalized nodes, preventing ref mismatch when CDP
    backend-id pushes partially fail.
  • ARIA role matching no longer misclassifies roles such as tabpanel and
    tablist.
  • action submit, action click, action hover, action fill, and
    action select argument validation accepts explicit --ref where supported.
  • Screenshot sizing env parsing no longer parses the same env var twice.
  • readTargetProfileDir handles Chrome's newline-rendered Profile Path
    output and paths with spaces.
  • CDP-created background tabs no longer fall through to OS profile launch after
    a strict profile verification failure.
  • Dead code and duplicated in-page helper code were removed from the CLI.

Compatibility:

  • agents/openai.yaml remains for Codex compatibility.
  • The CLI remains a zero-dependency standalone Node script.

Validated:

  • node --check skills/realbrowser/scripts/realbrowser.mjs
  • skills/realbrowser/scripts/realbrowser --version
  • skills/realbrowser/scripts/realbrowser self-test
  • git diff --check
  • Anonymous background browse smoke against https://example.com