Skip to content

Releases: remorses/usecomputer

usecomputer@0.1.3

24 Mar 15:58

Choose a tag to compare

  1. Kitty Graphics Protocol supportscreenshot now emits the PNG inline to stdout using the Kitty Graphics Protocol. Set AGENT_GRAPHICS=kitty and the image lands directly in the AI model's context window — no separate file-read step needed:

    AGENT_GRAPHICS=kitty usecomputer screenshot ./shot.png --json
    # { ..., "agentGraphics": true }

    Works with kitty-graphics-agent, an OpenCode plugin that intercepts the escape sequences and injects them as LLM-visible image attachments. Add to opencode.json:

    { "plugin": ["kitty-graphics-agent"] }
  2. Aligned table output for list commandsdisplay list, window list, and desktop list now render as aligned, human-readable tables. JSON mode (--json) is unchanged.

  3. Fixed agentGraphics JSON field — the field now reflects actual Kitty emission success. Empty PNG files and I/O errors correctly report false.

usecomputer@0.1.1

24 Mar 09:19

Choose a tag to compare

  1. Fixed Linux native builds — standalone executable now links libc correctly, fixing "C allocator is only available when linking against libc" errors
  2. Fixed native host builds — build script omits -Dtarget when building for the host platform so Zig finds system libraries (X11, libpng, etc)

usecomputer@0.1.0

24 Mar 09:18

Choose a tag to compare

  1. Standalone executable — ships as a self-contained binary, no Node.js required at runtime

  2. Linux X11 screenshot support — capture screens on Linux via XShm with XGetImage fallback on XWayland

  3. Screenshot coord-map and scaling — longest edge scaled to 1568 px with coordMap field for accurate pointer remapping:

    usecomputer screenshot ./shot.png --json
    usecomputer click -x 400 -y 220 --coord-map "0,0,1600,900,1568,882"
  4. New debug-point command — draw a red marker at a mapped coordinate to validate clicks before executing:

    usecomputer debug-point -x 400 -y 220 --coord-map "0,0,1600,900,1568,882"
  5. Keyboard synthesistype and press commands for text input and key chords:

    usecomputer type "hello from usecomputer"
    usecomputer press "cmd+s"
    cat ./notes.txt | usecomputer type --stdin --chunk-size 4000
  6. Native scroll support — scroll in any direction at any screen position:

    usecomputer scroll --direction down --amount 5
  7. Library exports — import as a Node.js library for use in custom agent harnesses:

    import * as usecomputer from 'usecomputer'
    const shot = await usecomputer.screenshot({ path: './shot.png', display: null, window: null, region: null, annotate: null })
  8. OpenAI and Anthropic computer-use examples — README includes full agentic loop examples for both providers