Skip to content

v0.2.0

Choose a tag to compare

@github-actions github-actions released this 14 Mar 03:00
· 13 commits to main since this release

πŸ›Έ Atlas v0.2.0 β€” Computer Use, Smart Actions & File Search

Major update: native screen control, fast action routing, file search, and a bunch of quality-of-life improvements.

What's new

  • πŸ–₯ Native computer control via Gemini Computer Use API β€” the agent sees the screen and performs actions (clicks, typing, scrolling, navigation)
  • ⚑ Fast system actions without screenshots β€” opening/closing apps, volume control, system info
  • 🧠 Three-way request classification β€” conversation, quick action, or screen interaction
  • πŸ“‹ Task planning β€” the agent breaks down complex requests into steps with a progress bar in the UI
  • πŸ“‚ Local file search β€” find files on disk with results displayed in Search Island (Open / Reveal actions)
  • πŸ”„ Migrated from nut-js to robotjs for better performance and reliability
  • πŸ”§ Session debug logger β€” detailed per-request logs with timers, togglable in Settings
  • ⚠️ TTS error handling β€” dismissable warnings for quota, auth, and rate-limit errors with auto-disable on quota exhaustion
  • πŸ” Search query normalization β€” strips paths, wildcards, and invalid patterns from LLM-generated queries
  • 🎨 UI improvements β€” progress bar for multi-step tasks, file search results, new Settings options
    and more

Getting started

  1. Download the .exe below
  2. Launch Atlas β†’ click tray icon β†’ Settings
  3. Paste your Gemini API key β†’ set recommended models in Settings -> LLM tab:
    • Text: gemini-3.1-flash-lite-preview
    • Vision: gemini-3-flash-preview (paid) / gemini-3.1-flash-lite-preview (free)
  4. Press Ctrl+Space β†’ go πŸš€

⚠️ Windows only. macOS & Linux support is planned.