Skip to content

v0.3.0 — Context-Aware Smart Session

Choose a tag to compare

@cnshsliu cnshsliu released this 12 May 07:50
· 1 commit to main since this release

What's New in v0.3.0

Smart Context Engine

  • Browser context: Captures visible viewport text (including input fields) when typing in Safari/Chrome/Edge
  • Word/Pages/TextEdit context: Captures text around cursor position for document editing
  • tmux context: Uses tmux capture-pane for real-time terminal content (no longer reads JSONL files in tmux)
  • Frontmost-app dispatch: Context source is determined by the active application — no priority chains

Hotkey Overhaul

  • Unified tap + PTT: Both styles work simultaneously — double-click Option (tap) or hold Option+Space (PTT)
  • Language slots: ⌥1/2/3/4 for temporary language, ⌃⌥1/2/3/4 for permanent switch
  • No mode setting needed: Auto-detected based on user action

Clipboard Preservation

  • User clipboard content is automatically saved and restored after each paste (500ms delay)
  • Works correctly with segmented output — clipboard always returns to user's original content

Pause Segmentation Improvements

  • Segment history carried into LLM post-processing for contextual awareness
  • RMS noise rejection: silent segments are discarded before Whisper (prevents hallucination garbage)
  • Enhanced hallucination detection: parenthesized noise words, repeated patterns

Settings & Reliability

  • Resilient per-field JSON decoding — a single field type mismatch no longer nukes all settings
  • Corrupted config files are backed up before reset
  • LLM prompts now include confusable tech term pairs (Color↔Cluster, Server↔Service, etc.)

Localization

  • All 9 languages updated for new hotkey descriptions