Skip to content

VoiceVibeCode v0.8.1

Choose a tag to compare

@cnshsliu cnshsliu released this 27 May 15:26
· 1 commit to main since this release

Bug Fixes

  • Fixed glow/audio freeze during long recordings — the floating mic indicator would stop glowing and audio level tracking would freeze during extended recording sessions. Root cause: voice activity timestamps were not updated while a segment was being processed, causing false "long silence" triggers that cascaded into broken segmentation. Voice activity is now always tracked regardless of processing state.

  • Fixed text loss after watchdog recovery — when the audio engine died and was automatically recovered by the watchdog, any buffered audio was discarded during restart. The watchdog now saves accumulated audio before restarting the engine and processes it after recovery, so no spoken text is lost.

  • Fixed potential hang on stop — when stopping recording while a segment was still being processed (Whisper/LLM), the stop handler would wait indefinitely. Now has a 30-second safety timeout that force-resumes if processing gets stuck.

New Features

  • LLM-based voice command detection — when LLM post-processing is enabled, voice commands (tmux, edit shortcuts, execution intent) are now detected by the LLM via structured JSON output instead of regex. This provides better accuracy, handles Whisper misrecognitions, and correctly distinguishes between executing a command and talking about it. Falls back to regex if LLM JSON parsing fails 3 times.

  • tmux last-window command — new voice command to switch to the previously active tmux window: "上一个窗口" / "返回窗口" / "last window" / "previous window"

  • Anti-false-positive command detection — the LLM now only recognizes commands when the user is requesting immediate execution, not when describing or mentioning the command in conversation.

Languages

  • All 9 languages updated with new voice command descriptions