Skip to content
KNOOP edited this page Jun 26, 2026 · 5 revisions

Voice Control

Voice control is the core feature of Ava, allowing you to control smart home devices by speaking.

Compatible with Android 5-16.


How It Works

┌─────────────┐    ┌─────────────┐    ┌─────────────┐    ┌─────────────┐
│  You speak  │ -> │ Ava records │ -> │Home Assistant│ -> │ Ava plays   │
│ wake word + │    │ sends audio │    │   speech    │    │   voice     │
│   command   │    │             │    │ recognition │    │   reply     │
└─────────────┘    └─────────────┘    └─────────────┘    └─────────────┘

Detailed Flow:

  1. Standby: Ava continuously listens for wake word (local processing, no internet)
  2. Wake Detection: When wake word detected, plays prompt sound, starts recording
  3. Audio Transmission: Recording sent to Home Assistant via ESPHome protocol
  4. Speech Recognition: Home Assistant's voice assistant performs speech-to-text
  5. Intent Processing: Home Assistant understands intent and executes action
  6. Speech Synthesis: Home Assistant generates voice response
  7. Playback: Ava receives and plays voice response

Topics

Topic Description
Wake Word Engines microWakeWord vs vsWakeWord — architecture, comparison, engine switching, visual feedback, stop words
Custom Wake Words Import custom wake word models via in-app library or APK injection — microWakeWord and vsWakeWord formats
Voiceprint Recognition On-device speaker identification — Manual mode (enrollment-based gating) and Automatic mode (passive learning)
Audio Event Detection Local detection of household sounds — alarm, baby crying, doorbell, glass breaking, siren, cough, speech
Audio Configuration Microphone settings, noise suppression, echo cancellation, gain, continuous conversation, subtitles, mute
Settings Summary & Services Complete settings reference, Home Assistant services, and FAQ

Back to Home

Clone this wiki locally