Skip to content

EchoTwin v0.1.0

Choose a tag to compare

@AlanY1an AlanY1an released this 12 Jun 06:11
· 5 commits to main since this release

First public release.

What works today

  • One-on-one voice conversation in a cloned voice, sub-second mouth-to-ear (best 361 ms, typically 0.6–1.1 s measured live)
  • Streaming ASR (sherpa-onnx zipformer) with speculative ASR/LLM execution, pre-opened TTS sockets, cached fillers
  • Barge-in, tool calls (time/date/weather), hot-swappable personas with per-persona Fish Audio TTS tuning
  • Per-turn cost ledger with daily/monthly budget caps
  • Bilingual prompt system: persona-level language: zh|en switches every LLM-facing prompt

Experimental

  • Organic multi-party mode: three-layer addressee pipeline (table-lookup reflexes → LLM arbiter with room context → golden-set-tested heuristics)

See docs/SETUP.md to go from zero to a talking bot in ~15 minutes.