Skip to content

docs: add demo audio + comparison table to README#21

Merged
StuBehan merged 1 commit into
mainfrom
docs/readme-demo-and-comparison
Apr 30, 2026
Merged

docs: add demo audio + comparison table to README#21
StuBehan merged 1 commit into
mainfrom
docs/readme-demo-and-comparison

Conversation

@StuBehan
Copy link
Copy Markdown
Collaborator

Summary

Two visibility wins for the public face of the repo, no code changes.

Changes

`docs/demo.wav` — 5 second voice sample at the top of the README

Generated locally with `Stackvox.synthesize()` using the cached Kokoro model: `af_heart` saying "Hello from stackvox." then `bf_emma` saying "Offline, Apache two-point-zero, CPU real-time." Concatenated with a 350ms gap. 234 KB as 24 kHz PCM_16 — small enough to commit and serve from the repo via GitHub's raw URL. Linked under the tagline so first-time visitors can hear what they'd be installing without doing anything.

If you'd rather replace it with something else (different voices, longer, your own recording), drop in a new `docs/demo.wav` — the README link uses a relative path.

README "How does this compare to other TTS?" section

Honest comparison table covering offline support, voice quality, latency, license, and best-for one-liner across:

  • stackvox (Kokoro-82M)
  • macOS `say`
  • `espeak-ng`
  • Piper
  • Coqui TTS
  • OpenAI / ElevenLabs / etc.

Helps prospective users decide quickly whether stackvox is what they want — and explicitly calls out that voice quality alone isn't a reason to switch off Piper, but the resident daemon + bash helper for sub-15ms shell-side speech is the actual differentiator. Sets honest expectations.

Test plan

  • CI passes (lint, format, mypy, tests, PR-title, commit lint).
  • Demo audio link in the rendered README plays / downloads on github.com.
  • Comparison table renders cleanly in dark and light themes.

Two visibility wins for the public face of the repo:

 - docs/demo.wav: 5 seconds of two stackvox voices (af_heart and
   bf_emma) speaking the tagline, generated locally with the cached
   Kokoro model. 234 KB at 24kHz PCM_16 — small enough to commit and
   serve via the GitHub raw URL. Linked from the top of the README so
   first-time visitors can hear the tool without installing anything.

 - README "How does this compare to other TTS?" section comparing
   stackvox against `say`, `espeak-ng`, Piper, Coqui TTS, and the
   cloud APIs across offline/quality/latency/license. The honest
   pitch: voice quality alone isn't a reason to switch off Piper, but
   the resident daemon plus bash helper for sub-15ms shell-side speech
   is the differentiator for shell-driven workflows.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@StuBehan StuBehan force-pushed the docs/readme-demo-and-comparison branch from d11b291 to b4b98d3 Compare April 30, 2026 11:43
@StuBehan StuBehan merged commit ce39be2 into main Apr 30, 2026
9 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant