docs: add demo audio + comparison table to README by StuBehan · Pull Request #21 · StackOneHQ/stackvox

StuBehan · 2026-04-30T11:43:13Z

Summary

Two visibility wins for the public face of the repo, no code changes.

Changes

`docs/demo.wav` — 5 second voice sample at the top of the README

Generated locally with `Stackvox.synthesize()` using the cached Kokoro model: `af_heart` saying "Hello from stackvox." then `bf_emma` saying "Offline, Apache two-point-zero, CPU real-time." Concatenated with a 350ms gap. 234 KB as 24 kHz PCM_16 — small enough to commit and serve from the repo via GitHub's raw URL. Linked under the tagline so first-time visitors can hear what they'd be installing without doing anything.

If you'd rather replace it with something else (different voices, longer, your own recording), drop in a new `docs/demo.wav` — the README link uses a relative path.

README "How does this compare to other TTS?" section

Honest comparison table covering offline support, voice quality, latency, license, and best-for one-liner across:

stackvox (Kokoro-82M)
macOS `say`
`espeak-ng`
Piper
Coqui TTS
OpenAI / ElevenLabs / etc.

Helps prospective users decide quickly whether stackvox is what they want — and explicitly calls out that voice quality alone isn't a reason to switch off Piper, but the resident daemon + bash helper for sub-15ms shell-side speech is the actual differentiator. Sets honest expectations.

Test plan

CI passes (lint, format, mypy, tests, PR-title, commit lint).
Demo audio link in the rendered README plays / downloads on github.com.
Comparison table renders cleanly in dark and light themes.

Two visibility wins for the public face of the repo: - docs/demo.wav: 5 seconds of two stackvox voices (af_heart and bf_emma) speaking the tagline, generated locally with the cached Kokoro model. 234 KB at 24kHz PCM_16 — small enough to commit and serve via the GitHub raw URL. Linked from the top of the README so first-time visitors can hear the tool without installing anything. - README "How does this compare to other TTS?" section comparing stackvox against `say`, `espeak-ng`, Piper, Coqui TTS, and the cloud APIs across offline/quality/latency/license. The honest pitch: voice quality alone isn't a reason to switch off Piper, but the resident daemon plus bash helper for sub-15ms shell-side speech is the differentiator for shell-driven workflows. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

StuBehan force-pushed the docs/readme-demo-and-comparison branch from d11b291 to b4b98d3 Compare April 30, 2026 11:43

StuBehan merged commit ce39be2 into main Apr 30, 2026
9 checks passed

github-actions Bot mentioned this pull request Apr 30, 2026

chore(main): release 0.4.0 #22

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: add demo audio + comparison table to README#21

docs: add demo audio + comparison table to README#21
StuBehan merged 1 commit into
mainfrom
docs/readme-demo-and-comparison

StuBehan commented Apr 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

StuBehan commented Apr 30, 2026

Summary

Changes

`docs/demo.wav` — 5 second voice sample at the top of the README

README "How does this compare to other TTS?" section

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant