Skip to content

sudomichael/openspeech

Repository files navigation

OpenSpeech

An open directory of open-source text-to-speech models. Every voice reads the same three scripts so you can actually compare them.

License: MIT PRs Welcome


Why

ElevenLabs is expensive. Open-source TTS has caught up — but the models are scattered across GitHub, Hugging Face, and a dozen comparison blog posts. Finding one and actually hearing it usually means cloning a repo, fighting CUDA, and praying the weights still download.

OpenSpeech fixes that:

  • Every voice reads the same three scripts.
  • Specs, license, and install in one place.
  • Filter by license, VRAM, language, capability.
  • Add a new model with one PR.

What's inside

  • 22 curated models from the broader OSS TTS ecosystem
  • ~40 voices across them
  • 3 standardized scripts per voice: neutral, emotional, numbers
  • Side-by-side audio so you can pick the model that fits your use case

Stack

Run locally

git clone https://github.com/sudomichael/openspeech
cd openspeech
npm install
npm run dev

The samples are committed to public/samples/ — you don't need a Replicate token just to browse.

Generate new samples

To add a new model or regenerate existing ones:

echo "REPLICATE_API_TOKEN=r8_..." > .env.local
npx tsx scripts/generate-replicate.ts          # full run
npx tsx scripts/generate-replicate.ts --missing  # only missing samples

See CONTRIBUTING.md for the full workflow.

Contributing

We want this to be the obvious place to compare open-source TTS. Adding a model is one PR.

Credits

License

MIT for the code. Each model retains its own license — see the individual model cards.

About

Open directory of open-source TTS models. Every voice reads the same three scripts so you can actually compare them.

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors