About OpenSpeech

ElevenLabs is expensive. Open-source TTS has caught up — but the models are scattered across GitHub, Hugging Face, and a dozen comparison blog posts. Finding one and actually hearing it usually means cloning a repo, fighting CUDA, and praying the weights still download.

OpenSpeech is an experiment to fix that:

  • Every voice reads the same three scripts.
  • Specs, license, and install in one place.
  • Filter by license, VRAM, language, capability.
  • Add a model with one PR.

Models curated from awesome-ai-voice. Samples generated via Replicate. Not affiliated with any model author.

The three scripts

Neutral — baseline naturalness with no tricky words.

Emotional — tests prosody and expressiveness.

Numbers & Dates — the common failure mode for every TTS model.