Text to Speech AI Model & Provider Leaderboard
Analysis and comparison of Text to Speech generation models & API providers. Artificial Analysis has analyzed text to speech models and hosting providers across quality, generation time, and price. For further details, see our methodology page.
Text to speech models & providers compared: Standard, OpenAI TTS, HD, OpenAI TTS, Studio, Google Cloud TTS, Journey, Google Cloud TTS, Neural2, Google Cloud TTS, WaveNet, Google Cloud TTS, Standard, Google Cloud TTS, Long-form, Amazon Polly, Neural, Amazon Polly, Standard, Amazon Polly, Neural, Microsoft Azure, MetaVoice v1, XTTS v2, StyleTTS 2, OpenVoice v2, Sonic English (Oct '24), Cartesia, Turbo v2.5, ElevenLabs, Multilingual v2, ElevenLabs, and LMNT.
Highlights
Summary Analysis
Quality vs. Price
Quality vs. Speed
Speed vs. Price
Quality
Quality Arena ELO (Text to Speech Arena)
Arena Win Rate
Participate in the Speech Arena to contribute to the crowdsourced quality evaluations
Speed
Characters Per Second
Speed Factor
Characters Per Second, Variance
Characters per Second, Over Time
Price
Price
Streaming
Streaming Support
Provider | Streaming Support |
---|---|
OpenAI | |
Amazon | |
Azure | |
Replicate | |
ElevenLabs | |
LMNT |
Provider | Model | Streaming support | Footnotes | Model Arena ELO | Characters per Second | Price per 1M Characters (USD) | Further Details |
---|---|---|---|---|---|---|---|
OpenAI | HD, OpenAI TTS | 1192 | 58.8 | $30.00 | |||
ElevenLabs | Multilingual v2, ElevenLabs | 1160 | 77.3 | $206.00 | |||
ElevenLabs | Turbo v2.5, ElevenLabs | 1151 | 347.6 | $103.00 | |||
OpenAI | Standard, OpenAI TTS | 1151 | 84.7 | $15.00 | |||
Cartesia | Sonic English (Oct '24), Cartesia | 1136 | 39.1 | $46.70 | |||
Microsoft Azure | Neural, Microsoft Azure | 1083 | 272.4 | $15.00 | |||
Amazon Bedrock | Long-form, Amazon Polly | 1076 | 345.8 | $100.00 | |||
Studio, Google Cloud TTS | 1060 | 282.7 | $160.00 | ||||
Journey, Google Cloud TTS | 1017 | 113.8 | $160.00 | ||||
LMNT | LMNT | 996 | 306.0 | $43.60 | |||
Replicate | OpenVoice v2 | 981 | 9.7 | $10.89 | |||
Replicate | XTTS v2 | 928 | 35.5 | $29.64 | |||
Replicate | StyleTTS 2 | 914 | 2.3 | $2.84 | |||
WaveNet, Google Cloud TTS | 899 | 440.8 | $16.00 | ||||
Amazon Bedrock | Neural, Amazon Polly | 891 | 462.2 | $16.00 | |||
Standard, Google Cloud TTS | 862 | 529.6 | $4.00 | ||||
Neural2, Google Cloud TTS | 860 | 582.0 | $16.00 | ||||
Replicate | MetaVoice v1 | 810 | 0.6 | $123.97 | |||
Amazon Bedrock | Standard, Amazon Polly | 807 | 1075.4 | $4.00 |