Stay connected with us on X, Discord, and LinkedIn to stay up to date with future analysis

Category

Accent

Frequently Asked Questions

Inworld TTS 1 Max currently leads the Text to Speech Arena with an Elo score of 1162.

The top Text to Speech models by Elo rating are: 1. Inworld TTS 1 Max (Elo 1162), 2. Inworld TTS 1.5 Max (Elo 1115), 3. TTS-1 (Elo 1111), 4. Speech-02-Turbo (Elo 1107), 5. Multilingual v2 (Elo 1105). Rankings are based on blind user votes in the Speech Arena.

Models are ranked using an Elo rating system derived from user votes in blind comparisons in the Speech Arena. Users listen to pairs of speech samples generated from the same text and choose which sounds more natural. Higher Elo scores indicate a model produces speech preferred more often by listeners. Vote in the Speech Arena

Kokoro 82M v1.0 is the most affordable at $0.65 per 1M characters with an Elo score of 1060. Other affordable options include StyleTTS 2 at $2.82 per 1M characters.

Kokoro 82M v1.0 is the highest-ranked open weights model on the Text to Speech Leaderboard with an Elo score of 1060. There are 12 open weights models out of 61 total.

The top open weights Text to Speech models are: 1. Kokoro 82M v1.0 (Elo 1060), 2. Maya1 (Elo 1030), 3. Fish Speech 1.5 (Elo 1026).

You can filter by the following categories: Knowledge Sharing, Assistants, Entertainment, and Customer Service, and the following accents: US and UK.