Provider | Model | Whisper version | Footnotes | Word Error Rate (%) | Median Speed Factor | Price (USD per 1000 minutes) | Further Details |
---|---|---|---|---|---|---|---|
Whisper Large v2 | large-v2 | 10.6% | 33.2 | 6.00 | |||
Whisper Large v2 | large-v2 | 10.6% | 33.8 | 6.00 | |||
![]() | Whisper Large v3 | large-v3 | 10.3% | 298.1 | 0.50 | ||
Incredibly Fast Whisper | large-v3 | 10.3% | 63.5 | 1.49 | |||
Whisper Large v2 | large-v2 | 11.2% | 2.6 | 3.47 | |||
Whisper Large v3 | large-v3 | 10.3% | 2.8 | 4.23 | |||
WhisperX | large-v3 | 10.9% | 7.6 | 1.09 | |||
Whisper (M) | medium | 12.8% | 2.68 | ||||
Whisper (S) | small | 17.0% | 1.37 | ||||
![]() | Whisper Large v3 | large-v3 | 10.3% | 247.9 | 1.85 | ||
![]() | Distil-Whisper | 13.0% | 350.4 | 0.33 | |||
![]() | Whisper Large v3 | large-v3 | 10.3% | 129.6 | 1.15 | ||
![]() | Whisper Large v3 | large-v3 | 10.3% | 103.6 | 0.45 | ||
![]() | Whisper Large v3 Turbo | v3 Turbo | 12.0% | 364.4 | 0.67 | ||
![]() | Whisper Large v3 | large-v3 | 11.2% | 271.7 | 1.00 | ||
![]() | Whisper Large v3 Turbo | v3 Turbo | 13.7% | 475.9 | 1.00 | ||
![]() | Whisper-Large-v3 | large-v3 | 10.8% | 57.9 | 1.67 | ||
Whisper Large v3 | large-v3 | 10.3% | 179.6 | 1.50 | |||
![]() | Universal-1 | 8.7% | 85.8 | 6.17 | |||
![]() | Nano | 12.7% | 84.6 | 2.00 | |||
![]() | Universal-2 | 8.6% | 85.2 | 6.17 | |||
![]() | Standard | 12.6% | 17.7 | 13.33 | |||
![]() | Enhanced | 8.1% | 17.5 | 6.70 | |||
Azure AI Speech Service | 12.6% | 16.67 | |||||
![]() | Nova-2 | 15.1% | 167.4 | 4.30 | |||
![]() | Base | 26.1% | 177.7 | 12.50 | |||
![]() | Whisper Large v2 | large-v2 | 10.6% | 57.3 | 4.80 | ||
![]() | Nova-3 | 12.8% | 156.9 | 4.30 | |||
Gladia | whisper-v2-variant | 12.9% | 10.20 | ||||
Amazon Transcribe | 11.2% | 20.0 | 24.00 | ||||
Fish Speech to Text | 19.1% | 0.00 | |||||
![]() | Rev AI | 0.0% | 20.00 | ||||
Chirp 2 | 9.8% | 20.4 | 16.00 | ||||
Chirp | 12.4% | 13.4 | 16.00 | ||||
Scribe | 7.7% | 52.4 | 6.67 | ||||
Gemini 2.0 Flash | 14.9% | 59.5 | 1.40 | ||||
Gemini 2.0 Flash Lite | 14.5% | 58.7 | 0.19 | ||||
GPT-4o Transcribe | 8.9% | 35.7 | 6.00 | ||||
GPT-4o Mini Transcribe | 13.2% | 41.0 | 3.00 | ||||
Granite 3.3 8B | 8.4% | 0.00 | |||||
Parakeet RNNT 1.1B | 6.1% | 6.8 | 1.91 | ||||
![]() | Voxtral Mini | 10.4% | 67.2 | 1.00 | |||
![]() | Voxtral Small | 8.8% | 54.5 | 4.00 | |||
![]() | Voxtral Small | 0.0% | 23.8 | 3.00 | |||
![]() | Voxtral Mini | 0.0% | 74.1 | 1.00 |