Provider | Model | Whisper version | Footnotes | Word Error Rate (%) | Median Speed Factor | Price (USD per 1000 minutes) | Further Details |
---|---|---|---|---|---|---|---|
Whisper Large v2 | large-v2 | 10.6% | 31.5 | 6.00 | |||
Whisper Large v2 | large-v2 | 10.6% | 33.7 | 6.00 | |||
![]() | Whisper Large v3 | large-v3 | 10.3% | 328.1 | 0.50 | ||
Incredibly Fast Whisper | large-v3 | 10.3% | 63.4 | 1.49 | |||
Whisper Large v2 | large-v2 | 11.2% | 2.3 | 3.47 | |||
Whisper Large v3 | large-v3 | 10.3% | 2.7 | 4.23 | |||
WhisperX | large-v3 | 10.9% | 18.5 | 1.09 | |||
Whisper (M) | medium | 12.8% | 2.68 | ||||
Whisper (S) | small | 17.0% | 1.37 | ||||
![]() | Whisper Large v3 | large-v3 | 10.3% | 213.7 | 1.85 | ||
![]() | Distil-Whisper | 13.0% | 247.5 | 0.33 | |||
![]() | Whisper Large v3 | large-v3 | 10.3% | 96.0 | 0.45 | ||
![]() | Whisper Large v3 | large-v3 | 10.3% | 146.8 | 1.15 | ||
![]() | Whisper Large v3 Turbo | v3 Turbo | 12.0% | 241.0 | 0.67 | ||
![]() | Whisper Large v3 | large-v3 | 11.2% | 402.9 | 1.00 | ||
![]() | Whisper Large v3 Turbo | v3 Turbo | 13.7% | 471.9 | 1.00 | ||
![]() | Whisper-Large-v3 | large-v3 | 10.8% | 166.4 | 1.67 | ||
Whisper Large v3 | large-v3 | 10.3% | 180.4 | 1.50 | |||
![]() | Universal-1 | 8.7% | 84.7 | 6.17 | |||
![]() | Nano | 12.7% | 59.3 | 2.00 | |||
![]() | Universal-2 | 8.6% | 84.9 | 6.17 | |||
![]() | Standard | 12.6% | 17.6 | 13.33 | |||
![]() | Enhanced | 8.1% | 17.6 | 6.70 | |||
Azure AI Speech Service | 12.6% | 16.67 | |||||
![]() | Nova-2 | 15.1% | 152.7 | 4.30 | |||
![]() | Base | 26.1% | 159.3 | 12.50 | |||
![]() | Whisper Large v2 | large-v2 | 10.6% | 33.0 | 4.80 | ||
![]() | Nova-3 | 12.8% | 146.6 | 4.30 | |||
Gladia | whisper-v2-variant | 12.9% | 32.8 | 10.20 | |||
Amazon Transcribe | 11.2% | 19.8 | 24.00 | ||||
Fish Speech to Text | 19.1% | 0.00 | |||||
![]() | Rev AI | 0.0% | 20.00 | ||||
Chirp 2 | 9.8% | 19.8 | 16.00 | ||||
Chirp | 12.4% | 13.3 | 16.00 | ||||
Scribe | 7.7% | 40.6 | 6.67 | ||||
Gemini 2.0 Flash | 14.9% | 59.7 | 1.40 | ||||
Gemini 2.0 Flash Lite | 14.5% | 58.9 | 0.19 | ||||
GPT-4o Transcribe | 8.9% | 31.6 | 6.00 | ||||
GPT-4o Mini Transcribe | 13.2% | 37.3 | 3.00 | ||||
Granite 3.3 8B | 8.4% | 0.00 | |||||
Granite 3.3 8B | 0.0% | 1.20 | |||||
Parakeet RNNT 1.1B | 6.1% | 6.9 | 1.91 | ||||
![]() | Voxtral Mini | 10.4% | 70.6 | 1.00 | |||
![]() | Voxtral Small | 8.8% | 60.2 | 4.00 | |||
![]() | Voxtral Small | 0.0% | 23.5 | 3.00 | |||
![]() | Voxtral Mini | 0.0% | 79.6 | 1.00 |