Provider | Model | Whisper version | Footnotes | Word Error Rate (%) | Median Speed Factor | Price (USD per 1000 minutes) | Further Details |
---|---|---|---|---|---|---|---|
Whisper Large v2 | large-v2 | 10.6% | 36.6 | 6.00 | |||
Whisper Large v2 | large-v2 | 10.6% | 33.3 | 6.00 | |||
![]() | Whisper Large v3 | large-v3 | 10.3% | 313.0 | 0.50 | ||
Incredibly Fast Whisper | large-v3 | 10.3% | 63.7 | 1.49 | |||
Whisper Large v2 | large-v2 | 11.2% | 2.4 | 3.47 | |||
Whisper Large v3 | large-v3 | 10.3% | 3.1 | 4.23 | |||
WhisperX | large-v3 | 10.9% | 7.5 | 1.09 | |||
Whisper (M) | medium | 12.8% | 2.68 | ||||
Whisper (S) | small | 17.0% | 1.37 | ||||
![]() | Whisper Large v3 | large-v3 | 10.3% | 304.4 | 1.85 | ||
![]() | Distil-Whisper | 13.0% | 352.0 | 0.33 | |||
![]() | Whisper Large v3 | large-v3 | 10.3% | 96.5 | 0.45 | ||
![]() | Whisper Large v3 | large-v3 | 10.3% | 146.8 | 1.15 | ||
![]() | Whisper Large v3 Turbo | v3 Turbo | 12.0% | 384.7 | 0.67 | ||
Whisper Large v3 | large-v3 | 11.2% | 407.8 | 1.00 | |||
Whisper Large v3 Turbo | v3 Turbo | 13.7% | 492.9 | 1.00 | |||
![]() | Whisper-Large-v3 | large-v3 | 10.8% | 110.5 | 1.67 | ||
Whisper Large v3 | large-v3 | 10.3% | 187.3 | 1.50 | |||
![]() | Universal-1 | 8.7% | 84.5 | 6.17 | |||
![]() | Nano | 12.7% | 58.7 | 2.00 | |||
![]() | Universal-2 | 8.6% | 85.6 | 6.17 | |||
![]() | Standard | 12.6% | 17.6 | 13.33 | |||
![]() | Enhanced | 8.1% | 17.6 | 6.70 | |||
Azure AI Speech Service | 12.6% | 16.67 | |||||
![]() | Nova-2 | 15.1% | 149.6 | 4.30 | |||
![]() | Base | 26.1% | 173.0 | 12.50 | |||
![]() | Whisper Large v2 | large-v2 | 10.6% | 37.2 | 4.80 | ||
![]() | Nova-3 | 12.8% | 158.1 | 4.30 | |||
Gladia | whisper-v2-variant | 12.9% | 27.8 | 10.20 | |||
Amazon Transcribe | 11.2% | 20.3 | 24.00 | ||||
Fish Speech to Text | 19.1% | 0.00 | |||||
![]() | Rev AI | 0.0% | 20.00 | ||||
Chirp 2 | 9.8% | 19.0 | 16.00 | ||||
Chirp | 12.4% | 13.4 | 16.00 | ||||
Scribe | 7.7% | 45.7 | 6.67 | ||||
Gemini 2.0 Flash | 14.9% | 61.6 | 1.40 | ||||
Gemini 2.0 Flash Lite | 14.5% | 58.5 | 0.19 | ||||
GPT-4o Transcribe | 8.9% | 33.0 | 6.00 | ||||
GPT-4o Mini Transcribe | 13.2% | 37.2 | 3.00 | ||||
Granite 3.3 8B | 8.4% | 0.00 | |||||
Granite 3.3 8B | 0.0% | 1.20 | |||||
Parakeet RNNT 1.1B | 6.1% | 6.9 | 1.91 | ||||
![]() | Voxtral Mini | 10.4% | 72.1 | 1.00 | |||
![]() | Voxtral Small | 8.8% | 59.9 | 4.00 | |||
![]() | Voxtral Small | 0.0% | 22.8 | 3.00 | |||
![]() | Voxtral Mini | 0.0% | 62.0 | 1.00 |