Provider | Model | Whisper version | Footnotes | Word Error Rate (%) | Median Speed Factor | Price (USD per 1000 minutes) | Further Details |
---|---|---|---|---|---|---|---|
Whisper Large v2 | large-v2 | 10.6% | 34.4 | 6.00 | |||
Whisper Large v2 | large-v2 | 10.6% | 33.5 | 6.00 | |||
![]() | Whisper Large v3 | large-v3 | 10.3% | 238.8 | 0.50 | ||
Incredibly Fast Whisper | large-v3 | 10.3% | 61.6 | 1.49 | |||
Whisper Large v2 | large-v2 | 11.2% | 2.5 | 3.47 | |||
Whisper Large v3 | large-v3 | 10.3% | 3.0 | 4.23 | |||
WhisperX | large-v3 | 10.9% | 10.6 | 1.09 | |||
![]() | Whisper Large v3 | large-v3 | 10.3% | 307.9 | 1.85 | ||
![]() | Distil-Whisper | 13.0% | 390.3 | 0.33 | |||
![]() | Whisper Large v3 | large-v3 | 10.3% | 156.2 | 1.15 | ||
![]() | Whisper Large v3 | large-v3 | 10.3% | 98.0 | 0.45 | ||
![]() | Whisper Large v3 Turbo | v3 Turbo | 12.0% | 409.9 | 0.67 | ||
![]() | Whisper Large v3 | large-v3 | 11.2% | 267.4 | 1.00 | ||
![]() | Whisper Large v3 Turbo | v3 Turbo | 13.7% | 442.0 | 1.00 | ||
![]() | Universal-1 | 8.7% | 86.1 | 6.17 | |||
![]() | Nano | 12.7% | 84.8 | 2.00 | |||
![]() | Universal-2 | 8.6% | 84.3 | 6.17 | |||
![]() | Standard | 12.6% | 17.7 | 13.33 | |||
![]() | Enhanced | 8.1% | 9.3 | 6.70 | |||
![]() | Nova-2 | 15.1% | 148.3 | 4.30 | |||
![]() | Base | 26.1% | 164.6 | 12.50 | |||
![]() | Whisper Large v2 | large-v2 | 10.6% | 30.2 | 4.80 | ||
![]() | Nova-3 | 12.8% | 168.9 | 4.30 | |||
Amazon Transcribe | 11.2% | 19.1 | 24.00 | ||||
Fish Speech to Text | 19.1% | 24.1 | 0.00 | ||||
Chirp 2 | 9.8% | 18.3 | 16.00 | ||||
Chirp | 12.4% | 14.3 | 16.00 | ||||
Scribe | 7.7% | 43.3 | 6.67 | ||||
Gemini 2.0 Flash | 14.9% | 60.3 | 1.40 | ||||
Gemini 2.0 Flash Lite | 14.5% | 61.9 | 0.19 | ||||
GPT-4o Transcribe | 8.9% | 37.8 | 6.00 | ||||
GPT-4o Mini Transcribe | 13.2% | 49.5 | 3.00 |