Comparison of API provider performance across over 500 AI Model endpoints, including from OpenAI, Google, DeepSeek and others, across performance key metrics including price, output speed, latency, context window and more. For more details including relating to our methodology, see our FAQs.
Features | Model Intelligence | Price | Output tokens/s | Latency | End-to-End Response Time | ||||
---|---|---|---|---|---|---|---|---|---|
Further Analysis | |||||||||
GPT-5 (high) | 400k | 69 | $3.44 | 134.2 | 71.84 | 75.57 | N/A | ||
![]() | GPT-5 (high) | 272k | 69 | $3.44 | 221.9 | 42.32 | 44.58 | N/A | |
GPT-5 (medium) | 400k | 68 | $3.44 | 168.3 | 35.81 | 38.78 | N/A | ||
![]() | GPT-5 (medium) | 272k | 68 | $3.44 | 225.6 | 22.16 | 24.38 | N/A | |
Grok 4 | 256k | 68 | $6.00 | 49.2 | 6.77 | 16.94 | N/A | ||
o3 | 200k | 67 | $3.50 | 217.6 | 10.84 | 13.14 | N/A | ||
![]() | o3 | 200k | 67 | $3.50 | 87.3 | 28.15 | 33.87 | N/A | |
o4-mini (high) | 200k | 65 | $1.93 | 108.9 | 40.03 | 44.63 | N/A | ||
![]() | o4-mini (high) | 200k | 65 | $1.93 | 164.7 | 23.75 | 26.78 | N/A | |
Gemini 2.5 Pro (AI_Studio) | 1m | 65 | $3.44 | 139.4 | 30.36 | 33.95 | N/A | ||
Gemini 2.5 Pro Vertex | 1m | 65 | $3.44 | 140.4 | 34.26 | 37.82 | N/A | ||
GPT-5 mini (medium) | 400k | 64 | $0.69 | 73.6 | 30.11 | 36.91 | N/A | ||
![]() | GPT-5 mini (medium) | 400k | 64 | $0.69 | 146.3 | 14.50 | 17.92 | N/A | |
![]() | Qwen3 235B 2507 (Reasoning) | 256k | 64 | $1.24 | 65.2 | 0.43 | 38.79 | 30.69 | |
![]() | Qwen3 235B 2507 (Reasoning) | 131k | 64 | $0.75 | 1,500.8 | 0.26 | 1.93 | 1.33 | |
Qwen3 235B 2507 (Reasoning) | 262k | 64 | $0.39 | 117.0 | 0.65 | 22.01 | 17.09 | ||
Qwen3 235B 2507 (Reasoning) (FP8) | 262k | 64 | $0.25 | 28.5 | 0.56 | 88.37 | 70.25 | ||
![]() | Qwen3 235B 2507 (Reasoning) | 131k | 64 | $0.97 | 49.0 | 0.81 | 51.86 | 40.84 | |
Qwen3 235B 2507 (Reasoning) (FP8) | 131k | 64 | $1.20 | 62.6 | 0.51 | 40.47 | 31.97 | ||
Qwen3 235B 2507 (Reasoning) | 262k | 64 | $1.24 | 60.0 | 0.30 | 41.97 | 33.34 | ||
GPT-5 (low) | 400k | 63 | $3.44 | 178.5 | 14.23 | 17.03 | N/A | ||
![]() | gpt-oss-120B (high) | 131k | 61 | $0.26 | 170.7 | 0.45 | 15.09 | 11.71 | |
![]() | gpt-oss-120B (high) | 131k | 61 | $0.36 | 3,229.2 | 0.28 | 1.05 | 0.62 | |
![]() | gpt-oss-120B (high) | 131k | 61 | $0.26 | 224.5 | 0.38 | 11.52 | 8.91 | |
gpt-oss-120B (high) Base | 128k | 61 | $0.26 | 193.6 | 0.52 | 13.44 | 10.33 | ||
gpt-oss-120B (high) Vertex | 131k | 61 | $0.26 | 231.0 | 0.18 | 11.00 | 8.66 | ||
![]() | gpt-oss-120B (high) | 131k | 61 | $0.10 | 193.0 | 0.22 | 13.17 | 10.36 | |
![]() | gpt-oss-120B (high) | 131k | 61 | $0.26 | 271.8 | 43.43 | 52.63 | 7.36 | |
gpt-oss-120B (high) | 131k | 61 | $0.26 | 304.3 | 0.44 | 8.66 | 6.57 | ||
gpt-oss-120B (high) | 131k | 61 | $0.18 | 222.8 | 0.21 | 11.43 | 8.98 | ||
![]() | gpt-oss-120B (high) | 131k | 61 | $0.20 | 198.7 | 0.69 | 13.27 | 10.06 | |
gpt-oss-120B (high) | 131k | 61 | $0.26 | 236.2 | 31.67 | 42.25 | 8.47 | ||
gpt-oss-120B (high) | 131k | 61 | $0.30 | 499.1 | 0.19 | 5.20 | 4.01 | ||
gpt-oss-120B (high) | 131k | 61 | $0.26 | 335.7 | 35.26 | 42.71 | 5.96 | ||
gpt-oss-120B (high) | 128k | 61 | $0.45 | 126.3 | 110.52 | 130.31 | 15.83 | ||
![]() | DeepSeek V3.1 (Reasoning) | 128k | 60 | $0.96 | 19.3 | 2.95 | 132.20 | 103.40 | |
DeepSeek V3.1 (Reasoning) (FP8) | 164k | 60 | $0.90 | 22.3 | 1.09 | 113.39 | 89.85 | ||
![]() | DeepSeek V3.1 (Reasoning) | 33k | 60 | $3.38 | 169.9 | 1.63 | 16.35 | 11.77 | |
![]() | Claude 4 Sonnet Thinking | 1m | 59 | $6.00 | 47.9 | 1.28 | 53.50 | 41.77 | |
Claude 4 Sonnet Thinking Vertex | 1m | 59 | $6.00 | 50.1 | 1.43 | 51.37 | 39.95 | ||
Claude 4 Sonnet Thinking | 1m | 59 | $6.00 | 47.6 | 1.73 | 54.28 | 42.04 | ||
DeepSeek R1 0528 | 164k | 59 | $0.92 | 40.0 | 0.38 | 62.84 | 49.97 | ||
![]() | DeepSeek R1 0528 | 64k | 59 | $0.96 | 19.1 | 2.92 | 133.98 | 104.85 | |
![]() | DeepSeek R1 0528 | 164k | 59 | $1.59 | 61.1 | 0.48 | 41.39 | 32.73 | |
DeepSeek R1 0528 | 164k | 59 | $3.00 | 91.2 | 2.43 | 29.83 | 21.92 | ||
DeepSeek R1 0528 | 164k | 59 | $1.20 | 24.0 | 0.64 | 104.80 | 83.33 | ||
DeepSeek R1 0528 Fast | 164k | 59 | $3.00 | 305.0 | 1.07 | 9.26 | 6.56 | ||
DeepSeek R1 0528 (Vertex) | 164k | 59 | $2.36 | 211.7 | 0.49 | 12.30 | 9.45 | ||
![]() | DeepSeek R1 0528 | 128k | 59 | $0.53 | 49.1 | 0.27 | 51.22 | 40.76 | |
![]() | DeepSeek R1 0528 | 128k | 59 | $2.36 | 89.1 | 0.54 | 28.61 | 22.45 | |
DeepSeek R1 0528 Fast | 164k | 59 | $4.25 | 231.0 | 0.54 | 11.36 | 8.66 | ||
DeepSeek R1 0528 | 164k | 59 | $0.91 | 83.5 | 0.38 | 30.33 | 23.96 | ||
![]() | DeepSeek R1 0528 | 164k | 59 | $1.15 | 35.0 | 0.59 | 71.96 | 57.10 | |
DeepSeek R1 0528 | 131k | 59 | $1.18 | 111.4 | 0.51 | 22.95 | 17.95 | ||
![]() | DeepSeek R1 0528 | 33k | 59 | $5.50 | 181.9 | 1.92 | 15.66 | 10.99 | |
DeepSeek R1 0528 | 164k | 59 | $4.00 | 274.5 | 0.90 | 10.00 | 7.29 | ||
DeepSeek R1 0528 (Throughput) | 164k | 59 | $0.96 | 46.5 | 0.74 | 54.45 | 42.97 | ||
Gemini 2.5 Flash (Reasoning) (AI_Studio) | 1m | 58 | $0.85 | 257.7 | 14.97 | 16.91 | N/A | ||
Gemini 2.5 Flash (Reasoning) (Vertex) | 1m | 58 | $0.85 | 316.9 | 12.60 | 14.18 | N/A | ||
Grok 3 mini Reasoning (high) | 131k | 58 | $0.35 | 186.8 | 0.57 | 13.95 | 10.71 | ||
Grok 3 mini Reasoning (high) Fast | 131k | 58 | $1.45 | 187.9 | 0.71 | 14.01 | 10.64 | ||
![]() | Grok 3 mini Reasoning (high) | 32k | 58 | $0.00 | 171.1 | 0.40 | 15.02 | 11.69 | |
![]() | GLM-4.5 (FP8) | 131k | 56 | $0.97 | 41.7 | 0.55 | 60.44 | 47.91 | |
GLM-4.5 | 131k | 56 | $0.91 | 57.8 | 0.60 | 43.85 | 34.60 | ||
![]() | Claude 4 Opus Thinking | 200k | 55 | $30.00 | 19.4 | 2.83 | 131.92 | 103.27 | |
Claude 4 Opus Thinking Vertex | 200k | 55 | $30.00 | 43.9 | 1.73 | 58.67 | 45.55 | ||
Claude 4 Opus Thinking | 200k | 55 | $30.00 | 44.5 | 1.79 | 58.00 | 44.96 | ||
Qwen3 30B 2507 (Reasoning) | 262k | 54 | $0.15 | 135.4 | 0.62 | 19.09 | 14.78 | ||
Qwen3 30B 2507 (Reasoning) (FP8) | 262k | 54 | $0.90 | 183.8 | 0.46 | 14.06 | 10.88 | ||
GPT-5 nano (medium) | 400k | 54 | $0.14 | 182.5 | 33.01 | 35.75 | N/A | ||
![]() | GPT-5 nano (medium) | 400k | 54 | $0.14 | 253.3 | 27.55 | 29.52 | N/A | |
![]() | GLM-4.5-Air | 128k | 53 | $0.32 | 105.1 | 1.11 | 24.90 | 19.04 | |
GLM-4.5-Air | 131k | 53 | $0.42 | 157.4 | 0.23 | 16.12 | 12.71 | ||
GLM-4.5-Air (FP8) | 131k | 53 | $0.42 | 197.1 | 0.36 | 13.04 | 10.15 | ||
![]() | Qwen3 235B 2507 (Non-reasoning) | 262k | 51 | $0.33 | 52.8 | 0.42 | 9.89 | N/A | |
![]() | Qwen3 235B 2507 (Non-reasoning) | 131k | 51 | $0.75 | 1,285.5 | 0.32 | 0.71 | N/A | |
Qwen3 235B 2507 (Non-reasoning) | 262k | 51 | $2.00 | 59.7 | 2.16 | 10.53 | N/A | ||
Qwen3 235B 2507 (Non-reasoning) | 262k | 51 | $0.30 | 59.4 | 0.66 | 9.07 | N/A | ||
Qwen3 235B 2507 (Non-reasoning) Vertex | 256k | 51 | $0.44 | 84.4 | 0.44 | 6.37 | N/A | ||
Qwen3 235B 2507 (Non-reasoning) (FP8) | 262k | 51 | $0.39 | 97.5 | 0.64 | 5.77 | N/A | ||
Qwen3 235B 2507 (Non-reasoning) | 262k | 51 | $0.25 | 17.9 | 0.53 | 28.49 | N/A | ||
![]() | Qwen3 235B 2507 (Non-reasoning) | 262k | 51 | $0.31 | 28.2 | 0.99 | 18.72 | N/A | |
Qwen3 235B 2507 (Non-reasoning) (FP8) | 131k | 51 | $0.40 | 55.3 | 0.55 | 9.58 | N/A | ||
Qwen3 235B 2507 (Non-reasoning) (FP8) | 262k | 51 | $0.30 | 33.9 | 0.29 | 15.02 | N/A | ||
![]() EXAONE 4.0 32B (Reasoning) | 131k | 51 | $0.70 | 83.1 | 0.27 | 30.36 | 24.07 |