Artificial Analysis LLM Performance Leaderboard
Independent performance benchmarks & pricing across API providers of LLMs. Definitions are below the table.
For further analysis and methodology, see artificialanalysis.ai.
For further analysis and methodology, see artificialanalysis.ai.
Features | Model Intelligence | Price | Output tokens/s | Latency | End-to-End Response Time | ||||
---|---|---|---|---|---|---|---|---|---|
Further Analysis | |||||||||
o4-mini (high) | 200k | 70 | $1.93 | 121.9 | 36.66 | 40.76 | N/A | ||
![]() | o4-mini (high) | 200k | 70 | $1.93 | 37.7 | 136.48 | 149.76 | N/A | |
Gemini 2.5 Pro | 1m | 69 | $3.44 | 145.4 | 38.88 | 42.32 | N/A | ||
o3 | 128k | 67 | $17.50 | 190.3 | 14.64 | 17.27 | N/A | ||
![]() | o3 | 128k | 67 | $17.50 | 55.6 | 66.56 | 75.56 | N/A | |
Grok 3 mini Reasoning (high) | 131k | 67 | $0.35 | 93.0 | 0.25 | 27.13 | 21.50 | ||
Grok 3 mini Reasoning (high) Fast | 131k | 67 | $1.45 | 226.5 | 0.24 | 11.27 | 8.83 | ||
o3-mini (high) | 200k | 66 | $1.93 | 146.5 | 44.43 | 47.84 | N/A | ||
![]() | o3-mini (high) | 200k | 66 | $1.93 | 189.8 | 39.21 | 41.84 | N/A | |
o3-mini | 200k | 63 | $1.93 | 145.0 | 15.68 | 19.13 | N/A | ||
![]() | o3-mini | 200k | 63 | $1.93 | 184.6 | 13.53 | 16.23 | N/A | |
Qwen3 235B A22B (Reasoning) Base | 41k | 62 | $0.30 | 35.1 | 0.61 | 71.74 | 56.90 | ||
Qwen3 235B A22B (Reasoning) | 128k | 62 | $0.10 | 77.3 | 0.59 | 32.92 | 25.86 | ||
Qwen3 235B A22B (Reasoning) (FP8) | 41k | 62 | $0.30 | 18.4 | 0.72 | 136.73 | 108.81 | ||
![]() | Qwen3 235B A22B (Reasoning) (FP8) | 128k | 62 | $0.35 | 32.2 | 0.72 | 78.36 | 62.11 | |
Qwen3 235B A22B (Reasoning) (FP8) | 41k | 62 | $0.30 | 29.7 | 0.27 | 84.31 | 67.23 | ||
![]() | Qwen3 235B A22B (Reasoning) (FP8) | 41k | 62 | $0.61 | 44.4 | 0.48 | 56.77 | 45.03 | |
o1 | 200k | 62 | $26.25 | 131.0 | 23.45 | 27.26 | N/A | ||
![]() | o1 | 200k | 62 | $26.25 | 110.3 | 24.64 | 29.17 | N/A | |
Gemini 2.5 Flash (Reasoning) (AI_Studio) | 1m | 60 | $0.99 | 344.2 | 8.70 | 10.15 | N/A | ||
![]() DeepSeek R1 | 164k | 60 | $0.95 | 38.4 | 0.35 | 74.54 | 61.16 | ||
![]() | ![]() DeepSeek R1 | 64k | 60 | $0.96 | 24.0 | 3.69 | 122.11 | 97.62 | |
![]() DeepSeek R1 | 128k | 60 | $2.00 | 90.4 | 0.97 | 32.46 | 25.96 | ||
![]() | ![]() DeepSeek R1 | 128k | 60 | $2.36 | 58.7 | 0.43 | 48.91 | 39.96 | |
![]() DeepSeek R1 Base | 128k | 60 | $1.20 | 29.5 | 0.68 | 97.22 | 79.59 | ||
![]() DeepSeek R1 Fast | 128k | 60 | $3.00 | 82.1 | 0.65 | 35.33 | 28.59 | ||
![]() | ![]() DeepSeek R1 | 128k | 60 | $3.99 | 83.8 | 0.44 | 34.41 | 28.00 | |
![]() | ![]() DeepSeek R1 | 128k | 60 | $2.36 | 95.9 | 0.52 | 30.20 | 24.47 | |
![]() DeepSeek R1 (Fast) | 164k | 60 | $4.25 | 114.9 | 0.68 | 25.47 | 20.43 | ||
![]() DeepSeek R1 (Turbo, FP4) | 33k | 60 | $1.50 | 116.9 | 0.32 | 24.68 | 20.08 | ||
![]() DeepSeek R1 | 64k | 60 | $0.96 | 24.9 | 0.54 | 114.82 | 94.21 | ||
![]() DeepSeek R1 | 128k | 60 | $4.00 | 94.4 | 0.46 | 30.63 | 24.87 | ||
![]() | ![]() DeepSeek R1 Turbo | 64k | 60 | $1.15 | 33.9 | 0.80 | 84.78 | 69.23 | |
![]() | ![]() DeepSeek R1 | 64k | 60 | $4.00 | 33.6 | 1.02 | 85.64 | 69.76 | |
![]() | ![]() DeepSeek R1 | 16k | 60 | $5.50 | 190.9 | 1.72 | 16.64 | 12.30 | |
![]() DeepSeek R1 | 128k | 60 | $4.00 | 98.3 | 0.60 | 29.56 | 23.88 | ||
![]() | ![]() DeepSeek R1 | 128k | 60 | $7.00 | 30.1 | 0.53 | 94.98 | 77.86 | |
Qwen3 32B (Reasoning) Base | 41k | 59 | $0.15 | 35.9 | 0.60 | 70.26 | 55.73 | ||
Qwen3 32B (Reasoning) (FP8) | 41k | 59 | $0.15 | 45.6 | 0.59 | 55.44 | 43.88 | ||
![]() | Qwen3 32B (Reasoning) (FP8) | 128k | 59 | $0.19 | 25.7 | 0.89 | 98.24 | 77.88 | |
![]() | Qwen3 32B (Reasoning) | 8k | 59 | $0.50 | 334.3 | 0.52 | 8.00 | 5.98 | |
QwQ-32B | 131k | 58 | $0.20 | 46.4 | 1.05 | 65.49 | 53.67 | ||
QwQ-32B Base | 131k | 58 | $0.23 | 20.7 | 3.58 | 148.33 | 120.55 | ||
![]() | QwQ-32B | 131k | 58 | $0.65 | 85.2 | 0.33 | 35.44 | 29.24 | |
QwQ-32B | 131k | 58 | $0.90 | 138.0 | 0.52 | 22.19 | 18.05 | ||
QwQ-32B | 131k | 58 | $0.14 | 35.8 | 0.57 | 84.23 | 69.67 | ||
QwQ-32B | 131k | 58 | $0.32 | 403.7 | 0.24 | 7.65 | 6.17 | ||
![]() | QwQ-32B | 16k | 58 | $0.63 | 433.7 | 0.40 | 7.29 | 5.74 | |
QwQ-32B | 131k | 58 | $1.20 | 89.0 | 0.51 | 34.11 | 27.98 | ||
Qwen3 14B (Reasoning) Base | 41k | 56 | $0.12 | 87.3 | 0.52 | 29.15 | 22.90 | ||
Qwen3 14B (Reasoning) (FP8) | 128k | 56 | $0.12 | 85.3 | 0.54 | 29.86 | 23.46 | ||
![]() | Qwen3 14B (Reasoning) (FP8) | 128k | 56 | $0.12 | 45.2 | 0.79 | 56.13 | 44.27 | |
Qwen3 30B A3B (Reasoning) Fast | 41k | 56 | $0.45 | 136.7 | 0.56 | 18.85 | 14.63 | ||
Qwen3 30B A3B (Reasoning) Base | 41k | 56 | $0.15 | 87.9 | 0.52 | 28.97 | 22.76 | ||
Qwen3 30B A3B (Reasoning) | 131k | 56 | $0.90 | 149.8 | 0.57 | 17.26 | 13.36 | ||
Qwen3 30B A3B (Reasoning) (FP8) | 41k | 56 | $0.15 | 37.5 | 0.56 | 67.27 | 53.37 | ||
![]() | Qwen3 30B A3B (Reasoning) (FP8) | 128k | 56 | $0.19 | 59.4 | 0.66 | 42.76 | 33.67 | |
o1-mini | 128k | 54 | $1.93 | 202.7 | 10.79 | 13.25 | N/A | ||
![]() | o1-mini | 128k | 54 | $1.93 | 250.1 | 9.27 | 11.27 | N/A | |
![]() | ![]() DeepSeek V3 (Mar' 25) | 64k | 53 | $0.48 | 24.9 | 3.69 | 23.77 | N/A | |
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $1.45 | 103.2 | 1.22 | 6.06 | N/A | ||
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $1.25 | 29.2 | 1.18 | 18.32 | N/A | ||
![]() DeepSeek V3 (Mar' 25) Fast | 128k | 53 | $3.00 | 77.2 | 0.68 | 7.16 | N/A | ||
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $0.75 | 20.9 | 0.65 | 24.55 | N/A | ||
![]() | ![]() DeepSeek V3 (Mar' 25) | 164k | 53 | $0.80 | 74.9 | 0.45 | 7.13 | N/A | |
![]() | ![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $2.00 | 125.9 | 0.50 | 4.47 | N/A | |
![]() DeepSeek V3 (Mar' 25) | 160k | 53 | $0.90 | 98.6 | 0.56 | 5.63 | N/A | ||
![]() DeepSeek V3 (Mar' 25) | 164k | 53 | $0.52 | 19.6 | 0.51 | 25.96 | N/A | ||
![]() | ![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $0.57 | 27.7 | 0.97 | 19.03 | N/A | |
![]() | ![]() DeepSeek V3 (Mar' 25) | 8k | 53 | $1.13 | 263.0 | 0.62 | 2.52 | N/A | |
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $1.25 | 149.1 | 0.39 | 3.75 | N/A | ||
![]() | ![]() DeepSeek V3 (Mar' 25) | 164k | 53 | $1.25 | 18.4 | 0.55 | 27.78 | N/A | |
GPT-4.1 mini | 1m | 53 | $0.70 | 73.1 | 0.56 | 7.40 | N/A | ||
![]() | GPT-4.1 mini | 1m | 53 | $0.70 | 193.0 | 0.63 | 3.22 | N/A | |
GPT-4.1 | 1m | 53 | $3.50 | 120.4 | 0.56 | 4.71 | N/A | ||
![]() | GPT-4.1 | 1m | 53 | $3.50 | 158.9 | 0.80 | 3.95 | N/A | |
![]() DeepSeek R1 Distill Qwen 32B | 128k | 52 | $0.14 | 45.0 | 0.30 | 55.81 | 44.41 | ||
![]() | ![]() DeepSeek R1 Distill Qwen 32B | 64k | 52 | $0.30 | 20.7 | 1.15 | 121.65 | 96.40 | |
![]() | Qwen3 8B (Reasoning) (FP8) | 128k | 51 | $0.06 | 69.7 | 0.72 | 36.59 | 28.69 | |
Grok 3 | 131k | 51 | $6.00 | 48.6 | 0.46 | 10.76 | N/A | ||
Grok 3 Fast | 131k | 51 | $10.00 | 103.2 | 0.35 | 5.20 | N/A | ||
Llama 4 Maverick (FP8) | 1m | 51 | $0.28 | 110.5 | 0.34 | 4.86 | N/A | ||
![]() | Llama 4 Maverick | 128k | 51 | $0.42 | 190.7 | 0.47 | 3.09 | N/A | |
Llama 4 Maverick Vertex | 524k | 51 | $0.00 | 122.2 | 0.32 | 4.42 | N/A | ||
![]() | Llama 4 Maverick (FP8) | 1m | 51 | $0.20 | 126.7 | 0.24 | 4.19 | N/A | |
![]() | Llama 4 Maverick (FP8) | 128k | 51 | $0.61 | 62.3 | 0.34 | 8.36 | N/A | |
Llama 4 Maverick | 1m | 51 | $0.39 | 160.1 | 0.46 | 3.58 | N/A | ||
Llama 4 Maverick (FP8) | 131k | 51 | $0.30 | 95.6 | 0.49 | 5.72 | N/A | ||
![]() | Llama 4 Maverick (FP8) | 1m | 51 | $0.34 | 69.8 | 0.56 | 7.72 | N/A | |
Llama 4 Maverick | 128k | 51 | $0.30 | 560.5 | 0.18 | 1.07 | N/A | ||
![]() | Llama 4 Maverick | 8k | 51 | $0.92 | 784.5 | 0.39 | 1.03 | N/A | |
Llama 4 Maverick (FP8) | 524k | 51 | $0.41 | 97.6 | 0.21 | 5.33 | N/A | ||
![]() | Llama 4 Maverick (FP8) | 1m | 51 | $0.35 | 158.8 | 0.39 | 3.54 | N/A | |
GPT-4o (March 2025) | 128k | 50 | $7.50 | 120.5 | 0.37 | 4.52 | N/A | ||
Gemini 2.0 Pro Experimental (AI Studio) | 2m | 49 | $0.00 | 23.3 | 17.40 | 38.84 | N/A | ||
![]() | ![]() DeepSeek R1 Distill Qwen 14B | 64k | 49 | $0.15 | 19.6 | 0.88 | 128.46 | 102.07 | |
![]() DeepSeek R1 Distill Qwen 14B | 128k | 49 | $1.60 | 169.2 | 0.38 | 15.16 | 11.82 | ||
Gemini 2.5 Flash (AI_Studio) | 1m | 49 | $0.26 | 266.1 | 0.38 | 2.26 | N/A | ||
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $0.30 | 62.0 | 0.30 | 40.62 | 32.26 | ||
![]() | ![]() DeepSeek R1 Distill Llama 70B | 66k | 48 | $0.94 | 2,396.9 | 0.22 | 1.26 | 0.83 | |
![]() DeepSeek R1 Distill Llama 70B Base | 128k | 48 | $0.38 | 49.9 | 0.61 | 50.73 | 40.09 | ||
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $0.34 | 31.0 | 0.31 | 81.06 | 64.60 | ||
![]() | ![]() DeepSeek R1 Distill Llama 70B | 32k | 48 | $0.39 | 72.6 | 0.64 | 35.06 | 27.53 | |
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $0.81 | 417.0 | 0.17 | 6.17 | 4.80 | ||
![]() | ![]() DeepSeek R1 Distill Llama 70B | 16k | 48 | $0.88 | 312.4 | 1.65 | 9.66 | 6.40 | |
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $2.00 | 120.8 | 0.40 | 21.09 | 16.55 | ||
![]() | Claude 3.7 Sonnet | 200k | 48 | $6.00 | 47.0 | 1.05 | 11.68 | N/A | |
Claude 3.7 Sonnet | 200k | 48 | $6.00 | 78.1 | 1.04 | 7.44 | N/A | ||
Gemini 2.0 Flash Vertex | 1m | 48 | $0.26 | 231.5 | 0.28 | 2.44 | N/A | ||
Gemini 2.0 Flash (AI Studio) | 1m | 48 | $0.17 | 233.4 | 0.36 | 2.50 | N/A | ||
Qwen3 4B (Reasoning) Fast | 41k | 47 | $0.12 | 157.6 | 0.51 | 16.37 | 12.69 | ||
![]() | Qwen3 4B (Reasoning) (FP8) | 128k | 47 | $0.00 | 127.8 | 0.64 | 20.20 | 15.65 | |
![]() | ![]() Reka Flash 3 | 128k | 47 | $0.35 | 56.0 | 0.96 | 45.59 | 35.71 | |
Gemini 2.0 Flash (exp) (AI Studio) | 1m | 46 | $0.00 | 231.7 | 0.24 | 2.40 | N/A | ||
![]() | ![]() DeepSeek V3 (Dec '24) | 66k | 46 | $0.48 | 24.9 | 3.51 | 23.58 | N/A | |
![]() DeepSeek V3 (Dec '24) (FP8) | 128k | 46 | $0.25 | 31.5 | 1.24 | 17.09 | N/A | ||
![]() DeepSeek V3 (Dec '24) | 128k | 46 | $0.75 | 21.1 | 0.69 | 24.35 | N/A | ||
![]() | ![]() DeepSeek V3 (Dec '24) | 128k | 46 | $2.00 | 80.9 | 0.45 | 6.63 | N/A | |
![]() DeepSeek V3 (Dec '24) | 128k | 46 | $1.31 | 50.2 | 0.82 | 10.78 | N/A | ||
![]() DeepSeek V3 (Dec '24) | 64k | 46 | $0.59 | 21.9 | 0.46 | 23.27 | N/A | ||
![]() | ![]() DeepSeek V3 (Dec '24) Turbo | 64k | 46 | $0.63 | 30.6 | 0.83 | 17.18 | N/A | |
![]() | ![]() DeepSeek V3 (Dec '24) | 64k | 46 | $0.89 | 30.8 | 0.83 | 17.08 | N/A | |
![]() DeepSeek V3 (Dec '24) (FP8) | 128k | 46 | $1.25 | 146.4 | 0.39 | 3.81 | N/A | ||
Qwen2.5 Max | 32k | 45 | $2.80 | 49.5 | 1.26 | 11.35 | N/A | ||
Gemini 1.5 Pro (Sep) (Vertex) | 2m | 45 | $2.19 | 92.9 | 0.42 | 5.80 | N/A | ||
Gemini 1.5 Pro (Sep) (AI Studio) | 2m | 45 | $2.19 | 92.5 | 0.41 | 5.81 | N/A | ||
![]() | Claude 3.5 Sonnet (Oct) | 200k | 44 | $6.00 | 42.4 | 1.08 | 12.88 | N/A | |
Claude 3.5 Sonnet (Oct) Vertex | 200k | 44 | $6.00 | 79.4 | 0.81 | 7.10 | N/A | ||
Claude 3.5 Sonnet (Oct) | 200k | 44 | $6.00 | 75.3 | 2.47 | 9.10 | N/A | ||
![]() Sonar | 127k | 43 | $1.00 | 101.5 | 1.67 | 6.60 | N/A | ||
Llama 4 Scout | 1m | 43 | $0.14 | 109.5 | 0.25 | 4.82 | N/A | ||
![]() | Llama 4 Scout | 32k | 43 | $0.70 | 2,863.7 | 0.22 | 0.40 | N/A | |
![]() | Llama 4 Scout | 128k | 43 | $0.29 | 160.4 | 0.49 | 3.61 | N/A | |
Llama 4 Scout Vertex | 1m | 43 | $0.00 | 129.4 | 0.35 | 4.21 | N/A | ||
![]() | Llama 4 Scout | 1m | 43 | $0.10 | 111.5 | 0.23 | 4.71 | N/A | |
![]() | Llama 4 Scout | 128k | 43 | $0.34 | 36.1 | 0.34 | 14.20 | N/A | |
Llama 4 Scout | 1m | 43 | $0.26 | 149.8 | 0.51 | 3.84 | N/A | ||
Llama 4 Scout | 131k | 43 | $0.15 | 73.7 | 0.49 | 7.27 | N/A | ||
![]() | Llama 4 Scout | 131k | 43 | $0.20 | 76.3 | 0.62 | 7.17 | N/A | |
Llama 4 Scout | 131k | 43 | $0.17 | 550.5 | 0.40 | 1.30 | N/A | ||
![]() | Llama 4 Scout | 8k | 43 | $0.47 | 776.4 | 0.79 | 1.43 | N/A | |
Llama 4 Scout | 328k | 43 | $0.28 | 116.0 | 0.19 | 4.50 | N/A | ||
![]() | Llama 4 Scout | 128k | 43 | $0.71 | 90.3 | 0.44 | 5.98 | N/A | |
![]() Sonar Pro | 200k | 43 | $6.00 | 82.9 | 2.52 | 8.55 | N/A | ||
QwQ 32B-Preview | 33k | 43 | $0.26 | 45.3 | 0.33 | 55.49 | 44.13 | ||
QwQ 32B-Preview | 33k | 43 | $1.20 | 87.3 | 0.50 | 29.15 | 22.92 | ||
![]() | ![]() Nova Premier | 1m | 43 | $5.00 | 63.1 | 0.82 | 8.74 | N/A | |
GPT-4o (Nov '24) | 128k | 41 | $4.38 | 114.5 | 0.55 | 4.92 | N/A | ||
![]() | GPT-4o (Nov '24) | 128k | 41 | $4.38 | 121.1 | 1.20 | 5.33 | N/A | |
Gemini 2.0 Flash-Lite (Feb '25) (AI Studio) | 1m | 41 | $0.13 | 202.1 | 0.29 | 2.77 | N/A | ||
Llama 3.3 70B (FP8) | 128k | 41 | $0.17 | 46.6 | 0.29 | 11.01 | N/A | ||
![]() | Llama 3.3 70B | 33k | 41 | $0.94 | 2,433.4 | 0.22 | 0.43 | N/A | |
Llama 3.3 70B | 128k | 41 | $0.40 | 31.8 | 1.17 | 16.88 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.71 | 159.1 | 0.56 | 3.70 | N/A | |
Llama 3.3 70B Fast | 128k | 41 | $0.38 | 132.7 | 0.56 | 4.33 | N/A | ||
Llama 3.3 70B Base | 128k | 41 | $0.20 | 32.6 | 0.64 | 15.99 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.50 | 138.1 | 0.38 | 4.00 | N/A | |
![]() | Llama 3.3 70B | 128k | 41 | $0.71 | 54.3 | 0.42 | 9.63 | N/A | |
Llama 3.3 70B | 128k | 41 | $0.90 | 161.9 | 0.48 | 3.57 | N/A | ||
Llama 3.3 70B (Turbo, FP8) | 128k | 41 | $0.20 | 29.6 | 0.27 | 17.15 | N/A | ||
Llama 3.3 70B | 128k | 41 | $0.27 | 15.7 | 0.65 | 32.51 | N/A | ||
Llama 3.3 70B | 128k | 41 | $0.60 | 162.9 | 0.41 | 3.48 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.20 | 37.7 | 0.61 | 13.86 | N/A | |
Llama 3.3 70B | 128k | 41 | $0.64 | 309.4 | 0.36 | 1.97 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.75 | 457.8 | 0.30 | 1.39 | N/A | |
Llama 3.3 70B Turbo | 128k | 41 | $0.88 | 121.4 | 0.41 | 4.53 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.70 | 30.7 | 0.42 | 16.70 | N/A | |
GPT-4.1 nano | 1m | 41 | $0.17 | 215.0 | 0.39 | 2.71 | N/A | ||
![]() | GPT-4.1 nano | 1m | 41 | $0.17 | 69.1 | 1.04 | 8.28 | N/A | |
GPT-4o (May '24) | 128k | 41 | $7.50 | 113.6 | 0.40 | 4.80 | N/A | ||
![]() | GPT-4o (May '24) | 128k | 41 | $7.50 | 115.1 | 0.83 | 5.18 | N/A | |
Llama 3.1 405B (FP8) | 128k | 40 | $0.80 | 32.2 | 0.33 | 15.87 | N/A | ||
Llama 3.1 405B | 128k | 40 | $9.50 | 19.2 | 0.98 | 27.08 | N/A | ||
Llama 3.1 405B | 128k | 40 | $4.00 | 87.4 | 1.11 | 6.83 | N/A | ||
![]() | Llama 3.1 405B Standard | 128k | 40 | $2.40 | 29.8 | 1.81 | 18.56 | N/A | |
![]() | Llama 3.1 405B Latency Optimized | 128k | 40 | $3.00 | 86.9 | 0.43 | 6.18 | N/A | |
Llama 3.1 405B Base | 128k | 40 | $1.50 | 32.4 | 0.70 | 16.11 | N/A | ||
Llama 3.1 405B Vertex | 128k | 40 | $7.75 | 29.0 | 0.40 | 17.63 | N/A | ||
![]() | Llama 3.1 405B | 128k | 40 | $8.00 | 31.6 | 0.46 | 16.28 | N/A | |
Llama 3.1 405B | 128k | 40 | $3.00 | 87.9 | 0.58 | 6.27 | N/A | ||
Llama 3.1 405B | 33k | 40 | $0.90 | 25.6 | 0.43 | 19.98 | N/A | ||
![]() | Llama 3.1 405B | 16k | 40 | $6.25 | 173.7 | 1.63 | 4.51 | N/A | |
Llama 3.1 405B | 128k | 40 | $7.50 | 38.3 | 0.79 | 13.86 | N/A | ||
Llama 3.1 405B Turbo | 128k | 40 | $3.50 | 71.3 | 0.44 | 7.45 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.40 | 30.5 | 1.36 | 17.73 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.20 | 24.6 | 0.69 | 21.06 | N/A | ||
Qwen2.5 72B Fast | 131k | 40 | $0.38 | 66.7 | 0.54 | 8.04 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.90 | 69.7 | 0.47 | 7.64 | N/A | ||
Qwen2.5 72B | 33k | 40 | $0.27 | 33.6 | 0.59 | 15.49 | N/A | ||
Qwen2.5 72B Turbo | 131k | 40 | $1.20 | 37.9 | 0.73 | 13.91 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.00 | 42.8 | 1.14 | 12.82 | N/A | ||
![]() | ![]() MiniMax-Text-01 | 1m | 40 | $0.42 | 35.1 | 0.87 | 15.10 | N/A | |
Phi-4 | 16k | 40 | $0.15 | 118.9 | 0.50 | 4.71 | N/A | ||
![]() | Phi-4 | 16k | 40 | $0.22 | 39.5 | 0.43 | 13.08 | N/A | |
Phi-4 | 16k | 40 | $0.09 | 41.1 | 0.50 | 12.67 | N/A | ||
![]() Command A | 256k | 40 | $4.38 | 56.1 | 0.24 | 9.15 | N/A | ||
Gemini 1.5 Flash (Sep) (Vertex) | 1m | 39 | $0.13 | 188.5 | 0.20 | 2.85 | N/A | ||
Gemini 1.5 Flash (Sep) (AI Studio) | 1m | 39 | $0.13 | 187.3 | 0.30 | 2.97 | N/A | ||
![]() | ![]() Mistral Large 2 (Nov '24) | 128k | 38 | $3.00 | 69.6 | 0.50 | 7.69 | N/A | |
![]() | ![]() Mistral Large 2 (Nov '24) | 128k | 38 | $3.00 | 36.7 | 0.49 | 14.11 | N/A | |
![]() | Qwen3 1.7B (Reasoning) (FP8) | 32k | 38 | $0.00 | 167.7 | 0.60 | 15.51 | 11.92 | |
Gemma 3 27B | 128k | 38 | $0.07 | 33.4 | 0.68 | 15.63 | N/A | ||
Grok Beta | 128k | 38 | $7.50 | 66.5 | 0.29 | 7.81 | N/A | ||
![]() | ![]() Pixtral Large | 128k | 37 | $3.00 | 39.0 | 0.58 | 13.38 | N/A | |
Qwen2.5 Instruct 32B Fast | 128k | 37 | $0.20 | 82.9 | 0.53 | 6.56 | N/A | ||
Qwen2.5 Instruct 32B Base | 128k | 37 | $0.10 | 38.7 | 0.60 | 13.52 | N/A | ||
Llama 3.1 Nemotron 70B (FP8) | 128k | 37 | $0.17 | 48.7 | 0.31 | 10.57 | N/A | ||
Llama 3.1 Nemotron 70B Base | 128k | 37 | $0.20 | 39.6 | 0.67 | 13.29 | N/A | ||
Llama 3.1 Nemotron 70B Fast | 128k | 37 | $0.38 | 70.8 | 0.55 | 7.61 | N/A | ||
Llama 3.1 Nemotron 70B | 128k | 37 | $0.27 | 27.5 | 0.63 | 18.82 | N/A | ||
![]() | ![]() Nova Pro | 300k | 37 | $1.40 | 156.8 | 0.36 | 3.55 | N/A | |
![]() | ![]() Mistral Large 2 (Jul '24) | 128k | 37 | $3.00 | 38.5 | 0.43 | 13.43 | N/A | |
![]() | ![]() Mistral Large 2 (Jul '24) | 128k | 37 | $3.00 | 33.5 | 0.44 | 15.36 | N/A | |
![]() | ![]() Mistral Large 2 (Jul '24) | 128k | 37 | $3.00 | 35.7 | 0.50 | 14.50 | N/A | |
Qwen2.5 Coder 32B | 33k | 36 | $0.09 | 41.1 | 0.31 | 12.47 | N/A | ||
Qwen2.5 Coder 32B | 131k | 36 | $0.20 | 57.0 | 0.91 | 9.68 | N/A | ||
Qwen2.5 Coder 32B | 33k | 36 | $0.10 | 51.6 | 0.25 | 9.94 | N/A | ||
Qwen2.5 Coder 32B | 131k | 36 | $0.80 | 81.9 | 0.45 | 6.55 | N/A | ||
GPT-4o mini | 128k | 36 | $0.26 | 68.6 | 0.48 | 7.76 | N/A | ||
![]() | GPT-4o mini | 128k | 36 | $0.26 | 118.9 | 0.93 | 5.14 | N/A | |
Llama 3.1 70B (FP8) | 128k | 35 | $0.17 | 48.9 | 0.29 | 10.53 | N/A | ||
Llama 3.1 70B | 128k | 35 | $0.40 | 132.4 | 1.14 | 4.91 | N/A | ||
![]() | Llama 3.1 70B Standard | 128k | 35 | $0.72 | 31.6 | 0.64 | 16.48 | N/A | |
![]() | Llama 3.1 70B Latency Optimized | 128k | 35 | $0.90 | 134.3 | 0.31 | 4.03 | N/A | |
Llama 3.1 70B Base | 128k | 35 | $0.20 | 10.9 | 0.83 | 46.71 | N/A | ||
Llama 3.1 70B Fast | 128k | 35 | $0.38 | 145.5 | 0.54 | 3.98 | N/A | ||
Llama 3.1 70B Vertex | 128k | 35 | $0.00 | 72.7 | 0.28 | 7.17 | N/A | ||
![]() | Llama 3.1 70B | 128k | 35 | $2.90 | 53.1 | 0.43 | 9.85 | N/A | |
Llama 3.1 70B | 128k | 35 | $0.90 | 157.4 | 0.42 | 3.60 | N/A | ||
Llama 3.1 70B (Turbo, FP8) | 128k | 35 | $0.20 | 35.9 | 0.28 | 14.23 | N/A | ||
Llama 3.1 70B | 128k | 35 | $0.27 | 15.0 | 0.58 | 33.94 | N/A | ||
![]() | Llama 3.1 70B | 32k | 35 | $0.19 | 54.5 | 1.06 | 10.23 | N/A | |
Llama 3.1 70B Turbo | 128k | 35 | $0.88 | 118.9 | 0.42 | 4.62 | N/A | ||
Llama 3.1 70B | 128k | 35 | $0.90 | 127.4 | 0.50 | 4.42 | N/A | ||
![]() | ![]() Mistral Small 3.1 | 128k | 35 | $0.15 | 74.3 | 0.37 | 7.10 | N/A | |
![]() Mistral Small 3.1 Vertex | 128k | 35 | $0.15 | 53.7 | 0.18 | 9.49 | N/A | ||
![]() | ![]() Mistral Small 3 | 32k | 35 | $0.15 | 160.9 | 0.28 | 3.39 | N/A | |
![]() Mistral Small 3 | 32k | 35 | $0.09 | 74.6 | 0.21 | 6.92 | N/A | ||
![]() Mistral Small 3 | 32k | 35 | $0.80 | 95.2 | 0.20 | 5.45 | N/A | ||
![]() | Claude 3 Opus | 200k | 35 | $30.00 | 25.6 | 1.22 | 20.77 | N/A | |
Claude 3 Opus Vertex | 200k | 35 | $30.00 | 22.2 | 2.40 | 24.93 | N/A | ||
Claude 3 Opus | 200k | 35 | $30.00 | 27.5 | 1.07 | 19.28 | N/A | ||
![]() | Claude 3.5 Haiku Standard | 200k | 35 | $1.60 | 57.3 | 1.28 | 10.01 | N/A | |
![]() | Claude 3.5 Haiku Latency Optimized | 200k | 35 | $2.00 | 96.0 | 0.50 | 5.71 | N/A | |
Claude 3.5 Haiku Vertex | 200k | 35 | $1.60 | 65.6 | 2.49 | 10.11 | N/A | ||
Claude 3.5 Haiku | 200k | 35 | $1.60 | 65.6 | 0.96 | 8.58 | N/A | ||
![]() | ![]() DeepSeek R1 Distill Llama 8B | 32k | 34 | $0.04 | 52.7 | 0.71 | 48.12 | 37.93 | |
Gemma 3 12B | 128k | 34 | $0.06 | 28.0 | 0.70 | 18.58 | N/A | ||
Gemini 1.5 Pro (May) (Vertex) | 2m | 34 | $2.19 | 65.6 | 0.35 | 7.97 | N/A | ||
Gemini 1.5 Pro (May) (AI Studio) | 2m | 34 | $2.19 | 66.6 | 0.44 | 7.95 | N/A | ||
Qwen Turbo | 1m | 34 | $0.09 | 110.3 | 1.12 | 5.66 | N/A | ||
![]() | Llama 3.2 90B (Vision) | 128k | 33 | $0.72 | 60.6 | 0.49 | 8.74 | N/A | |
Llama 3.2 90B (Vision) Vertex | 128k | 33 | $0.00 | 32.4 | 0.20 | 15.66 | N/A | ||
Llama 3.2 90B (Vision) | 33k | 33 | $0.36 | 15.0 | 0.54 | 33.80 | N/A | ||
Llama 3.2 90B (Vision) Turbo | 128k | 33 | $1.20 | 29.6 | 0.26 | 17.14 | N/A | ||
Qwen2 72B | 33k | 33 | $0.90 | 41.3 | 0.49 | 12.60 | N/A | ||
Qwen2 72B | 131k | 33 | $0.00 | 31.1 | 1.38 | 17.48 | N/A | ||
![]() | ![]() Nova Lite | 300k | 33 | $0.10 | 282.5 | 0.31 | 2.08 | N/A | |
Gemini 1.5 Flash-8B AI Studio | 1m | 31 | $0.07 | 284.2 | 0.18 | 1.94 | N/A | ||
![]() Jamba 1.5 Large | 256k | 29 | $3.50 | 68.0 | 0.51 | 7.86 | N/A | ||
![]() | ![]() Jamba 1.5 Large | 256k | 29 | $3.50 | 51.0 | 0.67 | 10.48 | N/A | |
![]() Jamba 1.6 Large | 256k | 29 | $3.50 | 50.0 | 0.62 | 10.62 | N/A | ||
Gemini 1.5 Flash (May) (Vertex) | 1m | 28 | $0.13 | 313.1 | 0.26 | 1.86 | N/A | ||
Gemini 1.5 Flash (May) (AI Studio) | 1m | 28 | $0.13 | 306.6 | 0.26 | 1.89 | N/A | ||
![]() | ![]() Nova Micro | 130k | 28 | $0.06 | 300.5 | 0.29 | 1.95 | N/A | |
![]() Yi-Large | 32k | 28 | $3.00 | 67.7 | 0.39 | 7.77 | N/A | ||
![]() | Claude 3 Sonnet | 200k | 28 | $6.00 | 43.6 | 0.83 | 12.30 | N/A | |
Claude 3 Sonnet | 200k | 28 | $6.00 | 61.0 | 0.58 | 8.78 | N/A | ||
![]() | ![]() Codestral (Jan '25) | 256k | 28 | $0.45 | 122.0 | 0.30 | 4.40 | N/A | |
![]() Codestral (Jan '25) Vertex | 128k | 28 | $0.45 | 147.3 | 0.15 | 3.54 | N/A | ||
Llama 3 70B | 8k | 27 | $1.18 | 35.2 | 0.40 | 14.59 | N/A | ||
Llama 3 70B | 8k | 27 | $0.40 | 20.4 | 1.40 | 25.93 | N/A | ||
![]() | Llama 3 70B | 8k | 27 | $2.86 | 47.3 | 0.39 | 10.96 | N/A | |
![]() | Llama 3 70B | 8k | 27 | $2.90 | 18.7 | 0.74 | 27.46 | N/A | |
Llama 3 70B | 8k | 27 | $0.27 | 33.1 | 0.49 | 15.58 | N/A | ||
![]() | Llama 3 70B | 8k | 27 | $0.57 | 19.9 | 1.02 | 26.11 | N/A | |
Llama 3 70B | 8k | 27 | $0.64 | 334.5 | 0.24 | 1.74 | N/A | ||
Llama 3 70B (Reference, FP16) | 8k | 27 | $0.90 | 117.1 | 0.70 | 4.97 | N/A | ||
Llama 3 70B (Turbo, FP8) | 8k | 27 | $0.88 | 104.3 | 0.44 | 5.23 | N/A | ||
![]() | ![]() Mistral Small (Sep '24) | 33k | 27 | $0.30 | 84.5 | 0.33 | 6.25 | N/A | |
![]() | Phi-4 Multimodal | 128k | 27 | $0.00 | 22.8 | 0.35 | 22.32 | N/A | |
Qwen2.5 Coder 7B Fast | 131k | 27 | $0.04 | 210.6 | 0.49 | 2.86 | N/A | ||
Qwen2.5 Coder 7B Base | 131k | 27 | $0.01 | 199.8 | 0.50 | 3.00 | N/A | ||
![]() | ![]() Mistral Large (Feb '24) | 33k | 26 | $6.00 | 30.5 | 0.46 | 16.87 | N/A | |
![]() | ![]() Mistral Large (Feb '24) | 33k | 26 | $6.00 | 43.6 | 0.38 | 11.86 | N/A | |
![]() | ![]() Mixtral 8x22B | 65k | 26 | $3.00 | 52.8 | 0.33 | 9.81 | N/A | |
![]() Mixtral 8x22B Base | 65k | 26 | $0.60 | 75.8 | 0.54 | 7.14 | N/A | ||
![]() Mixtral 8x22B Fast | 65k | 26 | $1.05 | 99.5 | 0.54 | 5.57 | N/A | ||
![]() Mixtral 8x22B | 65k | 26 | $1.20 | 84.5 | 0.36 | 6.28 | N/A | ||
![]() Mixtral 8x22B | 65k | 26 | $1.20 | 90.4 | 0.37 | 5.90 | N/A | ||
![]() | Phi-4 Mini | 128k | 26 | $0.12 | 221.6 | 0.26 | 2.51 | N/A | |
![]() | Phi-4 Mini | 128k | 26 | $0.00 | 58.6 | 0.33 | 8.86 | N/A | |
![]() | Phi-3 Medium 14B | 128k | 25 | $0.30 | 52.5 | 0.41 | 9.93 | N/A | |
Gemma 3 4B | 128k | 24 | $0.03 | 149.4 | 0.25 | 3.59 | N/A | ||
![]() | Claude 2.1 | 200k | 24 | $12.00 | 29.4 | 1.66 | 18.69 | N/A | |
Claude 2.1 | 200k | 24 | $12.00 | 14.0 | 0.90 | 36.52 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.03 | 140.7 | 0.22 | 3.78 | N/A | ||
![]() | Llama 3.1 8B | 33k | 24 | $0.10 | 2,151.6 | 0.26 | 0.49 | N/A | |
Llama 3.1 8B | 128k | 24 | $0.10 | 441.7 | 0.76 | 1.89 | N/A | ||
![]() | Llama 3.1 8B | 128k | 24 | $0.22 | 93.5 | 0.35 | 5.70 | N/A | |
Llama 3.1 8B Fast | 128k | 24 | $0.04 | 185.1 | 0.51 | 3.22 | N/A | ||
Llama 3.1 8B Base | 128k | 24 | $0.03 | 62.2 | 0.57 | 8.60 | N/A | ||
Llama 3.1 8B Vertex | 128k | 24 | $0.00 | 120.2 | 0.18 | 4.34 | N/A | ||
![]() | Llama 3.1 8B | 128k | 24 | $0.38 | 225.8 | 0.29 | 2.51 | N/A | |
Llama 3.1 8B | 128k | 24 | $0.20 | 256.2 | 0.29 | 2.24 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.04 | 48.9 | 0.39 | 10.61 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.10 | 451.4 | 0.30 | 1.40 | N/A | ||
![]() | Llama 3.1 8B | 16k | 24 | $0.03 | 75.0 | 0.66 | 7.32 | N/A | |
Llama 3.1 8B | 128k | 24 | $0.06 | 892.8 | 0.19 | 0.75 | N/A | ||
![]() | Llama 3.1 8B | 16k | 24 | $0.13 | 1,170.4 | 0.24 | 0.67 | N/A | |
Llama 3.1 8B Turbo | 128k | 24 | $0.18 | 162.9 | 0.28 | 3.35 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.15 | 469.4 | 0.19 | 1.26 | N/A | ||
![]() | Llama 3.1 8B | 128k | 24 | $0.18 | 61.5 | 0.45 | 8.58 | N/A | |
![]() | ![]() Pixtral 12B | 128k | 23 | $0.15 | 101.6 | 0.30 | 5.22 | N/A | |
![]() Pixtral 12B | 128k | 23 | $0.10 | 79.8 | 0.55 | 6.81 | N/A | ||
![]() | ![]() Mistral Small (Feb '24) | 33k | 23 | $1.50 | 83.8 | 0.28 | 6.24 | N/A | |
![]() | ![]() Mistral Small (Feb '24) | 33k | 23 | $1.50 | 87.7 | 0.38 | 6.08 | N/A | |
![]() | ![]() Mistral Medium | 33k | 23 | $4.09 | 56.8 | 0.38 | 9.19 | N/A | |
![]() | ![]() Ministral 8B | 128k | 22 | $0.10 | 134.4 | 0.28 | 4.00 | N/A | |
Gemma 2 9B Fast | 8k | 22 | $0.04 | 170.2 | 0.48 | 3.41 | N/A | ||
Gemma 2 9B Base | 8k | 22 | $0.03 | 163.1 | 0.49 | 3.55 | N/A | ||
Gemma 2 9B | 8k | 22 | $0.04 | 21.8 | 0.65 | 23.61 | N/A | ||
Gemma 2 9B | 8k | 22 | $0.20 | 703.1 | 0.22 | 0.93 | N/A | ||
Gemma 2 9B | 8k | 22 | $0.30 | 132.3 | 0.21 | 3.99 | N/A | ||
![]() LFM 40B | 32k | 22 | $0.15 | 164.1 | 0.17 | 3.21 | N/A | ||
![]() | ![]() Command-R+ | 128k | 21 | $6.00 | 47.6 | 0.47 | 10.99 | N/A | |
![]() Command-R+ | 128k | 21 | $4.38 | 49.2 | 0.25 | 10.43 | N/A | ||
Llama 3 8B | 8k | 21 | $0.10 | 79.9 | 0.39 | 6.65 | N/A | ||
![]() | Llama 3 8B | 8k | 21 | $0.38 | 103.9 | 0.31 | 5.12 | N/A | |
![]() | Llama 3 8B | 8k | 21 | $0.38 | 73.6 | 0.35 | 7.14 | N/A | |
Llama 3 8B | 8k | 21 | $0.04 | 110.9 | 0.22 | 4.73 | N/A | ||
![]() | Llama 3 8B | 8k | 21 | $0.04 | 58.8 | 0.83 | 9.34 | N/A | |
Llama 3 8B | 8k | 21 | $0.06 | 1,340.7 | 0.31 | 0.68 | N/A | ||
Llama 3 8B | 8k | 21 | $0.20 | 190.5 | 0.49 | 3.11 | N/A | ||
![]() | ![]() Codestral (May '24) | 33k | 20 | $0.30 | 108.1 | 0.31 | 4.93 | N/A | |
![]() Aya Expanse 32B | 128k | 20 | $0.75 | 120.0 | 0.16 | 4.33 | N/A | ||
![]() | ![]() Command-R+ (Apr '24) | 128k | 20 | $6.00 | 47.6 | 0.48 | 10.99 | N/A | |
![]() Command-R+ (Apr '24) | 128k | 20 | $6.00 | 69.0 | 0.22 | 7.47 | N/A | ||
![]() | ![]() Command-R+ (Apr '24) | 128k | 20 | $6.00 | 29.1 | 0.63 | 17.83 | N/A | |
![]() | ![]() Ministral 3B | 128k | 20 | $0.04 | 223.0 | 0.27 | 2.51 | N/A | |
![]() | ![]() Mistral NeMo | 128k | 20 | $0.15 | 144.0 | 0.29 | 3.76 | N/A | |
![]() Mistral NeMo Fast | 128k | 20 | $0.12 | 159.8 | 0.53 | 3.66 | N/A | ||
![]() Mistral NeMo Base | 128k | 20 | $0.06 | 42.8 | 0.62 | 12.31 | N/A | ||
![]() Mistral NeMo | 128k | 20 | $0.06 | 58.7 | 0.26 | 8.79 | N/A | ||
Llama 3.2 3B (FP8) | 128k | 20 | $0.02 | 222.9 | 0.21 | 2.45 | N/A | ||
Llama 3.2 3B | 128k | 20 | $0.10 | 104.1 | 0.95 | 5.76 | N/A | ||
![]() | Llama 3.2 3B | 128k | 20 | $0.15 | 71.6 | 0.46 | 7.44 | N/A | |
Llama 3.2 3B Base | 128k | 20 | $0.01 | 123.1 | 0.51 | 4.57 | N/A | ||
Llama 3.2 3B | 128k | 20 | $0.02 | 113.1 | 0.17 | 4.59 | N/A | ||
![]() | Llama 3.2 3B | 32k | 20 | $0.04 | 107.0 | 0.57 | 5.25 | N/A | |
![]() | Llama 3.2 3B | 8k | 20 | $0.10 | 1,590.6 | 0.23 | 0.54 | N/A | |
Llama 3.2 3B Turbo | 128k | 20 | $0.06 | 155.7 | 0.31 | 3.52 | N/A | ||
![]() DeepSeek R1 Distill Qwen 1.5B | 128k | 19 | $0.18 | 389.4 | 0.20 | 6.62 | 5.14 | ||
![]() Jamba 1.5 Mini | 256k | 18 | $0.25 | 175.4 | 0.33 | 3.18 | N/A | ||
![]() | ![]() Jamba 1.5 Mini | 256k | 18 | $0.25 | 82.8 | 0.45 | 6.50 | N/A | |
![]() Jamba 1.6 Mini | 256k | 18 | $0.25 | 186.3 | 0.33 | 3.01 | N/A | ||
![]() | ![]() Mixtral 8x7B | 33k | 17 | $0.70 | 93.2 | 0.29 | 5.66 | N/A | |
![]() | ![]() Mixtral 8x7B | 33k | 17 | $0.51 | 62.8 | 0.32 | 8.28 | N/A | |
![]() Mixtral 8x7B Fast | 33k | 17 | $0.23 | 52.7 | 0.61 | 10.10 | N/A | ||
![]() Mixtral 8x7B Base | 33k | 17 | $0.12 | 19.1 | 0.64 | 26.81 | N/A | ||
![]() Mixtral 8x7B | 33k | 17 | $0.24 | 94.9 | 0.23 | 5.49 | N/A | ||
![]() Mixtral 8x7B | 33k | 17 | $0.60 | 51.6 | 0.33 | 10.02 | N/A | ||
![]() Aya Expanse 8B | 8k | 16 | $0.75 | 165.0 | 0.12 | 3.15 | N/A | ||
![]() | ![]() Command-R | 128k | 15 | $0.75 | 108.3 | 0.33 | 4.95 | N/A | |
![]() Command-R | 128k | 15 | $0.26 | 74.3 | 0.20 | 6.92 | N/A | ||
![]() | ![]() Command-R (Mar '24) | 128k | 15 | $0.75 | 106.2 | 0.33 | 5.04 | N/A | |
![]() Command-R (Mar '24) | 128k | 15 | $0.75 | 160.7 | 0.15 | 3.26 | N/A | ||
![]() | ![]() Command-R (Mar '24) | 128k | 15 | $0.75 | 46.3 | 0.50 | 11.29 | N/A | |
![]() | ![]() Codestral-Mamba | 256k | 14 | $0.25 | 94.5 | 0.43 | 5.72 | N/A | |
![]() | ![]() Mistral 7B | 8k | 10 | $0.25 | 105.1 | 0.28 | 5.04 | N/A | |
![]() Mistral 7B | 8k | 10 | $0.04 | 97.7 | 0.20 | 5.32 | N/A | ||
![]() | ![]() Mistral 7B | 32k | 10 | $0.04 | 122.7 | 0.80 | 4.88 | N/A | |
![]() Mistral 7B | 8k | 10 | $0.20 | 180.7 | 0.15 | 2.92 | N/A | ||
![]() | Llama 3.2 1B | 128k | 10 | $0.10 | 117.8 | 0.46 | 4.70 | N/A | |
Llama 3.2 1B Base | 128k | 10 | $0.01 | 266.9 | 0.46 | 2.34 | N/A | ||
Llama 3.2 1B | 128k | 10 | $0.01 | 77.1 | 0.30 | 6.79 | N/A | ||
![]() | Llama 3.2 1B | 16k | 10 | $0.05 | 2,576.9 | 0.21 | 0.41 | N/A | |
Llama 2 Chat 7B | 4k | 8 | $0.10 | 135.8 | 0.36 | 4.04 | N/A | ||
![]() | Llama 3.2 11B (Vision) | 128k | $0.16 | 143.5 | 0.47 | 3.95 | N/A | ||
Llama 3.2 11B (Vision) | 128k | $0.06 | 50.2 | 0.25 | 10.20 | N/A | |||
Llama 3.2 11B (Vision) Turbo | 128k | $0.18 | 90.4 | 0.50 | 6.03 | N/A | |||
![]() | ![]() Mistral Saba | 32k | $0.30 | 90.8 | 0.32 | 5.83 | N/A | ||
![]() Sonar Reasoning | 127k | $2.00 | 74.1 | 1.59 | 35.31 | 26.98 | |||
Grok 3 mini Reasoning (low) | 131k | $0.35 | 116.2 | 0.25 | 21.77 | 17.22 | |||
Grok 3 mini Reasoning (low) Fast | 131k | $1.45 | 227.8 | 0.24 | 11.21 | 8.78 | |||
![]() | ![]() Reka Flash | 128k | $0.35 | 34.6 | 0.91 | 15.36 | N/A | ||
![]() | ![]() Reka Core | 128k | $2.00 | 27.7 | 0.92 | 18.96 | N/A | ||
![]() | ![]() Reka Flash (Feb '24) | 128k | $0.35 | 46.2 | 0.91 | 11.73 | N/A | ||
![]() | ![]() Reka Edge | 128k | $0.10 | 85.6 | 0.89 | 6.73 | N/A | ||
o1-preview | 128k | $26.25 | 160.8 | 20.66 | 23.77 | N/A | |||
![]() | o1-preview | 128k | $28.88 | 147.9 | 25.03 | 28.41 | N/A | ||
GPT-4o (Aug '24) | 128k | $4.38 | 103.6 | 0.39 | 5.22 | N/A | |||
![]() | GPT-4o (Aug '24) | 128k | $4.38 | 116.3 | 0.72 | 5.02 | N/A | ||
GPT-4 Turbo | 128k | $15.00 | 33.4 | 0.76 | 15.72 | N/A | |||
![]() | GPT-4 Turbo | 128k | $15.00 | 42.5 | 1.84 | 13.60 | N/A | ||
GPT-3.5 Turbo | 4k | $0.75 | 117.7 | 0.34 | 4.59 | N/A | |||
GPT-4 | 8k | $37.50 | 24.6 | 0.81 | 21.11 | N/A | |||
GPT-4.5 (Preview) | 128k | $93.75 | 63.1 | 1.05 | 8.97 | N/A | |||
Gemini 2.0 Flash-Lite (Preview) (AI Studio) | 1m | $0.13 | 199.6 | 0.30 | 2.80 | N/A | |||
Gemma 2 27B Fast | 8k | $0.26 | 86.0 | 0.55 | 6.37 | N/A | |||
Gemma 2 27B Base | 8k | $0.15 | 53.7 | 0.57 | 9.88 | N/A | |||
Gemma 2 27B | 8k | $0.80 | 88.5 | 0.24 | 5.90 | N/A | |||
![]() | Claude 3.5 Sonnet (June) | 200k | $6.00 | 42.1 | 1.01 | 12.89 | N/A | ||
Claude 3.5 Sonnet (June) Vertex | 200k | $6.00 | 80.1 | 0.79 | 7.03 | N/A | |||
Claude 3.5 Sonnet (June) | 200k | $6.00 | 79.9 | 0.76 | 7.01 | N/A | |||
![]() | Claude 3 Haiku | 200k | $0.50 | 98.0 | 1.02 | 6.12 | N/A | ||
Claude 3 Haiku | 200k | $0.50 | 141.5 | 0.52 | 4.05 | N/A | |||
![]() | Claude Instant | 100k | $1.20 | 54.3 | 0.56 | 9.77 | N/A | ||
Claude 2.0 | 100k | $12.00 | 30.6 | 0.86 | 17.18 | N/A | |||
![]() DeepSeek Coder V2 Lite Fast, FP8 | 128k | $0.12 | 108.9 | 0.63 | 5.23 | N/A | |||
![]() DeepSeek Coder V2 Lite Base, FP8 | 128k | $0.06 | 104.3 | 0.62 | 5.42 | N/A | |||
![]() OpenChat 3.5 | 8k | $0.06 | 45.3 | 0.41 | 11.46 | N/A | |||
![]() | ![]() Solar Mini | 4k | $0.15 | 81.5 | 1.05 | 7.19 | N/A | ||
![]() Jamba Instruct | 256k | $0.55 | 180.8 | 0.34 | 3.10 | N/A | |||
Qwen1.5 Chat 110B | 32k | $0.00 | 23.7 | 1.59 | 22.68 | N/A |
Key definitions
Context window: Maximum number of combined input & output tokens. Output tokens commonly have a significantly lower limit (varied by model).
Output Speed: Tokens per second received while the model is generating tokens (ie. after first chunk has been received from the API for models which support streaming).
Latency (Time to First Token): Time to first token received, in seconds, after API request sent. For reasoning models which share reasoning tokens, this will be the first reasoning token. For models which do not support streaming, this represents time to receive the completion.
Price: Price per token, represented as USD per million Tokens. Price is a blend of Input & Output token prices (3:1 ratio).
Output Price: Price per token generated by the model (received from the API), represented as USD per million Tokens.
Input Price: Price per token included in the request/message sent to the API, represented as USD per million Tokens.
Time period: Metrics are 'live' and are based on the past 72 hours of measurements, measurements are taken 8 times a day for single requests and 2 times per day for parallel requests.