Artificial Analysis LLM Performance Leaderboard
Independent performance benchmarks & pricing across API providers of LLMs. Definitions are below the table.
For further analysis and methodology, see artificialanalysis.ai.
For further analysis and methodology, see artificialanalysis.ai.
Features | Model Intelligence | Price | Output tokens/s | Latency | End-to-End Response Time | ||||
---|---|---|---|---|---|---|---|---|---|
Further Analysis | |||||||||
o4-mini (high) | 200k | 70 | $1.93 | 134.8 | 35.31 | 39.02 | N/A | ||
![]() | o4-mini (high) | 200k | 70 | $1.93 | 41.8 | 170.26 | 182.21 | N/A | |
Gemini 2.5 Pro Preview | 1m | 68 | $3.44 | 210.3 | 27.49 | 29.86 | N/A | ||
Grok 3 mini Reasoning (high) | 131k | 67 | $0.35 | 100.3 | 0.36 | 25.28 | 19.94 | ||
o3-mini (high) | 200k | 66 | $1.93 | 186.4 | 37.32 | 40.00 | N/A | ||
![]() | o3-mini (high) | 200k | 66 | $1.93 | 189.2 | 41.89 | 44.53 | N/A | |
o3-mini | 200k | 63 | $1.93 | 186.2 | 11.81 | 14.49 | N/A | ||
![]() | o3-mini | 200k | 63 | $1.93 | 197.8 | 14.09 | 16.61 | N/A | |
o1 | 200k | 62 | $26.25 | 67.5 | 43.30 | 50.70 | N/A | ||
![]() | o1 | 200k | 62 | $26.25 | 111.1 | 25.28 | 29.78 | N/A | |
![]() DeepSeek R1 | 164k | 60 | $0.95 | 35.7 | 0.55 | 80.35 | 65.79 | ||
![]() | ![]() DeepSeek R1 | 64k | 60 | $0.96 | 23.0 | 3.26 | 126.78 | 101.83 | |
![]() DeepSeek R1 | 128k | 60 | $2.00 | 30.3 | 1.48 | 95.40 | 77.42 | ||
![]() | ![]() DeepSeek R1 | 128k | 60 | $2.36 | 62.3 | 0.45 | 46.12 | 37.65 | |
![]() DeepSeek R1 Base | 128k | 60 | $1.20 | 31.1 | 0.67 | 92.29 | 75.53 | ||
![]() DeepSeek R1 Fast | 128k | 60 | $3.00 | 82.5 | 0.69 | 35.19 | 28.44 | ||
![]() | ![]() DeepSeek R1 | 128k | 60 | $3.99 | 71.1 | 0.57 | 40.60 | 33.01 | |
![]() | ![]() DeepSeek R1 | 128k | 60 | $2.36 | 95.3 | 0.54 | 30.42 | 24.64 | |
![]() DeepSeek R1 (Fast) | 164k | 60 | $4.25 | 111.9 | 0.81 | 26.26 | 20.98 | ||
![]() DeepSeek R1 (Turbo, FP4) | 33k | 60 | $1.50 | 133.3 | 0.30 | 21.66 | 17.61 | ||
![]() DeepSeek R1 | 64k | 60 | $0.96 | 14.1 | 0.57 | 202.19 | 166.21 | ||
![]() DeepSeek R1 | 128k | 60 | $4.00 | 73.3 | 0.56 | 39.42 | 32.04 | ||
![]() | ![]() DeepSeek R1 Turbo | 64k | 60 | $1.15 | 28.9 | 0.88 | 99.55 | 81.34 | |
![]() | ![]() DeepSeek R1 | 64k | 60 | $4.00 | 29.3 | 0.79 | 97.88 | 80.04 | |
![]() | ![]() DeepSeek R1 | 16k | 60 | $5.50 | 191.3 | 0.97 | 15.86 | 12.27 | |
![]() DeepSeek R1 | 128k | 60 | $4.00 | 89.0 | 0.69 | 32.68 | 26.37 | ||
![]() | ![]() DeepSeek R1 | 128k | 60 | $7.00 | 29.0 | 0.58 | 98.90 | 81.05 | |
QwQ-32B | 131k | 58 | $0.20 | 26.0 | 1.58 | 116.46 | 95.68 | ||
QwQ-32B Fast | 131k | 58 | $0.75 | 52.1 | 0.71 | 58.16 | 47.85 | ||
QwQ-32B Base | 131k | 58 | $0.23 | 40.9 | 1.00 | 74.19 | 60.96 | ||
![]() | QwQ-32B | 131k | 58 | $0.65 | 83.8 | 0.49 | 36.20 | 29.74 | |
QwQ-32B | 131k | 58 | $0.90 | 141.2 | 0.33 | 21.51 | 17.64 | ||
QwQ-32B | 131k | 58 | $0.14 | 35.7 | 0.28 | 84.09 | 69.80 | ||
QwQ-32B | 131k | 58 | $0.32 | 398.1 | 0.29 | 7.80 | 6.26 | ||
![]() | QwQ-32B | 16k | 58 | $0.63 | 425.0 | 0.92 | 7.96 | 5.86 | |
QwQ-32B | 131k | 58 | $1.20 | 87.5 | 0.44 | 34.63 | 28.47 | ||
o1-mini | 128k | 54 | $1.93 | 214.4 | 9.95 | 12.28 | N/A | ||
![]() | o1-mini | 128k | 54 | $2.12 | 256.5 | 8.92 | 10.87 | N/A | |
![]() | ![]() DeepSeek V3 (Mar' 25) | 64k | 53 | $0.48 | 25.5 | 3.45 | 23.09 | N/A | |
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $1.45 | 71.9 | 1.11 | 8.06 | N/A | ||
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $1.25 | 31.8 | 1.31 | 17.04 | N/A | ||
![]() DeepSeek V3 (Mar' 25) Fast | 128k | 53 | $3.00 | 92.3 | 0.67 | 6.08 | N/A | ||
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $0.75 | 34.2 | 0.63 | 15.27 | N/A | ||
![]() | ![]() DeepSeek V3 (Mar' 25) | 164k | 53 | $0.80 | 77.1 | 0.57 | 7.06 | N/A | |
![]() | ![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $2.00 | 66.5 | 0.69 | 8.21 | N/A | |
![]() DeepSeek V3 (Mar' 25) | 160k | 53 | $0.90 | 73.7 | 0.84 | 7.62 | N/A | ||
![]() DeepSeek V3 (Mar' 25) | 164k | 53 | $0.52 | 11.3 | 0.75 | 44.97 | N/A | ||
![]() | ![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $0.63 | 31.1 | 0.88 | 16.93 | N/A | |
![]() | ![]() DeepSeek V3 (Mar' 25) | 8k | 53 | $1.13 | 264.9 | 0.62 | 2.50 | N/A | |
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $1.25 | 34.2 | 2.69 | 17.31 | N/A | ||
![]() | ![]() DeepSeek V3 (Mar' 25) | 164k | 53 | $1.25 | 18.8 | 0.74 | 27.27 | N/A | |
GPT-4.1 mini | 1m | 53 | $0.70 | 103.1 | 0.38 | 5.23 | N/A | ||
![]() | GPT-4.1 mini | 1m | 53 | $0.70 | 158.9 | 0.63 | 3.78 | N/A | |
GPT-4.1 | 1m | 53 | $3.50 | 91.2 | 0.46 | 5.94 | N/A | ||
![]() | GPT-4.1 | 1m | 53 | $3.50 | 113.6 | 0.82 | 5.22 | N/A | |
![]() DeepSeek R1 Distill Qwen 32B | 128k | 52 | $0.14 | 44.9 | 0.27 | 55.90 | 44.50 | ||
![]() | ![]() DeepSeek R1 Distill Qwen 32B | 64k | 52 | $0.30 | 20.3 | 1.10 | 124.10 | 98.41 | |
Grok 3 | 131k | 51 | $6.00 | 54.7 | 0.55 | 9.69 | N/A | ||
Llama 4 Maverick (FP8) | 1m | 51 | $0.30 | 130.1 | 0.42 | 4.26 | N/A | ||
Llama 4 Maverick Vertex | 524k | 51 | $0.00 | 127.3 | 0.38 | 4.30 | N/A | ||
![]() | Llama 4 Maverick (FP8) | 1m | 51 | $0.20 | 133.1 | 0.46 | 4.22 | N/A | |
![]() | Llama 4 Maverick (FP8) | 128k | 51 | $0.61 | 62.1 | 0.34 | 8.38 | N/A | |
Llama 4 Maverick | 131k | 51 | $0.39 | 139.0 | 0.52 | 4.12 | N/A | ||
Llama 4 Maverick (FP8) | 131k | 51 | $0.30 | 106.3 | 0.56 | 5.26 | N/A | ||
![]() | Llama 4 Maverick (FP8) | 1m | 51 | $0.36 | 65.3 | 0.65 | 8.31 | N/A | |
Llama 4 Maverick | 128k | 51 | $0.30 | 287.4 | 0.25 | 1.99 | N/A | ||
![]() | Llama 4 Maverick | 8k | 51 | $0.92 | 804.0 | 0.92 | 1.54 | N/A | |
Llama 4 Maverick (FP8) | 524k | 51 | $0.41 | 111.3 | 0.24 | 4.74 | N/A | ||
![]() | Llama 4 Maverick (FP8) | 1m | 51 | $0.35 | 112.6 | 0.46 | 4.90 | N/A | |
GPT-4o (March 2025) | 128k | 50 | $7.50 | 194.3 | 0.51 | 3.08 | N/A | ||
Gemini 2.0 Pro Experimental (AI Studio) | 2m | 49 | $0.00 | 207.3 | 28.11 | 30.52 | N/A | ||
![]() | ![]() DeepSeek R1 Distill Qwen 14B | 64k | 49 | $0.15 | 44.8 | 0.72 | 56.54 | 44.66 | |
![]() DeepSeek R1 Distill Qwen 14B | 128k | 49 | $1.60 | 157.7 | 0.36 | 16.22 | 12.68 | ||
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $0.30 | 63.6 | 0.48 | 39.77 | 31.43 | ||
![]() | ![]() DeepSeek R1 Distill Llama 70B | 66k | 48 | $0.94 | 2,407.8 | 0.27 | 1.31 | 0.83 | |
![]() DeepSeek R1 Distill Llama 70B Base | 128k | 48 | $0.38 | 57.7 | 0.58 | 43.91 | 34.67 | ||
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $0.34 | 30.3 | 0.48 | 83.05 | 66.06 | ||
![]() | ![]() DeepSeek R1 Distill Llama 70B | 32k | 48 | $0.39 | 50.5 | 0.90 | 50.44 | 39.63 | |
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $0.81 | 388.8 | 0.19 | 6.62 | 5.14 | ||
![]() | ![]() DeepSeek R1 Distill Llama 70B | 16k | 48 | $0.88 | 303.6 | 1.56 | 9.79 | 6.59 | |
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $2.00 | 124.8 | 0.38 | 20.41 | 16.03 | ||
![]() | Claude 3.7 Sonnet | 200k | 48 | $6.00 | 39.6 | 1.05 | 13.67 | N/A | |
Claude 3.7 Sonnet | 200k | 48 | $6.00 | 75.6 | 1.15 | 7.77 | N/A | ||
Gemini 2.0 Flash Vertex | 1m | 48 | $0.26 | 241.5 | 0.28 | 2.35 | N/A | ||
Gemini 2.0 Flash (AI Studio) | 1m | 48 | $0.17 | 247.9 | 0.33 | 2.34 | N/A | ||
![]() | ![]() Reka Flash 3 | 128k | 47 | $0.35 | 56.3 | 0.98 | 45.35 | 35.50 | |
Gemini 2.0 Flash (exp) (AI Studio) | 1m | 46 | $0.00 | 243.0 | 0.29 | 2.34 | N/A | ||
![]() | ![]() DeepSeek V3 (Dec '24) | 66k | 46 | $0.48 | 25.6 | 3.22 | 22.77 | N/A | |
![]() DeepSeek V3 (Dec '24) (FP8) | 128k | 46 | $0.25 | 28.9 | 1.39 | 18.69 | N/A | ||
![]() DeepSeek V3 (Dec '24) | 128k | 46 | $0.75 | 24.7 | 0.68 | 20.92 | N/A | ||
![]() | ![]() DeepSeek V3 (Dec '24) | 128k | 46 | $2.00 | 72.9 | 0.61 | 7.47 | N/A | |
![]() DeepSeek V3 (Dec '24) | 128k | 46 | $1.31 | 54.2 | 0.68 | 9.90 | N/A | ||
![]() DeepSeek V3 (Dec '24) | 64k | 46 | $0.59 | 20.5 | 0.49 | 24.91 | N/A | ||
![]() | ![]() DeepSeek V3 (Dec '24) Turbo | 64k | 46 | $0.63 | 29.1 | 0.84 | 18.03 | N/A | |
![]() | ![]() DeepSeek V3 (Dec '24) | 64k | 46 | $0.89 | 29.2 | 0.85 | 17.99 | N/A | |
![]() DeepSeek V3 (Dec '24) (FP8) | 128k | 46 | $1.25 | 33.8 | 2.97 | 17.76 | N/A | ||
Qwen2.5 Max | 32k | 45 | $2.80 | 52.3 | 1.25 | 10.81 | N/A | ||
Gemini 1.5 Pro (Sep) (Vertex) | 2m | 45 | $2.19 | 93.7 | 0.60 | 5.93 | N/A | ||
Gemini 1.5 Pro (Sep) (AI Studio) | 2m | 45 | $2.19 | 94.3 | 0.55 | 5.85 | N/A | ||
![]() | Claude 3.5 Sonnet (Oct) | 200k | 44 | $6.00 | 47.4 | 0.79 | 11.33 | N/A | |
Claude 3.5 Sonnet (Oct) Vertex | 200k | 44 | $6.00 | 79.9 | 1.00 | 7.26 | N/A | ||
Claude 3.5 Sonnet (Oct) | 200k | 44 | $6.00 | 79.9 | 0.98 | 7.23 | N/A | ||
![]() Sonar | 127k | 43 | $1.00 | 76.2 | 2.41 | 8.97 | N/A | ||
Llama 4 Scout | 1m | 43 | $0.15 | 121.1 | 0.44 | 4.57 | N/A | ||
![]() | Llama 4 Scout | 32k | 43 | $0.70 | 2,584.7 | 0.31 | 0.50 | N/A | |
Llama 4 Scout Vertex | 1m | 43 | $0.00 | 132.4 | 0.38 | 4.15 | N/A | ||
![]() | Llama 4 Scout | 1m | 43 | $0.10 | 116.8 | 0.48 | 4.76 | N/A | |
![]() | Llama 4 Scout | 128k | 43 | $0.34 | 39.0 | 0.35 | 13.18 | N/A | |
Llama 4 Scout | 128k | 43 | $0.26 | 140.8 | 0.55 | 4.10 | N/A | ||
Llama 4 Scout | 131k | 43 | $0.15 | 110.7 | 0.26 | 4.77 | N/A | ||
![]() | Llama 4 Scout | 131k | 43 | $0.20 | 77.1 | 0.68 | 7.16 | N/A | |
Llama 4 Scout | 131k | 43 | $0.17 | 597.9 | 0.36 | 1.19 | N/A | ||
![]() | Llama 4 Scout | 8k | 43 | $0.47 | 739.1 | 0.73 | 1.41 | N/A | |
Llama 4 Scout | 328k | 43 | $0.28 | 114.7 | 0.21 | 4.56 | N/A | ||
![]() | Llama 4 Scout | 128k | 43 | $0.71 | 81.1 | 0.50 | 6.67 | N/A | |
![]() Sonar Pro | 200k | 43 | $6.00 | 61.0 | 2.97 | 11.17 | N/A | ||
QwQ 32B-Preview | 33k | 43 | $0.20 | 54.8 | 1.02 | 46.62 | 36.48 | ||
QwQ 32B-Preview | 33k | 43 | $0.26 | 45.9 | 0.26 | 54.71 | 43.57 | ||
QwQ 32B-Preview | 33k | 43 | $1.20 | 86.5 | 0.44 | 29.33 | 23.11 | ||
GPT-4o (Nov '24) | 128k | 41 | $4.38 | 146.6 | 0.58 | 3.99 | N/A | ||
![]() | GPT-4o (Nov '24) | 128k | 41 | $4.38 | 129.8 | 1.08 | 4.93 | N/A | |
Gemini 2.0 Flash-Lite (Feb '25) (AI Studio) | 1m | 41 | $0.13 | 196.8 | 0.28 | 2.82 | N/A | ||
Llama 3.3 70B (FP8) | 128k | 41 | $0.17 | 39.2 | 0.54 | 13.29 | N/A | ||
![]() | Llama 3.3 70B | 33k | 41 | $0.94 | 2,350.8 | 0.29 | 0.51 | N/A | |
Llama 3.3 70B | 128k | 41 | $0.40 | 88.2 | 1.02 | 6.69 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.71 | 138.8 | 0.59 | 4.20 | N/A | |
Llama 3.3 70B Fast | 128k | 41 | $0.38 | 133.5 | 0.55 | 4.29 | N/A | ||
Llama 3.3 70B Base | 128k | 41 | $0.20 | 33.4 | 0.69 | 15.65 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.50 | 142.9 | 0.50 | 4.00 | N/A | |
![]() | Llama 3.3 70B | 128k | 41 | $0.71 | 48.7 | 0.45 | 10.71 | N/A | |
Llama 3.3 70B | 128k | 41 | $0.90 | 178.4 | 0.45 | 3.25 | N/A | ||
Llama 3.3 70B (Turbo, FP8) | 128k | 41 | $0.20 | 34.2 | 0.27 | 14.91 | N/A | ||
Llama 3.3 70B | 128k | 41 | $0.27 | 29.1 | 0.47 | 17.62 | N/A | ||
Llama 3.3 70B | 128k | 41 | $0.60 | 185.7 | 0.41 | 3.11 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.39 | 91.6 | 0.62 | 6.08 | N/A | |
Llama 3.3 70B | 128k | 41 | $0.64 | 302.0 | 0.39 | 2.04 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.75 | 460.3 | 0.30 | 1.38 | N/A | |
Llama 3.3 70B Turbo | 128k | 41 | $0.88 | 124.1 | 0.43 | 4.46 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.70 | 45.8 | 0.55 | 11.47 | N/A | |
GPT-4.1 nano | 1m | 41 | $0.17 | 253.1 | 0.40 | 2.38 | N/A | ||
![]() | GPT-4.1 nano | 1m | 41 | $0.17 | 333.0 | 0.50 | 2.00 | N/A | |
GPT-4o (May '24) | 128k | 41 | $7.50 | 101.6 | 0.56 | 5.48 | N/A | ||
![]() | GPT-4o (May '24) | 128k | 41 | $7.50 | 103.3 | 0.91 | 5.75 | N/A | |
Llama 3.1 405B (FP8) | 128k | 40 | $0.80 | 34.8 | 0.62 | 14.98 | N/A | ||
Llama 3.1 405B | 128k | 40 | $9.50 | 19.2 | 1.00 | 27.09 | N/A | ||
Llama 3.1 405B | 128k | 40 | $4.00 | 42.4 | 1.04 | 12.84 | N/A | ||
![]() | Llama 3.1 405B Standard | 128k | 40 | $2.40 | 30.8 | 1.83 | 18.05 | N/A | |
![]() | Llama 3.1 405B Latency Optimized | 128k | 40 | $3.00 | 31.0 | 2.13 | 18.28 | N/A | |
Llama 3.1 405B Base | 128k | 40 | $1.50 | 31.2 | 0.70 | 16.74 | N/A | ||
Llama 3.1 405B Vertex | 128k | 40 | $7.75 | 30.0 | 0.40 | 17.08 | N/A | ||
![]() | Llama 3.1 405B | 128k | 40 | $8.00 | 31.4 | 0.50 | 16.44 | N/A | |
Llama 3.1 405B | 128k | 40 | $3.00 | 89.4 | 0.60 | 6.20 | N/A | ||
Llama 3.1 405B | 33k | 40 | $0.90 | 24.9 | 0.48 | 20.56 | N/A | ||
![]() | Llama 3.1 405B | 16k | 40 | $6.25 | 177.9 | 1.24 | 4.06 | N/A | |
Llama 3.1 405B | 128k | 40 | $7.50 | 36.4 | 0.80 | 14.52 | N/A | ||
Llama 3.1 405B Turbo | 128k | 40 | $3.50 | 92.0 | 0.45 | 5.88 | N/A | ||
![]() | Llama 3.1 405B | 128k | 40 | $3.50 | 17.2 | 1.06 | 30.14 | N/A | |
Qwen2.5 72B | 131k | 40 | $0.40 | 30.9 | 1.58 | 17.78 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.20 | 18.3 | 0.79 | 28.13 | N/A | ||
Qwen2.5 72B Fast | 131k | 40 | $0.38 | 67.0 | 0.57 | 8.03 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.90 | 44.3 | 0.45 | 11.75 | N/A | ||
Qwen2.5 72B | 33k | 40 | $0.27 | 40.1 | 0.29 | 12.74 | N/A | ||
Qwen2.5 72B Turbo | 131k | 40 | $1.20 | 85.4 | 0.63 | 6.49 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.00 | 62.7 | 1.10 | 9.08 | N/A | ||
![]() | ![]() MiniMax-Text-01 | 1m | 40 | $0.42 | 34.2 | 0.90 | 15.54 | N/A | |
Phi-4 | 16k | 40 | $0.15 | 123.6 | 0.51 | 4.55 | N/A | ||
![]() | Phi-4 | 16k | 40 | $0.22 | 42.5 | 0.48 | 12.26 | N/A | |
Phi-4 | 16k | 40 | $0.09 | 41.0 | 0.25 | 12.46 | N/A | ||
![]() Command A | 256k | 40 | $4.38 | 55.0 | 0.26 | 9.34 | N/A | ||
Gemini 1.5 Flash (Sep) (Vertex) | 1m | 39 | $0.13 | 193.5 | 0.22 | 2.80 | N/A | ||
Gemini 1.5 Flash (Sep) (AI Studio) | 1m | 39 | $0.13 | 169.3 | 0.33 | 3.29 | N/A | ||
![]() | ![]() Mistral Large 2 (Nov '24) | 128k | 38 | $3.00 | 39.0 | 0.45 | 13.27 | N/A | |
![]() | ![]() Mistral Large 2 (Nov '24) | 128k | 38 | $3.00 | 36.3 | 0.52 | 14.30 | N/A | |
Gemma 3 27B | 128k | 38 | $0.07 | 39.1 | 0.41 | 13.19 | N/A | ||
Grok Beta | 128k | 38 | $7.50 | 63.8 | 0.30 | 8.13 | N/A | ||
![]() | ![]() Pixtral Large | 128k | 37 | $3.00 | 31.9 | 0.49 | 16.18 | N/A | |
Qwen2.5 Instruct 32B Fast | 128k | 37 | $0.20 | 79.0 | 0.54 | 6.86 | N/A | ||
Qwen2.5 Instruct 32B Base | 128k | 37 | $0.10 | 59.4 | 0.59 | 9.00 | N/A | ||
Llama 3.1 Nemotron 70B (FP8) | 128k | 37 | $0.17 | 43.2 | 0.52 | 12.09 | N/A | ||
Llama 3.1 Nemotron 70B Base | 128k | 37 | $0.20 | 40.8 | 0.64 | 12.91 | N/A | ||
Llama 3.1 Nemotron 70B Fast | 128k | 37 | $0.38 | 71.8 | 0.57 | 7.54 | N/A | ||
Llama 3.1 Nemotron 70B | 128k | 37 | $0.27 | 35.8 | 0.32 | 14.28 | N/A | ||
![]() | ![]() Nova Pro | 300k | 37 | $1.40 | 109.7 | 0.36 | 4.91 | N/A | |
![]() | ![]() Nova Pro Latency Optimized | 300k | 37 | $1.75 | 109.5 | 0.41 | 4.98 | N/A | |
![]() | ![]() Mistral Large 2 (Jul '24) | 128k | 37 | $3.00 | 35.6 | 0.54 | 14.60 | N/A | |
![]() | ![]() Mistral Large 2 (Jul '24) | 128k | 37 | $3.00 | 32.8 | 0.46 | 15.69 | N/A | |
![]() | ![]() Mistral Large 2 (Jul '24) | 128k | 37 | $3.00 | 35.9 | 0.54 | 14.46 | N/A | |
Qwen2.5 Coder 32B | 33k | 36 | $0.09 | 64.3 | 0.49 | 8.26 | N/A | ||
Qwen2.5 Coder 32B | 131k | 36 | $0.20 | 52.5 | 1.10 | 10.61 | N/A | ||
Qwen2.5 Coder 32B | 33k | 36 | $0.90 | 66.9 | 0.34 | 7.82 | N/A | ||
Qwen2.5 Coder 32B | 33k | 36 | $0.10 | 48.8 | 0.25 | 10.49 | N/A | ||
Qwen2.5 Coder 32B | 131k | 36 | $0.80 | 83.7 | 0.41 | 6.38 | N/A | ||
GPT-4o mini | 128k | 36 | $0.26 | 77.4 | 0.36 | 6.82 | N/A | ||
![]() | GPT-4o mini | 128k | 36 | $0.26 | 159.2 | 0.97 | 4.11 | N/A | |
Llama 3.1 70B (FP8) | 128k | 35 | $0.17 | 51.3 | 0.53 | 10.28 | N/A | ||
Llama 3.1 70B | 128k | 35 | $0.40 | 44.7 | 1.69 | 12.89 | N/A | ||
![]() | Llama 3.1 70B Standard | 128k | 35 | $0.72 | 31.6 | 0.64 | 16.48 | N/A | |
![]() | Llama 3.1 70B Latency Optimized | 128k | 35 | $0.90 | 31.6 | 0.81 | 16.65 | N/A | |
Llama 3.1 70B Base | 128k | 35 | $0.20 | 38.3 | 0.65 | 13.70 | N/A | ||
Llama 3.1 70B Fast | 128k | 35 | $0.38 | 148.0 | 0.55 | 3.93 | N/A | ||
Llama 3.1 70B Vertex | 128k | 35 | $0.00 | 72.6 | 0.25 | 7.14 | N/A | ||
![]() | Llama 3.1 70B | 128k | 35 | $2.90 | 53.1 | 0.46 | 9.88 | N/A | |
Llama 3.1 70B | 128k | 35 | $0.90 | 159.4 | 0.42 | 3.56 | N/A | ||
Llama 3.1 70B (Turbo, FP8) | 128k | 35 | $0.20 | 34.7 | 0.28 | 14.68 | N/A | ||
Llama 3.1 70B | 128k | 35 | $0.27 | 38.5 | 0.39 | 13.38 | N/A | ||
![]() | Llama 3.1 70B | 32k | 35 | $0.35 | 15.0 | 1.38 | 34.64 | N/A | |
Llama 3.1 70B Turbo | 128k | 35 | $0.88 | 113.4 | 0.41 | 4.82 | N/A | ||
Llama 3.1 70B | 128k | 35 | $0.90 | 126.6 | 0.50 | 4.45 | N/A | ||
![]() | ![]() Mistral Small 3.1 | 128k | 35 | $0.15 | 159.4 | 0.33 | 3.47 | N/A | |
![]() Mistral Small 3.1 Vertex | 128k | 35 | $0.15 | 208.3 | 0.17 | 2.57 | N/A | ||
![]() | ![]() Mistral Small 3 | 32k | 35 | $0.15 | 131.0 | 0.39 | 4.21 | N/A | |
![]() Mistral Small 3 | 32k | 35 | $0.09 | 63.2 | 0.27 | 8.18 | N/A | ||
![]() Mistral Small 3 | 32k | 35 | $0.80 | 96.6 | 0.25 | 5.43 | N/A | ||
![]() | Claude 3 Opus | 200k | 35 | $30.00 | 24.4 | 1.27 | 21.79 | N/A | |
Claude 3 Opus Vertex | 200k | 35 | $30.00 | 27.3 | 1.12 | 19.44 | N/A | ||
Claude 3 Opus | 200k | 35 | $30.00 | 28.1 | 1.13 | 18.94 | N/A | ||
![]() | Claude 3.5 Haiku Standard | 200k | 35 | $1.60 | 55.6 | 1.37 | 10.36 | N/A | |
![]() | Claude 3.5 Haiku Latency Optimized | 200k | 35 | $2.00 | 49.8 | 1.31 | 11.35 | N/A | |
Claude 3.5 Haiku Vertex | 200k | 35 | $1.60 | 65.9 | 0.65 | 8.23 | N/A | ||
Claude 3.5 Haiku | 200k | 35 | $1.60 | 65.9 | 6.29 | 13.88 | N/A | ||
![]() | ![]() DeepSeek R1 Distill Llama 8B | 32k | 34 | $0.04 | 54.0 | 0.71 | 47.05 | 37.07 | |
Gemma 3 12B | 128k | 34 | $0.06 | 43.6 | 0.46 | 11.93 | N/A | ||
Gemini 1.5 Pro (May) (Vertex) | 2m | 34 | $2.19 | 66.3 | 0.39 | 7.93 | N/A | ||
Gemini 1.5 Pro (May) (AI Studio) | 2m | 34 | $2.19 | 65.2 | 0.44 | 8.11 | N/A | ||
Qwen Turbo | 1m | 34 | $0.09 | 109.4 | 1.00 | 5.57 | N/A | ||
![]() | Llama 3.2 90B (Vision) | 128k | 33 | $0.72 | 60.3 | 0.51 | 8.80 | N/A | |
Llama 3.2 90B (Vision) Vertex | 128k | 33 | $0.00 | 30.4 | 0.18 | 16.63 | N/A | ||
Llama 3.2 90B (Vision) | 128k | 33 | $0.90 | 42.3 | 0.40 | 12.21 | N/A | ||
Llama 3.2 90B (Vision) | 33k | 33 | $0.36 | 37.8 | 0.42 | 13.66 | N/A | ||
Llama 3.2 90B (Vision) Turbo | 128k | 33 | $1.20 | 31.0 | 0.26 | 16.39 | N/A | ||
Qwen2 72B | 33k | 33 | $0.90 | 42.0 | 0.56 | 12.48 | N/A | ||
Qwen2 72B | 131k | 33 | $0.00 | 31.0 | 1.29 | 17.41 | N/A | ||
![]() | ![]() Nova Lite | 300k | 33 | $0.10 | 279.1 | 0.35 | 2.14 | N/A | |
Gemini 1.5 Flash-8B AI Studio | 1m | 31 | $0.07 | 284.9 | 0.21 | 1.97 | N/A | ||
![]() Jamba 1.5 Large | 256k | 29 | $3.50 | 66.3 | 0.56 | 8.10 | N/A | ||
![]() | ![]() Jamba 1.5 Large | 256k | 29 | $3.50 | 51.2 | 0.69 | 10.44 | N/A | |
![]() Jamba 1.6 Large | 256k | 29 | $3.50 | 63.5 | 0.53 | 8.41 | N/A | ||
Gemini 1.5 Flash (May) (Vertex) | 1m | 28 | $0.13 | 309.4 | 0.29 | 1.91 | N/A | ||
Gemini 1.5 Flash (May) (AI Studio) | 1m | 28 | $0.13 | 307.0 | 0.25 | 1.87 | N/A | ||
![]() | ![]() Nova Micro | 130k | 28 | $0.06 | 330.0 | 0.32 | 1.84 | N/A | |
![]() Yi-Large | 32k | 28 | $3.00 | 69.1 | 0.45 | 7.68 | N/A | ||
![]() | Claude 3 Sonnet | 200k | 28 | $6.00 | 59.1 | 0.73 | 9.19 | N/A | |
Claude 3 Sonnet | 200k | 28 | $6.00 | 61.6 | 0.55 | 8.68 | N/A | ||
![]() | ![]() Codestral (Jan '25) | 256k | 28 | $0.45 | 183.1 | 0.33 | 3.06 | N/A | |
![]() Codestral (Jan '25) Vertex | 128k | 28 | $0.45 | 149.5 | 0.15 | 3.49 | N/A | ||
Llama 3 70B | 8k | 27 | $1.18 | 45.4 | 0.42 | 11.43 | N/A | ||
Llama 3 70B | 8k | 27 | $0.40 | 32.7 | 1.03 | 16.32 | N/A | ||
![]() | Llama 3 70B | 8k | 27 | $2.86 | 50.9 | 0.42 | 10.25 | N/A | |
![]() | Llama 3 70B | 8k | 27 | $2.90 | 19.0 | 0.78 | 27.16 | N/A | |
Llama 3 70B | 8k | 27 | $0.90 | 154.3 | 0.41 | 3.65 | N/A | ||
Llama 3 70B | 8k | 27 | $0.27 | 35.6 | 0.55 | 14.60 | N/A | ||
![]() | Llama 3 70B | 8k | 27 | $0.57 | 33.0 | 0.63 | 15.80 | N/A | |
Llama 3 70B | 8k | 27 | $0.64 | 334.3 | 0.27 | 1.77 | N/A | ||
Llama 3 70B (Reference, FP16) | 8k | 27 | $0.90 | 130.1 | 0.67 | 4.52 | N/A | ||
Llama 3 70B (Turbo, FP8) | 8k | 27 | $0.88 | 20.8 | 0.38 | 24.44 | N/A | ||
![]() | ![]() Mistral Small (Sep '24) | 33k | 27 | $0.30 | 58.1 | 0.43 | 9.04 | N/A | |
![]() | Phi-4 Multimodal | 128k | 27 | $0.00 | 21.6 | 0.37 | 23.56 | N/A | |
Qwen2.5 Coder 7B Fast | 131k | 27 | $0.04 | 219.1 | 0.49 | 2.77 | N/A | ||
Qwen2.5 Coder 7B Base | 131k | 27 | $0.01 | 200.9 | 0.51 | 3.00 | N/A | ||
![]() | ![]() Mistral Large (Feb '24) | 33k | 26 | $6.00 | 30.6 | 0.54 | 16.89 | N/A | |
![]() | ![]() Mistral Large (Feb '24) | 33k | 26 | $6.00 | 42.0 | 0.43 | 12.34 | N/A | |
![]() | ![]() Mistral Large (Feb '24) | 33k | 26 | $6.00 | 40.1 | 0.51 | 12.98 | N/A | |
![]() | ![]() Mixtral 8x22B | 65k | 26 | $3.00 | 55.1 | 0.37 | 9.44 | N/A | |
![]() Mixtral 8x22B Base | 65k | 26 | $0.60 | 78.2 | 0.58 | 6.98 | N/A | ||
![]() Mixtral 8x22B Fast | 65k | 26 | $1.05 | 100.1 | 0.53 | 5.52 | N/A | ||
![]() Mixtral 8x22B | 65k | 26 | $1.20 | 75.5 | 0.44 | 7.07 | N/A | ||
![]() Mixtral 8x22B | 65k | 26 | $1.20 | 90.9 | 0.31 | 5.81 | N/A | ||
![]() | Phi-4 Mini | 128k | 26 | $0.12 | 222.0 | 0.43 | 2.69 | N/A | |
![]() | Phi-4 Mini | 128k | 26 | $0.00 | 56.3 | 0.34 | 9.22 | N/A | |
![]() | Phi-3 Medium 14B | 128k | 25 | $0.30 | 51.9 | 0.44 | 10.07 | N/A | |
Gemma 3 4B | 128k | 24 | $0.03 | 90.9 | 0.30 | 5.80 | N/A | ||
![]() | Claude 2.1 | 200k | 24 | $12.00 | 29.4 | 1.70 | 18.68 | N/A | |
Claude 2.1 | 200k | 24 | $12.00 | 14.0 | 0.86 | 36.59 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.03 | 135.4 | 0.43 | 4.12 | N/A | ||
![]() | Llama 3.1 8B | 33k | 24 | $0.10 | 2,146.2 | 0.29 | 0.53 | N/A | |
Llama 3.1 8B | 128k | 24 | $0.10 | 73.3 | 1.06 | 7.89 | N/A | ||
![]() | Llama 3.1 8B | 128k | 24 | $0.22 | 91.9 | 0.36 | 5.80 | N/A | |
Llama 3.1 8B Fast | 128k | 24 | $0.04 | 184.8 | 0.49 | 3.20 | N/A | ||
Llama 3.1 8B Base | 128k | 24 | $0.03 | 66.5 | 0.53 | 8.05 | N/A | ||
Llama 3.1 8B Vertex | 128k | 24 | $0.00 | 118.8 | 0.17 | 4.38 | N/A | ||
![]() | Llama 3.1 8B | 128k | 24 | $0.38 | 226.0 | 0.31 | 2.52 | N/A | |
Llama 3.1 8B | 128k | 24 | $0.20 | 217.4 | 0.32 | 2.62 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.04 | 47.2 | 0.24 | 10.83 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.10 | 463.5 | 0.38 | 1.46 | N/A | ||
![]() | Llama 3.1 8B | 16k | 24 | $0.05 | 71.2 | 0.74 | 7.77 | N/A | |
Llama 3.1 8B | 128k | 24 | $0.06 | 899.6 | 0.21 | 0.77 | N/A | ||
![]() | Llama 3.1 8B | 16k | 24 | $0.13 | 1,173.1 | 0.21 | 0.64 | N/A | |
Llama 3.1 8B Turbo | 128k | 24 | $0.18 | 143.2 | 0.32 | 3.81 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.15 | 467.2 | 0.18 | 1.25 | N/A | ||
![]() | Llama 3.1 8B | 128k | 24 | $0.18 | 61.6 | 0.52 | 8.63 | N/A | |
![]() | ![]() Pixtral 12B | 128k | 23 | $0.15 | 106.4 | 0.34 | 5.04 | N/A | |
![]() Pixtral 12B | 128k | 23 | $0.10 | 79.6 | 0.64 | 6.92 | N/A | ||
![]() | ![]() Mistral Small (Feb '24) | 33k | 23 | $1.50 | 160.3 | 0.31 | 3.42 | N/A | |
![]() | ![]() Mistral Small (Feb '24) | 33k | 23 | $1.50 | 88.5 | 0.41 | 6.06 | N/A | |
![]() | ![]() Mistral Medium | 33k | 23 | $4.09 | 40.5 | 0.47 | 12.81 | N/A | |
![]() | ![]() Ministral 8B | 128k | 22 | $0.10 | 140.1 | 0.37 | 3.94 | N/A | |
Gemma 2 9B Fast | 8k | 22 | $0.04 | 169.5 | 0.49 | 3.44 | N/A | ||
Gemma 2 9B Base | 8k | 22 | $0.03 | 171.2 | 0.52 | 3.44 | N/A | ||
Gemma 2 9B | 8k | 22 | $0.04 | 43.1 | 0.47 | 12.06 | N/A | ||
Gemma 2 9B | 8k | 22 | $0.20 | 714.8 | 0.24 | 0.94 | N/A | ||
Gemma 2 9B | 8k | 22 | $0.30 | 134.7 | 0.22 | 3.94 | N/A | ||
![]() LFM 40B | 32k | 22 | $0.15 | 169.2 | 0.43 | 3.38 | N/A | ||
![]() | ![]() Command-R+ | 128k | 21 | $6.00 | 47.7 | 0.49 | 10.98 | N/A | |
![]() Command-R+ | 128k | 21 | $4.38 | 52.0 | 0.28 | 9.90 | N/A | ||
Llama 3 8B | 8k | 21 | $0.10 | 81.3 | 0.48 | 6.63 | N/A | ||
![]() | Llama 3 8B | 8k | 21 | $0.38 | 103.2 | 0.31 | 5.15 | N/A | |
![]() | Llama 3 8B | 8k | 21 | $0.38 | 73.8 | 0.39 | 7.16 | N/A | |
Llama 3 8B | 8k | 21 | $0.04 | 67.9 | 0.29 | 7.65 | N/A | ||
![]() | Llama 3 8B | 8k | 21 | $0.04 | 44.5 | 0.88 | 12.12 | N/A | |
Llama 3 8B | 8k | 21 | $0.06 | 1,350.5 | 0.31 | 0.69 | N/A | ||
Llama 3 8B | 8k | 21 | $0.20 | 174.7 | 0.42 | 3.28 | N/A | ||
Gemini 1.0 Pro Vertex | 33k | 21 | $0.19 | 163.4 | 0.32 | 3.37 | N/A | ||
![]() | ![]() Codestral (May '24) | 33k | 20 | $0.30 | 100.1 | 0.40 | 5.39 | N/A | |
![]() Aya Expanse 32B | 128k | 20 | $0.75 | 120.4 | 0.16 | 4.31 | N/A | ||
![]() | ![]() Command-R+ (Apr '24) | 128k | 20 | $6.00 | 47.7 | 0.50 | 10.99 | N/A | |
![]() Command-R+ (Apr '24) | 128k | 20 | $6.00 | 70.8 | 0.24 | 7.30 | N/A | ||
![]() | ![]() Command-R+ (Apr '24) | 128k | 20 | $6.00 | 50.3 | 0.62 | 10.56 | N/A | |
![]() DBRX | 33k | 20 | $1.13 | 69.7 | 0.56 | 7.73 | N/A | ||
![]() | ![]() Ministral 3B | 128k | 20 | $0.04 | 217.8 | 0.30 | 2.60 | N/A | |
![]() | ![]() Mistral NeMo | 128k | 20 | $0.15 | 128.3 | 0.37 | 4.26 | N/A | |
![]() Mistral NeMo Fast | 128k | 20 | $0.12 | 164.1 | 0.53 | 3.57 | N/A | ||
![]() Mistral NeMo Base | 128k | 20 | $0.06 | 26.6 | 0.75 | 19.53 | N/A | ||
![]() Mistral NeMo | 128k | 20 | $0.06 | 54.0 | 0.24 | 9.49 | N/A | ||
Llama 3.2 3B (FP8) | 128k | 20 | $0.02 | 215.9 | 0.45 | 2.77 | N/A | ||
Llama 3.2 3B | 128k | 20 | $0.10 | 216.7 | 0.85 | 3.16 | N/A | ||
![]() | Llama 3.2 3B | 128k | 20 | $0.15 | 72.4 | 0.47 | 7.38 | N/A | |
Llama 3.2 3B Base | 128k | 20 | $0.01 | 125.8 | 0.48 | 4.46 | N/A | ||
![]() | Llama 3.2 3B | 128k | 20 | $0.06 | 246.1 | 0.41 | 2.44 | N/A | |
Llama 3.2 3B | 128k | 20 | $0.02 | 144.1 | 0.21 | 3.68 | N/A | ||
![]() | Llama 3.2 3B | 32k | 20 | $0.04 | 71.0 | 0.67 | 7.71 | N/A | |
![]() | Llama 3.2 3B | 8k | 20 | $0.10 | 1,587.7 | 0.23 | 0.54 | N/A | |
Llama 3.2 3B Turbo | 128k | 20 | $0.06 | 162.2 | 0.33 | 3.41 | N/A | ||
![]() DeepSeek R1 Distill Qwen 1.5B | 128k | 19 | $0.18 | 339.1 | 0.26 | 7.63 | 5.90 | ||
![]() Jamba 1.5 Mini | 256k | 18 | $0.25 | 164.5 | 0.32 | 3.36 | N/A | ||
![]() | ![]() Jamba 1.5 Mini | 256k | 18 | $0.25 | 82.6 | 0.49 | 6.55 | N/A | |
![]() Jamba 1.6 Mini | 256k | 18 | $0.25 | 200.6 | 0.35 | 2.84 | N/A | ||
![]() | ![]() Mixtral 8x7B | 33k | 17 | $0.70 | 77.1 | 0.38 | 6.86 | N/A | |
![]() | ![]() Mixtral 8x7B | 33k | 17 | $0.51 | 59.5 | 0.33 | 8.73 | N/A | |
![]() Mixtral 8x7B Fast | 33k | 17 | $0.23 | 53.6 | 0.61 | 9.93 | N/A | ||
![]() Mixtral 8x7B Base | 33k | 17 | $0.12 | 53.7 | 0.60 | 9.91 | N/A | ||
![]() Mixtral 8x7B | 33k | 17 | $0.50 | 177.9 | 0.33 | 3.14 | N/A | ||
![]() Mixtral 8x7B | 33k | 17 | $0.24 | 99.0 | 0.21 | 5.26 | N/A | ||
![]() Mixtral 8x7B | 33k | 17 | $0.63 | 95.2 | 0.48 | 5.73 | N/A | ||
![]() Mixtral 8x7B | 33k | 17 | $0.60 | 53.1 | 0.43 | 9.85 | N/A | ||
![]() Aya Expanse 8B | 8k | 16 | $0.75 | 167.8 | 0.12 | 3.10 | N/A | ||
![]() | ![]() Command-R | 128k | 15 | $0.75 | 109.6 | 0.36 | 4.92 | N/A | |
![]() Command-R | 128k | 15 | $0.26 | 85.4 | 0.19 | 6.04 | N/A | ||
![]() | ![]() Command-R (Mar '24) | 128k | 15 | $0.75 | 109.2 | 0.36 | 4.94 | N/A | |
![]() Command-R (Mar '24) | 128k | 15 | $0.75 | 176.6 | 0.15 | 2.98 | N/A | ||
![]() | ![]() Command-R (Mar '24) | 128k | 15 | $0.75 | 80.1 | 0.47 | 6.71 | N/A | |
![]() | ![]() Codestral-Mamba | 256k | 14 | $0.25 | 93.0 | 0.53 | 5.90 | N/A | |
![]() | ![]() Mistral 7B | 8k | 10 | $0.25 | 102.3 | 0.36 | 5.25 | N/A | |
![]() Mistral 7B | 8k | 10 | $0.04 | 77.1 | 0.22 | 6.71 | N/A | ||
![]() | ![]() Mistral 7B | 32k | 10 | $0.06 | 117.2 | 0.84 | 5.11 | N/A | |
![]() Mistral 7B | 8k | 10 | $0.20 | 173.8 | 0.18 | 3.06 | N/A | ||
![]() | Llama 3.2 1B | 128k | 10 | $0.10 | 118.5 | 0.45 | 4.67 | N/A | |
Llama 3.2 1B Base | 128k | 10 | $0.01 | 272.7 | 0.48 | 2.31 | N/A | ||
Llama 3.2 1B | 128k | 10 | $0.01 | 179.6 | 0.24 | 3.02 | N/A | ||
![]() | Llama 3.2 1B | 16k | 10 | $0.05 | 2,606.2 | 0.18 | 0.37 | N/A | |
Llama 2 Chat 7B | 4k | 8 | $0.10 | 132.7 | 0.53 | 4.30 | N/A | ||
o1-preview | 128k | $26.25 | 162.4 | 20.44 | 23.52 | N/A | |||
![]() | o1-preview | 128k | $28.88 | 133.1 | 26.72 | 30.47 | N/A | ||
GPT-4o (Aug '24) | 128k | $4.38 | 101.2 | 0.54 | 5.49 | N/A | |||
![]() | GPT-4o (Aug '24) | 128k | $4.38 | 114.9 | 0.82 | 5.17 | N/A | ||
![]() | o3 | 128k | $17.50 | 91.1 | 30.18 | 35.67 | N/A | ||
GPT-4.5 (Preview) | 128k | $93.75 | 58.0 | 1.14 | 9.76 | N/A | |||
![]() | Llama 3.2 11B (Vision) | 128k | $0.16 | 143.3 | 0.47 | 3.96 | N/A | ||
![]() | Llama 3.2 11B (Vision) | 128k | $0.15 | 83.3 | 0.44 | 6.44 | N/A | ||
Llama 3.2 11B (Vision) | 128k | $0.20 | 109.8 | 0.27 | 4.82 | N/A | |||
Llama 3.2 11B (Vision) | 128k | $0.06 | 48.8 | 0.22 | 10.47 | N/A | |||
Llama 3.2 11B (Vision) Turbo | 128k | $0.18 | 115.3 | 0.24 | 4.58 | N/A | |||
Gemma 2 27B Fast | 8k | $0.26 | 87.1 | 0.54 | 6.28 | N/A | |||
Gemma 2 27B Base | 8k | $0.15 | 54.5 | 0.59 | 9.76 | N/A | |||
Gemma 2 27B | 8k | $0.80 | 89.1 | 0.27 | 5.88 | N/A | |||
Gemini 2.5 Flash Preview (Reasoning) (AI_Studio) | 1m | $0.99 | 151.4 | 11.55 | 14.86 | N/A | |||
Gemini 2.5 Flash Preview (AI_Studio) | 1m | $0.26 | 213.1 | 10.71 | 13.06 | N/A | |||
![]() | Claude 3.5 Sonnet (June) | 200k | $6.00 | 45.3 | 0.91 | 11.95 | N/A | ||
Claude 3.5 Sonnet (June) Vertex | 200k | $6.00 | 80.1 | 0.92 | 7.16 | N/A | |||
Claude 3.5 Sonnet (June) | 200k | $6.00 | 80.2 | 0.71 | 6.94 | N/A | |||
![]() | Claude 3 Haiku | 200k | $0.50 | 107.3 | 1.00 | 5.66 | N/A | ||
Claude 3 Haiku | 200k | $0.50 | 137.7 | 0.42 | 4.05 | N/A | |||
![]() | ![]() Mistral Saba | 32k | $0.30 | 84.7 | 0.40 | 6.30 | N/A | ||
![]() DeepSeek Coder V2 Lite Fast, FP8 | 128k | $0.12 | 115.3 | 0.60 | 4.94 | N/A | |||
![]() DeepSeek Coder V2 Lite Base, FP8 | 128k | $0.06 | 110.4 | 0.58 | 5.11 | N/A | |||
![]() Sonar Reasoning | 127k | $2.00 | 77.7 | 2.04 | 34.23 | 25.75 | |||
![]() | ![]() Solar Mini | 4k | $0.15 | 65.2 | 1.09 | 8.76 | N/A | ||
![]() | ![]() Reka Flash | 128k | $0.35 | 46.2 | 0.96 | 11.79 | N/A | ||
![]() | ![]() Reka Core | 128k | $2.00 | 27.9 | 0.97 | 18.87 | N/A | ||
![]() | ![]() Reka Flash (Feb '24) | 128k | $0.35 | 46.1 | 0.94 | 11.79 | N/A | ||
![]() | ![]() Reka Edge | 128k | $0.10 | 84.0 | 0.95 | 6.90 | N/A | ||
Qwen1.5 Chat 110B | 32k | $0.00 | 29.6 | 1.25 | 18.14 | N/A | |||
GPT-4 Turbo | 128k | $15.00 | 33.8 | 0.74 | 15.54 | N/A | |||
![]() | GPT-4 Turbo | 128k | $15.00 | 46.2 | 1.54 | 12.36 | N/A | ||
GPT-4 | 8k | $37.50 | 24.6 | 0.71 | 21.02 | N/A | |||
Gemini 2.0 Flash-Lite (Preview) (AI Studio) | 1m | $0.13 | 200.4 | 0.28 | 2.78 | N/A | |||
Claude 2.0 | 100k | $12.00 | 30.7 | 0.86 | 17.16 | N/A | |||
![]() OpenChat 3.5 | 8k | $0.06 | 45.5 | 0.23 | 11.23 | N/A | |||
![]() Jamba Instruct | 256k | $0.55 | 168.8 | 0.37 | 3.33 | N/A |
Key definitions
Context window: Maximum number of combined input & output tokens. Output tokens commonly have a significantly lower limit (varied by model).
Output Speed: Tokens per second received while the model is generating tokens (ie. after first chunk has been received from the API for models which support streaming).
Latency (Time to First Token): Time to first token received, in seconds, after API request sent. For reasoning models which share reasoning tokens, this will be the first reasoning token. For models which do not support streaming, this represents time to receive the completion.
Price: Price per token, represented as USD per million Tokens. Price is a blend of Input & Output token prices (3:1 ratio).
Output Price: Price per token generated by the model (received from the API), represented as USD per million Tokens.
Input Price: Price per token included in the request/message sent to the API, represented as USD per million Tokens.
Time period: Metrics are 'live' and are based on the past 72 hours of measurements, measurements are taken 8 times a day for single requests and 2 times per day for parallel requests.