LLM API Providers Leaderboard - Comparison of over 100 LLM endpoints
Comparison and ranking of API provider performance for over 100 AI LLM Model endpoints across performance key metrics including price, output speed, latency, context window & others. For more details including relating to our methodology, see our FAQs.
API providers compared: OpenAI, Playground AI, Microsoft Azure, Mistral, Ideogram, DeepSeek, Amazon Bedrock, Hyperbolic, Groq, FriendliAI, Together.ai, Anthropic, Black Forest Labs, Perplexity, Google, Lambda Labs, Fireworks, Cerebras, Leonardo.Ai, Cohere, Recraft AI, Upstage, Simplismart, Speechmatics, Deepinfra, Fish Audio, Replicate, , Genmo, Nebius, Adobe, MiniMax, CentML, StepFun, Runpod, Zyphra, Murf AI, Speechify, Rev AI, AssemblyAI, fal.ai, Rime, kluster.ai, Prodia, Hume AI, Reka AI, Deepgram, Gladia, Baseten, Stability.ai, Midjourney, Reve, Databricks, ElevenLabs, IBM, Vivago AI, SambaNova, Dreamina, xAI, Cartesia, LMNT, PlayAI, 01.AI, Alibaba Cloud, Novita, AI21 Labs, and WaveSpeed.
Features | Model Intelligence | Price | Output tokens/s | Latency | End-to-End Response Time | ||||
---|---|---|---|---|---|---|---|---|---|
Further Analysis | |||||||||
o4-mini (high) | 200k | 70 | $1.93 | 149.9 | 38.82 | 42.16 | N/A | ||
![]() | o4-mini (high) | 200k | 70 | $1.93 | 97.3 | 83.41 | 88.55 | N/A | |
Gemini 2.5 Pro | 1m | 68 | $3.44 | 159.8 | 36.54 | 39.67 | N/A | ||
o3 | 128k | 67 | $17.50 | 177.9 | 22.46 | 25.27 | N/A | ||
![]() | o3 | 128k | 67 | $17.50 | 71.2 | 44.13 | 51.15 | N/A | |
Grok 3 mini Reasoning (high) | 131k | 67 | $0.35 | 170.8 | 0.23 | 14.87 | 11.71 | ||
Grok 3 mini Reasoning (high) Fast | 131k | 67 | $1.45 | 227.4 | 0.23 | 11.22 | 8.80 | ||
o3-mini (high) | 200k | 66 | $1.93 | 153.6 | 48.43 | 51.69 | N/A | ||
![]() | o3-mini (high) | 200k | 66 | $1.93 | 199.5 | 42.64 | 45.15 | N/A | |
o3-mini | 200k | 63 | $1.93 | 152.5 | 14.86 | 18.14 | N/A | ||
![]() | o3-mini | 200k | 63 | $1.93 | 209.2 | 12.40 | 14.79 | N/A | |
Qwen3 235B A22B (Reasoning) | 128k | 62 | $0.10 | 68.6 | 0.98 | 37.40 | 29.14 | ||
Qwen3 235B A22B (Reasoning) (FP8) | 41k | 62 | $0.30 | 20.7 | 0.62 | 121.21 | 96.47 | ||
o1 | 200k | 62 | $26.25 | 120.2 | 24.81 | 28.97 | N/A | ||
![]() | o1 | 200k | 62 | $26.25 | 112.8 | 25.42 | 29.86 | N/A | |
Gemini 2.5 Flash (Reasoning) (AI_Studio) | 1m | 60 | $0.99 | 341.8 | 8.07 | 9.54 | N/A | ||
![]() DeepSeek R1 | 164k | 60 | $0.95 | 38.3 | 0.35 | 74.64 | 61.24 | ||
![]() | ![]() DeepSeek R1 | 64k | 60 | $0.96 | 25.0 | 3.58 | 117.66 | 94.04 | |
![]() DeepSeek R1 | 128k | 60 | $2.00 | 86.7 | 1.12 | 33.95 | 27.06 | ||
![]() | ![]() DeepSeek R1 | 128k | 60 | $2.36 | 76.3 | 0.42 | 37.75 | 30.77 | |
![]() DeepSeek R1 Base | 128k | 60 | $1.20 | 31.9 | 0.65 | 89.99 | 73.65 | ||
![]() DeepSeek R1 Fast | 128k | 60 | $3.00 | 81.0 | 0.65 | 35.79 | 28.97 | ||
![]() | ![]() DeepSeek R1 | 128k | 60 | $3.99 | 84.4 | 0.45 | 34.17 | 27.80 | |
![]() | ![]() DeepSeek R1 | 128k | 60 | $2.36 | 115.9 | 0.56 | 25.13 | 20.25 | |
![]() DeepSeek R1 (Fast) | 164k | 60 | $4.25 | 112.7 | 0.64 | 25.90 | 20.82 | ||
![]() DeepSeek R1 (Turbo, FP4) | 33k | 60 | $1.50 | 155.2 | 0.26 | 18.61 | 15.13 | ||
![]() DeepSeek R1 | 64k | 60 | $0.96 | 27.8 | 0.40 | 102.95 | 84.55 | ||
![]() DeepSeek R1 | 128k | 60 | $4.00 | 85.5 | 0.51 | 33.79 | 27.44 | ||
![]() | ![]() DeepSeek R1 Turbo | 64k | 60 | $1.15 | 32.0 | 0.73 | 89.60 | 73.26 | |
![]() | ![]() DeepSeek R1 | 64k | 60 | $4.00 | 34.6 | 1.30 | 83.69 | 67.91 | |
![]() | ![]() DeepSeek R1 | 16k | 60 | $5.50 | 191.5 | 1.83 | 16.69 | 12.25 | |
![]() DeepSeek R1 | 128k | 60 | $4.00 | 99.7 | 0.58 | 29.13 | 23.53 | ||
![]() | ![]() DeepSeek R1 | 128k | 60 | $7.00 | 29.6 | 0.54 | 96.85 | 79.40 | |
Qwen3 32B (Reasoning) Base | 41k | 59 | $0.15 | 35.4 | 0.61 | 71.15 | 56.43 | ||
Qwen3 32B (Reasoning) (FP8) | 128k | 59 | $0.15 | 25.7 | 0.63 | 98.03 | 77.92 | ||
![]() | Qwen3 32B (Reasoning) (FP8) | 128k | 59 | $0.19 | 32.9 | 0.88 | 76.83 | 60.75 | |
![]() | Qwen3 32B (Reasoning) | 8k | 59 | $0.50 | 333.6 | 0.49 | 7.98 | 5.99 | |
QwQ-32B | 131k | 58 | $0.20 | 54.6 | 1.38 | 56.15 | 45.61 | ||
QwQ-32B Fast | 131k | 58 | $0.75 | 82.0 | 0.54 | 37.01 | 30.37 | ||
QwQ-32B Base | 131k | 58 | $0.23 | 18.6 | 0.57 | 161.59 | 134.10 | ||
![]() | QwQ-32B | 131k | 58 | $0.65 | 78.5 | 0.35 | 38.43 | 31.71 | |
QwQ-32B | 131k | 58 | $0.90 | 155.1 | 0.43 | 19.72 | 16.07 | ||
QwQ-32B | 131k | 58 | $0.14 | 37.8 | 0.32 | 79.47 | 65.92 | ||
QwQ-32B | 131k | 58 | $0.32 | 432.1 | 0.16 | 7.08 | 5.76 | ||
![]() | QwQ-32B | 16k | 58 | $0.63 | 421.2 | 0.43 | 7.53 | 5.91 | |
QwQ-32B | 131k | 58 | $1.20 | 82.8 | 0.50 | 36.64 | 30.10 | ||
Qwen3 14B (Reasoning) (FP8) | 128k | 56 | $0.12 | 85.8 | 0.53 | 29.66 | 23.30 | ||
Qwen3 30B A3B (Reasoning) | 131k | 56 | $0.90 | 165.7 | 0.53 | 15.62 | 12.07 | ||
Qwen3 30B A3B (Reasoning) (FP8) | 41k | 56 | $0.15 | 89.5 | 0.55 | 28.50 | 22.36 | ||
o1-mini | 128k | 54 | $1.93 | 223.7 | 10.42 | 12.66 | N/A | ||
![]() | o1-mini | 128k | 54 | $1.93 | 270.7 | 9.04 | 10.89 | N/A | |
![]() | ![]() DeepSeek V3 (Mar' 25) | 64k | 53 | $0.48 | 25.9 | 3.45 | 22.76 | N/A | |
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $1.45 | 81.8 | 0.95 | 7.07 | N/A | ||
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $1.25 | 27.7 | 1.34 | 19.38 | N/A | ||
![]() DeepSeek V3 (Mar' 25) Fast | 128k | 53 | $3.00 | 92.4 | 0.69 | 6.10 | N/A | ||
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $0.75 | 26.2 | 0.71 | 19.82 | N/A | ||
![]() | ![]() DeepSeek V3 (Mar' 25) | 164k | 53 | $0.80 | 91.8 | 0.47 | 5.92 | N/A | |
![]() | ![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $2.00 | 132.8 | 0.63 | 4.39 | N/A | |
![]() DeepSeek V3 (Mar' 25) | 160k | 53 | $0.90 | 75.4 | 0.70 | 7.32 | N/A | ||
![]() DeepSeek V3 (Mar' 25) | 164k | 53 | $0.52 | 22.6 | 0.38 | 22.47 | N/A | ||
![]() | ![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $0.57 | 27.3 | 0.92 | 19.21 | N/A | |
![]() | ![]() DeepSeek V3 (Mar' 25) | 8k | 53 | $1.13 | 262.8 | 0.62 | 2.53 | N/A | |
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $1.25 | 33.1 | 2.86 | 17.98 | N/A | ||
![]() | ![]() DeepSeek V3 (Mar' 25) | 164k | 53 | $1.25 | 22.7 | 0.58 | 22.65 | N/A | |
GPT-4.1 mini | 1m | 53 | $0.70 | 85.4 | 0.54 | 6.40 | N/A | ||
![]() | GPT-4.1 mini | 1m | 53 | $0.70 | 213.4 | 0.68 | 3.02 | N/A | |
GPT-4.1 | 1m | 53 | $3.50 | 121.8 | 0.48 | 4.58 | N/A | ||
![]() | GPT-4.1 | 1m | 53 | $3.50 | 208.6 | 0.88 | 3.27 | N/A | |
![]() DeepSeek R1 Distill Qwen 32B | 128k | 52 | $0.14 | 44.8 | 0.28 | 56.11 | 44.66 | ||
![]() | ![]() DeepSeek R1 Distill Qwen 32B | 64k | 52 | $0.30 | 19.6 | 1.21 | 128.78 | 102.06 | |
Grok 3 | 131k | 51 | $6.00 | 42.5 | 0.48 | 12.24 | N/A | ||
Grok 3 Fast | 131k | 51 | $10.00 | 102.3 | 0.36 | 5.25 | N/A | ||
Llama 4 Maverick (FP8) | 1m | 51 | $0.28 | 128.4 | 0.21 | 4.10 | N/A | ||
![]() | Llama 4 Maverick | 128k | 51 | $0.42 | 183.7 | 0.45 | 3.17 | N/A | |
Llama 4 Maverick Vertex | 524k | 51 | $0.00 | 125.2 | 0.33 | 4.33 | N/A | ||
![]() | Llama 4 Maverick (FP8) | 1m | 51 | $0.20 | 127.7 | 0.23 | 4.14 | N/A | |
![]() | Llama 4 Maverick (FP8) | 128k | 51 | $0.61 | 62.0 | 0.32 | 8.38 | N/A | |
Llama 4 Maverick | 1m | 51 | $0.39 | 161.3 | 0.44 | 3.54 | N/A | ||
Llama 4 Maverick (FP8) | 131k | 51 | $0.30 | 108.7 | 0.28 | 4.88 | N/A | ||
![]() | Llama 4 Maverick (FP8) | 1m | 51 | $0.34 | 77.6 | 0.58 | 7.02 | N/A | |
Llama 4 Maverick | 128k | 51 | $0.30 | 558.1 | 0.19 | 1.09 | N/A | ||
![]() | Llama 4 Maverick | 8k | 51 | $0.92 | 784.9 | 0.37 | 1.01 | N/A | |
Llama 4 Maverick (FP8) | 524k | 51 | $0.41 | 106.5 | 0.22 | 4.92 | N/A | ||
![]() | Llama 4 Maverick (FP8) | 1m | 51 | $0.35 | 125.8 | 0.40 | 4.37 | N/A | |
GPT-4o (March 2025) | 128k | 50 | $7.50 | 148.7 | 0.29 | 3.65 | N/A | ||
Gemini 2.0 Pro Experimental (AI Studio) | 2m | 49 | $0.00 | 33.1 | 16.52 | 31.65 | N/A | ||
![]() | ![]() DeepSeek R1 Distill Qwen 14B | 64k | 49 | $0.15 | 46.0 | 0.85 | 55.22 | 43.50 | |
![]() DeepSeek R1 Distill Qwen 14B | 128k | 49 | $1.60 | 170.0 | 0.39 | 15.09 | 11.76 | ||
Gemini 2.5 Flash (AI_Studio) | 1m | 48 | $0.26 | 266.9 | 0.27 | 2.15 | N/A | ||
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $0.30 | 60.5 | 0.30 | 41.65 | 33.08 | ||
![]() | ![]() DeepSeek R1 Distill Llama 70B | 66k | 48 | $0.94 | 2,221.9 | 0.26 | 1.38 | 0.90 | |
![]() DeepSeek R1 Distill Llama 70B Base | 128k | 48 | $0.38 | 49.6 | 0.60 | 51.01 | 40.33 | ||
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $0.34 | 32.2 | 0.58 | 78.16 | 62.07 | ||
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $0.81 | 414.3 | 0.16 | 6.20 | 4.83 | ||
![]() | ![]() DeepSeek R1 Distill Llama 70B | 16k | 48 | $0.88 | 296.3 | 1.53 | 9.97 | 6.75 | |
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $2.00 | 121.2 | 0.39 | 21.02 | 16.50 | ||
![]() | Claude 3.7 Sonnet | 200k | 48 | $6.00 | 48.9 | 1.03 | 11.26 | N/A | |
Claude 3.7 Sonnet | 200k | 48 | $6.00 | 78.1 | 1.01 | 7.42 | N/A | ||
Gemini 2.0 Flash Vertex | 1m | 48 | $0.26 | 235.6 | 0.26 | 2.38 | N/A | ||
Gemini 2.0 Flash (AI Studio) | 1m | 48 | $0.17 | 234.8 | 0.38 | 2.51 | N/A | ||
![]() | ![]() Reka Flash 3 | 128k | 47 | $0.35 | 56.8 | 0.96 | 44.95 | 35.19 | |
Gemini 2.0 Flash (exp) (AI Studio) | 1m | 46 | $0.00 | 239.2 | 0.26 | 2.35 | N/A | ||
![]() | ![]() DeepSeek V3 (Dec '24) | 66k | 46 | $0.48 | 25.9 | 3.48 | 22.79 | N/A | |
![]() DeepSeek V3 (Dec '24) (FP8) | 128k | 46 | $0.25 | 32.3 | 1.38 | 16.86 | N/A | ||
![]() DeepSeek V3 (Dec '24) | 128k | 46 | $0.75 | 23.3 | 0.68 | 22.09 | N/A | ||
![]() | ![]() DeepSeek V3 (Dec '24) | 128k | 46 | $2.00 | 84.9 | 0.48 | 6.37 | N/A | |
![]() DeepSeek V3 (Dec '24) | 128k | 46 | $1.31 | 56.4 | 0.85 | 9.71 | N/A | ||
![]() DeepSeek V3 (Dec '24) | 64k | 46 | $0.59 | 33.4 | 0.30 | 15.28 | N/A | ||
![]() | ![]() DeepSeek V3 (Dec '24) Turbo | 64k | 46 | $0.63 | 32.2 | 0.76 | 16.28 | N/A | |
![]() | ![]() DeepSeek V3 (Dec '24) | 64k | 46 | $0.89 | 33.7 | 0.82 | 15.67 | N/A | |
![]() DeepSeek V3 (Dec '24) (FP8) | 128k | 46 | $1.25 | 32.8 | 3.27 | 18.52 | N/A | ||
Qwen2.5 Max | 32k | 45 | $2.80 | 50.6 | 1.23 | 11.10 | N/A | ||
Gemini 1.5 Pro (Sep) (Vertex) | 2m | 45 | $2.19 | 94.3 | 1.21 | 6.51 | N/A | ||
Gemini 1.5 Pro (Sep) (AI Studio) | 2m | 45 | $2.19 | 93.7 | 16.66 | 22.00 | N/A | ||
![]() | Claude 3.5 Sonnet (Oct) | 200k | 44 | $6.00 | 46.4 | 0.98 | 11.76 | N/A | |
Claude 3.5 Sonnet (Oct) Vertex | 200k | 44 | $6.00 | 78.8 | 0.88 | 7.23 | N/A | ||
Claude 3.5 Sonnet (Oct) | 200k | 44 | $6.00 | 78.8 | 1.02 | 7.36 | N/A | ||
![]() Sonar | 127k | 43 | $1.00 | 135.5 | 1.67 | 5.37 | N/A | ||
Llama 4 Scout | 1m | 43 | $0.14 | 119.3 | 0.26 | 4.46 | N/A | ||
![]() | Llama 4 Scout | 32k | 43 | $0.70 | 2,796.0 | 0.24 | 0.42 | N/A | |
![]() | Llama 4 Scout | 128k | 43 | $0.29 | 162.1 | 0.52 | 3.60 | N/A | |
Llama 4 Scout Vertex | 1m | 43 | $0.00 | 129.4 | 0.36 | 4.23 | N/A | ||
![]() | Llama 4 Scout | 1m | 43 | $0.10 | 112.0 | 0.25 | 4.72 | N/A | |
![]() | Llama 4 Scout | 128k | 43 | $0.34 | 36.7 | 0.34 | 13.95 | N/A | |
Llama 4 Scout | 1m | 43 | $0.26 | 150.5 | 0.46 | 3.78 | N/A | ||
Llama 4 Scout | 131k | 43 | $0.15 | 85.7 | 0.26 | 6.10 | N/A | ||
![]() | Llama 4 Scout | 131k | 43 | $0.20 | 81.0 | 0.65 | 6.82 | N/A | |
Llama 4 Scout | 131k | 43 | $0.17 | 583.3 | 0.37 | 1.22 | N/A | ||
![]() | Llama 4 Scout | 8k | 43 | $0.47 | 781.3 | 0.77 | 1.41 | N/A | |
Llama 4 Scout | 328k | 43 | $0.28 | 112.4 | 0.28 | 4.73 | N/A | ||
![]() | Llama 4 Scout | 128k | 43 | $0.71 | 86.7 | 0.46 | 6.23 | N/A | |
![]() Sonar Pro | 200k | 43 | $6.00 | 96.3 | 1.98 | 7.17 | N/A | ||
QwQ 32B-Preview | 33k | 43 | $0.26 | 45.3 | 0.26 | 55.39 | 44.11 | ||
QwQ 32B-Preview | 33k | 43 | $1.20 | 82.9 | 0.56 | 30.72 | 24.13 | ||
![]() | ![]() Nova Premier | 1m | 43 | $5.00 | 66.6 | 0.80 | 8.32 | N/A | |
GPT-4o (Nov '24) | 128k | 41 | $4.38 | 141.8 | 0.53 | 4.06 | N/A | ||
![]() | GPT-4o (Nov '24) | 128k | 41 | $4.38 | 137.1 | 1.17 | 4.82 | N/A | |
Gemini 2.0 Flash-Lite (Feb '25) (AI Studio) | 1m | 41 | $0.13 | 215.4 | 0.27 | 2.59 | N/A | ||
Llama 3.3 70B (FP8) | 128k | 41 | $0.17 | 52.3 | 0.29 | 9.85 | N/A | ||
![]() | Llama 3.3 70B | 33k | 41 | $0.94 | 2,464.9 | 0.29 | 0.50 | N/A | |
Llama 3.3 70B | 128k | 41 | $0.40 | 68.0 | 1.30 | 8.65 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.71 | 160.1 | 0.55 | 3.68 | N/A | |
Llama 3.3 70B Fast | 128k | 41 | $0.38 | 130.1 | 0.55 | 4.40 | N/A | ||
Llama 3.3 70B Base | 128k | 41 | $0.20 | 32.5 | 0.68 | 16.09 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.50 | 152.9 | 0.52 | 3.79 | N/A | |
![]() | Llama 3.3 70B | 128k | 41 | $0.71 | 52.9 | 0.44 | 9.89 | N/A | |
Llama 3.3 70B | 128k | 41 | $0.90 | 198.8 | 0.53 | 3.05 | N/A | ||
Llama 3.3 70B (Turbo, FP8) | 128k | 41 | $0.20 | 30.4 | 0.26 | 16.71 | N/A | ||
Llama 3.3 70B | 128k | 41 | $0.27 | 11.9 | 0.78 | 42.69 | N/A | ||
Llama 3.3 70B | 128k | 41 | $0.60 | 183.0 | 0.37 | 3.10 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.20 | 24.6 | 0.72 | 21.04 | N/A | |
Llama 3.3 70B | 128k | 41 | $0.64 | 425.3 | 0.35 | 1.53 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.75 | 470.3 | 0.30 | 1.36 | N/A | |
Llama 3.3 70B Turbo | 128k | 41 | $0.88 | 151.0 | 0.31 | 3.62 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.70 | 33.2 | 0.45 | 15.52 | N/A | |
GPT-4.1 nano | 1m | 41 | $0.17 | 231.1 | 0.46 | 2.62 | N/A | ||
![]() | GPT-4.1 nano | 1m | 41 | $0.17 | 209.1 | 0.61 | 3.00 | N/A | |
GPT-4o (May '24) | 128k | 41 | $7.50 | 103.4 | 0.52 | 5.35 | N/A | ||
![]() | GPT-4o (May '24) | 128k | 41 | $7.50 | 140.0 | 0.71 | 4.28 | N/A | |
Llama 3.1 405B (FP8) | 128k | 40 | $0.80 | 32.0 | 0.35 | 15.99 | N/A | ||
Llama 3.1 405B | 128k | 40 | $9.50 | 19.1 | 0.98 | 27.19 | N/A | ||
Llama 3.1 405B | 128k | 40 | $4.00 | 86.2 | 1.15 | 6.95 | N/A | ||
![]() | Llama 3.1 405B Standard | 128k | 40 | $2.40 | 31.1 | 1.81 | 17.87 | N/A | |
![]() | Llama 3.1 405B Latency Optimized | 128k | 40 | $3.00 | 88.9 | 0.43 | 6.05 | N/A | |
Llama 3.1 405B Base | 128k | 40 | $1.50 | 31.2 | 0.72 | 16.73 | N/A | ||
Llama 3.1 405B Vertex | 128k | 40 | $7.75 | 29.6 | 0.41 | 17.28 | N/A | ||
![]() | Llama 3.1 405B | 128k | 40 | $8.00 | 31.1 | 0.46 | 16.52 | N/A | |
Llama 3.1 405B | 128k | 40 | $3.00 | 89.3 | 0.47 | 6.07 | N/A | ||
Llama 3.1 405B | 33k | 40 | $0.90 | 25.2 | 0.47 | 20.29 | N/A | ||
![]() | Llama 3.1 405B | 16k | 40 | $6.25 | 174.8 | 1.60 | 4.46 | N/A | |
Llama 3.1 405B | 128k | 40 | $7.50 | 38.7 | 0.79 | 13.72 | N/A | ||
Llama 3.1 405B Turbo | 128k | 40 | $3.50 | 104.2 | 0.40 | 5.20 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.40 | 32.1 | 1.39 | 16.96 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.20 | 25.3 | 0.72 | 20.47 | N/A | ||
Qwen2.5 72B Fast | 131k | 40 | $0.38 | 66.9 | 0.55 | 8.02 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.90 | 69.6 | 0.46 | 7.65 | N/A | ||
Qwen2.5 72B | 33k | 40 | $0.27 | 41.1 | 0.32 | 12.50 | N/A | ||
Qwen2.5 72B Turbo | 131k | 40 | $1.20 | 100.1 | 0.53 | 5.53 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.00 | 42.9 | 1.15 | 12.80 | N/A | ||
![]() | ![]() MiniMax-Text-01 | 1m | 40 | $0.42 | 32.5 | 0.86 | 16.23 | N/A | |
Phi-4 | 16k | 40 | $0.15 | 119.5 | 0.51 | 4.69 | N/A | ||
![]() | Phi-4 | 16k | 40 | $0.22 | 44.6 | 0.42 | 11.64 | N/A | |
Phi-4 | 16k | 40 | $0.09 | 44.1 | 0.29 | 11.62 | N/A | ||
![]() Command A | 256k | 40 | $4.38 | 100.4 | 0.21 | 5.19 | N/A | ||
Gemini 1.5 Flash (Sep) (Vertex) | 1m | 39 | $0.13 | 196.8 | 0.19 | 2.73 | N/A | ||
Gemini 1.5 Flash (Sep) (AI Studio) | 1m | 39 | $0.13 | 171.5 | 0.29 | 3.21 | N/A | ||
![]() | ![]() Mistral Large 2 (Nov '24) | 128k | 38 | $3.00 | 73.2 | 0.40 | 7.23 | N/A | |
![]() | ![]() Mistral Large 2 (Nov '24) | 128k | 38 | $3.00 | 36.5 | 0.51 | 14.21 | N/A | |
Gemma 3 27B | 128k | 38 | $0.07 | 35.8 | 0.68 | 14.66 | N/A | ||
Grok Beta | 128k | 38 | $7.50 | 67.5 | 0.27 | 7.67 | N/A | ||
![]() | ![]() Pixtral Large | 128k | 37 | $3.00 | 42.0 | 0.36 | 12.27 | N/A | |
Qwen2.5 Instruct 32B Fast | 128k | 37 | $0.20 | 82.1 | 0.55 | 6.64 | N/A | ||
Qwen2.5 Instruct 32B Base | 128k | 37 | $0.10 | 60.0 | 0.56 | 8.90 | N/A | ||
Llama 3.1 Nemotron 70B (FP8) | 128k | 37 | $0.17 | 48.6 | 0.30 | 10.58 | N/A | ||
Llama 3.1 Nemotron 70B Base | 128k | 37 | $0.20 | 40.4 | 0.66 | 13.03 | N/A | ||
Llama 3.1 Nemotron 70B Fast | 128k | 37 | $0.38 | 70.9 | 0.55 | 7.60 | N/A | ||
Llama 3.1 Nemotron 70B | 128k | 37 | $0.27 | 35.2 | 0.33 | 14.54 | N/A | ||
![]() | ![]() Nova Pro | 300k | 37 | $1.40 | 150.9 | 0.36 | 3.68 | N/A | |
![]() | ![]() Nova Pro Latency Optimized | 300k | 37 | $1.75 | 131.8 | 0.55 | 4.34 | N/A | |
![]() | ![]() Mistral Large 2 (Jul '24) | 128k | 37 | $3.00 | 38.1 | 0.42 | 13.55 | N/A | |
![]() | ![]() Mistral Large 2 (Jul '24) | 128k | 37 | $3.00 | 32.3 | 0.45 | 15.93 | N/A | |
![]() | ![]() Mistral Large 2 (Jul '24) | 128k | 37 | $3.00 | 35.3 | 0.50 | 14.66 | N/A | |
Qwen2.5 Coder 32B | 33k | 36 | $0.09 | 40.8 | 0.34 | 12.60 | N/A | ||
Qwen2.5 Coder 32B | 131k | 36 | $0.20 | 50.4 | 1.30 | 11.23 | N/A | ||
Qwen2.5 Coder 32B | 33k | 36 | $0.10 | 52.5 | 0.26 | 9.78 | N/A | ||
Qwen2.5 Coder 32B | 131k | 36 | $0.80 | 86.9 | 0.49 | 6.24 | N/A | ||
GPT-4o mini | 128k | 36 | $0.26 | 60.3 | 0.36 | 8.65 | N/A | ||
![]() | GPT-4o mini | 128k | 36 | $0.26 | 158.7 | 0.86 | 4.01 | N/A | |
Llama 3.1 70B (FP8) | 128k | 35 | $0.17 | 46.7 | 0.31 | 11.02 | N/A | ||
Llama 3.1 70B | 128k | 35 | $0.40 | 166.6 | 1.13 | 4.13 | N/A | ||
![]() | Llama 3.1 70B Standard | 128k | 35 | $0.72 | 31.7 | 0.65 | 16.44 | N/A | |
![]() | Llama 3.1 70B Latency Optimized | 128k | 35 | $0.90 | 135.1 | 0.32 | 4.02 | N/A | |
Llama 3.1 70B Base | 128k | 35 | $0.20 | 24.9 | 0.70 | 20.78 | N/A | ||
Llama 3.1 70B Fast | 128k | 35 | $0.38 | 142.9 | 0.54 | 4.04 | N/A | ||
Llama 3.1 70B Vertex | 128k | 35 | $0.00 | 73.0 | 0.27 | 7.11 | N/A | ||
![]() | Llama 3.1 70B | 128k | 35 | $2.90 | 53.6 | 0.43 | 9.76 | N/A | |
Llama 3.1 70B | 128k | 35 | $0.90 | 161.2 | 0.67 | 3.77 | N/A | ||
Llama 3.1 70B (Turbo, FP8) | 128k | 35 | $0.20 | 37.5 | 0.27 | 13.61 | N/A | ||
Llama 3.1 70B | 128k | 35 | $0.27 | 22.4 | 0.55 | 22.90 | N/A | ||
![]() | Llama 3.1 70B | 32k | 35 | $0.19 | 27.6 | 1.56 | 19.71 | N/A | |
Llama 3.1 70B Turbo | 128k | 35 | $0.88 | 138.1 | 0.47 | 4.09 | N/A | ||
Llama 3.1 70B | 128k | 35 | $0.90 | 127.2 | 0.51 | 4.44 | N/A | ||
![]() | ![]() Mistral Small 3.1 | 128k | 35 | $0.15 | 170.7 | 0.28 | 3.21 | N/A | |
![]() Mistral Small 3.1 Vertex | 128k | 35 | $0.15 | 57.9 | 0.18 | 8.81 | N/A | ||
![]() | ![]() Mistral Small 3 | 32k | 35 | $0.15 | 143.3 | 0.32 | 3.81 | N/A | |
![]() Mistral Small 3 | 32k | 35 | $0.09 | 86.6 | 0.22 | 6.00 | N/A | ||
![]() Mistral Small 3 | 32k | 35 | $0.80 | 97.6 | 0.22 | 5.34 | N/A | ||
![]() | Claude 3 Opus | 200k | 35 | $30.00 | 25.5 | 1.20 | 20.79 | N/A | |
Claude 3 Opus Vertex | 200k | 35 | $30.00 | 22.2 | 2.39 | 24.90 | N/A | ||
Claude 3 Opus | 200k | 35 | $30.00 | 26.9 | 0.99 | 19.54 | N/A | ||
![]() | Claude 3.5 Haiku Standard | 200k | 35 | $1.60 | 54.9 | 1.19 | 10.30 | N/A | |
![]() | Claude 3.5 Haiku Latency Optimized | 200k | 35 | $2.00 | 96.2 | 0.48 | 5.68 | N/A | |
Claude 3.5 Haiku Vertex | 200k | 35 | $1.60 | 65.4 | 1.58 | 9.22 | N/A | ||
Claude 3.5 Haiku | 200k | 35 | $1.60 | 65.1 | 0.62 | 8.30 | N/A | ||
![]() | ![]() DeepSeek R1 Distill Llama 8B | 32k | 34 | $0.04 | 53.9 | 0.67 | 47.09 | 37.14 | |
Gemma 3 12B | 128k | 34 | $0.06 | 28.6 | 0.56 | 18.05 | N/A | ||
Gemini 1.5 Pro (May) (Vertex) | 2m | 34 | $2.19 | 67.9 | 0.36 | 7.73 | N/A | ||
Gemini 1.5 Pro (May) (AI Studio) | 2m | 34 | $2.19 | 67.7 | 0.43 | 7.82 | N/A | ||
Qwen Turbo | 1m | 34 | $0.09 | 107.3 | 1.03 | 5.70 | N/A | ||
![]() | Llama 3.2 90B (Vision) | 128k | 33 | $0.72 | 60.5 | 0.50 | 8.77 | N/A | |
Llama 3.2 90B (Vision) Vertex | 128k | 33 | $0.00 | 29.6 | 0.19 | 17.09 | N/A | ||
Llama 3.2 90B (Vision) | 33k | 33 | $0.36 | 23.9 | 0.52 | 21.45 | N/A | ||
Llama 3.2 90B (Vision) Turbo | 128k | 33 | $1.20 | 28.7 | 0.21 | 17.63 | N/A | ||
Qwen2 72B | 33k | 33 | $0.90 | 39.6 | 0.44 | 13.06 | N/A | ||
Qwen2 72B | 131k | 33 | $0.00 | 31.1 | 1.37 | 17.45 | N/A | ||
![]() | ![]() Nova Lite | 300k | 33 | $0.10 | 278.6 | 0.31 | 2.10 | N/A | |
Gemini 1.5 Flash-8B AI Studio | 1m | 31 | $0.07 | 275.4 | 0.19 | 2.01 | N/A | ||
![]() Jamba 1.5 Large | 256k | 29 | $3.50 | 67.4 | 0.55 | 7.97 | N/A | ||
![]() | ![]() Jamba 1.5 Large | 256k | 29 | $3.50 | 51.3 | 0.68 | 10.42 | N/A | |
![]() Jamba 1.6 Large | 256k | 29 | $3.50 | 64.6 | 0.56 | 8.30 | N/A | ||
Gemini 1.5 Flash (May) (Vertex) | 1m | 28 | $0.13 | 318.5 | 0.27 | 1.84 | N/A | ||
Gemini 1.5 Flash (May) (AI Studio) | 1m | 28 | $0.13 | 321.8 | 0.24 | 1.79 | N/A | ||
![]() | ![]() Nova Micro | 130k | 28 | $0.06 | 314.6 | 0.31 | 1.90 | N/A | |
![]() Yi-Large | 32k | 28 | $3.00 | 69.0 | 0.41 | 7.66 | N/A | ||
![]() | Claude 3 Sonnet | 200k | 28 | $6.00 | 50.1 | 0.76 | 10.74 | N/A | |
Claude 3 Sonnet | 200k | 28 | $6.00 | 61.4 | 0.57 | 8.72 | N/A | ||
![]() | ![]() Codestral (Jan '25) | 256k | 28 | $0.45 | 203.4 | 0.26 | 2.71 | N/A | |
![]() Codestral (Jan '25) Vertex | 128k | 28 | $0.45 | 151.6 | 0.14 | 3.44 | N/A | ||
Llama 3 70B | 8k | 27 | $1.18 | 46.5 | 0.38 | 11.13 | N/A | ||
Llama 3 70B | 8k | 27 | $0.40 | 20.6 | 1.47 | 25.70 | N/A | ||
![]() | Llama 3 70B | 8k | 27 | $2.86 | 43.7 | 0.40 | 11.85 | N/A | |
![]() | Llama 3 70B | 8k | 27 | $2.90 | 18.9 | 0.74 | 27.16 | N/A | |
Llama 3 70B | 8k | 27 | $0.27 | 33.6 | 0.47 | 15.36 | N/A | ||
![]() | Llama 3 70B | 8k | 27 | $0.57 | 20.7 | 1.11 | 25.24 | N/A | |
Llama 3 70B | 8k | 27 | $0.64 | 333.3 | 0.22 | 1.72 | N/A | ||
Llama 3 70B (Reference, FP16) | 8k | 27 | $0.90 | 144.7 | 0.64 | 4.10 | N/A | ||
Llama 3 70B (Turbo, FP8) | 8k | 27 | $0.88 | 149.2 | 0.35 | 3.70 | N/A | ||
![]() | ![]() Mistral Small (Sep '24) | 33k | 27 | $0.30 | 68.0 | 0.30 | 7.65 | N/A | |
![]() | Phi-4 Multimodal | 128k | 27 | $0.00 | 21.5 | 0.34 | 23.60 | N/A | |
Qwen2.5 Coder 7B Fast | 131k | 27 | $0.04 | 211.5 | 0.49 | 2.86 | N/A | ||
Qwen2.5 Coder 7B Base | 131k | 27 | $0.01 | 196.5 | 0.48 | 3.03 | N/A | ||
![]() | ![]() Mistral Large (Feb '24) | 33k | 26 | $6.00 | 30.6 | 0.48 | 16.81 | N/A | |
![]() | ![]() Mistral Large (Feb '24) | 33k | 26 | $6.00 | 44.2 | 0.39 | 11.70 | N/A | |
![]() | ![]() Mixtral 8x22B | 65k | 26 | $3.00 | 60.8 | 0.34 | 8.57 | N/A | |
![]() Mixtral 8x22B Base | 65k | 26 | $0.60 | 73.6 | 0.54 | 7.33 | N/A | ||
![]() Mixtral 8x22B Fast | 65k | 26 | $1.05 | 99.3 | 0.54 | 5.57 | N/A | ||
![]() Mixtral 8x22B | 65k | 26 | $1.20 | 90.6 | 0.32 | 5.85 | N/A | ||
![]() Mixtral 8x22B | 65k | 26 | $1.20 | 88.7 | 0.34 | 5.97 | N/A | ||
![]() | Phi-4 Mini | 128k | 26 | $0.12 | 222.5 | 0.26 | 2.51 | N/A | |
![]() | Phi-4 Mini | 128k | 26 | $0.00 | 57.5 | 0.32 | 9.02 | N/A | |
![]() | Phi-3 Medium 14B | 128k | 25 | $0.30 | 52.4 | 0.42 | 9.97 | N/A | |
Gemma 3 4B | 128k | 24 | $0.03 | 154.2 | 0.25 | 3.49 | N/A | ||
![]() | Claude 2.1 | 200k | 24 | $12.00 | 30.0 | 1.85 | 18.51 | N/A | |
Claude 2.1 | 200k | 24 | $12.00 | 14.1 | 0.86 | 36.43 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.03 | 142.3 | 0.23 | 3.75 | N/A | ||
![]() | Llama 3.1 8B | 33k | 24 | $0.10 | 2,165.0 | 0.31 | 0.54 | N/A | |
Llama 3.1 8B | 128k | 24 | $0.10 | 416.3 | 0.94 | 2.14 | N/A | ||
![]() | Llama 3.1 8B | 128k | 24 | $0.22 | 92.8 | 0.36 | 5.75 | N/A | |
Llama 3.1 8B Fast | 128k | 24 | $0.04 | 185.7 | 0.48 | 3.17 | N/A | ||
Llama 3.1 8B Base | 128k | 24 | $0.03 | 62.0 | 0.53 | 8.60 | N/A | ||
Llama 3.1 8B Vertex | 128k | 24 | $0.00 | 119.9 | 0.17 | 4.34 | N/A | ||
![]() | Llama 3.1 8B | 128k | 24 | $0.38 | 225.6 | 0.29 | 2.51 | N/A | |
Llama 3.1 8B | 128k | 24 | $0.20 | 274.1 | 0.32 | 2.15 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.04 | 51.4 | 0.52 | 10.26 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.10 | 440.6 | 0.34 | 1.47 | N/A | ||
![]() | Llama 3.1 8B | 16k | 24 | $0.03 | 77.5 | 0.65 | 7.10 | N/A | |
Llama 3.1 8B | 128k | 24 | $0.06 | 899.7 | 0.16 | 0.72 | N/A | ||
![]() | Llama 3.1 8B | 16k | 24 | $0.13 | 1,168.2 | 0.22 | 0.65 | N/A | |
Llama 3.1 8B Turbo | 128k | 24 | $0.18 | 152.4 | 0.45 | 3.73 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.15 | 466.2 | 0.18 | 1.25 | N/A | ||
![]() | Llama 3.1 8B | 128k | 24 | $0.18 | 61.9 | 0.46 | 8.53 | N/A | |
![]() | ![]() Pixtral 12B | 128k | 23 | $0.15 | 89.6 | 0.30 | 5.88 | N/A | |
![]() Pixtral 12B | 128k | 23 | $0.10 | 73.6 | 0.73 | 7.52 | N/A | ||
![]() | ![]() Mistral Small (Feb '24) | 33k | 23 | $1.50 | 168.1 | 0.28 | 3.25 | N/A | |
![]() | ![]() Mistral Small (Feb '24) | 33k | 23 | $1.50 | 87.1 | 0.38 | 6.12 | N/A | |
![]() | ![]() Mistral Medium | 33k | 23 | $4.09 | 41.8 | 0.36 | 12.32 | N/A | |
![]() | ![]() Ministral 8B | 128k | 22 | $0.10 | 139.7 | 0.28 | 3.86 | N/A | |
Gemma 2 9B Fast | 8k | 22 | $0.04 | 160.7 | 0.49 | 3.60 | N/A | ||
Gemma 2 9B Base | 8k | 22 | $0.03 | 169.7 | 0.49 | 3.43 | N/A | ||
Gemma 2 9B | 8k | 22 | $0.04 | 25.4 | 0.64 | 20.29 | N/A | ||
Gemma 2 9B | 8k | 22 | $0.20 | 723.8 | 0.22 | 0.91 | N/A | ||
Gemma 2 9B | 8k | 22 | $0.30 | 133.2 | 0.24 | 3.99 | N/A | ||
![]() LFM 40B | 32k | 22 | $0.15 | 165.1 | 0.18 | 3.20 | N/A | ||
![]() | ![]() Command-R+ | 128k | 21 | $6.00 | 48.0 | 0.47 | 10.88 | N/A | |
![]() Command-R+ | 128k | 21 | $4.38 | 49.2 | 0.26 | 10.42 | N/A | ||
Llama 3 8B | 8k | 21 | $0.10 | 81.1 | 0.36 | 6.53 | N/A | ||
![]() | Llama 3 8B | 8k | 21 | $0.38 | 103.7 | 0.31 | 5.13 | N/A | |
![]() | Llama 3 8B | 8k | 21 | $0.38 | 73.7 | 0.36 | 7.14 | N/A | |
Llama 3 8B | 8k | 21 | $0.04 | 114.0 | 0.21 | 4.59 | N/A | ||
![]() | Llama 3 8B | 8k | 21 | $0.04 | 60.9 | 0.80 | 9.01 | N/A | |
Llama 3 8B | 8k | 21 | $0.06 | 1,349.8 | 0.29 | 0.66 | N/A | ||
Llama 3 8B | 8k | 21 | $0.20 | 192.2 | 0.49 | 3.09 | N/A | ||
![]() | ![]() Codestral (May '24) | 33k | 20 | $0.30 | 107.0 | 0.30 | 4.98 | N/A | |
![]() Aya Expanse 32B | 128k | 20 | $0.75 | 121.4 | 0.16 | 4.28 | N/A | ||
![]() | ![]() Command-R+ (Apr '24) | 128k | 20 | $6.00 | 47.5 | 0.48 | 11.00 | N/A | |
![]() Command-R+ (Apr '24) | 128k | 20 | $6.00 | 68.6 | 0.24 | 7.53 | N/A | ||
![]() | ![]() Command-R+ (Apr '24) | 128k | 20 | $6.00 | 28.9 | 0.65 | 17.96 | N/A | |
![]() | ![]() Ministral 3B | 128k | 20 | $0.04 | 226.7 | 0.27 | 2.47 | N/A | |
![]() | ![]() Mistral NeMo | 128k | 20 | $0.15 | 142.7 | 0.29 | 3.79 | N/A | |
![]() Mistral NeMo Fast | 128k | 20 | $0.12 | 161.5 | 0.51 | 3.61 | N/A | ||
![]() Mistral NeMo Base | 128k | 20 | $0.06 | 41.3 | 0.60 | 12.71 | N/A | ||
![]() Mistral NeMo | 128k | 20 | $0.06 | 58.3 | 0.25 | 8.83 | N/A | ||
Llama 3.2 3B (FP8) | 128k | 20 | $0.02 | 221.5 | 0.26 | 2.51 | N/A | ||
Llama 3.2 3B | 128k | 20 | $0.10 | 100.7 | 1.11 | 6.08 | N/A | ||
![]() | Llama 3.2 3B | 128k | 20 | $0.15 | 72.7 | 0.46 | 7.33 | N/A | |
Llama 3.2 3B Base | 128k | 20 | $0.01 | 23.8 | 0.59 | 21.58 | N/A | ||
![]() | Llama 3.2 3B | 128k | 20 | $0.06 | 249.0 | 0.25 | 2.26 | N/A | |
Llama 3.2 3B | 128k | 20 | $0.02 | 138.4 | 0.18 | 3.79 | N/A | ||
![]() | Llama 3.2 3B | 32k | 20 | $0.04 | 105.2 | 0.59 | 5.34 | N/A | |
![]() | Llama 3.2 3B | 8k | 20 | $0.10 | 1,591.0 | 0.19 | 0.51 | N/A | |
Llama 3.2 3B Turbo | 128k | 20 | $0.06 | 161.6 | 0.41 | 3.50 | N/A | ||
![]() DeepSeek R1 Distill Qwen 1.5B | 128k | 19 | $0.18 | 386.9 | 0.23 | 6.69 | 5.17 | ||
![]() Jamba 1.5 Mini | 256k | 18 | $0.25 | 179.8 | 0.29 | 3.08 | N/A | ||
![]() | ![]() Jamba 1.5 Mini | 256k | 18 | $0.25 | 82.8 | 0.46 | 6.51 | N/A | |
![]() Jamba 1.6 Mini | 256k | 18 | $0.25 | 199.3 | 0.34 | 2.85 | N/A | ||
![]() | ![]() Mixtral 8x7B | 33k | 17 | $0.70 | 91.1 | 0.30 | 5.79 | N/A | |
![]() | ![]() Mixtral 8x7B | 33k | 17 | $0.51 | 75.4 | 0.32 | 6.95 | N/A | |
![]() Mixtral 8x7B Fast | 33k | 17 | $0.23 | 52.8 | 0.60 | 10.08 | N/A | ||
![]() Mixtral 8x7B Base | 33k | 17 | $0.12 | 20.0 | 0.60 | 25.65 | N/A | ||
![]() Mixtral 8x7B | 33k | 17 | $0.24 | 99.9 | 0.23 | 5.23 | N/A | ||
![]() Mixtral 8x7B | 33k | 17 | $0.60 | 55.7 | 0.44 | 9.42 | N/A | ||
![]() Aya Expanse 8B | 8k | 16 | $0.75 | 167.4 | 0.12 | 3.11 | N/A | ||
![]() | ![]() Command-R | 128k | 15 | $0.75 | 109.6 | 0.34 | 4.90 | N/A | |
![]() Command-R | 128k | 15 | $0.26 | 75.4 | 0.19 | 6.83 | N/A | ||
![]() | ![]() Command-R (Mar '24) | 128k | 15 | $0.75 | 109.6 | 0.34 | 4.90 | N/A | |
![]() Command-R (Mar '24) | 128k | 15 | $0.75 | 176.1 | 0.15 | 2.99 | N/A | ||
![]() | ![]() Command-R (Mar '24) | 128k | 15 | $0.75 | 46.3 | 0.51 | 11.31 | N/A | |
![]() | ![]() Codestral-Mamba | 256k | 14 | $0.25 | 94.7 | 0.43 | 5.71 | N/A | |
![]() | ![]() Mistral 7B | 8k | 10 | $0.25 | 106.9 | 0.30 | 4.98 | N/A | |
![]() Mistral 7B | 8k | 10 | $0.04 | 80.7 | 0.20 | 6.40 | N/A | ||
![]() | ![]() Mistral 7B | 32k | 10 | $0.04 | 118.5 | 0.78 | 5.00 | N/A | |
![]() Mistral 7B | 8k | 10 | $0.20 | 177.5 | 0.16 | 2.97 | N/A | ||
![]() | Llama 3.2 1B | 128k | 10 | $0.10 | 117.5 | 0.45 | 4.71 | N/A | |
Llama 3.2 1B Base | 128k | 10 | $0.01 | 271.2 | 0.48 | 2.32 | N/A | ||
Llama 3.2 1B | 128k | 10 | $0.01 | 120.5 | 0.35 | 4.50 | N/A | ||
![]() | Llama 3.2 1B | 16k | 10 | $0.05 | 2,604.0 | 0.21 | 0.40 | N/A | |
Llama 2 Chat 7B | 4k | 8 | $0.10 | 133.4 | 0.42 | 4.17 | N/A | ||
GPT-4o (Aug '24) | 128k | $4.38 | 102.8 | 0.53 | 5.39 | N/A | |||
![]() | GPT-4o (Aug '24) | 128k | $4.38 | 130.8 | 0.69 | 4.52 | N/A | ||
![]() | Llama 3.2 11B (Vision) | 128k | $0.16 | 143.4 | 0.46 | 3.95 | N/A | ||
![]() | Llama 3.2 11B (Vision) | 128k | $0.15 | 83.3 | 0.29 | 6.30 | N/A | ||
Llama 3.2 11B (Vision) | 128k | $0.06 | 53.1 | 0.44 | 9.85 | N/A | |||
Llama 3.2 11B (Vision) Turbo | 128k | $0.18 | 60.8 | 0.78 | 9.00 | N/A | |||
Gemma 2 27B Fast | 8k | $0.26 | 88.7 | 0.54 | 6.18 | N/A | |||
Gemma 2 27B Base | 8k | $0.15 | 53.4 | 0.57 | 9.94 | N/A | |||
Gemma 2 27B | 8k | $0.80 | 91.8 | 0.24 | 5.69 | N/A | |||
![]() | Claude 3 Haiku | 200k | $0.50 | 113.2 | 0.96 | 5.38 | N/A | ||
Claude 3 Haiku | 200k | $0.50 | 138.7 | 0.46 | 4.06 | N/A | |||
![]() | ![]() Mistral Saba | 32k | $0.30 | 92.4 | 0.31 | 5.72 | N/A | ||
![]() DeepSeek Coder V2 Lite Fast, FP8 | 128k | $0.12 | 109.4 | 0.62 | 5.19 | N/A | |||
![]() DeepSeek Coder V2 Lite Base, FP8 | 128k | $0.06 | 103.4 | 0.60 | 5.44 | N/A | |||
![]() Sonar Reasoning | 127k | $2.00 | 83.3 | 1.49 | 31.51 | 24.01 | |||
Grok 3 mini Reasoning (low) | 131k | $0.35 | 104.9 | 0.25 | 24.08 | 19.07 | |||
Grok 3 mini Reasoning (low) Fast | 131k | $1.45 | 232.3 | 0.24 | 11.00 | 8.61 | |||
![]() | ![]() Solar Mini | 4k | $0.15 | 23.8 | 1.73 | 22.75 | N/A | ||
![]() | ![]() Reka Flash | 128k | $0.35 | 38.2 | 0.94 | 14.03 | N/A | ||
![]() | ![]() Reka Core | 128k | $2.00 | 27.7 | 0.95 | 19.01 | N/A | ||
![]() | ![]() Reka Flash (Feb '24) | 128k | $0.35 | 45.9 | 0.93 | 11.81 | N/A | ||
![]() | ![]() Reka Edge | 128k | $0.10 | 85.7 | 0.84 | 6.68 | N/A | ||
Qwen1.5 Chat 110B | 32k | $0.00 | 23.7 | 1.61 | 22.71 | N/A | |||
o1-preview | 128k | $26.25 | 159.9 | 20.32 | 23.44 | N/A | |||
![]() | o1-preview | 128k | $28.88 | 152.6 | 23.51 | 26.78 | N/A | ||
GPT-4 Turbo | 128k | $15.00 | 46.1 | 0.61 | 11.45 | N/A | |||
![]() | GPT-4 Turbo | 128k | $15.00 | 45.4 | 1.54 | 12.54 | N/A | ||
GPT-4 | 8k | $37.50 | 32.0 | 0.66 | 16.29 | N/A | |||
GPT-4.5 (Preview) | 128k | $93.75 | 78.2 | 0.95 | 7.34 | N/A | |||
Gemini 2.0 Flash-Lite (Preview) (AI Studio) | 1m | $0.13 | 215.1 | 0.28 | 2.60 | N/A | |||
![]() | Claude 3.5 Sonnet (June) | 200k | $6.00 | 50.5 | 0.88 | 10.78 | N/A | ||
Claude 3.5 Sonnet (June) Vertex | 200k | $6.00 | 80.1 | 0.87 | 7.11 | N/A | |||
Claude 3.5 Sonnet (June) | 200k | $6.00 | 79.6 | 0.80 | 7.08 | N/A | |||
Claude 2.0 | 100k | $12.00 | 31.1 | 0.86 | 16.93 | N/A | |||
![]() OpenChat 3.5 | 8k | $0.06 | 52.7 | 0.48 | 9.98 | N/A | |||
![]() Jamba Instruct | 256k | $0.55 | 175.5 | 0.33 | 3.18 | N/A |