LLM API Providers Leaderboard - Comparison of over 100 LLM endpoints
Comparison and ranking of API provider performance for over 100 AI LLM Model endpoints across performance key metrics including price, output speed, latency, context window & others. For more details including relating to our methodology, see our FAQs.
API providers compared: OpenAI, Playground AI, Microsoft Azure, Ideogram, Mistral, Amazon Bedrock, DeepSeek, Hyperbolic, Groq, Together.ai, FriendliAI, Anthropic, Black Forest Labs, Perplexity, Google, Fireworks, Lambda Labs, Leonardo.Ai, Cerebras, Recraft AI, Cohere, Upstage, Simplismart, Speechmatics, Deepinfra, Fish Audio, Replicate, , Genmo, Nebius, Adobe, MiniMax, CentML, Runpod, StepFun, Zyphra, Murf AI, Speechify, Rev AI, AssemblyAI, fal.ai, Rime, kluster.ai, Prodia, Hume AI, Reka AI, Deepgram, Gladia, Stability.ai, Baseten, Midjourney, Reve, Databricks, ElevenLabs, Vivago AI, IBM, SambaNova, Dreamina, Parasail, xAI, Cartesia, LMNT, PlayAI, 01.AI, Alibaba Cloud, Novita, AI21 Labs, and WaveSpeed.
Features | Model Intelligence | Price | Output tokens/s | Latency | End-to-End Response Time | ||||
---|---|---|---|---|---|---|---|---|---|
Further Analysis | |||||||||
o4-mini (high) | 200k | 70 | $1.93 | 150.8 | 46.59 | 49.90 | N/A | ||
![]() | o4-mini (high) | 200k | 70 | $1.93 | 110.7 | 60.61 | 65.13 | N/A | |
Gemini 2.5 Pro | 1m | 69 | $3.44 | 152.1 | 37.36 | 40.64 | N/A | ||
o3 | 128k | 67 | $17.50 | 179.7 | 17.06 | 19.85 | N/A | ||
![]() | o3 | 128k | 67 | $17.50 | 89.8 | 39.11 | 44.68 | N/A | |
Grok 3 mini Reasoning (high) | 131k | 67 | $0.35 | 75.6 | 0.40 | 33.48 | 26.47 | ||
Grok 3 mini Reasoning (high) Fast | 131k | 67 | $1.45 | 110.1 | 0.42 | 23.12 | 18.16 | ||
o3-mini (high) | 200k | 66 | $1.93 | 162.4 | 40.24 | 43.32 | N/A | ||
![]() | o3-mini (high) | 200k | 66 | $1.93 | 182.2 | 64.00 | 66.74 | N/A | |
Gemini 2.5 Flash (May '25) (Reasoning) (AI_Studio) | 1m | 65 | $0.99 | 333.7 | 15.43 | 16.93 | N/A | ||
o3-mini | 200k | 63 | $1.93 | 166.5 | 13.89 | 16.90 | N/A | ||
![]() | o3-mini | 200k | 63 | $1.93 | 192.2 | 25.14 | 27.75 | N/A | |
![]() | Qwen3 235B A22B (Reasoning) (FP8) | 41k | 62 | $0.35 | 60.2 | 0.44 | 41.99 | 33.23 | |
Qwen3 235B A22B (Reasoning) Base | 33k | 62 | $0.30 | 24.8 | 0.59 | 101.26 | 80.54 | ||
Qwen3 235B A22B (Reasoning) | 128k | 62 | $0.10 | 63.3 | 0.67 | 40.14 | 31.57 | ||
Qwen3 235B A22B (Reasoning) (FP8) | 41k | 62 | $0.30 | 26.4 | 0.62 | 95.21 | 75.67 | ||
![]() | Qwen3 235B A22B (Reasoning) (FP8) | 128k | 62 | $0.35 | 44.0 | 0.66 | 57.44 | 45.42 | |
Qwen3 235B A22B (Reasoning) (FP8) | 41k | 62 | $0.30 | 32.2 | 0.77 | 78.31 | 62.03 | ||
![]() | Qwen3 235B A22B (Reasoning) (FP8) | 41k | 62 | $0.61 | 35.7 | 0.76 | 70.84 | 56.06 | |
Qwen3 235B A22B (Reasoning) | 131k | 62 | $2.63 | 70.4 | 1.16 | 36.70 | 28.43 | ||
o1 | 200k | 62 | $26.25 | 131.4 | 21.75 | 25.56 | N/A | ||
![]() | o1 | 200k | 62 | $26.25 | 111.5 | 24.69 | 29.17 | N/A | |
Llama 3.1 Nemotron Ultra 253B Reasoning Base | 131k | 61 | $0.90 | 42.8 | 0.66 | 59.10 | 46.75 | ||
Gemini 2.5 Flash (April '25) (Reasoning) (AI_Studio) | 1m | 60 | $0.99 | 351.2 | 8.51 | 9.93 | N/A | ||
![]() DeepSeek R1 | 164k | 60 | $0.95 | 38.7 | 0.42 | 74.01 | 60.66 | ||
![]() | ![]() DeepSeek R1 | 64k | 60 | $0.96 | 24.5 | 3.54 | 119.96 | 95.97 | |
![]() DeepSeek R1 | 128k | 60 | $2.00 | 89.8 | 0.97 | 32.66 | 26.12 | ||
![]() | ![]() DeepSeek R1 | 128k | 60 | $2.36 | 185.5 | 0.37 | 15.72 | 12.65 | |
![]() DeepSeek R1 Base | 128k | 60 | $1.20 | 29.4 | 0.66 | 97.47 | 79.81 | ||
![]() DeepSeek R1 Fast | 128k | 60 | $3.00 | 56.6 | 0.69 | 50.95 | 41.43 | ||
![]() | ![]() DeepSeek R1 | 128k | 60 | $2.99 | 70.9 | 0.54 | 40.68 | 33.09 | |
![]() | ![]() DeepSeek R1 | 128k | 60 | $2.36 | 102.4 | 0.52 | 28.31 | 22.91 | |
![]() DeepSeek R1 (Fast) | 164k | 60 | $4.25 | 232.4 | 0.37 | 12.62 | 10.10 | ||
![]() DeepSeek R1 (Turbo, FP4) | 33k | 60 | $1.50 | 175.1 | 0.29 | 16.54 | 13.40 | ||
![]() DeepSeek R1 | 64k | 60 | $0.92 | 47.4 | 0.33 | 60.40 | 49.52 | ||
![]() DeepSeek R1 | 128k | 60 | $4.00 | 91.7 | 0.45 | 31.48 | 25.58 | ||
![]() | ![]() DeepSeek R1 Turbo | 64k | 60 | $1.15 | 29.8 | 0.85 | 96.27 | 78.66 | |
![]() | ![]() DeepSeek R1 | 64k | 60 | $4.00 | 30.7 | 0.73 | 93.42 | 76.41 | |
![]() | ![]() DeepSeek R1 | 16k | 60 | $5.50 | 196.4 | 1.84 | 16.34 | 11.95 | |
![]() DeepSeek R1 | 128k | 60 | $4.00 | 98.6 | 0.69 | 29.56 | 23.81 | ||
![]() | ![]() DeepSeek R1 | 128k | 60 | $7.00 | 34.2 | 0.73 | 83.87 | 68.54 | |
![]() | Qwen3 32B (Reasoning) (FP8) | 41k | 59 | $0.20 | 49.9 | 0.46 | 50.56 | 40.08 | |
![]() | Qwen3 32B (Reasoning) | 41k | 59 | $0.50 | 2,431.0 | 0.30 | 1.33 | 0.82 | |
Qwen3 32B (Reasoning) Base | 33k | 59 | $0.15 | 43.0 | 0.58 | 58.74 | 46.52 | ||
Qwen3 32B (Reasoning) (FP8) | 41k | 59 | $0.15 | 50.2 | 0.53 | 50.34 | 39.85 | ||
![]() | Qwen3 32B (Reasoning) (FP8) | 128k | 59 | $0.19 | 27.2 | 0.89 | 92.63 | 73.40 | |
![]() | Qwen3 32B (Reasoning) | 8k | 59 | $0.50 | 333.5 | 0.45 | 7.94 | 6.00 | |
Qwen3 32B (Reasoning) | 131k | 59 | $2.63 | 64.7 | 1.08 | 39.74 | 30.93 | ||
QwQ-32B | 131k | 58 | $0.20 | 114.4 | 1.16 | 27.30 | 21.77 | ||
QwQ-32B Fast | 131k | 58 | $0.75 | 77.2 | 0.52 | 39.28 | 32.28 | ||
QwQ-32B Base | 131k | 58 | $0.23 | 53.8 | 0.59 | 56.18 | 46.30 | ||
![]() | QwQ-32B | 131k | 58 | $0.65 | 80.7 | 0.35 | 37.39 | 30.85 | |
QwQ-32B | 131k | 58 | $0.90 | 168.2 | 0.48 | 18.27 | 14.81 | ||
QwQ-32B | 131k | 58 | $0.16 | 43.1 | 0.55 | 70.02 | 57.86 | ||
![]() | QwQ-32B | 33k | 58 | $0.18 | 35.6 | 0.66 | 84.57 | 69.88 | |
QwQ-32B | 131k | 58 | $0.32 | 400.7 | 0.31 | 7.78 | 6.22 | ||
![]() | QwQ-32B | 16k | 58 | $0.63 | 389.0 | 0.41 | 8.10 | 6.40 | |
QwQ-32B | 131k | 58 | $1.20 | 96.3 | 0.42 | 31.47 | 25.86 | ||
![]() | Claude 3.7 Sonnet Thinking | 200k | 57 | $6.00 | 46.6 | 1.69 | 36.10 | 23.69 | |
Claude 3.7 Sonnet Thinking | 200k | 57 | $6.00 | 88.2 | 1.74 | 19.95 | 12.54 | ||
Qwen3 14B (Reasoning) Base | 33k | 56 | $0.12 | 88.2 | 0.53 | 28.86 | 22.66 | ||
Qwen3 14B (Reasoning) (FP8) | 128k | 56 | $0.12 | 67.9 | 0.22 | 37.06 | 29.47 | ||
![]() | Qwen3 14B (Reasoning) (FP8) | 128k | 56 | $0.12 | 55.3 | 0.71 | 45.89 | 36.14 | |
Qwen3 14B (Reasoning) | 131k | 56 | $1.31 | 63.5 | 1.04 | 40.41 | 31.50 | ||
![]() | Qwen3 30B A3B (Reasoning) (FP8) | 41k | 56 | $0.20 | 144.8 | 0.36 | 17.63 | 13.82 | |
Qwen3 30B A3B (Reasoning) Fast | 33k | 56 | $0.45 | 137.7 | 0.54 | 18.70 | 14.52 | ||
Qwen3 30B A3B (Reasoning) Base | 33k | 56 | $0.15 | 125.8 | 0.51 | 20.38 | 15.90 | ||
Qwen3 30B A3B (Reasoning) | 131k | 56 | $0.90 | 129.9 | 0.66 | 19.91 | 15.40 | ||
Qwen3 30B A3B (Reasoning) (FP8) | 41k | 56 | $0.15 | 110.7 | 0.53 | 23.12 | 18.07 | ||
![]() | Qwen3 30B A3B (Reasoning) (FP8) | 128k | 56 | $0.19 | 178.9 | 0.66 | 14.63 | 11.18 | |
Qwen3 30B A3B (Reasoning) | 131k | 56 | $0.75 | 92.1 | 1.02 | 28.16 | 21.72 | ||
o1-mini | 128k | 54 | $1.93 | 234.8 | 9.46 | 11.59 | N/A | ||
![]() | o1-mini | 128k | 54 | $1.93 | 265.0 | 8.71 | 10.59 | N/A | |
Gemini 2.5 Flash (May '25) (AI_Studio) | 1m | 53 | $0.26 | 266.5 | 0.38 | 2.26 | N/A | ||
![]() | ![]() DeepSeek V3 (Mar' 25) | 64k | 53 | $0.48 | 24.3 | 3.90 | 24.48 | N/A | |
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $1.25 | 26.9 | 1.25 | 19.83 | N/A | ||
![]() DeepSeek V3 (Mar' 25) Fast | 128k | 53 | $3.00 | 90.0 | 0.66 | 6.21 | N/A | ||
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $0.75 | 23.5 | 0.65 | 21.97 | N/A | ||
![]() | ![]() DeepSeek V3 (Mar' 25) | 164k | 53 | $0.80 | 16.1 | 0.48 | 31.46 | N/A | |
![]() | ![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $2.00 | 71.7 | 0.46 | 7.44 | N/A | |
![]() DeepSeek V3 (Mar' 25) | 160k | 53 | $0.90 | 261.5 | 0.39 | 2.30 | N/A | ||
![]() DeepSeek V3 (Mar' 25) | 164k | 53 | $0.45 | 36.1 | 0.58 | 14.42 | N/A | ||
![]() | ![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $0.57 | 26.2 | 0.83 | 19.94 | N/A | |
![]() | ![]() DeepSeek V3 (Mar' 25) | 8k | 53 | $3.38 | 169.3 | 1.00 | 3.95 | N/A | |
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $1.25 | 105.1 | 0.51 | 5.27 | N/A | ||
![]() | ![]() DeepSeek V3 (Mar' 25) | 164k | 53 | $1.25 | 19.4 | 0.54 | 26.38 | N/A | |
GPT-4.1 mini | 1m | 53 | $0.70 | 68.7 | 0.43 | 7.70 | N/A | ||
![]() | GPT-4.1 mini | 1m | 53 | $0.70 | 169.7 | 0.58 | 3.53 | N/A | |
GPT-4.1 | 1m | 53 | $3.50 | 113.1 | 0.57 | 4.99 | N/A | ||
![]() | GPT-4.1 | 1m | 53 | $3.50 | 193.7 | 0.70 | 3.28 | N/A | |
![]() DeepSeek R1 Distill Qwen 32B | 128k | 52 | $0.14 | 45.1 | 0.25 | 55.72 | 44.37 | ||
![]() | ![]() DeepSeek R1 Distill Qwen 32B | 64k | 52 | $0.30 | 20.8 | 1.17 | 121.59 | 96.33 | |
![]() | Qwen3 8B (Reasoning) (FP8) | 128k | 51 | $0.06 | 40.0 | 0.89 | 63.31 | 49.94 | |
Qwen3 8B (Reasoning) | 131k | 51 | $0.66 | 94.2 | 1.00 | 27.54 | 21.23 | ||
Grok 3 | 131k | 51 | $6.00 | 61.2 | 0.52 | 8.69 | N/A | ||
Grok 3 Fast | 131k | 51 | $10.00 | 95.7 | 0.58 | 5.80 | N/A | ||
Llama 4 Maverick (FP8) | 1m | 51 | $0.28 | 162.8 | 0.29 | 3.36 | N/A | ||
![]() | Llama 4 Maverick (FP8) | 1m | 51 | $0.35 | 183.5 | 0.39 | 3.12 | N/A | |
![]() | Llama 4 Maverick | 128k | 51 | $0.42 | 291.0 | 0.52 | 2.24 | N/A | |
Llama 4 Maverick Vertex | 524k | 51 | $0.55 | 126.6 | 0.34 | 4.28 | N/A | ||
![]() | Llama 4 Maverick (FP8) | 1m | 51 | $0.20 | 126.6 | 0.25 | 4.20 | N/A | |
![]() | Llama 4 Maverick (FP8) | 128k | 51 | $0.61 | 56.3 | 0.33 | 9.21 | N/A | |
Llama 4 Maverick | 1m | 51 | $0.39 | 179.5 | 0.60 | 3.38 | N/A | ||
Llama 4 Maverick (FP8) | 131k | 51 | $0.27 | 109.8 | 0.24 | 4.79 | N/A | ||
![]() | Llama 4 Maverick (FP8) | 1m | 51 | $0.34 | 84.6 | 0.55 | 6.45 | N/A | |
Llama 4 Maverick | 128k | 51 | $0.30 | 267.8 | 0.32 | 2.18 | N/A | ||
![]() | Llama 4 Maverick | 131k | 51 | $0.92 | 789.7 | 0.38 | 1.02 | N/A | |
Llama 4 Maverick (FP8) | 524k | 51 | $0.41 | 122.7 | 0.19 | 4.27 | N/A | ||
![]() | Llama 4 Maverick (FP8) | 1m | 51 | $0.35 | 154.7 | 0.62 | 3.85 | N/A | |
GPT-4o (March 2025) | 128k | 50 | $7.50 | 177.1 | 0.48 | 3.30 | N/A | ||
Gemini 2.0 Pro Experimental (AI Studio) | 2m | 49 | $0.00 | 28.0 | 16.88 | 34.75 | N/A | ||
![]() | ![]() DeepSeek R1 Distill Qwen 14B | 64k | 49 | $0.15 | 44.0 | 0.80 | 57.58 | 45.43 | |
![]() DeepSeek R1 Distill Qwen 14B | 128k | 49 | $1.60 | 170.5 | 0.32 | 14.98 | 11.73 | ||
Gemini 2.5 Flash (AI_Studio) | 1m | 49 | $0.26 | 283.9 | 0.38 | 2.14 | N/A | ||
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $0.30 | 64.3 | 0.31 | 39.19 | 31.10 | ||
![]() | ![]() DeepSeek R1 Distill Llama 70B | 66k | 48 | $0.94 | 2,300.2 | 0.24 | 1.33 | 0.87 | |
![]() DeepSeek R1 Distill Llama 70B Base | 128k | 48 | $0.38 | 58.9 | 0.57 | 43.02 | 33.96 | ||
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $0.17 | 32.1 | 0.59 | 78.51 | 62.33 | ||
![]() | ![]() DeepSeek R1 Distill Llama 70B | 32k | 48 | $0.80 | 101.4 | 0.67 | 25.33 | 19.73 | |
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $0.81 | 425.3 | 0.17 | 6.05 | 4.70 | ||
![]() | ![]() DeepSeek R1 Distill Llama 70B | 16k | 48 | $0.88 | 322.8 | 1.58 | 9.32 | 6.20 | |
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $2.00 | 121.0 | 0.36 | 21.03 | 16.53 | ||
![]() | Claude 3.7 Sonnet | 200k | 48 | $6.00 | 42.3 | 1.56 | 13.39 | N/A | |
Claude 3.7 Sonnet | 200k | 48 | $6.00 | 75.2 | 2.04 | 8.69 | N/A | ||
Gemini 2.0 Flash Vertex | 1m | 48 | $0.26 | 253.7 | 0.26 | 2.23 | N/A | ||
Gemini 2.0 Flash (AI Studio) | 1m | 48 | $0.17 | 240.2 | 0.35 | 2.43 | N/A | ||
Qwen3 4B (Reasoning) Fast | 33k | 47 | $0.12 | 157.8 | 0.48 | 16.32 | 12.68 | ||
![]() | Qwen3 4B (Reasoning) (FP8) | 128k | 47 | $0.00 | 120.2 | 0.70 | 21.50 | 16.64 | |
Qwen3 4B (Reasoning) | 131k | 47 | $0.40 | 100.5 | 0.99 | 25.87 | 19.90 | ||
![]() | ![]() Reka Flash 3 | 128k | 47 | $0.35 | 57.1 | 0.94 | 44.69 | 35.00 | |
Qwen3 235B A22B | 131k | 47 | $1.23 | 71.1 | 1.22 | 8.25 | N/A | ||
Gemini 2.0 Flash (exp) (AI Studio) | 1m | 46 | $0.00 | 252.1 | 0.24 | 2.23 | N/A | ||
![]() DeepSeek V3 (Dec '24) (FP8) | 128k | 46 | $0.25 | 29.3 | 1.23 | 18.31 | N/A | ||
![]() DeepSeek V3 (Dec '24) | 128k | 46 | $0.75 | 22.9 | 0.66 | 22.45 | N/A | ||
![]() | ![]() DeepSeek V3 (Dec '24) | 128k | 46 | $2.00 | 74.4 | 0.47 | 7.19 | N/A | |
![]() DeepSeek V3 (Dec '24) | 128k | 46 | $1.31 | 66.6 | 0.71 | 8.22 | N/A | ||
![]() DeepSeek V3 (Dec '24) | 64k | 46 | $0.51 | 27.2 | 0.32 | 18.72 | N/A | ||
![]() | ![]() DeepSeek V3 (Dec '24) Turbo | 64k | 46 | $0.63 | 29.9 | 0.79 | 17.49 | N/A | |
![]() | ![]() DeepSeek V3 (Dec '24) | 64k | 46 | $0.89 | 29.7 | 0.85 | 17.71 | N/A | |
![]() DeepSeek V3 (Dec '24) (FP8) | 128k | 46 | $1.25 | 93.2 | 0.50 | 5.87 | N/A | ||
Qwen2.5 Max | 32k | 45 | $2.80 | 50.4 | 1.27 | 11.19 | N/A | ||
Gemini 1.5 Pro (Sep) (Vertex) | 2m | 45 | $2.19 | 90.1 | 0.81 | 6.36 | N/A | ||
Gemini 1.5 Pro (Sep) (AI Studio) | 2m | 45 | $2.19 | 90.4 | 4.68 | 10.21 | N/A | ||
![]() | Claude 3.5 Sonnet (Oct) | 200k | 44 | $6.00 | 38.4 | 1.36 | 14.39 | N/A | |
Claude 3.5 Sonnet (Oct) Vertex | 200k | 44 | $6.00 | 77.7 | 0.80 | 7.24 | N/A | ||
Claude 3.5 Sonnet (Oct) | 200k | 44 | $6.00 | 73.6 | 1.69 | 8.48 | N/A | ||
Qwen3 32B | 131k | 44 | $1.23 | 65.5 | 1.08 | 8.72 | N/A | ||
![]() Sonar | 127k | 43 | $1.00 | 128.1 | 2.09 | 5.99 | N/A | ||
Llama 4 Scout | 1m | 43 | $0.14 | 116.2 | 0.29 | 4.59 | N/A | ||
![]() | Llama 4 Scout (FP8) | 158k | 43 | $0.19 | 118.6 | 0.35 | 4.57 | N/A | |
![]() | Llama 4 Scout | 32k | 43 | $0.70 | 2,759.0 | 0.24 | 0.42 | N/A | |
![]() | Llama 4 Scout | 128k | 43 | $0.29 | 163.6 | 0.49 | 3.54 | N/A | |
Llama 4 Scout Vertex | 1m | 43 | $0.36 | 131.4 | 0.34 | 4.14 | N/A | ||
![]() | Llama 4 Scout | 1m | 43 | $0.10 | 114.1 | 0.27 | 4.65 | N/A | |
![]() | Llama 4 Scout | 128k | 43 | $0.34 | 34.8 | 0.34 | 14.73 | N/A | |
Llama 4 Scout | 1m | 43 | $0.26 | 162.9 | 4.31 | 7.38 | N/A | ||
Llama 4 Scout | 131k | 43 | $0.14 | 49.1 | 0.62 | 10.80 | N/A | ||
![]() | Llama 4 Scout | 131k | 43 | $0.20 | 57.8 | 1.14 | 9.79 | N/A | |
Llama 4 Scout | 131k | 43 | $0.17 | 576.7 | 0.32 | 1.19 | N/A | ||
![]() | Llama 4 Scout | 8k | 43 | $0.47 | 794.5 | 0.92 | 1.55 | N/A | |
Llama 4 Scout | 328k | 43 | $0.28 | 113.0 | 0.19 | 4.61 | N/A | ||
![]() | Llama 4 Scout | 128k | 43 | $0.71 | 91.7 | 0.66 | 6.11 | N/A | |
![]() Sonar Pro | 200k | 43 | $6.00 | 88.2 | 2.45 | 8.12 | N/A | ||
QwQ 32B-Preview | 33k | 43 | $0.14 | 45.8 | 0.26 | 54.90 | 43.72 | ||
QwQ 32B-Preview | 33k | 43 | $1.20 | 96.9 | 0.46 | 26.26 | 20.64 | ||
![]() | ![]() Nova Premier | 1m | 43 | $5.00 | 63.6 | 0.82 | 8.68 | N/A | |
Qwen3 30B A3B | 131k | 43 | $0.35 | 92.7 | 1.07 | 6.46 | N/A | ||
GPT-4o (Nov '24) | 128k | 41 | $4.38 | 113.0 | 0.55 | 4.98 | N/A | ||
![]() | GPT-4o (Nov '24) | 128k | 41 | $4.38 | 129.6 | 1.11 | 4.97 | N/A | |
Gemini 2.0 Flash-Lite (Feb '25) (AI Studio) | 1m | 41 | $0.13 | 211.1 | 0.26 | 2.63 | N/A | ||
Llama 3.3 70B (FP8) | 128k | 41 | $0.17 | 57.8 | 0.37 | 9.01 | N/A | ||
![]() | Llama 3.3 70B | 131k | 41 | $1.20 | 443.0 | 0.37 | 1.50 | N/A | |
![]() | Llama 3.3 70B (FP8) | 131k | 41 | $0.28 | 111.3 | 0.46 | 4.95 | N/A | |
![]() | Llama 3.3 70B | 33k | 41 | $0.94 | 2,565.2 | 0.24 | 0.44 | N/A | |
Llama 3.3 70B | 128k | 41 | $0.40 | 30.8 | 1.20 | 17.43 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.71 | 258.2 | 0.53 | 2.47 | N/A | |
Llama 3.3 70B Fast | 128k | 41 | $0.38 | 134.9 | 0.59 | 4.30 | N/A | ||
Llama 3.3 70B Base | 128k | 41 | $0.20 | 40.3 | 0.63 | 13.04 | N/A | ||
Llama 3.3 70B Vertex | 128k | 41 | $0.72 | 72.5 | 0.27 | 7.17 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.35 | 149.8 | 0.39 | 3.72 | N/A | |
![]() | Llama 3.3 70B | 128k | 41 | $0.71 | 49.2 | 0.45 | 10.62 | N/A | |
Llama 3.3 70B | 128k | 41 | $0.90 | 134.9 | 2.22 | 5.92 | N/A | ||
Llama 3.3 70B (Turbo, FP8) | 128k | 41 | $0.12 | 33.1 | 0.25 | 15.34 | N/A | ||
Llama 3.3 70B | 128k | 41 | $0.27 | 27.4 | 0.53 | 18.75 | N/A | ||
Llama 3.3 70B | 128k | 41 | $0.60 | 169.5 | 0.41 | 3.35 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.20 | 107.3 | 0.64 | 5.30 | N/A | |
Llama 3.3 70B | 128k | 41 | $0.64 | 442.0 | 0.22 | 1.36 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.75 | 460.8 | 0.35 | 1.43 | N/A | |
Llama 3.3 70B Turbo | 128k | 41 | $0.88 | 127.8 | 0.41 | 4.32 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.70 | 30.3 | 0.37 | 16.88 | N/A | |
GPT-4.1 nano | 1m | 41 | $0.17 | 115.5 | 0.33 | 4.66 | N/A | ||
![]() | GPT-4.1 nano | 1m | 41 | $0.17 | 224.2 | 0.59 | 2.82 | N/A | |
Qwen3 14B | 131k | 41 | $0.61 | 64.1 | 1.16 | 8.96 | N/A | ||
GPT-4o (May '24) | 128k | 41 | $7.50 | 127.8 | 0.46 | 4.37 | N/A | ||
![]() | GPT-4o (May '24) | 128k | 41 | $7.50 | 124.8 | 0.72 | 4.73 | N/A | |
Llama 3.1 405B (FP8) | 128k | 40 | $0.80 | 33.0 | 0.33 | 15.47 | N/A | ||
![]() | Llama 3.1 405B | 131k | 40 | $7.00 | 176.6 | 1.67 | 4.50 | N/A | |
Llama 3.1 405B | 128k | 40 | $4.00 | 92.0 | 1.08 | 6.52 | N/A | ||
![]() | Llama 3.1 405B Standard | 128k | 40 | $2.40 | 31.3 | 1.82 | 17.79 | N/A | |
![]() | Llama 3.1 405B Latency Optimized | 128k | 40 | $3.00 | 90.9 | 0.45 | 5.95 | N/A | |
Llama 3.1 405B Base | 128k | 40 | $1.50 | 33.0 | 0.66 | 15.81 | N/A | ||
Llama 3.1 405B Vertex | 128k | 40 | $7.75 | 28.2 | 0.44 | 18.14 | N/A | ||
![]() | Llama 3.1 405B | 128k | 40 | $8.00 | 31.4 | 0.48 | 16.39 | N/A | |
Llama 3.1 405B | 128k | 40 | $3.00 | 98.0 | 0.97 | 6.08 | N/A | ||
Llama 3.1 405B | 33k | 40 | $0.80 | 22.9 | 0.75 | 22.60 | N/A | ||
![]() | Llama 3.1 405B | 16k | 40 | $6.25 | 171.8 | 1.63 | 4.54 | N/A | |
Llama 3.1 405B | 128k | 40 | $7.50 | 37.3 | 0.88 | 14.27 | N/A | ||
Llama 3.1 405B Turbo | 128k | 40 | $3.50 | 102.5 | 0.41 | 5.28 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.40 | 29.3 | 1.75 | 18.81 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.20 | 23.4 | 0.68 | 22.04 | N/A | ||
Qwen2.5 72B Fast | 131k | 40 | $0.38 | 70.5 | 0.54 | 7.63 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.90 | 72.0 | 0.44 | 7.38 | N/A | ||
Qwen2.5 72B | 33k | 40 | $0.19 | 33.6 | 0.30 | 15.19 | N/A | ||
Qwen2.5 72B Turbo | 131k | 40 | $1.20 | 104.4 | 0.54 | 5.33 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.00 | 43.1 | 1.05 | 12.63 | N/A | ||
![]() | ![]() MiniMax-Text-01 | 1m | 40 | $0.42 | 34.1 | 0.90 | 15.54 | N/A | |
Phi-4 | 16k | 40 | $0.15 | 118.5 | 0.49 | 4.71 | N/A | ||
![]() | Phi-4 | 16k | 40 | $0.22 | 37.5 | 0.44 | 13.76 | N/A | |
Phi-4 | 16k | 40 | $0.09 | 41.3 | 0.31 | 12.43 | N/A | ||
![]() Command A | 256k | 40 | $4.38 | 94.7 | 0.21 | 5.49 | N/A | ||
Gemini 1.5 Flash (Sep) (Vertex) | 1m | 39 | $0.13 | 188.7 | 0.18 | 2.83 | N/A | ||
Gemini 1.5 Flash (Sep) (AI Studio) | 1m | 39 | $0.13 | 193.8 | 0.31 | 2.89 | N/A | ||
![]() | ![]() Mistral Large 2 (Nov '24) | 128k | 38 | $3.00 | 71.9 | 0.43 | 7.39 | N/A | |
![]() | ![]() Mistral Large 2 (Nov '24) | 128k | 38 | $3.00 | 35.1 | 0.53 | 14.77 | N/A | |
![]() | Qwen3 1.7B (Reasoning) (FP8) | 32k | 38 | $0.00 | 48.7 | 0.67 | 52.04 | 41.10 | |
Qwen3 1.7B (Reasoning) | 33k | 38 | $0.40 | 129.9 | 0.97 | 20.21 | 15.40 | ||
![]() | Gemma 3 27B | 131k | 38 | $0.29 | 64.2 | 0.46 | 8.25 | N/A | |
Gemma 3 27B | 128k | 38 | $0.13 | 36.1 | 0.37 | 14.23 | N/A | ||
Grok Beta | 128k | 38 | $7.50 | 67.0 | 0.29 | 7.76 | N/A | ||
![]() | ![]() Pixtral Large | 128k | 37 | $3.00 | 77.4 | 0.47 | 6.93 | N/A | |
Qwen2.5 Instruct 32B Fast | 128k | 37 | $0.20 | 86.9 | 0.53 | 6.29 | N/A | ||
Qwen2.5 Instruct 32B Base | 128k | 37 | $0.10 | 60.2 | 0.55 | 8.86 | N/A | ||
Llama 3.1 Nemotron 70B (FP8) | 128k | 37 | $0.17 | 50.4 | 0.26 | 10.19 | N/A | ||
Llama 3.1 Nemotron 70B Base | 128k | 37 | $0.20 | 37.6 | 0.62 | 13.93 | N/A | ||
Llama 3.1 Nemotron 70B Fast | 128k | 37 | $0.38 | 71.9 | 0.53 | 7.48 | N/A | ||
Llama 3.1 Nemotron 70B | 128k | 37 | $0.17 | 26.3 | 0.57 | 19.56 | N/A | ||
![]() | ![]() Nova Pro | 300k | 37 | $1.40 | 170.2 | 0.37 | 3.30 | N/A | |
Qwen3 8B | 131k | 37 | $0.31 | 95.1 | 1.04 | 6.30 | N/A | ||
![]() | ![]() Mistral Large 2 (Jul '24) | 128k | 37 | $3.00 | 37.6 | 0.44 | 13.74 | N/A | |
![]() | ![]() Mistral Large 2 (Jul '24) | 128k | 37 | $3.00 | 32.8 | 0.44 | 15.66 | N/A | |
Qwen2.5 Coder 32B | 33k | 36 | $0.09 | 43.3 | 0.33 | 11.88 | N/A | ||
Qwen2.5 Coder 32B | 131k | 36 | $0.20 | 47.0 | 0.99 | 11.63 | N/A | ||
Qwen2.5 Coder 32B | 33k | 36 | $0.08 | 47.4 | 0.52 | 11.07 | N/A | ||
Qwen2.5 Coder 32B | 131k | 36 | $0.80 | 79.9 | 0.52 | 6.78 | N/A | ||
GPT-4o mini | 128k | 36 | $0.26 | 66.4 | 0.49 | 8.02 | N/A | ||
![]() | GPT-4o mini | 128k | 36 | $0.26 | 152.2 | 0.85 | 4.14 | N/A | |
Llama 3.1 70B (FP8) | 128k | 35 | $0.17 | 49.3 | 0.27 | 10.42 | N/A | ||
Llama 3.1 70B | 128k | 35 | $0.40 | 159.3 | 0.87 | 4.01 | N/A | ||
![]() | Llama 3.1 70B Standard | 128k | 35 | $0.72 | 31.6 | 0.63 | 16.45 | N/A | |
![]() | Llama 3.1 70B Latency Optimized | 128k | 35 | $0.90 | 142.0 | 0.32 | 3.84 | N/A | |
Llama 3.1 70B Base | 128k | 35 | $0.20 | 30.9 | 0.64 | 16.81 | N/A | ||
Llama 3.1 70B Fast | 128k | 35 | $0.38 | 142.0 | 0.53 | 4.05 | N/A | ||
Llama 3.1 70B Vertex | 128k | 35 | $0.72 | 73.0 | 0.27 | 7.12 | N/A | ||
![]() | Llama 3.1 70B | 128k | 35 | $2.90 | 52.3 | 0.45 | 10.02 | N/A | |
Llama 3.1 70B | 128k | 35 | $0.90 | 158.8 | 0.39 | 3.54 | N/A | ||
Llama 3.1 70B (Turbo, FP8) | 128k | 35 | $0.14 | 38.4 | 0.24 | 13.27 | N/A | ||
Llama 3.1 70B | 128k | 35 | $0.27 | 40.3 | 0.28 | 12.67 | N/A | ||
![]() | Llama 3.1 70B | 32k | 35 | $0.19 | 13.8 | 1.38 | 37.57 | N/A | |
Llama 3.1 70B Turbo | 128k | 35 | $0.88 | 161.7 | 0.31 | 3.40 | N/A | ||
Llama 3.1 70B | 128k | 35 | $0.90 | 126.0 | 0.52 | 4.49 | N/A | ||
![]() | ![]() Mistral Small 3.1 | 128k | 35 | $0.15 | 113.9 | 0.29 | 4.68 | N/A | |
![]() | ![]() Mistral Small 3.1 | 128k | 35 | $0.15 | 81.7 | 0.37 | 6.49 | N/A | |
![]() Mistral Small 3.1 Vertex | 128k | 35 | $0.15 | 209.1 | 0.19 | 2.58 | N/A | ||
![]() | ![]() Mistral Small 3 | 32k | 35 | $0.15 | 160.8 | 0.29 | 3.40 | N/A | |
![]() Mistral Small 3 | 32k | 35 | $0.07 | 82.9 | 0.22 | 6.26 | N/A | ||
![]() Mistral Small 3 | 32k | 35 | $0.80 | 94.3 | 0.18 | 5.48 | N/A | ||
Qwen3 4B | 131k | 35 | $0.19 | 102.8 | 0.98 | 5.84 | N/A | ||
![]() | Claude 3 Opus | 200k | 35 | $30.00 | 25.4 | 1.22 | 20.91 | N/A | |
Claude 3 Opus Vertex | 200k | 35 | $30.00 | 21.9 | 2.44 | 25.27 | N/A | ||
Claude 3 Opus | 200k | 35 | $30.00 | 28.4 | 1.03 | 18.64 | N/A | ||
![]() | Claude 3.5 Haiku Standard | 200k | 35 | $1.60 | 56.8 | 1.22 | 10.03 | N/A | |
![]() | Claude 3.5 Haiku Latency Optimized | 200k | 35 | $2.00 | 96.1 | 0.50 | 5.70 | N/A | |
Claude 3.5 Haiku Vertex | 200k | 35 | $1.60 | 65.6 | 1.55 | 9.17 | N/A | ||
Claude 3.5 Haiku | 200k | 35 | $1.60 | 64.0 | 1.00 | 8.81 | N/A | ||
![]() | ![]() DeepSeek R1 Distill Llama 8B | 32k | 34 | $0.04 | 43.3 | 0.72 | 58.41 | 46.15 | |
Gemma 3 12B | 128k | 34 | $0.06 | 25.2 | 0.62 | 20.49 | N/A | ||
Gemini 1.5 Pro (May) (Vertex) | 2m | 34 | $2.19 | 65.8 | 0.35 | 7.95 | N/A | ||
Gemini 1.5 Pro (May) (AI Studio) | 2m | 34 | $2.19 | 69.0 | 0.42 | 7.67 | N/A | ||
Qwen2.5 Turbo | 1m | 34 | $0.09 | 108.8 | 1.09 | 5.69 | N/A | ||
![]() | Llama 3.2 90B (Vision) | 128k | 33 | $0.72 | 60.7 | 0.51 | 8.74 | N/A | |
Llama 3.2 90B (Vision) Vertex | 128k | 33 | $0.00 | 32.4 | 0.20 | 15.62 | N/A | ||
Llama 3.2 90B (Vision) | 33k | 33 | $0.36 | 39.0 | 0.53 | 13.34 | N/A | ||
Llama 3.2 90B (Vision) Turbo | 128k | 33 | $1.20 | 27.9 | 0.25 | 18.18 | N/A | ||
Qwen2 72B | 33k | 33 | $0.90 | 39.5 | 0.54 | 13.19 | N/A | ||
Qwen2 72B | 131k | 33 | $0.00 | 31.0 | 1.30 | 17.44 | N/A | ||
![]() | ![]() Nova Lite | 300k | 33 | $0.10 | 284.3 | 0.31 | 2.07 | N/A | |
Gemini 1.5 Flash-8B AI Studio | 1m | 31 | $0.07 | 277.4 | 0.20 | 2.01 | N/A | ||
![]() | ![]() Jamba 1.5 Large | 256k | 29 | $3.50 | 51.2 | 0.72 | 10.49 | N/A | |
![]() Jamba 1.6 Large | 256k | 29 | $3.50 | 62.6 | 0.62 | 8.61 | N/A | ||
Gemini 1.5 Flash (May) (Vertex) | 1m | 28 | $0.13 | 328.8 | 0.25 | 1.77 | N/A | ||
Gemini 1.5 Flash (May) (AI Studio) | 1m | 28 | $0.13 | 324.1 | 0.23 | 1.77 | N/A | ||
![]() | ![]() Nova Micro | 130k | 28 | $0.06 | 315.1 | 0.29 | 1.88 | N/A | |
![]() Yi-Large | 32k | 28 | $3.00 | 68.0 | 0.38 | 7.73 | N/A | ||
![]() | Claude 3 Sonnet | 200k | 28 | $6.00 | 43.6 | 0.79 | 12.26 | N/A | |
Claude 3 Sonnet | 200k | 28 | $6.00 | 59.8 | 0.72 | 9.08 | N/A | ||
![]() | ![]() Codestral (Jan '25) | 256k | 28 | $0.45 | 93.0 | 0.37 | 5.75 | N/A | |
![]() Codestral (Jan '25) Vertex | 128k | 28 | $0.45 | 150.0 | 0.15 | 3.48 | N/A | ||
Llama 3 70B | 8k | 27 | $0.40 | 19.0 | 1.42 | 27.70 | N/A | ||
![]() | Llama 3 70B | 8k | 27 | $2.86 | 47.1 | 0.39 | 11.00 | N/A | |
![]() | Llama 3 70B | 8k | 27 | $2.90 | 18.9 | 0.77 | 27.17 | N/A | |
Llama 3 70B | 8k | 27 | $0.33 | 33.3 | 0.49 | 15.52 | N/A | ||
![]() | Llama 3 70B | 8k | 27 | $0.57 | 18.4 | 1.09 | 28.23 | N/A | |
Llama 3 70B | 8k | 27 | $0.64 | 336.3 | 0.26 | 1.75 | N/A | ||
Llama 3 70B (Reference, FP16) | 8k | 27 | $0.88 | 102.3 | 0.76 | 5.65 | N/A | ||
Llama 3 70B (Turbo, FP8) | 8k | 27 | $0.88 | 84.9 | 0.46 | 6.35 | N/A | ||
![]() | ![]() Mistral Small (Sep '24) | 33k | 27 | $0.30 | 86.4 | 0.30 | 6.09 | N/A | |
![]() | Phi-4 Multimodal | 128k | 27 | $0.00 | 21.4 | 0.35 | 23.69 | N/A | |
Qwen2.5 Coder 7B Fast | 131k | 27 | $0.04 | 228.8 | 0.46 | 2.65 | N/A | ||
Qwen2.5 Coder 7B Base | 131k | 27 | $0.01 | 182.1 | 0.46 | 3.21 | N/A | ||
![]() | ![]() Mistral Large (Feb '24) | 33k | 26 | $6.00 | 30.0 | 0.46 | 17.15 | N/A | |
![]() | ![]() Mistral Large (Feb '24) | 33k | 26 | $6.00 | 45.4 | 0.38 | 11.39 | N/A | |
![]() | ![]() Mixtral 8x22B | 65k | 26 | $3.00 | 72.3 | 0.32 | 7.23 | N/A | |
![]() Mixtral 8x22B Base | 65k | 26 | $0.60 | 77.0 | 0.54 | 7.03 | N/A | ||
![]() Mixtral 8x22B Fast | 65k | 26 | $1.05 | 104.2 | 0.52 | 5.31 | N/A | ||
![]() Mixtral 8x22B | 65k | 26 | $1.20 | 75.1 | 0.36 | 7.02 | N/A | ||
![]() Mixtral 8x22B | 65k | 26 | $1.20 | 83.5 | 0.30 | 6.28 | N/A | ||
![]() | Phi-4 Mini | 128k | 26 | $0.12 | 222.5 | 0.27 | 2.52 | N/A | |
![]() | Phi-4 Mini | 128k | 26 | $0.00 | 58.2 | 0.34 | 8.93 | N/A | |
Qwen3 1.7B | 33k | 25 | $0.19 | 134.3 | 0.93 | 4.65 | N/A | ||
![]() | Phi-3 Medium 14B | 128k | 25 | $0.30 | 52.4 | 0.44 | 9.97 | N/A | |
Gemma 3 4B | 128k | 24 | $0.03 | 90.1 | 0.23 | 5.78 | N/A | ||
![]() | Claude 2.1 | 200k | 24 | $12.00 | 29.1 | 1.81 | 19.01 | N/A | |
Claude 2.1 | 200k | 24 | $12.00 | 13.8 | 0.94 | 37.10 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.03 | 137.2 | 0.25 | 3.90 | N/A | ||
![]() | Llama 3.1 8B | 131k | 24 | $0.20 | 1,126.0 | 0.33 | 0.77 | N/A | |
![]() | Llama 3.1 8B | 33k | 24 | $0.10 | 2,170.2 | 0.27 | 0.50 | N/A | |
Llama 3.1 8B | 128k | 24 | $0.10 | 448.1 | 0.78 | 1.90 | N/A | ||
![]() | Llama 3.1 8B | 128k | 24 | $0.22 | 91.3 | 0.34 | 5.82 | N/A | |
Llama 3.1 8B Fast | 128k | 24 | $0.04 | 182.7 | 0.50 | 3.23 | N/A | ||
Llama 3.1 8B Base | 128k | 24 | $0.03 | 58.9 | 0.54 | 9.02 | N/A | ||
Llama 3.1 8B Vertex | 128k | 24 | $0.00 | 121.9 | 0.18 | 4.28 | N/A | ||
![]() | Llama 3.1 8B | 128k | 24 | $0.38 | 225.9 | 0.29 | 2.51 | N/A | |
Llama 3.1 8B | 128k | 24 | $0.20 | 283.6 | 0.26 | 2.02 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.04 | 53.7 | 0.49 | 9.81 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.10 | 449.5 | 0.26 | 1.37 | N/A | ||
![]() | Llama 3.1 8B | 16k | 24 | $0.03 | 72.6 | 0.70 | 7.59 | N/A | |
Llama 3.1 8B | 128k | 24 | $0.06 | 840.8 | 0.20 | 0.79 | N/A | ||
![]() | Llama 3.1 8B | 16k | 24 | $0.13 | 1,044.2 | 0.29 | 0.77 | N/A | |
Llama 3.1 8B Turbo | 128k | 24 | $0.18 | 171.1 | 0.24 | 3.17 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.15 | 454.6 | 0.15 | 1.25 | N/A | ||
![]() | Llama 3.1 8B | 128k | 24 | $0.18 | 61.3 | 0.44 | 8.60 | N/A | |
![]() | ![]() Pixtral 12B | 128k | 23 | $0.15 | 86.0 | 0.28 | 6.10 | N/A | |
![]() Pixtral 12B | 128k | 23 | $0.10 | 79.0 | 0.61 | 6.94 | N/A | ||
Qwen3 0.6B (Reasoning) | 33k | 23 | $0.40 | 210.0 | 0.94 | 12.85 | 9.53 | ||
![]() | ![]() Mistral Small (Feb '24) | 33k | 23 | $1.50 | 128.8 | 0.27 | 4.15 | N/A | |
![]() | ![]() Mistral Small (Feb '24) | 33k | 23 | $1.50 | 87.6 | 0.39 | 6.10 | N/A | |
![]() | ![]() Mistral Medium | 33k | 23 | $4.09 | 56.7 | 0.41 | 9.22 | N/A | |
![]() | ![]() Ministral 8B | 128k | 22 | $0.10 | 131.3 | 0.28 | 4.09 | N/A | |
Gemma 2 9B Fast | 8k | 22 | $0.04 | 168.2 | 0.49 | 3.46 | N/A | ||
Gemma 2 9B Base | 8k | 22 | $0.03 | 152.3 | 0.49 | 3.77 | N/A | ||
Gemma 2 9B | 8k | 22 | $0.04 | 28.8 | 0.56 | 17.93 | N/A | ||
Gemma 2 9B | 8k | 22 | $0.20 | 721.7 | 0.22 | 0.91 | N/A | ||
![]() LFM 40B | 32k | 22 | $0.15 | 162.4 | 0.19 | 3.27 | N/A | ||
![]() | ![]() Command-R+ | 128k | 21 | $6.00 | 47.7 | 0.50 | 10.97 | N/A | |
![]() Command-R+ | 128k | 21 | $4.38 | 48.6 | 0.27 | 10.57 | N/A | ||
![]() | Llama 3 8B | 8k | 21 | $0.38 | 103.9 | 0.30 | 5.11 | N/A | |
![]() | Llama 3 8B | 8k | 21 | $0.38 | 73.8 | 0.37 | 7.15 | N/A | |
Llama 3 8B | 8k | 21 | $0.04 | 110.5 | 0.22 | 4.75 | N/A | ||
![]() | Llama 3 8B | 8k | 21 | $0.04 | 59.6 | 0.84 | 9.22 | N/A | |
Llama 3 8B | 8k | 21 | $0.06 | 1,348.2 | 0.35 | 0.72 | N/A | ||
Llama 3 8B | 8k | 21 | $0.20 | 192.2 | 0.33 | 2.93 | N/A | ||
![]() | ![]() Codestral (May '24) | 33k | 20 | $0.30 | 103.7 | 0.30 | 5.13 | N/A | |
![]() Aya Expanse 32B | 128k | 20 | $0.75 | 120.8 | 0.16 | 4.30 | N/A | ||
![]() | ![]() Command-R+ (Apr '24) | 128k | 20 | $6.00 | 48.0 | 0.48 | 10.90 | N/A | |
![]() Command-R+ (Apr '24) | 128k | 20 | $6.00 | 64.6 | 0.23 | 7.97 | N/A | ||
![]() | ![]() Command-R+ (Apr '24) | 128k | 20 | $6.00 | 29.1 | 0.66 | 17.85 | N/A | |
![]() | ![]() Ministral 3B | 128k | 20 | $0.04 | 225.0 | 0.26 | 2.48 | N/A | |
![]() | ![]() Mistral NeMo | 128k | 20 | $0.15 | 139.3 | 0.29 | 3.88 | N/A | |
![]() | ![]() Mistral NeMo (FP8) | 131k | 20 | $0.11 | 97.2 | 0.53 | 5.68 | N/A | |
![]() Mistral NeMo Fast | 128k | 20 | $0.12 | 157.6 | 0.51 | 3.68 | N/A | ||
![]() Mistral NeMo Base | 128k | 20 | $0.06 | 39.8 | 0.60 | 13.17 | N/A | ||
![]() Mistral NeMo | 128k | 20 | $0.04 | 60.2 | 0.24 | 8.54 | N/A | ||
Llama 3.2 3B (FP8) | 128k | 20 | $0.02 | 224.3 | 0.21 | 2.44 | N/A | ||
Llama 3.2 3B | 128k | 20 | $0.10 | 102.1 | 0.98 | 5.88 | N/A | ||
![]() | Llama 3.2 3B | 128k | 20 | $0.15 | 71.7 | 0.47 | 7.44 | N/A | |
Llama 3.2 3B Base | 128k | 20 | $0.01 | 127.9 | 0.48 | 4.39 | N/A | ||
Llama 3.2 3B | 128k | 20 | $0.01 | 109.4 | 0.18 | 4.75 | N/A | ||
![]() | Llama 3.2 3B | 32k | 20 | $0.04 | 107.3 | 0.59 | 5.25 | N/A | |
![]() | Llama 3.2 3B | 8k | 20 | $0.10 | 1,592.9 | 0.25 | 0.56 | N/A | |
Llama 3.2 3B Turbo | 128k | 20 | $0.06 | 164.4 | 0.28 | 3.32 | N/A | ||
![]() DeepSeek R1 Distill Qwen 1.5B | 128k | 19 | $0.18 | 387.7 | 0.23 | 6.68 | 5.16 | ||
![]() | ![]() Jamba 1.5 Mini | 256k | 18 | $0.25 | 82.5 | 0.49 | 6.55 | N/A | |
![]() Jamba 1.6 Mini | 256k | 18 | $0.25 | 181.4 | 0.34 | 3.09 | N/A | ||
![]() | ![]() Mixtral 8x7B | 33k | 17 | $0.70 | 89.0 | 0.35 | 5.97 | N/A | |
![]() | ![]() Mixtral 8x7B | 33k | 17 | $0.51 | 69.9 | 0.32 | 7.47 | N/A | |
![]() Mixtral 8x7B Fast | 33k | 17 | $0.23 | 125.7 | 0.49 | 4.47 | N/A | ||
![]() Mixtral 8x7B Base | 33k | 17 | $0.12 | 103.6 | 0.50 | 5.33 | N/A | ||
![]() Mixtral 8x7B | 33k | 17 | $0.12 | 94.3 | 0.51 | 5.81 | N/A | ||
![]() Mixtral 8x7B | 33k | 17 | $0.60 | 72.9 | 0.41 | 7.27 | N/A | ||
Qwen3 0.6B | 33k | 17 | $0.19 | 216.0 | 0.94 | 3.26 | N/A | ||
![]() Aya Expanse 8B | 8k | 16 | $0.75 | 167.5 | 0.15 | 3.13 | N/A | ||
![]() | ![]() Command-R | 128k | 15 | $0.75 | 109.3 | 0.33 | 4.90 | N/A | |
![]() Command-R | 128k | 15 | $0.26 | 68.6 | 0.20 | 7.49 | N/A | ||
![]() | ![]() Command-R (Mar '24) | 128k | 15 | $0.75 | 109.8 | 0.33 | 4.88 | N/A | |
![]() Command-R (Mar '24) | 128k | 15 | $0.75 | 164.1 | 0.16 | 3.21 | N/A | ||
![]() | ![]() Command-R (Mar '24) | 128k | 15 | $0.75 | 46.6 | 0.51 | 11.24 | N/A | |
![]() | ![]() Codestral-Mamba | 256k | 14 | $0.25 | 95.1 | 0.48 | 5.73 | N/A | |
![]() | ![]() Mistral 7B | 8k | 10 | $0.25 | 104.1 | 0.27 | 5.08 | N/A | |
![]() | ![]() Mistral 7B | 8k | 10 | $0.16 | 92.1 | 0.31 | 5.73 | N/A | |
![]() Mistral 7B | 8k | 10 | $0.04 | 97.7 | 0.21 | 5.33 | N/A | ||
![]() | ![]() Mistral 7B | 32k | 10 | $0.04 | 122.2 | 0.81 | 4.90 | N/A | |
![]() Mistral 7B | 8k | 10 | $0.20 | 172.5 | 0.15 | 3.05 | N/A | ||
![]() | Llama 3.2 1B | 128k | 10 | $0.10 | 117.1 | 0.44 | 4.71 | N/A | |
Llama 3.2 1B Base | 128k | 10 | $0.01 | 58.3 | 0.51 | 9.09 | N/A | ||
Llama 3.2 1B | 128k | 10 | $0.01 | 117.4 | 0.31 | 4.56 | N/A | ||
![]() | Llama 3.2 1B | 16k | 10 | $0.05 | 2,586.2 | 0.18 | 0.38 | N/A | |
![]() | Llama 3.2 11B (Vision) | 128k | $0.16 | 143.3 | 0.47 | 3.96 | N/A | ||
Llama 3.2 11B (Vision) | 128k | $0.05 | 53.1 | 0.23 | 9.64 | N/A | |||
Llama 3.2 11B (Vision) Turbo | 128k | $0.18 | 117.7 | 0.14 | 4.39 | N/A | |||
![]() | ![]() Mistral Saba | 32k | $0.30 | 92.9 | 0.30 | 5.68 | N/A | ||
![]() Sonar Reasoning | 127k | $2.00 | 85.9 | 1.73 | 30.83 | 23.28 | |||
Grok 3 mini Reasoning (low) | 131k | $0.35 | 123.2 | 0.38 | 20.68 | 16.24 | |||
Grok 3 mini Reasoning (low) Fast | 131k | $1.45 | 201.7 | 0.41 | 12.80 | 9.91 | |||
![]() | ![]() Reka Flash | 128k | $0.35 | 46.2 | 0.90 | 11.72 | N/A | ||
![]() | ![]() Reka Core | 128k | $2.00 | 27.5 | 0.86 | 19.05 | N/A | ||
![]() | ![]() Reka Flash (Feb '24) | 128k | $0.35 | 46.0 | 0.89 | 11.76 | N/A | ||
![]() | ![]() Reka Edge | 128k | $0.10 | 85.9 | 0.80 | 6.62 | N/A | ||
o1-preview | 128k | $26.25 | 167.3 | 18.08 | 21.07 | N/A | |||
![]() | o1-preview | 128k | $28.88 | 164.9 | 19.17 | 22.20 | N/A | ||
GPT-4o (Aug '24) | 128k | $4.38 | 123.3 | 0.45 | 4.50 | N/A | |||
![]() | GPT-4o (Aug '24) | 128k | $4.38 | 131.2 | 0.58 | 4.39 | N/A | ||
GPT-4 Turbo | 128k | $15.00 | 36.4 | 0.71 | 14.47 | N/A | |||
![]() | GPT-4 Turbo | 128k | $15.00 | 44.5 | 1.52 | 12.77 | N/A | ||
GPT-3.5 Turbo | 4k | $0.75 | 116.3 | 0.38 | 4.68 | N/A | |||
GPT-4 | 8k | $37.50 | 27.8 | 0.76 | 18.76 | N/A | |||
GPT-4.5 (Preview) | 128k | $93.75 | 68.9 | 0.98 | 8.24 | N/A | |||
Gemini 2.0 Flash-Lite (Preview) (AI Studio) | 1m | $0.13 | 213.2 | 0.27 | 2.62 | N/A | |||
Gemma 2 27B Fast | 8k | $0.26 | 87.4 | 0.54 | 6.26 | N/A | |||
Gemma 2 27B Base | 8k | $0.15 | 49.7 | 0.57 | 10.64 | N/A | |||
Gemma 2 27B | 8k | $0.80 | 85.6 | 0.59 | 6.43 | N/A | |||
![]() | Claude 3.5 Sonnet (June) | 200k | $6.00 | 44.9 | 0.86 | 11.98 | N/A | ||
Claude 3.5 Sonnet (June) Vertex | 200k | $6.00 | 78.5 | 0.83 | 7.20 | N/A | |||
Claude 3.5 Sonnet (June) | 200k | $6.00 | 77.7 | 0.70 | 7.13 | N/A | |||
![]() | Claude 3 Haiku | 200k | $0.50 | 103.9 | 0.99 | 5.81 | N/A | ||
Claude 3 Haiku | 200k | $0.50 | 140.7 | 0.41 | 3.96 | N/A | |||
![]() | Claude Instant | 100k | $1.20 | 58.6 | 0.55 | 9.08 | N/A | ||
Claude 2.0 | 100k | $12.00 | 30.7 | 0.91 | 17.19 | N/A | |||
![]() DeepSeek Coder V2 Lite Fast, FP8 | 128k | $0.12 | 96.3 | 0.51 | 5.69 | N/A | |||
![]() DeepSeek Coder V2 Lite Base, FP8 | 128k | $0.06 | 103.6 | 0.52 | 5.35 | N/A | |||
![]() OpenChat 3.5 | 8k | $0.05 | 54.8 | 0.45 | 9.58 | N/A | |||
![]() | ![]() Solar Mini | 4k | $0.15 | 86.4 | 1.03 | 6.82 | N/A | ||
Qwen1.5 Chat 110B | 32k | $0.00 | 23.7 | 1.61 | 22.75 | N/A |