LLM API Providers Leaderboard - Comparison of over 100 LLM endpoints
Comparison and ranking of API provider performance for over 100 AI LLM Model endpoints across performance key metrics including price, output speed, latency, context window & others. For more details including relating to our methodology, see our FAQs.
API providers compared: OpenAI, Playground AI, Microsoft Azure, Ideogram, Mistral, Hyperbolic, Amazon Bedrock, DeepSeek, Groq, Together.ai, FriendliAI, Anthropic, Black Forest Labs, Perplexity, Google, Lambda Labs, Fireworks, Leonardo.Ai, Cerebras, Recraft AI, Cohere, Upstage, Simplismart, Speechmatics, Deepinfra, Fish Audio, Replicate, , Genmo, Nebius, Adobe, MiniMax, CentML, Runpod, StepFun, Zyphra, Murf AI, Rev AI, Speechify, fal.ai, AssemblyAI, Rime, kluster.ai, Prodia, Hume AI, Reka AI, Deepgram, Gladia, Stability.ai, Baseten, Midjourney, Reve, Databricks, Snowflake, ElevenLabs, Vivago AI, IBM, SambaNova, Dreamina, Parasail, xAI, Cartesia, LMNT, PlayAI, 01.AI, Alibaba Cloud, Novita, AI21 Labs, and WaveSpeed.
Features | Model Intelligence | Price | Output tokens/s | Latency | End-to-End Response Time | ||||
---|---|---|---|---|---|---|---|---|---|
Further Analysis | |||||||||
o4-mini (high) | 200k | 70 | $1.93 | 147.6 | 39.12 | 42.51 | N/A | ||
![]() | o4-mini (high) | 200k | 70 | $1.93 | 113.0 | 46.11 | 50.53 | N/A | |
Gemini 2.5 Pro | 1m | 69 | $3.44 | 153.5 | 32.50 | 35.76 | N/A | ||
o3 | 128k | 67 | $17.50 | 212.1 | 12.46 | 14.82 | N/A | ||
![]() | o3 | 128k | 67 | $17.50 | 87.9 | 34.77 | 40.45 | N/A | |
Grok 3 mini Reasoning (high) | 131k | 67 | $0.35 | 50.5 | 0.31 | 49.81 | 39.60 | ||
Grok 3 mini Reasoning (high) Fast | 131k | 67 | $1.45 | 149.2 | 0.38 | 17.13 | 13.41 | ||
o3-mini (high) | 200k | 66 | $1.93 | 173.2 | 41.91 | 44.79 | N/A | ||
![]() | o3-mini (high) | 200k | 66 | $1.93 | 199.2 | 46.27 | 48.78 | N/A | |
Gemini 2.5 Flash (May '25) (Reasoning) (AI_Studio) | 1m | 65 | $0.99 | 359.7 | 15.13 | 16.52 | N/A | ||
Gemini 2.5 Flash (May '25) (Reasoning) (Vertex) | 1m | 65 | $0.99 | 321.1 | 16.11 | 17.67 | N/A | ||
o3-mini | 200k | 63 | $1.93 | 152.8 | 15.12 | 18.39 | N/A | ||
![]() | o3-mini | 200k | 63 | $1.93 | 186.7 | 13.17 | 15.85 | N/A | |
![]() | Qwen3 235B (Reasoning) (FP8) | 41k | 62 | $0.35 | 64.3 | 0.45 | 39.32 | 31.09 | |
Qwen3 235B (Reasoning) Base | 33k | 62 | $0.30 | 25.9 | 0.58 | 96.98 | 77.12 | ||
Qwen3 235B (Reasoning) | 128k | 62 | $0.10 | 93.2 | 0.50 | 27.32 | 21.46 | ||
Qwen3 235B (Reasoning) (FP8) | 41k | 62 | $0.30 | 27.2 | 0.60 | 92.47 | 73.50 | ||
![]() | Qwen3 235B (Reasoning) (FP8) | 128k | 62 | $0.35 | 28.0 | 0.83 | 89.97 | 71.31 | |
Qwen3 235B (Reasoning) (FP8) | 41k | 62 | $0.30 | 28.0 | 0.39 | 89.63 | 71.40 | ||
![]() | Qwen3 235B (Reasoning) (FP8) | 41k | 62 | $0.61 | 36.9 | 0.87 | 68.56 | 54.15 | |
Qwen3 235B (Reasoning) | 131k | 62 | $2.63 | 58.4 | 1.20 | 44.00 | 34.24 | ||
o1 | 200k | 62 | $26.25 | 130.8 | 20.22 | 24.04 | N/A | ||
![]() | o1 | 200k | 62 | $26.25 | 107.5 | 26.75 | 31.40 | N/A | |
Llama 3.1 Nemotron Ultra 253B Reasoning Base | 131k | 61 | $0.90 | 42.2 | 0.64 | 59.92 | 47.42 | ||
Gemini 2.5 Flash (April '25) (Reasoning) (AI_Studio) | 1m | 60 | $0.99 | 400.0 | 7.60 | 8.85 | N/A | ||
![]() DeepSeek R1 | 164k | 60 | $0.95 | 39.5 | 0.36 | 72.44 | 59.42 | ||
![]() | ![]() DeepSeek R1 | 64k | 60 | $0.96 | 24.8 | 4.03 | 118.76 | 94.58 | |
![]() DeepSeek R1 | 128k | 60 | $2.00 | 221.3 | 0.85 | 13.72 | 10.60 | ||
![]() DeepSeek R1 Base | 128k | 60 | $1.20 | 29.1 | 0.66 | 98.39 | 80.57 | ||
![]() DeepSeek R1 Fast | 128k | 60 | $3.00 | 82.3 | 0.69 | 35.29 | 28.53 | ||
![]() | ![]() DeepSeek R1 | 128k | 60 | $2.99 | 76.0 | 0.49 | 37.98 | 30.90 | |
![]() | ![]() DeepSeek R1 | 128k | 60 | $2.36 | 74.9 | 0.51 | 38.54 | 31.35 | |
![]() DeepSeek R1 (Fast) | 164k | 60 | $4.25 | 240.5 | 0.37 | 12.21 | 9.76 | ||
![]() DeepSeek R1 (Turbo, FP4) | 33k | 60 | $1.50 | 185.1 | 0.22 | 15.60 | 12.68 | ||
![]() DeepSeek R1 | 64k | 60 | $0.92 | 61.9 | 0.34 | 46.30 | 37.89 | ||
![]() DeepSeek R1 | 128k | 60 | $4.00 | 90.5 | 0.48 | 31.95 | 25.95 | ||
![]() | ![]() DeepSeek R1 Turbo | 64k | 60 | $1.15 | 31.1 | 0.89 | 92.43 | 75.47 | |
![]() | ![]() DeepSeek R1 | 64k | 60 | $4.00 | 30.2 | 1.02 | 95.34 | 77.76 | |
![]() | ![]() DeepSeek R1 | 16k | 60 | $5.50 | 199.3 | 1.97 | 16.25 | 11.78 | |
![]() DeepSeek R1 | 128k | 60 | $4.00 | 103.8 | 0.73 | 28.16 | 22.61 | ||
![]() | ![]() DeepSeek R1 | 128k | 60 | $7.00 | 34.5 | 0.72 | 83.17 | 67.96 | |
![]() | Qwen3 32B (Reasoning) (FP8) | 41k | 59 | $0.20 | 53.7 | 0.45 | 47.03 | 37.26 | |
![]() | Qwen3 32B (Reasoning) | 41k | 59 | $0.50 | 2,502.3 | 0.26 | 1.26 | 0.80 | |
Qwen3 32B (Reasoning) Base | 33k | 59 | $0.15 | 45.8 | 0.59 | 55.15 | 43.65 | ||
Qwen3 32B (Reasoning) (FP8) | 41k | 59 | $0.15 | 47.5 | 0.41 | 53.03 | 42.09 | ||
![]() | Qwen3 32B (Reasoning) (FP8) | 128k | 59 | $0.19 | 35.1 | 0.94 | 72.17 | 56.99 | |
![]() | Qwen3 32B (Reasoning) | 8k | 59 | $0.50 | 332.9 | 0.42 | 7.93 | 6.01 | |
Qwen3 32B (Reasoning) | 131k | 59 | $2.63 | 63.7 | 1.18 | 40.40 | 31.38 | ||
QwQ-32B | 131k | 58 | $0.20 | 124.4 | 1.06 | 25.09 | 20.02 | ||
QwQ-32B Base | 131k | 58 | $0.23 | 44.5 | 0.81 | 67.99 | 55.95 | ||
![]() | QwQ-32B | 131k | 58 | $0.65 | 82.1 | 0.34 | 36.75 | 30.32 | |
QwQ-32B | 131k | 58 | $0.90 | 180.8 | 0.45 | 17.00 | 13.78 | ||
QwQ-32B | 131k | 58 | $0.16 | 54.2 | 0.24 | 55.44 | 45.97 | ||
![]() | QwQ-32B | 33k | 58 | $0.18 | 37.1 | 0.67 | 81.22 | 67.09 | |
QwQ-32B | 131k | 58 | $0.32 | 404.2 | 0.26 | 7.66 | 6.16 | ||
![]() | QwQ-32B | 16k | 58 | $0.63 | 398.8 | 0.41 | 7.91 | 6.25 | |
QwQ-32B | 131k | 58 | $1.20 | 98.0 | 0.43 | 30.95 | 25.41 | ||
![]() | Claude 4 Opus | 200k | 58 | $30.00 | 17.5 | 2.60 | 31.15 | N/A | |
Claude 4 Opus | 200k | 58 | $30.00 | 56.6 | 4.53 | 13.35 | N/A | ||
![]() | Claude 3.7 Sonnet Thinking | 200k | 57 | $6.00 | 50.4 | 1.40 | 33.27 | 21.94 | |
Claude 3.7 Sonnet Thinking | 200k | 57 | $6.00 | 86.5 | 2.45 | 21.00 | 12.77 | ||
Qwen3 14B (Reasoning) Base | 33k | 56 | $0.12 | 87.5 | 0.56 | 29.13 | 22.86 | ||
Qwen3 14B (Reasoning) (FP8) | 128k | 56 | $0.12 | 67.4 | 0.55 | 37.65 | 29.68 | ||
![]() | Qwen3 14B (Reasoning) (FP8) | 128k | 56 | $0.12 | 56.2 | 53.36 | 97.86 | 35.60 | |
Qwen3 14B (Reasoning) | 131k | 56 | $1.31 | 63.4 | 1.08 | 40.49 | 31.53 | ||
![]() | Qwen3 30B A3B (Reasoning) (FP8) | 41k | 56 | $0.20 | 157.7 | 0.40 | 16.25 | 12.68 | |
Qwen3 30B A3B (Reasoning) Fast | 33k | 56 | $0.45 | 131.2 | 0.52 | 19.57 | 15.24 | ||
Qwen3 30B A3B (Reasoning) Base | 33k | 56 | $0.15 | 125.8 | 0.52 | 20.39 | 15.89 | ||
Qwen3 30B A3B (Reasoning) | 131k | 56 | $0.90 | 162.7 | 0.60 | 15.97 | 12.29 | ||
Qwen3 30B A3B (Reasoning) (FP8) | 41k | 56 | $0.15 | 109.5 | 0.20 | 23.02 | 18.26 | ||
![]() | Qwen3 30B A3B (Reasoning) (FP8) | 128k | 56 | $0.19 | 180.2 | 0.62 | 14.50 | 11.10 | |
Qwen3 30B A3B (Reasoning) | 131k | 56 | $0.75 | 92.5 | 1.05 | 28.08 | 21.62 | ||
o1-mini | 128k | 54 | $1.93 | 242.3 | 8.22 | 10.28 | N/A | ||
![]() | o1-mini | 128k | 54 | $1.93 | 258.2 | 8.64 | 10.58 | N/A | |
Gemini 2.5 Flash (May '25) (AI_Studio) | 1m | 53 | $0.26 | 284.0 | 0.27 | 2.03 | N/A | ||
Gemini 2.5 Flash (May '25) (Vertex) | 1m | 53 | $0.26 | 259.1 | 0.27 | 2.20 | N/A | ||
![]() | ![]() DeepSeek V3 (Mar' 25) | 64k | 53 | $0.48 | 25.1 | 3.50 | 23.45 | N/A | |
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $1.25 | 31.2 | 1.34 | 17.36 | N/A | ||
![]() DeepSeek V3 (Mar' 25) Fast | 128k | 53 | $3.00 | 91.9 | 0.65 | 6.08 | N/A | ||
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $0.75 | 20.3 | 0.65 | 25.23 | N/A | ||
![]() | ![]() DeepSeek V3 (Mar' 25) | 164k | 53 | $0.80 | 33.6 | 0.47 | 15.33 | N/A | |
![]() | ![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $2.00 | 74.2 | 0.53 | 7.27 | N/A | |
![]() DeepSeek V3 (Mar' 25) | 160k | 53 | $0.90 | 260.5 | 0.41 | 2.33 | N/A | ||
![]() DeepSeek V3 (Mar' 25) | 164k | 53 | $0.45 | 23.8 | 0.42 | 21.46 | N/A | ||
![]() | ![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $0.57 | 30.9 | 0.83 | 16.98 | N/A | |
![]() | ![]() DeepSeek V3 (Mar' 25) | 8k | 53 | $3.38 | 168.8 | 1.82 | 4.78 | N/A | |
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $1.25 | 121.5 | 0.52 | 4.63 | N/A | ||
![]() | ![]() DeepSeek V3 (Mar' 25) | 164k | 53 | $1.25 | 21.6 | 0.78 | 23.92 | N/A | |
![]() | Claude 4 Sonnet | 200k | 53 | $6.00 | 54.7 | 1.23 | 10.37 | N/A | |
Claude 4 Sonnet | 200k | 53 | $6.00 | 82.1 | 1.39 | 7.48 | N/A | ||
GPT-4.1 mini | 1m | 53 | $0.70 | 76.3 | 0.64 | 7.19 | N/A | ||
![]() | GPT-4.1 mini | 1m | 53 | $0.70 | 186.3 | 0.59 | 3.28 | N/A | |
GPT-4.1 | 1m | 53 | $3.50 | 102.8 | 0.55 | 5.41 | N/A | ||
![]() | GPT-4.1 | 1m | 53 | $3.50 | 209.5 | 0.71 | 3.10 | N/A | |
![]() DeepSeek R1 Distill Qwen 32B | 128k | 52 | $0.14 | 49.4 | 0.22 | 50.82 | 40.48 | ||
![]() | ![]() DeepSeek R1 Distill Qwen 32B | 64k | 52 | $0.30 | 20.8 | 1.18 | 121.29 | 96.09 | |
![]() | Qwen3 8B (Reasoning) (FP8) | 128k | 51 | $0.06 | 53.8 | 0.67 | 47.16 | 37.19 | |
Qwen3 8B (Reasoning) | 131k | 51 | $0.66 | 94.4 | 1.03 | 27.51 | 21.19 | ||
Grok 3 | 131k | 51 | $6.00 | 79.0 | 0.43 | 6.76 | N/A | ||
Grok 3 Fast | 131k | 51 | $10.00 | 84.3 | 0.45 | 6.38 | N/A | ||
Llama 4 Maverick (FP8) | 1m | 51 | $0.28 | 162.7 | 0.24 | 3.31 | N/A | ||
![]() | Llama 4 Maverick (FP8) | 1m | 51 | $0.35 | 190.4 | 0.38 | 3.00 | N/A | |
Llama 4 Maverick Vertex | 524k | 51 | $0.55 | 124.4 | 0.32 | 4.34 | N/A | ||
![]() | Llama 4 Maverick (FP8) | 1m | 51 | $0.20 | 129.0 | 0.26 | 4.14 | N/A | |
![]() | Llama 4 Maverick (FP8) | 128k | 51 | $0.61 | 56.1 | 0.33 | 9.24 | N/A | |
Llama 4 Maverick | 1m | 51 | $0.39 | 177.7 | 0.64 | 3.45 | N/A | ||
Llama 4 Maverick (FP8) | 131k | 51 | $0.27 | 119.5 | 0.22 | 4.40 | N/A | ||
![]() | Llama 4 Maverick (FP8) | 1m | 51 | $0.34 | 86.2 | 0.57 | 6.37 | N/A | |
Llama 4 Maverick | 128k | 51 | $0.30 | 593.2 | 0.10 | 0.94 | N/A | ||
![]() | Llama 4 Maverick | 131k | 51 | $0.92 | 791.4 | 0.40 | 1.03 | N/A | |
Llama 4 Maverick (FP8) | 524k | 51 | $0.41 | 123.7 | 0.32 | 4.36 | N/A | ||
![]() | Llama 4 Maverick (FP8) | 1m | 51 | $0.35 | 154.4 | 0.60 | 3.84 | N/A | |
GPT-4o (March 2025) | 128k | 50 | $7.50 | 175.7 | 0.52 | 3.36 | N/A | ||
Gemini 2.0 Pro Experimental (AI Studio) | 2m | 49 | $0.00 | 74.4 | 16.69 | 23.40 | N/A | ||
![]() | ![]() DeepSeek R1 Distill Qwen 14B | 64k | 49 | $0.15 | 45.2 | 0.85 | 56.14 | 44.23 | |
![]() DeepSeek R1 Distill Qwen 14B | 128k | 49 | $1.60 | 170.1 | 0.35 | 15.05 | 11.76 | ||
![]() | ![]() Mistral Medium 3 | 128k | 49 | $0.80 | 65.6 | 0.49 | 8.11 | N/A | |
![]() | ![]() Mistral Medium 3 | 128k | 49 | $0.80 | 57.5 | 0.80 | 9.50 | N/A | |
Gemini 2.5 Flash (AI_Studio) | 1m | 49 | $0.26 | 314.2 | 0.31 | 1.90 | N/A | ||
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $0.30 | 66.7 | 0.26 | 37.76 | 30.00 | ||
![]() | ![]() DeepSeek R1 Distill Llama 70B | 66k | 48 | $0.94 | 2,274.4 | 0.29 | 1.39 | 0.88 | |
![]() DeepSeek R1 Distill Llama 70B Base | 128k | 48 | $0.38 | 59.8 | 0.59 | 42.41 | 33.46 | ||
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $0.17 | 33.1 | 0.56 | 75.98 | 60.34 | ||
![]() | ![]() DeepSeek R1 Distill Llama 70B | 32k | 48 | $0.80 | 33.5 | 0.53 | 75.13 | 59.68 | |
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $0.81 | 421.4 | 0.16 | 6.10 | 4.75 | ||
![]() | ![]() DeepSeek R1 Distill Llama 70B | 16k | 48 | $0.88 | 303.3 | 1.52 | 9.77 | 6.59 | |
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $2.00 | 125.2 | 0.38 | 20.34 | 15.97 | ||
Claude 3.7 Sonnet | 200k | 48 | $6.00 | 77.0 | 1.70 | 8.20 | N/A | ||
Gemini 2.0 Flash Vertex | 1m | 48 | $0.26 | 239.3 | 0.26 | 2.35 | N/A | ||
Gemini 2.0 Flash (AI Studio) | 1m | 48 | $0.17 | 232.9 | 0.32 | 2.47 | N/A | ||
Qwen3 4B (Reasoning) Fast | 33k | 47 | $0.12 | 159.7 | 0.48 | 16.13 | 12.52 | ||
![]() | Qwen3 4B (Reasoning) (FP8) | 128k | 47 | $0.00 | 60.5 | 0.63 | 41.93 | 33.03 | |
Qwen3 4B (Reasoning) | 131k | 47 | $0.40 | 100.3 | 0.97 | 25.90 | 19.95 | ||
![]() | ![]() Reka Flash 3 | 128k | 47 | $0.35 | 56.7 | 0.95 | 45.01 | 35.24 | |
Qwen3 235B | 131k | 47 | $1.23 | 57.8 | 1.19 | 9.83 | N/A | ||
Gemini 2.0 Flash (exp) (AI Studio) | 1m | 46 | $0.00 | 240.2 | 0.24 | 2.32 | N/A | ||
![]() DeepSeek V3 (Dec '24) (FP8) | 128k | 46 | $0.25 | 29.9 | 1.15 | 17.90 | N/A | ||
![]() DeepSeek V3 (Dec '24) | 128k | 46 | $0.75 | 23.0 | 0.63 | 22.33 | N/A | ||
![]() | ![]() DeepSeek V3 (Dec '24) | 128k | 46 | $2.00 | 43.4 | 0.56 | 12.09 | N/A | |
![]() DeepSeek V3 (Dec '24) | 128k | 46 | $1.31 | 86.0 | 0.70 | 6.51 | N/A | ||
![]() DeepSeek V3 (Dec '24) | 64k | 46 | $0.51 | 31.8 | 0.39 | 16.12 | N/A | ||
![]() | ![]() DeepSeek V3 (Dec '24) Turbo | 64k | 46 | $0.63 | 29.2 | 0.81 | 17.93 | N/A | |
![]() | ![]() DeepSeek V3 (Dec '24) | 64k | 46 | $0.89 | 29.5 | 0.88 | 17.83 | N/A | |
![]() DeepSeek V3 (Dec '24) (FP8) | 128k | 46 | $1.25 | 110.7 | 0.59 | 5.11 | N/A | ||
Qwen2.5 Max | 32k | 45 | $2.80 | 40.6 | 1.34 | 13.64 | N/A | ||
Gemini 1.5 Pro (Sep) (Vertex) | 2m | 45 | $2.19 | 93.0 | 0.51 | 5.89 | N/A | ||
Gemini 1.5 Pro (Sep) (AI Studio) | 2m | 45 | $2.19 | 94.9 | 0.45 | 5.72 | N/A | ||
Claude 3.5 Sonnet (Oct) Vertex | 200k | 44 | $6.00 | 83.6 | 1.28 | 7.26 | N/A | ||
Claude 3.5 Sonnet (Oct) | 200k | 44 | $6.00 | 80.5 | 0.74 | 6.95 | N/A | ||
Qwen3 32B | 131k | 44 | $1.23 | 65.0 | 1.08 | 8.78 | N/A | ||
![]() Sonar | 127k | 43 | $1.00 | 117.5 | 1.89 | 6.14 | N/A | ||
Llama 4 Scout | 1m | 43 | $0.14 | 121.9 | 0.24 | 4.34 | N/A | ||
![]() | Llama 4 Scout (FP8) | 158k | 43 | $0.19 | 120.9 | 0.38 | 4.52 | N/A | |
![]() | Llama 4 Scout | 32k | 43 | $0.70 | 2,775.3 | 0.24 | 0.42 | N/A | |
Llama 4 Scout Vertex | 1m | 43 | $0.36 | 131.6 | 0.35 | 4.15 | N/A | ||
![]() | Llama 4 Scout | 1m | 43 | $0.10 | 117.4 | 0.24 | 4.50 | N/A | |
![]() | Llama 4 Scout | 128k | 43 | $0.34 | 33.2 | 0.35 | 15.41 | N/A | |
Llama 4 Scout | 1m | 43 | $0.26 | 162.7 | 0.62 | 3.69 | N/A | ||
Llama 4 Scout | 131k | 43 | $0.14 | 43.0 | 0.49 | 12.10 | N/A | ||
![]() | Llama 4 Scout | 131k | 43 | $0.20 | 58.3 | 0.80 | 9.37 | N/A | |
Llama 4 Scout | 131k | 43 | $0.17 | 619.4 | 0.18 | 0.98 | N/A | ||
![]() | Llama 4 Scout | 8k | 43 | $0.47 | 791.0 | 1.69 | 2.32 | N/A | |
Llama 4 Scout | 328k | 43 | $0.28 | 122.7 | 0.19 | 4.26 | N/A | ||
![]() | Llama 4 Scout | 128k | 43 | $0.71 | 99.4 | 0.67 | 5.70 | N/A | |
![]() Sonar Pro | 200k | 43 | $6.00 | 82.4 | 2.59 | 8.66 | N/A | ||
QwQ 32B-Preview | 33k | 43 | $0.14 | 49.5 | 0.25 | 50.79 | 40.43 | ||
QwQ 32B-Preview | 33k | 43 | $1.20 | 98.2 | 0.43 | 25.89 | 20.37 | ||
Qwen3 30B A3B | 131k | 43 | $0.35 | 92.2 | 1.01 | 6.43 | N/A | ||
GPT-4o (Nov '24) | 128k | 41 | $4.38 | 144.1 | 0.52 | 3.99 | N/A | ||
![]() | GPT-4o (Nov '24) | 128k | 41 | $4.38 | 134.1 | 1.20 | 4.93 | N/A | |
Gemini 2.0 Flash-Lite (Feb '25) (AI Studio) | 1m | 41 | $0.13 | 206.1 | 0.26 | 2.68 | N/A | ||
Llama 3.3 70B (FP8) | 128k | 41 | $0.17 | 57.3 | 0.29 | 9.02 | N/A | ||
![]() | Llama 3.3 70B | 131k | 41 | $1.20 | 451.7 | 0.42 | 1.52 | N/A | |
![]() | Llama 3.3 70B (FP8) | 131k | 41 | $0.28 | 115.6 | 0.44 | 4.76 | N/A | |
![]() | Llama 3.3 70B | 33k | 41 | $0.94 | 2,594.4 | 0.24 | 0.43 | N/A | |
Llama 3.3 70B | 128k | 41 | $0.40 | 30.0 | 1.17 | 17.86 | N/A | ||
Llama 3.3 70B Fast | 128k | 41 | $0.38 | 141.6 | 0.53 | 4.06 | N/A | ||
Llama 3.3 70B Base | 128k | 41 | $0.20 | 40.2 | 0.62 | 13.05 | N/A | ||
Llama 3.3 70B Vertex | 128k | 41 | $0.72 | 72.5 | 0.28 | 7.18 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.35 | 146.6 | 0.37 | 3.78 | N/A | |
![]() | Llama 3.3 70B | 128k | 41 | $0.71 | 58.7 | 0.43 | 8.96 | N/A | |
Llama 3.3 70B | 128k | 41 | $0.90 | 174.6 | 2.24 | 5.10 | N/A | ||
Llama 3.3 70B (Turbo, FP8) | 128k | 41 | $0.12 | 31.5 | 0.24 | 16.14 | N/A | ||
Llama 3.3 70B | 128k | 41 | $0.27 | 22.6 | 0.62 | 22.71 | N/A | ||
Llama 3.3 70B | 128k | 41 | $0.60 | 194.4 | 0.39 | 2.97 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.20 | 93.8 | 0.69 | 6.02 | N/A | |
Llama 3.3 70B | 128k | 41 | $0.64 | 441.9 | 0.22 | 1.35 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.75 | 454.7 | 0.32 | 1.42 | N/A | |
Llama 3.3 70B Turbo | 128k | 41 | $0.88 | 146.5 | 0.36 | 3.77 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.70 | 29.8 | 0.36 | 17.15 | N/A | |
GPT-4.1 nano | 1m | 41 | $0.17 | 152.8 | 0.33 | 3.60 | N/A | ||
![]() | GPT-4.1 nano | 1m | 41 | $0.17 | 245.7 | 0.56 | 2.60 | N/A | |
Qwen3 14B | 131k | 41 | $0.61 | 63.9 | 1.06 | 8.88 | N/A | ||
GPT-4o (May '24) | 128k | 41 | $7.50 | 114.7 | 0.58 | 4.94 | N/A | ||
![]() | GPT-4o (May '24) | 128k | 41 | $7.50 | 141.4 | 0.66 | 4.20 | N/A | |
Llama 3.1 405B (FP8) | 128k | 40 | $0.80 | 32.8 | 0.32 | 15.57 | N/A | ||
![]() | Llama 3.1 405B | 131k | 40 | $7.00 | 179.1 | 1.63 | 4.42 | N/A | |
Llama 3.1 405B | 128k | 40 | $4.00 | 95.1 | 1.14 | 6.40 | N/A | ||
Llama 3.1 405B Base | 128k | 40 | $1.50 | 32.4 | 0.68 | 16.13 | N/A | ||
Llama 3.1 405B Vertex | 128k | 40 | $7.75 | 28.9 | 0.42 | 17.71 | N/A | ||
![]() | Llama 3.1 405B | 128k | 40 | $8.00 | 31.5 | 0.47 | 16.35 | N/A | |
Llama 3.1 405B | 128k | 40 | $3.00 | 38.9 | 0.88 | 13.75 | N/A | ||
Llama 3.1 405B | 33k | 40 | $0.80 | 24.8 | 0.71 | 20.84 | N/A | ||
![]() | Llama 3.1 405B | 16k | 40 | $6.25 | 179.7 | 1.61 | 4.40 | N/A | |
Llama 3.1 405B | 128k | 40 | $7.50 | 38.6 | 0.74 | 13.71 | N/A | ||
Llama 3.1 405B Turbo | 128k | 40 | $3.50 | 100.2 | 0.49 | 5.48 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.40 | 29.4 | 1.25 | 18.23 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.20 | 31.6 | 0.67 | 16.50 | N/A | ||
Qwen2.5 72B Fast | 131k | 40 | $0.38 | 68.3 | 0.56 | 7.88 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.90 | 72.0 | 0.46 | 7.40 | N/A | ||
Qwen2.5 72B | 33k | 40 | $0.19 | 34.3 | 0.60 | 15.19 | N/A | ||
Qwen2.5 72B Turbo | 131k | 40 | $1.20 | 121.2 | 0.40 | 4.53 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.00 | 58.0 | 1.23 | 9.85 | N/A | ||
![]() | ![]() MiniMax-Text-01 | 1m | 40 | $0.42 | 31.7 | 0.91 | 16.70 | N/A | |
Phi-4 | 16k | 40 | $0.15 | 117.4 | 0.50 | 4.76 | N/A | ||
![]() | Phi-4 | 16k | 40 | $0.22 | 40.9 | 0.45 | 12.68 | N/A | |
Phi-4 | 16k | 40 | $0.09 | 40.1 | 0.24 | 12.72 | N/A | ||
![]() Command A | 256k | 40 | $4.38 | 89.8 | 0.22 | 5.79 | N/A | ||
Gemini 1.5 Flash (Sep) (Vertex) | 1m | 39 | $0.13 | 185.3 | 0.19 | 2.89 | N/A | ||
Gemini 1.5 Flash (Sep) (AI Studio) | 1m | 39 | $0.13 | 191.5 | 0.29 | 2.90 | N/A | ||
![]() | ![]() Mistral Large 2 (Nov '24) | 128k | 38 | $3.00 | 61.2 | 0.45 | 8.61 | N/A | |
![]() | ![]() Mistral Large 2 (Nov '24) | 128k | 38 | $3.00 | 31.7 | 0.53 | 16.31 | N/A | |
![]() | Qwen3 1.7B (Reasoning) (FP8) | 32k | 38 | $0.00 | 48.0 | 0.65 | 52.76 | 41.68 | |
Qwen3 1.7B (Reasoning) | 33k | 38 | $0.40 | 130.1 | 0.95 | 20.17 | 15.37 | ||
![]() | Gemma 3 27B | 131k | 38 | $0.29 | 88.2 | 0.40 | 6.06 | N/A | |
Gemma 3 27B (AI_Studio) | 128k | 38 | $0.00 | 52.2 | 0.58 | 10.15 | N/A | ||
Gemma 3 27B | 128k | 38 | $0.13 | 36.0 | 0.51 | 14.40 | N/A | ||
Grok Beta | 128k | 38 | $7.50 | 67.6 | 0.31 | 7.71 | N/A | ||
![]() | ![]() Pixtral Large | 128k | 37 | $3.00 | 66.6 | 0.39 | 7.90 | N/A | |
Qwen2.5 Instruct 32B Fast | 128k | 37 | $0.20 | 88.2 | 0.52 | 6.19 | N/A | ||
Qwen2.5 Instruct 32B Base | 128k | 37 | $0.10 | 58.1 | 0.57 | 9.17 | N/A | ||
Llama 3.1 Nemotron 70B (FP8) | 128k | 37 | $0.17 | 49.6 | 0.22 | 10.31 | N/A | ||
Llama 3.1 Nemotron 70B Base | 128k | 37 | $0.20 | 39.3 | 0.64 | 13.37 | N/A | ||
Llama 3.1 Nemotron 70B Fast | 128k | 37 | $0.38 | 74.3 | 0.53 | 7.26 | N/A | ||
Llama 3.1 Nemotron 70B | 128k | 37 | $0.17 | 32.8 | 0.28 | 15.50 | N/A | ||
![]() | ![]() Nova Pro | 300k | 37 | $1.40 | 167.0 | 0.36 | 3.35 | N/A | |
Qwen3 8B | 131k | 37 | $0.31 | 95.2 | 1.06 | 6.32 | N/A | ||
![]() | ![]() Mistral Large 2 (Jul '24) | 128k | 37 | $3.00 | 39.9 | 0.43 | 12.96 | N/A | |
Qwen2.5 Coder 32B | 33k | 36 | $0.09 | 43.4 | 0.31 | 11.82 | N/A | ||
Qwen2.5 Coder 32B | 131k | 36 | $0.20 | 56.4 | 1.06 | 9.92 | N/A | ||
Qwen2.5 Coder 32B | 33k | 36 | $0.08 | 50.1 | 0.24 | 10.22 | N/A | ||
Qwen2.5 Coder 32B | 131k | 36 | $0.80 | 79.6 | 0.44 | 6.72 | N/A | ||
GPT-4o mini | 128k | 36 | $0.26 | 72.9 | 0.47 | 7.32 | N/A | ||
![]() | GPT-4o mini | 128k | 36 | $0.26 | 184.5 | 0.81 | 3.52 | N/A | |
Llama 3.1 70B (FP8) | 128k | 35 | $0.17 | 50.0 | 0.22 | 10.22 | N/A | ||
Llama 3.1 70B | 128k | 35 | $0.40 | 134.3 | 0.95 | 4.67 | N/A | ||
Llama 3.1 70B Base | 128k | 35 | $0.20 | 29.6 | 0.66 | 17.55 | N/A | ||
Llama 3.1 70B Fast | 128k | 35 | $0.38 | 147.1 | 0.54 | 3.94 | N/A | ||
Llama 3.1 70B Vertex | 128k | 35 | $0.72 | 72.5 | 0.27 | 7.17 | N/A | ||
![]() | Llama 3.1 70B | 128k | 35 | $2.90 | 55.0 | 0.44 | 9.53 | N/A | |
Llama 3.1 70B | 128k | 35 | $0.90 | 166.4 | 0.37 | 3.38 | N/A | ||
Llama 3.1 70B (Turbo, FP8) | 128k | 35 | $0.14 | 37.1 | 0.25 | 13.74 | N/A | ||
Llama 3.1 70B | 128k | 35 | $0.27 | 41.2 | 0.22 | 12.35 | N/A | ||
![]() | Llama 3.1 70B | 32k | 35 | $0.19 | 52.3 | 1.20 | 10.77 | N/A | |
Llama 3.1 70B Turbo | 128k | 35 | $0.88 | 152.0 | 0.37 | 3.66 | N/A | ||
Llama 3.1 70B | 128k | 35 | $0.90 | 127.5 | 0.51 | 4.43 | N/A | ||
![]() | ![]() Mistral Small 3.1 | 128k | 35 | $0.15 | 110.6 | 0.30 | 4.82 | N/A | |
![]() | ![]() Mistral Small 3.1 | 128k | 35 | $0.15 | 75.8 | 0.40 | 6.99 | N/A | |
![]() Mistral Small 3.1 Vertex | 128k | 35 | $0.15 | 210.5 | 0.19 | 2.56 | N/A | ||
![]() | ![]() Mistral Small 3 | 32k | 35 | $0.15 | 150.5 | 0.27 | 3.59 | N/A | |
![]() Mistral Small 3 | 32k | 35 | $0.07 | 73.9 | 0.23 | 7.00 | N/A | ||
![]() Mistral Small 3 | 32k | 35 | $0.80 | 96.4 | 0.20 | 5.39 | N/A | ||
Qwen3 4B | 131k | 35 | $0.19 | 102.6 | 1.04 | 5.91 | N/A | ||
Claude 3 Opus Vertex | 200k | 35 | $30.00 | 21.9 | 2.84 | 25.63 | N/A | ||
Claude 3 Opus | 200k | 35 | $30.00 | 28.3 | 1.01 | 18.71 | N/A | ||
Claude 3.5 Haiku Vertex | 200k | 35 | $1.60 | 66.4 | 1.83 | 9.37 | N/A | ||
Claude 3.5 Haiku | 200k | 35 | $1.60 | 64.9 | 0.79 | 8.49 | N/A | ||
![]() | ![]() Devstral | 256k | 34 | $0.15 | 128.4 | 0.30 | 4.20 | N/A | |
![]() | ![]() DeepSeek R1 Distill Llama 8B | 32k | 34 | $0.04 | 41.5 | 0.68 | 60.92 | 48.19 | |
Gemma 3 12B | 128k | 34 | $0.06 | 27.6 | 0.65 | 18.77 | N/A | ||
Gemini 1.5 Pro (May) (Vertex) | 2m | 34 | $2.19 | 66.8 | 0.35 | 7.84 | N/A | ||
Gemini 1.5 Pro (May) (AI Studio) | 2m | 34 | $2.19 | 67.3 | 0.41 | 7.84 | N/A | ||
Qwen2.5 Turbo | 1m | 34 | $0.09 | 108.5 | 1.03 | 5.64 | N/A | ||
Llama 3.2 90B (Vision) Vertex | 128k | 33 | $0.00 | 32.8 | 0.20 | 15.46 | N/A | ||
Llama 3.2 90B (Vision) | 33k | 33 | $0.36 | 40.1 | 0.26 | 12.74 | N/A | ||
Llama 3.2 90B (Vision) Turbo | 128k | 33 | $1.20 | 32.9 | 0.37 | 15.58 | N/A | ||
Qwen2 72B | 33k | 33 | $0.90 | 39.5 | 0.52 | 13.19 | N/A | ||
Qwen2 72B | 131k | 33 | $0.00 | 30.9 | 1.31 | 17.47 | N/A | ||
Gemini 1.5 Flash-8B AI Studio | 1m | 31 | $0.07 | 282.2 | 0.20 | 1.97 | N/A | ||
![]() | ![]() Jamba 1.5 Large | 256k | 29 | $3.50 | 50.8 | 0.69 | 10.54 | N/A | |
![]() Jamba 1.6 Large | 256k | 29 | $3.50 | 64.3 | 0.58 | 8.36 | N/A | ||
Gemini 1.5 Flash (May) (Vertex) | 1m | 28 | $0.13 | 321.4 | 0.26 | 1.82 | N/A | ||
Gemini 1.5 Flash (May) (AI Studio) | 1m | 28 | $0.13 | 322.9 | 0.19 | 1.74 | N/A | ||
![]() Yi-Large | 32k | 28 | $3.00 | 66.8 | 0.37 | 7.85 | N/A | ||
Claude 3 Sonnet | 200k | 28 | $6.00 | 60.8 | 0.84 | 9.06 | N/A | ||
![]() | ![]() Codestral (Jan '25) | 256k | 28 | $0.45 | 110.0 | 0.30 | 4.85 | N/A | |
![]() Codestral (Jan '25) Vertex | 128k | 28 | $0.45 | 150.1 | 0.15 | 3.48 | N/A | ||
Llama 3 70B | 8k | 27 | $0.40 | 17.2 | 1.57 | 30.67 | N/A | ||
![]() | Llama 3 70B | 8k | 27 | $2.90 | 18.9 | 0.77 | 27.16 | N/A | |
Llama 3 70B | 8k | 27 | $0.33 | 33.5 | 0.46 | 15.40 | N/A | ||
![]() | Llama 3 70B | 8k | 27 | $0.57 | 16.0 | 1.18 | 32.51 | N/A | |
Llama 3 70B | 8k | 27 | $0.64 | 332.6 | 0.25 | 1.75 | N/A | ||
Llama 3 70B (Reference, FP16) | 8k | 27 | $0.88 | 132.0 | 0.68 | 4.47 | N/A | ||
Llama 3 70B (Turbo, FP8) | 8k | 27 | $0.88 | 134.9 | 0.40 | 4.11 | N/A | ||
![]() | ![]() Mistral Small (Sep '24) | 33k | 27 | $0.30 | 89.4 | 0.32 | 5.91 | N/A | |
![]() | Phi-4 Multimodal | 128k | 27 | $0.00 | 20.5 | 0.35 | 24.78 | N/A | |
Qwen2.5 Coder 7B Fast | 131k | 27 | $0.04 | 227.8 | 0.49 | 2.68 | N/A | ||
Qwen2.5 Coder 7B Base | 131k | 27 | $0.01 | 183.9 | 0.48 | 3.20 | N/A | ||
![]() | ![]() Mistral Large (Feb '24) | 33k | 26 | $6.00 | 30.0 | 0.47 | 17.13 | N/A | |
![]() | ![]() Mixtral 8x22B | 65k | 26 | $3.00 | 61.2 | 0.32 | 8.48 | N/A | |
![]() Mixtral 8x22B Base | 65k | 26 | $0.60 | 75.2 | 0.56 | 7.21 | N/A | ||
![]() Mixtral 8x22B Fast | 65k | 26 | $1.05 | 107.3 | 0.53 | 5.19 | N/A | ||
![]() Mixtral 8x22B | 65k | 26 | $1.20 | 91.2 | 0.30 | 5.79 | N/A | ||
![]() | Phi-4 Mini | 128k | 26 | $0.12 | 209.9 | 0.29 | 2.67 | N/A | |
![]() | Phi-4 Mini | 128k | 26 | $0.00 | 58.5 | 0.34 | 8.88 | N/A | |
Qwen3 1.7B | 33k | 25 | $0.19 | 134.2 | 1.04 | 4.77 | N/A | ||
![]() | Phi-3 Medium 14B | 128k | 25 | $0.30 | 53.2 | 0.43 | 9.83 | N/A | |
Gemma 3 4B | 128k | 24 | $0.03 | 83.0 | 0.25 | 6.27 | N/A | ||
Claude 2.1 | 200k | 24 | $12.00 | 13.8 | 0.91 | 37.09 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.03 | 142.6 | 0.20 | 3.71 | N/A | ||
![]() | Llama 3.1 8B | 131k | 24 | $0.20 | 1,133.6 | 0.29 | 0.73 | N/A | |
![]() | Llama 3.1 8B | 33k | 24 | $0.10 | 2,163.9 | 0.22 | 0.45 | N/A | |
Llama 3.1 8B | 128k | 24 | $0.10 | 435.7 | 0.78 | 1.93 | N/A | ||
Llama 3.1 8B Fast | 128k | 24 | $0.04 | 177.7 | 0.49 | 3.31 | N/A | ||
Llama 3.1 8B Base | 128k | 24 | $0.03 | 62.9 | 0.54 | 8.49 | N/A | ||
Llama 3.1 8B Vertex | 128k | 24 | $0.00 | 119.4 | 0.18 | 4.37 | N/A | ||
![]() | Llama 3.1 8B | 128k | 24 | $0.38 | 211.7 | 0.30 | 2.66 | N/A | |
Llama 3.1 8B | 128k | 24 | $0.20 | 301.0 | 0.24 | 1.91 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.04 | 56.0 | 0.24 | 9.16 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.10 | 458.2 | 0.27 | 1.36 | N/A | ||
![]() | Llama 3.1 8B | 16k | 24 | $0.03 | 75.1 | 0.66 | 7.31 | N/A | |
Llama 3.1 8B | 128k | 24 | $0.06 | 847.6 | 0.19 | 0.78 | N/A | ||
![]() | Llama 3.1 8B | 16k | 24 | $0.13 | 1,177.4 | 0.22 | 0.65 | N/A | |
Llama 3.1 8B Turbo | 128k | 24 | $0.18 | 161.5 | 0.42 | 3.51 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.15 | 469.0 | 0.17 | 1.24 | N/A | ||
![]() | Llama 3.1 8B | 128k | 24 | $0.18 | 61.4 | 0.44 | 8.58 | N/A | |
![]() | ![]() Pixtral 12B | 128k | 23 | $0.15 | 87.0 | 0.28 | 6.03 | N/A | |
![]() Pixtral 12B | 128k | 23 | $0.10 | 77.6 | 0.58 | 7.03 | N/A | ||
![]() | Qwen3 0.6B (Reasoning) (FP8) | 32k | 23 | $0.00 | 46.2 | 0.68 | 54.84 | 43.32 | |
Qwen3 0.6B (Reasoning) | 33k | 23 | $0.40 | 210.5 | 0.99 | 12.87 | 9.50 | ||
![]() | ![]() Mistral Small (Feb '24) | 33k | 23 | $1.50 | 124.8 | 0.27 | 4.27 | N/A | |
![]() | ![]() Mistral Small (Feb '24) | 33k | 23 | $1.50 | 88.0 | 0.39 | 6.06 | N/A | |
![]() | ![]() Mistral Medium | 33k | 23 | $4.09 | 82.7 | 0.39 | 6.44 | N/A | |
![]() | ![]() Ministral 8B | 128k | 22 | $0.10 | 134.7 | 0.27 | 3.99 | N/A | |
Gemma 2 9B Fast | 8k | 22 | $0.04 | 174.7 | 0.47 | 3.33 | N/A | ||
Gemma 2 9B Base | 8k | 22 | $0.03 | 151.9 | 0.48 | 3.77 | N/A | ||
Gemma 2 9B | 8k | 22 | $0.04 | 19.7 | 0.76 | 26.16 | N/A | ||
Gemma 2 9B | 8k | 22 | $0.20 | 710.0 | 0.21 | 0.92 | N/A | ||
![]() LFM 40B | 32k | 22 | $0.15 | 161.3 | 0.15 | 3.25 | N/A | ||
![]() Command-R+ | 128k | 21 | $4.38 | 47.7 | 0.27 | 10.75 | N/A | ||
![]() | Llama 3 8B | 8k | 21 | $0.38 | 73.8 | 0.37 | 7.15 | N/A | |
Llama 3 8B | 8k | 21 | $0.04 | 119.2 | 0.20 | 4.40 | N/A | ||
![]() | Llama 3 8B | 8k | 21 | $0.04 | 59.2 | 0.76 | 9.21 | N/A | |
Llama 3 8B | 8k | 21 | $0.06 | 1,343.5 | 0.32 | 0.70 | N/A | ||
Llama 3 8B | 8k | 21 | $0.20 | 190.1 | 0.31 | 2.94 | N/A | ||
![]() | ![]() Codestral (May '24) | 33k | 20 | $0.30 | 114.5 | 0.31 | 4.68 | N/A | |
![]() Aya Expanse 32B | 128k | 20 | $0.75 | 122.3 | 0.16 | 4.25 | N/A | ||
![]() Command-R+ (Apr '24) | 128k | 20 | $6.00 | 58.6 | 0.23 | 8.77 | N/A | ||
![]() | ![]() Command-R+ (Apr '24) | 128k | 20 | $6.00 | 29.0 | 0.67 | 17.88 | N/A | |
![]() | ![]() Ministral 3B | 128k | 20 | $0.04 | 224.6 | 0.26 | 2.49 | N/A | |
![]() | ![]() Mistral NeMo | 128k | 20 | $0.15 | 139.3 | 0.29 | 3.88 | N/A | |
![]() | ![]() Mistral NeMo (FP8) | 131k | 20 | $0.11 | 96.9 | 0.52 | 5.68 | N/A | |
![]() Mistral NeMo Fast | 128k | 20 | $0.12 | 155.7 | 0.50 | 3.71 | N/A | ||
![]() Mistral NeMo Base | 128k | 20 | $0.06 | 37.6 | 0.59 | 13.90 | N/A | ||
![]() Mistral NeMo | 128k | 20 | $0.04 | 59.3 | 0.22 | 8.65 | N/A | ||
Llama 3.2 3B (FP8) | 128k | 20 | $0.02 | 224.8 | 0.20 | 2.42 | N/A | ||
Llama 3.2 3B | 128k | 20 | $0.10 | 91.4 | 1.07 | 6.54 | N/A | ||
Llama 3.2 3B Base | 128k | 20 | $0.01 | 126.3 | 0.48 | 4.44 | N/A | ||
Llama 3.2 3B | 128k | 20 | $0.01 | 124.8 | 0.17 | 4.18 | N/A | ||
![]() | Llama 3.2 3B | 32k | 20 | $0.04 | 94.1 | 0.56 | 5.88 | N/A | |
![]() | Llama 3.2 3B | 8k | 20 | $0.10 | 1,581.5 | 0.19 | 0.51 | N/A | |
Llama 3.2 3B Turbo | 128k | 20 | $0.06 | 165.1 | 0.31 | 3.34 | N/A | ||
![]() DeepSeek R1 Distill Qwen 1.5B | 128k | 19 | $0.18 | 388.2 | 0.24 | 6.68 | 5.15 | ||
![]() | ![]() Jamba 1.5 Mini | 256k | 18 | $0.25 | 82.7 | 0.49 | 6.54 | N/A | |
![]() Jamba 1.6 Mini | 256k | 18 | $0.25 | 185.0 | 0.39 | 3.09 | N/A | ||
![]() | ![]() Mixtral 8x7B | 33k | 17 | $0.70 | 88.5 | 0.29 | 5.94 | N/A | |
![]() Mixtral 8x7B Fast | 33k | 17 | $0.23 | 127.9 | 0.50 | 4.41 | N/A | ||
![]() Mixtral 8x7B Base | 33k | 17 | $0.12 | 107.8 | 0.50 | 5.14 | N/A | ||
![]() Mixtral 8x7B | 33k | 17 | $0.12 | 91.4 | 0.50 | 5.97 | N/A | ||
![]() Mixtral 8x7B | 33k | 17 | $0.60 | 68.4 | 0.41 | 7.71 | N/A | ||
Qwen3 0.6B | 33k | 17 | $0.19 | 214.9 | 0.99 | 3.32 | N/A | ||
![]() Aya Expanse 8B | 8k | 16 | $0.75 | 167.3 | 0.15 | 3.13 | N/A | ||
![]() Command-R | 128k | 15 | $0.26 | 66.4 | 0.20 | 7.73 | N/A | ||
![]() Command-R (Mar '24) | 128k | 15 | $0.75 | 163.4 | 0.17 | 3.23 | N/A | ||
![]() | ![]() Command-R (Mar '24) | 128k | 15 | $0.75 | 46.5 | 0.51 | 11.26 | N/A | |
![]() | ![]() Codestral-Mamba | 256k | 14 | $0.25 | 94.1 | 0.42 | 5.74 | N/A | |
![]() | ![]() Mistral 7B | 8k | 10 | $0.25 | 106.4 | 0.28 | 4.98 | N/A | |
![]() Mistral 7B | 8k | 10 | $0.04 | 87.7 | 0.16 | 5.86 | N/A | ||
![]() | ![]() Mistral 7B | 32k | 10 | $0.04 | 126.8 | 0.82 | 4.76 | N/A | |
![]() Mistral 7B | 8k | 10 | $0.20 | 178.4 | 0.17 | 2.97 | N/A | ||
Llama 3.2 1B Base | 128k | 10 | $0.01 | 112.9 | 0.49 | 4.91 | N/A | ||
Llama 3.2 1B | 128k | 10 | $0.01 | 57.3 | 0.28 | 9.00 | N/A | ||
![]() | Llama 3.2 1B | 16k | 10 | $0.05 | 2,600.8 | 0.18 | 0.37 | N/A | |
![]() | Claude 4 Sonnet Thinking | 200k | $6.00 | 43.2 | 1.19 | 59.02 | 46.26 | ||
![]() | Claude 4 Opus Thinking | 200k | $30.00 | 13.0 | 3.11 | 196.04 | 154.35 | ||
Claude 4 Sonnet Thinking | 200k | $6.00 | 81.3 | 1.20 | 31.95 | 24.60 | |||
Claude 4 Opus Thinking | 200k | $30.00 | 57.8 | 5.06 | 48.30 | 34.60 | |||
Llama 3.2 11B (Vision) | 128k | $0.05 | 57.2 | 0.22 | 8.96 | N/A | |||
Llama 3.2 11B (Vision) Turbo | 128k | $0.18 | 119.8 | 0.17 | 4.34 | N/A | |||
![]() | ![]() Mistral Saba | 32k | $0.30 | 94.8 | 0.29 | 5.57 | N/A | ||
![]() Sonar Reasoning | 127k | $2.00 | 82.4 | 1.54 | 31.89 | 24.28 | |||
Grok 3 mini Reasoning (low) | 131k | $0.35 | 123.9 | 0.26 | 20.44 | 16.15 | |||
Grok 3 mini Reasoning (low) Fast | 131k | $1.45 | 209.4 | 0.30 | 12.24 | 9.55 | |||
![]() | ![]() Reka Flash | 128k | $0.35 | 46.1 | 0.85 | 11.70 | N/A | ||
![]() | ![]() Reka Core | 128k | $2.00 | 27.6 | 0.84 | 18.95 | N/A | ||
![]() | ![]() Reka Flash (Feb '24) | 128k | $0.35 | 45.8 | 0.85 | 11.77 | N/A | ||
![]() | ![]() Reka Edge | 128k | $0.10 | 85.9 | 0.83 | 6.65 | N/A | ||
o1-preview | 128k | $26.25 | 165.0 | 21.78 | 24.81 | N/A | |||
![]() | o1-preview | 128k | $28.88 | 155.7 | 23.38 | 26.60 | N/A | ||
GPT-4o (Aug '24) | 128k | $4.38 | 114.2 | 0.58 | 4.96 | N/A | |||
![]() | GPT-4o (Aug '24) | 128k | $4.38 | 125.4 | 0.66 | 4.65 | N/A | ||
GPT-4 Turbo | 128k | $15.00 | 51.2 | 0.70 | 10.47 | N/A | |||
![]() | GPT-4 Turbo | 128k | $15.00 | 50.1 | 1.55 | 11.52 | N/A | ||
GPT-3.5 Turbo | 4k | $0.75 | 137.9 | 0.41 | 4.04 | N/A | |||
GPT-4 | 8k | $37.50 | 27.8 | 0.63 | 18.61 | N/A | |||
GPT-4.5 (Preview) | 128k | $93.75 | 77.9 | 0.98 | 7.40 | N/A | |||
Gemini 2.0 Flash-Lite (Preview) (AI Studio) | 1m | $0.13 | 200.7 | 0.27 | 2.76 | N/A | |||
Gemma 2 27B Fast | 8k | $0.26 | 86.1 | 0.51 | 6.32 | N/A | |||
Gemma 2 27B Base | 8k | $0.15 | 51.7 | 0.58 | 10.24 | N/A | |||
Gemma 2 27B | 8k | $0.80 | 91.2 | 0.27 | 5.75 | N/A | |||
Claude 3.5 Sonnet (June) Vertex | 200k | $6.00 | 83.9 | 1.31 | 7.27 | N/A | |||
Claude 3.5 Sonnet (June) | 200k | $6.00 | 80.9 | 0.65 | 6.83 | N/A | |||
Claude 3 Haiku | 200k | $0.50 | 143.9 | 0.52 | 4.00 | N/A | |||
Claude 2.0 | 100k | $12.00 | 31.0 | 1.01 | 17.16 | N/A | |||
![]() DeepSeek Coder V2 Lite Fast, FP8 | 128k | $0.12 | 109.5 | 0.49 | 5.05 | N/A | |||
![]() DeepSeek Coder V2 Lite Base, FP8 | 128k | $0.06 | 117.4 | 0.51 | 4.77 | N/A | |||
![]() OpenChat 3.5 | 8k | $0.05 | 60.3 | 0.38 | 8.67 | N/A | |||
![]() | ![]() Solar Mini | 4k | $0.15 | 94.5 | 1.05 | 6.35 | N/A | ||
Qwen1.5 Chat 110B | 32k | $0.00 | 23.7 | 1.60 | 22.71 | N/A |