LLM API Providers Leaderboard - Comparison of over 100 LLM endpoints
Comparison and ranking of API provider performance for over 100 AI LLM Model endpoints across performance key metrics including price, output speed, latency, context window & others. For more details including relating to our methodology, see our FAQs.
API providers compared: OpenAI, Playground AI, Mistral, Microsoft Azure, Ideogram, DeepSeek, Amazon Bedrock, Hyperbolic, Groq, FriendliAI, Together.ai, Anthropic, Black Forest Labs, Perplexity, Google, Lambda Labs, Fireworks, Cerebras, Leonardo.Ai, Cohere, Recraft AI, Upstage, Simplismart, Speechmatics, Fish Audio, Deepinfra, Replicate, , Genmo, Nebius, Adobe, MiniMax, CentML, Runpod, StepFun, Zyphra, Murf AI, Speechify, Rev AI, AssemblyAI, fal.ai, Rime, kluster.ai, Prodia, Reka AI, Hume AI, Deepgram, Gladia, Stability.ai, Baseten, Midjourney, Reve, Databricks, ElevenLabs, IBM, Vivago AI, SambaNova, Dreamina, xAI, Cartesia, LMNT, PlayAI, 01.AI, Alibaba Cloud, Novita, AI21 Labs, and WaveSpeed.
Features | Model Intelligence | Price | Output tokens/s | Latency | End-to-End Response Time | ||||
---|---|---|---|---|---|---|---|---|---|
Further Analysis | |||||||||
o4-mini (high) | 200k | 70 | $1.93 | 129.7 | 52.47 | 56.33 | N/A | ||
![]() | o4-mini (high) | 200k | 70 | $1.93 | 85.9 | 57.72 | 63.54 | N/A | |
Gemini 2.5 Pro | 1m | 69 | $3.44 | 154.2 | 34.47 | 37.71 | N/A | ||
o3 | 128k | 67 | $17.50 | 213.3 | 15.64 | 17.98 | N/A | ||
![]() | o3 | 128k | 67 | $17.50 | 81.2 | 38.70 | 44.86 | N/A | |
Grok 3 mini Reasoning (high) | 131k | 67 | $0.35 | 102.8 | 0.41 | 24.73 | 19.45 | ||
Grok 3 mini Reasoning (high) Fast | 131k | 67 | $1.45 | 64.8 | 0.54 | 39.14 | 30.88 | ||
o3-mini (high) | 200k | 66 | $1.93 | 186.6 | 37.76 | 40.44 | N/A | ||
![]() | o3-mini (high) | 200k | 66 | $1.93 | 205.0 | 62.36 | 64.80 | N/A | |
o3-mini | 200k | 63 | $1.93 | 189.2 | 12.65 | 15.30 | N/A | ||
![]() | o3-mini | 200k | 63 | $1.93 | 216.4 | 13.12 | 15.43 | N/A | |
Qwen3 235B A22B (Reasoning) Base | 33k | 62 | $0.30 | 24.5 | 0.58 | 102.48 | 81.52 | ||
Qwen3 235B A22B (Reasoning) | 128k | 62 | $0.10 | 65.7 | 0.64 | 38.71 | 30.45 | ||
Qwen3 235B A22B (Reasoning) (FP8) | 41k | 62 | $0.30 | 27.7 | 0.63 | 90.74 | 72.08 | ||
![]() | Qwen3 235B A22B (Reasoning) (FP8) | 128k | 62 | $0.35 | 28.2 | 0.74 | 89.25 | 70.81 | |
Qwen3 235B A22B (Reasoning) (FP8) | 41k | 62 | $0.30 | 33.6 | 0.52 | 74.91 | 59.51 | ||
![]() | Qwen3 235B A22B (Reasoning) (FP8) | 41k | 62 | $0.61 | 34.4 | 0.47 | 73.05 | 58.06 | |
Qwen3 235B A22B (Reasoning) | 131k | 62 | $2.63 | 69.4 | 1.17 | 37.21 | 28.84 | ||
o1 | 200k | 62 | $26.25 | 134.0 | 22.47 | 26.20 | N/A | ||
![]() | o1 | 200k | 62 | $26.25 | 114.1 | 24.96 | 29.35 | N/A | |
Llama 3.1 Nemotron Ultra 253B Reasoning Base | 131k | 61 | $0.90 | 42.8 | 0.65 | 59.01 | 46.69 | ||
Gemini 2.5 Flash (Reasoning) (AI_Studio) | 1m | 60 | $0.99 | 332.0 | 7.86 | 9.37 | N/A | ||
![]() DeepSeek R1 | 164k | 60 | $0.95 | 38.7 | 0.36 | 73.95 | 60.66 | ||
![]() | ![]() DeepSeek R1 | 64k | 60 | $0.96 | 25.1 | 3.73 | 117.33 | 93.65 | |
![]() DeepSeek R1 | 128k | 60 | $2.00 | 92.2 | 1.02 | 31.90 | 25.46 | ||
![]() | ![]() DeepSeek R1 | 128k | 60 | $2.36 | 169.1 | 0.37 | 17.20 | 13.88 | |
![]() DeepSeek R1 Base | 128k | 60 | $1.20 | 28.7 | 0.64 | 99.84 | 81.78 | ||
![]() DeepSeek R1 Fast | 128k | 60 | $3.00 | 67.2 | 0.64 | 43.03 | 34.94 | ||
![]() | ![]() DeepSeek R1 | 128k | 60 | $3.99 | 70.9 | 0.52 | 40.65 | 33.09 | |
![]() | ![]() DeepSeek R1 | 128k | 60 | $2.36 | 105.3 | 0.51 | 27.55 | 22.29 | |
![]() DeepSeek R1 (Fast) | 164k | 60 | $4.25 | 162.6 | 0.54 | 18.05 | 14.43 | ||
![]() DeepSeek R1 (Turbo, FP4) | 33k | 60 | $1.50 | 177.6 | 0.31 | 16.33 | 13.21 | ||
![]() DeepSeek R1 | 64k | 60 | $0.96 | 46.3 | 0.36 | 61.82 | 50.66 | ||
![]() DeepSeek R1 | 128k | 60 | $4.00 | 90.2 | 0.46 | 32.02 | 26.01 | ||
![]() | ![]() DeepSeek R1 Turbo | 64k | 60 | $1.15 | 31.1 | 0.82 | 92.26 | 75.38 | |
![]() | ![]() DeepSeek R1 | 64k | 60 | $4.00 | 31.3 | 0.83 | 91.83 | 75.02 | |
![]() | ![]() DeepSeek R1 | 16k | 60 | $5.50 | 190.8 | 1.87 | 16.79 | 12.30 | |
![]() DeepSeek R1 | 128k | 60 | $4.00 | 101.3 | 0.65 | 28.75 | 23.17 | ||
![]() | ![]() DeepSeek R1 | 128k | 60 | $7.00 | 34.6 | 0.73 | 83.08 | 67.89 | |
![]() | Qwen3 32B (Reasoning) | 41k | 59 | $0.50 | 2,394.2 | 0.31 | 1.36 | 0.84 | |
Qwen3 32B (Reasoning) Base | 33k | 59 | $0.15 | 41.6 | 0.58 | 60.61 | 48.02 | ||
Qwen3 32B (Reasoning) (FP8) | 41k | 59 | $0.15 | 46.4 | 0.49 | 54.34 | 43.08 | ||
![]() | Qwen3 32B (Reasoning) (FP8) | 128k | 59 | $0.19 | 26.8 | 0.90 | 94.09 | 74.55 | |
![]() | Qwen3 32B (Reasoning) | 8k | 59 | $0.50 | 334.1 | 0.41 | 7.89 | 5.99 | |
Qwen3 32B (Reasoning) | 131k | 59 | $2.63 | 64.9 | 1.08 | 39.58 | 30.80 | ||
QwQ-32B | 131k | 58 | $0.20 | 133.7 | 1.07 | 23.44 | 18.63 | ||
QwQ-32B Base | 131k | 58 | $0.23 | 53.8 | 0.59 | 56.18 | 46.30 | ||
![]() | QwQ-32B | 131k | 58 | $0.65 | 78.2 | 0.33 | 38.57 | 31.85 | |
QwQ-32B | 131k | 58 | $0.90 | 159.2 | 2.31 | 21.09 | 15.64 | ||
QwQ-32B | 131k | 58 | $0.14 | 47.3 | 0.57 | 63.76 | 52.63 | ||
![]() | QwQ-32B | 33k | 58 | $0.18 | 33.7 | 0.66 | 89.50 | 73.99 | |
QwQ-32B | 131k | 58 | $0.32 | 403.3 | 0.28 | 7.70 | 6.18 | ||
![]() | QwQ-32B | 16k | 58 | $0.63 | 402.6 | 0.42 | 7.85 | 6.19 | |
QwQ-32B | 131k | 58 | $1.20 | 98.0 | 0.47 | 30.99 | 25.42 | ||
![]() | Claude 3.7 Sonnet Thinking | 200k | 57 | $6.00 | 46.6 | 1.70 | 36.15 | 23.72 | |
Claude 3.7 Sonnet Thinking | 200k | 57 | $6.00 | 88.9 | 1.55 | 19.60 | 12.43 | ||
Qwen3 14B (Reasoning) Base | 33k | 56 | $0.12 | 88.6 | 0.52 | 28.74 | 22.57 | ||
Qwen3 14B (Reasoning) (FP8) | 128k | 56 | $0.12 | 67.2 | 0.21 | 37.43 | 29.77 | ||
![]() | Qwen3 14B (Reasoning) (FP8) | 128k | 56 | $0.12 | 56.2 | 0.71 | 45.21 | 35.60 | |
Qwen3 14B (Reasoning) | 131k | 56 | $1.31 | 63.6 | 1.04 | 40.35 | 31.44 | ||
Qwen3 30B A3B (Reasoning) Fast | 33k | 56 | $0.45 | 135.5 | 0.53 | 18.97 | 14.75 | ||
Qwen3 30B A3B (Reasoning) Base | 33k | 56 | $0.15 | 126.2 | 0.52 | 20.33 | 15.85 | ||
Qwen3 30B A3B (Reasoning) | 131k | 56 | $0.90 | 131.4 | 0.68 | 19.70 | 15.22 | ||
Qwen3 30B A3B (Reasoning) (FP8) | 41k | 56 | $0.15 | 115.5 | 0.49 | 22.14 | 17.32 | ||
![]() | Qwen3 30B A3B (Reasoning) (FP8) | 128k | 56 | $0.19 | 173.4 | 0.68 | 15.10 | 11.53 | |
Qwen3 30B A3B (Reasoning) | 131k | 56 | $0.75 | 92.2 | 1.01 | 28.13 | 21.69 | ||
o1-mini | 128k | 54 | $1.93 | 226.3 | 9.46 | 11.67 | N/A | ||
![]() | o1-mini | 128k | 54 | $1.93 | 263.7 | 9.47 | 11.37 | N/A | |
![]() | ![]() DeepSeek V3 (Mar' 25) | 64k | 53 | $0.48 | 25.0 | 3.64 | 23.66 | N/A | |
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $1.25 | 26.4 | 1.27 | 20.21 | N/A | ||
![]() DeepSeek V3 (Mar' 25) Fast | 128k | 53 | $3.00 | 91.7 | 0.66 | 6.11 | N/A | ||
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $0.75 | 22.7 | 0.66 | 22.68 | N/A | ||
![]() | ![]() DeepSeek V3 (Mar' 25) | 164k | 53 | $0.80 | 16.2 | 0.51 | 31.38 | N/A | |
![]() | ![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $2.00 | 78.0 | 0.48 | 6.89 | N/A | |
![]() DeepSeek V3 (Mar' 25) | 160k | 53 | $0.90 | 258.1 | 0.38 | 2.31 | N/A | ||
![]() DeepSeek V3 (Mar' 25) | 164k | 53 | $0.52 | 28.3 | 0.54 | 18.19 | N/A | ||
![]() | ![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $0.57 | 24.2 | 1.03 | 21.70 | N/A | |
![]() | ![]() DeepSeek V3 (Mar' 25) | 8k | 53 | $1.13 | 264.0 | 0.62 | 2.52 | N/A | |
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $1.25 | 98.6 | 0.55 | 5.62 | N/A | ||
![]() | ![]() DeepSeek V3 (Mar' 25) | 164k | 53 | $1.25 | 21.2 | 0.73 | 24.37 | N/A | |
GPT-4.1 mini | 1m | 53 | $0.70 | 78.9 | 0.46 | 6.80 | N/A | ||
![]() | GPT-4.1 mini | 1m | 53 | $0.70 | 198.0 | 0.57 | 3.09 | N/A | |
GPT-4.1 | 1m | 53 | $3.50 | 123.3 | 0.57 | 4.63 | N/A | ||
![]() | GPT-4.1 | 1m | 53 | $3.50 | 210.0 | 0.71 | 3.09 | N/A | |
![]() DeepSeek R1 Distill Qwen 32B | 128k | 52 | $0.14 | 46.2 | 0.25 | 54.32 | 43.25 | ||
![]() | ![]() DeepSeek R1 Distill Qwen 32B | 64k | 52 | $0.30 | 21.0 | 1.20 | 119.98 | 95.02 | |
![]() | Qwen3 8B (Reasoning) (FP8) | 128k | 51 | $0.06 | 39.1 | 0.81 | 64.73 | 51.13 | |
Qwen3 8B (Reasoning) | 131k | 51 | $0.66 | 94.2 | 0.97 | 27.51 | 21.23 | ||
Grok 3 | 131k | 51 | $6.00 | 51.5 | 0.53 | 10.23 | N/A | ||
Grok 3 Fast | 131k | 51 | $10.00 | 92.1 | 0.60 | 6.03 | N/A | ||
Llama 4 Maverick (FP8) | 1m | 51 | $0.28 | 122.8 | 0.42 | 4.49 | N/A | ||
![]() | Llama 4 Maverick | 128k | 51 | $0.42 | 299.2 | 0.59 | 2.26 | N/A | |
Llama 4 Maverick Vertex | 524k | 51 | $0.55 | 126.2 | 0.38 | 4.34 | N/A | ||
![]() | Llama 4 Maverick (FP8) | 1m | 51 | $0.20 | 125.6 | 0.25 | 4.23 | N/A | |
![]() | Llama 4 Maverick (FP8) | 128k | 51 | $0.61 | 56.1 | 0.33 | 9.24 | N/A | |
Llama 4 Maverick | 1m | 51 | $0.39 | 179.3 | 0.52 | 3.31 | N/A | ||
Llama 4 Maverick (FP8) | 131k | 51 | $0.27 | 117.5 | 0.23 | 4.49 | N/A | ||
![]() | Llama 4 Maverick (FP8) | 1m | 51 | $0.34 | 83.7 | 0.60 | 6.57 | N/A | |
Llama 4 Maverick | 128k | 51 | $0.30 | 277.2 | 0.34 | 2.14 | N/A | ||
![]() | Llama 4 Maverick | 64k | 51 | $0.92 | 793.9 | 0.37 | 0.99 | N/A | |
Llama 4 Maverick (FP8) | 524k | 51 | $0.41 | 127.3 | 0.20 | 4.13 | N/A | ||
![]() | Llama 4 Maverick (FP8) | 1m | 51 | $0.35 | 163.1 | 0.65 | 3.71 | N/A | |
GPT-4o (March 2025) | 128k | 50 | $7.50 | 190.6 | 0.36 | 2.99 | N/A | ||
Gemini 2.0 Pro Experimental (AI Studio) | 2m | 49 | $0.00 | 31.1 | 16.85 | 32.94 | N/A | ||
![]() | ![]() DeepSeek R1 Distill Qwen 14B | 64k | 49 | $0.15 | 45.0 | 0.86 | 56.40 | 44.44 | |
![]() DeepSeek R1 Distill Qwen 14B | 128k | 49 | $1.60 | 170.8 | 0.33 | 14.97 | 11.71 | ||
Gemini 2.5 Flash (AI_Studio) | 1m | 49 | $0.26 | 267.3 | 0.40 | 2.27 | N/A | ||
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $0.30 | 62.6 | 0.30 | 40.26 | 31.96 | ||
![]() | ![]() DeepSeek R1 Distill Llama 70B | 66k | 48 | $0.94 | 2,178.6 | 0.31 | 1.46 | 0.92 | |
![]() DeepSeek R1 Distill Llama 70B Base | 128k | 48 | $0.38 | 56.8 | 0.58 | 44.57 | 35.19 | ||
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $0.34 | 32.0 | 0.45 | 78.49 | 62.43 | ||
![]() | ![]() DeepSeek R1 Distill Llama 70B | 32k | 48 | $0.39 | 90.1 | 0.69 | 28.45 | 22.20 | |
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $0.81 | 422.5 | 0.16 | 6.08 | 4.73 | ||
![]() | ![]() DeepSeek R1 Distill Llama 70B | 16k | 48 | $0.88 | 304.7 | 1.56 | 9.77 | 6.56 | |
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $2.00 | 123.1 | 0.40 | 20.71 | 16.24 | ||
![]() | Claude 3.7 Sonnet | 200k | 48 | $6.00 | 45.0 | 1.57 | 12.67 | N/A | |
Claude 3.7 Sonnet | 200k | 48 | $6.00 | 75.5 | 2.31 | 8.93 | N/A | ||
Gemini 2.0 Flash Vertex | 1m | 48 | $0.26 | 261.8 | 0.25 | 2.16 | N/A | ||
Gemini 2.0 Flash (AI Studio) | 1m | 48 | $0.17 | 240.6 | 0.36 | 2.44 | N/A | ||
Qwen3 4B (Reasoning) Fast | 33k | 47 | $0.12 | 157.8 | 0.48 | 16.32 | 12.67 | ||
![]() | Qwen3 4B (Reasoning) (FP8) | 128k | 47 | $0.00 | 49.1 | 0.69 | 51.60 | 40.72 | |
Qwen3 4B (Reasoning) | 131k | 47 | $0.40 | 100.5 | 0.98 | 25.85 | 19.90 | ||
![]() | ![]() Reka Flash 3 | 128k | 47 | $0.35 | 56.4 | 0.96 | 45.27 | 35.45 | |
Qwen3 235B A22B | 131k | 47 | $1.23 | 70.3 | 1.22 | 8.34 | N/A | ||
Gemini 2.0 Flash (exp) (AI Studio) | 1m | 46 | $0.00 | 259.5 | 0.24 | 2.17 | N/A | ||
![]() DeepSeek V3 (Dec '24) (FP8) | 128k | 46 | $0.25 | 32.2 | 1.18 | 16.70 | N/A | ||
![]() DeepSeek V3 (Dec '24) | 128k | 46 | $0.75 | 22.9 | 0.66 | 22.47 | N/A | ||
![]() | ![]() DeepSeek V3 (Dec '24) | 128k | 46 | $2.00 | 73.7 | 0.48 | 7.26 | N/A | |
![]() DeepSeek V3 (Dec '24) | 128k | 46 | $1.31 | 71.0 | 0.66 | 7.70 | N/A | ||
![]() DeepSeek V3 (Dec '24) | 64k | 46 | $0.59 | 29.7 | 0.33 | 17.14 | N/A | ||
![]() | ![]() DeepSeek V3 (Dec '24) Turbo | 64k | 46 | $0.63 | 31.1 | 0.84 | 16.94 | N/A | |
![]() | ![]() DeepSeek V3 (Dec '24) | 64k | 46 | $0.89 | 30.8 | 0.77 | 17.00 | N/A | |
![]() DeepSeek V3 (Dec '24) (FP8) | 128k | 46 | $1.25 | 99.9 | 0.49 | 5.50 | N/A | ||
Qwen2.5 Max | 32k | 45 | $2.80 | 50.9 | 1.26 | 11.07 | N/A | ||
Gemini 1.5 Pro (Sep) (Vertex) | 2m | 45 | $2.19 | 92.8 | 0.56 | 5.94 | N/A | ||
Gemini 1.5 Pro (Sep) (AI Studio) | 2m | 45 | $2.19 | 92.1 | 0.65 | 6.08 | N/A | ||
![]() | Claude 3.5 Sonnet (Oct) | 200k | 44 | $6.00 | 42.8 | 1.10 | 12.79 | N/A | |
Claude 3.5 Sonnet (Oct) Vertex | 200k | 44 | $6.00 | 77.8 | 0.73 | 7.15 | N/A | ||
Claude 3.5 Sonnet (Oct) | 200k | 44 | $6.00 | 77.7 | 0.88 | 7.32 | N/A | ||
Qwen3 32B | 131k | 44 | $1.23 | 65.5 | 1.06 | 8.69 | N/A | ||
![]() Sonar | 127k | 43 | $1.00 | 141.6 | 1.82 | 5.36 | N/A | ||
Llama 4 Scout | 1m | 43 | $0.14 | 119.6 | 0.27 | 4.45 | N/A | ||
![]() | Llama 4 Scout | 32k | 43 | $0.70 | 2,743.2 | 0.30 | 0.48 | N/A | |
![]() | Llama 4 Scout | 128k | 43 | $0.29 | 162.3 | 0.61 | 3.69 | N/A | |
Llama 4 Scout Vertex | 1m | 43 | $0.36 | 131.3 | 0.39 | 4.19 | N/A | ||
![]() | Llama 4 Scout | 1m | 43 | $0.10 | 115.1 | 0.28 | 4.62 | N/A | |
![]() | Llama 4 Scout | 128k | 43 | $0.34 | 35.7 | 0.34 | 14.34 | N/A | |
Llama 4 Scout | 1m | 43 | $0.26 | 162.5 | 2.74 | 5.82 | N/A | ||
Llama 4 Scout | 131k | 43 | $0.15 | 48.4 | 0.58 | 10.91 | N/A | ||
![]() | Llama 4 Scout | 131k | 43 | $0.20 | 66.3 | 0.92 | 8.47 | N/A | |
Llama 4 Scout | 131k | 43 | $0.17 | 567.7 | 0.34 | 1.22 | N/A | ||
![]() | Llama 4 Scout | 8k | 43 | $0.47 | 792.0 | 0.77 | 1.40 | N/A | |
Llama 4 Scout | 328k | 43 | $0.28 | 118.0 | 0.18 | 4.42 | N/A | ||
![]() | Llama 4 Scout | 128k | 43 | $0.71 | 95.0 | 0.67 | 5.93 | N/A | |
![]() Sonar Pro | 200k | 43 | $6.00 | 81.2 | 2.79 | 8.95 | N/A | ||
QwQ 32B-Preview | 33k | 43 | $0.26 | 47.5 | 0.27 | 52.95 | 42.14 | ||
QwQ 32B-Preview | 33k | 43 | $1.20 | 97.9 | 0.47 | 26.00 | 20.42 | ||
![]() | ![]() Nova Premier | 1m | 43 | $5.00 | 65.4 | 0.85 | 8.50 | N/A | |
Qwen3 30B A3B | 131k | 43 | $0.35 | 92.5 | 1.03 | 6.43 | N/A | ||
GPT-4o (Nov '24) | 128k | 41 | $4.38 | 138.6 | 0.46 | 4.07 | N/A | ||
![]() | GPT-4o (Nov '24) | 128k | 41 | $4.38 | 141.2 | 1.11 | 4.65 | N/A | |
Gemini 2.0 Flash-Lite (Feb '25) (AI Studio) | 1m | 41 | $0.13 | 211.4 | 0.27 | 2.64 | N/A | ||
Llama 3.3 70B (FP8) | 128k | 41 | $0.17 | 57.7 | 0.34 | 9.00 | N/A | ||
![]() | Llama 3.3 70B | 33k | 41 | $0.94 | 2,566.9 | 0.27 | 0.46 | N/A | |
Llama 3.3 70B | 128k | 41 | $0.40 | 33.2 | 1.16 | 16.22 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.71 | 264.1 | 0.52 | 2.42 | N/A | |
Llama 3.3 70B Fast | 128k | 41 | $0.38 | 138.0 | 0.57 | 4.19 | N/A | ||
Llama 3.3 70B Base | 128k | 41 | $0.20 | 40.1 | 0.63 | 13.10 | N/A | ||
Llama 3.3 70B Vertex | 128k | 41 | $0.72 | 72.6 | 0.29 | 7.18 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.50 | 145.9 | 0.38 | 3.81 | N/A | |
![]() | Llama 3.3 70B | 128k | 41 | $0.71 | 52.6 | 0.45 | 9.95 | N/A | |
Llama 3.3 70B | 128k | 41 | $0.90 | 121.3 | 5.31 | 9.43 | N/A | ||
Llama 3.3 70B (Turbo, FP8) | 128k | 41 | $0.20 | 32.2 | 0.25 | 15.75 | N/A | ||
Llama 3.3 70B | 128k | 41 | $0.27 | 30.4 | 0.32 | 16.75 | N/A | ||
Llama 3.3 70B | 128k | 41 | $0.60 | 167.6 | 0.39 | 3.38 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.20 | 108.4 | 0.63 | 5.24 | N/A | |
Llama 3.3 70B | 128k | 41 | $0.64 | 418.7 | 0.23 | 1.42 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.75 | 447.8 | 0.33 | 1.44 | N/A | |
Llama 3.3 70B Turbo | 128k | 41 | $0.88 | 135.0 | 0.30 | 4.01 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.70 | 31.9 | 0.37 | 16.05 | N/A | |
GPT-4.1 nano | 1m | 41 | $0.17 | 196.1 | 0.33 | 2.88 | N/A | ||
![]() | GPT-4.1 nano | 1m | 41 | $0.17 | 210.6 | 0.65 | 3.03 | N/A | |
Qwen3 14B | 131k | 41 | $0.61 | 64.2 | 1.14 | 8.93 | N/A | ||
GPT-4o (May '24) | 128k | 41 | $7.50 | 154.2 | 0.51 | 3.75 | N/A | ||
![]() | GPT-4o (May '24) | 128k | 41 | $7.50 | 142.2 | 0.70 | 4.21 | N/A | |
Llama 3.1 405B (FP8) | 128k | 40 | $0.80 | 32.9 | 0.33 | 15.51 | N/A | ||
Llama 3.1 405B | 128k | 40 | $4.00 | 96.8 | 1.02 | 6.19 | N/A | ||
![]() | Llama 3.1 405B Standard | 128k | 40 | $2.40 | 30.7 | 1.82 | 18.12 | N/A | |
![]() | Llama 3.1 405B Latency Optimized | 128k | 40 | $3.00 | 86.2 | 0.43 | 6.23 | N/A | |
Llama 3.1 405B Base | 128k | 40 | $1.50 | 32.7 | 0.67 | 15.97 | N/A | ||
Llama 3.1 405B Vertex | 128k | 40 | $7.75 | 28.9 | 0.42 | 17.72 | N/A | ||
![]() | Llama 3.1 405B | 128k | 40 | $8.00 | 31.4 | 0.48 | 16.39 | N/A | |
Llama 3.1 405B | 128k | 40 | $3.00 | 94.5 | 10.38 | 15.67 | N/A | ||
Llama 3.1 405B | 33k | 40 | $0.90 | 23.5 | 0.74 | 22.00 | N/A | ||
![]() | Llama 3.1 405B | 16k | 40 | $6.25 | 169.4 | 1.62 | 4.57 | N/A | |
Llama 3.1 405B | 128k | 40 | $7.50 | 38.3 | 0.81 | 13.86 | N/A | ||
Llama 3.1 405B Turbo | 128k | 40 | $3.50 | 101.8 | 0.39 | 5.30 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.40 | 20.4 | 1.88 | 26.39 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.20 | 25.8 | 0.67 | 20.01 | N/A | ||
Qwen2.5 72B Fast | 131k | 40 | $0.38 | 70.5 | 0.54 | 7.63 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.90 | 71.3 | 0.54 | 7.56 | N/A | ||
Qwen2.5 72B | 33k | 40 | $0.27 | 33.6 | 0.28 | 15.16 | N/A | ||
Qwen2.5 72B Turbo | 131k | 40 | $1.20 | 100.7 | 0.50 | 5.46 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.00 | 51.1 | 1.07 | 10.85 | N/A | ||
![]() | ![]() MiniMax-Text-01 | 1m | 40 | $0.42 | 32.5 | 0.90 | 16.27 | N/A | |
Phi-4 | 16k | 40 | $0.15 | 117.7 | 0.52 | 4.77 | N/A | ||
![]() | Phi-4 | 16k | 40 | $0.22 | 32.8 | 0.46 | 15.71 | N/A | |
Phi-4 | 16k | 40 | $0.09 | 41.6 | 0.45 | 12.47 | N/A | ||
![]() Command A | 256k | 40 | $4.38 | 94.0 | 0.20 | 5.52 | N/A | ||
Gemini 1.5 Flash (Sep) (Vertex) | 1m | 39 | $0.13 | 192.2 | 0.18 | 2.78 | N/A | ||
Gemini 1.5 Flash (Sep) (AI Studio) | 1m | 39 | $0.13 | 195.9 | 0.31 | 2.86 | N/A | ||
![]() | ![]() Mistral Large 2 (Nov '24) | 128k | 38 | $3.00 | 60.0 | 0.41 | 8.74 | N/A | |
![]() | ![]() Mistral Large 2 (Nov '24) | 128k | 38 | $3.00 | 34.9 | 0.52 | 14.85 | N/A | |
![]() | Qwen3 1.7B (Reasoning) (FP8) | 32k | 38 | $0.00 | 48.4 | 0.66 | 52.34 | 41.34 | |
Qwen3 1.7B (Reasoning) | 33k | 38 | $0.40 | 131.1 | 0.90 | 19.98 | 15.26 | ||
Gemma 3 27B | 128k | 38 | $0.07 | 35.9 | 0.54 | 14.45 | N/A | ||
Grok Beta | 128k | 38 | $7.50 | 67.9 | 0.30 | 7.67 | N/A | ||
![]() | ![]() Pixtral Large | 128k | 37 | $3.00 | 82.3 | 0.37 | 6.44 | N/A | |
Qwen2.5 Instruct 32B Fast | 128k | 37 | $0.20 | 87.2 | 0.53 | 6.27 | N/A | ||
Qwen2.5 Instruct 32B Base | 128k | 37 | $0.10 | 61.0 | 0.55 | 8.74 | N/A | ||
Llama 3.1 Nemotron 70B (FP8) | 128k | 37 | $0.17 | 50.3 | 0.24 | 10.19 | N/A | ||
Llama 3.1 Nemotron 70B Base | 128k | 37 | $0.20 | 37.5 | 0.63 | 13.96 | N/A | ||
Llama 3.1 Nemotron 70B Fast | 128k | 37 | $0.38 | 71.7 | 0.52 | 7.50 | N/A | ||
Llama 3.1 Nemotron 70B | 128k | 37 | $0.27 | 27.0 | 0.43 | 18.94 | N/A | ||
![]() | ![]() Nova Pro | 300k | 37 | $1.40 | 171.4 | 0.36 | 3.28 | N/A | |
Qwen3 8B | 131k | 37 | $0.31 | 95.4 | 1.01 | 6.25 | N/A | ||
![]() | ![]() Mistral Large 2 (Jul '24) | 128k | 37 | $3.00 | 37.1 | 0.50 | 13.96 | N/A | |
![]() | ![]() Mistral Large 2 (Jul '24) | 128k | 37 | $3.00 | 33.4 | 0.45 | 15.41 | N/A | |
Qwen2.5 Coder 32B | 33k | 36 | $0.09 | 43.3 | 0.32 | 11.86 | N/A | ||
Qwen2.5 Coder 32B | 131k | 36 | $0.20 | 53.7 | 0.99 | 10.30 | N/A | ||
Qwen2.5 Coder 32B | 33k | 36 | $0.10 | 53.0 | 0.26 | 9.69 | N/A | ||
Qwen2.5 Coder 32B | 131k | 36 | $0.80 | 80.7 | 0.49 | 6.68 | N/A | ||
GPT-4o mini | 128k | 36 | $0.26 | 70.3 | 0.47 | 7.58 | N/A | ||
![]() | GPT-4o mini | 128k | 36 | $0.26 | 153.7 | 0.82 | 4.07 | N/A | |
Llama 3.1 70B (FP8) | 128k | 35 | $0.17 | 48.5 | 0.26 | 10.58 | N/A | ||
Llama 3.1 70B | 128k | 35 | $0.40 | 142.0 | 0.94 | 4.46 | N/A | ||
![]() | Llama 3.1 70B Standard | 128k | 35 | $0.72 | 31.6 | 0.64 | 16.44 | N/A | |
![]() | Llama 3.1 70B Latency Optimized | 128k | 35 | $0.90 | 136.6 | 0.32 | 3.98 | N/A | |
Llama 3.1 70B Base | 128k | 35 | $0.20 | 32.7 | 0.64 | 15.91 | N/A | ||
Llama 3.1 70B Fast | 128k | 35 | $0.38 | 140.7 | 0.54 | 4.10 | N/A | ||
Llama 3.1 70B Vertex | 128k | 35 | $0.72 | 72.4 | 0.27 | 7.17 | N/A | ||
![]() | Llama 3.1 70B | 128k | 35 | $2.90 | 53.9 | 0.45 | 9.72 | N/A | |
Llama 3.1 70B | 128k | 35 | $0.90 | 154.4 | 0.39 | 3.63 | N/A | ||
Llama 3.1 70B (Turbo, FP8) | 128k | 35 | $0.20 | 38.9 | 0.23 | 13.10 | N/A | ||
Llama 3.1 70B | 128k | 35 | $0.27 | 52.8 | 0.27 | 9.74 | N/A | ||
![]() | Llama 3.1 70B | 32k | 35 | $0.19 | 14.8 | 1.32 | 35.08 | N/A | |
Llama 3.1 70B Turbo | 128k | 35 | $0.88 | 134.4 | 0.32 | 4.04 | N/A | ||
Llama 3.1 70B | 128k | 35 | $0.90 | 125.6 | 0.49 | 4.47 | N/A | ||
![]() | ![]() Mistral Small 3.1 | 128k | 35 | $0.15 | 154.3 | 0.27 | 3.51 | N/A | |
![]() Mistral Small 3.1 Vertex | 128k | 35 | $0.15 | 210.5 | 0.17 | 2.54 | N/A | ||
![]() | ![]() Mistral Small 3 | 32k | 35 | $0.15 | 161.5 | 0.27 | 3.36 | N/A | |
![]() Mistral Small 3 | 32k | 35 | $0.09 | 86.6 | 0.20 | 5.98 | N/A | ||
![]() Mistral Small 3 | 32k | 35 | $0.80 | 95.6 | 0.18 | 5.41 | N/A | ||
Qwen3 4B | 131k | 35 | $0.19 | 102.9 | 0.98 | 5.83 | N/A | ||
![]() | Claude 3 Opus | 200k | 35 | $30.00 | 20.8 | 1.28 | 25.38 | N/A | |
Claude 3 Opus Vertex | 200k | 35 | $30.00 | 21.9 | 2.45 | 25.26 | N/A | ||
Claude 3 Opus | 200k | 35 | $30.00 | 27.5 | 1.02 | 19.21 | N/A | ||
![]() | Claude 3.5 Haiku Standard | 200k | 35 | $1.60 | 60.9 | 1.29 | 9.50 | N/A | |
![]() | Claude 3.5 Haiku Latency Optimized | 200k | 35 | $2.00 | 95.9 | 0.49 | 5.70 | N/A | |
Claude 3.5 Haiku Vertex | 200k | 35 | $1.60 | 65.8 | 1.32 | 8.92 | N/A | ||
Claude 3.5 Haiku | 200k | 35 | $1.60 | 64.3 | 0.89 | 8.66 | N/A | ||
![]() | ![]() DeepSeek R1 Distill Llama 8B | 32k | 34 | $0.04 | 52.8 | 0.69 | 48.06 | 37.90 | |
Gemma 3 12B | 128k | 34 | $0.06 | 19.8 | 0.71 | 25.92 | N/A | ||
Gemini 1.5 Pro (May) (Vertex) | 2m | 34 | $2.19 | 66.8 | 0.34 | 7.83 | N/A | ||
Gemini 1.5 Pro (May) (AI Studio) | 2m | 34 | $2.19 | 68.5 | 0.42 | 7.73 | N/A | ||
Qwen2.5 Turbo | 1m | 34 | $0.09 | 109.4 | 1.11 | 5.68 | N/A | ||
![]() | Llama 3.2 90B (Vision) | 128k | 33 | $0.72 | 60.5 | 0.51 | 8.78 | N/A | |
Llama 3.2 90B (Vision) Vertex | 128k | 33 | $0.00 | 32.7 | 0.20 | 15.49 | N/A | ||
Llama 3.2 90B (Vision) | 33k | 33 | $0.36 | 50.1 | 0.29 | 10.26 | N/A | ||
Llama 3.2 90B (Vision) Turbo | 128k | 33 | $1.20 | 28.5 | 0.23 | 17.75 | N/A | ||
Qwen2 72B | 33k | 33 | $0.90 | 40.4 | 0.46 | 12.82 | N/A | ||
Qwen2 72B | 131k | 33 | $0.00 | 31.0 | 1.31 | 17.42 | N/A | ||
![]() | ![]() Nova Lite | 300k | 33 | $0.10 | 284.3 | 0.30 | 2.06 | N/A | |
Gemini 1.5 Flash-8B AI Studio | 1m | 31 | $0.07 | 283.2 | 0.20 | 1.96 | N/A | ||
![]() | ![]() Jamba 1.5 Large | 256k | 29 | $3.50 | 51.1 | 0.69 | 10.47 | N/A | |
![]() Jamba 1.6 Large | 256k | 29 | $3.50 | 61.4 | 0.59 | 8.74 | N/A | ||
Gemini 1.5 Flash (May) (Vertex) | 1m | 28 | $0.13 | 329.2 | 0.25 | 1.77 | N/A | ||
Gemini 1.5 Flash (May) (AI Studio) | 1m | 28 | $0.13 | 326.8 | 0.23 | 1.76 | N/A | ||
![]() | ![]() Nova Micro | 130k | 28 | $0.06 | 327.2 | 0.30 | 1.82 | N/A | |
![]() Yi-Large | 32k | 28 | $3.00 | 67.8 | 0.41 | 7.79 | N/A | ||
![]() | Claude 3 Sonnet | 200k | 28 | $6.00 | 48.0 | 0.73 | 11.15 | N/A | |
Claude 3 Sonnet | 200k | 28 | $6.00 | 59.7 | 0.61 | 8.98 | N/A | ||
![]() | ![]() Codestral (Jan '25) | 256k | 28 | $0.45 | 122.7 | 0.32 | 4.39 | N/A | |
![]() Codestral (Jan '25) Vertex | 128k | 28 | $0.45 | 154.3 | 0.15 | 3.39 | N/A | ||
Llama 3 70B | 8k | 27 | $0.40 | 20.0 | 1.28 | 26.33 | N/A | ||
![]() | Llama 3 70B | 8k | 27 | $2.86 | 47.4 | 0.40 | 10.95 | N/A | |
![]() | Llama 3 70B | 8k | 27 | $2.90 | 18.9 | 0.77 | 27.16 | N/A | |
Llama 3 70B | 8k | 27 | $0.27 | 33.4 | 0.49 | 15.44 | N/A | ||
![]() | Llama 3 70B | 8k | 27 | $0.57 | 19.1 | 1.07 | 27.23 | N/A | |
Llama 3 70B | 8k | 27 | $0.64 | 334.9 | 0.26 | 1.75 | N/A | ||
Llama 3 70B (Reference, FP16) | 8k | 27 | $0.90 | 131.3 | 0.71 | 4.51 | N/A | ||
Llama 3 70B (Turbo, FP8) | 8k | 27 | $0.88 | 134.4 | 0.39 | 4.11 | N/A | ||
![]() | ![]() Mistral Small (Sep '24) | 33k | 27 | $0.30 | 88.2 | 0.30 | 5.97 | N/A | |
![]() | Phi-4 Multimodal | 128k | 27 | $0.00 | 21.5 | 0.35 | 23.59 | N/A | |
Qwen2.5 Coder 7B Fast | 131k | 27 | $0.04 | 230.2 | 0.46 | 2.63 | N/A | ||
Qwen2.5 Coder 7B Base | 131k | 27 | $0.01 | 191.9 | 0.49 | 3.09 | N/A | ||
![]() | ![]() Mistral Large (Feb '24) | 33k | 26 | $6.00 | 30.1 | 0.47 | 17.09 | N/A | |
![]() | ![]() Mistral Large (Feb '24) | 33k | 26 | $6.00 | 45.9 | 0.39 | 11.27 | N/A | |
![]() | ![]() Mixtral 8x22B | 65k | 26 | $3.00 | 59.9 | 0.32 | 8.67 | N/A | |
![]() Mixtral 8x22B Base | 65k | 26 | $0.60 | 77.2 | 0.53 | 7.01 | N/A | ||
![]() Mixtral 8x22B Fast | 65k | 26 | $1.05 | 104.2 | 0.52 | 5.31 | N/A | ||
![]() Mixtral 8x22B | 65k | 26 | $1.20 | 62.7 | 0.38 | 8.36 | N/A | ||
![]() Mixtral 8x22B | 65k | 26 | $1.20 | 88.3 | 0.31 | 5.97 | N/A | ||
![]() | Phi-4 Mini | 128k | 26 | $0.12 | 222.5 | 0.26 | 2.51 | N/A | |
![]() | Phi-4 Mini | 128k | 26 | $0.00 | 58.6 | 0.33 | 8.87 | N/A | |
Qwen3 1.7B | 33k | 25 | $0.19 | 134.8 | 0.93 | 4.64 | N/A | ||
![]() | Phi-3 Medium 14B | 128k | 25 | $0.30 | 52.2 | 0.44 | 10.01 | N/A | |
Gemma 3 4B | 128k | 24 | $0.03 | 92.6 | 0.22 | 5.62 | N/A | ||
![]() | Claude 2.1 | 200k | 24 | $12.00 | 29.5 | 1.78 | 18.74 | N/A | |
Claude 2.1 | 200k | 24 | $12.00 | 14.1 | 0.95 | 36.51 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.03 | 142.3 | 0.27 | 3.78 | N/A | ||
![]() | Llama 3.1 8B | 33k | 24 | $0.10 | 2,176.4 | 0.28 | 0.51 | N/A | |
Llama 3.1 8B | 128k | 24 | $0.10 | 442.6 | 0.72 | 1.85 | N/A | ||
![]() | Llama 3.1 8B | 128k | 24 | $0.22 | 91.3 | 0.35 | 5.83 | N/A | |
Llama 3.1 8B Fast | 128k | 24 | $0.04 | 181.8 | 0.50 | 3.25 | N/A | ||
Llama 3.1 8B Base | 128k | 24 | $0.03 | 58.2 | 0.54 | 9.13 | N/A | ||
Llama 3.1 8B Vertex | 128k | 24 | $0.00 | 119.8 | 0.17 | 4.35 | N/A | ||
![]() | Llama 3.1 8B | 128k | 24 | $0.38 | 225.6 | 0.30 | 2.52 | N/A | |
Llama 3.1 8B | 128k | 24 | $0.20 | 274.8 | 0.24 | 2.06 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.04 | 51.5 | 0.23 | 9.93 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.10 | 448.8 | 0.27 | 1.38 | N/A | ||
![]() | Llama 3.1 8B | 16k | 24 | $0.03 | 75.6 | 0.72 | 7.34 | N/A | |
Llama 3.1 8B | 128k | 24 | $0.06 | 840.6 | 0.25 | 0.84 | N/A | ||
![]() | Llama 3.1 8B | 16k | 24 | $0.13 | 1,038.8 | 0.22 | 0.70 | N/A | |
Llama 3.1 8B Turbo | 128k | 24 | $0.18 | 176.5 | 0.23 | 3.06 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.15 | 455.2 | 0.14 | 1.24 | N/A | ||
![]() | Llama 3.1 8B | 128k | 24 | $0.18 | 61.4 | 0.44 | 8.58 | N/A | |
![]() | ![]() Pixtral 12B | 128k | 23 | $0.15 | 86.3 | 0.28 | 6.07 | N/A | |
![]() Pixtral 12B | 128k | 23 | $0.10 | 78.1 | 0.62 | 7.02 | N/A | ||
Qwen3 0.6B (Reasoning) | 33k | 23 | $0.40 | 209.7 | 0.94 | 12.86 | 9.54 | ||
![]() | ![]() Mistral Small (Feb '24) | 33k | 23 | $1.50 | 161.2 | 0.26 | 3.36 | N/A | |
![]() | ![]() Mistral Small (Feb '24) | 33k | 23 | $1.50 | 88.2 | 0.39 | 6.06 | N/A | |
![]() | ![]() Mistral Medium | 33k | 23 | $4.09 | 84.8 | 0.39 | 6.29 | N/A | |
![]() | ![]() Ministral 8B | 128k | 22 | $0.10 | 134.6 | 0.30 | 4.01 | N/A | |
Gemma 2 9B Fast | 8k | 22 | $0.04 | 169.0 | 0.47 | 3.43 | N/A | ||
Gemma 2 9B Base | 8k | 22 | $0.03 | 152.8 | 0.47 | 3.75 | N/A | ||
Gemma 2 9B | 8k | 22 | $0.04 | 17.9 | 0.75 | 28.67 | N/A | ||
Gemma 2 9B | 8k | 22 | $0.20 | 719.3 | 0.21 | 0.91 | N/A | ||
Gemma 2 9B | 8k | 22 | $0.30 | 142.6 | 0.19 | 3.69 | N/A | ||
![]() LFM 40B | 32k | 22 | $0.15 | 160.9 | 0.17 | 3.27 | N/A | ||
![]() | ![]() Command-R+ | 128k | 21 | $6.00 | 47.5 | 0.47 | 11.00 | N/A | |
![]() Command-R+ | 128k | 21 | $4.38 | 49.3 | 0.26 | 10.40 | N/A | ||
![]() | Llama 3 8B | 8k | 21 | $0.38 | 104.0 | 0.30 | 5.11 | N/A | |
![]() | Llama 3 8B | 8k | 21 | $0.38 | 73.7 | 0.36 | 7.15 | N/A | |
Llama 3 8B | 8k | 21 | $0.04 | 115.2 | 0.20 | 4.54 | N/A | ||
![]() | Llama 3 8B | 8k | 21 | $0.04 | 60.1 | 0.87 | 9.19 | N/A | |
Llama 3 8B | 8k | 21 | $0.06 | 1,343.9 | 0.34 | 0.71 | N/A | ||
Llama 3 8B | 8k | 21 | $0.20 | 193.6 | 0.29 | 2.87 | N/A | ||
![]() | ![]() Codestral (May '24) | 33k | 20 | $0.30 | 104.8 | 0.31 | 5.08 | N/A | |
![]() Aya Expanse 32B | 128k | 20 | $0.75 | 120.4 | 0.17 | 4.32 | N/A | ||
![]() | ![]() Command-R+ (Apr '24) | 128k | 20 | $6.00 | 47.3 | 0.46 | 11.04 | N/A | |
![]() Command-R+ (Apr '24) | 128k | 20 | $6.00 | 64.8 | 0.23 | 7.94 | N/A | ||
![]() | ![]() Command-R+ (Apr '24) | 128k | 20 | $6.00 | 29.0 | 0.65 | 17.92 | N/A | |
![]() | ![]() Ministral 3B | 128k | 20 | $0.04 | 230.4 | 0.26 | 2.43 | N/A | |
![]() | ![]() Mistral NeMo | 128k | 20 | $0.15 | 141.3 | 0.29 | 3.83 | N/A | |
![]() Mistral NeMo Fast | 128k | 20 | $0.12 | 159.1 | 0.50 | 3.64 | N/A | ||
![]() Mistral NeMo Base | 128k | 20 | $0.06 | 38.1 | 0.60 | 13.71 | N/A | ||
![]() Mistral NeMo | 128k | 20 | $0.06 | 62.3 | 0.21 | 8.23 | N/A | ||
Llama 3.2 3B (FP8) | 128k | 20 | $0.02 | 224.5 | 0.22 | 2.45 | N/A | ||
Llama 3.2 3B | 128k | 20 | $0.10 | 93.3 | 1.03 | 6.39 | N/A | ||
![]() | Llama 3.2 3B | 128k | 20 | $0.15 | 71.7 | 0.47 | 7.44 | N/A | |
Llama 3.2 3B Base | 128k | 20 | $0.01 | 128.3 | 0.50 | 4.40 | N/A | ||
Llama 3.2 3B | 128k | 20 | $0.02 | 107.4 | 0.18 | 4.83 | N/A | ||
![]() | Llama 3.2 3B | 32k | 20 | $0.04 | 88.9 | 0.65 | 6.27 | N/A | |
![]() | Llama 3.2 3B | 8k | 20 | $0.10 | 1,589.0 | 0.19 | 0.51 | N/A | |
Llama 3.2 3B Turbo | 128k | 20 | $0.06 | 164.5 | 0.28 | 3.32 | N/A | ||
![]() DeepSeek R1 Distill Qwen 1.5B | 128k | 19 | $0.18 | 384.8 | 0.23 | 6.72 | 5.20 | ||
![]() | ![]() Jamba 1.5 Mini | 256k | 18 | $0.25 | 82.4 | 0.46 | 6.53 | N/A | |
![]() Jamba 1.6 Mini | 256k | 18 | $0.25 | 175.4 | 0.31 | 3.16 | N/A | ||
![]() | ![]() Mixtral 8x7B | 33k | 17 | $0.70 | 87.2 | 0.35 | 6.09 | N/A | |
![]() | ![]() Mixtral 8x7B | 33k | 17 | $0.51 | 90.3 | 0.33 | 5.86 | N/A | |
![]() Mixtral 8x7B Fast | 33k | 17 | $0.23 | 126.0 | 0.50 | 4.47 | N/A | ||
![]() Mixtral 8x7B Base | 33k | 17 | $0.12 | 98.4 | 0.49 | 5.57 | N/A | ||
![]() Mixtral 8x7B | 33k | 17 | $0.24 | 100.8 | 0.21 | 5.17 | N/A | ||
![]() Mixtral 8x7B | 33k | 17 | $0.60 | 73.8 | 0.30 | 7.08 | N/A | ||
Qwen3 0.6B | 33k | 17 | $0.19 | 216.7 | 0.92 | 3.23 | N/A | ||
![]() Aya Expanse 8B | 8k | 16 | $0.75 | 166.9 | 0.14 | 3.14 | N/A | ||
![]() | ![]() Command-R | 128k | 15 | $0.75 | 108.8 | 0.34 | 4.94 | N/A | |
![]() Command-R | 128k | 15 | $0.26 | 67.6 | 0.20 | 7.59 | N/A | ||
![]() | ![]() Command-R (Mar '24) | 128k | 15 | $0.75 | 108.9 | 0.33 | 4.92 | N/A | |
![]() Command-R (Mar '24) | 128k | 15 | $0.75 | 163.2 | 0.17 | 3.23 | N/A | ||
![]() | ![]() Command-R (Mar '24) | 128k | 15 | $0.75 | 46.2 | 0.50 | 11.32 | N/A | |
![]() | ![]() Codestral-Mamba | 256k | 14 | $0.25 | 93.9 | 0.45 | 5.78 | N/A | |
![]() | ![]() Mistral 7B | 8k | 10 | $0.25 | 105.0 | 0.27 | 5.03 | N/A | |
![]() | ![]() Mistral 7B | 8k | 10 | $0.16 | 93.3 | 0.29 | 5.65 | N/A | |
![]() Mistral 7B | 8k | 10 | $0.04 | 101.3 | 0.47 | 5.41 | N/A | ||
![]() | ![]() Mistral 7B | 32k | 10 | $0.04 | 121.6 | 0.81 | 4.92 | N/A | |
![]() Mistral 7B | 8k | 10 | $0.20 | 180.3 | 0.14 | 2.91 | N/A | ||
![]() | Llama 3.2 1B | 128k | 10 | $0.10 | 117.7 | 0.45 | 4.70 | N/A | |
Llama 3.2 1B Base | 128k | 10 | $0.01 | 25.0 | 0.57 | 20.59 | N/A | ||
Llama 3.2 1B | 128k | 10 | $0.01 | 137.5 | 0.27 | 3.91 | N/A | ||
![]() | Llama 3.2 1B | 16k | 10 | $0.05 | 2,615.4 | 0.18 | 0.37 | N/A | |
![]() | Llama 3.2 11B (Vision) | 128k | $0.16 | 143.3 | 0.48 | 3.97 | N/A | ||
Llama 3.2 11B (Vision) | 128k | $0.06 | 54.5 | 0.22 | 9.39 | N/A | |||
Llama 3.2 11B (Vision) Turbo | 128k | $0.18 | 117.2 | 0.14 | 4.41 | N/A | |||
![]() | ![]() Mistral Saba | 32k | $0.30 | 92.4 | 0.30 | 5.71 | N/A | ||
![]() Sonar Reasoning | 127k | $2.00 | 88.9 | 1.64 | 29.76 | 22.49 | |||
Grok 3 mini Reasoning (low) | 131k | $0.35 | 104.6 | 0.42 | 24.31 | 19.12 | |||
Grok 3 mini Reasoning (low) Fast | 131k | $1.45 | 172.2 | 0.47 | 14.99 | 11.61 | |||
![]() | ![]() Reka Flash | 128k | $0.35 | 46.2 | 0.89 | 11.70 | N/A | ||
![]() | ![]() Reka Core | 128k | $2.00 | 27.1 | 0.86 | 19.31 | N/A | ||
![]() | ![]() Reka Flash (Feb '24) | 128k | $0.35 | 46.0 | 0.89 | 11.77 | N/A | ||
![]() | ![]() Reka Edge | 128k | $0.10 | 84.7 | 0.82 | 6.72 | N/A | ||
o1-preview | 128k | $26.25 | 168.6 | 19.06 | 22.02 | N/A | |||
![]() | o1-preview | 128k | $28.88 | 173.1 | 16.77 | 19.66 | N/A | ||
GPT-4o (Aug '24) | 128k | $4.38 | 140.6 | 0.45 | 4.00 | N/A | |||
![]() | GPT-4o (Aug '24) | 128k | $4.38 | 141.0 | 0.54 | 4.09 | N/A | ||
GPT-4 Turbo | 128k | $15.00 | 43.0 | 0.79 | 12.42 | N/A | |||
![]() | GPT-4 Turbo | 128k | $15.00 | 54.1 | 1.43 | 10.67 | N/A | ||
GPT-3.5 Turbo | 4k | $0.75 | 134.0 | 0.41 | 4.14 | N/A | |||
GPT-4 | 8k | $37.50 | 30.0 | 0.86 | 17.53 | N/A | |||
GPT-4.5 (Preview) | 128k | $93.75 | 73.3 | 1.01 | 7.83 | N/A | |||
Gemini 2.0 Flash-Lite (Preview) (AI Studio) | 1m | $0.13 | 208.8 | 0.27 | 2.66 | N/A | |||
Gemma 2 27B Fast | 8k | $0.26 | 88.3 | 0.54 | 6.20 | N/A | |||
Gemma 2 27B Base | 8k | $0.15 | 50.0 | 0.58 | 10.58 | N/A | |||
Gemma 2 27B | 8k | $0.80 | 89.1 | 0.24 | 5.85 | N/A | |||
![]() | Claude 3.5 Sonnet (June) | 200k | $6.00 | 49.9 | 0.85 | 10.88 | N/A | ||
Claude 3.5 Sonnet (June) Vertex | 200k | $6.00 | 78.3 | 0.71 | 7.10 | N/A | |||
Claude 3.5 Sonnet (June) | 200k | $6.00 | 78.4 | 0.65 | 7.03 | N/A | |||
![]() | Claude 3 Haiku | 200k | $0.50 | 108.1 | 0.49 | 5.11 | N/A | ||
Claude 3 Haiku | 200k | $0.50 | 139.7 | 0.45 | 4.03 | N/A | |||
![]() | Claude Instant | 100k | $1.20 | 62.6 | 0.55 | 8.53 | N/A | ||
Claude 2.0 | 100k | $12.00 | 30.8 | 0.95 | 17.18 | N/A | |||
![]() DeepSeek Coder V2 Lite Fast, FP8 | 128k | $0.12 | 93.7 | 0.49 | 5.83 | N/A | |||
![]() DeepSeek Coder V2 Lite Base, FP8 | 128k | $0.06 | 104.3 | 0.53 | 5.32 | N/A | |||
![]() OpenChat 3.5 | 8k | $0.06 | 58.4 | 0.21 | 8.78 | N/A | |||
![]() | ![]() Solar Mini | 4k | $0.15 | 86.3 | 1.07 | 6.86 | N/A | ||
Qwen1.5 Chat 110B | 32k | $0.00 | 23.7 | 1.54 | 22.65 | N/A |