LLM API Providers Leaderboard - Comparison of over 100 LLM endpoints
Comparison and ranking of API provider performance for over 100 AI LLM Model endpoints across performance key metrics including price, output speed, latency, context window & others. For more details including relating to our methodology, see our FAQs.
API providers compared: OpenAI, Playground AI, Microsoft Azure, Ideogram, Mistral, Amazon Bedrock, DeepSeek, Hyperbolic, Groq, FriendliAI, Together.ai, Anthropic, Black Forest Labs, Perplexity, Google, Lambda Labs, Fireworks, Cerebras, Leonardo.Ai, Cohere, Recraft AI, Upstage, Simplismart, Speechmatics, Fish Audio, Deepinfra, , Replicate, Genmo, Nebius, Adobe, MiniMax, CentML, StepFun, Runpod, Zyphra, Murf AI, Speechify, Rev AI, AssemblyAI, fal.ai, Rime, kluster.ai, Prodia, Reka AI, Hume AI, Deepgram, Gladia, Stability.ai, Baseten, Midjourney, Reve, Databricks, ElevenLabs, Vivago AI, IBM, SambaNova, xAI, Cartesia, LMNT, PlayAI, 01.AI, Alibaba Cloud, Novita, AI21 Labs, and WaveSpeed.
Features | Model Intelligence | Price | Output tokens/s | Latency | End-to-End Response Time | ||||
---|---|---|---|---|---|---|---|---|---|
Further Analysis | |||||||||
Gemini 2.5 Pro Experimental | 1m | 68 | $3.44 | 204.3 | 26.77 | 29.22 | N/A | ||
o3-mini (high) | 200k | 66 | $1.93 | 181.2 | 43.60 | 46.36 | N/A | ||
![]() | o3-mini (high) | 200k | 66 | $1.93 | 127.1 | 54.70 | 58.64 | N/A | |
o3-mini | 200k | 63 | $1.93 | 179.4 | 12.10 | 14.89 | N/A | ||
![]() | o3-mini | 200k | 63 | $1.93 | 139.8 | 16.09 | 19.66 | N/A | |
o1 | 200k | 62 | $26.25 | 66.7 | 42.27 | 49.77 | N/A | ||
![]() | o1 | 200k | 62 | $26.25 | 112.7 | 25.95 | 30.39 | N/A | |
![]() DeepSeek R1 | 164k | 60 | $0.95 | 35.5 | 0.54 | 80.82 | 66.18 | ||
![]() | ![]() DeepSeek R1 | 64k | 60 | $0.96 | 22.5 | 3.07 | 129.40 | 104.14 | |
![]() DeepSeek R1 | 128k | 60 | $2.00 | 27.0 | 1.07 | 106.71 | 87.08 | ||
![]() | ![]() DeepSeek R1 | 128k | 60 | $2.36 | 46.6 | 0.45 | 61.60 | 50.40 | |
![]() DeepSeek R1 Base | 128k | 60 | $1.20 | 26.3 | 0.65 | 108.96 | 89.29 | ||
![]() DeepSeek R1 Fast | 128k | 60 | $3.00 | 79.4 | 0.68 | 36.53 | 29.56 | ||
![]() | ![]() DeepSeek R1 | 128k | 60 | $3.99 | 71.3 | 0.58 | 40.52 | 32.93 | |
![]() | ![]() DeepSeek R1 | 128k | 60 | $2.36 | 54.1 | 0.57 | 53.23 | 43.41 | |
![]() DeepSeek R1 (Fast) | 164k | 60 | $4.25 | 111.6 | 0.83 | 26.33 | 21.03 | ||
![]() DeepSeek R1 (Turbo, FP4) | 33k | 60 | $1.50 | 148.0 | 0.24 | 19.47 | 15.85 | ||
![]() DeepSeek R1 | 64k | 60 | $0.96 | 12.3 | 0.60 | 232.20 | 190.92 | ||
![]() DeepSeek R1 | 128k | 60 | $4.00 | 53.1 | 0.55 | 54.18 | 44.21 | ||
![]() | ![]() DeepSeek R1 Turbo | 64k | 60 | $1.15 | 28.2 | 0.94 | 101.89 | 83.22 | |
![]() | ![]() DeepSeek R1 | 64k | 60 | $4.00 | 29.6 | 0.87 | 96.90 | 79.16 | |
![]() | ![]() DeepSeek R1 | 16k | 60 | $5.50 | 192.1 | 0.97 | 15.79 | 12.22 | |
![]() DeepSeek R1 | 128k | 60 | $4.00 | 107.3 | 0.62 | 27.14 | 21.86 | ||
![]() | ![]() DeepSeek R1 | 128k | 60 | $7.00 | 29.0 | 0.62 | 98.89 | 81.01 | |
QwQ-32B | 131k | 58 | $0.20 | 32.2 | 1.22 | 94.16 | 77.40 | ||
QwQ-32B Base | 131k | 58 | $0.23 | 56.5 | 0.55 | 53.45 | 44.05 | ||
![]() | QwQ-32B | 131k | 58 | $0.65 | 83.9 | 0.50 | 36.15 | 29.70 | |
QwQ-32B | 131k | 58 | $0.90 | 116.6 | 0.58 | 26.23 | 21.36 | ||
QwQ-32B | 131k | 58 | $0.14 | 31.6 | 0.39 | 95.11 | 78.89 | ||
QwQ-32B | 131k | 58 | $0.32 | 403.5 | 0.16 | 7.57 | 6.17 | ||
![]() | QwQ-32B | 16k | 58 | $0.63 | 402.7 | 0.93 | 8.36 | 6.19 | |
QwQ-32B | 131k | 58 | $1.20 | 81.5 | 0.50 | 37.22 | 30.58 | ||
o1-mini | 128k | 54 | $1.93 | 210.6 | 9.54 | 11.92 | N/A | ||
![]() | o1-mini | 128k | 54 | $2.12 | 247.8 | 9.50 | 11.52 | N/A | |
![]() | ![]() DeepSeek V3 (Mar' 25) | 64k | 53 | $0.48 | 24.6 | 3.37 | 23.73 | N/A | |
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $1.45 | 57.0 | 1.10 | 9.87 | N/A | ||
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $1.25 | 28.1 | 0.97 | 18.79 | N/A | ||
![]() DeepSeek V3 (Mar' 25) Fast | 128k | 53 | $3.00 | 92.5 | 0.65 | 6.05 | N/A | ||
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $0.75 | 34.0 | 0.62 | 15.33 | N/A | ||
![]() | ![]() DeepSeek V3 (Mar' 25) | 164k | 53 | $0.80 | 76.8 | 0.55 | 7.06 | N/A | |
![]() DeepSeek V3 (Mar' 25) | 160k | 53 | $0.90 | 59.9 | 0.93 | 9.27 | N/A | ||
![]() DeepSeek V3 (Mar' 25) | 164k | 53 | $0.52 | 8.5 | 1.01 | 60.18 | N/A | ||
![]() | ![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $0.63 | 28.9 | 0.85 | 18.12 | N/A | |
![]() | ![]() DeepSeek V3 (Mar' 25) | 8k | 53 | $1.13 | 265.0 | 0.68 | 2.57 | N/A | |
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $1.25 | 33.1 | 2.63 | 17.76 | N/A | ||
![]() | ![]() DeepSeek V3 (Mar' 25) | 164k | 53 | $1.25 | 16.9 | 0.75 | 30.35 | N/A | |
GPT-4.1 mini | 1m | 53 | $0.70 | 229.2 | 0.41 | 2.59 | N/A | ||
GPT-4.1 | 1m | 53 | $3.50 | 132.3 | 0.40 | 4.18 | N/A | ||
![]() | GPT-4.1 | 1m | 53 | $3.50 | 121.5 | 0.75 | 4.86 | N/A | |
![]() DeepSeek R1 Distill Qwen 32B | 128k | 52 | $0.14 | 44.4 | 0.26 | 56.61 | 45.08 | ||
![]() | ![]() DeepSeek R1 Distill Qwen 32B | 64k | 52 | $0.30 | 20.9 | 1.12 | 120.91 | 95.83 | |
![]() DeepSeek R1 Distill Qwen 32B | 128k | 52 | $0.69 | 383.6 | 0.36 | 6.88 | 5.21 | ||
Llama 4 Maverick (FP8) | 1m | 51 | $0.30 | 132.2 | 0.42 | 4.20 | N/A | ||
Llama 4 Maverick Vertex | 524k | 51 | $0.00 | 125.6 | 0.35 | 4.33 | N/A | ||
![]() | Llama 4 Maverick (FP8) | 1m | 51 | $0.20 | 132.0 | 0.44 | 4.22 | N/A | |
Llama 4 Maverick | 131k | 51 | $0.39 | 156.9 | 0.28 | 3.47 | N/A | ||
Llama 4 Maverick (FP8) | 131k | 51 | $0.30 | 107.5 | 0.51 | 5.17 | N/A | ||
![]() | Llama 4 Maverick (FP8) | 1m | 51 | $0.36 | 69.2 | 0.63 | 7.85 | N/A | |
Llama 4 Maverick | 128k | 51 | $0.30 | 275.8 | 0.34 | 2.15 | N/A | ||
![]() | Llama 4 Maverick | 8k | 51 | $0.92 | 799.3 | 1.01 | 1.63 | N/A | |
Llama 4 Maverick (FP8) | 524k | 51 | $0.41 | 114.8 | 0.27 | 4.63 | N/A | ||
![]() | Llama 4 Maverick (FP8) | 1m | 51 | $0.35 | 107.2 | 0.51 | 5.17 | N/A | |
GPT-4o (March 2025) | 128k | 50 | $7.50 | 184.3 | 0.39 | 3.10 | N/A | ||
Gemini 2.0 Pro Experimental (AI Studio) | 2m | 49 | $0.00 | 198.9 | 27.21 | 29.72 | N/A | ||
![]() | ![]() DeepSeek R1 Distill Qwen 14B | 64k | 49 | $0.15 | 38.2 | 0.77 | 66.22 | 52.36 | |
![]() DeepSeek R1 Distill Qwen 14B | 128k | 49 | $1.60 | 136.4 | 0.38 | 18.70 | 14.66 | ||
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $0.30 | 64.1 | 0.43 | 39.44 | 31.21 | ||
![]() | ![]() DeepSeek R1 Distill Llama 70B | 66k | 48 | $0.94 | 2,499.9 | 0.24 | 1.24 | 0.80 | |
![]() DeepSeek R1 Distill Llama 70B Base | 128k | 48 | $0.38 | 57.3 | 0.60 | 44.23 | 34.90 | ||
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $0.34 | 31.6 | 0.29 | 79.49 | 63.36 | ||
![]() | ![]() DeepSeek R1 Distill Llama 70B | 32k | 48 | $0.39 | 46.4 | 0.96 | 54.84 | 43.10 | |
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $0.81 | 299.4 | 0.33 | 8.68 | 6.68 | ||
![]() | ![]() DeepSeek R1 Distill Llama 70B | 16k | 48 | $0.88 | 308.5 | 1.80 | 9.90 | 6.48 | |
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $2.00 | 122.7 | 0.33 | 20.71 | 16.30 | ||
![]() | Claude 3.7 Sonnet | 200k | 48 | $6.00 | 37.9 | 1.01 | 14.22 | N/A | |
Claude 3.7 Sonnet | 200k | 48 | $6.00 | 77.0 | 0.89 | 7.39 | N/A | ||
Gemini 2.0 Flash Vertex | 1m | 48 | $0.26 | 240.4 | 0.29 | 2.37 | N/A | ||
Gemini 2.0 Flash (AI Studio) | 1m | 48 | $0.17 | 246.6 | 0.35 | 2.38 | N/A | ||
![]() | ![]() Reka Flash 3 | 128k | 47 | $0.35 | 55.7 | 0.94 | 45.85 | 35.92 | |
Gemini 2.0 Flash (exp) (AI Studio) | 1m | 46 | $0.00 | 244.8 | 0.29 | 2.33 | N/A | ||
![]() | ![]() DeepSeek V3 (Dec '24) | 66k | 46 | $0.48 | 24.1 | 3.26 | 24.03 | N/A | |
![]() DeepSeek V3 (Dec '24) (FP8) | 128k | 46 | $0.25 | 27.8 | 1.10 | 19.09 | N/A | ||
![]() DeepSeek V3 (Dec '24) | 128k | 46 | $0.75 | 25.5 | 0.66 | 20.30 | N/A | ||
![]() | ![]() DeepSeek V3 (Dec '24) | 128k | 46 | $2.00 | 86.0 | 0.50 | 6.31 | N/A | |
![]() DeepSeek V3 (Dec '24) | 128k | 46 | $1.31 | 66.9 | 0.81 | 8.27 | N/A | ||
![]() DeepSeek V3 (Dec '24) | 64k | 46 | $0.59 | 19.4 | 0.76 | 26.59 | N/A | ||
![]() | ![]() DeepSeek V3 (Dec '24) Turbo | 64k | 46 | $0.63 | 29.9 | 0.86 | 17.56 | N/A | |
![]() | ![]() DeepSeek V3 (Dec '24) | 64k | 46 | $0.89 | 29.8 | 0.84 | 17.63 | N/A | |
![]() DeepSeek V3 (Dec '24) (FP8) | 128k | 46 | $1.25 | 34.7 | 2.52 | 16.91 | N/A | ||
Qwen2.5 Max | 32k | 45 | $2.80 | 52.1 | 1.25 | 10.84 | N/A | ||
Gemini 1.5 Pro (Sep) (Vertex) | 2m | 45 | $2.19 | 96.6 | 0.50 | 5.68 | N/A | ||
Gemini 1.5 Pro (Sep) (AI Studio) | 2m | 45 | $2.19 | 95.9 | 0.48 | 5.70 | N/A | ||
Claude 3.5 Sonnet (Oct) Vertex | 200k | 44 | $6.00 | 79.1 | 0.92 | 7.25 | N/A | ||
Claude 3.5 Sonnet (Oct) | 200k | 44 | $6.00 | 79.4 | 0.90 | 7.20 | N/A | ||
![]() Sonar | 127k | 43 | $1.00 | 70.6 | 2.11 | 9.19 | N/A | ||
Llama 4 Scout | 1m | 43 | $0.15 | 121.1 | 0.40 | 4.54 | N/A | ||
![]() | Llama 4 Scout | 32k | 43 | $0.70 | 2,562.7 | 0.31 | 0.51 | N/A | |
Llama 4 Scout Vertex | 1m | 43 | $0.00 | 131.6 | 0.35 | 4.16 | N/A | ||
![]() | Llama 4 Scout | 1m | 43 | $0.10 | 113.0 | 0.45 | 4.88 | N/A | |
Llama 4 Scout | 128k | 43 | $0.26 | 138.6 | 0.33 | 3.94 | N/A | ||
Llama 4 Scout | 131k | 43 | $0.15 | 106.4 | 0.26 | 4.96 | N/A | ||
![]() | Llama 4 Scout | 131k | 43 | $0.20 | 75.5 | 0.65 | 7.27 | N/A | |
Llama 4 Scout | 131k | 43 | $0.17 | 604.7 | 0.34 | 1.16 | N/A | ||
![]() | Llama 4 Scout | 8k | 43 | $0.47 | 731.5 | 0.75 | 1.43 | N/A | |
Llama 4 Scout | 328k | 43 | $0.28 | 116.4 | 0.19 | 4.48 | N/A | ||
![]() | Llama 4 Scout | 128k | 43 | $0.71 | 55.2 | 0.52 | 9.57 | N/A | |
![]() Sonar Pro | 200k | 43 | $6.00 | 59.1 | 2.76 | 11.22 | N/A | ||
QwQ 32B-Preview | 33k | 43 | $0.20 | 55.2 | 0.87 | 46.16 | 36.24 | ||
QwQ 32B-Preview | 33k | 43 | $0.26 | 45.5 | 0.25 | 55.22 | 43.98 | ||
QwQ 32B-Preview | 33k | 43 | $1.20 | 74.9 | 0.51 | 33.89 | 26.71 | ||
GPT-4o (Nov '24) | 128k | 41 | $4.38 | 129.1 | 0.54 | 4.42 | N/A | ||
![]() | GPT-4o (Nov '24) | 128k | 41 | $4.38 | 118.7 | 1.27 | 5.48 | N/A | |
Gemini 2.0 Flash-Lite (Feb '25) (AI Studio) | 1m | 41 | $0.13 | 206.4 | 0.27 | 2.69 | N/A | ||
Llama 3.3 70B (FP8) | 128k | 41 | $0.17 | 39.0 | 0.51 | 13.34 | N/A | ||
![]() | Llama 3.3 70B | 33k | 41 | $0.94 | 2,581.6 | 0.24 | 0.43 | N/A | |
Llama 3.3 70B | 128k | 41 | $0.40 | 57.0 | 1.05 | 9.83 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.71 | 137.7 | 0.59 | 4.22 | N/A | |
Llama 3.3 70B Fast | 128k | 41 | $0.38 | 136.2 | 0.55 | 4.22 | N/A | ||
Llama 3.3 70B Base | 128k | 41 | $0.20 | 33.7 | 0.67 | 15.53 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.50 | 145.1 | 0.48 | 3.93 | N/A | |
![]() | Llama 3.3 70B | 128k | 41 | $0.71 | 47.4 | 0.44 | 10.99 | N/A | |
Llama 3.3 70B | 128k | 41 | $0.90 | 179.3 | 0.49 | 3.28 | N/A | ||
Llama 3.3 70B (Turbo, FP8) | 128k | 41 | $0.20 | 32.3 | 0.57 | 16.04 | N/A | ||
Llama 3.3 70B | 128k | 41 | $0.27 | 27.9 | 0.38 | 18.32 | N/A | ||
Llama 3.3 70B | 128k | 41 | $0.60 | 149.8 | 0.45 | 3.78 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.39 | 40.1 | 0.61 | 13.09 | N/A | |
Llama 3.3 70B (Spec decoding) | 8k | 41 | $0.69 | 1,615.0 | 0.42 | 0.73 | N/A | ||
Llama 3.3 70B | 128k | 41 | $0.64 | 304.9 | 0.37 | 2.01 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.75 | 470.9 | 0.30 | 1.36 | N/A | |
Llama 3.3 70B Turbo | 128k | 41 | $0.88 | 116.1 | 0.32 | 4.63 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.70 | 41.3 | 0.61 | 12.72 | N/A | |
GPT-4.1 nano | 1m | 41 | $0.17 | 292.7 | 0.88 | 2.59 | N/A | ||
GPT-4o (May '24) | 128k | 41 | $7.50 | 102.9 | 0.54 | 5.40 | N/A | ||
![]() | GPT-4o (May '24) | 128k | 41 | $7.50 | 101.4 | 0.85 | 5.79 | N/A | |
Llama 3.1 405B (FP8) | 128k | 40 | $0.80 | 34.7 | 0.62 | 15.02 | N/A | ||
Llama 3.1 405B | 128k | 40 | $9.50 | 19.1 | 1.00 | 27.12 | N/A | ||
Llama 3.1 405B | 128k | 40 | $4.00 | 71.4 | 0.85 | 7.85 | N/A | ||
![]() | Llama 3.1 405B Standard | 128k | 40 | $2.40 | 30.9 | 1.83 | 17.99 | N/A | |
![]() | Llama 3.1 405B Latency Optimized | 128k | 40 | $3.00 | 29.2 | 2.12 | 19.23 | N/A | |
Llama 3.1 405B Base | 128k | 40 | $1.50 | 31.9 | 0.70 | 16.35 | N/A | ||
Llama 3.1 405B Vertex | 128k | 40 | $7.75 | 30.0 | 0.40 | 17.09 | N/A | ||
![]() | Llama 3.1 405B | 128k | 40 | $8.00 | 30.7 | 0.46 | 16.75 | N/A | |
Llama 3.1 405B | 128k | 40 | $3.00 | 91.0 | 0.52 | 6.01 | N/A | ||
Llama 3.1 405B | 33k | 40 | $0.90 | 20.6 | 0.70 | 25.00 | N/A | ||
![]() | Llama 3.1 405B | 16k | 40 | $6.25 | 183.7 | 1.26 | 3.98 | N/A | |
Llama 3.1 405B | 128k | 40 | $7.50 | 37.0 | 0.69 | 14.19 | N/A | ||
Llama 3.1 405B Turbo | 128k | 40 | $3.50 | 85.3 | 0.39 | 6.25 | N/A | ||
![]() | Llama 3.1 405B | 128k | 40 | $3.50 | 17.5 | 1.12 | 29.69 | N/A | |
Qwen2.5 72B | 131k | 40 | $0.40 | 19.7 | 1.63 | 27.06 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.20 | 24.0 | 0.71 | 21.53 | N/A | ||
Qwen2.5 72B Fast | 131k | 40 | $0.38 | 67.0 | 0.58 | 8.05 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.90 | 43.6 | 0.39 | 11.86 | N/A | ||
Qwen2.5 72B | 33k | 40 | $0.27 | 40.5 | 0.28 | 12.63 | N/A | ||
![]() | Qwen2.5 72B | 16k | 40 | $0.94 | 322.9 | 0.79 | 2.34 | N/A | |
Qwen2.5 72B Turbo | 131k | 40 | $1.20 | 93.6 | 0.40 | 5.74 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.00 | 62.8 | 1.19 | 9.14 | N/A | ||
![]() | ![]() MiniMax-Text-01 | 1m | 40 | $0.42 | 39.5 | 0.88 | 13.55 | N/A | |
Phi-4 | 16k | 40 | $0.15 | 118.3 | 0.50 | 4.72 | N/A | ||
![]() | Phi-4 | 16k | 40 | $0.22 | 44.2 | 0.47 | 11.79 | N/A | |
Phi-4 | 16k | 40 | $0.09 | 43.3 | 0.36 | 11.91 | N/A | ||
![]() Command A | 256k | 40 | $4.38 | 57.6 | 0.26 | 8.94 | N/A | ||
Gemini 1.5 Flash (Sep) (Vertex) | 1m | 39 | $0.13 | 191.8 | 0.22 | 2.82 | N/A | ||
Gemini 1.5 Flash (Sep) (AI Studio) | 1m | 39 | $0.13 | 158.2 | 0.31 | 3.47 | N/A | ||
![]() | ![]() Mistral Large 2 (Nov '24) | 128k | 38 | $3.00 | 36.8 | 0.42 | 14.01 | N/A | |
![]() | ![]() Mistral Large 2 (Nov '24) | 128k | 38 | $3.00 | 36.5 | 0.53 | 14.22 | N/A | |
Gemma 3 27B | 128k | 38 | $0.07 | 33.7 | 0.61 | 15.44 | N/A | ||
Grok Beta | 128k | 38 | $7.50 | 66.7 | 0.29 | 7.79 | N/A | ||
![]() | ![]() Pixtral Large | 128k | 37 | $3.00 | 32.2 | 0.44 | 15.95 | N/A | |
Qwen2.5 Instruct 32B Fast | 128k | 37 | $0.20 | 80.4 | 0.56 | 6.77 | N/A | ||
Qwen2.5 Instruct 32B Base | 128k | 37 | $0.10 | 59.1 | 0.57 | 9.03 | N/A | ||
Qwen2.5 Instruct 32B | 128k | 37 | $0.79 | 393.8 | 0.24 | 1.51 | N/A | ||
Llama 3.1 Nemotron 70B (FP8) | 128k | 37 | $0.17 | 38.8 | 0.55 | 13.43 | N/A | ||
Llama 3.1 Nemotron 70B Base | 128k | 37 | $0.20 | 40.7 | 0.63 | 12.91 | N/A | ||
Llama 3.1 Nemotron 70B Fast | 128k | 37 | $0.38 | 71.9 | 0.55 | 7.51 | N/A | ||
Llama 3.1 Nemotron 70B | 128k | 37 | $0.27 | 35.1 | 0.57 | 14.83 | N/A | ||
![]() | ![]() Nova Pro | 300k | 37 | $1.40 | 103.9 | 0.37 | 5.18 | N/A | |
![]() | ![]() Nova Pro Latency Optimized | 300k | 37 | $1.75 | 106.6 | 0.40 | 5.09 | N/A | |
![]() | ![]() Mistral Large 2 (Jul '24) | 128k | 37 | $3.00 | 37.7 | 0.49 | 13.76 | N/A | |
![]() | ![]() Mistral Large 2 (Jul '24) | 128k | 37 | $3.00 | 35.0 | 0.46 | 14.76 | N/A | |
![]() | ![]() Mistral Large 2 (Jul '24) | 128k | 37 | $3.00 | 36.3 | 0.52 | 14.31 | N/A | |
Qwen2.5 Coder 32B | 33k | 36 | $0.09 | 64.9 | 0.49 | 8.19 | N/A | ||
Qwen2.5 Coder 32B | 131k | 36 | $0.20 | 14.3 | 0.91 | 35.86 | N/A | ||
Qwen2.5 Coder 32B | 33k | 36 | $0.90 | 67.3 | 0.34 | 7.77 | N/A | ||
Qwen2.5 Coder 32B | 33k | 36 | $0.10 | 48.3 | 0.36 | 10.71 | N/A | ||
Qwen2.5 Coder 32B | 131k | 36 | $0.79 | 387.7 | 0.37 | 1.66 | N/A | ||
![]() | Qwen2.5 Coder 32B | 16k | 36 | $0.63 | 357.1 | 0.75 | 2.15 | N/A | |
Qwen2.5 Coder 32B | 131k | 36 | $0.80 | 87.6 | 0.42 | 6.13 | N/A | ||
GPT-4o mini | 128k | 36 | $0.26 | 72.8 | 0.35 | 7.22 | N/A | ||
![]() | GPT-4o mini | 128k | 36 | $0.26 | 163.2 | 0.87 | 3.94 | N/A | |
Llama 3.1 70B (FP8) | 128k | 35 | $0.17 | 38.9 | 0.51 | 13.36 | N/A | ||
Llama 3.1 70B | 128k | 35 | $0.40 | 55.3 | 1.25 | 10.29 | N/A | ||
![]() | Llama 3.1 70B Standard | 128k | 35 | $0.72 | 31.6 | 0.64 | 16.49 | N/A | |
![]() | Llama 3.1 70B Latency Optimized | 128k | 35 | $0.90 | 31.6 | 0.83 | 16.67 | N/A | |
Llama 3.1 70B Base | 128k | 35 | $0.20 | 38.9 | 0.76 | 13.61 | N/A | ||
Llama 3.1 70B Fast | 128k | 35 | $0.38 | 150.2 | 0.55 | 3.87 | N/A | ||
Llama 3.1 70B Vertex | 128k | 35 | $0.00 | 72.0 | 0.28 | 7.23 | N/A | ||
![]() | Llama 3.1 70B | 128k | 35 | $2.90 | 54.4 | 0.45 | 9.65 | N/A | |
Llama 3.1 70B | 128k | 35 | $0.90 | 157.3 | 0.40 | 3.58 | N/A | ||
Llama 3.1 70B (Turbo, FP8) | 128k | 35 | $0.20 | 33.2 | 0.43 | 15.51 | N/A | ||
Llama 3.1 70B | 128k | 35 | $0.27 | 34.4 | 0.28 | 14.82 | N/A | ||
Llama 3.1 70B | 128k | 35 | $0.60 | 203.8 | 0.46 | 2.91 | N/A | ||
![]() | Llama 3.1 70B | 32k | 35 | $0.35 | 19.5 | 1.24 | 26.93 | N/A | |
![]() | Llama 3.1 70B | 128k | 35 | $0.75 | 474.5 | 3.55 | 4.60 | N/A | |
Llama 3.1 70B Turbo | 128k | 35 | $0.88 | 114.5 | 0.48 | 4.85 | N/A | ||
Llama 3.1 70B | 128k | 35 | $0.90 | 127.4 | 0.51 | 4.44 | N/A | ||
![]() | ![]() Mistral Small 3.1 | 128k | 35 | $0.15 | 108.5 | 0.32 | 4.93 | N/A | |
![]() Mistral Small 3.1 Vertex | 128k | 35 | $0.15 | 208.8 | 0.17 | 2.57 | N/A | ||
![]() | ![]() Mistral Small 3 | 32k | 35 | $0.15 | 138.2 | 0.39 | 4.01 | N/A | |
![]() Mistral Small 3 | 32k | 35 | $0.09 | 70.1 | 0.22 | 7.35 | N/A | ||
![]() Mistral Small 3 | 32k | 35 | $0.80 | 96.0 | 0.24 | 5.44 | N/A | ||
![]() | Claude 3 Opus | 200k | 35 | $30.00 | 25.6 | 1.28 | 20.80 | N/A | |
Claude 3 Opus Vertex | 200k | 35 | $30.00 | 28.3 | 1.11 | 18.80 | N/A | ||
Claude 3 Opus | 200k | 35 | $30.00 | 28.5 | 1.13 | 18.68 | N/A | ||
![]() | Claude 3.5 Haiku Latency Optimized | 200k | 35 | $2.00 | 57.6 | 1.30 | 9.98 | N/A | |
Claude 3.5 Haiku Vertex | 200k | 35 | $1.60 | 65.5 | 0.60 | 8.23 | N/A | ||
Claude 3.5 Haiku | 200k | 35 | $1.60 | 65.7 | 4.34 | 11.95 | N/A | ||
![]() | ![]() DeepSeek R1 Distill Llama 8B | 32k | 34 | $0.04 | 53.8 | 0.67 | 47.13 | 37.17 | |
Gemma 3 12B | 128k | 34 | $0.06 | 42.1 | 0.50 | 12.39 | N/A | ||
Gemini 1.5 Pro (May) (Vertex) | 2m | 34 | $2.19 | 65.7 | 0.38 | 7.99 | N/A | ||
Gemini 1.5 Pro (May) (AI Studio) | 2m | 34 | $2.19 | 64.8 | 0.47 | 8.18 | N/A | ||
Qwen Turbo | 1m | 34 | $0.09 | 110.3 | 1.09 | 5.62 | N/A | ||
![]() | Llama 3.2 90B (Vision) | 128k | 33 | $0.72 | 60.5 | 0.51 | 8.78 | N/A | |
Llama 3.2 90B (Vision) Vertex | 128k | 33 | $0.00 | 33.1 | 0.18 | 15.30 | N/A | ||
Llama 3.2 90B (Vision) | 128k | 33 | $0.90 | 38.0 | 0.41 | 13.57 | N/A | ||
Llama 3.2 90B (Vision) | 33k | 33 | $0.36 | 35.4 | 0.29 | 14.42 | N/A | ||
Llama 3.2 90B (Vision) | 8k | 33 | $0.90 | 267.5 | 0.32 | 2.19 | N/A | ||
Llama 3.2 90B (Vision) Turbo | 128k | 33 | $1.20 | 32.1 | 0.23 | 15.80 | N/A | ||
Qwen2 72B | 33k | 33 | $0.90 | 41.9 | 0.40 | 12.32 | N/A | ||
Qwen2 72B | 131k | 33 | $0.00 | 46.1 | 1.05 | 11.89 | N/A | ||
![]() | ![]() Nova Lite | 300k | 33 | $0.10 | 278.4 | 0.34 | 2.14 | N/A | |
Gemini 1.5 Flash-8B AI Studio | 1m | 31 | $0.07 | 287.1 | 0.22 | 1.96 | N/A | ||
![]() Jamba 1.5 Large | 256k | 29 | $3.50 | 65.9 | 0.56 | 8.15 | N/A | ||
![]() | ![]() Jamba 1.5 Large | 256k | 29 | $3.50 | 51.3 | 0.66 | 10.41 | N/A | |
![]() Jamba 1.6 Large | 256k | 29 | $3.50 | 57.8 | 0.52 | 9.16 | N/A | ||
Gemini 1.5 Flash (May) (Vertex) | 1m | 28 | $0.13 | 313.4 | 0.30 | 1.89 | N/A | ||
Gemini 1.5 Flash (May) (AI Studio) | 1m | 28 | $0.13 | 315.1 | 0.23 | 1.82 | N/A | ||
![]() | ![]() Nova Micro | 130k | 28 | $0.06 | 319.5 | 0.30 | 1.86 | N/A | |
![]() Yi-Large | 32k | 28 | $3.00 | 69.1 | 0.44 | 7.67 | N/A | ||
![]() | Claude 3 Sonnet | 200k | 28 | $6.00 | 63.1 | 0.75 | 8.67 | N/A | |
Claude 3 Sonnet | 200k | 28 | $6.00 | 60.3 | 0.50 | 8.78 | N/A | ||
![]() | ![]() Codestral (Jan '25) | 256k | 28 | $0.45 | 200.9 | 0.31 | 2.80 | N/A | |
![]() Codestral (Jan '25) Vertex | 128k | 28 | $0.45 | 152.9 | 0.15 | 3.42 | N/A | ||
Llama 3 70B | 8k | 27 | $1.18 | 47.9 | 0.42 | 10.85 | N/A | ||
Llama 3 70B | 8k | 27 | $0.40 | 33.8 | 0.77 | 15.58 | N/A | ||
![]() | Llama 3 70B | 8k | 27 | $2.86 | 54.4 | 0.42 | 9.61 | N/A | |
![]() | Llama 3 70B | 8k | 27 | $2.90 | 19.0 | 0.76 | 27.13 | N/A | |
Llama 3 70B | 8k | 27 | $0.90 | 148.4 | 0.41 | 3.78 | N/A | ||
Llama 3 70B | 8k | 27 | $0.27 | 40.1 | 0.30 | 12.77 | N/A | ||
![]() | Llama 3 70B | 8k | 27 | $0.57 | 32.5 | 0.69 | 16.05 | N/A | |
Llama 3 70B | 8k | 27 | $0.64 | 335.1 | 0.25 | 1.74 | N/A | ||
Llama 3 70B (Reference, FP16) | 8k | 27 | $0.90 | 142.8 | 0.64 | 4.14 | N/A | ||
Llama 3 70B (Turbo, FP8) | 8k | 27 | $0.88 | 22.8 | 0.34 | 22.31 | N/A | ||
![]() | ![]() Mistral Small (Sep '24) | 33k | 27 | $0.30 | 74.4 | 0.41 | 7.13 | N/A | |
![]() | Phi-4 Multimodal | 128k | 27 | $0.00 | 21.1 | 0.34 | 23.99 | N/A | |
Qwen2.5 Coder 7B Fast | 131k | 27 | $0.04 | 219.3 | 0.46 | 2.74 | N/A | ||
Qwen2.5 Coder 7B Base | 131k | 27 | $0.01 | 201.0 | 0.49 | 2.98 | N/A | ||
![]() | ![]() Mistral Large (Feb '24) | 33k | 26 | $6.00 | 32.0 | 0.48 | 16.11 | N/A | |
![]() | ![]() Mistral Large (Feb '24) | 33k | 26 | $6.00 | 43.9 | 0.42 | 11.81 | N/A | |
![]() | ![]() Mistral Large (Feb '24) | 33k | 26 | $6.00 | 39.6 | 0.48 | 13.10 | N/A | |
![]() | ![]() Mixtral 8x22B | 65k | 26 | $3.00 | 66.5 | 0.32 | 7.84 | N/A | |
![]() Mixtral 8x22B Base | 65k | 26 | $0.60 | 78.2 | 0.56 | 6.95 | N/A | ||
![]() Mixtral 8x22B Fast | 65k | 26 | $1.05 | 100.5 | 0.52 | 5.50 | N/A | ||
![]() Mixtral 8x22B | 65k | 26 | $1.20 | 76.4 | 0.36 | 6.90 | N/A | ||
![]() Mixtral 8x22B | 65k | 26 | $1.20 | 88.8 | 0.27 | 5.90 | N/A | ||
![]() | Phi-4 Mini | 128k | 26 | $0.12 | 218.5 | 0.43 | 2.72 | N/A | |
![]() | Phi-4 Mini | 128k | 26 | $0.00 | 53.2 | 0.36 | 9.75 | N/A | |
![]() | Phi-3 Medium 14B | 128k | 25 | $0.30 | 51.4 | 0.42 | 10.15 | N/A | |
Gemma 3 4B | 128k | 24 | $0.03 | 92.9 | 0.29 | 5.67 | N/A | ||
![]() | Claude 2.1 | 200k | 24 | $12.00 | 29.8 | 1.64 | 18.42 | N/A | |
Claude 2.1 | 200k | 24 | $12.00 | 14.1 | 0.81 | 36.35 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.03 | 135.0 | 0.43 | 4.13 | N/A | ||
![]() | Llama 3.1 8B | 33k | 24 | $0.10 | 2,141.3 | 0.25 | 0.48 | N/A | |
Llama 3.1 8B | 128k | 24 | $0.10 | 66.4 | 0.84 | 8.37 | N/A | ||
![]() | Llama 3.1 8B | 128k | 24 | $0.22 | 91.0 | 0.37 | 5.86 | N/A | |
Llama 3.1 8B Fast | 128k | 24 | $0.04 | 181.3 | 0.49 | 3.25 | N/A | ||
Llama 3.1 8B Base | 128k | 24 | $0.03 | 66.5 | 0.51 | 8.03 | N/A | ||
Llama 3.1 8B Vertex | 128k | 24 | $0.00 | 118.9 | 0.17 | 4.38 | N/A | ||
![]() | Llama 3.1 8B | 128k | 24 | $0.38 | 226.4 | 0.31 | 2.52 | N/A | |
Llama 3.1 8B | 128k | 24 | $0.20 | 213.2 | 0.35 | 2.69 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.04 | 49.3 | 0.25 | 10.39 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.10 | 468.9 | 0.39 | 1.45 | N/A | ||
![]() | Llama 3.1 8B | 16k | 24 | $0.05 | 65.8 | 0.76 | 8.35 | N/A | |
Llama 3.1 8B | 128k | 24 | $0.06 | 900.9 | 0.22 | 0.78 | N/A | ||
![]() | Llama 3.1 8B | 16k | 24 | $0.13 | 1,056.0 | 0.21 | 0.69 | N/A | |
Llama 3.1 8B Turbo | 128k | 24 | $0.18 | 149.2 | 0.22 | 3.58 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.15 | 468.1 | 0.17 | 1.24 | N/A | ||
![]() | Llama 3.1 8B | 128k | 24 | $0.18 | 62.2 | 0.53 | 8.56 | N/A | |
![]() | ![]() Pixtral 12B | 128k | 23 | $0.15 | 105.1 | 0.33 | 5.09 | N/A | |
![]() Pixtral 12B | 128k | 23 | $0.10 | 75.4 | 0.47 | 7.10 | N/A | ||
![]() | ![]() Mistral Small (Feb '24) | 33k | 23 | $1.50 | 124.3 | 0.32 | 4.34 | N/A | |
![]() | ![]() Mistral Small (Feb '24) | 33k | 23 | $1.50 | 88.3 | 0.40 | 6.06 | N/A | |
![]() | ![]() Mistral Medium | 33k | 23 | $4.09 | 41.2 | 0.40 | 12.52 | N/A | |
![]() | ![]() Ministral 8B | 128k | 22 | $0.10 | 147.9 | 0.35 | 3.73 | N/A | |
Gemma 2 9B Fast | 8k | 22 | $0.04 | 164.5 | 0.48 | 3.52 | N/A | ||
Gemma 2 9B Base | 8k | 22 | $0.03 | 173.2 | 0.52 | 3.40 | N/A | ||
Gemma 2 9B | 8k | 22 | $0.04 | 40.3 | 0.52 | 12.92 | N/A | ||
Gemma 2 9B | 8k | 22 | $0.20 | 708.5 | 0.22 | 0.93 | N/A | ||
Gemma 2 9B | 8k | 22 | $0.30 | 132.5 | 0.21 | 3.99 | N/A | ||
![]() LFM 40B | 32k | 22 | $0.15 | 174.7 | 0.46 | 3.32 | N/A | ||
![]() | ![]() Command-R+ | 128k | 21 | $6.00 | 48.5 | 0.48 | 10.78 | N/A | |
![]() Command-R+ | 128k | 21 | $4.38 | 52.7 | 0.29 | 9.78 | N/A | ||
Llama 3 8B | 8k | 21 | $0.10 | 78.7 | 0.45 | 6.80 | N/A | ||
![]() | Llama 3 8B | 8k | 21 | $0.38 | 104.4 | 0.32 | 5.11 | N/A | |
![]() | Llama 3 8B | 8k | 21 | $0.38 | 73.8 | 0.37 | 7.14 | N/A | |
Llama 3 8B | 8k | 21 | $0.04 | 104.2 | 0.21 | 5.01 | N/A | ||
![]() | Llama 3 8B | 8k | 21 | $0.04 | 50.4 | 0.85 | 10.76 | N/A | |
Llama 3 8B | 8k | 21 | $0.06 | 1,349.7 | 0.30 | 0.67 | N/A | ||
Llama 3 8B | 8k | 21 | $0.20 | 191.5 | 0.43 | 3.04 | N/A | ||
Gemini 1.0 Pro Vertex | 33k | 21 | $0.19 | 164.7 | 0.30 | 3.34 | N/A | ||
![]() | ![]() Codestral (May '24) | 33k | 20 | $0.30 | 102.3 | 0.34 | 5.23 | N/A | |
![]() Aya Expanse 32B | 128k | 20 | $0.75 | 121.4 | 0.15 | 4.27 | N/A | ||
![]() | ![]() Command-R+ (Apr '24) | 128k | 20 | $6.00 | 48.5 | 0.49 | 10.80 | N/A | |
![]() Command-R+ (Apr '24) | 128k | 20 | $6.00 | 77.2 | 0.23 | 6.71 | N/A | ||
![]() | ![]() Command-R+ (Apr '24) | 128k | 20 | $6.00 | 51.0 | 0.59 | 10.39 | N/A | |
![]() DBRX | 33k | 20 | $1.13 | 69.5 | 0.46 | 7.65 | N/A | ||
![]() | ![]() Ministral 3B | 128k | 20 | $0.04 | 237.2 | 0.31 | 2.42 | N/A | |
![]() | ![]() Mistral NeMo | 128k | 20 | $0.15 | 134.5 | 0.35 | 4.06 | N/A | |
![]() Mistral NeMo Fast | 128k | 20 | $0.12 | 166.5 | 0.49 | 3.50 | N/A | ||
![]() Mistral NeMo Base | 128k | 20 | $0.06 | 24.1 | 0.65 | 21.38 | N/A | ||
![]() Mistral NeMo | 128k | 20 | $0.06 | 57.8 | 0.22 | 8.87 | N/A | ||
Llama 3.2 3B (FP8) | 128k | 20 | $0.02 | 216.6 | 0.37 | 2.68 | N/A | ||
Llama 3.2 3B | 128k | 20 | $0.10 | 85.2 | 0.82 | 6.69 | N/A | ||
![]() | Llama 3.2 3B | 128k | 20 | $0.15 | 72.6 | 0.48 | 7.37 | N/A | |
Llama 3.2 3B Base | 128k | 20 | $0.01 | 125.7 | 0.50 | 4.48 | N/A | ||
![]() | Llama 3.2 3B | 128k | 20 | $0.06 | 246.3 | 0.43 | 2.46 | N/A | |
Llama 3.2 3B | 128k | 20 | $0.02 | 145.4 | 0.17 | 3.61 | N/A | ||
![]() | Llama 3.2 3B | 32k | 20 | $0.04 | 71.3 | 0.66 | 7.67 | N/A | |
Llama 3.2 3B | 8k | 20 | $0.06 | 1,512.9 | 0.36 | 0.69 | N/A | ||
![]() | Llama 3.2 3B | 8k | 20 | $0.10 | 1,590.9 | 0.18 | 0.50 | N/A | |
Llama 3.2 3B Turbo | 128k | 20 | $0.06 | 162.2 | 0.33 | 3.41 | N/A | ||
![]() DeepSeek R1 Distill Qwen 1.5B | 128k | 19 | $0.18 | 388.6 | 0.24 | 6.67 | 5.15 | ||
![]() Jamba 1.5 Mini | 256k | 18 | $0.25 | 168.7 | 0.32 | 3.28 | N/A | ||
![]() | ![]() Jamba 1.5 Mini | 256k | 18 | $0.25 | 82.9 | 0.47 | 6.50 | N/A | |
![]() Jamba 1.6 Mini | 256k | 18 | $0.25 | 204.0 | 0.35 | 2.80 | N/A | ||
![]() | ![]() Mixtral 8x7B | 33k | 17 | $0.70 | 96.7 | 0.35 | 5.52 | N/A | |
![]() | ![]() Mixtral 8x7B | 33k | 17 | $0.51 | 76.4 | 0.33 | 6.88 | N/A | |
![]() Mixtral 8x7B Fast | 33k | 17 | $0.23 | 54.2 | 0.60 | 9.82 | N/A | ||
![]() Mixtral 8x7B Base | 33k | 17 | $0.12 | 53.7 | 0.61 | 9.91 | N/A | ||
![]() Mixtral 8x7B | 33k | 17 | $0.50 | 189.0 | 0.30 | 2.94 | N/A | ||
![]() Mixtral 8x7B | 33k | 17 | $0.24 | 95.9 | 0.21 | 5.42 | N/A | ||
![]() Mixtral 8x7B | 33k | 17 | $0.63 | 94.7 | 0.42 | 5.70 | N/A | ||
![]() Mixtral 8x7B | 33k | 17 | $0.60 | 49.0 | 0.45 | 10.66 | N/A | ||
![]() Aya Expanse 8B | 8k | 16 | $0.75 | 167.4 | 0.12 | 3.10 | N/A | ||
![]() | ![]() Command-R | 128k | 15 | $0.75 | 110.1 | 0.36 | 4.90 | N/A | |
![]() Command-R | 128k | 15 | $0.26 | 86.4 | 0.20 | 5.99 | N/A | ||
![]() | ![]() Command-R (Mar '24) | 128k | 15 | $0.75 | 110.2 | 0.34 | 4.88 | N/A | |
![]() Command-R (Mar '24) | 128k | 15 | $0.75 | 177.3 | 0.16 | 2.98 | N/A | ||
![]() | ![]() Command-R (Mar '24) | 128k | 15 | $0.75 | 81.0 | 0.44 | 6.62 | N/A | |
![]() | ![]() Codestral-Mamba | 256k | 14 | $0.25 | 95.0 | 0.43 | 5.69 | N/A | |
![]() | ![]() Mistral 7B | 8k | 10 | $0.25 | 111.3 | 0.33 | 4.82 | N/A | |
![]() Mistral 7B | 8k | 10 | $0.04 | 94.5 | 0.21 | 5.50 | N/A | ||
![]() | ![]() Mistral 7B | 32k | 10 | $0.06 | 117.8 | 0.81 | 5.05 | N/A | |
![]() Mistral 7B | 8k | 10 | $0.20 | 180.3 | 0.17 | 2.94 | N/A | ||
![]() | Llama 3.2 1B | 128k | 10 | $0.10 | 118.9 | 0.46 | 4.66 | N/A | |
Llama 3.2 1B Base | 128k | 10 | $0.01 | 271.7 | 0.47 | 2.31 | N/A | ||
Llama 3.2 1B | 128k | 10 | $0.01 | 180.5 | 0.24 | 3.01 | N/A | ||
Llama 3.2 1B | 8k | 10 | $0.04 | 3,398.0 | 0.47 | 0.62 | N/A | ||
![]() | Llama 3.2 1B | 16k | 10 | $0.05 | 2,583.2 | 0.20 | 0.39 | N/A | |
Llama 2 Chat 7B | 4k | 8 | $0.10 | 132.7 | 0.43 | 4.20 | N/A | ||
o1-preview | 128k | $26.25 | 149.2 | 20.94 | 24.29 | N/A | |||
![]() | o1-preview | 128k | $28.88 | 111.0 | 34.65 | 39.15 | N/A | ||
GPT-4o (Aug '24) | 128k | $4.38 | 108.8 | 0.57 | 5.17 | N/A | |||
![]() | GPT-4o (Aug '24) | 128k | $4.38 | 121.0 | 0.74 | 4.87 | N/A | ||
GPT-4.5 (Preview) | 128k | $93.75 | 58.3 | 1.12 | 9.70 | N/A | |||
![]() | Llama 3.2 11B (Vision) | 128k | $0.16 | 144.1 | 0.48 | 3.95 | N/A | ||
![]() | Llama 3.2 11B (Vision) | 128k | $0.15 | 85.6 | 0.41 | 6.25 | N/A | ||
Llama 3.2 11B (Vision) | 128k | $0.20 | 107.6 | 0.24 | 4.89 | N/A | |||
Llama 3.2 11B (Vision) | 128k | $0.06 | 45.8 | 0.25 | 11.18 | N/A | |||
Llama 3.2 11B (Vision) | 8k | $0.18 | 902.3 | 0.36 | 0.91 | N/A | |||
Llama 3.2 11B (Vision) Turbo | 128k | $0.18 | 122.9 | 0.15 | 4.22 | N/A | |||
Gemma 2 27B Fast | 8k | $0.26 | 85.6 | 0.52 | 6.36 | N/A | |||
Gemma 2 27B Base | 8k | $0.15 | 53.9 | 0.59 | 9.86 | N/A | |||
Gemma 2 27B | 8k | $0.80 | 88.7 | 0.30 | 5.94 | N/A | |||
![]() | Claude 3.5 Sonnet (June) | 200k | $6.00 | 44.2 | 0.84 | 12.14 | N/A | ||
Claude 3.5 Sonnet (June) Vertex | 200k | $6.00 | 80.1 | 0.97 | 7.21 | N/A | |||
Claude 3.5 Sonnet (June) | 200k | $6.00 | 79.8 | 0.62 | 6.89 | N/A | |||
![]() | Claude 3 Haiku | 200k | $0.50 | 113.2 | 0.83 | 5.25 | N/A | ||
Claude 3 Haiku | 200k | $0.50 | 144.3 | 0.38 | 3.85 | N/A | |||
![]() | ![]() Mistral Saba | 32k | $0.30 | 99.9 | 0.35 | 5.36 | N/A | ||
![]() DeepSeek Coder V2 Lite Fast, FP8 | 128k | $0.12 | 113.0 | 0.57 | 4.99 | N/A | |||
![]() DeepSeek Coder V2 Lite Base, FP8 | 128k | $0.06 | 108.7 | 0.59 | 5.19 | N/A | |||
![]() Sonar Reasoning | 127k | $2.00 | 81.1 | 1.80 | 32.63 | 24.66 | |||
![]() | ![]() Solar Mini | 4k | $0.15 | 64.4 | 1.08 | 8.84 | N/A | ||
![]() | ![]() Reka Flash | 128k | $0.35 | 46.1 | 0.91 | 11.76 | N/A | ||
![]() | ![]() Reka Core | 128k | $2.00 | 27.8 | 0.95 | 18.95 | N/A | ||
![]() | ![]() Reka Flash (Feb '24) | 128k | $0.35 | 44.9 | 0.97 | 12.11 | N/A | ||
![]() | ![]() Reka Edge | 128k | $0.10 | 82.8 | 0.89 | 6.93 | N/A | ||
Qwen1.5 Chat 110B | 32k | $0.00 | 29.8 | 1.09 | 17.85 | N/A | |||
GPT-4 Turbo | 128k | $15.00 | 37.8 | 0.67 | 13.90 | N/A | |||
![]() | GPT-4 Turbo | 128k | $15.00 | 52.1 | 1.34 | 10.94 | N/A | ||
GPT-4 | 8k | $37.50 | 23.7 | 0.73 | 21.80 | N/A | |||
Gemini 2.0 Flash-Lite (Preview) (AI Studio) | 1m | $0.13 | 207.6 | 0.26 | 2.67 | N/A | |||
Claude 2.0 | 100k | $12.00 | 31.1 | 0.83 | 16.92 | N/A | |||
![]() OpenChat 3.5 | 8k | $0.06 | 43.8 | 0.49 | 11.91 | N/A | |||
![]() Jamba Instruct | 256k | $0.55 | 178.2 | 0.32 | 3.12 | N/A |