LLM Leaderboard - Comparison of over 100 AI models from OpenAI, Google, DeepSeek & others
Intelligence
Gemini 3.1 Pro Preview and GPT-5.4 (xhigh) are the highest intelligence models, followed by GPT-5.3 Codex (xhigh) and Claude Opus 4.6 (max).
Output Speed
Mercury 2 and Granite 4.0 H Small are the fastest models, followed by Granite 3.3 8B and Gemini 2.5 Flash-Lite.
Latency
Qwen3.5 0.8B and Qwen3.5 2B are the lowest latency models, followed by NVIDIA Nemotron 3 Nano and Ministral 3 3B.
Price
Qwen3.5 0.8B and Qwen3.5 0.8B are the cheapest models, followed by Gemma 3n E4B and Qwen3.5 2B.
Context Window
Llama 4 Scout and Grok 4.1 Fast support the largest context windows, followed by Grok 4.1 Fast and Gemini 1.5 Pro (May).
Further Analysis | ||||||||
|---|---|---|---|---|---|---|---|---|
Gemini 3.1 Pro Preview | 1M | 57 | $4.50 | 115 | 33.27 | 37.61 | ||
GPT-5.4 (xhigh) | 1.05M | 57 | $5.63 | 82 | 198.43 | 204.50 | ||
GPT-5.3 Codex (xhigh) | 400k | 54 | $4.81 | 68 | 96.11 | 103.48 | ||
Claude Opus 4.6 (max) | 1M | 53 | $10.00 | 47 | 19.31 | 30.02 | ||
Muse Spark | 262k | 52 | -- | -- | -- | -- | ||
Claude Sonnet 4.6 (max) | 1M | 52 | $6.00 | 62 | 94.28 | 102.36 | ||
GLM-5.1 | 200k | 51 | $2.15 | 66 | 1.73 | 66.38 | ||
Qwen3.6 Plus | 1M | 50 | $1.13 | 50 | 2.34 | 124.26 | ||
GLM-5 | 200k | 50 | $1.55 | 72 | 1.70 | 51.64 | ||
MiniMax-M2.7 | 205k | 50 | $0.53 | 42 | 3.18 | 73.14 | ||
Grok 4.20 0309 v2 | 2M | 49 | $3.00 | 194 | 11.08 | 13.65 | ||
MiMo-V2-Pro | 1M | 49 | $1.50 | 69 | 2.81 | 52.62 | ||
GPT-5.4 mini (xhigh) | 400k | 49 | $1.69 | 167 | 8.14 | 11.14 | ||
Kimi K2.5 | 256k | 47 | $1.20 | 37 | 2.11 | 95.92 | ||
GLM-5-Turbo | 200k | 47 | -- | -- | -- | -- | ||
Claude Opus 4.6 | 1M | 46 | $10.00 | 40 | 1.82 | 14.33 | ||
Gemini 3 Flash | 1M | 46 | $1.13 | 153 | 8.04 | 11.31 | ||
Qwen3.5 397B A17B | 262k | 45 | $1.35 | 80 | 2.49 | 48.41 | ||
MiMo-V2-Omni-0327 | 256k | 45 | -- | -- | -- | -- | ||
Claude Sonnet 4.6 | 1M | 44 | $6.00 | 43 | 1.48 | 13.07 | ||
GPT-5.4 nano (xhigh) | 400k | 44 | $0.46 | 188 | 4.52 | 7.18 | ||
GLM-5.1 | 200k | 44 | $2.15 | 47 | 1.65 | 12.40 | ||
MiMo-V2-Omni | 256k | 43 | -- | -- | -- | -- | ||
GLM 5V Turbo | 200k | 43 | -- | -- | -- | -- | ||
Claude Sonnet 4.6 (Non-reasoning, Low Effort) | 1M | 43 | $6.00 | 42 | 1.42 | 13.25 | ||
Qwen3.5 27B | 262k | 42 | $0.82 | 87 | 5.69 | 34.38 | ||
DeepSeek V3.2 | 128k | 42 | $0.32 | 46 | 2.13 | 56.75 | ||
Qwen3.5 122B A10B | 262k | 42 | $1.10 | 121 | 2.36 | 23.08 | ||
MiMo-V2-Flash (Feb 2026) | 256k | 41 | $0.15 | 130 | 2.22 | 21.43 | ||
Gemini 3 Pro Preview (low) | 1M | 41 | $4.50 | -- | -- | -- | ||
GLM-5 | 200k | 41 | $1.55 | 48 | 2.33 | 12.83 | ||
Qwen3.5 397B A17B | 262k | 40 | $1.35 | 80 | 2.72 | 8.98 | ||
Qwen3 Max Thinking | 256k | 40 | $2.40 | 37 | 3.98 | 71.21 | ||
Gemma 4 31B | 256k | 39 | $0.00 | 36 | 1.70 | 71.58 | ||
Qwen3.5 Omni Plus | 256k | 39 | $1.50 | 48 | 2.54 | 12.88 | ||
Grok 4.1 Fast | 2M | 39 | $0.28 | 119 | 7.16 | 11.37 | ||
o3 | 200k | 38 | $3.50 | 88 | 11.05 | 16.74 | ||
GPT-5.4 nano | 400k | 38 | $0.46 | 187 | 4.32 | 7.00 | ||
Step 3.5 Flash | 256k | 38 | $0.15 | 165 | 2.35 | 17.49 | ||
GPT-5.4 mini (medium) | 400k | 38 | $1.69 | 166 | 12.87 | 15.89 | ||
Kimi K2.5 | 256k | 37 | $1.20 | 35 | 3.12 | 17.24 | ||
Qwen3.5 27B | 262k | 37 | $0.82 | 88 | 5.62 | 11.32 | ||
Qwen3.5 35B A3B | 262k | 37 | $0.69 | 185 | 2.07 | 15.58 | ||
Claude 4.5 Haiku | 200k | 37 | $2.00 | 99 | 14.79 | 19.87 | ||
NVIDIA Nemotron 3 Super | 1M | 36 | $0.41 | 172 | 0.96 | 15.48 | ||
Qwen3.5 122B A10B | 262k | 36 | $1.10 | 139 | 2.43 | 6.02 | ||
Nova 2.0 Pro Preview (medium) | 256k | 36 | $3.44 | 126 | 12.03 | 31.86 | ||
GPT-5.4 | 1.05M | 35 | $5.63 | 59 | 0.78 | 9.31 | ||
Gemini 3 Flash | 1M | 35 | $1.13 | 165 | 1.50 | 4.53 | ||
Gemini 2.5 Pro | 1M | 35 | $3.44 | 117 | 29.01 | 33.29 | ||
Nova 2.0 Lite (high) | 1M | 35* | $0.85 | 157 | 11.99 | 27.89 | ||
Gemini 3.1 Flash-Lite Preview | 1M | 34 | $0.56 | 186 | 9.07 | 11.76 | ||
Doubao Seed Code | 256k | 34 | -- | -- | -- | -- | ||
gpt-oss-120B (high) | 131k | 33 | $0.26 | 215 | 0.86 | 12.46 | ||
Mercury 2 | 128k | 33 | $0.38 | 848 | 3.79 | 4.38 | ||
Qwen3.5 9B | 262k | 32 | $0.10 | 105 | 0.65 | 24.35 | ||
Gemma 4 31B | 256k | 32 | -- | -- | -- | -- | ||
K-EXAONE | 256k | 32 | -- | -- | -- | -- | ||
DeepSeek V3.2 | 128k | 32 | $0.32 | 45 | 2.02 | 13.11 | ||
Grok 3 mini Reasoning (high) | 1M | 32 | $0.35 | 189 | 0.58 | 13.80 | ||
Nova 2.0 Pro Preview (low) | 256k | 32 | $3.44 | 133 | 10.04 | 28.90 | ||
Trinity Large Thinking | 512k | 32 | $0.40 | 101 | 1.04 | 25.89 | ||
Gemma 4 26B A4B | 256k | 31 | $0.20 | -- | -- | -- | ||
Claude 4.5 Haiku | 200k | 31 | $2.00 | 91 | 0.76 | 6.26 | ||
Qwen3.5 35B A3B | 262k | 31 | $0.69 | 173 | 2.06 | 4.94 | ||
MiMo-V2-Flash | 256k | 30 | $0.15 | 128 | 2.34 | 6.24 | ||
Nova 2.0 Lite (medium) | 1M | 30 | $0.85 | 164 | 16.04 | 31.24 | ||
DeepSeek V3.2 Speciale | 128k | 29 | -- | -- | -- | -- | ||
ERNIE 5.0 Thinking Preview | 128k | 29 | -- | -- | -- | -- | ||
Grok 4.20 0309 v2 | 2M | 29 | $3.00 | 180 | 0.55 | 3.33 | ||
Grok Code Fast 1 | 256k | 29 | $0.53 | 138 | 3.96 | 7.58 | ||
Qwen3 Coder Next | 256k | 28 | $0.60 | 145 | 1.86 | 5.31 | ||
Nova 2.0 Omni (medium) | 1M | 28 | $0.85 | -- | -- | -- | ||
Nemotron Cascade 2 30B A3B | 262k | 28 | -- | -- | -- | -- | ||
Qwen3.5 9B | 262k | 27 | $0.08 | 158 | 0.64 | 3.79 | ||
Mistral Small 4 | 256k | 27 | $0.26 | 145 | 3.30 | 20.49 | ||
Magistral Medium 1.2 | 128k | 27 | $2.75 | 86 | 1.68 | 30.86 | ||
Gemma 4 26B A4B | 256k | 27 | -- | -- | -- | -- | ||
Qwen3.5 4B | 262k | 27 | $0.06 | 190 | 0.68 | 13.82 | ||
DeepSeek R1 0528 | 128k | 27 | $2.36 | -- | -- | -- | ||
Qwen3 Next 80B A3B | 262k | 27 | $1.88 | 172 | 2.16 | 16.71 | ||
Solar Pro 3 | 128k | 26 | -- | -- | -- | -- | ||
Qwen3.5 Omni Flash | 256k | 26 | $0.28 | 164 | 1.92 | 4.97 | ||
Qwen3 Coder 480B | 262k | 25 | $3.00 | 57 | 2.96 | 11.73 | ||
Nova 2.0 Lite (low) | 1M | 25 | $0.85 | 173 | 10.16 | 24.63 | ||
gpt-oss-120B (low) | 131k | 24 | $0.26 | 213 | 0.84 | 12.56 | ||
gpt-oss-20B (high) | 131k | 24 | $0.09 | 251 | 0.70 | 10.66 | ||
GPT-5.4 nano | 400k | 24 | $0.46 | 185 | 0.61 | 3.32 | ||
NVIDIA Nemotron 3 Nano | 1M | 24 | $0.10 | 101 | 1.67 | 26.51 | ||
LongCat Flash Lite | 256k | 24 | $0.00 | 116 | 6.78 | 11.10 | ||
Grok 4.1 Fast | 2M | 24 | $0.28 | 131 | 0.57 | 4.38 | ||
K-EXAONE | 256k | 23 | -- | -- | -- | -- | ||
GPT-5.4 mini | 400k | 23 | $1.69 | 157 | 0.72 | 3.91 | ||
Nova 2.0 Omni (low) | 1M | 23 | $0.85 | -- | -- | -- | ||
Mi:dm K 2.5 Pro | 128k | 23 | -- | -- | -- | -- | ||
Nova 2.0 Pro Preview | 256k | 23 | $3.44 | 124 | 1.03 | 5.07 | ||
Mistral Large 3 | 256k | 23 | $0.75 | 38 | 1.68 | 14.91 | ||
Ring-1T | 128k | 23 | -- | -- | -- | -- | ||
Qwen3.5 4B | 262k | 23 | $0.06 | 199 | 0.82 | 3.34 | ||
INTELLECT-3 | 131k | 22 | -- | -- | -- | -- | ||
Devstral 2 | 256k | 22 | $0.00 | 71 | 0.90 | 7.91 | ||
Solar Open 100B | 128k | 22 | -- | -- | -- | -- | ||
Gemini 2.5 Flash-Lite (Sep) | 1M | 22 | $0.17 | -- | -- | -- | ||
Mistral Medium 3.1 | 128k | 21 | $0.80 | 100 | 1.37 | 6.37 | ||
gpt-oss-20B (low) | 131k | 21 | $0.09 | 215 | 0.73 | 12.37 | ||
Qwen3 Next 80B A3B | 262k | 20 | $0.88 | 163 | 2.24 | 5.31 | ||
Devstral Small 2 | 256k | 19 | $0.00 | 72 | 0.93 | 7.84 | ||
Gemini 2.5 Flash-Lite (Sep) | 1M | 19 | $0.17 | -- | -- | -- | ||
Motif-2-12.7B | 128k | 19 | -- | -- | -- | -- | ||
Ling-1T | 128k | 19 | -- | -- | -- | -- | ||
Nova Premier | 1M | 19 | $5.00 | 26 | 3.03 | 22.47 | ||
Gemma 4 E4B | 128k | 19 | -- | -- | -- | -- | ||
Llama Nemotron Super 49B v1.5 | 128k | 19 | $0.17 | 55 | 1.45 | 46.84 | ||
Mistral Small 4 | 256k | 19 | $0.26 | 117 | 1.82 | 6.07 | ||
Llama 3.3 Nemotron Super 49B | 128k | 18* | -- | -- | -- | -- | ||
Llama 4 Maverick | 1M | 18 | $0.49 | 111 | 1.03 | 5.55 | ||
Sarvam 105B (high) | 128k | 18 | $0.00 | 109 | 2.48 | 25.32 | ||
Magistral Small 1.2 | 128k | 18 | $0.75 | 161 | 0.85 | 16.37 | ||
Nova 2.0 Lite | 1M | 18 | $0.85 | 165 | 1.29 | 4.32 | ||
Llama 3.1 405B | 128k | 17 | $3.69 | 29 | 2.37 | 19.38 | ||
EXAONE 4.0 32B | 131k | 17 | -- | -- | -- | -- | ||
Nova 2.0 Omni | 1M | 17 | $0.85 | 192 | 1.09 | 3.70 | ||
DeepSeek R1 0528 Qwen3 8B | 32.8k | 16* | -- | -- | -- | -- | ||
Qwen3.5 2B | 262k | 16 | $0.04 | -- | -- | -- | ||
Nanbeige4.1-3B | 256k | 16 | -- | -- | -- | -- | ||
Ministral 3 14B | 256k | 16 | $0.20 | 108 | 0.72 | 5.34 | ||
DeepSeek R1 Distill Llama 70B | 128k | 16* | $0.88 | 39 | 2.96 | 67.68 | ||
Falcon-H1R-7B | 256k | 16 | -- | -- | -- | -- | ||
Ling-flash-2.0 | 128k | 16 | $0.25 | 62 | 2.43 | 10.46 | ||
Qwen3 Omni 30B A3B | 65.5k | 16 | $0.43 | 91 | 1.91 | 29.32 | ||
Step3 VL 10B | 65.5k | 15 | -- | -- | -- | -- | ||
Gemma 4 E2B | 128k | 15 | -- | -- | -- | -- | ||
Llama Nemotron Ultra | 128k | 15 | $0.90 | 41 | 2.52 | 63.41 | ||
ERNIE 4.5 300B A47B | 131k | 15 | $0.48 | 25 | 3.53 | 23.56 | ||
Solar Pro 2 | 65.5k | 15 | -- | -- | -- | -- | ||
NVIDIA Nemotron Nano 12B v2 VL | 128k | 15 | $0.30 | 139 | 0.64 | 18.65 | ||
Ministral 3 8B | 256k | 15 | $0.15 | 180 | 0.56 | 3.34 | ||
Gemma 4 E4B | 128k | 15 | -- | -- | -- | -- | ||
NVIDIA Nemotron Nano 9B V2 | 131k | 15 | $0.07 | 156 | 0.64 | 16.64 | ||
NVIDIA Nemotron 3 Nano 4B | 262k | 15 | -- | -- | -- | -- | ||
Qwen3.5 2B | 262k | 15 | $0.04 | 270 | 0.40 | 2.25 | ||
Llama Nemotron Super 49B v1.5 | 128k | 15 | $0.17 | 55 | 1.50 | 10.64 | ||
Llama 3.3 70B | 128k | 14 | $0.64 | 87 | 1.39 | 7.15 | ||
Llama 3.1 Nemotron Nano 4B v1.1 | 128k | 14* | -- | -- | -- | -- | ||
Kimi Linear 48B A3B Instruct | 1M | 14* | -- | -- | -- | -- | ||
Llama 3.3 Nemotron Super 49B | 128k | 14* | -- | -- | -- | -- | ||
Ring-flash-2.0 | 128k | 14 | $0.25 | 85 | 2.67 | 31.95 | ||
Solar Pro 2 | 65.5k | 14 | -- | -- | -- | -- | ||
Llama 4 Scout | 10M | 14 | $0.29 | 143 | 0.78 | 4.27 | ||
Command A | 256k | 13 | $4.38 | 32 | 2.14 | 17.82 | ||
Llama 3.1 Nemotron 70B | 128k | 13 | $1.20 | 46 | 1.89 | 12.82 | ||
NVIDIA Nemotron 3 Nano | 1M | 13 | $0.09 | 84 | 0.46 | 6.38 | ||
NVIDIA Nemotron Nano 9B V2 | 131k | 13 | $0.09 | 163 | 1.08 | 4.14 | ||
Sarvam 30B (high) | 65.5k | 12 | $0.00 | 142 | 1.91 | 19.49 | ||
Gemma 4 E2B | 128k | 12 | -- | -- | -- | -- | ||
R1 1776 | 128k | 12* | -- | -- | -- | -- | ||
Llama 3.2 90B (Vision) | 128k | 12* | $0.72 | 54 | 1.07 | 10.32 | ||
EXAONE 4.0 32B | 131k | 12 | -- | -- | -- | -- | ||
Ministral 3 3B | 256k | 11 | $0.10 | 278 | 0.47 | 2.27 | ||
Jamba 1.7 Large | 256k | 11 | $3.50 | 59 | 1.41 | 9.91 | ||
Granite 4.0 H Small | 128k | 11 | $0.11 | 449 | 10.25 | 11.36 | ||
Qwen3 Omni 30B A3B | 65.5k | 11 | $0.43 | 95 | 1.91 | 7.17 | ||
Qwen3.5 0.8B | 262k | 11 | $0.02 | -- | -- | -- | ||
LFM2 24B A2B | 32.8k | 10 | $0.05 | 52 | 0.55 | 10.13 | ||
Phi-4 | 16k | 10 | $0.22 | 34 | 2.15 | 16.76 | ||
Nova Micro | 130k | 10 | $0.06 | 285 | 0.79 | 2.55 | ||
NVIDIA Nemotron Nano 12B v2 VL | 128k | 10 | $0.30 | 143 | 1.07 | 4.56 | ||
Phi-4 Multimodal | 128k | 10* | $0.00 | 17 | 0.83 | 29.93 | ||
Qwen3.5 0.8B | 262k | 10 | $0.02 | 305 | 0.37 | 2.01 | ||
Jamba Reasoning 3B | 262k | 10 | -- | -- | -- | -- | ||
Reka Flash 3 | 128k | 10 | $0.35 | -- | -- | -- | ||
Ling-mini-2.0 | 131k | 9 | -- | -- | -- | -- | ||
Llama 3.2 11B (Vision) | 128k | 9 | $0.16 | 52 | 0.78 | 10.42 | ||
Phi-4 Mini | 128k | 8 | $0.00 | 43 | 0.83 | 12.47 | ||
Exaone 4.0 1.2B | 64k | 8 | -- | -- | -- | -- | ||
Exaone 4.0 1.2B | 64k | 8 | -- | -- | -- | -- | ||
LFM2.5-1.2B-Thinking | 32k | 8 | -- | -- | -- | -- | ||
Jamba 1.7 Mini | 258k | 8 | -- | -- | -- | -- | ||
LFM2 2.6B | 32.8k | 8 | $0.00 | -- | -- | -- | ||
LFM2.5-1.2B-Instruct | 32k | 8 | $0.00 | -- | -- | -- | ||
Granite 4.0 H 1B | 128k | 8 | -- | -- | -- | -- | ||
Gemma 3 270M | 32k | 8 | -- | -- | -- | -- | ||
Apertus 70B Instruct | 65.5k | 8 | $1.34 | -- | -- | -- | ||
Granite 4.0 Micro | 128k | 8 | -- | -- | -- | -- | ||
Granite 4.0 1B | 128k | 7 | -- | -- | -- | -- | ||
LFM2 8B A1B | 32.8k | 7 | $0.00 | -- | -- | -- | ||
LFM2.5-VL-1.6B | 32k | 6 | $0.00 | -- | -- | -- | ||
Granite 4.0 350M | 32.8k | 6 | -- | -- | -- | -- | ||
Apertus 8B Instruct | 65.5k | 6 | $0.13 | -- | -- | -- | ||
Granite 4.0 H 350M | 32.8k | 5 | -- | -- | -- | -- | ||
Tiny Aya Global | 8.19k | 5 | -- | -- | -- | -- | ||
GPT-5.4 Pro (xhigh) | 1.05M | -- | $67.50 | -- | -- | -- | ||
Gemini 3 Deep Think | 128k | -- | -- | -- | -- | -- | ||
Mi:dm K 2.5 Pro Preview | 128k | -- | -- | -- | -- | -- | ||
Key definitions
Frequently Asked Questions
Gemini 3.1 Pro Preview currently ranks #1 on the Artificial Analysis LLM Leaderboard with an Intelligence Index score of 57, out of 319 models ranked.
The top models by Intelligence Index are: 1. Gemini 3.1 Pro Preview (57), 2. GPT-5.4 (xhigh) (57), 3. GPT-5.3 Codex (xhigh) (54), 4. Claude Opus 4.6 (Adaptive Reasoning, Max Effort) (53), 5. Muse Spark (52).
Mercury 2 is the fastest at 848.5 tokens per second, followed by Granite 4.0 H Small (449.3 t/s) and Granite 3.3 8B (Non-reasoning) (335.6 t/s).
Qwen3.5 0.8B (Non-reasoning) is the most affordable at $0.02 per 1M tokens (blended 3:1 input-to-output), followed by Qwen3.5 0.8B (Reasoning) ($0.02) and Gemma 3n E4B Instruct ($0.03).
GLM-5.1 (Reasoning) is the highest-ranked open weights model with an Intelligence Index score of 51. There are 197 open weights models out of 319 total on the leaderboard.
The top open weights models by Intelligence Index are: 1. GLM-5.1 (Reasoning) (51), 2. GLM-5 (Reasoning) (50), 3. Kimi K2.5 (Reasoning) (47).
Gemini 3.1 Pro Preview leads among 159 reasoning models with an Intelligence Index score of 57. Reasoning models use extended thinking to solve complex problems before responding.
The leaderboard includes filters to narrow results by model type (reasoning vs non-reasoning), openness (open weights vs proprietary), and other criteria. You can also adjust prompt options to see how performance varies with different input lengths.
Click on any model name in the leaderboard to visit its dedicated comparison page with detailed charts covering intelligence, pricing, speed, latency, and more. You can also compare API providers for each model. View all models