LLM Leaderboard - Comparison of over 100 AI models from OpenAI, Google, DeepSeek & others

Comparison and ranking the performance of over 100 AI models (LLMs) across key metrics including intelligence, price, performance and speed (output speed - tokens per second & latency - TTFT), context window & others.

For more details including relating to our methodology, see our FAQs.

Intelligence

Updated

Claude Fable 5 (with fallback) and Claude Opus 4.8 (max) are the highest intelligence models, followed by GPT-5.5 (xhigh) and Claude Opus 4.7 (max).

Output Speed

Mercury 2 and LFM2.5-1.2B-Instruct are the fastest models, followed by LFM2 1.2B and LFM2.5-VL-1.6B.

Latency

North Mini Code and Command A+ are the lowest latency models, followed by NVIDIA Nemotron 3 Nano and NVIDIA Nemotron Nano 12B v2 VL.

Price

Qwen3.5 0.8B and Qwen3.5 0.8B are the cheapest models, followed by Gemma 3n E4B and Nova Micro.

Context Window

Llama 4 Scout and Grok 4.20 0309 support the largest context windows, followed by Gemini 1.5 Pro (May) and Grok 4.1 Fast.

Further Analysis
Claude Fable 5 (with fallback)
1M
AnthropicAnthropic
60
$7.70
--
--
--
Claude Opus 4.8 (max)
1M
AnthropicAnthropic
56
$3.85
59
18.64
27.09
GPT-5.5 (xhigh)
922k
OpenAIOpenAI
55
$4.35
63
51.97
59.95
Claude Opus 4.7 (max)
1M
AnthropicAnthropic
54
$3.85
51
21.81
31.52
GPT-5.5 (high)
922k
OpenAIOpenAI
53
$4.35
58
24.46
33.06
GLM-5.2 (max)
1M
Z AIZ AI
51
$0.90
96
1.72
27.84
Gemini 3.5 Flash
1M
GoogleGoogle
50
$1.31
161
19.92
23.01
Claude Sonnet 4.6 (max)
1M
AnthropicAnthropic
47
$2.31
53
102.27
111.77
GPT-5.5 (medium)
922k
OpenAIOpenAI
47*
$4.35
60
7.86
16.20
Gemini 3.1 Pro Preview
1M
GoogleGoogle
46
$1.74
123
27.87
31.93
Qwen3.7 Max
1M
AlibabaAlibaba
46
$1.43
112
2.65
28.60
Gemini 3.5 Flash (medium)
1M
GoogleGoogle
45*
$1.31
161
18.56
21.67
MiniMax-M3
1M
MiniMaxMiniMax
44
$0.22
62
2.89
43.27
DeepSeek V4 Pro (Max)
1M
DeepSeekDeepSeek
44
$0.18
85
1.73
58.98
GPT-5.3 Codex (xhigh)
400k
OpenAIOpenAI
44*
$1.87
75
82.70
89.36
Muse Spark
262k
MetaMeta
43
--
--
--
--
Kimi K2.6
256k
KimiKimi
43
$0.70
44
2.31
116.07
Claude Opus 4.7 (Non-reasoning, high)
1M
AnthropicAnthropic
43*
$3.85
48
1.10
11.50
MiMo-V2.5-Pro
1M
XiaomiXiaomi
42
$0.18
53
2.55
49.81
Kimi K2.7 Code
256k
KimiKimi
42
$0.70
59
2.23
48.32
GPT-5.5 (low)
922k
OpenAIOpenAI
42*
$4.35
60
1.70
10.10
DeepSeek V4 Pro (High)
1M
DeepSeekDeepSeek
41*
$0.18
81
1.78
32.65
DeepSeek V4 Flash (Max)
1M
DeepSeekDeepSeek
40
$0.06
109
1.38
57.42
GLM-5.1
200k
Z AIZ AI
40
$0.90
84
1.52
52.87
MiMo-V2.5
1M
XiaomiXiaomi
40*
$0.06
81
2.50
33.46
GPT-5.4 mini (xhigh)
400k
OpenAIOpenAI
40
$0.65
174
6.96
9.84
Qwen3.6 Plus
1M
AlibabaAlibaba
40
$0.43
53
2.76
117.99
Qwen3.7 Plus
1M
AlibabaAlibaba
39
$0.25
52
2.73
50.81
GPT-5.4 nano (xhigh)
400k
OpenAIOpenAI
38
$0.18
157
6.63
9.82
MiniMax-M2.7
205k
MiniMaxMiniMax
38
$0.22
47
2.20
64.76
GLM-5-Turbo
200k
Z AIZ AI
38*
--
--
--
--
Nemotron 3 Ultra
262k
NVIDIANVIDIA
38
$0.58
172
1.14
17.28
Grok 4.3 (high)
1M
xAIxAI
38
$0.64
138
11.58
15.19
DeepSeek V4 Flash (High)
1M
DeepSeekDeepSeek
37*
$0.08
--
--
--
Qwen3.6 27B
262k
AlibabaAlibaba
37
$0.90
56
3.80
114.89
MiMo-V2-Omni-0327
256k
XiaomiXiaomi
36*
$0.34
78
3.00
34.98
Grok 4.3 (medium)
1M
xAIxAI
36*
$0.64
151
8.40
11.72
Claude Sonnet 4.6 (Non-reasoning)
1M
AnthropicAnthropic
36*
$2.31
43
1.24
12.90
Grok 4.3 (low)
1M
xAIxAI
35*
$0.64
118
4.65
8.88
GLM-5.1
200k
Z AIZ AI
35*
$0.90
55
1.95
11.02
MiMo-V2-Omni
256k
XiaomiXiaomi
35*
$0.00
79
2.65
34.43
Gemini 3.5 Flash (minimal)
1M
GoogleGoogle
35*
$1.31
167
0.90
3.89
Kimi K2.6
256k
KimiKimi
35*
$0.70
45
2.78
13.98
GLM 5V Turbo
200k
Z AIZ AI
34*
--
--
--
--
Claude Sonnet 4.6 (Non-reasoning, Low Effort)
1M
AnthropicAnthropic
34*
$2.31
44
1.28
12.72
Qwen3.5 397B A17B
262k
AlibabaAlibaba
34
$0.90
51
2.74
74.40
Hy3-preview
256k
TencentTencent
34*
$0.10
115
3.77
25.51
GPT-5.5 Instant (May 2026)
400k
OpenAIOpenAI
34*
$4.35
--
--
--
MiMo-V2-Flash (Feb 2026)
256k
XiaomiXiaomi
33*
$0.06
86
2.52
31.70
GPT-5.5 (Non-reasoning)
922k
OpenAIOpenAI
33*
$4.35
56
0.99
9.92
Qwen3.5 122B A10B
262k
AlibabaAlibaba
32
$0.68
135
2.51
20.97
Qwen3.5 397B A17B
262k
AlibabaAlibaba
32*
$0.90
52
2.65
12.34
Qwen3.6 35B A3B
262k
AlibabaAlibaba
32
$0.37
171
2.47
36.88
DeepSeek V4 Pro
1M
DeepSeekDeepSeek
31*
$0.18
79
1.78
8.11
Qwen3.5 Omni Plus
256k
AlibabaAlibaba
31*
$0.84
52
2.44
12.04
Ring-2.6-1T
262k
InclusionAIInclusionAI
31
$0.52
132
3.46
22.35
o3
200k
OpenAIOpenAI
30*
$1.55
136
6.55
10.23
GPT-5.4 nano
400k
OpenAIOpenAI
30*
$0.18
170
4.39
7.33
Mistral Medium 3.5
256k
MistralMistral
30
$1.16
138
2.17
20.31
GPT-5.4 mini (medium)
400k
OpenAIOpenAI
30*
$0.65
156
4.35
7.55
Step 3.7 Flash
256k
StepFunStepFun
30
$0.18
384
0.95
7.46
Claude 4.5 Haiku
200k
AnthropicAnthropic
30
$0.77
105
12.82
17.60
Gemma 4 31B
256k
GoogleGoogle
29
$0.00
35
1.08
65.65
Command A+
192k
CohereCohere
29*
$0.00
199
0.39
12.94
Qwen3.6 27B
262k
AlibabaAlibaba
29*
$0.90
62
3.74
11.86
DeepSeek V4 Flash
1M
DeepSeekDeepSeek
29*
$0.06
107
1.34
6.03
JT-35B-Flash
256k
China MobileChina Mobile
28*
--
--
--
--
Qwen3.5 122B A10B
262k
AlibabaAlibaba
28*
$0.68
156
2.47
5.68
MiMo-V2.5-Pro
1M
XiaomiXiaomi
28*
$0.58
56
2.79
11.67
Gemini 2.5 Pro
1M
GoogleGoogle
27*
$1.34
124
17.34
21.39
Hy3-preview
256k
TencentTencent
26*
$0.10
131
4.33
8.14
Ling-2.6-1T
262k
InclusionAIInclusionAI
26*
$0.52
--
--
--
Step 3.5 Flash 2603
256k
StepFunStepFun
26*
$0.06
205
1.17
13.36
Doubao Seed Code
256k
ByteDance SeedByteDance Seed
26*
--
--
--
--
Gemma 4 26B A4B
256k
GoogleGoogle
26
$0.14
--
--
--
NVIDIA Nemotron 3 Super
1M
NVIDIANVIDIA
25
$0.28
146
1.18
18.27
Mercury 2
128k
InceptionInception
25*
$0.14
900
3.60
4.15
Gemini 3.1 Flash-Lite
1M
GoogleGoogle
25
$0.22
286
5.39
7.14
Qwen3.5 9B
262k
AlibabaAlibaba
25*
$0.11
63
1.55
41.31
Gemma 4 31B
256k
GoogleGoogle
25*
$0.17
40
2.25
14.73
Grok 4.3 (Non-reasoning)
1M
xAIxAI
25
$0.64
129
0.71
4.59
K-EXAONE
256k
LG AI ResearchLG AI Research
25*
--
--
--
--
Trinity Large Thinking
512k
Arcee AIArcee AI
24*
$0.24
158
1.08
16.92
Qwen3.6 35B A3B
262k
AlibabaAlibaba
24*
$0.56
183
2.33
5.06
gpt-oss-120b (high)
131k
OpenAIOpenAI
24
$0.20
340
0.95
8.30
Claude 4.5 Haiku
200k
AnthropicAnthropic
24*
$0.77
89
0.79
6.39
Qwen3.5 35B A3B
262k
AlibabaAlibaba
23*
$0.42
172
2.42
5.33
MiMo-V2-Flash
256k
XiaomiXiaomi
23*
$0.12
93
4.02
9.41
EXAONE 4.5 33B
262k
LG AI ResearchLG AI Research
23*
--
--
--
--
HyperNova 60B 2605
131k
Multiverse ComputingMultiverse Computing
22*
$0.05
362
0.69
7.60
Gemma 4 12B
256k
GoogleGoogle
22*
$0.12
124
2.43
22.64
ERNIE 5.0 Thinking Preview
128k
BaiduBaidu
22*
--
--
--
--
Nova 2.0 Pro Preview (medium)
256k
AmazonAmazon
22
$1.47
126
12.53
32.33
Nemotron Cascade 2 30B A3B
1M
NVIDIANVIDIA
21*
--
--
--
--
Qwen3 Coder Next
256k
AlibabaAlibaba
21*
$0.43
63
2.00
9.97
Nova 2.0 Omni (medium)
1M
AmazonAmazon
21*
$0.52
--
--
--
Mistral Small 4
256k
MistralMistral
21*
$0.20
170
0.77
15.43
North Mini Code
256k
CohereCohere
21*
$0.00
151
0.33
16.88
Nova 2.0 Lite (high)
1M
AmazonAmazon
21*
$0.52
144
22.47
39.77
Qwen3.5 9B
262k
AlibabaAlibaba
20*
--
--
--
--
Magistral Medium 1.2
128k
MistralMistral
20*
$2.30
41
1.77
62.10
Gemma 4 26B A4B
256k
GoogleGoogle
20*
$0.16
40
1.93
14.32
Qwen3.5 4B
262k
AlibabaAlibaba
20*
$0.04
31
0.88
80.26
Qwen3 Next 80B A3B
262k
AlibabaAlibaba
20*
$1.05
174
2.24
16.58
Nova 2.0 Pro Preview (low)
256k
AmazonAmazon
20*
$2.13
120
9.73
30.52
Ling 2.6 Flash
262k
InclusionAIInclusionAI
19*
$0.06
--
--
--
Nova 2.0 Lite (medium)
1M
AmazonAmazon
19*
$0.52
139
24.73
42.73
Qwen3.5 Omni Flash
256k
AlibabaAlibaba
19*
$0.17
249
1.97
3.98
JT-MINI
128k
China MobileChina Mobile
19*
--
--
--
--
Nova 2.0 Lite (low)
1M
AmazonAmazon
18*
$0.52
141
15.41
33.15
gpt-oss-120b (low)
131k
OpenAIOpenAI
18*
$0.20
357
0.89
7.90
GPT-5.4 nano
400k
OpenAIOpenAI
18*
$0.18
164
0.59
3.63
NVIDIA Nemotron 3 Nano
1M
NVIDIANVIDIA
18*
$0.07
97
1.75
27.52
LongCat Flash Lite
256k
LongCatLongCat
17*
$0.00
--
--
--
K-EXAONE
256k
LG AI ResearchLG AI Research
17*
--
--
--
--
GPT-5.4 mini
400k
OpenAIOpenAI
17*
$0.65
147
0.68
4.09
Nova 2.0 Omni (low)
1M
AmazonAmazon
17*
$0.52
--
--
--
Nova 2.0 Pro Preview
256k
AmazonAmazon
16*
$2.13
122
1.09
5.20
Mi:dm K 2.5 Pro
128k
Korea TelecomKorea Telecom
16*
--
--
--
--
Mistral Large 3
256k
MistralMistral
16*
$0.60
51
1.18
11.04
Qwen3.5 4B
262k
AlibabaAlibaba
16*
$0.04
33
0.88
15.92
INTELLECT-3
131k
Prime IntellectPrime Intellect
16*
--
--
--
--
Devstral 2
256k
MistralMistral
15*
$0.00
43
1.46
13.12
Solar Open 100B
128k
UpstageUpstage
15*
--
--
--
--
Nemotron 3 Nano Omni 30B A3B Reasoning
256k
NVIDIANVIDIA
15*
$0.10
281
1.02
9.93
gpt-oss-20B (high)
131k
OpenAIOpenAI
15
$0.07
217
0.79
12.32
gpt-oss-20B (low)
131k
OpenAIOpenAI
14*
$0.07
224
0.85
12.02
Llama 4 Maverick
1M
MetaMeta
14
$0.34
93
0.99
6.38
Solar Pro 3
128k
UpstageUpstage
14
--
--
--
--
Qwen3 Next 80B A3B
262k
AlibabaAlibaba
14*
$0.65
177
2.23
5.05
Gemma 4 12B (Non-reasoning)
262k
GoogleGoogle
13*
$0.12
133
2.84
6.59
Devstral Small 2
256k
MistralMistral
13*
$0.00
56
1.22
10.14
Motif-2-12.7B
128k
Motif TechnologiesMotif Technologies
13*
--
--
--
--
Nova Premier
1M
AmazonAmazon
13*
$2.18
33
2.93
18.00
Gemma 4 E4B
128k
GoogleGoogle
12*
--
--
--
--
Llama Nemotron Super 49B v1.5
128k
NVIDIANVIDIA
12*
$0.13
50
1.24
51.60
Mistral Small 4
256k
MistralMistral
12*
$0.20
156
0.81
4.02
MiniCPM5-1B
128k
OpenBMBOpenBMB
12*
--
--
--
--
Magistral Small 1.2
128k
MistralMistral
12*
$0.60
108
0.88
24.12
Sarvam 105B (high)
128k
SarvamSarvam
12*
$0.04
97
2.11
27.78
Nova 2.0 Lite
1M
AmazonAmazon
12*
$0.52
132
1.30
5.10
MiniCPM5-1B
128k
OpenBMBOpenBMB
12*
--
--
--
--
EXAONE 4.0 32B
131k
LG AI ResearchLG AI Research
11*
--
--
--
--
Nova 2.0 Omni
1M
AmazonAmazon
11*
$0.52
--
--
--
Qwen3.5 2B
262k
AlibabaAlibaba
10*
$0.03
38
0.76
66.20
Nanbeige4.1-3B
256k
NanbeigeNanbeige
10*
--
--
--
--
Llama 4 Scout
10M
MetaMeta
10
$0.22
107
0.84
5.50
Ministral 3 14B
256k
MistralMistral
10*
$0.20
92
0.87
6.29
Falcon-H1R-7B
256k
TII UAETII UAE
10*
--
--
--
--
Qwen3 Omni 30B A3B
65.5k
AlibabaAlibaba
10*
$0.32
85
2.02
31.44
Step3 VL 10B
65.5k
StepFunStepFun
9*
--
--
--
--
Gemma 4 E2B
128k
GoogleGoogle
9*
--
--
--
--
Llama Nemotron Ultra
128k
NVIDIANVIDIA
9*
$0.72
51
2.39
50.95
ERNIE 4.5 300B A47B
131k
BaiduBaidu
9*
$0.36
--
--
--
Solar Pro 2
65.5k
UpstageUpstage
9*
--
--
--
--
NVIDIA Nemotron Nano 12B v2 VL
128k
NVIDIANVIDIA
9*
$0.24
283
0.43
9.27
Ministral 3 8B
256k
MistralMistral
9*
$0.15
102
0.70
5.61
Gemma 4 E4B
128k
GoogleGoogle
9*
--
--
--
--
Granite 4.1 30B
131k
IBMIBM
9
--
--
--
--
NVIDIA Nemotron Nano 9B V2
131k
NVIDIANVIDIA
9*
$0.05
93
5.14
32.13
NVIDIA Nemotron 3 Nano 4B
262k
NVIDIANVIDIA
9*
--
--
--
--
Qwen3.5 2B
262k
AlibabaAlibaba
9*
$0.03
31
0.80
16.87
Llama Nemotron Super 49B v1.5
128k
NVIDIANVIDIA
9*
$0.13
49
1.25
11.38
Llama 3.3 70B
128k
MetaMeta
9*
$0.59
92
1.61
7.04
Kimi Linear 48B A3B Instruct
1M
KimiKimi
9*
--
--
--
--
Llama 3.1 405B
128k
MetaMeta
9*
$3.13
66
2.36
9.94
LFM2.5-8B-A1B
32.8k
Liquid AILiquid AI
8*
$0.00
231
2.78
13.61
Ring-flash-2.0
128k
InclusionAIInclusionAI
8*
$0.18
--
--
--
Solar Pro 2
65.5k
UpstageUpstage
8*
--
--
--
--
Command A
256k
CohereCohere
8*
$3.25
73
1.59
8.40
Llama 3.1 Nemotron 70B
128k
NVIDIANVIDIA
8*
$1.20
304
5.00
6.64
NVIDIA Nemotron 3 Nano
1M
NVIDIANVIDIA
7*
$0.07
90
0.40
5.97
NVIDIA Nemotron Nano 9B V2
131k
NVIDIANVIDIA
7*
$0.06
158
1.56
4.72
Granite 4.1 8B
131k
IBMIBM
7*
$0.06
121
0.83
4.96
Sarvam 30B (high)
65.5k
SarvamSarvam
7*
$0.03
165
1.90
17.07
Gemma 4 E2B
128k
GoogleGoogle
6*
--
--
--
--
R1 1776
128k
PerplexityPerplexity
6*
--
--
--
--
Llama 3.2 90B (Vision)
128k
MetaMeta
6*
$1.38
58
1.16
9.77
EXAONE 4.0 32B
131k
LG AI ResearchLG AI Research
6*
--
--
--
--
Ministral 3 3B
256k
MistralMistral
6*
$0.10
182
0.56
3.30
Jamba 1.7 Large
256k
AI21 LabsAI21 Labs
5*
$2.60
59
1.65
10.07
Granite 4.0 H Small
128k
IBMIBM
5*
$0.08
415
10.30
11.51
Qwen3 Omni 30B A3B
65.5k
AlibabaAlibaba
5*
$0.32
95
1.92
7.18
Qwen3.5 0.8B
262k
AlibabaAlibaba
5*
$0.01
33
0.80
76.59
LFM2 24B A2B
32.8k
Liquid AILiquid AI
5*
$0.04
130
0.61
4.44
Phi-4
16k
MicrosoftMicrosoft
5*
$0.16
40
1.99
14.53
Nova Micro
130k
AmazonAmazon
5*
$0.03
306
0.98
2.62
NVIDIA Nemotron Nano 12B v2 VL
128k
NVIDIANVIDIA
5*
$0.24
213
0.84
3.18
Phi-4 Multimodal
128k
MicrosoftMicrosoft
5*
$0.00
16
1.78
32.81
Qwen3.5 0.8B
262k
AlibabaAlibaba
4*
$0.01
38
0.87
14.07
MiniCPM-V 4.6 1.3B
262k
OpenBMBOpenBMB
4
--
--
--
--
Jamba Reasoning 3B
262k
AI21 LabsAI21 Labs
4*
--
--
--
--
Reka Flash 3
128k
Reka AIReka AI
4*
$0.26
--
--
--
Ling-mini-2.0
131k
InclusionAIInclusionAI
4*
--
--
--
--
Llama 3.2 11B (Vision)
128k
MetaMeta
3*
$0.25
52
0.75
10.46
Granite 4.1 3B
131k
IBMIBM
3*
--
--
--
--
Phi-4 Mini
128k
MicrosoftMicrosoft
3*
$0.00
45
0.89
11.89
Exaone 4.0 1.2B
64k
LG AI ResearchLG AI Research
3*
--
--
--
--
Exaone 4.0 1.2B
64k
LG AI ResearchLG AI Research
3*
--
--
--
--
LFM2.5-1.2B-Thinking
32k
Liquid AILiquid AI
3*
--
--
--
--
Jamba 1.7 Mini
258k
AI21 LabsAI21 Labs
3*
--
--
--
--
LFM2 2.6B
32.8k
Liquid AILiquid AI
3*
$0.00
337
1.44
2.92
LFM2.5-1.2B-Instruct
32k
Liquid AILiquid AI
3*
$0.00
497
1.41
2.41
Granite 4.0 H 1B
128k
IBMIBM
3*
--
--
--
--
Gemma 3 270M
32k
GoogleGoogle
2*
--
--
--
--
Apertus 70B Instruct
65.5k
Swiss AI InitiativeSwiss AI Initiative
2*
$1.03
--
--
--
Granite 4.0 Micro
128k
IBMIBM
2*
--
--
--
--
Granite 4.0 1B
128k
IBMIBM
2*
--
--
--
--
LFM2 8B A1B
32.8k
Liquid AILiquid AI
2*
$0.00
--
--
--
LFM2.5-VL-1.6B
32k
Liquid AILiquid AI
1*
$0.00
422
1.50
2.69
Granite 4.0 350M
32.8k
IBMIBM
1*
--
--
--
--
Tiny Aya Global
8.19k
CohereCohere
1*
$0.00
--
--
--
Apertus 8B Instruct
65.5k
Swiss AI InitiativeSwiss AI Initiative
1*
$0.11
--
--
--
Granite 4.0 H 350M
32.8k
IBMIBM
1*
--
--
--
--
EXAONE 4.5 33B
262k
LG AI ResearchLG AI Research
--
--
--
--
--
Gemini 3 Deep Think
128k
GoogleGoogle
--
--
--
--
--
Mi:dm K 2.5 Pro Preview
128k
Korea TelecomKorea Telecom
--
--
--
--
--
GPT-5.5 Pro (xhigh)
922k
OpenAIOpenAI
--
--
--
--
--

Key definitions

Maximum number of combined input & output tokens. Output tokens commonly have a significantly lower limit (varied by model).

Tokens per second received while the model is generating tokens (ie. after first chunk has been received from the API for models which support streaming).

Time to first token received, in seconds, after API request sent. For reasoning models which share reasoning tokens, this will be the first reasoning token. For models which do not support streaming, this represents time to receive the completion.

Price per token, shown in USD per million tokens. Price is a blend of cache hit, input, and output token prices using the selected ratio (default 7:2:1 cache-input-output).

Price per token generated by the model (received from the API), represented as USD per million Tokens.

Price per token included in the request/message sent to the API, represented as USD per million Tokens.

Metrics are 'live' and are based on the past 72 hours of measurements, measurements are taken 8 times a day for single requests and 2 times per day for parallel requests.

Frequently Asked Questions

Claude Fable 5 (Adaptive Reasoning, Max Effort, Opus 4.8 Fallback) currently ranks #1 on the Artificial Analysis LLM Leaderboard with an Intelligence Index score of 60, out of 47 models ranked.

The top models by Intelligence Index are: 1. Claude Fable 5 (Adaptive Reasoning, Max Effort, Opus 4.8 Fallback) (60), 2. Claude Opus 4.8 (Adaptive Reasoning, Max Effort) (56), 3. GPT-5.5 (xhigh) (55), 4. Claude Opus 4.7 (Adaptive Reasoning, Max Effort) (54), 5. GPT-5.5 (high) (53).

Mercury 2 is the fastest at 900.3 tokens per second, followed by LFM2.5-1.2B-Instruct (497.3 t/s) and LFM2 1.2B (438.5 t/s).

Qwen3.5 0.8B (Non-reasoning) is the most affordable at $0.01 per 1M tokens (blended 7:2:1 cache hit/input/output ratio), followed by Qwen3.5 0.8B (Reasoning) ($0.01) and Gemma 3n E4B Instruct ($0.02).

GLM-5.2 (max) is the highest-ranked open weights model with an Intelligence Index score of 51. There are 26 open weights models out of 47 total on the leaderboard.

The top open weights models by Intelligence Index are: 1. GLM-5.2 (max) (51), 2. MiniMax-M3 (44), 3. DeepSeek V4 Pro (Reasoning, Max Effort) (44).

Claude Fable 5 (Adaptive Reasoning, Max Effort, Opus 4.8 Fallback) leads among 42 reasoning models with an Intelligence Index score of 60. Reasoning models use extended thinking to solve complex problems before responding.

The leaderboard includes filters to narrow results by model type (reasoning vs non-reasoning), openness (open weights vs proprietary), and other criteria. You can also adjust prompt options to see how performance varies with different input lengths.

Click on any model name in the leaderboard to visit its dedicated comparison page with detailed charts covering intelligence, pricing, speed, latency, and more. You can also compare API providers for each model. View all models