LLM Leaderboard - Comparison of over 100 AI models from OpenAI, Google, DeepSeek & others

Comparison and ranking the performance of over 100 AI models (LLMs) across key metrics including intelligence, price, performance and speed (output speed - tokens per second & latency - TTFT), context window & others. For more details including relating to our methodology, see our FAQs.

Intelligence

Gemini 3.1 Pro Preview and GPT-5.4 (xhigh) are the highest intelligence models, followed by GPT-5.3 Codex (xhigh) and Claude Opus 4.6 (max).

Output Speed

Mercury 2 and Granite 4.0 H Small are the fastest models, followed by Granite 3.3 8B and Gemini 2.5 Flash-Lite.

Latency

Qwen3.5 0.8B and Qwen3.5 2B are the lowest latency models, followed by NVIDIA Nemotron 3 Nano and Ministral 3 3B.

Price

Qwen3.5 0.8B and Qwen3.5 0.8B are the cheapest models, followed by Gemma 3n E4B and Qwen3.5 2B.

Context Window

Llama 4 Scout and Grok 4.1 Fast support the largest context windows, followed by Grok 4.1 Fast and Gemini 1.5 Pro (May).

Further Analysis
Gemini 3.1 Pro Preview
1M
GoogleGoogle
57
$4.50
115
33.27
37.61
GPT-5.4 (xhigh)
1.05M
OpenAIOpenAI
57
$5.63
82
198.43
204.50
GPT-5.3 Codex (xhigh)
400k
OpenAIOpenAI
54
$4.81
68
96.11
103.48
Claude Opus 4.6 (max)
1M
AnthropicAnthropic
53
$10.00
47
19.31
30.02
Muse Spark
262k
MetaMeta
52
--
--
--
--
Claude Sonnet 4.6 (max)
1M
AnthropicAnthropic
52
$6.00
62
94.28
102.36
GLM-5.1
200k
Z AIZ AI
51
$2.15
66
1.73
66.38
Qwen3.6 Plus
1M
AlibabaAlibaba
50
$1.13
50
2.34
124.26
GLM-5
200k
Z AIZ AI
50
$1.55
72
1.70
51.64
MiniMax-M2.7
205k
MiniMaxMiniMax
50
$0.53
42
3.18
73.14
Grok 4.20 0309 v2
2M
xAIxAI
49
$3.00
194
11.08
13.65
MiMo-V2-Pro
1M
XiaomiXiaomi
49
$1.50
69
2.81
52.62
GPT-5.4 mini (xhigh)
400k
OpenAIOpenAI
49
$1.69
167
8.14
11.14
Kimi K2.5
256k
KimiKimi
47
$1.20
37
2.11
95.92
GLM-5-Turbo
200k
Z AIZ AI
47
--
--
--
--
Claude Opus 4.6
1M
AnthropicAnthropic
46
$10.00
40
1.82
14.33
Gemini 3 Flash
1M
GoogleGoogle
46
$1.13
153
8.04
11.31
Qwen3.5 397B A17B
262k
AlibabaAlibaba
45
$1.35
80
2.49
48.41
MiMo-V2-Omni-0327
256k
XiaomiXiaomi
45
--
--
--
--
Claude Sonnet 4.6
1M
AnthropicAnthropic
44
$6.00
43
1.48
13.07
GPT-5.4 nano (xhigh)
400k
OpenAIOpenAI
44
$0.46
188
4.52
7.18
GLM-5.1
200k
Z AIZ AI
44
$2.15
47
1.65
12.40
MiMo-V2-Omni
256k
XiaomiXiaomi
43
--
--
--
--
GLM 5V Turbo
200k
Z AIZ AI
43
--
--
--
--
Claude Sonnet 4.6 (Non-reasoning, Low Effort)
1M
AnthropicAnthropic
43
$6.00
42
1.42
13.25
Qwen3.5 27B
262k
AlibabaAlibaba
42
$0.82
87
5.69
34.38
DeepSeek V3.2
128k
DeepSeekDeepSeek
42
$0.32
46
2.13
56.75
Qwen3.5 122B A10B
262k
AlibabaAlibaba
42
$1.10
121
2.36
23.08
MiMo-V2-Flash (Feb 2026)
256k
XiaomiXiaomi
41
$0.15
130
2.22
21.43
Gemini 3 Pro Preview (low)
1M
GoogleGoogle
41
$4.50
--
--
--
GLM-5
200k
Z AIZ AI
41
$1.55
48
2.33
12.83
Qwen3.5 397B A17B
262k
AlibabaAlibaba
40
$1.35
80
2.72
8.98
Qwen3 Max Thinking
256k
AlibabaAlibaba
40
$2.40
37
3.98
71.21
Gemma 4 31B
256k
GoogleGoogle
39
$0.00
36
1.70
71.58
Qwen3.5 Omni Plus
256k
AlibabaAlibaba
39
$1.50
48
2.54
12.88
Grok 4.1 Fast
2M
xAIxAI
39
$0.28
119
7.16
11.37
o3
200k
OpenAIOpenAI
38
$3.50
88
11.05
16.74
GPT-5.4 nano
400k
OpenAIOpenAI
38
$0.46
187
4.32
7.00
Step 3.5 Flash
256k
StepFunStepFun
38
$0.15
165
2.35
17.49
GPT-5.4 mini (medium)
400k
OpenAIOpenAI
38
$1.69
166
12.87
15.89
Kimi K2.5
256k
KimiKimi
37
$1.20
35
3.12
17.24
Qwen3.5 27B
262k
AlibabaAlibaba
37
$0.82
88
5.62
11.32
Qwen3.5 35B A3B
262k
AlibabaAlibaba
37
$0.69
185
2.07
15.58
Claude 4.5 Haiku
200k
AnthropicAnthropic
37
$2.00
99
14.79
19.87
NVIDIA Nemotron 3 Super
1M
NVIDIANVIDIA
36
$0.41
172
0.96
15.48
Qwen3.5 122B A10B
262k
AlibabaAlibaba
36
$1.10
139
2.43
6.02
Nova 2.0 Pro Preview (medium)
256k
AmazonAmazon
36
$3.44
126
12.03
31.86
GPT-5.4
1.05M
OpenAIOpenAI
35
$5.63
59
0.78
9.31
Gemini 3 Flash
1M
GoogleGoogle
35
$1.13
165
1.50
4.53
Gemini 2.5 Pro
1M
GoogleGoogle
35
$3.44
117
29.01
33.29
Nova 2.0 Lite (high)
1M
AmazonAmazon
35*
$0.85
157
11.99
27.89
Gemini 3.1 Flash-Lite Preview
1M
GoogleGoogle
34
$0.56
186
9.07
11.76
Doubao Seed Code
256k
ByteDance SeedByteDance Seed
34
--
--
--
--
gpt-oss-120B (high)
131k
OpenAIOpenAI
33
$0.26
215
0.86
12.46
Mercury 2
128k
InceptionInception
33
$0.38
848
3.79
4.38
Qwen3.5 9B
262k
AlibabaAlibaba
32
$0.10
105
0.65
24.35
Gemma 4 31B
256k
GoogleGoogle
32
--
--
--
--
K-EXAONE
256k
LG AI ResearchLG AI Research
32
--
--
--
--
DeepSeek V3.2
128k
DeepSeekDeepSeek
32
$0.32
45
2.02
13.11
Grok 3 mini Reasoning (high)
1M
xAIxAI
32
$0.35
189
0.58
13.80
Nova 2.0 Pro Preview (low)
256k
AmazonAmazon
32
$3.44
133
10.04
28.90
Trinity Large Thinking
512k
Arcee AIArcee AI
32
$0.40
101
1.04
25.89
Gemma 4 26B A4B
256k
GoogleGoogle
31
$0.20
--
--
--
Claude 4.5 Haiku
200k
AnthropicAnthropic
31
$2.00
91
0.76
6.26
Qwen3.5 35B A3B
262k
AlibabaAlibaba
31
$0.69
173
2.06
4.94
MiMo-V2-Flash
256k
XiaomiXiaomi
30
$0.15
128
2.34
6.24
Nova 2.0 Lite (medium)
1M
AmazonAmazon
30
$0.85
164
16.04
31.24
DeepSeek V3.2 Speciale
128k
DeepSeekDeepSeek
29
--
--
--
--
ERNIE 5.0 Thinking Preview
128k
BaiduBaidu
29
--
--
--
--
Grok 4.20 0309 v2
2M
xAIxAI
29
$3.00
180
0.55
3.33
Grok Code Fast 1
256k
xAIxAI
29
$0.53
138
3.96
7.58
Qwen3 Coder Next
256k
AlibabaAlibaba
28
$0.60
145
1.86
5.31
Nova 2.0 Omni (medium)
1M
AmazonAmazon
28
$0.85
--
--
--
Nemotron Cascade 2 30B A3B
262k
NVIDIANVIDIA
28
--
--
--
--
Qwen3.5 9B
262k
AlibabaAlibaba
27
$0.08
158
0.64
3.79
Mistral Small 4
256k
MistralMistral
27
$0.26
145
3.30
20.49
Magistral Medium 1.2
128k
MistralMistral
27
$2.75
86
1.68
30.86
Gemma 4 26B A4B
256k
GoogleGoogle
27
--
--
--
--
Qwen3.5 4B
262k
AlibabaAlibaba
27
$0.06
190
0.68
13.82
DeepSeek R1 0528
128k
DeepSeekDeepSeek
27
$2.36
--
--
--
Qwen3 Next 80B A3B
262k
AlibabaAlibaba
27
$1.88
172
2.16
16.71
Solar Pro 3
128k
UpstageUpstage
26
--
--
--
--
Qwen3.5 Omni Flash
256k
AlibabaAlibaba
26
$0.28
164
1.92
4.97
Qwen3 Coder 480B
262k
AlibabaAlibaba
25
$3.00
57
2.96
11.73
Nova 2.0 Lite (low)
1M
AmazonAmazon
25
$0.85
173
10.16
24.63
gpt-oss-120B (low)
131k
OpenAIOpenAI
24
$0.26
213
0.84
12.56
gpt-oss-20B (high)
131k
OpenAIOpenAI
24
$0.09
251
0.70
10.66
GPT-5.4 nano
400k
OpenAIOpenAI
24
$0.46
185
0.61
3.32
NVIDIA Nemotron 3 Nano
1M
NVIDIANVIDIA
24
$0.10
101
1.67
26.51
LongCat Flash Lite
256k
LongCatLongCat
24
$0.00
116
6.78
11.10
Grok 4.1 Fast
2M
xAIxAI
24
$0.28
131
0.57
4.38
K-EXAONE
256k
LG AI ResearchLG AI Research
23
--
--
--
--
GPT-5.4 mini
400k
OpenAIOpenAI
23
$1.69
157
0.72
3.91
Nova 2.0 Omni (low)
1M
AmazonAmazon
23
$0.85
--
--
--
Mi:dm K 2.5 Pro
128k
Korea TelecomKorea Telecom
23
--
--
--
--
Nova 2.0 Pro Preview
256k
AmazonAmazon
23
$3.44
124
1.03
5.07
Mistral Large 3
256k
MistralMistral
23
$0.75
38
1.68
14.91
Ring-1T
128k
InclusionAIInclusionAI
23
--
--
--
--
Qwen3.5 4B
262k
AlibabaAlibaba
23
$0.06
199
0.82
3.34
INTELLECT-3
131k
Prime IntellectPrime Intellect
22
--
--
--
--
Devstral 2
256k
MistralMistral
22
$0.00
71
0.90
7.91
Solar Open 100B
128k
UpstageUpstage
22
--
--
--
--
Gemini 2.5 Flash-Lite (Sep)
1M
GoogleGoogle
22
$0.17
--
--
--
Mistral Medium 3.1
128k
MistralMistral
21
$0.80
100
1.37
6.37
gpt-oss-20B (low)
131k
OpenAIOpenAI
21
$0.09
215
0.73
12.37
Qwen3 Next 80B A3B
262k
AlibabaAlibaba
20
$0.88
163
2.24
5.31
Devstral Small 2
256k
MistralMistral
19
$0.00
72
0.93
7.84
Gemini 2.5 Flash-Lite (Sep)
1M
GoogleGoogle
19
$0.17
--
--
--
Motif-2-12.7B
128k
Motif TechnologiesMotif Technologies
19
--
--
--
--
Ling-1T
128k
InclusionAIInclusionAI
19
--
--
--
--
Nova Premier
1M
AmazonAmazon
19
$5.00
26
3.03
22.47
Gemma 4 E4B
128k
GoogleGoogle
19
--
--
--
--
Llama Nemotron Super 49B v1.5
128k
NVIDIANVIDIA
19
$0.17
55
1.45
46.84
Mistral Small 4
256k
MistralMistral
19
$0.26
117
1.82
6.07
Llama 3.3 Nemotron Super 49B
128k
NVIDIANVIDIA
18*
--
--
--
--
Llama 4 Maverick
1M
MetaMeta
18
$0.49
111
1.03
5.55
Sarvam 105B (high)
128k
SarvamSarvam
18
$0.00
109
2.48
25.32
Magistral Small 1.2
128k
MistralMistral
18
$0.75
161
0.85
16.37
Nova 2.0 Lite
1M
AmazonAmazon
18
$0.85
165
1.29
4.32
Llama 3.1 405B
128k
MetaMeta
17
$3.69
29
2.37
19.38
EXAONE 4.0 32B
131k
LG AI ResearchLG AI Research
17
--
--
--
--
Nova 2.0 Omni
1M
AmazonAmazon
17
$0.85
192
1.09
3.70
DeepSeek R1 0528 Qwen3 8B
32.8k
DeepSeekDeepSeek
16*
--
--
--
--
Qwen3.5 2B
262k
AlibabaAlibaba
16
$0.04
--
--
--
Nanbeige4.1-3B
256k
NanbeigeNanbeige
16
--
--
--
--
Ministral 3 14B
256k
MistralMistral
16
$0.20
108
0.72
5.34
DeepSeek R1 Distill Llama 70B
128k
DeepSeekDeepSeek
16*
$0.88
39
2.96
67.68
Falcon-H1R-7B
256k
TII UAETII UAE
16
--
--
--
--
Ling-flash-2.0
128k
InclusionAIInclusionAI
16
$0.25
62
2.43
10.46
Qwen3 Omni 30B A3B
65.5k
AlibabaAlibaba
16
$0.43
91
1.91
29.32
Step3 VL 10B
65.5k
StepFunStepFun
15
--
--
--
--
Gemma 4 E2B
128k
GoogleGoogle
15
--
--
--
--
Llama Nemotron Ultra
128k
NVIDIANVIDIA
15
$0.90
41
2.52
63.41
ERNIE 4.5 300B A47B
131k
BaiduBaidu
15
$0.48
25
3.53
23.56
Solar Pro 2
65.5k
UpstageUpstage
15
--
--
--
--
NVIDIA Nemotron Nano 12B v2 VL
128k
NVIDIANVIDIA
15
$0.30
139
0.64
18.65
Ministral 3 8B
256k
MistralMistral
15
$0.15
180
0.56
3.34
Gemma 4 E4B
128k
GoogleGoogle
15
--
--
--
--
NVIDIA Nemotron Nano 9B V2
131k
NVIDIANVIDIA
15
$0.07
156
0.64
16.64
NVIDIA Nemotron 3 Nano 4B
262k
NVIDIANVIDIA
15
--
--
--
--
Qwen3.5 2B
262k
AlibabaAlibaba
15
$0.04
270
0.40
2.25
Llama Nemotron Super 49B v1.5
128k
NVIDIANVIDIA
15
$0.17
55
1.50
10.64
Llama 3.3 70B
128k
MetaMeta
14
$0.64
87
1.39
7.15
Llama 3.1 Nemotron Nano 4B v1.1
128k
NVIDIANVIDIA
14*
--
--
--
--
Kimi Linear 48B A3B Instruct
1M
KimiKimi
14*
--
--
--
--
Llama 3.3 Nemotron Super 49B
128k
NVIDIANVIDIA
14*
--
--
--
--
Ring-flash-2.0
128k
InclusionAIInclusionAI
14
$0.25
85
2.67
31.95
Solar Pro 2
65.5k
UpstageUpstage
14
--
--
--
--
Llama 4 Scout
10M
MetaMeta
14
$0.29
143
0.78
4.27
Command A
256k
CohereCohere
13
$4.38
32
2.14
17.82
Llama 3.1 Nemotron 70B
128k
NVIDIANVIDIA
13
$1.20
46
1.89
12.82
NVIDIA Nemotron 3 Nano
1M
NVIDIANVIDIA
13
$0.09
84
0.46
6.38
NVIDIA Nemotron Nano 9B V2
131k
NVIDIANVIDIA
13
$0.09
163
1.08
4.14
Sarvam 30B (high)
65.5k
SarvamSarvam
12
$0.00
142
1.91
19.49
Gemma 4 E2B
128k
GoogleGoogle
12
--
--
--
--
R1 1776
128k
PerplexityPerplexity
12*
--
--
--
--
Llama 3.2 90B (Vision)
128k
MetaMeta
12*
$0.72
54
1.07
10.32
EXAONE 4.0 32B
131k
LG AI ResearchLG AI Research
12
--
--
--
--
Ministral 3 3B
256k
MistralMistral
11
$0.10
278
0.47
2.27
Jamba 1.7 Large
256k
AI21 LabsAI21 Labs
11
$3.50
59
1.41
9.91
Granite 4.0 H Small
128k
IBMIBM
11
$0.11
449
10.25
11.36
Qwen3 Omni 30B A3B
65.5k
AlibabaAlibaba
11
$0.43
95
1.91
7.17
Qwen3.5 0.8B
262k
AlibabaAlibaba
11
$0.02
--
--
--
LFM2 24B A2B
32.8k
Liquid AILiquid AI
10
$0.05
52
0.55
10.13
Phi-4
16k
Microsoft AzureMicrosoft Azure
10
$0.22
34
2.15
16.76
Nova Micro
130k
AmazonAmazon
10
$0.06
285
0.79
2.55
NVIDIA Nemotron Nano 12B v2 VL
128k
NVIDIANVIDIA
10
$0.30
143
1.07
4.56
Phi-4 Multimodal
128k
Microsoft AzureMicrosoft Azure
10*
$0.00
17
0.83
29.93
Qwen3.5 0.8B
262k
AlibabaAlibaba
10
$0.02
305
0.37
2.01
Jamba Reasoning 3B
262k
AI21 LabsAI21 Labs
10
--
--
--
--
Reka Flash 3
128k
Reka AIReka AI
10
$0.35
--
--
--
Ling-mini-2.0
131k
InclusionAIInclusionAI
9
--
--
--
--
Llama 3.2 11B (Vision)
128k
MetaMeta
9
$0.16
52
0.78
10.42
Phi-4 Mini
128k
Microsoft AzureMicrosoft Azure
8
$0.00
43
0.83
12.47
Exaone 4.0 1.2B
64k
LG AI ResearchLG AI Research
8
--
--
--
--
Exaone 4.0 1.2B
64k
LG AI ResearchLG AI Research
8
--
--
--
--
LFM2.5-1.2B-Thinking
32k
Liquid AILiquid AI
8
--
--
--
--
Jamba 1.7 Mini
258k
AI21 LabsAI21 Labs
8
--
--
--
--
LFM2 2.6B
32.8k
Liquid AILiquid AI
8
$0.00
--
--
--
LFM2.5-1.2B-Instruct
32k
Liquid AILiquid AI
8
$0.00
--
--
--
Granite 4.0 H 1B
128k
IBMIBM
8
--
--
--
--
Gemma 3 270M
32k
GoogleGoogle
8
--
--
--
--
Apertus 70B Instruct
65.5k
Swiss AI InitiativeSwiss AI Initiative
8
$1.34
--
--
--
Granite 4.0 Micro
128k
IBMIBM
8
--
--
--
--
Granite 4.0 1B
128k
IBMIBM
7
--
--
--
--
LFM2 8B A1B
32.8k
Liquid AILiquid AI
7
$0.00
--
--
--
LFM2.5-VL-1.6B
32k
Liquid AILiquid AI
6
$0.00
--
--
--
Granite 4.0 350M
32.8k
IBMIBM
6
--
--
--
--
Apertus 8B Instruct
65.5k
Swiss AI InitiativeSwiss AI Initiative
6
$0.13
--
--
--
Granite 4.0 H 350M
32.8k
IBMIBM
5
--
--
--
--
Tiny Aya Global
8.19k
CohereCohere
5
--
--
--
--
GPT-5.4 Pro (xhigh)
1.05M
OpenAIOpenAI
--
$67.50
--
--
--
Gemini 3 Deep Think
128k
GoogleGoogle
--
--
--
--
--
Mi:dm K 2.5 Pro Preview
128k
Korea TelecomKorea Telecom
--
--
--
--
--

Key definitions

Maximum number of combined input & output tokens. Output tokens commonly have a significantly lower limit (varied by model).

Tokens per second received while the model is generating tokens (ie. after first chunk has been received from the API for models which support streaming).

Time to first token received, in seconds, after API request sent. For reasoning models which share reasoning tokens, this will be the first reasoning token. For models which do not support streaming, this represents time to receive the completion.

Price per token, represented as USD per million Tokens. Price is a blend of Input & Output token prices (3:1 ratio).

Price per token generated by the model (received from the API), represented as USD per million Tokens.

Price per token included in the request/message sent to the API, represented as USD per million Tokens.

Metrics are 'live' and are based on the past 72 hours of measurements, measurements are taken 8 times a day for single requests and 2 times per day for parallel requests.

Frequently Asked Questions

Gemini 3.1 Pro Preview currently ranks #1 on the Artificial Analysis LLM Leaderboard with an Intelligence Index score of 57, out of 319 models ranked.

The top models by Intelligence Index are: 1. Gemini 3.1 Pro Preview (57), 2. GPT-5.4 (xhigh) (57), 3. GPT-5.3 Codex (xhigh) (54), 4. Claude Opus 4.6 (Adaptive Reasoning, Max Effort) (53), 5. Muse Spark (52).

Mercury 2 is the fastest at 848.5 tokens per second, followed by Granite 4.0 H Small (449.3 t/s) and Granite 3.3 8B (Non-reasoning) (335.6 t/s).

Qwen3.5 0.8B (Non-reasoning) is the most affordable at $0.02 per 1M tokens (blended 3:1 input-to-output), followed by Qwen3.5 0.8B (Reasoning) ($0.02) and Gemma 3n E4B Instruct ($0.03).

GLM-5.1 (Reasoning) is the highest-ranked open weights model with an Intelligence Index score of 51. There are 197 open weights models out of 319 total on the leaderboard.

The top open weights models by Intelligence Index are: 1. GLM-5.1 (Reasoning) (51), 2. GLM-5 (Reasoning) (50), 3. Kimi K2.5 (Reasoning) (47).

Gemini 3.1 Pro Preview leads among 159 reasoning models with an Intelligence Index score of 57. Reasoning models use extended thinking to solve complex problems before responding.

The leaderboard includes filters to narrow results by model type (reasoning vs non-reasoning), openness (open weights vs proprietary), and other criteria. You can also adjust prompt options to see how performance varies with different input lengths.

Click on any model name in the leaderboard to visit its dedicated comparison page with detailed charts covering intelligence, pricing, speed, latency, and more. You can also compare API providers for each model. View all models