Comparisons of Large Open Source AI Models (>150B)

Open source AI models with over 150B parameters. Models are considered Open Source (also commonly referred to as open weights) where their weights are accessible to download. This allows self-hosting on your own infrastructure and enables customizing the model such as through fine-tuning. Click on any model to see detailed metrics. For more details including relating to our methodology, see our FAQs.
Kimi logoKimi K2.6 and Xiaomi logoMiMo-V2.5-Pro are the highest intelligence Large open source models, defined as those with >150B parameters, followed by DeepSeek logoDeepSeek V4 Pro (Max) & Z AI logoGLM-5.1.

Highlights

Intelligence
Artificial Analysis Intelligence Index; Higher is better
Total Parameters
Trainable parameters in billions

Openness

Artificial Analysis Openness Index: Results

Openness Index assesses model openness on a 0 to 100 normalized scale (higher is more open)

Intelligence

Artificial Analysis Intelligence Index

Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Intelligence Evaluations

Intelligence evaluations measured independently by Artificial Analysis; Higher is better
Reasoning models are indicated by a lightbulb icon.

While model intelligence generally translates across use cases, specific evaluations may be more relevant for certain use cases.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Size

Model Size: Total and Active Parameters

Comparison between total model parameters and parameters active during inference
Active Parameters
Passive Parameters
Reasoning models are indicated by a lightbulb icon.

The total number of trainable weights and biases in the model, expressed in billions. These parameters are learned during training and determine the model's ability to process and generate responses.

The number of parameters actually executed during each inference forward pass, expressed in billions. For Mixture of Experts (MoE) models, a routing mechanism selects a subset of experts per token, resulting in fewer active than total parameters. Dense models use all parameters, so active equals total.

Intelligence vs. Active Parameters

Active Parameters at Inference Time; Artificial Analysis Intelligence Index
Most attractive quadrant
Alibaba
DeepSeek
Kimi
MiniMax
Tencent
Xiaomi
Z AI
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

The number of parameters actually executed during each inference forward pass, expressed in billions. For Mixture of Experts (MoE) models, a routing mechanism selects a subset of experts per token, resulting in fewer active than total parameters. Dense models use all parameters, so active equals total.

Intelligence vs. Total Parameters

Artificial Analysis Intelligence Index; Size in Parameters (Billions)
Most attractive quadrant
Alibaba
DeepSeek
Kimi
MiniMax
Tencent
Xiaomi
Z AI
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

The total number of trainable weights and biases in the model, expressed in billions. These parameters are learned during training and determine the model's ability to process and generate responses.

Context Window

Context Window

Context Window: Tokens Limit; Higher is better
Reasoning models are indicated by a lightbulb icon.

Larger context windows are relevant to RAG (Retrieval Augmented Generation) LLM workflows which typically involve reasoning and information retrieval of large amounts of data.

Maximum number of combined input & output tokens. Output tokens commonly have a significantly lower limit (varied by model).

Further details

WeightsProvider
Benchmarks
Kimi logo
Kimi K2.6
Kimi
54
1.0KB
(32B active at inference time)
256k
$1.7
-
🤗
Together.ai
Parasail
SiliconFlow
+6 more
View
Xiaomi logo
MiMo-V2.5-Pro
Xiaomi
54
1.0KB
(42B active at inference time)
1.00M
$1.5
60
Not available
Xiaomi
View
DeepSeek logo
DeepSeek V4 Pro (Reasoning, Max Effort)
DeepSeek
52
1.6KB
(49B active at inference time)
1.00M
$2.2
36
🤗
Fireworks
DeepInfra
DeepSeek
+3 more
View
Z AI logo
GLM-5.1 (Reasoning)
Z AI
51
744B
(40B active at inference time)
200k
$2.1
48
🤗
Parasail
Novita
DeepInfra
+6 more
View
DeepSeek logo
DeepSeek V4 Pro (Reasoning, High Effort)
DeepSeek
50
1.6KB
(49B active at inference time)
1.00M
$2.2
36
🤗
DeepInfra
Lightning AI
DeepSeek
+4 more
View
Z AI logo
GLM-5 (Reasoning)
Z AI
50
744B
(40B active at inference time)
200k
$1.6
64
🤗
SiliconFlow
Novita
Fireworks
+9 more
View
MiniMax logo
MiniMax-M2.7
MiniMax
50
230B
(10B active at inference time)
205k
$0.5
47
🤗
Novita
MiniMax
Together.ai
+1 more
View
DeepSeek logo
DeepSeek V4 Flash (Reasoning, Max Effort)
DeepSeek
47
284B
(13B active at inference time)
1.00M
$0.2
81
🤗
Novita
DeepSeek
SiliconFlow
View
Alibaba logo
Qwen3.5 397B A17B (Reasoning)
Alibaba
45
397B
(17B active at inference time)
262k
$1.4
53
🤗
Nebius
GMI
Together.ai
+7 more
View
DeepSeek logo
DeepSeek V4 Flash (Reasoning, High Effort)
DeepSeek
45
284B
(13B active at inference time)
1.00M
$0.2
-
🤗
DeepSeek
Novita
DeepInfra
+1 more
View
Z AI logo
GLM-5.1 (Non-reasoning)
Z AI
44
744B
(40B active at inference time)
200k
$2.1
42
🤗
SiliconFlow
Novita
DeepInfra
+2 more
View
Tencent logo
Hy3-preview (Reasoning)
Tencent
42
295B
(21B active at inference time)
256k
-
69
🤗
SiliconFlow
View
DeepSeek logo
DeepSeek V3.2 (Reasoning)
DeepSeek
42
685B
(37B active at inference time)
128k
$0.3
-
🤗
Google
DeepInfra
?
+12 more
View
Xiaomi logo
MiMo-V2-Flash (Feb 2026)
Xiaomi
41
309B
(15B active at inference time)
256k
$0.1
122
🤗
Xiaomi
View
Z AI logo
GLM-5 (Non-reasoning)
Z AI
41
744B
(40B active at inference time)
200k
$1.6
60
🤗
Novita
SiliconFlow
Nebius
+3 more
View
Alibaba logo
Qwen3.5 397B A17B (Non-reasoning)
Alibaba
40
397B
(17B active at inference time)
262k
$1.4
53
🤗
DigitalOcean
DeepInfra
Nebius
+5 more
View
Kimi logo
Kimi K2.5 (Non-reasoning)
Kimi
37
1.0KB
(32B active at inference time)
256k
$1.2
35
🤗
Nebius
GMI
Together.ai
+5 more
View
LG AI Research logo
K-EXAONE (Reasoning)
LG AI Research
32
236B
(23B active at inference time)
256k
-
-
🤗
-
View
DeepSeek logo
DeepSeek V3.2 (Non-reasoning)
DeepSeek
32
685B
(37B active at inference time)
128k
$0.3
-
🤗
Amazon Bedrock
GMI
Eigen AI
+12 more
View
Arcee AI logo
Trinity Large Thinking
Arcee AI
32
399B
(13B active at inference time)
512k
$0.4
129
🤗
Arcee AI
Parasail
View
Xiaomi logo
MiMo-V2-Flash (Non-reasoning)
Xiaomi
30
309B
(15B active at inference time)
256k
$0.1
121
🤗
Xiaomi
View
DeepSeek logo
DeepSeek V3.2 Speciale
DeepSeek
29
685B
(37B active at inference time)
128k
-
-
🤗
-
View
DeepSeek logo
DeepSeek R1 0528 (May '25)
DeepSeek
27
685B
(37B active at inference time)
128k
$2.4
-
🤗
DeepInfra
Together.ai
Google
+3 more
View
LG AI Research logo
K-EXAONE (Non-reasoning)
LG AI Research
23
236B
(23B active at inference time)
256k
-
-
🤗
-
View
Mistral logo
Mistral Large 3
Mistral
23
675B
(41B active at inference time)
256k
$0.8
48
🤗
Microsoft Azure
Amazon Bedrock
Mistral
View
InclusionAI logo
Ring-1T
InclusionAI
23
1.0KB
(50B active at inference time)
128k
-
-
🤗
-
View
InclusionAI logo
Ling-1T
InclusionAI
19
1.0KB
(50B active at inference time)
128k
-
-
🤗
-
View
Nous Research logo
Hermes 4 - Llama-3.1 405B (Reasoning)
Nous Research
19
406B
128k
$1.5
35
🤗
Nebius
View
Meta logo
Llama 4 Maverick
Meta
18
402B
(17B active at inference time)
1.00M
$0.5
108
🤗
Parasail
Microsoft Azure
SambaNova
+6 more
View
Nous Research logo
Hermes 4 - Llama-3.1 405B (Non-reasoning)
Nous Research
18
406B
128k
$1.5
34
🤗
Nebius
View
Meta logo
Llama 3.1 Instruct 405B
Meta
17
405B
128k
$3.7
29
🤗
Amazon Bedrock
Amazon Bedrock
Databricks
+1 more
View
NVIDIA logo
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)
NVIDIA
15
253B
128k
$0.9
42
🤗
Nebius
View
Baidu logo
ERNIE 4.5 300B A47B
Baidu
15
300B
(47B active at inference time)
131k
$0.5
24
🤗
SiliconFlow
Novita
View
Perplexity logo
R1 1776
Perplexity
12
671B
(37B active at inference time)
128k
-
-
🤗
-
View
AI21 Labs logo
Jamba 1.7 Large
AI21 Labs
11
398B
(94B active at inference time)
256k
$3.5
62
🤗
AI21 Labs
View
Deep Cogito logo
Cogito v2.1 (Reasoning)
Deep Cogito
-
671B
(37B active at inference time)
128k
$1.3
38
🤗
Together.ai
View