Comparisons of Large Open Source AI Models (>150B)
Open source AI models with over 150B parameters. Models are considered open source (also commonly referred to as open weights) where their weights are accessible to download. This allows self-hosting on your own infrastructure and enables customizing the model such as through fine-tuning. Click on any model to see detailed metrics. For more details including relating to our methodology, see our FAQs.
Kimi K2.6 and Highlights
Openness
Artificial Analysis Openness Index: Results
Openness Index assesses model openness on a 0 to 100 normalized scale (higher is more open)
Reasoning models are indicated by a lightbulb icon
Intelligence
Artificial Analysis Intelligence Index
Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Reasoning models are indicated by a lightbulb icon
Intelligence Evaluations
Intelligence evaluations measured independently by Artificial Analysis · Higher is better
Results claimed by AI Lab (not yet independently verified)
GDPval-AA
Agentic real-world work tasks, (ELO-500)/2000
Terminal-Bench Hard
Agentic coding & terminal use
𝜏²-Bench Telecom
Agentic tool use
AA-LCR
Long context reasoning
AA-Omniscience Accuracy
Knowledge
AA-Omniscience Non-Hallucination Rate
1 - hallucination rate
Humanity's Last Exam
Reasoning & knowledge
GPQA Diamond
Scientific reasoning
SciCode
Coding
IFBench
Instruction following
CritPt
Physics reasoning
APEX-Agents-AA
Long-horizon agentic tasks
MMMU-Pro
Visual reasoning
Reasoning models are indicated by a lightbulb icon.
Size
Model Size: Total and Active Parameters
Comparison between total model parameters and parameters active during inference
Reasoning models are indicated by a lightbulb icon
Intelligence vs. Active Parameters
Active parameters at inference time · Artificial Analysis Intelligence Index
Most attractive quadrant
Alibaba
DeepSeek
Kimi
MiniMax
Tencent
Xiaomi
Z AI
Reasoning models are indicated by a lightbulb icon.
Intelligence vs. Total Parameters
Artificial Analysis Intelligence Index · Size in parameters (billions)
Most attractive quadrant
Alibaba
DeepSeek
Kimi
MiniMax
Tencent
Xiaomi
Z AI
Reasoning models are indicated by a lightbulb icon.
Context Window
Context Window
Context window: tokens limit · Higher is better
Reasoning models are indicated by a lightbulb icon
Further details
| Weights | Provider Benchmarks | |||||||
|---|---|---|---|---|---|---|---|---|
Kimi K2.6 Kimi | 54 | 1.0KB (32B active at inference time) | 256k | $1.7 | 94 | 🤗 | +9 more | View |
MiMo-V2.5-Pro Xiaomi | 54 | 1.0KB (42B active at inference time) | 1.00M | $1.5 | 57 | 🤗 | +1 more | View |
DeepSeek V4 Pro (Reasoning, Max Effort) DeepSeek | 52 | 1.6KB (49B active at inference time) | 1.00M | $2.2 | 30 | 🤗 | +5 more | View |
GLM-5.1 (Reasoning) Z AI | 51 | 744B (40B active at inference time) | 200k | $2.1 | 58 | 🤗 | +8 more | View |
DeepSeek V4 Pro (Reasoning, High Effort) DeepSeek | 50 | 1.6KB (49B active at inference time) | 1.00M | $2.2 | 29 | 🤗 | +6 more | View |
GLM-5 (Reasoning) Z AI | 50 | 744B (40B active at inference time) | 200k | $1.6 | 68 | 🤗 | +9 more | View |
MiniMax-M2.7 MiniMax | 50 | 230B (10B active at inference time) | 205k | $0.5 | 47 | 🤗 | +3 more | View |
MiMo-V2.5 Xiaomi | 49 | 310B (15B active at inference time) | 1.00M | $0.7 | 93 | 🤗 | View | |
DeepSeek V4 Flash (Reasoning, Max Effort) DeepSeek | 47 | 284B (13B active at inference time) | 1.00M | $0.2 | 97 | 🤗 | +2 more | View |
DeepSeek V4 Flash (Reasoning, High Effort) DeepSeek | 46 | 284B (13B active at inference time) | 1.00M | $0.2 | - | 🤗 | +2 more | View |
Qwen3.5 397B A17B (Reasoning) Alibaba | 45 | 397B (17B active at inference time) | 262k | $1.4 | 53 | 🤗 | +9 more | View |
GLM-5.1 (Non-reasoning) Z AI | 44 | 744B (40B active at inference time) | 200k | $2.1 | 46 | 🤗 | +4 more | View |
Kimi K2.6 (Non-reasoning) Kimi | 43 | 1.0KB (32B active at inference time) | 256k | $1.7 | 78 | 🤗 | +7 more | View |
Hy3-preview (Reasoning) Tencent | 42 | 295B (21B active at inference time) | 256k | $0.2 | 117 | 🤗 | View | |
MiMo-V2-Flash (Feb 2026) Xiaomi | 41 | 309B (15B active at inference time) | 256k | $0.2 | 153 | 🤗 | View | |
GLM-5 (Non-reasoning) Z AI | 41 | 744B (40B active at inference time) | 200k | $1.6 | 54 | 🤗 | +3 more | View |
Qwen3.5 397B A17B (Non-reasoning) Alibaba | 40 | 397B (17B active at inference time) | 262k | $1.4 | 53 | 🤗 | +6 more | View |
DeepSeek V4 Pro (Non-reasoning) DeepSeek | 39 | 1.6KB (49B active at inference time) | 1.00M | $2.2 | 30 | 🤗 | View | |
Ring-2.6-1T InclusionAI | 38 | 1.0KB (63B active at inference time) | 262k | - | - | 🤗 | - | View |
Kimi K2.5 (Non-reasoning) Kimi | 37 | 1.0KB (32B active at inference time) | 256k | $1.2 | 45 | 🤗 | +7 more | View |
DeepSeek V4 Flash (Non-reasoning) DeepSeek | 36 | 284B (13B active at inference time) | 1.00M | $0.2 | 96 | 🤗 | View | |
MiMo-V2.5-Pro (Non-reasoning) Xiaomi | 36 | 1.0KB (41.7B active at inference time) | 1.00M | $1.5 | 61 | 🤗 | +1 more | View |
Hy3-preview (Non-reasoning) Tencent | 34 | 295B (21B active at inference time) | 256k | $0.2 | 120 | 🤗 | View | |
Ling-2.6-1T InclusionAI | 34 | 1.0KB (63B active at inference time) | 262k | $0.8 | - | 🤗 | View | |
K-EXAONE (Reasoning) LG AI Research | 32 | 236B (23B active at inference time) | 256k | - | - | 🤗 | - | View |
Trinity Large Thinking Arcee AI | 32 | 399B (13B active at inference time) | 512k | $0.4 | 135 | 🤗 | View | |
MiMo-V2-Flash (Non-reasoning) Xiaomi | 30 | 309B (15B active at inference time) | 256k | $0.2 | 153 | 🤗 | View | |
DeepSeek R1 0528 (May '25) DeepSeek | 27 | 685B (37B active at inference time) | 128k | $2.1 | - | 🤗 | +3 more | View |
K-EXAONE (Non-reasoning) LG AI Research | 23 | 236B (23B active at inference time) | 256k | - | - | 🤗 | - | View |
Mistral Large 3 Mistral | 23 | 675B (41B active at inference time) | 256k | $0.8 | 49 | 🤗 | View | |
Ring-1T InclusionAI | 23 | 1.0KB (50B active at inference time) | 128k | - | - | 🤗 | - | View |
Ling-1T InclusionAI | 19 | 1.0KB (50B active at inference time) | 128k | - | - | 🤗 | - | View |
Hermes 4 - Llama-3.1 405B (Reasoning) Nous Research | 19 | 406B | 128k | $1.5 | 34 | 🤗 | View | |
Llama 4 Maverick Meta | 18 | 402B (17B active at inference time) | 1.00M | $0.5 | 117 | 🤗 | +6 more | View |
Hermes 4 - Llama-3.1 405B (Non-reasoning) Nous Research | 18 | 406B | 128k | $1.5 | 34 | 🤗 | View | |
Llama 3.1 Instruct 405B Meta | 17 | 405B | 128k | $3.7 | 60 | 🤗 | +1 more | View |
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) NVIDIA | 15 | 253B | 128k | $0.9 | 41 | 🤗 | View | |
ERNIE 4.5 300B A47B Baidu | 15 | 300B (47B active at inference time) | 131k | $0.5 | 23 | 🤗 | View | |
R1 1776 Perplexity | 12 | 671B (37B active at inference time) | 128k | - | - | 🤗 | - | View |
Jamba 1.7 Large AI21 Labs | 11 | 398B (94B active at inference time) | 256k | $3.5 | 61 | 🤗 | View | |
Cogito v2.1 (Reasoning) Deep Cogito | - | 671B (37B active at inference time) | 128k | $1.3 | 61 | 🤗 | View |