Comparisons of Large Open Source AI Models (>150B)
Open source AI models with over 150 billion parameters. Models are considered Open Source (also commonly referred to as open weights) where their weights are accessible to download. This allows self-hosting on your own infrastructure and enables customizing the model such as through fine-tuning. Click on any model to see detailed metrics. For more details including relating to our methodology, see our FAQs.
DeepSeek R1 0528 (May '25) and
MiniMax M1 80k are the highest intelligence Large open source models, defined as those with >150B parameters, followed by
Qwen3 235B (Reasoning) &
MiniMax M1 40k.
Highlights
Intelligence
Artificial Analysis Intelligence Index; Higher is better
Estimate (independent evaluation forthcoming)
Total Parameters
Trainable parameters in billions
Further details
Weights | Provider Benchmarks | |||||||
---|---|---|---|---|---|---|---|---|
DeepSeek R1 0528 (May '25) DeepSeek | 68 | 685B (37B active at inference time) | 128k | $1.0 | 22 | 🤗 | +13 more | View |
![]() MiniMax M1 80k MiniMax | 63 | 456B (45.9B active at inference time) | 1.00M | $0.8 | - | 🤗 | ![]() | View |
Qwen3 235B A22B (Reasoning) Alibaba | 62 | 235B (22B active at inference time) | 128k | $2.6 | 51 | 🤗 | ![]() ![]() +6 more | View |
![]() MiniMax M1 40k MiniMax | 61 | 456B (45.9B active at inference time) | 1.00M | $0.8 | 35 | 🤗 | ![]() | View |
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) NVIDIA | 61 | 253B | 128k | $0.9 | 43 | 🤗 | ![]() | View |
DeepSeek R1 (Jan '25) DeepSeek | 60 | 685B (37B active at inference time) | 128k | $2.4 | - | 🤗 | +12 more | View |
DeepSeek V3 0324 (Mar '25) DeepSeek | 53 | 671B (37B active at inference time) | 128k | $0.5 | 24 | 🤗 | ![]() +12 more | View |
Llama 4 Maverick Meta | 51 | 402B (17B active at inference time) | 1.00M | $0.4 | 162 | 🤗 | ![]() +12 more | View |
Qwen3 235B A22B Alibaba | 47 | 235B (22B active at inference time) | 128k | $1.2 | 62 | 🤗 | View | |
DeepSeek V3 (Dec '24) DeepSeek | 46 | 671B (37B active at inference time) | 128k | $0.5 | - | 🤗 | ![]() ![]() +6 more | View |
Llama 3.1 Instruct 405B Meta | 40 | 405B | 128k | $3.5 | 33 | 🤗 | ![]() +12 more | View |
![]() MiniMax-Text-01 MiniMax | 40 | 456B (45.9B active at inference time) | 4.00M | $0.4 | 27 | 🤗 | ![]() | View |
![]() Llama 3.1 Tulu3 405B Allen Institute for AI | 40 | 405B | 128k | - | - | 🤗 | - | View |
DeepSeek-V2.5 (Dec '24) DeepSeek | 35 | 236B (21B active at inference time) | 128k | $0.2 | - | 🤗 | View | |
DeepSeek-V2.5 DeepSeek | 35 | 236B (21B active at inference time) | 128k | $0.2 | - | 🤗 | View | |
![]() R1 1776 Perplexity | 34 | 671B (37B active at inference time) | 128k | $3.5 | - | 🤗 | ![]() | View |
![]() Jamba 1.5 Large AI21 Labs | 29 | 398B (94B active at inference time) | 256k | $3.5 | - | 🤗 | View | |
DeepSeek-Coder-V2 DeepSeek | 29 | 236B (21B active at inference time) | 128k | $0.2 | - | 🤗 | View | |
![]() Jamba 1.6 Large AI21 Labs | 29 | 398B (94B active at inference time) | 256k | $3.5 | 55 | 🤗 | ![]() | View |
DeepSeek-V2-Chat DeepSeek | 23 | 236B (21B active at inference time) | 128k | $0.2 | - | 🤗 | View | |
Arctic Instruct Snowflake | 22 | 480B (17B active at inference time) | 4.00k | - | - | 🤗 | - | View |