Comparisons of Large Open Source AI Models (>150B)

Open source AI models with over 150 billion parameters. Models are considered Open Source (also commonly referred to as open weights) where their weights are accessible to download. This allows self-hosting on your own infrastructure and enables customizing the model such as through fine-tuning. Click on any model to see detailed metrics. For more details including relating to our methodology, see our FAQs.

DeepSeek logoDeepSeek R1 0528 (May '25) and MiniMax logoMiniMax M1 80k are the highest intelligence Large open source models, defined as those with >150B parameters, followed by Alibaba logoQwen3 235B (Reasoning) & MiniMax logoMiniMax M1 40k.

Highlights

Intelligence
Artificial Analysis Intelligence Index; Higher is better
Estimate (independent evaluation forthcoming)
Total Parameters
Trainable parameters in billions
Further details
WeightsProvider
Benchmarks
DeepSeek logo
DeepSeek R1 0528 (May '25)
DeepSeek
68
685B
(37B active at inference time)
128k
$1.0
22
🤗
Hyperbolic
Google
Lambda
+13 more
View
MiniMax logo
MiniMax M1 80k
MiniMax
63
456B
(45.9B active at inference time)
1.00M
$0.8
-
🤗
MiniMax
View
Alibaba logo
Qwen3 235B A22B (Reasoning)
Alibaba
62
235B
(22B active at inference time)
128k
$2.6
51
🤗
kluster.ai
Novita
Fireworks
+6 more
View
MiniMax logo
MiniMax M1 40k
MiniMax
61
456B
(45.9B active at inference time)
1.00M
$0.8
35
🤗
MiniMax
View
NVIDIA logo
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)
NVIDIA
61
253B
128k
$0.9
43
🤗
Nebius
View
DeepSeek logo
DeepSeek R1 (Jan '25)
DeepSeek
60
685B
(37B active at inference time)
128k
$2.4
-
🤗
Amazon Bedrock
Lambda
Novita
+12 more
View
DeepSeek logo
DeepSeek V3 0324 (Mar '25)
DeepSeek
53
671B
(37B active at inference time)
128k
$0.5
24
🤗
Microsoft Azure
DeepSeek
kluster.ai
+12 more
View
Meta logo
Llama 4 Maverick
Meta
51
402B
(17B active at inference time)
1.00M
$0.4
162
🤗
Fireworks
Google
Parasail
+12 more
View
Alibaba logo
Qwen3 235B A22B
Alibaba
47
235B
(22B active at inference time)
128k
$1.2
62
🤗
Alibaba Cloud
GMI
View
DeepSeek logo
DeepSeek V3 (Dec '24)
DeepSeek
46
671B
(37B active at inference time)
128k
$0.5
-
🤗
Together.ai
Deepinfra
Nebius
+6 more
View
Meta logo
Llama 3.1 Instruct 405B
Meta
40
405B
128k
$3.5
33
🤗
Parasail
Deepinfra
Amazon Bedrock
+12 more
View
MiniMax logo
MiniMax-Text-01
MiniMax
40
456B
(45.9B active at inference time)
4.00M
$0.4
27
🤗
MiniMax
View
Allen Institute for AI logo
Llama 3.1 Tulu3 405B
Allen Institute for AI
40
405B
128k
-
-
🤗
-
View
DeepSeek logo
DeepSeek-V2.5 (Dec '24)
DeepSeek
35
236B
(21B active at inference time)
128k
$0.2
-
🤗
DeepSeek
View
DeepSeek logo
DeepSeek-V2.5
DeepSeek
35
236B
(21B active at inference time)
128k
$0.2
-
🤗
DeepSeek
View
Perplexity logo
R1 1776
Perplexity
34
671B
(37B active at inference time)
128k
$3.5
-
🤗
Perplexity
View
AI21 Labs logo
Jamba 1.5 Large
AI21 Labs
29
398B
(94B active at inference time)
256k
$3.5
-
🤗
Microsoft Azure
View
DeepSeek logo
DeepSeek-Coder-V2
DeepSeek
29
236B
(21B active at inference time)
128k
$0.2
-
🤗
DeepSeek
View
AI21 Labs logo
Jamba 1.6 Large
AI21 Labs
29
398B
(94B active at inference time)
256k
$3.5
55
🤗
AI21 Labs
View
DeepSeek logo
DeepSeek-V2-Chat
DeepSeek
23
236B
(21B active at inference time)
128k
$0.2
-
🤗
DeepSeek
View
Snowflake logo
Arctic Instruct
Snowflake
22
480B
(17B active at inference time)
4.00k
-
-
🤗
-
View