Stay connected with us on X, Discord, and LinkedIn to stay up to date with future analysis

Comparison of Open Source Models

Name: Artificial Analysis Intelligence Index
Creator: Artificial Analysis
License: https://artificialanalysis.ai/docs/legal/Terms-of-Use.pdf

Comparison and analysis of open source AI models across key performance metrics including quality, performance, inference speed, context window, parameter count & licensing details. Models are considered Open Source (also commonly referred to as open weights) where their weights are accessible to download. This allows self-hosting on your own infrastructure and enables customizing the model such as through fine-tuning. Click on any model to see detailed metrics. For more details relating to our methodology, see our FAQs.

Z AI logo GLM-5 and Kimi logo Kimi K2.5 are the highest intelligence open source models, followed by Alibaba logo Qwen3.5 397B A17B & GLM-4.7.

Intelligence

Artificial Analysis Intelligence Index; Higher is better

Estimate (independent evaluation forthcoming)

Total Parameters

Trainable parameters in billions

Navigation

Openness Open Source Progress Size Context Window

Openness

Artificial Analysis Openness Index: Results

Openness Index assesses model openness on a 0 to 100 normalized scale (higher is more open)

+ Add model from specific provider

Open Source Progress

Progress in Open Weights vs. Proprietary Intelligence

Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt

Open Weights

Proprietary

Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Indicates whether the model weights are available. Models are labelled as 'Commercial Use Restricted' if the weights are available but commercial use is limited (typically requires obtaining a paid license).

Artificial Analysis Intelligence Index

+ Add model from specific provider

Estimate (independent evaluation forthcoming)

Reasoning models are indicated by a lightbulb icon.

Open Source Language Models Intelligence By Lab Over Time

Alibaba

DeepSeek

Google

Open Source Models Intelligence By Size Over Time

Large Models (>150B)

Medium Models (40B-150B)

Small Models (4B-40B)

Tiny Models (≤4B)

Reasoning models are indicated by a lightbulb icon.

Tiny: Less than or equal to 4B parameters. These are usually the smallest models in terms of resource demand.
Small: Less than 40B parameters.
Medium: Between 40B-150B parameters.
Large: Over 150B parameters.

Intelligence Evaluations

Intelligence evaluations measured independently by Artificial Analysis; Higher is better

+ Add model from specific provider

Results claimed by AI Lab (not yet independently verified)

GDPval-AA (Agentic Real-World Work Tasks, (ELO-500)/2000)

Terminal-Bench Hard (Agentic Coding & Terminal Use)

𝜏²-Bench Telecom (Agentic Tool Use)

AA-LCR (Long Context Reasoning)

AA-Omniscience Accuracy (Knowledge)

AA-Omniscience Non-Hallucination Rate (1 - Hallucination Rate)

Humanity's Last Exam (Reasoning & Knowledge)

GPQA Diamond (Scientific Reasoning)

SciCode (Coding)

IFBench (Instruction Following)

CritPt (Physics Reasoning)

MMMU Pro (Visual Reasoning)

Reasoning models are indicated by a lightbulb icon.

While model intelligence generally translates across use cases, specific evaluations may be more relevant for certain use cases.

Size

Intelligence Index By Model Size

+ Add model from specific provider

Estimate (independent evaluation forthcoming)

Large Models (>150B)

Medium Models (40B-150B)

Small Models (4B-40B)

Reasoning models are indicated by a lightbulb icon.

Tiny: Less than or equal to 4B parameters. These are usually the smallest models in terms of resource demand.
Small: Less than 40B parameters.
Medium: Between 40B-150B parameters.
Large: Over 150B parameters.

Model Size: Total and Active Parameters

Comparison between total model parameters and parameters active during inference

+ Add model from specific provider

Active Parameters

Passive Parameters

Reasoning models are indicated by a lightbulb icon.

The total number of trainable weights and biases in the model, expressed in billions. These parameters are learned during training and determine the model's ability to process and generate responses.

The number of parameters actually executed during each inference forward pass, expressed in billions. For Mixture of Experts (MoE) models, a routing mechanism selects a subset of experts per token, resulting in fewer active than total parameters. Dense models use all parameters, so active equals total.

Intelligence vs. Active Parameters

Active Parameters at Inference Time; Artificial Analysis Intelligence Index

+ Add model from specific provider

Most attractive quadrant

Alibaba

DeepSeek

Kimi

Korea Telecom

LG AI Research

MBZUAI Institute of Foundation Models

Intelligence vs. Total Parameters

Artificial Analysis Intelligence Index; Size in Parameters (Billions)

+ Add model from specific provider

Most attractive quadrant

Alibaba

DeepSeek

Kimi

Korea Telecom

LG AI Research

MBZUAI Institute of Foundation Models

Context Window

Context Window: Tokens Limit; Higher is better

+ Add model from specific provider

Reasoning models are indicated by a lightbulb icon.

Larger context windows are relevant to RAG (Retrieval Augmented Generation) LLM workflows which typically involve reasoning and information retrieval of large amounts of data.

Maximum number of combined input & output tokens. Output tokens commonly have a significantly lower limit (varied by model).

Further details

						Weights		Provider Benchmarks
GLM-5 (Reasoning) Z AI	50	744B (40B active at inference time)	200k	$1.6	68	🤗	+10 more	View
Kimi K2.5 (Reasoning) Kimi	47	1.0KB (32B active at inference time)	256k	$1.2	48	🤗	+14 more	View
Qwen3.5 397B A17B (Reasoning) Alibaba	45	397B (17B active at inference time)	262k	$1.4	84	🤗	+5 more	View
GLM-4.7 (Reasoning) Z AI	42	357B (32B active at inference time)	200k	$1.0	84	🤗	+8 more	View
Qwen3.5 27B (Reasoning) Alibaba	42	27.8B	262k	$0.8	89	🤗		View
MiniMax-M2.5 MiniMax	42	230B (10B active at inference time)	205k	$0.5	49	🤗	+10 more	View
DeepSeek V3.2 (Reasoning) DeepSeek	42	685B (37B active at inference time)	128k	$0.3	34	🤗	+8 more	View
Qwen3.5 122B A10B (Reasoning) Alibaba	42	125B (10B active at inference time)	262k	$1.1	132	🤗		View
MiMo-V2-Flash (Feb 2026) Xiaomi	41	309B (15B active at inference time)	256k	$0.1	128	🤗		View
Kimi K2 Thinking Kimi	41	1.0KB (32B active at inference time)	256k	$1.1	88	🤗	+5 more	View
GLM-5 (Non-reasoning) Z AI	41	744B (40B active at inference time)	200k	$1.6	67	🤗	+4 more	View
Qwen3.5 397B A17B (Non-reasoning) Alibaba	40	397B (17B active at inference time)	262k	$1.4	85	🤗	+2 more	View
MiniMax-M2.1 MiniMax	39	230B (10B active at inference time)	205k	$0.5	43	🤗	+4 more	View
MiMo-V2-Flash (Reasoning) Xiaomi	39	309B (15B active at inference time)	256k	$0.1	126	🤗		View
Step 3.5 Flash StepFun	38	196B (11B active at inference time)	256k	$0.1	114	🤗		View
Kimi K2.5 (Non-reasoning) Kimi	37	1.0KB (32B active at inference time)	256k	$1.2	46	🤗	+6 more	View
Qwen3.5 27B (Non-reasoning) Alibaba	37	27.8B	262k	$0.8	93	🤗		View
Qwen3.5 35B A3B (Reasoning) Alibaba	37	36B (3B active at inference time)	262k	$0.7	175	🤗		View
MiniMax-M2 MiniMax	36	230B (10B active at inference time)	205k	$0.5	47	🤗	+1 more	View
NVIDIA Nemotron 3 Super 120B A12B (Reasoning) NVIDIA	36	120.6B (12.7B active at inference time)	1.00M	$0.4	458	Not available	+2 more	View
Qwen3.5 122B A10B (Non-reasoning) Alibaba	36	125B (10B active at inference time)	262k	$1.1	126	🤗		View
GLM-4.7 (Non-reasoning) Z AI	34	357B (32B active at inference time)	200k	$0.9	78	🤗	+7 more	View
DeepSeek V3.1 Terminus (Reasoning) DeepSeek	34	685B (37B active at inference time)	128k	$0.8	-	🤗		View
gpt-oss-120B (high) OpenAI	33	117B (5.1B active at inference time)	131k	$0.3	280	🤗	+22 more	View
DeepSeek V3.2 Exp (Reasoning) DeepSeek	33	685B (37B active at inference time)	128k	$0.3	36	🤗		View
GLM-4.6 (Reasoning) Z AI	33	357B (32B active at inference time)	200k	$1.0	99	🤗	+1 more	View
Qwen3.5 9B (Reasoning) Alibaba	32	9.65B	262k	$0.1	59	🤗		View
K-EXAONE (Reasoning) LG AI Research	32	236B (23B active at inference time)	256k	-	-	🤗	-	View
DeepSeek V3.2 (Non-reasoning) DeepSeek	32	685B (37B active at inference time)	128k	$0.3	34	🤗	+11 more	View
Kimi K2 0905 Kimi	31	1.0KB (32B active at inference time)	256k	$1.1	37	🤗	+1 more	View
Qwen3.5 35B A3B (Non-reasoning) Alibaba	31	36B (3B active at inference time)	262k	$0.7	157	🤗		View
MiMo-V2-Flash (Non-reasoning) Xiaomi	30	309B (15B active at inference time)	256k	$0.1	131	🤗		View
GLM-4.6 (Non-reasoning) Z AI	30	357B (32B active at inference time)	200k	$1.0	87	🤗		View
GLM-4.7-Flash (Reasoning) Z AI	30	31.2B (3B active at inference time)	200k	$0.2	58	🤗		View
Qwen3 235B A22B 2507 (Reasoning) Alibaba	30	235B (22B active at inference time)	256k	$2.6	42	🤗	+4 more	View
DeepSeek V3.2 Speciale DeepSeek	29	685B (37B active at inference time)	128k	-	-	🤗	-	View
DeepSeek V3.1 Terminus (Non-reasoning) DeepSeek	29	685B (37B active at inference time)	128k	$0.6	-	🤗	+1 more	View
DeepSeek V3.2 Exp (Non-reasoning) DeepSeek	28	685B (37B active at inference time)	128k	$0.3	34	🤗		View
Apriel-v1.5-15B-Thinker ServiceNow	28	15B	128k	-	141	🤗		View
Qwen3 Coder Next Alibaba	28	79.7B (3B active at inference time)	256k	$0.6	137	🤗	+1 more	View
DeepSeek V3.1 (Non-reasoning) DeepSeek	28	685B (37B active at inference time)	128k	$0.8	-	🤗	+8 more	View
DeepSeek V3.1 (Reasoning) DeepSeek	28	685B (37B active at inference time)	128k	$0.9	-	🤗	+2 more	View
Qwen3 VL 235B A22B (Reasoning) Alibaba	28	235B (22B active at inference time)	262k	$2.6	51	🤗		View
Apriel-v1.6-15B-Thinker ServiceNow	28	15B	128k	-	96	🤗		View
Qwen3.5 9B (Non-reasoning) Alibaba	27	9.65B	262k	-	-	🤗	-	View
Qwen3.5 4B (Reasoning) Alibaba	27	4.66B	262k	-	-	🤗	-	View
DeepSeek R1 0528 (May '25) DeepSeek	27	685B (37B active at inference time)	128k	$2.4	-	🤗	+6 more	View
Mistral Small 4 (Reasoning) Mistral	27	119B (6.5B active at inference time)	256k	$0.3	153	🤗		View
Qwen3 Next 80B A3B (Reasoning) Alibaba	27	80B (3B active at inference time)	262k	$1.9	155	🤗	+4 more	View
GLM-4.5 (Reasoning) Z AI	26	355B (32B active at inference time)	128k	$0.8	40	🤗		View
Kimi K2 Kimi	26	1.0KB (32B active at inference time)	128k	$1.0	39	🤗	+2 more	View
Seed-OSS-36B-Instruct ByteDance Seed	25	36.2B	512k	$0.3	34	🤗		View
Qwen3 235B A22B 2507 Instruct Alibaba	25	235B (22B active at inference time)	256k	$1.2	60	🤗	+10 more	View
Qwen3 Coder 480B A35B Instruct Alibaba	25	480B (35B active at inference time)	262k	$3.0	57	🤗	+8 more	View
Qwen3 VL 32B (Reasoning) Alibaba	25	33.4B	256k	$2.6	88	🤗		View
gpt-oss-120B (low) OpenAI	24	117B (5.1B active at inference time)	131k	$0.3	288	🤗	+18 more	View
gpt-oss-20B (high) OpenAI	24	21B (3.6B active at inference time)	131k	$0.1	296	🤗	+9 more	View
MiniMax M1 80k MiniMax	24	456B (45.9B active at inference time)	1.00M	$1.0	-	🤗		View
NVIDIA Nemotron 3 Nano 30B A3B (Reasoning) NVIDIA	24	31.6B (3.6B active at inference time)	1.00M	$0.1	124	🤗		View
K2 Think V2 MBZUAI Institute of Foundation Models	24	70B	262k	-	-	Not available	-	View
LongCat Flash Lite LongCat	24	68.5B (3B active at inference time)	256k	-	101	🤗		View
HyperCLOVA X SEED Think (32B) Naver	24	32B	128k	-	-	🤗	-	View
GLM-4.6V (Reasoning) Z AI	23	108B	128k	$0.5	31	🤗		View
K-EXAONE (Non-reasoning) LG AI Research	23	236B (23B active at inference time)	256k	-	-	🤗	-	View
GLM-4.5-Air Z AI	23	106B (12B active at inference time)	128k	$0.4	112	🤗	+1 more	View
Mi:dm K 2.5 Pro Korea Telecom	23	32B	128k	-	-	Not available	-	View
Mistral Large 3 Mistral	23	675B (41B active at inference time)	256k	$0.8	49	🤗		View
Ring-1T InclusionAI	23	1.0KB (50B active at inference time)	128k	-	-	🤗	-	View
Qwen3.5 4B (Non-reasoning) Alibaba	23	4.66B	262k	-	-	🤗	-	View
Qwen3 30B A3B 2507 (Reasoning) Alibaba	22	30.5B (3.3B active at inference time)	262k	$0.8	150	🤗		View
DeepSeek V3 0324 DeepSeek	22	671B (37B active at inference time)	128k	$1.3	-	🤗	+6 more	View
INTELLECT-3 Prime Intellect	22	107B	131k	-	-	🤗	-	View
GLM-4.7-Flash (Non-reasoning) Z AI	22	31.2B (3B active at inference time)	200k	$0.2	53	🤗		View
Devstral 2 Mistral	22	125B	256k	-	82	🤗		View
MiniMax M1 40k MiniMax	21	456B (45.9B active at inference time)	1.00M	-	-	🤗	-	View
gpt-oss-20B (low) OpenAI	21	21B (3.6B active at inference time)	131k	$0.1	299	🤗	+9 more	View
Qwen3 VL 235B A22B Instruct Alibaba	21	235B (22B active at inference time)	262k	$1.2	58	🤗	+2 more	View
K2-V2 (high) MBZUAI Institute of Foundation Models	21	70B	512k	-	-	🤗	-	View
Qwen3 Next 80B A3B Instruct Alibaba	20	80B (3B active at inference time)	262k	$0.9	149	🤗	+4 more	View
Tri-21B-think Preview Trillion Labs	20	21B	32.0k	-	-	Not available	-	View
Qwen3 Coder 30B A3B Instruct Alibaba	20	30.5B (3.3B active at inference time)	262k	$0.9	26	🤗	+2 more	View
Qwen3 235B A22B (Reasoning) Alibaba	20	235B (22B active at inference time)	32.8k	$2.6	51	🤗		View
QwQ 32B Alibaba	20	32.8B	131k	$0.7	-	🤗		View
Qwen3 VL 30B A3B (Reasoning) Alibaba	20	30B (3B active at inference time)	256k	$0.8	111	🤗	+1 more	View
Devstral Small 2 Mistral	19	24B	256k	-	194	🤗		View
Ling-1T InclusionAI	19	1.0KB (50B active at inference time)	128k	-	-	🤗	-	View
DeepSeek R1 (Jan '25) DeepSeek	19	685B (37B active at inference time)	128k	$2.4	-	🤗	+6 more	View
Llama Nemotron Super 49B v1.5 (Reasoning) NVIDIA	19	49B	128k	$0.2	81	🤗		View
K2-V2 (medium) MBZUAI Institute of Foundation Models	19	70B	512k	-	-	🤗	-	View
Mistral Small 4 (Non-reasoning) Mistral	19	119B (6.5B active at inference time)	256k	$0.3	130	🤗		View
Tri-21B-Think Trillion Labs	19	21B	32.0k	-	-	Not available	-	View
Hermes 4 - Llama-3.1 405B (Reasoning) Nous Research	19	406B	128k	$1.5	29	🤗		View
Llama 3.3 Nemotron Super 49B v1 (Reasoning) NVIDIA	18	49B	128k	-	-	🤗	-	View
Llama 4 Maverick Meta	18	402B (17B active at inference time)	1.00M	$0.5	125	🤗	+10 more	View
Qwen3 4B 2507 (Reasoning) Alibaba	18	4.02B	262k	-	-	🤗	-	View
Magistral Small 1.2 Mistral	18	24B	128k	$0.8	96	🤗		View
Sarvam 105B (Reasoning) Sarvam	18	106B (10.3B active at inference time)	65.5k	-	78	🤗		View
Devstral Small (May '25) Mistral	18	23.6B	256k	$0.1	-	🤗		View
Hermes 4 - Llama-3.1 405B (Non-reasoning) Nous Research	18	406B	128k	$1.5	30	🤗		View
Llama 3.1 Instruct 405B Meta	17	405B	128k	$4.4	33	🤗	+2 more	View
Qwen3 VL 32B Instruct Alibaba	17	33.4B	256k	$1.2	73	🤗		View
DeepSeek R1 Distill Qwen 32B DeepSeek	17	32B	128k	$0.3	61	🤗		View
GLM-4.6V (Non-reasoning) Z AI	17	108B	128k	$0.5	21	🤗		View
Qwen3 235B A22B (Non-reasoning) Alibaba	17	235B (22B active at inference time)	32.8k	$1.2	46	🤗		View
Magistral Small 1 Mistral	17	23.6B	40.0k	-	-	🤗	-	View
EXAONE 4.0 32B (Reasoning) LG AI Research	17	32B	131k	-	-	🤗	-	View
Qwen3 VL 8B (Reasoning) Alibaba	17	8.77B	256k	$0.7	114	🤗		View
Qwen3 32B (Reasoning) Alibaba	17	32.8B	32.8k	$2.6	93	🤗	+4 more	View
DeepSeek V3 (Dec '24) DeepSeek	16	671B (37B active at inference time)	128k	$0.6	-	🤗	+2 more	View
DeepSeek R1 0528 Qwen3 8B DeepSeek	16	8.19B	32.8k	-	-	🤗	-	View
Qwen3.5 2B (Reasoning) Alibaba	16	2.27B	262k	-	-	🤗	-	View
Qwen3 14B (Reasoning) Alibaba	16	14.8B	32.8k	$1.3	62	🤗		View
Qwen3 VL 30B A3B Instruct Alibaba	16	30B (3B active at inference time)	256k	$0.3	104	🤗	+2 more	View
Hermes 4 - Llama-3.1 70B (Reasoning) Nous Research	16	70.6B	128k	$0.2	77	🤗		View
Ministral 3 14B Mistral	16	14B	256k	$0.2	116	🤗		View
DeepSeek R1 Distill Llama 70B DeepSeek	16	70B	128k	$0.9	53	🤗		View
DeepSeek R1 Distill Qwen 14B DeepSeek	16	14B	128k	-	-	🤗	-	View
Falcon-H1R-7B TII UAE	16	7B	256k	-	-	Not available	-	View
Ling-flash-2.0 InclusionAI	16	103B (6.1B active at inference time)	128k	$0.2	58	🤗		View
Qwen3 Omni 30B A3B (Reasoning) Alibaba	16	35.3B (3B active at inference time)	65.5k	$0.4	91	🤗		View
Qwen2.5 Instruct 72B Alibaba	16	72B	131k	-	26	🤗		View
Step3 VL 10B StepFun	15	10.2B	65.5k	-	-	🤗	-	View
Qwen3 30B A3B (Reasoning) Alibaba	15	30.5B (3.3B active at inference time)	32.8k	$0.8	59	🤗	+2 more	View
Devstral Small (Jul '25) Mistral	15	24B	256k	$0.1	204	🤗		View
QwQ 32B-Preview Alibaba	15	32.8B	32.8k	$0.1	61	🤗		View
Mistral Large 2 (Nov '24) Mistral	15	123B	128k	$3.0	41	🤗		View
GLM-4.5V (Reasoning) Z AI	15	108B (12B active at inference time)	64.0k	$0.9	50	🤗		View
Mistral Small 3.2 Mistral	15	24B	128k	$0.1	164	🤗		View
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) NVIDIA	15	253B	128k	$0.9	42	🤗		View
Qwen3 30B A3B 2507 Instruct Alibaba	15	30.5B (3.3B active at inference time)	262k	$0.3	71	🤗	+1 more	View
ERNIE 4.5 300B A47B Baidu	15	300B (47B active at inference time)	131k	$0.5	35	🤗		View
NVIDIA Nemotron Nano 12B v2 VL (Reasoning) NVIDIA	15	13.2B	128k	$0.3	130	🤗		View
Ministral 3 8B Mistral	15	8B	256k	$0.1	181	🤗		View
NVIDIA Nemotron Nano 9B V2 (Reasoning) NVIDIA	15	9B	131k	$0.1	120	🤗		View
Qwen3.5 2B (Non-reasoning) Alibaba	15	2.27B	262k	-	-	🤗	-	View
Llama Nemotron Super 49B v1.5 (Non-reasoning) NVIDIA	15	49B	128k	$0.2	81	🤗		View
Qwen3 32B (Non-reasoning) Alibaba	15	32.8B	32.8k	$1.2	93	🤗	+5 more	View
Llama 3.3 Instruct 70B Meta	14	70B	128k	$0.7	81	🤗	+19 more	View
Mistral Small 3.1 Mistral	14	24B	128k	$0.1	134	🤗	+1 more	View
K2-V2 (low) MBZUAI Institute of Foundation Models	14	70B	512k	-	-	🤗	-	View
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) NVIDIA	14	4.51B	128k	-	-	🤗	-	View
Kimi Linear 48B A3B Instruct Kimi	14	49.1B (3B active at inference time)	1.00M	-	-	🤗	-	View
Llama 3.3 Nemotron Super 49B v1 (Non-reasoning) NVIDIA	14	49B	128k	-	-	🤗	-	View
Qwen3 VL 8B Instruct Alibaba	14	8.77B	256k	$0.3	116	🤗		View
Qwen3 4B (Reasoning) Alibaba	14	4.02B	32.0k	$0.4	93	🤗		View
Llama 3.1 Tulu3 405B Allen Institute for AI	14	405B	128k	-	-	🤗	-	View
Ring-flash-2.0 InclusionAI	14	103B (6.1B active at inference time)	128k	$0.2	78	🤗		View
Pixtral Large Mistral	14	124B	128k	$3.0	53	🤗		View
Olmo 3.1 32B Think Allen Institute for AI	14	32.2B	65.5k	-	95	🤗		View
Grok 2 (Dec '24) xAI	14	270B	131k	-	-	🤗	-	View
Qwen3 VL 4B (Reasoning) Alibaba	14	4.44B	256k	-	-	🤗	-	View
Llama 4 Scout Meta	14	109B (17B active at inference time)	10.0M	$0.3	127	🤗	+7 more	View
Command A Cohere	13	111B	256k	$4.4	46	🤗		View
Llama 3.1 Nemotron Instruct 70B NVIDIA	13	70B	128k	$1.2	36	🤗		View
Qwen2.5 Instruct 32B Alibaba	13	32B	128k	-	-	🤗	-	View
Qwen3 8B (Reasoning) Alibaba	13	8.19B	131k	$0.7	76	🤗		View
NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning) NVIDIA	13	31.6B (3.6B active at inference time)	1.00M	$0.1	77	🤗		View
NVIDIA Nemotron Nano 9B V2 (Non-reasoning) NVIDIA	13	9B	131k	$0.1	149	🤗		View
Mistral Large 2 (Jul '24) Mistral	13	123B	128k	$3.0	-	🤗		View
Qwen3 4B 2507 Instruct Alibaba	13	4.02B	262k	-	-	🤗	-	View
Qwen2.5 Coder Instruct 32B Alibaba	13	32B	131k	-	-	🤗	-	View
Qwen3 14B (Non-reasoning) Alibaba	13	14.8B	32.8k	$0.6	62	🤗		View
GLM-4.5V (Non-reasoning) Z AI	13	108B (12B active at inference time)	64.0k	$0.9	50	🤗		View
Mistral Small 3 Mistral	13	24B	32.0k	$0.1	128	🤗		View
Hermes 4 - Llama-3.1 70B (Non-reasoning) Nous Research	13	70.6B	128k	$0.2	79	🤗		View
Qwen3 30B A3B (Non-reasoning) Alibaba	13	30.5B (3.3B active at inference time)	32.8k	$0.3	57	🤗		View
DeepSeek-V2.5 (Dec '24) DeepSeek	13	236B (21B active at inference time)	128k	-	-	🤗	-	View
Qwen3 4B (Non-reasoning) Alibaba	12	4.02B	32.0k	$0.2	94	🤗		View
Llama 3.1 Instruct 70B Meta	12	70B	128k	$0.6	34	🤗	+2 more	View
Sarvam 30B (Reasoning) Sarvam	12	32.2B	65.5k	-	193	🤗		View
DeepSeek-V2.5 DeepSeek	12	236B (21B active at inference time)	128k	-	-	🤗	-	View
Olmo 3.1 32B Instruct Allen Institute for AI	12	32.2B	65.5k	$0.3	53	🤗		View
DeepSeek R1 Distill Llama 8B DeepSeek	12	8B	128k	-	-	🤗	-	View
Olmo 3 32B Think Allen Institute for AI	12	32.2B	65.5k	-	-	🤗	-	View
R1 1776 Perplexity	12	671B (37B active at inference time)	128k	-	-	🤗	-	View
Llama 3.2 Instruct 90B (Vision) Meta	12	90B	128k	$0.7	56	🤗	+1 more	View
Llama 3.1 Instruct 8B Meta	12	8B	128k	$0.1	155	🤗	+15 more	View
Qwen2 Instruct 72B Alibaba	12	72B	131k	-	-	🤗	-	View
EXAONE 4.0 32B (Non-reasoning) LG AI Research	12	32B	131k	-	-	🤗	-	View
Ministral 3 3B Mistral	11	3B	256k	$0.1	253	🤗		View
DeepHermes 3 - Mistral 24B Preview (Non-reasoning) Nous Research	11	24B	32.0k	-	-	🤗	-	View
Jamba 1.7 Large AI21 Labs	11	398B (94B active at inference time)	256k	$3.5	59	🤗		View
Granite 4.0 H Small IBM	11	32B (9B active at inference time)	128k	$0.1	388	🤗		View
Jamba 1.5 Large AI21 Labs	11	398B (94B active at inference time)	256k	$3.5	-	🤗		View
Qwen3 Omni 30B A3B Instruct Alibaba	11	35.3B (3B active at inference time)	65.5k	$0.4	96	🤗		View
Hermes 3 - Llama-3.1 70B Nous Research	11	70.6B	128k	$0.3	41	🤗		View
Qwen3 8B (Non-reasoning) Alibaba	11	8.19B	32.8k	$0.3	78	🤗		View
DeepSeek-Coder-V2 DeepSeek	11	236B (21B active at inference time)	128k	-	-	🤗	-	View
Jamba 1.6 Large AI21 Labs	11	398B (94B active at inference time)	256k	$3.5	59	🤗		View
Qwen3.5 0.8B (Reasoning) Alibaba	11	0.873B	262k	-	-	🤗	-	View
LFM2 24B A2B Liquid AI	10	23.8B (2.3B active at inference time)	32.8k	$0.1	209	🤗		View
Phi-4 Microsoft Azure	10	14B	16.0k	$0.2	34	🤗		View
Gemma 3 27B Instruct Google	10	27.4B	128k	-	27	🤗	+3 more	View
Mistral Small (Sep '24) Mistral	10	22B	32.8k	$0.3	126	🤗		View
NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning) NVIDIA	10	13.2B	128k	$0.3	135	🤗		View
Gemma 3n E4B Instruct Preview (May '25) Google	10	8.39B (4B active at inference time)	32.0k	-	-	🤗	-	View
Phi-4 Multimodal Instruct Microsoft Azure	10	5.6B	128k	-	17	🤗		View
Qwen2.5 Coder Instruct 7B Alibaba	10	7.62B	131k	-	-	🤗	-	View
Qwen3.5 0.8B (Non-reasoning) Alibaba	10	0.873B	262k	-	-	🤗	-	View
Mixtral 8x22B Instruct Mistral	10	141B (39B active at inference time)	65.4k	-	-	🤗	-	View
Llama 3.2 Instruct 3B Meta	10	3B	128k	$0.1	51	🤗		View
Jamba Reasoning 3B AI21 Labs	10	3B	262k	-	-	🤗	-	View
Qwen3 VL 4B Instruct Alibaba	10	4.44B	256k	-	-	🤗	-	View
Qwen1.5 Chat 110B Alibaba	10	110B	32.0k	-	-	🤗	-	View
Reka Flash 3 Reka AI	10	21B	128k	$0.3	43	🤗		View
Olmo 3 7B Think Allen Institute for AI	9	7B	65.5k	-	-	🤗	-	View
Ling-mini-2.0 InclusionAI	9	16.3B (1.4B active at inference time)	131k	-	-	🤗	-	View
DeepSeek R1 Distill Qwen 1.5B DeepSeek	9	1.5B	128k	-	-	🤗	-	View
DeepSeek-V2-Chat DeepSeek	9	236B (21B active at inference time)	128k	-	-	🤗	-	View
Qwen Chat 72B Alibaba	9	72B	33.8k	-	-	🤗	-	View
Gemma 3 12B Instruct Google	9	12.2B	128k	-	25	🤗	+2 more	View
Llama 3.2 Instruct 11B (Vision) Meta	9	11B	128k	$0.2	45	🤗		View
DeepSeek Coder V2 Lite Instruct DeepSeek	8	16B (2.4B active at inference time)	128k	-	-	🤗	-	View
Phi-4 Mini Instruct Microsoft Azure	8	3.84B	128k	-	43	🤗		View
Sarvam M (Reasoning) Sarvam	8	23.6B	32.8k	-	-	🤗	-	View
Command-R+ (Apr '24) Cohere	8	104B	128k	$6.0	-	🤗		View
DBRX Instruct Databricks	8	132B (36B active at inference time)	32.8k	-	-	🤗	-	View
Exaone 4.0 1.2B (Reasoning) LG AI Research	8	1.28B	64.0k	-	-	🤗	-	View
Olmo 3 7B Instruct Allen Institute for AI	8	7B	65.5k	$0.1	144	🤗		View
Exaone 4.0 1.2B (Non-reasoning) LG AI Research	8	1.28B	64.0k	-	-	🤗	-	View
LFM2.5-1.2B-Thinking Liquid AI	8	1.17B	32.0k	-	-	🤗	-	View
Jamba 1.7 Mini AI21 Labs	8	52B (12B active at inference time)	258k	-	-	🤗	-	View
LFM2 2.6B Liquid AI	8	2.57B	32.8k	-	-	🤗	?	View
LFM2.5-1.2B-Instruct Liquid AI	8	1.17B	32.0k	-	-	🤗	?	View
Jamba 1.5 Mini AI21 Labs	8	52B (12B active at inference time)	256k	$0.3	-	🤗		View
Granite 4.0 H 1B IBM	8	1.5B	128k	-	-	🤗	-	View
Qwen3 1.7B (Reasoning) Alibaba	8	2.03B	32.0k	$0.4	126	🤗		View
Jamba 1.6 Mini AI21 Labs	8	52B (12B active at inference time)	256k	$0.3	174	🤗		View
Mixtral 8x7B Instruct Mistral	8	46.7B (12.9B active at inference time)	32.8k	$0.5	-	🤗		View
Gemma 3 270M Google	8	0.268B	32.0k	-	-	🤗	-	View
Granite 4.0 Micro IBM	8	3B	128k	-	-	🤗	-	View
DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning) Nous Research	8	8B	128k	-	-	🤗	-	View
Command-R (Mar '24) Cohere	7	35B	128k	$0.8	-	🤗		View
Granite 4.0 1B IBM	7	1.6B	128k	-	-	🤗	-	View
Molmo2-8B Allen Institute for AI	7	8.66B	36.9k	-	105	🤗		View
LFM2 8B A1B Liquid AI	7	8.34B (1.5B active at inference time)	32.8k	-	-	🤗	?	View
Granite 3.3 8B (Non-reasoning) IBM	7	8.17B	128k	$0.1	163	🤗		View
Qwen3 1.7B (Non-reasoning) Alibaba	7	2.03B	32.0k	$0.2	128	🤗		View
Qwen3 0.6B (Reasoning) Alibaba	6	0.752B	32.0k	$0.4	194	🤗		View
Gemma 3n E4B Instruct Google	6	8.39B (4B active at inference time)	32.0k	$0.0	42	🤗		View
LFM2 1.2B Liquid AI	6	1.17B	32.8k	-	-	🤗	?	View
Gemma 3 4B Instruct Google	6	4.3B	128k	-	28	🤗		View
Llama 3.2 Instruct 1B Meta	6	1B	128k	$0.1	95	🤗		View
LFM2.5-VL-1.6B Liquid AI	6	1.6B	32.0k	-	-	🤗	?	View
Granite 4.0 350M IBM	6	0.35B	32.8k	-	-	🤗	-	View
Qwen3 0.6B (Non-reasoning) Alibaba	6	0.752B	32.0k	$0.2	192	🤗		View
Gemma 3 1B Instruct Google	6	1B	32.0k	-	42	🤗		View
Granite 4.0 H 350M IBM	5	0.34B	32.8k	-	-	🤗	-	View
Gemma 3n E2B Instruct Google	5	5.98B (2B active at inference time)	32.0k	-	-	🤗		View
Cogito v2.1 (Reasoning) Deep Cogito	-	671B (37B active at inference time)	128k	$1.3	87	🤗		View

Comparison of Open Source Models

Navigation

Openness

Artificial Analysis Openness Index: Results

Open Source Progress

Progress in Open Weights vs. Proprietary Intelligence

Artificial Analysis Intelligence Index

Open Weights

Artificial Analysis Intelligence Index

Artificial Analysis Intelligence Index

Open Source Language Models Intelligence By Lab Over Time

Artificial Analysis Intelligence Index

Open Source Models Intelligence By Size Over Time

Artificial Analysis Intelligence Index

Model Size Classifications

Intelligence Evaluations

Intelligence Evaluation Relevance

Artificial Analysis Intelligence Index

Size

Intelligence Index By Model Size

Artificial Analysis Intelligence Index

Open Weights

Model Size Classifications

Model Size: Total and Active Parameters

Total Parameters

Active Parameters at Inference Time

Intelligence vs. Active Parameters

Artificial Analysis Intelligence Index

Active Parameters at Inference Time

Intelligence vs. Total Parameters

Artificial Analysis Intelligence Index

Total Parameters

Context Window

Context Window

Context Window for RAG

Context window