Comparisons of Tiny Open Source AI Models (≤4B)
Open source AI models with 4B parameters or fewer. These are usually the smallest models in terms of resource demand. Models are considered open source (also commonly referred to as open weights) where their weights are accessible to download. This allows self-hosting on your own infrastructure and enables customizing the model such as through fine-tuning. Click on any model to see detailed metrics. For more details including relating to our methodology, see our FAQs.
Nanbeige4.1-3B.Highlights
Openness
Artificial Analysis Openness Index: Score
Openness Index assesses model openness on a 0 to 100 normalized scale (higher is more open)
Reasoning models are indicated by a lightbulb icon
Intelligence
Artificial Analysis Intelligence Index
Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Reasoning models are indicated by a lightbulb icon
Intelligence Evaluations
Intelligence evaluations measured independently by Artificial Analysis · Higher is better
GDPval-AA
Agentic real-world work tasks, (Elo-500)/2000
Terminal-Bench Hard
Agentic coding & terminal use
𝜏²-Bench Telecom
Agentic tool use
AA-LCR
Long context reasoning
AA-Omniscience Accuracy
Knowledge
AA-Omniscience Non-Hallucination Rate
1 - hallucination rate
Humanity's Last Exam
Reasoning & knowledge
GPQA Diamond
Scientific reasoning
SciCode
Coding
IFBench
Instruction following
CritPt
Physics reasoning
APEX-Agents-AA
Long-horizon agentic tasks
No data available
ITBench-AA
Kubernetes incident root-cause analysis
No data available
MMMU-Pro
Visual reasoning
Reasoning models are indicated by a lightbulb icon.
Size
Model Size: Total and Active Parameters
Comparison between total model parameters and parameters active during inference
Reasoning models are indicated by a lightbulb icon
Intelligence vs. Active Parameters
Active parameters at inference time · Artificial Analysis Intelligence Index
Most attractive quadrant
AI21 Labs
Alibaba
IBM
LG AI Research
Liquid AI
Microsoft
Mistral
Nanbeige
NVIDIA
OpenBMB
Reasoning models are indicated by a lightbulb icon.
Intelligence vs. Total Parameters
Artificial Analysis Intelligence Index · Size in parameters (billions)
Most attractive quadrant
AI21 Labs
Alibaba
IBM
LG AI Research
Liquid AI
Microsoft
Mistral
Nanbeige
NVIDIA
OpenBMB
Reasoning models are indicated by a lightbulb icon.
Context Window
Context Window
Context window: tokens limit · Higher is better
Reasoning models are indicated by a lightbulb icon
Further details
Weights | Provider Benchmarks | ||||||||
|---|---|---|---|---|---|---|---|---|---|
MiniCPM5-1B (Reasoning) | 18 | 1B | 128k | - | - | - | |||
MiniCPM5-1B (Non-reasoning) | 18 | 1B | 128k | - | - | - | |||
Qwen3.5 2B (Reasoning) | 16 | 2.27B | 262k | $0.0 | - | ||||
Nanbeige4.1-3B | 16 | 3.93B | 256k | - | - | - | |||
NVIDIA Nemotron 3 Nano 4B | 15 | 3.97B | 262k | - | - | - | |||
Qwen3.5 2B (Non-reasoning) | 15 | 2.27B | 262k | $0.0 | 340 | ||||
MiniCPM-V 4.6 1.3B | 13 | 1.3B | 262k | - | - | - | |||
Ministral 3 3B | 11 | 3B | 256k | $0.1 | 149 | ||||
Qwen3.5 0.8B (Reasoning) | 11 | 0.873B | 262k | $0.0 | - | ||||
Qwen3.5 0.8B (Non-reasoning) | 10 | 0.873B | 262k | $0.0 | 89 | ||||
Jamba Reasoning 3B | 10 | 3B | 262k | - | - | - | |||
Granite 4.1 3B | 9 | 3B | 131k | - | - | - | |||
Phi-4 Mini Instruct | 8 | 3.84B | 128k | - | 24 | ||||
Exaone 4.0 1.2B (Reasoning) | 8 | 1.28B | 64.0k | - | - | - | |||
Exaone 4.0 1.2B (Non-reasoning) | 8 | 1.28B | 64.0k | - | - | - | |||
LFM2.5-1.2B-Thinking | 8 | 1.17B | 32.0k | - | - | - | |||
LFM2 2.6B | 8 | 2.57B | 32.8k | - | - | ? | |||
LFM2.5-1.2B-Instruct | 8 | 1.17B | 32.0k | - | - | ? | |||
Granite 4.0 H 1B | 8 | 1.5B | 128k | - | - | - | |||
Gemma 3 270M | 8 | 0.268B | 32.0k | - | - | - | |||
Granite 4.0 Micro | 8 | 3B | 128k | - | - | - | |||
Granite 4.0 1B | 7 | 1.6B | 128k | - | - | - | |||
LFM2.5-VL-1.6B | 6 | 1.6B | 32.0k | - | - | ? | |||
Granite 4.0 350M | 6 | 0.35B | 32.8k | - | - | - | |||
Granite 4.0 H 350M | 5 | 0.34B | 32.8k | - | - | - | |||
Tiny Aya Global | 5 | 3.35B | 8.19k | - | - |