Comparison of Open Source Models
Comparison and analysis of open source AI models across key performance metrics including quality, performance, inference speed, context window, parameter count & licensing details. Models are considered Open Source (also commonly referred to as open weights) where their weights are accessible to download. This allows self-hosting on your own infrastructure and enables customizing the model such as through fine-tuning. Click on any model to see detailed metrics. For more details relating to our methodology, see our FAQs.
GLM-5 and
Kimi K2.5 are the highest intelligence open source models, followed by Qwen3.5 397B A17B &
GLM-4.7.
Intelligence
Artificial Analysis Intelligence Index; Higher is better
Estimate (independent evaluation forthcoming)
Total Parameters
Trainable parameters in billions
Openness
Artificial Analysis Openness Index: Results
Openness Index assesses model openness on a 0 to 100 normalized scale (higher is more open)
Open Source Progress
Progress in Open Weights vs. Proprietary Intelligence
Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Open Weights
Proprietary
Reasoning models are indicated by a lightbulb icon.
Artificial Analysis Intelligence Index
Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Estimate (independent evaluation forthcoming)
Reasoning models are indicated by a lightbulb icon.
Open Source Language Models Intelligence By Lab Over Time
Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Alibaba
DeepSeek
Google
Meta
Microsoft Azure
Mistral
NVIDIA
Reasoning models are indicated by a lightbulb icon.
Open Source Models Intelligence By Size Over Time
Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Large Models (>150B)
Medium Models (40B-150B)
Small Models (4B-40B)
Tiny Models (≤4B)
Reasoning models are indicated by a lightbulb icon.
Intelligence Evaluations
Intelligence evaluations measured independently by Artificial Analysis; Higher is better
Results claimed by AI Lab (not yet independently verified)
GDPval-AA (Agentic Real-World Work Tasks, (ELO-500)/2000)
Terminal-Bench Hard (Agentic Coding & Terminal Use)
𝜏²-Bench Telecom (Agentic Tool Use)
AA-LCR (Long Context Reasoning)
AA-Omniscience Accuracy (Knowledge)
AA-Omniscience Non-Hallucination Rate (1 - Hallucination Rate)
Humanity's Last Exam (Reasoning & Knowledge)
GPQA Diamond (Scientific Reasoning)
SciCode (Coding)
IFBench (Instruction Following)
CritPt (Physics Reasoning)
MMMU Pro (Visual Reasoning)
Reasoning models are indicated by a lightbulb icon.
Size
Intelligence Index By Model Size
Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Estimate (independent evaluation forthcoming)
Large Models (>150B)
Medium Models (40B-150B)
Small Models (4B-40B)
Reasoning models are indicated by a lightbulb icon.
Model Size: Total and Active Parameters
Comparison between total model parameters and parameters active during inference
Active Parameters
Passive Parameters
Reasoning models are indicated by a lightbulb icon.
Intelligence vs. Active Parameters
Active Parameters at Inference Time; Artificial Analysis Intelligence Index
Most attractive quadrant
Alibaba
DeepSeek
Kimi
Korea Telecom
LG AI Research
MBZUAI Institute of Foundation Models
Meta
Mistral
NVIDIA
OpenAI
Xiaomi
Z AI
Reasoning models are indicated by a lightbulb icon.
Intelligence vs. Total Parameters
Artificial Analysis Intelligence Index; Size in Parameters (Billions)
Most attractive quadrant
Alibaba
DeepSeek
Kimi
Korea Telecom
LG AI Research
MBZUAI Institute of Foundation Models
Meta
Mistral
NVIDIA
OpenAI
Xiaomi
Z AI
Reasoning models are indicated by a lightbulb icon.
Context Window
Context Window
Context Window: Tokens Limit; Higher is better
Reasoning models are indicated by a lightbulb icon.
Further details
| Weights | Provider Benchmarks | |||||||
|---|---|---|---|---|---|---|---|---|
GLM-5 (Reasoning) Z AI | 50 | 744B (40B active at inference time) | 200k | $1.6 | 68 | 🤗 | +10 more | View |
Kimi K2.5 (Reasoning) Kimi | 47 | 1.0KB (32B active at inference time) | 256k | $1.2 | 48 | 🤗 | +14 more | View |
Qwen3.5 397B A17B (Reasoning) Alibaba | 45 | 397B (17B active at inference time) | 262k | $1.4 | 84 | 🤗 | +5 more | View |
GLM-4.7 (Reasoning) Z AI | 42 | 357B (32B active at inference time) | 200k | $1.0 | 84 | 🤗 | +8 more | View |
Qwen3.5 27B (Reasoning) Alibaba | 42 | 27.8B | 262k | $0.8 | 89 | 🤗 | View | |
MiniMax-M2.5 MiniMax | 42 | 230B (10B active at inference time) | 205k | $0.5 | 49 | 🤗 | +10 more | View |
DeepSeek V3.2 (Reasoning) DeepSeek | 42 | 685B (37B active at inference time) | 128k | $0.3 | 34 | 🤗 | +8 more | View |
Qwen3.5 122B A10B (Reasoning) Alibaba | 42 | 125B (10B active at inference time) | 262k | $1.1 | 132 | 🤗 | View | |
MiMo-V2-Flash (Feb 2026) Xiaomi | 41 | 309B (15B active at inference time) | 256k | $0.1 | 128 | 🤗 | View | |
Kimi K2 Thinking Kimi | 41 | 1.0KB (32B active at inference time) | 256k | $1.1 | 88 | 🤗 | +5 more | View |
GLM-5 (Non-reasoning) Z AI | 41 | 744B (40B active at inference time) | 200k | $1.6 | 67 | 🤗 | +4 more | View |
Qwen3.5 397B A17B (Non-reasoning) Alibaba | 40 | 397B (17B active at inference time) | 262k | $1.4 | 85 | 🤗 | +2 more | View |
MiniMax-M2.1 MiniMax | 39 | 230B (10B active at inference time) | 205k | $0.5 | 43 | 🤗 | +4 more | View |
MiMo-V2-Flash (Reasoning) Xiaomi | 39 | 309B (15B active at inference time) | 256k | $0.1 | 126 | 🤗 | View | |
Step 3.5 Flash StepFun | 38 | 196B (11B active at inference time) | 256k | $0.1 | 114 | 🤗 | View | |
Kimi K2.5 (Non-reasoning) Kimi | 37 | 1.0KB (32B active at inference time) | 256k | $1.2 | 46 | 🤗 | +6 more | View |
Qwen3.5 27B (Non-reasoning) Alibaba | 37 | 27.8B | 262k | $0.8 | 93 | 🤗 | View | |
Qwen3.5 35B A3B (Reasoning) Alibaba | 37 | 36B (3B active at inference time) | 262k | $0.7 | 175 | 🤗 | View | |
MiniMax-M2 MiniMax | 36 | 230B (10B active at inference time) | 205k | $0.5 | 47 | 🤗 | +1 more | View |
NVIDIA Nemotron 3 Super 120B A12B (Reasoning) NVIDIA | 36 | 120.6B (12.7B active at inference time) | 1.00M | $0.4 | 458 | Not available | +2 more | View |
Qwen3.5 122B A10B (Non-reasoning) Alibaba | 36 | 125B (10B active at inference time) | 262k | $1.1 | 126 | 🤗 | View | |
GLM-4.7 (Non-reasoning) Z AI | 34 | 357B (32B active at inference time) | 200k | $0.9 | 78 | 🤗 | +7 more | View |
DeepSeek V3.1 Terminus (Reasoning) DeepSeek | 34 | 685B (37B active at inference time) | 128k | $0.8 | - | 🤗 | View | |
gpt-oss-120B (high) OpenAI | 33 | 117B (5.1B active at inference time) | 131k | $0.3 | 280 | 🤗 | +22 more | View |
DeepSeek V3.2 Exp (Reasoning) DeepSeek | 33 | 685B (37B active at inference time) | 128k | $0.3 | 36 | 🤗 | View | |
GLM-4.6 (Reasoning) Z AI | 33 | 357B (32B active at inference time) | 200k | $1.0 | 99 | 🤗 | +1 more | View |
Qwen3.5 9B (Reasoning) Alibaba | 32 | 9.65B | 262k | $0.1 | 59 | 🤗 | View | |
K-EXAONE (Reasoning) LG AI Research | 32 | 236B (23B active at inference time) | 256k | - | - | 🤗 | - | View |
DeepSeek V3.2 (Non-reasoning) DeepSeek | 32 | 685B (37B active at inference time) | 128k | $0.3 | 34 | 🤗 | +11 more | View |
Kimi K2 0905 Kimi | 31 | 1.0KB (32B active at inference time) | 256k | $1.1 | 37 | 🤗 | +1 more | View |
Qwen3.5 35B A3B (Non-reasoning) Alibaba | 31 | 36B (3B active at inference time) | 262k | $0.7 | 157 | 🤗 | View | |
MiMo-V2-Flash (Non-reasoning) Xiaomi | 30 | 309B (15B active at inference time) | 256k | $0.1 | 131 | 🤗 | View | |
GLM-4.6 (Non-reasoning) Z AI | 30 | 357B (32B active at inference time) | 200k | $1.0 | 87 | 🤗 | View | |
GLM-4.7-Flash (Reasoning) Z AI | 30 | 31.2B (3B active at inference time) | 200k | $0.2 | 58 | 🤗 | View | |
Qwen3 235B A22B 2507 (Reasoning) Alibaba | 30 | 235B (22B active at inference time) | 256k | $2.6 | 42 | 🤗 | +4 more | View |
DeepSeek V3.2 Speciale DeepSeek | 29 | 685B (37B active at inference time) | 128k | - | - | 🤗 | - | View |
DeepSeek V3.1 Terminus (Non-reasoning) DeepSeek | 29 | 685B (37B active at inference time) | 128k | $0.6 | - | 🤗 | +1 more | View |
DeepSeek V3.2 Exp (Non-reasoning) DeepSeek | 28 | 685B (37B active at inference time) | 128k | $0.3 | 34 | 🤗 | View | |
Apriel-v1.5-15B-Thinker ServiceNow | 28 | 15B | 128k | - | 141 | 🤗 | View | |
Qwen3 Coder Next Alibaba | 28 | 79.7B (3B active at inference time) | 256k | $0.6 | 137 | 🤗 | +1 more | View |
DeepSeek V3.1 (Non-reasoning) DeepSeek | 28 | 685B (37B active at inference time) | 128k | $0.8 | - | 🤗 | +8 more | View |
DeepSeek V3.1 (Reasoning) DeepSeek | 28 | 685B (37B active at inference time) | 128k | $0.9 | - | 🤗 | +2 more | View |
Qwen3 VL 235B A22B (Reasoning) Alibaba | 28 | 235B (22B active at inference time) | 262k | $2.6 | 51 | 🤗 | View | |
Apriel-v1.6-15B-Thinker ServiceNow | 28 | 15B | 128k | - | 96 | 🤗 | View | |
Qwen3.5 9B (Non-reasoning) Alibaba | 27 | 9.65B | 262k | - | - | 🤗 | - | View |
Qwen3.5 4B (Reasoning) Alibaba | 27 | 4.66B | 262k | - | - | 🤗 | - | View |
DeepSeek R1 0528 (May '25) DeepSeek | 27 | 685B (37B active at inference time) | 128k | $2.4 | - | 🤗 | +6 more | View |
Mistral Small 4 (Reasoning) Mistral | 27 | 119B (6.5B active at inference time) | 256k | $0.3 | 153 | 🤗 | View | |
Qwen3 Next 80B A3B (Reasoning) Alibaba | 27 | 80B (3B active at inference time) | 262k | $1.9 | 155 | 🤗 | +4 more | View |
GLM-4.5 (Reasoning) Z AI | 26 | 355B (32B active at inference time) | 128k | $0.8 | 40 | 🤗 | View | |
Kimi K2 Kimi | 26 | 1.0KB (32B active at inference time) | 128k | $1.0 | 39 | 🤗 | +2 more | View |
Seed-OSS-36B-Instruct ByteDance Seed | 25 | 36.2B | 512k | $0.3 | 34 | 🤗 | View | |
Qwen3 235B A22B 2507 Instruct Alibaba | 25 | 235B (22B active at inference time) | 256k | $1.2 | 60 | 🤗 | +10 more | View |
Qwen3 Coder 480B A35B Instruct Alibaba | 25 | 480B (35B active at inference time) | 262k | $3.0 | 57 | 🤗 | +8 more | View |
Qwen3 VL 32B (Reasoning) Alibaba | 25 | 33.4B | 256k | $2.6 | 88 | 🤗 | View | |
gpt-oss-120B (low) OpenAI | 24 | 117B (5.1B active at inference time) | 131k | $0.3 | 288 | 🤗 | +18 more | View |
gpt-oss-20B (high) OpenAI | 24 | 21B (3.6B active at inference time) | 131k | $0.1 | 296 | 🤗 | +9 more | View |
MiniMax M1 80k MiniMax | 24 | 456B (45.9B active at inference time) | 1.00M | $1.0 | - | 🤗 | View | |
NVIDIA Nemotron 3 Nano 30B A3B (Reasoning) NVIDIA | 24 | 31.6B (3.6B active at inference time) | 1.00M | $0.1 | 124 | 🤗 | View | |
K2 Think V2 MBZUAI Institute of Foundation Models | 24 | 70B | 262k | - | - | Not available | - | View |
LongCat Flash Lite LongCat | 24 | 68.5B (3B active at inference time) | 256k | - | 101 | 🤗 | View | |
HyperCLOVA X SEED Think (32B) Naver | 24 | 32B | 128k | - | - | 🤗 | - | View |
GLM-4.6V (Reasoning) Z AI | 23 | 108B | 128k | $0.5 | 31 | 🤗 | View | |
K-EXAONE (Non-reasoning) LG AI Research | 23 | 236B (23B active at inference time) | 256k | - | - | 🤗 | - | View |
GLM-4.5-Air Z AI | 23 | 106B (12B active at inference time) | 128k | $0.4 | 112 | 🤗 | +1 more | View |
Mi:dm K 2.5 Pro Korea Telecom | 23 | 32B | 128k | - | - | Not available | - | View |
Mistral Large 3 Mistral | 23 | 675B (41B active at inference time) | 256k | $0.8 | 49 | 🤗 | View | |
Ring-1T InclusionAI | 23 | 1.0KB (50B active at inference time) | 128k | - | - | 🤗 | - | View |
Qwen3.5 4B (Non-reasoning) Alibaba | 23 | 4.66B | 262k | - | - | 🤗 | - | View |
Qwen3 30B A3B 2507 (Reasoning) Alibaba | 22 | 30.5B (3.3B active at inference time) | 262k | $0.8 | 150 | 🤗 | View | |
DeepSeek V3 0324 DeepSeek | 22 | 671B (37B active at inference time) | 128k | $1.3 | - | 🤗 | +6 more | View |
INTELLECT-3 Prime Intellect | 22 | 107B | 131k | - | - | 🤗 | - | View |
GLM-4.7-Flash (Non-reasoning) Z AI | 22 | 31.2B (3B active at inference time) | 200k | $0.2 | 53 | 🤗 | View | |
Devstral 2 Mistral | 22 | 125B | 256k | - | 82 | 🤗 | View | |
MiniMax M1 40k MiniMax | 21 | 456B (45.9B active at inference time) | 1.00M | - | - | 🤗 | - | View |
gpt-oss-20B (low) OpenAI | 21 | 21B (3.6B active at inference time) | 131k | $0.1 | 299 | 🤗 | +9 more | View |
Qwen3 VL 235B A22B Instruct Alibaba | 21 | 235B (22B active at inference time) | 262k | $1.2 | 58 | 🤗 | +2 more | View |
K2-V2 (high) MBZUAI Institute of Foundation Models | 21 | 70B | 512k | - | - | 🤗 | - | View |
Qwen3 Next 80B A3B Instruct Alibaba | 20 | 80B (3B active at inference time) | 262k | $0.9 | 149 | 🤗 | +4 more | View |
Tri-21B-think Preview Trillion Labs | 20 | 21B | 32.0k | - | - | Not available | - | View |
Qwen3 Coder 30B A3B Instruct Alibaba | 20 | 30.5B (3.3B active at inference time) | 262k | $0.9 | 26 | 🤗 | +2 more | View |
Qwen3 235B A22B (Reasoning) Alibaba | 20 | 235B (22B active at inference time) | 32.8k | $2.6 | 51 | 🤗 | View | |
QwQ 32B Alibaba | 20 | 32.8B | 131k | $0.7 | - | 🤗 | View | |
Qwen3 VL 30B A3B (Reasoning) Alibaba | 20 | 30B (3B active at inference time) | 256k | $0.8 | 111 | 🤗 | +1 more | View |
Devstral Small 2 Mistral | 19 | 24B | 256k | - | 194 | 🤗 | View | |
Ling-1T InclusionAI | 19 | 1.0KB (50B active at inference time) | 128k | - | - | 🤗 | - | View |
DeepSeek R1 (Jan '25) DeepSeek | 19 | 685B (37B active at inference time) | 128k | $2.4 | - | 🤗 | +6 more | View |
Llama Nemotron Super 49B v1.5 (Reasoning) NVIDIA | 19 | 49B | 128k | $0.2 | 81 | 🤗 | View | |
K2-V2 (medium) MBZUAI Institute of Foundation Models | 19 | 70B | 512k | - | - | 🤗 | - | View |
Mistral Small 4 (Non-reasoning) Mistral | 19 | 119B (6.5B active at inference time) | 256k | $0.3 | 130 | 🤗 | View | |
Tri-21B-Think Trillion Labs | 19 | 21B | 32.0k | - | - | Not available | - | View |
Hermes 4 - Llama-3.1 405B (Reasoning) Nous Research | 19 | 406B | 128k | $1.5 | 29 | 🤗 | View | |
Llama 3.3 Nemotron Super 49B v1 (Reasoning) NVIDIA | 18 | 49B | 128k | - | - | 🤗 | - | View |
Llama 4 Maverick Meta | 18 | 402B (17B active at inference time) | 1.00M | $0.5 | 125 | 🤗 | +10 more | View |
Qwen3 4B 2507 (Reasoning) Alibaba | 18 | 4.02B | 262k | - | - | 🤗 | - | View |
Magistral Small 1.2 Mistral | 18 | 24B | 128k | $0.8 | 96 | 🤗 | View | |
Sarvam 105B (Reasoning) Sarvam | 18 | 106B (10.3B active at inference time) | 65.5k | - | 78 | 🤗 | View | |
Devstral Small (May '25) Mistral | 18 | 23.6B | 256k | $0.1 | - | 🤗 | View | |
Hermes 4 - Llama-3.1 405B (Non-reasoning) Nous Research | 18 | 406B | 128k | $1.5 | 30 | 🤗 | View | |
Llama 3.1 Instruct 405B Meta | 17 | 405B | 128k | $4.4 | 33 | 🤗 | +2 more | View |
Qwen3 VL 32B Instruct Alibaba | 17 | 33.4B | 256k | $1.2 | 73 | 🤗 | View | |
DeepSeek R1 Distill Qwen 32B DeepSeek | 17 | 32B | 128k | $0.3 | 61 | 🤗 | View | |
GLM-4.6V (Non-reasoning) Z AI | 17 | 108B | 128k | $0.5 | 21 | 🤗 | View | |
Qwen3 235B A22B (Non-reasoning) Alibaba | 17 | 235B (22B active at inference time) | 32.8k | $1.2 | 46 | 🤗 | View | |
Magistral Small 1 Mistral | 17 | 23.6B | 40.0k | - | - | 🤗 | - | View |
EXAONE 4.0 32B (Reasoning) LG AI Research | 17 | 32B | 131k | - | - | 🤗 | - | View |
Qwen3 VL 8B (Reasoning) Alibaba | 17 | 8.77B | 256k | $0.7 | 114 | 🤗 | View | |
Qwen3 32B (Reasoning) Alibaba | 17 | 32.8B | 32.8k | $2.6 | 93 | 🤗 | +4 more | View |
DeepSeek V3 (Dec '24) DeepSeek | 16 | 671B (37B active at inference time) | 128k | $0.6 | - | 🤗 | +2 more | View |
DeepSeek R1 0528 Qwen3 8B DeepSeek | 16 | 8.19B | 32.8k | - | - | 🤗 | - | View |
Qwen3.5 2B (Reasoning) Alibaba | 16 | 2.27B | 262k | - | - | 🤗 | - | View |
Qwen3 14B (Reasoning) Alibaba | 16 | 14.8B | 32.8k | $1.3 | 62 | 🤗 | View | |
Qwen3 VL 30B A3B Instruct Alibaba | 16 | 30B (3B active at inference time) | 256k | $0.3 | 104 | 🤗 | +2 more | View |
Hermes 4 - Llama-3.1 70B (Reasoning) Nous Research | 16 | 70.6B | 128k | $0.2 | 77 | 🤗 | View | |
Ministral 3 14B Mistral | 16 | 14B | 256k | $0.2 | 116 | 🤗 | View | |
DeepSeek R1 Distill Llama 70B DeepSeek | 16 | 70B | 128k | $0.9 | 53 | 🤗 | View | |
DeepSeek R1 Distill Qwen 14B DeepSeek | 16 | 14B | 128k | - | - | 🤗 | - | View |
Falcon-H1R-7B TII UAE | 16 | 7B | 256k | - | - | Not available | - | View |
Ling-flash-2.0 InclusionAI | 16 | 103B (6.1B active at inference time) | 128k | $0.2 | 58 | 🤗 | View | |
Qwen3 Omni 30B A3B (Reasoning) Alibaba | 16 | 35.3B (3B active at inference time) | 65.5k | $0.4 | 91 | 🤗 | View | |
Qwen2.5 Instruct 72B Alibaba | 16 | 72B | 131k | - | 26 | 🤗 | View | |
Step3 VL 10B StepFun | 15 | 10.2B | 65.5k | - | - | 🤗 | - | View |
Qwen3 30B A3B (Reasoning) Alibaba | 15 | 30.5B (3.3B active at inference time) | 32.8k | $0.8 | 59 | 🤗 | +2 more | View |
Devstral Small (Jul '25) Mistral | 15 | 24B | 256k | $0.1 | 204 | 🤗 | View | |
QwQ 32B-Preview Alibaba | 15 | 32.8B | 32.8k | $0.1 | 61 | 🤗 | View | |
Mistral Large 2 (Nov '24) Mistral | 15 | 123B | 128k | $3.0 | 41 | 🤗 | View | |
GLM-4.5V (Reasoning) Z AI | 15 | 108B (12B active at inference time) | 64.0k | $0.9 | 50 | 🤗 | View | |
Mistral Small 3.2 Mistral | 15 | 24B | 128k | $0.1 | 164 | 🤗 | View | |
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) NVIDIA | 15 | 253B | 128k | $0.9 | 42 | 🤗 | View | |
Qwen3 30B A3B 2507 Instruct Alibaba | 15 | 30.5B (3.3B active at inference time) | 262k | $0.3 | 71 | 🤗 | +1 more | View |
ERNIE 4.5 300B A47B Baidu | 15 | 300B (47B active at inference time) | 131k | $0.5 | 35 | 🤗 | View | |
NVIDIA Nemotron Nano 12B v2 VL (Reasoning) NVIDIA | 15 | 13.2B | 128k | $0.3 | 130 | 🤗 | View | |
Ministral 3 8B Mistral | 15 | 8B | 256k | $0.1 | 181 | 🤗 | View | |
NVIDIA Nemotron Nano 9B V2 (Reasoning) NVIDIA | 15 | 9B | 131k | $0.1 | 120 | 🤗 | View | |
Qwen3.5 2B (Non-reasoning) Alibaba | 15 | 2.27B | 262k | - | - | 🤗 | - | View |
Llama Nemotron Super 49B v1.5 (Non-reasoning) NVIDIA | 15 | 49B | 128k | $0.2 | 81 | 🤗 | View | |
Qwen3 32B (Non-reasoning) Alibaba | 15 | 32.8B | 32.8k | $1.2 | 93 | 🤗 | +5 more | View |
Llama 3.3 Instruct 70B Meta | 14 | 70B | 128k | $0.7 | 81 | 🤗 | +19 more | View |
Mistral Small 3.1 Mistral | 14 | 24B | 128k | $0.1 | 134 | 🤗 | +1 more | View |
K2-V2 (low) MBZUAI Institute of Foundation Models | 14 | 70B | 512k | - | - | 🤗 | - | View |
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) NVIDIA | 14 | 4.51B | 128k | - | - | 🤗 | - | View |
Kimi Linear 48B A3B Instruct Kimi | 14 | 49.1B (3B active at inference time) | 1.00M | - | - | 🤗 | - | View |
Llama 3.3 Nemotron Super 49B v1 (Non-reasoning) NVIDIA | 14 | 49B | 128k | - | - | 🤗 | - | View |
Qwen3 VL 8B Instruct Alibaba | 14 | 8.77B | 256k | $0.3 | 116 | 🤗 | View | |
Qwen3 4B (Reasoning) Alibaba | 14 | 4.02B | 32.0k | $0.4 | 93 | 🤗 | View | |
Llama 3.1 Tulu3 405B Allen Institute for AI | 14 | 405B | 128k | - | - | 🤗 | - | View |
Ring-flash-2.0 InclusionAI | 14 | 103B (6.1B active at inference time) | 128k | $0.2 | 78 | 🤗 | View | |
Pixtral Large Mistral | 14 | 124B | 128k | $3.0 | 53 | 🤗 | View | |
Olmo 3.1 32B Think Allen Institute for AI | 14 | 32.2B | 65.5k | - | 95 | 🤗 | View | |
Grok 2 (Dec '24) xAI | 14 | 270B | 131k | - | - | 🤗 | - | View |
Qwen3 VL 4B (Reasoning) Alibaba | 14 | 4.44B | 256k | - | - | 🤗 | - | View |
Llama 4 Scout Meta | 14 | 109B (17B active at inference time) | 10.0M | $0.3 | 127 | 🤗 | +7 more | View |
Command A Cohere | 13 | 111B | 256k | $4.4 | 46 | 🤗 | View | |
Llama 3.1 Nemotron Instruct 70B NVIDIA | 13 | 70B | 128k | $1.2 | 36 | 🤗 | View | |
Qwen2.5 Instruct 32B Alibaba | 13 | 32B | 128k | - | - | 🤗 | - | View |
Qwen3 8B (Reasoning) Alibaba | 13 | 8.19B | 131k | $0.7 | 76 | 🤗 | View | |
NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning) NVIDIA | 13 | 31.6B (3.6B active at inference time) | 1.00M | $0.1 | 77 | 🤗 | View | |
NVIDIA Nemotron Nano 9B V2 (Non-reasoning) NVIDIA | 13 | 9B | 131k | $0.1 | 149 | 🤗 | View | |
Mistral Large 2 (Jul '24) Mistral | 13 | 123B | 128k | $3.0 | - | 🤗 | View | |
Qwen3 4B 2507 Instruct Alibaba | 13 | 4.02B | 262k | - | - | 🤗 | - | View |
Qwen2.5 Coder Instruct 32B Alibaba | 13 | 32B | 131k | - | - | 🤗 | - | View |
Qwen3 14B (Non-reasoning) Alibaba | 13 | 14.8B | 32.8k | $0.6 | 62 | 🤗 | View | |
GLM-4.5V (Non-reasoning) Z AI | 13 | 108B (12B active at inference time) | 64.0k | $0.9 | 50 | 🤗 | View | |
Mistral Small 3 Mistral | 13 | 24B | 32.0k | $0.1 | 128 | 🤗 | View | |
Hermes 4 - Llama-3.1 70B (Non-reasoning) Nous Research | 13 | 70.6B | 128k | $0.2 | 79 | 🤗 | View | |
Qwen3 30B A3B (Non-reasoning) Alibaba | 13 | 30.5B (3.3B active at inference time) | 32.8k | $0.3 | 57 | 🤗 | View | |
DeepSeek-V2.5 (Dec '24) DeepSeek | 13 | 236B (21B active at inference time) | 128k | - | - | 🤗 | - | View |
Qwen3 4B (Non-reasoning) Alibaba | 12 | 4.02B | 32.0k | $0.2 | 94 | 🤗 | View | |
Llama 3.1 Instruct 70B Meta | 12 | 70B | 128k | $0.6 | 34 | 🤗 | +2 more | View |
Sarvam 30B (Reasoning) Sarvam | 12 | 32.2B | 65.5k | - | 193 | 🤗 | View | |
DeepSeek-V2.5 DeepSeek | 12 | 236B (21B active at inference time) | 128k | - | - | 🤗 | - | View |
Olmo 3.1 32B Instruct Allen Institute for AI | 12 | 32.2B | 65.5k | $0.3 | 53 | 🤗 | View | |
DeepSeek R1 Distill Llama 8B DeepSeek | 12 | 8B | 128k | - | - | 🤗 | - | View |
Olmo 3 32B Think Allen Institute for AI | 12 | 32.2B | 65.5k | - | - | 🤗 | - | View |
R1 1776 Perplexity | 12 | 671B (37B active at inference time) | 128k | - | - | 🤗 | - | View |
Llama 3.2 Instruct 90B (Vision) Meta | 12 | 90B | 128k | $0.7 | 56 | 🤗 | +1 more | View |
Llama 3.1 Instruct 8B Meta | 12 | 8B | 128k | $0.1 | 155 | 🤗 | +15 more | View |
Qwen2 Instruct 72B Alibaba | 12 | 72B | 131k | - | - | 🤗 | - | View |
EXAONE 4.0 32B (Non-reasoning) LG AI Research | 12 | 32B | 131k | - | - | 🤗 | - | View |
Ministral 3 3B Mistral | 11 | 3B | 256k | $0.1 | 253 | 🤗 | View | |
DeepHermes 3 - Mistral 24B Preview (Non-reasoning) Nous Research | 11 | 24B | 32.0k | - | - | 🤗 | - | View |
Jamba 1.7 Large AI21 Labs | 11 | 398B (94B active at inference time) | 256k | $3.5 | 59 | 🤗 | View | |
Granite 4.0 H Small IBM | 11 | 32B (9B active at inference time) | 128k | $0.1 | 388 | 🤗 | View | |
Jamba 1.5 Large AI21 Labs | 11 | 398B (94B active at inference time) | 256k | $3.5 | - | 🤗 | View | |
Qwen3 Omni 30B A3B Instruct Alibaba | 11 | 35.3B (3B active at inference time) | 65.5k | $0.4 | 96 | 🤗 | View | |
Hermes 3 - Llama-3.1 70B Nous Research | 11 | 70.6B | 128k | $0.3 | 41 | 🤗 | View | |
Qwen3 8B (Non-reasoning) Alibaba | 11 | 8.19B | 32.8k | $0.3 | 78 | 🤗 | View | |
DeepSeek-Coder-V2 DeepSeek | 11 | 236B (21B active at inference time) | 128k | - | - | 🤗 | - | View |
Jamba 1.6 Large AI21 Labs | 11 | 398B (94B active at inference time) | 256k | $3.5 | 59 | 🤗 | View | |
Qwen3.5 0.8B (Reasoning) Alibaba | 11 | 0.873B | 262k | - | - | 🤗 | - | View |
LFM2 24B A2B Liquid AI | 10 | 23.8B (2.3B active at inference time) | 32.8k | $0.1 | 209 | 🤗 | View | |
Phi-4 Microsoft Azure | 10 | 14B | 16.0k | $0.2 | 34 | 🤗 | View | |
Gemma 3 27B Instruct Google | 10 | 27.4B | 128k | - | 27 | 🤗 | +3 more | View |
Mistral Small (Sep '24) Mistral | 10 | 22B | 32.8k | $0.3 | 126 | 🤗 | View | |
NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning) NVIDIA | 10 | 13.2B | 128k | $0.3 | 135 | 🤗 | View | |
Gemma 3n E4B Instruct Preview (May '25) Google | 10 | 8.39B (4B active at inference time) | 32.0k | - | - | 🤗 | - | View |
Phi-4 Multimodal Instruct Microsoft Azure | 10 | 5.6B | 128k | - | 17 | 🤗 | View | |
Qwen2.5 Coder Instruct 7B Alibaba | 10 | 7.62B | 131k | - | - | 🤗 | - | View |
Qwen3.5 0.8B (Non-reasoning) Alibaba | 10 | 0.873B | 262k | - | - | 🤗 | - | View |
Mixtral 8x22B Instruct Mistral | 10 | 141B (39B active at inference time) | 65.4k | - | - | 🤗 | - | View |
Llama 3.2 Instruct 3B Meta | 10 | 3B | 128k | $0.1 | 51 | 🤗 | View | |
Jamba Reasoning 3B AI21 Labs | 10 | 3B | 262k | - | - | 🤗 | - | View |
Qwen3 VL 4B Instruct Alibaba | 10 | 4.44B | 256k | - | - | 🤗 | - | View |
Qwen1.5 Chat 110B Alibaba | 10 | 110B | 32.0k | - | - | 🤗 | - | View |
Reka Flash 3 Reka AI | 10 | 21B | 128k | $0.3 | 43 | 🤗 | View | |
Olmo 3 7B Think Allen Institute for AI | 9 | 7B | 65.5k | - | - | 🤗 | - | View |
Ling-mini-2.0 InclusionAI | 9 | 16.3B (1.4B active at inference time) | 131k | - | - | 🤗 | - | View |
DeepSeek R1 Distill Qwen 1.5B DeepSeek | 9 | 1.5B | 128k | - | - | 🤗 | - | View |
DeepSeek-V2-Chat DeepSeek | 9 | 236B (21B active at inference time) | 128k | - | - | 🤗 | - | View |
Qwen Chat 72B Alibaba | 9 | 72B | 33.8k | - | - | 🤗 | - | View |
Gemma 3 12B Instruct Google | 9 | 12.2B | 128k | - | 25 | 🤗 | +2 more | View |
Llama 3.2 Instruct 11B (Vision) Meta | 9 | 11B | 128k | $0.2 | 45 | 🤗 | View | |
DeepSeek Coder V2 Lite Instruct DeepSeek | 8 | 16B (2.4B active at inference time) | 128k | - | - | 🤗 | - | View |
Phi-4 Mini Instruct Microsoft Azure | 8 | 3.84B | 128k | - | 43 | 🤗 | View | |
Sarvam M (Reasoning) Sarvam | 8 | 23.6B | 32.8k | - | - | 🤗 | - | View |
Command-R+ (Apr '24) Cohere | 8 | 104B | 128k | $6.0 | - | 🤗 | View | |
DBRX Instruct Databricks | 8 | 132B (36B active at inference time) | 32.8k | - | - | 🤗 | - | View |
Exaone 4.0 1.2B (Reasoning) LG AI Research | 8 | 1.28B | 64.0k | - | - | 🤗 | - | View |
Olmo 3 7B Instruct Allen Institute for AI | 8 | 7B | 65.5k | $0.1 | 144 | 🤗 | View | |
Exaone 4.0 1.2B (Non-reasoning) LG AI Research | 8 | 1.28B | 64.0k | - | - | 🤗 | - | View |
LFM2.5-1.2B-Thinking Liquid AI | 8 | 1.17B | 32.0k | - | - | 🤗 | - | View |
Jamba 1.7 Mini AI21 Labs | 8 | 52B (12B active at inference time) | 258k | - | - | 🤗 | - | View |
LFM2 2.6B Liquid AI | 8 | 2.57B | 32.8k | - | - | 🤗 | ? | View |
LFM2.5-1.2B-Instruct Liquid AI | 8 | 1.17B | 32.0k | - | - | 🤗 | ? | View |
Jamba 1.5 Mini AI21 Labs | 8 | 52B (12B active at inference time) | 256k | $0.3 | - | 🤗 | View | |
Granite 4.0 H 1B IBM | 8 | 1.5B | 128k | - | - | 🤗 | - | View |
Qwen3 1.7B (Reasoning) Alibaba | 8 | 2.03B | 32.0k | $0.4 | 126 | 🤗 | View | |
Jamba 1.6 Mini AI21 Labs | 8 | 52B (12B active at inference time) | 256k | $0.3 | 174 | 🤗 | View | |
Mixtral 8x7B Instruct Mistral | 8 | 46.7B (12.9B active at inference time) | 32.8k | $0.5 | - | 🤗 | View | |
Gemma 3 270M Google | 8 | 0.268B | 32.0k | - | - | 🤗 | - | View |
Granite 4.0 Micro IBM | 8 | 3B | 128k | - | - | 🤗 | - | View |
DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning) Nous Research | 8 | 8B | 128k | - | - | 🤗 | - | View |
Command-R (Mar '24) Cohere | 7 | 35B | 128k | $0.8 | - | 🤗 | View | |
Granite 4.0 1B IBM | 7 | 1.6B | 128k | - | - | 🤗 | - | View |
Molmo2-8B Allen Institute for AI | 7 | 8.66B | 36.9k | - | 105 | 🤗 | View | |
LFM2 8B A1B Liquid AI | 7 | 8.34B (1.5B active at inference time) | 32.8k | - | - | 🤗 | ? | View |
Granite 3.3 8B (Non-reasoning) IBM | 7 | 8.17B | 128k | $0.1 | 163 | 🤗 | View | |
Qwen3 1.7B (Non-reasoning) Alibaba | 7 | 2.03B | 32.0k | $0.2 | 128 | 🤗 | View | |
Qwen3 0.6B (Reasoning) Alibaba | 6 | 0.752B | 32.0k | $0.4 | 194 | 🤗 | View | |
Gemma 3n E4B Instruct Google | 6 | 8.39B (4B active at inference time) | 32.0k | $0.0 | 42 | 🤗 | View | |
LFM2 1.2B Liquid AI | 6 | 1.17B | 32.8k | - | - | 🤗 | ? | View |
Gemma 3 4B Instruct Google | 6 | 4.3B | 128k | - | 28 | 🤗 | View | |
Llama 3.2 Instruct 1B Meta | 6 | 1B | 128k | $0.1 | 95 | 🤗 | View | |
LFM2.5-VL-1.6B Liquid AI | 6 | 1.6B | 32.0k | - | - | 🤗 | ? | View |
Granite 4.0 350M IBM | 6 | 0.35B | 32.8k | - | - | 🤗 | - | View |
Qwen3 0.6B (Non-reasoning) Alibaba | 6 | 0.752B | 32.0k | $0.2 | 192 | 🤗 | View | |
Gemma 3 1B Instruct Google | 6 | 1B | 32.0k | - | 42 | 🤗 | View | |
Granite 4.0 H 350M IBM | 5 | 0.34B | 32.8k | - | - | 🤗 | - | View |
Gemma 3n E2B Instruct Google | 5 | 5.98B (2B active at inference time) | 32.0k | - | - | 🤗 | View | |
Cogito v2.1 (Reasoning) Deep Cogito | - | 671B (37B active at inference time) | 128k | $1.3 | 87 | 🤗 | View |