Comparison of Open Source Models
Comparison and analysis of open source AI models across key performance metrics including quality, performance, inference speed, context window, parameter count & licensing details. Models are considered Open Source (also commonly referred to as open weights) where their weights are accessible to download. This allows self-hosting on your own infrastructure and enables customizing the model such as through fine-tuning. Click on any model to see detailed metrics. For more details relating to our methodology, see our FAQs.
gpt-oss-120B (high) and
Qwen3 235B 2507 are the highest intelligence open source models, followed by
Qwen3 Next 80B A3B &
DeepSeek V3.1.
Highlights
Intelligence
Artificial Analysis Intelligence Index; Higher is better
Estimate (independent evaluation forthcoming)
Total Parameters
Trainable parameters in billions
Further details
Weights | Provider Benchmarks | |||||||
---|---|---|---|---|---|---|---|---|
gpt-oss-120B OpenAI | 58 | 117B (5.1B active at inference time) | 131k | $0.3 | 245 | 🤗 | +15 more | View |
Qwen3 235B A22B 2507 (Reasoning) Alibaba | 57 | 235B (22B active at inference time) | 256k | $2.6 | 49 | Not available | ![]() +4 more | View |
Qwen3 Next 80B A3B (Reasoning) Alibaba | 54 | 80B (3B active at inference time) | 262k | $1.9 | 70 | 🤗 | +2 more | View |
DeepSeek V3.1 (Reasoning) DeepSeek | 54 | 685B (37.0B active at inference time) | 128k | $1.0 | 21 | 🤗 | +1 more | View |
DeepSeek R1 0528 (May '25) DeepSeek | 52 | 685B (37B active at inference time) | 128k | $1.0 | 20 | 🤗 | ![]() +13 more | View |
Kimi K2 0905 Moonshot AI | 50 | 1.0KB (32.0B active at inference time) | 256k | $1.4 | 51 | 🤗 | ![]() +1 more | View |
GLM-4.5 Z AI | 49 | 355B (32B active at inference time) | 128k | $1.0 | 51 | 🤗 | ![]() +4 more | View |
GLM-4.5-Air Z AI | 49 | - | 128k | $0.4 | 98 | 🤗 | ![]() +2 more | View |
Kimi K2 Moonshot AI | 48 | 1.0KB (32B active at inference time) | 128k | $1.1 | 53 | 🤗 | ![]() +7 more | View |
Qwen3 30B A3B 2507 (Reasoning) Alibaba | 46 | 30.5B (3.3B active at inference time) | 262k | $0.8 | 100 | 🤗 | View | |
MiniMax M1 80k MiniMax | 46 | 456B (45.9B active at inference time) | 1.00M | $0.8 | - | 🤗 | View | |
Qwen3 235B A22B 2507 (Non-reasoning) Alibaba | 45 | 235B (22.0B active at inference time) | 256k | $1.2 | 39 | 🤗 | +8 more | View |
Llama Nemotron Super 49B v1.5 (Reasoning) NVIDIA | 45 | 49B | 128k | - | - | 🤗 | ? | View |
Qwen3 Next 80B A3B Alibaba | 45 | 80B (3B active at inference time) | 262k | $0.9 | 61 | 🤗 | +2 more | View |
gpt-oss-20B OpenAI | 45 | 21B (3.6B active at inference time) | 131k | $0.1 | 254 | 🤗 | ![]() +7 more | View |
DeepSeek V3.1 (Non-reasoning) DeepSeek | 45 | 685B (37B active at inference time) | 128k | $0.5 | 20 | 🤗 | ![]() +6 more | View |
DeepSeek R1 (Jan '25) DeepSeek | 44 | 685B (37B active at inference time) | 128k | $2.0 | - | 🤗 | ![]() +8 more | View |
Qwen3 4B 2507 (Reasoning) Alibaba | 43 | 4.02B | 262k | - | - | 🤗 | - | View |
![]() EXAONE 4.0 32B (Reasoning) LG AI Research | 43 | 32B | 131k | $0.7 | 56 | 🤗 | View | |
Qwen3 Coder 480B A35B Alibaba | 42 | 480B (35.0B active at inference time) | 262k | $3.0 | 39 | 🤗 | ![]() +9 more | View |
Qwen3 235B A22B (Reasoning) Alibaba | 42 | 235B (22B active at inference time) | 32.8k | $2.6 | 47 | 🤗 | ![]() +2 more | View |
MiniMax M1 40k MiniMax | 42 | 456B (45.9B active at inference time) | 1.00M | $0.8 | - | 🤗 | View | |
![]() Hermes 4 - Llama-3.1 405B (Reasoning) Nous Research | 42 | 406B | 128k | $1.5 | 37 | 🤗 | View | |
DeepSeek V3 0324 DeepSeek | 41 | 671B (37B active at inference time) | 128k | $0.5 | 21 | 🤗 | +11 more | View |
![]() Hermes 4 - Llama-3.1 70B (Reasoning) Nous Research | 39 | 70.6B | 128k | $0.2 | 87 | 🤗 | View | |
Qwen3 32B (Reasoning) Alibaba | 39 | 32.8B | 32.8k | $2.6 | 57 | 🤗 | +5 more | View |
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) NVIDIA | 38 | 253B | 128k | $0.9 | 37 | 🤗 | View | |
NVIDIA Nemotron Nano 9B V2 NVIDIA | 38 | 9B | 131k | - | - | Not available | - | View |
QwQ 32B Alibaba | 38 | 32.8B | 131k | $0.5 | 37 | 🤗 | ![]() +3 more | View |
GLM-4.5V (Reasoning) Z AI | 37 | 108B (12.0B active at inference time) | 64.0k | $0.9 | 45 | 🤗 | ![]() | View |
Qwen3 30B A3B 2507 (Non-reasoning) Alibaba | 37 | 30.5B (3.3B active at inference time) | 262k | $0.3 | 86 | 🤗 | View | |
NVIDIA Nemotron Nano 9B V2 (Reasoning) NVIDIA | 37 | 9B | 131k | - | - | Not available | - | View |
Qwen3 30B A3B (Reasoning) Alibaba | 37 | 30.5B (3.3B active at inference time) | 32.8k | $0.8 | 69 | 🤗 | ![]() +3 more | View |
Qwen3 14B (Reasoning) Alibaba | 36 | 14.8B | 32.8k | $1.3 | 48 | 🤗 | ![]() | View |
Llama 4 Maverick Meta | 36 | 402B (17B active at inference time) | 1.00M | $0.4 | 138 | 🤗 | ![]() +12 more | View |
Llama 3.3 Nemotron Super 49B v1 (Reasoning) NVIDIA | 35 | 49B | 128k | - | - | 🤗 | - | View |
DeepSeek R1 0528 Qwen3 8B DeepSeek | 35 | 8.19B | 32.8k | $0.1 | 42 | 🤗 | View | |
![]() EXAONE 4.0 32B LG AI Research | 33 | 32B | 131k | $0.7 | 54 | Not available | View | |
Qwen3 Coder 30B A3B Alibaba | 33 | 30.5B (3.3B active at inference time) | 262k | $0.9 | 92 | 🤗 | View | |
DeepSeek R1 Distill Qwen 32B DeepSeek | 33 | 32B | 128k | $0.3 | 20 | 🤗 | ![]() | View |
![]() Hermes 4 - Llama-3.1 405B Nous Research | 33 | 406B | 128k | $1.5 | 34 | 🤗 | View | |
Reka Flash 3 Reka AI | 33 | 21B | 128k | $0.3 | 50 | 🤗 | View | |
DeepSeek V3 (Dec '24) DeepSeek | 32 | 671B (37B active at inference time) | 128k | $0.5 | - | 🤗 | +6 more | View |
![]() Magistral Small Mistral | 32 | 23.6B | 40.0k | $0.8 | 184 | 🤗 | ![]() | View |
DeepSeek R1 Distill Llama 70B DeepSeek | 31 | 70B | 128k | $0.8 | 117 | 🤗 | ![]() +3 more | View |
Qwen3 235B A22B Alibaba | 30 | 235B (22B active at inference time) | 32.8k | $1.2 | 41 | 🤗 | ![]() +2 more | View |
DeepSeek R1 Distill Qwen 14B DeepSeek | 30 | 14B | 128k | $0.9 | 148 | 🤗 | View | |
Qwen3 14B Alibaba | 29 | 14.8B | 32.8k | $0.6 | 46 | 🤗 | ![]() | View |
![]() Mistral Small 3.2 Mistral | 29 | 24B | 128k | $0.1 | 118 | 🤗 | ![]() ![]() | View |
Qwen2.5 Instruct 72B Alibaba | 29 | 72B | 131k | - | 49 | 🤗 | ![]() +2 more | View |
Qwen3 8B (Reasoning) Alibaba | 28 | 8.19B | 131k | $0.7 | 90 | 🤗 | View | |
Llama 4 Scout Meta | 28 | 109B (17B active at inference time) | 10.0M | $0.3 | 111 | 🤗 | +10 more | View |
![]() Command A Cohere | 28 | 111B | 256k | $4.4 | 88 | 🤗 | ![]() | View |
QwQ 32B-Preview Alibaba | 28 | 32.8B | 32.8k | $0.7 | 69 | 🤗 | ![]() | View |
Llama 3.3 Instruct 70B Meta | 28 | 70B | 128k | $0.6 | 72 | 🤗 | +19 more | View |
![]() Exaone 4.0 1.2B (Reasoning) LG AI Research | 27 | 1.28B | 64.0k | - | - | 🤗 | - | View |
Llama Nemotron Super 49B v1.5 NVIDIA | 27 | 49B | 128k | - | - | 🤗 | ? | View |
Qwen3 30B A3B Alibaba | 26 | 30.5B (3.3B active at inference time) | 32.8k | $0.3 | 66 | 🤗 | ![]() | View |
Qwen3 32B Alibaba | 26 | 32.8B | 32.8k | $1.2 | 54 | 🤗 | ![]() +6 more | View |
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) NVIDIA | 26 | 4.51B | 128k | - | - | 🤗 | - | View |
GLM-4.5V Z AI | 26 | 108B (12.0B active at inference time) | 64.0k | $0.9 | 57 | 🤗 | View | |
Llama 3.3 Nemotron Super 49B v1 NVIDIA | 26 | 49B | 128k | - | - | 🤗 | - | View |
MiniMax-Text-01 MiniMax | 26 | 456B (45.9B active at inference time) | 4.00M | $0.4 | - | 🤗 | View | |
Llama 3.1 Instruct 405B Meta | 26 | 405B | 128k | $3.5 | 31 | 🤗 | ![]() +8 more | View |
Qwen3 4B (Reasoning) Alibaba | 26 | 4.02B | 32.0k | $0.4 | 93 | 🤗 | View | |
![]() Mistral Large 2 (Nov '24) Mistral | 26 | 123B | 128k | $3.0 | 45 | 🤗 | ![]() | View |
Llama 3.1 Tulu3 405B Allen Institute for AI | 25 | 405B | 128k | - | - | 🤗 | - | View |
![]() Pixtral Large Mistral | 25 | 124B | 128k | $3.0 | 44 | 🤗 | ![]() | View |
Grok 2 (Dec '24) xAI | 25 | 270B | 131k | - | - | 🤗 | - | View |
Phi-4 Microsoft Azure | 25 | 14B | 16.0k | $0.2 | 24 | 🤗 | ![]() | View |
![]() Hermes 4 - Llama-3.1 70B Nous Research | 24 | 70.6B | 128k | $0.2 | 80 | 🤗 | View | |
Llama 3.1 Nemotron Instruct 70B NVIDIA | 24 | 70B | 128k | $0.6 | 30 | 🤗 | ![]() | View |
![]() Mistral Small 3.1 Mistral | 23 | 24B | 128k | $0.1 | 171 | 🤗 | ![]() +2 more | View |
Qwen3 8B Alibaba | 23 | 8.19B | 32.8k | $0.3 | 81 | 🤗 | View | |
Qwen2.5 Instruct 32B Alibaba | 23 | 32B | 128k | - | - | 🤗 | - | View |
Llama 3.1 Instruct 70B Meta | 23 | 70B | 128k | $0.8 | 61 | 🤗 | +9 more | View |
Qwen3 1.7B (Reasoning) Alibaba | 22 | 2.03B | 32.0k | $0.4 | 124 | 🤗 | View | |
![]() Mistral Large 2 (Jul '24) Mistral | 22 | 123B | 128k | $3.0 | 37 | 🤗 | ![]() | View |
Gemma 3 27B Instruct Google | 22 | 27.4B | 128k | - | 48 | 🤗 | ![]() | View |
Qwen2.5 Coder Instruct 32B Alibaba | 22 | 32B | 131k | $0.2 | 76 | 🤗 | ![]() | View |
![]() Mistral Small 3 Mistral | 21 | 24B | 32.0k | $0.1 | 157 | 🤗 | ![]() ![]() | View |
Jamba 1.7 Large AI21 Labs | 21 | 398B (94.0B active at inference time) | 256k | $3.5 | 44 | 🤗 | View | |
Gemma 3 12B Instruct Google | 21 | 12.2B | 128k | $0.2 | - | 🤗 | ![]() ![]() | View |
DeepSeek-V2.5 (Dec '24) DeepSeek | 21 | 236B (21B active at inference time) | 128k | $0.2 | - | 🤗 | View | |
Qwen3 4B Alibaba | 21 | 4.02B | 32.0k | $0.2 | 88 | 🤗 | View | |
![]() Exaone 4.0 1.2B LG AI Research | 20 | 1.28B | 64.0k | - | - | 🤗 | - | View |
DeepSeek-V2.5 DeepSeek | 20 | 236B (21B active at inference time) | 128k | $0.2 | - | 🤗 | View | |
![]() Devstral Small (May '25) Mistral | 20 | 23.6B | 256k | $0.1 | 115 | 🤗 | ![]() ![]() | View |
DeepSeek R1 Distill Llama 8B DeepSeek | 19 | 8B | 128k | $0.0 | 49 | 🤗 | View | |
![]() R1 1776 Perplexity | 19 | 671B (37B active at inference time) | 128k | $3.5 | - | 🤗 | ![]() | View |
Llama 3.2 Instruct 90B (Vision) Meta | 19 | 90B | 128k | $0.7 | 38 | 🤗 | ![]() +1 more | View |
Solar Mini Upstage | 19 | 10.7B | 4.10k | $0.1 | 81 | 🤗 | View | |
Grok-1 xAI | 18 | 314B (78B active at inference time) | 8.19k | - | - | 🤗 | - | View |
Qwen2 Instruct 72B Alibaba | 18 | 72B | 131k | - | - | 🤗 | - | View |
![]() Devstral Small (Jul '25) Mistral | 18 | 24B | 256k | $0.1 | 148 | 🤗 | ![]() ![]() | View |
Gemma 2 27B Google | 17 | 27.2B | 8.19k | - | - | 🤗 | - | View |
Llama 3.1 Instruct 8B Meta | 17 | 8B | 128k | $0.1 | 168 | 🤗 | ![]() +18 more | View |
Phi-4 Mini Instruct Microsoft Azure | 16 | 3.84B | 128k | - | 47 | 🤗 | View | |
Gemma 3n E4B Instruct Google | 16 | 8.39B (4.0B active at inference time) | 32.0k | $0.0 | 68 | 🤗 | View | |
![]() DeepHermes 3 - Mistral 24B Preview Nous Research | 16 | 24B | 32.0k | - | - | 🤗 | - | View |
Granite 3.3 Instruct 8B IBM | 15 | 8.17B | 128k | $0.1 | 436 | 🤗 | ![]() | View |
Jamba 1.5 Large AI21 Labs | 15 | 398B (94B active at inference time) | 256k | $3.5 | - | 🤗 | View | |
Gemma 3 4B Instruct Google | 15 | 4.3B | 128k | $0.1 | - | 🤗 | ![]() | View |
![]() Hermes 3 - Llama-3.1 70B Nous Research | 15 | 70.6B | 128k | $0.3 | 33 | 🤗 | ![]() | View |
Llama 3.2 Instruct 11B (Vision) Meta | 15 | 11B | 128k | $0.2 | 59 | 🤗 | ![]() | View |
DeepSeek-Coder-V2 DeepSeek | 15 | 236B (21B active at inference time) | 128k | $0.2 | - | 🤗 | View | |
Qwen3 1.7B Alibaba | 14 | 2.03B | 32.0k | $0.2 | 117 | 🤗 | View | |
Jamba 1.6 Large AI21 Labs | 14 | 398B (94B active at inference time) | 256k | $3.5 | 43 | 🤗 | View | |
Qwen3 0.6B (Reasoning) Alibaba | 14 | 0.752B | 32.0k | $0.4 | 207 | 🤗 | View | |
Phi-3 Mini Instruct 3.8B Microsoft Azure | 13 | 3.8B | 4.10k | $0.2 | 68 | 🤗 | View | |
Llama 3 Instruct 70B Meta | 13 | 70B | 8.19k | $0.9 | 42 | 🤗 | ![]() +6 more | View |
![]() Mistral Small (Sep '24) Mistral | 13 | 22B | 32.8k | $0.3 | 92 | 🤗 | ![]() | View |
Gemma 3n E4B Instruct Preview (May '25) Google | 13 | 8.39B (4B active at inference time) | 32.0k | - | - | 🤗 | - | View |
Phi-4 Multimodal Instruct Microsoft Azure | 12 | 5.6B | 128k | - | 18 | 🤗 | View | |
Qwen2.5 Coder Instruct 7B Alibaba | 12 | 7.62B | 131k | - | - | 🤗 | - | View |
![]() Mixtral 8x22B Instruct Mistral | 12 | 141B (39B active at inference time) | 65.4k | $3.0 | 51 | 🤗 | ![]() | View |
Llama 2 Chat 7B Meta | 11 | 7B | 4.10k | $0.1 | 113 | 🤗 | ![]() | View |
Llama 3.2 Instruct 3B Meta | 11 | 3B | 128k | $0.0 | 101 | 🤗 | +3 more | View |
Qwen3 0.6B Alibaba | 11 | 0.752B | 32.0k | $0.2 | 193 | 🤗 | View | |
Qwen1.5 Chat 110B Alibaba | 11 | 110B | 32.0k | - | - | 🤗 | - | View |
Phi-3 Medium Instruct 14B Microsoft Azure | 10 | 14B | 128k | $0.3 | 43 | 🤗 | View | |
![]() Pixtral 12B (2409) Mistral | 9 | 12B | 128k | $0.1 | 107 | 🤗 | ![]() | View |
DeepSeek R1 Distill Qwen 1.5B DeepSeek | 9 | 1.5B | 128k | - | - | 🤗 | - | View |
DeepSeek-V2-Chat DeepSeek | 9 | 236B (21B active at inference time) | 128k | $0.2 | - | 🤗 | View | |
LFM2 1.2B Liquid AI | 8 | 1.17B | 32.8k | - | - | 🤗 | ? | View |
Gemma 3n E2B Instruct Google | 8 | 5.98B (2.0B active at inference time) | 32.0k | - | 47 | 🤗 | View | |
![]() Ministral 8B Mistral | 8 | 8B | 128k | $0.1 | 177 | 🤗 | ![]() | View |
Gemma 2 9B Google | 8 | 9B | 8.19k | $0.2 | - | 🤗 | ![]() | View |
Arctic Instruct Snowflake | 8 | 480B (17B active at inference time) | 4.00k | - | - | 🤗 | - | View |
Qwen Chat 72B Alibaba | 8 | 72B | 33.8k | $1.0 | - | 🤗 | View | |
Llama 3.2 Instruct 1B Meta | 7 | 1B | 128k | $0.1 | 104 | 🤗 | ![]() | View |
![]() Command-R+ (Aug '24) Cohere | 7 | 104B | 128k | $4.4 | 38 | 🤗 | ![]() | View |
Llama 3 Instruct 8B Meta | 7 | 8B | 8.19k | $0.1 | 66 | 🤗 | ? ![]() +3 more | View |
Gemma 3 1B Instruct Google | 6 | 1B | 32.0k | - | - | 🤗 | ? | View |
DeepSeek Coder V2 Lite Instruct DeepSeek | 6 | 16B (2.4B active at inference time) | 128k | - | - | 🤗 | - | View |
![]() Codestral (May '24) Mistral | 6 | 22B | 32.8k | - | - | 🤗 | - | View |
![]() Aya Expanse 32B Cohere | 6 | 32B | 128k | $0.8 | 73 | 🤗 | ![]() | View |
Llama 2 Chat 70B Meta | 6 | 70B | 4.10k | - | - | 🤗 | - | View |
DeepSeek LLM 67B Chat (V1) DeepSeek | 6 | 7B | 4.10k | - | - | 🤗 | - | View |
Llama 2 Chat 13B Meta | 6 | 13B | 4.10k | - | - | 🤗 | - | View |
![]() Command-R+ (Apr '24) Cohere | 5 | 104B | 128k | $6.0 | 19 | 🤗 | ![]() | View |
![]() OpenChat 3.5 (1210) OpenChat | 5 | 7B | 8.19k | - | - | 🤗 | - | View |
![]() DBRX Instruct Databricks | 5 | 132B (36B active at inference time) | 32.8k | - | - | 🤗 | - | View |
![]() Mistral NeMo Mistral | 5 | 12B | 128k | $0.1 | 124 | 🤗 | ![]() +1 more | View |
Jamba 1.5 Mini AI21 Labs | 4 | 52B (12B active at inference time) | 256k | $0.3 | - | 🤗 | View | |
Jamba 1.7 Mini AI21 Labs | 4 | 52B (12.0B active at inference time) | 258k | $0.3 | 140 | 🤗 | View | |
Jamba 1.6 Mini AI21 Labs | 3 | 52B (12B active at inference time) | 256k | $0.3 | 140 | 🤗 | View | |
![]() Mixtral 8x7B Instruct Mistral | 3 | 46.7B (12.9B active at inference time) | 32.8k | $0.7 | 58 | 🤗 | ? ![]() +2 more | View |
![]() DeepHermes 3 - Llama-3.1 8B Preview Nous Research | 2 | 8B | 128k | - | - | 🤗 | - | View |
![]() Aya Expanse 8B Cohere | 2 | 8B | 8.00k | $0.8 | 83 | 🤗 | ![]() | View |
![]() Codestral-Mamba Mistral | 1 | 7B | 256k | - | - | 🤗 | - | View |
![]() Mistral 7B Instruct Mistral | 1 | 7B | 8.19k | $0.3 | 109 | 🤗 | ![]() ![]() +3 more | View |
![]() Command-R (Aug '24) Cohere | 1 | 32B | 128k | $0.3 | 88 | 🤗 | ![]() | View |
![]() Command-R (Mar '24) Cohere | 1 | 35B | 128k | $0.8 | 83 | 🤗 | ![]() | View |