Comparison of Open Source Models
Comparison and analysis of open source AI models across key performance metrics including quality, performance, inference speed, context window, parameter count & licensing details. Models are considered Open Source (also commonly referred to as open weights) where their weights are accessible to download. This allows self-hosting on your own infrastructure and enables customizing the model such as through fine-tuning. Click on any model to see detailed metrics. For more details relating to our methodology, see our FAQs.
Qwen3 235B 2507 (Reasoning) and
gpt-oss-120B (high) are the highest intelligence open source models, followed by
DeepSeek R1 0528 &
GLM-4.5.
Highlights
Intelligence
Artificial Analysis Intelligence Index; Higher is better
Loading chart...
Total Parameters
Trainable parameters in billions
Loading chart...
Further details
Weights | Provider Benchmarks | |||||||
---|---|---|---|---|---|---|---|---|
Qwen3 235B A22B 2507 (Reasoning) Alibaba | 64 | 235B (22B active at inference time) | 256k | $2.6 | - | Not available | ![]() +5 more | View |
gpt-oss-120B OpenAI | 61 | 117B (5.1B active at inference time) | 131k | $0.3 | 220 | 🤗 | ![]() ![]() +9 more | View |
DeepSeek R1 0528 (May '25) DeepSeek | 59 | 685B (37B active at inference time) | 128k | $1.0 | 21 | 🤗 | ![]() ![]() +13 more | View |
GLM-4.5 Z AI | 56 | 355B (32B active at inference time) | 128k | $1.0 | 51 | 🤗 | ![]() ![]() +3 more | View |
Qwen3 30B A3B 2507 (Reasoning) Alibaba | 53 | 30.5B (3.3B active at inference time) | 32.8k | $0.8 | - | 🤗 | ![]() | View |
![]() MiniMax M1 80k MiniMax | 53 | 456B (45.9B active at inference time) | 1.00M | $0.8 | 18 | 🤗 | ![]() | View |
Llama Nemotron Super 49B v1.5 (Reasoning) NVIDIA | 52 | 49B | 128k | - | - | 🤗 | ? | View |
![]() MiniMax M1 40k MiniMax | 51 | 456B (45.9B active at inference time) | 1.00M | $0.8 | 22 | 🤗 | ![]() | View |
Qwen3 235B A22B 2507 (Non-reasoning) Alibaba | 51 | 235B (22.0B active at inference time) | 256k | $1.2 | - | 🤗 | ![]() +6 more | View |
![]() EXAONE 4.0 32B (Reasoning) LG AI Research | 51 | 32B | 131k | $0.7 | 98 | 🤗 | View | |
DeepSeek R1 (Jan '25) DeepSeek | 50 | 685B (37B active at inference time) | 128k | $2.0 | - | 🤗 | ![]() +10 more | View |
GLM-4.5-Air Z AI | 49 | - | 128k | $0.4 | 173 | 🤗 | ![]() ![]() +1 more | View |
gpt-oss-20B OpenAI | 49 | 21B (3.6B active at inference time) | 131k | $0.1 | 285 | 🤗 | ![]() +5 more | View |
Kimi K2 Moonshot AI | 49 | 1.0KB (32B active at inference time) | 128k | $1.1 | 44 | 🤗 | ![]() +6 more | View |
Qwen3 235B A22B (Reasoning) Alibaba | 48 | 235B (22B active at inference time) | 32.8k | $2.6 | - | 🤗 | ![]() +4 more | View |
QwQ 32B Alibaba | 48 | 32.8B | 131k | $0.5 | 55 | 🤗 | ![]() ![]() +3 more | View |
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) NVIDIA | 46 | 253B | 128k | $0.9 | 42 | 🤗 | ![]() | View |
Qwen3 30B A3B 2507 (Non-reasoning) Alibaba | 46 | 30.5B (3.3B active at inference time) | 32.8k | $0.3 | - | 🤗 | ![]() | View |
Qwen3 14B (Reasoning) Alibaba | 45 | 14.8B | 32.8k | $1.3 | - | 🤗 | ![]() ![]() | View |
Qwen3 Coder 480B A35B Alibaba | 45 | 480B (35.0B active at inference time) | 262k | $3.0 | - | 🤗 | ![]() ![]() +8 more | View |
Qwen3 32B (Reasoning) Alibaba | 44 | 32.8B | 32.8k | $2.6 | - | 🤗 | ![]() ![]() +6 more | View |
DeepSeek V3 0324 (Mar '25) DeepSeek | 44 | 671B (37B active at inference time) | 128k | $0.5 | 22 | 🤗 | ![]() ![]() +11 more | View |
Qwen3 30B A3B (Reasoning) Alibaba | 42 | 30.5B (3.3B active at inference time) | 32.8k | $0.8 | - | 🤗 | ![]() ![]() +4 more | View |
Llama 4 Maverick Meta | 42 | 402B (17B active at inference time) | 1.00M | $0.4 | 166 | 🤗 | ![]() ![]() +11 more | View |
DeepSeek R1 0528 Qwen3 8B DeepSeek | 42 | 8.19B | 32.8k | $0.1 | 92 | 🤗 | View | |
DeepSeek R1 Distill Qwen 32B DeepSeek | 41 | 32B | 128k | $0.2 | 22 | 🤗 | ![]() | View |
Qwen3 8B (Reasoning) Alibaba | 41 | 8.19B | 32.8k | $0.7 | - | 🤗 | View | |
Llama 3.3 Nemotron Super 49B v1 (Reasoning) NVIDIA | 40 | 49B | 128k | - | - | 🤗 | - | View |
![]() EXAONE 4.0 32B LG AI Research | 40 | 32B | 131k | $0.7 | 87 | Not available | View | |
DeepSeek R1 Distill Qwen 14B DeepSeek | 38 | 14B | 128k | $0.9 | 47 | 🤗 | View | |
DeepSeek R1 Distill Llama 70B DeepSeek | 37 | 70B | 128k | $0.8 | 63 | 🤗 | ![]() ![]() +5 more | View |
Qwen3 4B (Reasoning) Alibaba | 36 | 4.02B | 32.0k | $0.4 | - | 🤗 | ![]() | View |
![]() Reka Flash 3 Reka AI | 36 | 21B | 128k | $0.3 | 56 | 🤗 | ![]() | View |
![]() Magistral Small Mistral | 36 | 23.6B | 40.0k | $0.8 | 204 | 🤗 | ![]() | View |
DeepSeek V3 (Dec '24) DeepSeek | 35 | 671B (37B active at inference time) | 128k | $0.5 | - | 🤗 | ![]() ![]() +6 more | View |
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) NVIDIA | 34 | 4.51B | 128k | - | - | 🤗 | - | View |
Qwen3 Coder 30B A3B Alibaba | 33 | 30.5B (3.3B active at inference time) | 262k | $0.9 | - | 🤗 | ![]() | View |
Qwen3 235B A22B Alibaba | 33 | 235B (22B active at inference time) | 32.8k | $1.2 | - | 🤗 | ![]() ![]() +4 more | View |
Llama 4 Scout Meta | 33 | 109B (17B active at inference time) | 10.0M | $0.2 | 132 | 🤗 | ![]() +10 more | View |
![]() Mistral Small 3.2 Mistral | 32 | 24B | 128k | $0.1 | 176 | 🤗 | ![]() ![]() | View |
![]() Command A Cohere | 32 | 111B | 256k | $4.4 | 166 | 🤗 | ![]() | View |
QwQ 32B-Preview Alibaba | 32 | 32.8B | 32.8k | $0.7 | 50 | 🤗 | ![]() | View |
Llama 3.3 Instruct 70B Meta | 31 | 70B | 128k | $0.6 | 129 | 🤗 | ![]() +17 more | View |
Qwen3 30B A3B Alibaba | 30 | 30.5B (3.3B active at inference time) | 32.8k | $0.3 | - | 🤗 | ![]() | View |
Qwen3 14B Alibaba | 30 | 14.8B | 32.8k | $0.6 | - | 🤗 | ![]() | View |
Qwen3 32B Alibaba | 30 | 32.8B | 32.8k | $1.2 | - | 🤗 | ![]() ![]() +7 more | View |
Llama 3.1 Instruct 405B Meta | 29 | 405B | 128k | $3.3 | 33 | 🤗 | ![]() +11 more | View |
Qwen2.5 Instruct 72B Alibaba | 29 | 72B | 131k | - | - | 🤗 | ![]() +3 more | View |
![]() MiniMax-Text-01 MiniMax | 29 | 456B (45.9B active at inference time) | 4.00M | $0.4 | 39 | 🤗 | ![]() | View |
![]() Llama 3.1 Tulu3 405B Allen Institute for AI | 29 | 405B | 128k | - | - | 🤗 | - | View |
Llama 3.3 Nemotron Super 49B v1 NVIDIA | 28 | 49B | 128k | - | - | 🤗 | - | View |
Phi-4 Microsoft Azure | 28 | 14B | 16.0k | $0.2 | 44 | 🤗 | ![]() ![]() | View |
![]() Mistral Large 2 (Nov '24) Mistral | 27 | 123B | 128k | $3.0 | 50 | 🤗 | ![]() | View |
Llama Nemotron Super 49B v1.5 NVIDIA | 27 | 49B | 128k | - | - | 🤗 | ? | View |
Qwen3 1.7B (Reasoning) Alibaba | 27 | 2.03B | 32.0k | $0.4 | - | 🤗 | View | |
![]() Mistral Small 3.1 Mistral | 26 | 24B | 128k | $0.1 | 150 | 🤗 | ![]() ![]() +2 more | View |
![]() Pixtral Large Mistral | 26 | 124B | 128k | $3.0 | 63 | 🤗 | ![]() | View |
Qwen2.5 Instruct 32B Alibaba | 26 | 32B | 128k | $0.1 | - | 🤗 | ![]() ![]() | View |
Llama 3.1 Nemotron Instruct 70B NVIDIA | 26 | 70B | 128k | $0.2 | 42 | 🤗 | ![]() | View |
Qwen3 8B Alibaba | 25 | 8.19B | 32.8k | $0.3 | - | 🤗 | View | |
![]() Mistral Large 2 (Jul '24) Mistral | 25 | 123B | 128k | $3.0 | 103 | 🤗 | ![]() | View |
Gemma 3 27B Instruct Google | 25 | 27.4B | 128k | - | 62 | 🤗 | ![]() | View |
Qwen2.5 Coder Instruct 32B Alibaba | 25 | 32B | 131k | $0.1 | 53 | 🤗 | ![]() +1 more | View |
Llama 3.1 Instruct 70B Meta | 24 | 70B | 128k | $0.8 | 54 | 🤗 | ? ![]() +10 more | View |
Gemma 3 12B Instruct Google | 24 | 12.2B | 128k | $0.1 | - | 🤗 | ![]() | View |
![]() Mistral Small 3 Mistral | 24 | 24B | 32.0k | $0.1 | 200 | 🤗 | ![]() ![]() | View |
DeepSeek-V2.5 (Dec '24) DeepSeek | 24 | 236B (21B active at inference time) | 128k | $0.2 | - | 🤗 | View | |
Qwen3 4B Alibaba | 24 | 4.02B | 32.0k | $0.2 | - | 🤗 | View | |
DeepSeek-V2.5 DeepSeek | 23 | 236B (21B active at inference time) | 128k | $0.2 | - | 🤗 | View | |
![]() Devstral Small (May '25) Mistral | 23 | 23.6B | 256k | $0.1 | 108 | 🤗 | ![]() ![]() | View |
DeepSeek R1 Distill Llama 8B DeepSeek | 23 | 8B | 128k | $0.0 | 57 | 🤗 | View | |
![]() R1 1776 Perplexity | 22 | 671B (37B active at inference time) | 128k | $3.5 | - | 🤗 | ![]() | View |
Llama 3.2 Instruct 90B (Vision) Meta | 22 | 90B | 128k | $0.7 | 40 | 🤗 | ![]() +1 more | View |
Solar Mini Upstage | 22 | 10.7B | 4.10k | $0.1 | 95 | 🤗 | View | |
Grok-1 xAI | 21 | 314B (78B active at inference time) | 8.19k | - | - | 🤗 | - | View |
Qwen2 Instruct 72B Alibaba | 21 | 72B | 131k | - | - | 🤗 | View | |
![]() Devstral Small (Jul '25) Mistral | 21 | 24B | 256k | $0.1 | 161 | 🤗 | ![]() ![]() ![]() | View |
Gemma 2 27B Google | 20 | 27.2B | 8.19k | $0.8 | - | 🤗 | View | |
Llama 3.1 Instruct 8B Meta | 19 | 8B | 128k | $0.1 | 187 | 🤗 | ? +18 more | View |
Gemma 3n E4B Instruct Google | 18 | 8.39B (4.0B active at inference time) | 32.0k | $0.0 | 88 | 🤗 | View | |
![]() DeepHermes 3 - Mistral 24B Preview Nous Research | 18 | 24B | 32.0k | - | - | 🤗 | - | View |
![]() Jamba 1.7 Large AI21 Labs | 18 | 398B (94.0B active at inference time) | 256k | $3.5 | 48 | 🤗 | ![]() | View |
![]() Jamba 1.5 Large AI21 Labs | 18 | 398B (94B active at inference time) | 256k | $3.5 | - | 🤗 | View | |
Granite 3.3 Instruct 8B IBM | 18 | 8.17B | 128k | $0.1 | 126 | 🤗 | View | |
![]() Hermes 3 - Llama-3.1 70B Nous Research | 17 | 70.6B | 128k | $0.1 | 40 | 🤗 | ![]() | View |
DeepSeek-Coder-V2 DeepSeek | 17 | 236B (21B active at inference time) | 128k | $0.2 | - | 🤗 | View | |
![]() Jamba 1.6 Large AI21 Labs | 17 | 398B (94B active at inference time) | 256k | $3.5 | 48 | 🤗 | ![]() | View |
Llama 3 Instruct 70B Meta | 16 | 70B | 8.19k | $0.8 | 60 | 🤗 | +7 more | View |
![]() Mistral Small (Sep '24) Mistral | 16 | 22B | 32.8k | $0.3 | 99 | 🤗 | ![]() | View |
Gemma 3n E4B Instruct Preview (May '25) Google | 15 | 8.39B (4B active at inference time) | 32.0k | - | - | 🤗 | - | View |
Phi-4 Multimodal Instruct Microsoft Azure | 15 | 5.6B | 128k | - | 23 | 🤗 | View | |
Qwen2.5 Coder Instruct 7B Alibaba | 15 | 7.62B | 131k | - | - | 🤗 | - | View |
![]() Mixtral 8x22B Instruct Mistral | 14 | 141B (39B active at inference time) | 65.4k | $3.0 | 58 | 🤗 | ![]() ![]() | View |
Phi-4 Mini Instruct Microsoft Azure | 14 | 3.84B | 128k | - | 57 | 🤗 | View | |
Llama 2 Chat 7B Meta | 14 | 7B | 4.10k | $0.1 | 130 | 🤗 | View | |
Gemma 3 4B Instruct Google | 14 | 4.3B | 128k | $0.0 | - | 🤗 | ![]() | View |
Llama 3.2 Instruct 11B (Vision) Meta | 13 | 11B | 128k | $0.2 | 89 | 🤗 | ![]() | View |
Qwen3 1.7B Alibaba | 13 | 2.03B | 32.0k | $0.2 | - | 🤗 | View | |
Qwen1.5 Chat 110B Alibaba | 13 | 110B | 32.0k | - | - | 🤗 | View | |
Phi-3 Medium Instruct 14B Microsoft Azure | 13 | 14B | 128k | $0.3 | 52 | 🤗 | View | |
![]() Pixtral 12B (2409) Mistral | 11 | 12B | 128k | $0.1 | 103 | 🤗 | ![]() | View |
Qwen3 0.6B (Reasoning) Alibaba | 11 | 0.752B | 32.0k | $0.4 | - | 🤗 | View | |
DeepSeek-V2-Chat DeepSeek | 11 | 236B (21B active at inference time) | 128k | $0.2 | - | 🤗 | View | |
Gemma 3n E2B Instruct Google | 10 | 5.98B (2.0B active at inference time) | 32.0k | - | 61 | 🤗 | View | |
![]() Ministral 8B Mistral | 10 | 8B | 128k | $0.1 | 188 | 🤗 | ![]() | View |
Gemma 2 9B Google | 10 | 9B | 8.19k | $0.2 | - | 🤗 | ![]() ![]() ![]() | View |
Phi-3 Mini Instruct 3.8B Microsoft Azure | 10 | 3.8B | 4.10k | $0.2 | 83 | 🤗 | View | |
Arctic Instruct Snowflake | 10 | 480B (17B active at inference time) | 4.00k | - | - | 🤗 | - | View |
Qwen Chat 72B Alibaba | 10 | 72B | 33.8k | $1.0 | - | 🤗 | View | |
![]() Command-R+ (Aug '24) Cohere | 9 | 104B | 128k | $4.4 | 48 | 🤗 | ![]() | View |
Llama 3 Instruct 8B Meta | 9 | 8B | 8.19k | $0.1 | 103 | 🤗 | ? ![]() +4 more | View |
DeepSeek Coder V2 Lite Instruct DeepSeek | 8 | 16B (2.4B active at inference time) | 128k | - | - | 🤗 | - | View |
![]() Codestral (May '24) Mistral | 8 | 22B | 32.8k | $0.3 | - | 🤗 | ![]() | View |
![]() Aya Expanse 32B Cohere | 8 | 32B | 128k | $0.8 | 119 | 🤗 | ![]() | View |
Llama 2 Chat 70B Meta | 8 | 70B | 4.10k | - | - | 🤗 | - | View |
DeepSeek LLM 67B Chat (V1) DeepSeek | 8 | 7B | 4.10k | - | - | 🤗 | - | View |
Llama 2 Chat 13B Meta | 8 | 13B | 4.10k | - | - | 🤗 | - | View |
![]() Command-R+ (Apr '24) Cohere | 8 | 104B | 128k | $6.0 | 58 | 🤗 | ![]() | View |
![]() OpenChat 3.5 (1210) OpenChat | 8 | 7B | 8.19k | - | - | 🤗 | - | View |
![]() DBRX Instruct Databricks | 8 | 132B (36B active at inference time) | 32.8k | - | - | 🤗 | - | View |
![]() Mistral NeMo Mistral | 8 | 12B | 128k | $0.1 | 131 | 🤗 | ![]() +2 more | View |
Llama 3.2 Instruct 3B Meta | 7 | 3B | 128k | $0.0 | 154 | 🤗 | +4 more | View |
DeepSeek R1 Distill Qwen 1.5B DeepSeek | 7 | 1.5B | 128k | - | - | 🤗 | - | View |
![]() Jamba 1.5 Mini AI21 Labs | 6 | 52B (12B active at inference time) | 256k | $0.3 | - | 🤗 | View | |
![]() Jamba 1.7 Mini AI21 Labs | 6 | 52B (12.0B active at inference time) | 258k | $0.3 | 166 | 🤗 | ![]() | View |
![]() Jamba 1.6 Mini AI21 Labs | 5 | 52B (12B active at inference time) | 256k | $0.3 | 165 | 🤗 | ![]() | View |
![]() Mixtral 8x7B Instruct Mistral | 5 | 46.7B (12.9B active at inference time) | 32.8k | $0.7 | 65 | 🤗 | ? ![]() +2 more | View |
Qwen3 0.6B Alibaba | 4 | 0.752B | 32.0k | $0.2 | - | 🤗 | View | |
![]() DeepHermes 3 - Llama-3.1 8B Preview Nous Research | 4 | 8B | 128k | - | - | 🤗 | - | View |
![]() Aya Expanse 8B Cohere | 4 | 8B | 8.00k | $0.8 | 167 | 🤗 | ![]() | View |
![]() Command-R (Aug '24) Cohere | 3 | 32B | 128k | $0.3 | 70 | 🤗 | ![]() | View |
![]() Command-R (Mar '24) Cohere | 2 | 35B | 128k | $0.8 | 172 | 🤗 | ![]() | View |
![]() Codestral-Mamba Mistral | 2 | 7B | 256k | - | - | 🤗 | - | View |
Gemma 3 1B Instruct Google | 1 | 1B | 32.0k | - | - | 🤗 | ? | View |
Llama 3.2 Instruct 1B Meta | 1 | 1B | 128k | $0.1 | 131 | 🤗 | ![]() | View |
![]() Mistral 7B Instruct Mistral | 1 | 7B | 8.19k | $0.3 | 124 | 🤗 | ![]() ? +3 more | View |