Comparison of Open Source Models
Comparison and analysis of open source AI models across key performance metrics including quality, performance, inference speed, context window, parameter count & licensing details. Models are considered Open Source (also commonly referred to as open weights) where their weights are accessible to download. This allows self-hosting on your own infrastructure and enables customizing the model such as through fine-tuning. Click on any model to see detailed metrics. For more details relating to our methodology, see our FAQs.
MiniMax-M2 and
gpt-oss-120B (high) are the highest intelligence open source models, followed by
DeepSeek V3.1 Terminus &
Qwen3 235B A22B 2507.
Highlights
Intelligence
Artificial Analysis Intelligence Index; Higher is better
Estimate (independent evaluation forthcoming)
Total Parameters
Trainable parameters in billions
Further details
| Weights | Provider Benchmarks | |||||||
|---|---|---|---|---|---|---|---|---|
MiniMax-M2 MiniMax | 61 | 230B (10B active at inference time) | 205k | $0.5 | 79 | 🤗 | View | |
gpt-oss-120B (high) OpenAI | 58 | 117B (5.1B active at inference time) | 131k | $0.3 | 344 | 🤗 | +20 more | View |
DeepSeek V3.1 Terminus (Reasoning) DeepSeek | 58 | 685B (37B active at inference time) | 128k | $1.9 | - | 🤗 | View | |
Qwen3 235B A22B 2507 (Reasoning) Alibaba | 57 | 235B (22B active at inference time) | 256k | $2.6 | 82 | 🤗 | +6 more | View |
DeepSeek V3.2 Exp (Reasoning) DeepSeek | 57 | 685B (37.0B active at inference time) | 128k | $0.3 | 29 | 🤗 | View | |
GLM-4.6 (Reasoning) Z AI | 56 | 357B (32B active at inference time) | 200k | $1.0 | 94 | 🤗 | +3 more | View |
Qwen3 Next 80B A3B (Reasoning) Alibaba | 54 | 80B (3B active at inference time) | 262k | $1.9 | 170 | 🤗 | +6 more | View |
DeepSeek V3.1 (Reasoning) DeepSeek | 54 | 685B (37.0B active at inference time) | 128k | $0.7 | - | 🤗 | +1 more | View |
gpt-oss-20B (high) OpenAI | 52 | 21B (3.6B active at inference time) | 131k | $0.1 | 256 | 🤗 | +9 more | View |
DeepSeek R1 0528 (May '25) DeepSeek | 52 | 685B (37B active at inference time) | 128k | $2.0 | - | 🤗 | +11 more | View |
Seed-OSS-36B-Instruct ByteDance Seed | 52 | 36.2B | 512k | $0.3 | 49 | 🤗 | View | |
Apriel-v1.5-15B-Thinker ServiceNow | 52 | 15B | 128k | - | - | Not available | - | View |
GLM-4.5 (Reasoning) Z AI | 51 | 355B (32B active at inference time) | 128k | $1.0 | 97 | 🤗 | +3 more | View |
Kimi K2 0905 Moonshot AI | 50 | 1.0KB (32.0B active at inference time) | 256k | $1.2 | 70 | 🤗 | +4 more | View |
GLM-4.5-Air Z AI | 49 | 106B (12B active at inference time) | 128k | $0.4 | 174 | 🤗 | +2 more | View |
Kimi K2 Moonshot AI | 48 | 1.0KB (32B active at inference time) | 128k | $1.1 | 57 | 🤗 | +5 more | View |
Qwen3 30B A3B 2507 (Reasoning) Alibaba | 46 | 30.5B (3.3B active at inference time) | 262k | $0.8 | 102 | 🤗 | View | |
DeepSeek V3.2 Exp (Non-reasoning) DeepSeek | 46 | 685B (37.0B active at inference time) | 128k | $0.3 | 27 | 🤗 | View | |
MiniMax M1 80k MiniMax | 46 | 456B (45.9B active at inference time) | 1.00M | $0.8 | - | 🤗 | View | |
DeepSeek V3.1 Terminus (Non-reasoning) DeepSeek | 46 | 685B (37B active at inference time) | 128k | $0.6 | - | 🤗 | +1 more | View |
Qwen3 235B A22B 2507 Instruct Alibaba | 45 | 235B (22.0B active at inference time) | 256k | $1.2 | 41 | 🤗 | +10 more | View |
Llama Nemotron Super 49B v1.5 (Reasoning) NVIDIA | 45 | 49B | 128k | $0.2 | 76 | 🤗 | View | |
Qwen3 Next 80B A3B Instruct Alibaba | 45 | 80B (3B active at inference time) | 262k | $0.9 | 147 | 🤗 | +4 more | View |
DeepSeek V3.1 (Non-reasoning) DeepSeek | 45 | 685B (37B active at inference time) | 128k | $0.8 | - | 🤗 | +8 more | View |
GLM-4.6 (Non-reasoning) Z AI | 45 | 357B (32B active at inference time) | 200k | $1.0 | 45 | 🤗 | View | |
DeepSeek R1 (Jan '25) DeepSeek | 44 | 685B (37B active at inference time) | 128k | $2.2 | - | 🤗 | +7 more | View |
Qwen3 4B 2507 (Reasoning) Alibaba | 43 | 4.02B | 262k | - | - | 🤗 | - | View |
Magistral Small 1.2 Mistral | 43 | 24B | 128k | $0.8 | 200 | 🤗 | View | |
EXAONE 4.0 32B (Reasoning) LG AI Research | 43 | 32B | 131k | $0.7 | 90 | 🤗 | View | |
Qwen3 Coder 480B A35B Instruct Alibaba | 42 | 480B (35.0B active at inference time) | 262k | $3.0 | 43 | 🤗 | +10 more | View |
Qwen3 235B A22B (Reasoning) Alibaba | 42 | 235B (22B active at inference time) | 32.8k | $2.6 | 54 | 🤗 | +2 more | View |
Hermes 4 - Llama-3.1 405B (Reasoning) Nous Research | 42 | 406B | 128k | $1.5 | 36 | 🤗 | View | |
DeepSeek V3 0324 DeepSeek | 41 | 671B (37B active at inference time) | 128k | $0.9 | - | 🤗 | +10 more | View |
MiniMax M1 40k MiniMax | 40 | 456B (45.9B active at inference time) | 1.00M | $0.8 | 43 | 🤗 | View | |
Qwen3 Omni 30B A3B (Reasoning) Alibaba | 40 | 35.3B (3.0B active at inference time) | 65.5k | $0.4 | 94 | 🤗 | View | |
Hermes 4 - Llama-3.1 70B (Reasoning) Nous Research | 39 | 70.6B | 128k | $0.2 | 85 | 🤗 | View | |
Qwen3 32B (Reasoning) Alibaba | 39 | 32.8B | 32.8k | $2.6 | 55 | 🤗 | +6 more | View |
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) NVIDIA | 38 | 253B | 128k | $0.9 | 38 | 🤗 | View | |
QwQ 32B Alibaba | 38 | 32.8B | 131k | $0.5 | 80 | 🤗 | +3 more | View |
GLM-4.5V (Reasoning) Z AI | 37 | 108B (12.0B active at inference time) | 64.0k | $0.9 | - | 🤗 | View | |
Qwen3 30B A3B 2507 Instruct Alibaba | 37 | 30.5B (3.3B active at inference time) | 262k | $0.3 | 84 | 🤗 | View | |
NVIDIA Nemotron Nano 9B V2 (Reasoning) NVIDIA | 37 | 9B | 131k | $0.1 | 120 | 🤗 | View | |
Qwen3 30B A3B (Reasoning) Alibaba | 37 | 30.5B (3.3B active at inference time) | 32.8k | $0.8 | 69 | 🤗 | +3 more | View |
NVIDIA Nemotron Nano 9B V2 (Non-reasoning) NVIDIA | 36 | 9B | 131k | $0.1 | 119 | 🤗 | View | |
Qwen3 14B (Reasoning) Alibaba | 36 | 14.8B | 32.8k | $1.3 | 56 | 🤗 | View | |
Llama 4 Maverick Meta | 36 | 402B (17B active at inference time) | 1.00M | $0.4 | 131 | 🤗 | +11 more | View |
Llama 3.3 Nemotron Super 49B v1 (Reasoning) NVIDIA | 35 | 49B | 128k | - | - | 🤗 | - | View |
Qwen3 Coder 30B A3B Instruct Alibaba | 33 | 30.5B (3.3B active at inference time) | 262k | $0.9 | 93 | 🤗 | View | |
ERNIE 4.5 300B A47B Baidu | 33 | 300B (47.0B active at inference time) | 131k | $0.5 | 24 | 🤗 | View | |
DeepSeek R1 Distill Qwen 32B DeepSeek | 33 | 32B | 128k | $0.3 | 44 | 🤗 | View | |
Hermes 4 - Llama-3.1 405B (Non-reasoning) Nous Research | 33 | 406B | 128k | $1.5 | 32 | 🤗 | View | |
DeepSeek V3 (Dec '24) DeepSeek | 32 | 671B (37B active at inference time) | 128k | $0.7 | - | 🤗 | +3 more | View |
Qwen3 VL 8B (Reasoning) Alibaba | 32 | 8.77B | 256k | $0.7 | 60 | 🤗 | View | |
Magistral Small 1 Mistral | 32 | 23.6B | 40.0k | $0.8 | 204 | 🤗 | View | |
DeepSeek R1 0528 Qwen3 8B DeepSeek | 31 | 8.19B | 32.8k | $0.1 | 67 | 🤗 | View | |
Qwen3 4B 2507 Instruct Alibaba | 30 | 4.02B | 262k | - | - | 🤗 | - | View |
EXAONE 4.0 32B (Non-reasoning) LG AI Research | 30 | 32B | 131k | $0.7 | 87 | 🤗 | View | |
Qwen3 Omni 30B A3B Instruct Alibaba | 30 | 35.3B (3.0B active at inference time) | 65.5k | $0.4 | 36 | 🤗 | View | |
Qwen3 235B A22B (Non-reasoning) Alibaba | 30 | 235B (22B active at inference time) | 32.8k | $1.2 | 54 | 🤗 | +2 more | View |
DeepSeek R1 Distill Llama 70B DeepSeek | 30 | 70B | 128k | $0.8 | 286 | 🤗 | +2 more | View |
DeepSeek R1 Distill Qwen 14B DeepSeek | 30 | 14B | 128k | $0.9 | 148 | 🤗 | View | |
Qwen3 14B (Non-reasoning) Alibaba | 29 | 14.8B | 32.8k | $0.6 | 50 | 🤗 | View | |
Mistral Small 3.2 Mistral | 29 | 24B | 128k | $0.1 | 109 | 🤗 | View | |
Qwen2.5 Instruct 72B Alibaba | 29 | 72B | 131k | - | 47 | 🤗 | +2 more | View |
MiniMax-Text-01 MiniMax | 28 | 456B (45.9B active at inference time) | 4.00M | $0.4 | 27 | 🤗 | View | |
Qwen3 8B (Reasoning) Alibaba | 28 | 8.19B | 131k | $0.7 | 87 | 🤗 | View | |
Llama 4 Scout Meta | 28 | 109B (17B active at inference time) | 10.0M | $0.2 | 127 | 🤗 | +9 more | View |
Llama 3.1 Instruct 405B Meta | 28 | 405B | 128k | $4.0 | 30 | 🤗 | +6 more | View |
QwQ 32B-Preview Alibaba | 28 | 32.8B | 32.8k | $0.7 | 82 | 🤗 | View | |
Llama 3.3 Instruct 70B Meta | 28 | 70B | 128k | $0.6 | 99 | 🤗 | +20 more | View |
Qwen3 VL 4B (Reasoning) Alibaba | 27 | 4.44B | 256k | $0.3 | 156 | 🤗 | View | |
Devstral Small (Jul '25) Mistral | 27 | 24B | 256k | $0.1 | 135 | 🤗 | View | |
Qwen3 VL 8B Instruct Alibaba | 27 | 8.77B | 256k | $0.3 | 102 | 🤗 | View | |
Command A Cohere | 27 | 111B | 256k | $4.4 | 96 | 🤗 | View | |
Mistral Large 2 (Nov '24) Mistral | 27 | 123B | 128k | $3.0 | 46 | 🤗 | View | |
Exaone 4.0 1.2B (Reasoning) LG AI Research | 27 | 1.28B | 64.0k | - | - | 🤗 | - | View |
Llama Nemotron Super 49B v1.5 (Non-reasoning) NVIDIA | 27 | 49B | 128k | $0.2 | 65 | 🤗 | View | |
Qwen3 30B A3B (Non-reasoning) Alibaba | 26 | 30.5B (3.3B active at inference time) | 32.8k | $0.3 | 57 | 🤗 | View | |
Qwen3 32B (Non-reasoning) Alibaba | 26 | 32.8B | 32.8k | $1.2 | 53 | 🤗 | +7 more | View |
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) NVIDIA | 26 | 4.51B | 128k | - | - | 🤗 | - | View |
GLM-4.5V (Non-reasoning) Z AI | 26 | 108B (12.0B active at inference time) | 64.0k | $0.9 | 54 | 🤗 | View | |
Reka Flash 3 Reka AI | 26 | 21B | 128k | $0.3 | 49 | 🤗 | View | |
Llama 3.3 Nemotron Super 49B v1 (Non-reasoning) NVIDIA | 26 | 49B | 128k | - | - | 🤗 | - | View |
Qwen3 4B (Reasoning) Alibaba | 26 | 4.02B | 32.0k | $0.4 | 92 | 🤗 | View | |
Llama 3.1 Tulu3 405B Allen Institute for AI | 25 | 405B | 128k | - | - | 🤗 | - | View |
Qwen3 VL 4B Instruct Alibaba | 25 | 4.44B | 256k | $0.2 | 141 | 🤗 | View | |
Pixtral Large Mistral | 25 | 124B | 128k | $3.0 | 36 | 🤗 | View | |
Grok 2 (Dec '24) xAI | 25 | 270B | 131k | $4.0 | 89 | 🤗 | View | |
Hermes 4 - Llama-3.1 70B (Non-reasoning) Nous Research | 24 | 70.6B | 128k | $0.2 | 77 | 🤗 | View | |
Llama 3.1 Nemotron Instruct 70B NVIDIA | 24 | 70B | 128k | $0.6 | 33 | 🤗 | View | |
Mistral Small 3.1 Mistral | 23 | 24B | 128k | $0.1 | 170 | 🤗 | +2 more | View |
Qwen3 8B (Non-reasoning) Alibaba | 23 | 8.19B | 32.8k | $0.3 | 79 | 🤗 | View | |
Qwen2.5 Instruct 32B Alibaba | 23 | 32B | 128k | - | - | 🤗 | - | View |
Granite 4.0 H Small IBM | 23 | 32B (9B active at inference time) | 128k | $0.1 | - | Not available | View | |
Phi-4 Microsoft Azure | 23 | 14B | 16.0k | $0.2 | 33 | 🤗 | View | |
Llama 3.1 Instruct 70B Meta | 23 | 70B | 128k | $0.6 | 109 | 🤗 | +6 more | View |
Qwen3 1.7B (Reasoning) Alibaba | 22 | 2.03B | 32.0k | $0.4 | 125 | 🤗 | View | |
Mistral Large 2 (Jul '24) Mistral | 22 | 123B | 128k | $3.0 | 36 | 🤗 | View | |
Gemma 3 27B Instruct Google | 22 | 27.4B | 128k | - | 37 | 🤗 | +1 more | View |
Qwen2.5 Coder Instruct 32B Alibaba | 22 | 32B | 131k | $0.2 | 72 | 🤗 | View | |
Mistral Small 3 Mistral | 21 | 24B | 32.0k | $0.1 | 171 | 🤗 | View | |
Jamba Reasoning 3B AI21 Labs | 21 | 3B | 262k | - | - | 🤗 | - | View |
Jamba 1.7 Large AI21 Labs | 21 | 398B (94.0B active at inference time) | 256k | $3.5 | 50 | 🤗 | View | |
DeepSeek-V2.5 (Dec '24) DeepSeek | 21 | 236B (21B active at inference time) | 128k | - | - | 🤗 | - | View |
Qwen3 4B (Non-reasoning) Alibaba | 21 | 4.02B | 32.0k | $0.2 | 84 | 🤗 | View | |
Exaone 4.0 1.2B (Non-reasoning) LG AI Research | 20 | 1.28B | 64.0k | - | - | 🤗 | - | View |
Gemma 3 12B Instruct Google | 20 | 12.2B | 128k | - | 15 | 🤗 | +1 more | View |
DeepSeek-V2.5 DeepSeek | 20 | 236B (21B active at inference time) | 128k | - | - | 🤗 | - | View |
Devstral Small (May '25) Mistral | 20 | 23.6B | 256k | $0.1 | 123 | 🤗 | View | |
DeepSeek R1 Distill Llama 8B DeepSeek | 19 | 8B | 128k | - | - | 🤗 | - | View |
R1 1776 Perplexity | 19 | 671B (37B active at inference time) | 128k | - | - | 🤗 | - | View |
Llama 3.2 Instruct 90B (Vision) Meta | 19 | 90B | 128k | $0.7 | 28 | 🤗 | +1 more | View |
Solar Mini Upstage | 19 | 10.7B | 4.10k | $0.1 | 79 | 🤗 | View | |
Grok-1 xAI | 18 | 314B (78B active at inference time) | 8.19k | - | - | 🤗 | - | View |
Qwen2 Instruct 72B Alibaba | 18 | 72B | 131k | - | - | 🤗 | - | View |
LFM2 8B A1B Liquid AI | 17 | 8.34B (1.5B active at inference time) | 32.8k | - | - | 🤗 | - | View |
Gemma 2 27B Google | 17 | 27.2B | 8.19k | - | - | 🤗 | - | View |
Llama 3.1 Instruct 8B Meta | 17 | 8B | 128k | $0.1 | 201 | 🤗 | +16 more | View |
Granite 4.0 Micro IBM | 16 | 3B | 128k | - | - | Not available | - | View |
Phi-4 Mini Instruct Microsoft Azure | 16 | 3.84B | 128k | - | 47 | 🤗 | View | |
DeepHermes 3 - Mistral 24B Preview (Non-reasoning) Nous Research | 16 | 24B | 32.0k | - | - | 🤗 | - | View |
Llama 3.2 Instruct 11B (Vision) Meta | 16 | 11B | 128k | $0.2 | 70 | 🤗 | View | |
Gemma 3n E4B Instruct Google | 15 | 8.39B (4.0B active at inference time) | 32.0k | $0.0 | 48 | 🤗 | View | |
Granite 3.3 8B (Non-reasoning) IBM | 15 | 8.17B | 128k | $0.1 | 488 | 🤗 | View | |
Jamba 1.5 Large AI21 Labs | 15 | 398B (94B active at inference time) | 256k | $3.5 | - | 🤗 | View | |
Jamba 1.7 Mini AI21 Labs | 15 | 52B (12.0B active at inference time) | 258k | $0.3 | 159 | 🤗 | View | |
Gemma 3 4B Instruct Google | 15 | 4.3B | 128k | - | 29 | 🤗 | View | |
Hermes 3 - Llama-3.1 70B Nous Research | 15 | 70.6B | 128k | $0.3 | 31 | 🤗 | View | |
DeepSeek-Coder-V2 DeepSeek | 15 | 236B (21B active at inference time) | 128k | - | - | 🤗 | - | View |
Qwen3 1.7B (Non-reasoning) Alibaba | 14 | 2.03B | 32.0k | $0.2 | 115 | 🤗 | View | |
Phi-3 Medium Instruct 14B Microsoft Azure | 14 | 14B | 128k | $0.3 | 42 | 🤗 | View | |
Jamba 1.6 Large AI21 Labs | 14 | 398B (94B active at inference time) | 256k | $3.5 | 51 | 🤗 | View | |
Qwen3 0.6B (Reasoning) Alibaba | 14 | 0.752B | 32.0k | $0.4 | 209 | 🤗 | View | |
Aya Expanse 32B Cohere | 14 | 32B | 128k | $0.8 | 46 | 🤗 | View | |
Llama 3 Instruct 70B Meta | 13 | 70B | 8.19k | $0.9 | 84 | 🤗 | +4 more | View |
Mistral Small (Sep '24) Mistral | 13 | 22B | 32.8k | $0.3 | 94 | 🤗 | View | |
Phi-3 Mini Instruct 3.8B Microsoft Azure | 13 | 3.8B | 4.10k | $0.2 | 74 | 🤗 | View | |
Gemma 3n E4B Instruct Preview (May '25) Google | 13 | 8.39B (4B active at inference time) | 32.0k | - | - | 🤗 | - | View |
Phi-4 Multimodal Instruct Microsoft Azure | 12 | 5.6B | 128k | - | 17 | 🤗 | View | |
Ministral 8B Mistral | 12 | 8B | 128k | $0.1 | 165 | 🤗 | View | |
Qwen2.5 Coder Instruct 7B Alibaba | 12 | 7.62B | 131k | - | - | 🤗 | - | View |
LFM2 2.6B Liquid AI | 12 | 2.57B | 32.8k | - | - | 🤗 | - | View |
Mixtral 8x22B Instruct Mistral | 12 | 141B (39B active at inference time) | 65.4k | $3.0 | 62 | 🤗 | View | |
Llama 2 Chat 7B Meta | 11 | 7B | 4.10k | $0.1 | 113 | 🤗 | View | |
Gemma 3n E2B Instruct Google | 11 | 5.98B (2.0B active at inference time) | 32.0k | - | 30 | 🤗 | View | |
Llama 3.2 Instruct 3B Meta | 11 | 3B | 128k | $0.1 | 105 | 🤗 | +2 more | View |
Qwen3 0.6B (Non-reasoning) Alibaba | 11 | 0.752B | 32.0k | $0.2 | 198 | 🤗 | View | |
Qwen1.5 Chat 110B Alibaba | 11 | 110B | 32.0k | - | - | 🤗 | - | View |
Aya Expanse 8B Cohere | 10 | 8B | 8.00k | $0.8 | 83 | 🤗 | View | |
LFM2 1.2B Liquid AI | 10 | 1.17B | 32.8k | - | - | 🤗 | ? | View |
Pixtral 12B (2409) Mistral | 9 | 12B | 128k | $0.1 | 157 | 🤗 | View | |
Llama 3.2 Instruct 1B Meta | 9 | 1B | 128k | $0.1 | 76 | 🤗 | View | |
DeepSeek R1 Distill Qwen 1.5B DeepSeek | 9 | 1.5B | 128k | - | - | 🤗 | - | View |
DeepSeek-V2-Chat DeepSeek | 9 | 236B (21B active at inference time) | 128k | - | - | 🤗 | - | View |
Gemma 2 9B Google | 8 | 9B | 8.19k | $0.0 | - | 🤗 | View | |
Arctic Instruct Snowflake | 8 | 480B (17B active at inference time) | 4.00k | - | - | 🤗 | - | View |
Qwen Chat 72B Alibaba | 8 | 72B | 33.8k | - | - | 🤗 | - | View |
Command-R+ (Aug '24) Cohere | 7 | 104B | 128k | $4.4 | 23 | 🤗 | View | |
Llama 3 Instruct 8B Meta | 7 | 8B | 8.19k | $0.1 | 66 | 🤗 | +1 more | View |
Gemma 3 1B Instruct Google | 7 | 1B | 32.0k | - | 19 | 🤗 | View | |
DeepSeek Coder V2 Lite Instruct DeepSeek | 6 | 16B (2.4B active at inference time) | 128k | - | - | 🤗 | - | View |
Codestral (May '24) Mistral | 6 | 22B | 32.8k | - | - | 🤗 | - | View |
Gemma 3 270M Google | 6 | 0.268B | 32.0k | - | - | 🤗 | - | View |
Llama 2 Chat 70B Meta | 6 | 70B | 4.10k | - | - | 🤗 | - | View |
DeepSeek LLM 67B Chat (V1) DeepSeek | 6 | 7B | 4.10k | - | - | 🤗 | - | View |
Llama 2 Chat 13B Meta | 6 | 13B | 4.10k | - | - | 🤗 | - | View |
Command-R+ (Apr '24) Cohere | 5 | 104B | 128k | $6.0 | - | 🤗 | View | |
OpenChat 3.5 (1210) OpenChat | 5 | 7B | 8.19k | - | - | 🤗 | - | View |
DBRX Instruct Databricks | 5 | 132B (36B active at inference time) | 32.8k | - | - | 🤗 | - | View |
Mistral NeMo Mistral | 5 | 12B | 128k | $0.1 | 133 | 🤗 | +2 more | View |
Jamba 1.5 Mini AI21 Labs | 4 | 52B (12B active at inference time) | 256k | $0.3 | - | 🤗 | View | |
Jamba 1.6 Mini AI21 Labs | 3 | 52B (12B active at inference time) | 256k | $0.3 | 162 | 🤗 | View | |
Mixtral 8x7B Instruct Mistral | 3 | 46.7B (12.9B active at inference time) | 32.8k | $0.7 | 60 | 🤗 | +1 more | View |
DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning) Nous Research | 2 | 8B | 128k | - | - | 🤗 | - | View |
Llama 65B Meta | 1 | 65B | 2.05k | - | - | Not available | - | View |
Qwen Chat 14B Alibaba | 1 | 14B | 8.19k | - | - | Not available | - | View |
Codestral-Mamba Mistral | 1 | 7B | 256k | - | - | 🤗 | - | View |
Mistral 7B Instruct Mistral | 1 | 7B | 8.19k | $0.3 | 114 | 🤗 | +1 more | View |
Command-R (Aug '24) Cohere | 1 | 32B | 128k | $0.3 | 61 | 🤗 | View | |
Command-R (Mar '24) Cohere | 1 | 35B | 128k | $0.8 | - | 🤗 | View | |
Qwen3 VL 30B A3B Instruct Alibaba | - | 30B (3.0B active at inference time) | 256k | $0.3 | 96 | 🤗 | View | |
Qwen3 VL 30B A3B (Reasoning) Alibaba | - | 30B (3.0B active at inference time) | 256k | $0.8 | 104 | 🤗 | View |