Stay connected with us on X, Discord, and LinkedIn to stay up to date with future analysis

Comparison of Open Source Models

Comparison and analysis of open source AI models across key performance metrics including quality, performance, inference speed, context window, parameter count & licensing details. Models are considered Open Source (also commonly referred to as open weights) where their weights are accessible to download. This allows self-hosting on your own infrastructure and enables customizing the model such as through fine-tuning. Click on any model to see detailed metrics. For more details relating to our methodology, see our FAQs.

Kimi logoKimi K2.5 and Z AI logoGLM-4.7 are the highest intelligence open source models, followed by DeepSeek logoDeepSeek V3.2 & Kimi logoKimi K2 Thinking.

Highlights

Intelligence
Artificial Analysis Intelligence Index; Higher is better
Total Parameters
Trainable parameters in billions

Artificial Analysis Openness Index: Results

Openness Index assesses model openness on a 0 to 100 normalized scale (higher is more open)

Open Source Progress

Progress in Open Weights vs. Proprietary Intelligence

Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Open Weights
Proprietary

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Indicates whether the model weights are available. Models are labelled as 'Commercial Use Restricted' if the weights are available but commercial use is limited (typically requires obtaining a paid license).

Artificial Analysis Intelligence Index

Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

{"@context":"https://schema.org","@type":"Dataset","name":"Artificial Analysis Intelligence Index","creator":{"@type":"Organization","name":"Artificial Analysis","url":"https://artificialanalysis.ai"},"description":"Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt","measurementTechnique":"Independent test run by Artificial Analysis on dedicated hardware.","spatialCoverage":"Worldwide","keywords":["analytics","llm","AI","benchmark","model","gpt","claude"],"license":"https://creativecommons.org/licenses/by/4.0/","isAccessibleForFree":true,"citation":"Artificial Analysis (2025). LLM benchmarks dataset. https://artificialanalysis.ai","data":""}

Open Source Language Models Intelligence By Lab Over Time

Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Alibaba
DeepSeek
Google
Meta
Microsoft Azure
Mistral
NVIDIA

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Open Source Models Intelligence By Size Over Time

Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Large Models (>150B)
Medium Models (40B-150B)
Small Models (4B-40B)
Tiny Models (≤4B)

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

  • Tiny: Less than or equal to 4B parameters. These are usually the smallest models in terms of resource demand.
  • Small: Less than 40B parameters.
  • Medium: Between 40B-150B parameters.
  • Large: Over 150B parameters.

Intelligence Evaluations

Intelligence evaluations measured independently by Artificial Analysis; Higher is better
Results claimed by AI Lab (not yet independently verified)
GDPval-AA (Agentic Real-World Work Tasks, (ELO-500)/2000)
Terminal-Bench Hard (Agentic Coding & Terminal Use)
𝜏²-Bench Telecom (Agentic Tool Use)
AA-LCR (Long Context Reasoning)
AA-Omniscience Accuracy (Knowledge)
AA-Omniscience Non-Hallucination Rate (1 - Hallucination Rate)
Humanity's Last Exam (Reasoning & Knowledge)
GPQA Diamond (Scientific Reasoning)
SciCode (Coding)
IFBench (Instruction Following)
CritPt (Physics Reasoning)
MMMU Pro (Visual Reasoning)

While model intelligence generally translates across use cases, specific evaluations may be more relevant for certain use cases.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Intelligence Index By Model Size

Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Large Models (>150B)
Medium Models (40B-150B)
Small Models (4B-40B)

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Indicates whether the model weights are available. Models are labelled as 'Commercial Use Restricted' if the weights are available but commercial use is limited (typically requires obtaining a paid license).

  • Tiny: Less than or equal to 4B parameters. These are usually the smallest models in terms of resource demand.
  • Small: Less than 40B parameters.
  • Medium: Between 40B-150B parameters.
  • Large: Over 150B parameters.

Model Size: Total and Active Parameters

Comparison between total model parameters and parameters active during inference
Active Parameters
Passive Parameters

The total number of trainable weights and biases in the model, expressed in billions. These parameters are learned during training and determine the model's ability to process and generate responses.

The number of parameters actually executed during each inference forward pass, expressed in billions. For Mixture of Experts (MoE) models, a routing mechanism selects a subset of experts per token, resulting in fewer active than total parameters. Dense models use all parameters, so active equals total.

Intelligence vs. Active Parameters

Active Parameters at Inference Time; Artificial Analysis Intelligence Index
Most attractive quadrant
Alibaba
DeepSeek
Kimi
Korea Telecom
LG AI Research
MBZUAI Institute of Foundation Models
MiniMax
Mistral
NVIDIA
OpenAI
Xiaomi
Z AI

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

The number of parameters actually executed during each inference forward pass, expressed in billions. For Mixture of Experts (MoE) models, a routing mechanism selects a subset of experts per token, resulting in fewer active than total parameters. Dense models use all parameters, so active equals total.

Intelligence vs. Total Parameters

Artificial Analysis Intelligence Index; Size in Parameters (Billions)
Most attractive quadrant
Alibaba
DeepSeek
Kimi
Korea Telecom
LG AI Research
MBZUAI Institute of Foundation Models
MiniMax
Mistral
NVIDIA
OpenAI
Xiaomi
Z AI

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

The total number of trainable weights and biases in the model, expressed in billions. These parameters are learned during training and determine the model's ability to process and generate responses.

Context Window

Context Window

Context Window: Tokens Limit; Higher is better

Larger context windows are relevant to RAG (Retrieval Augmented Generation) LLM workflows which typically involve reasoning and information retrieval of large amounts of data.

Maximum number of combined input & output tokens. Output tokens commonly have a significantly lower limit (varied by model).

{"@context":"https://schema.org","@type":"Dataset","name":"Context Window","creator":{"@type":"Organization","name":"Artificial Analysis","url":"https://artificialanalysis.ai"},"description":"Context Window: Tokens Limit; Higher is better","measurementTechnique":"Independent test run by Artificial Analysis on dedicated hardware.","spatialCoverage":"Worldwide","keywords":["analytics","llm","AI","benchmark","model","gpt","claude"],"license":"https://creativecommons.org/licenses/by/4.0/","isAccessibleForFree":true,"citation":"Artificial Analysis (2025). LLM benchmarks dataset. https://artificialanalysis.ai","data":""}

Further details
WeightsProvider
Benchmarks
Kimi logo
Kimi K2.5 (Reasoning)
Kimi
47
1.0KB
(32B active at inference time)
256k
$1.2
112
🤗
Novita
Kimi
Fireworks
+5 more
View
Z AI logo
GLM-4.7 (Reasoning)
Z AI
42
357B
(32B active at inference time)
200k
$0.9
140
🤗
Parasail
Google
Cerebras
+8 more
View
DeepSeek logo
DeepSeek V3.2 (Reasoning)
DeepSeek
42
685B
(37B active at inference time)
128k
$0.3
31
🤗
SiliconFlow
DeepSeek
Novita
+5 more
View
Kimi logo
Kimi K2 Thinking
Kimi
41
1.0KB
(32B active at inference time)
256k
$1.1
99
🤗
Kimi
GMI
Novita
+10 more
View
MiniMax logo
MiniMax-M2.1
MiniMax
40
230B
(10B active at inference time)
205k
$0.5
55
🤗
DeepInfra
GMI
Novita
+4 more
View
Xiaomi logo
MiMo-V2-Flash (Reasoning)
Xiaomi
39
309B
(15B active at inference time)
256k
$0.1
151
🤗
Xiaomi
View
Kimi logo
Kimi K2.5 (Non-reasoning)
Kimi
37
1.0KB
(32B active at inference time)
256k
$1.2
104
🤗
GMI
Novita
Fireworks
+2 more
View
MiniMax logo
MiniMax-M2
MiniMax
36
230B
(10B active at inference time)
205k
$0.5
92
🤗
DeepInfra
Amazon Bedrock
Fireworks
+3 more
View
Z AI logo
GLM-4.7 (Non-reasoning)
Z AI
34
357B
(32B active at inference time)
200k
$0.9
146
🤗
GMI
Together.ai
Parasail
+7 more
View
DeepSeek logo
DeepSeek V3.2 Speciale
DeepSeek
34
685B
(37B active at inference time)
128k
$0.4
-
🤗
Parasail
View
DeepSeek logo
DeepSeek V3.1 Terminus (Reasoning)
DeepSeek
34
685B
(37B active at inference time)
128k
$0.8
-
🤗
SambaNova
Novita
Eigen AI
View
OpenAI logo
gpt-oss-120B (high)
OpenAI
33
117B
(5.1B active at inference time)
131k
$0.3
320
🤗
?
Google
Microsoft Azure
+20 more
View
DeepSeek logo
DeepSeek V3.2 Exp (Reasoning)
DeepSeek
33
685B
(37B active at inference time)
128k
$0.3
31
🤗
DeepSeek
Novita
View
Z AI logo
GLM-4.6 (Reasoning)
Z AI
33
357B
(32B active at inference time)
200k
$1.0
72
🤗
Together.ai
Baseten
DeepInfra
+2 more
View
LG AI Research logo
K-EXAONE (Reasoning)
LG AI Research
32
236B
(23B active at inference time)
256k
-
148
🤗
FriendliAI
View
DeepSeek logo
DeepSeek V3.2 (Non-reasoning)
DeepSeek
32
685B
(37B active at inference time)
128k
$0.3
29
🤗
SambaNova
Novita
GMI
+7 more
View
Kimi logo
Kimi K2 0905
Kimi
31
1.0KB
(32B active at inference time)
256k
$1.2
49
🤗
DeepInfra
Fireworks
Novita
+4 more
View
Xiaomi logo
MiMo-V2-Flash (Non-reasoning)
Xiaomi
31
309B
(15B active at inference time)
256k
$0.1
145
🤗
Xiaomi
View
Z AI logo
GLM-4.6 (Non-reasoning)
Z AI
30
357B
(32B active at inference time)
200k
$1.0
34
🤗
Together.ai
Novita
View
Z AI logo
GLM-4.7-Flash (Reasoning)
Z AI
30
31.2B
(3B active at inference time)
200k
$0.2
64
🤗
Novita
GMI
DeepInfra
View
Alibaba logo
Qwen3 235B A22B 2507 (Reasoning)
Alibaba
29
235B
(22B active at inference time)
256k
$2.6
45
🤗
Hyperbolic
DeepInfra
Fireworks
+5 more
View
DeepSeek logo
DeepSeek V3.1 Terminus (Non-reasoning)
DeepSeek
28
685B
(37B active at inference time)
128k
$0.8
-
🤗
SambaNova
Fireworks
Eigen AI
+2 more
View
ServiceNow logo
Apriel-v1.5-15B-Thinker
ServiceNow
28
15B
128k
-
140
🤗
Together.ai
View
DeepSeek logo
DeepSeek V3.2 Exp (Non-reasoning)
DeepSeek
28
685B
(37B active at inference time)
128k
$0.3
32
🤗
DeepSeek
Novita
DeepInfra
View
Alibaba logo
Qwen3-Coder-Next
Alibaba
28
79.7B
(3B active at inference time)
256k
$0.5
103
🤗
Novita
Together.ai
Parasail
View
DeepSeek logo
DeepSeek V3.1 (Non-reasoning)
DeepSeek
28
685B
(37B active at inference time)
128k
$0.8
-
🤗
DeepInfra
Google
Together.ai
+6 more
View
DeepSeek logo
DeepSeek V3.1 (Reasoning)
DeepSeek
28
685B
(37B active at inference time)
128k
$0.9
-
🤗
Amazon Bedrock
Novita
Google
+1 more
View
ServiceNow logo
Apriel-v1.6-15B-Thinker
ServiceNow
28
15B
128k
-
143
🤗
Together.ai
View
Alibaba logo
Qwen3 VL 235B A22B (Reasoning)
Alibaba
28
235B
(22B active at inference time)
262k
$2.6
54
🤗
Alibaba Cloud
Fireworks
Novita
View
DeepSeek logo
DeepSeek R1 0528 (May '25)
DeepSeek
27
685B
(37B active at inference time)
128k
$2.4
-
🤗
SambaNova
Together.ai
Nebius
+7 more
View
Alibaba logo
Qwen3 Next 80B A3B (Reasoning)
Alibaba
26
80B
(3B active at inference time)
262k
$1.9
171
🤗
Alibaba Cloud
Novita
Hyperbolic
+4 more
View
Z AI logo
GLM-4.5 (Reasoning)
Z AI
26
355B
(32B active at inference time)
128k
$1.0
47
🤗
Novita
Nebius
DeepInfra
View
Kimi logo
Kimi K2
Kimi
26
1.0KB
(32B active at inference time)
128k
$1.1
42
🤗
Kimi
Novita
Parasail
+4 more
View
ByteDance Seed logo
Seed-OSS-36B-Instruct
ByteDance Seed
25
36.2B
512k
$0.3
32
🤗
SiliconFlow
View
Alibaba logo
Qwen3 235B A22B 2507 Instruct
Alibaba
25
235B
(22B active at inference time)
256k
$1.2
52
🤗
DeepInfra
Alibaba Cloud
Hyperbolic
+9 more
View
Alibaba logo
Qwen3 Coder 480B A35B Instruct
Alibaba
25
480B
(35B active at inference time)
262k
$3.0
46
🤗
Alibaba Cloud
Nebius
Novita
+7 more
View
MBZUAI Institute of Foundation Models logo
K2 Think V2
MBZUAI Institute of Foundation Models
25
70B
262k
-
-
Not available
-
View
Alibaba logo
Qwen3 VL 32B (Reasoning)
Alibaba
25
33.4B
256k
$2.6
85
🤗
Alibaba Cloud
View
OpenAI logo
gpt-oss-20B (high)
OpenAI
24
21B
(3.6B active at inference time)
131k
$0.1
265
🤗
Novita
Together.ai
Nebius
+8 more
View
NVIDIA logo
NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)
NVIDIA
24
31.6B
(3.6B active at inference time)
1.00M
$0.1
205
🤗
DeepInfra
Nebius
View
MiniMax logo
MiniMax M1 80k
MiniMax
24
456B
(45.9B active at inference time)
1.00M
$1.0
-
🤗
Novita
View
OpenAI logo
gpt-oss-120B (low)
OpenAI
24
117B
(5.1B active at inference time)
131k
$0.3
307
🤗
Fireworks
Novita
Eigen AI
+17 more
View
Naver logo
HyperCLOVA X SEED Think (32B)
Naver
24
32B
128k
-
-
🤗
-
View
Z AI logo
GLM-4.6V (Reasoning)
Z AI
23
108B
128k
$0.5
78
🤗
Novita
SiliconFlow
Parasail
+1 more
View
Z AI logo
GLM-4.5-Air
Z AI
23
106B
(12B active at inference time)
128k
$0.4
103
🤗
DeepInfra
Together.ai
SiliconFlow
+1 more
View
LG AI Research logo
K-EXAONE (Non-reasoning)
LG AI Research
23
236B
(23B active at inference time)
256k
-
91
🤗
FriendliAI
View
Korea Telecom logo
Mi:dm K 2.5 Pro
Korea Telecom
23
32B
128k
-
-
Not available
-
View
Mistral logo
Mistral Large 3
Mistral
23
675B
(41B active at inference time)
256k
$0.8
55
🤗
Microsoft Azure
Amazon Bedrock
Mistral
View
Mistral logo
Magistral Small 1.2
Mistral
23
24B
128k
$0.8
167
🤗
Amazon Bedrock
Mistral
View
InclusionAI logo
Ring-1T
InclusionAI
23
1.0KB
(50B active at inference time)
128k
$1.0
52
🤗
ZenMux
View
Alibaba logo
Qwen3 30B A3B 2507 (Reasoning)
Alibaba
22
30.5B
(3.3B active at inference time)
262k
$0.8
163
🤗
Nebius
Clarifai
Alibaba Cloud
View
Prime Intellect logo
INTELLECT-3
Prime Intellect
22
107B
131k
$0.4
86
🤗
Nebius
View
Mistral logo
Devstral 2
Mistral
22
125B
256k
-
65
🤗
Mistral
View
DeepSeek logo
DeepSeek V3 0324
DeepSeek
22
671B
(37B active at inference time)
128k
$1.3
-
🤗
Together.ai
Nebius
Replicate
+7 more
View
Nous Research logo
Hermes 4 - Llama-3.1 405B (Reasoning)
Nous Research
22
406B
128k
$1.5
35
🤗
Nebius
View
Z AI logo
GLM-4.7-Flash (Non-reasoning)
Z AI
21
31.2B
(3B active at inference time)
200k
$0.2
108
🤗
GMI
View
MiniMax logo
MiniMax M1 40k
MiniMax
21
456B
(45.9B active at inference time)
1.00M
-
-
🤗
-
View
OpenAI logo
gpt-oss-20B (low)
OpenAI
21
21B
(3.6B active at inference time)
131k
$0.1
286
🤗
Nebius
Cloudflare
Lightning AI
+8 more
View
MBZUAI Institute of Foundation Models logo
K2-V2 (high)
MBZUAI Institute of Foundation Models
21
70B
512k
-
-
🤗
-
View
InclusionAI logo
Ring-flash-2.0
InclusionAI
21
103B
(6.1B active at inference time)
128k
$0.2
89
🤗
SiliconFlow
View
Alibaba logo
Qwen3 VL 235B A22B Instruct
Alibaba
21
235B
(22B active at inference time)
262k
$1.2
47
🤗
GMI
Fireworks
Eigen AI
+4 more
View
Nous Research logo
Hermes 4 - Llama-3.1 70B (Reasoning)
Nous Research
20
70.6B
128k
$0.2
81
🤗
Nebius
View
Alibaba logo
Qwen3 Next 80B A3B Instruct
Alibaba
20
80B
(3B active at inference time)
262k
$0.9
157
🤗
Novita
Alibaba Cloud
Google
+4 more
View
NVIDIA logo
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)
NVIDIA
20
253B
128k
$0.9
37
🤗
Nebius
View
Alibaba logo
Qwen3 Coder 30B A3B Instruct
Alibaba
20
30.5B
(3.3B active at inference time)
262k
$0.9
22
🤗
Alibaba Cloud
DeepInfra
Scaleway
+2 more
View
Alibaba logo
Qwen3 235B A22B (Reasoning)
Alibaba
20
235B
(22B active at inference time)
32.8k
$2.6
60
🤗
Fireworks
Together.ai
Alibaba Cloud
View
Alibaba logo
QwQ 32B
Alibaba
20
32.8B
131k
$0.5
27
🤗
Hyperbolic
Cloudflare
View
Alibaba logo
Qwen3 VL 30B A3B (Reasoning)
Alibaba
20
30B
(3B active at inference time)
256k
$0.8
116
🤗
Alibaba Cloud
Fireworks
Novita
View
Mistral logo
Devstral Small 2
Mistral
19
24B
256k
-
206
🤗
Mistral
View
Z AI logo
GLM-4.5V (Reasoning)
Z AI
19
108B
(12B active at inference time)
64.0k
$0.9
43
🤗
Novita
View
InclusionAI logo
Ling-1T
InclusionAI
19
1.0KB
(50B active at inference time)
128k
-
-
🤗
-
View
Allen Institute for AI logo
Olmo 3 32B Think
Allen Institute for AI
19
32.2B
65.5k
-
-
🤗
-
View
DeepSeek logo
DeepSeek R1 (Jan '25)
DeepSeek
19
685B
(37B active at inference time)
128k
$2.4
-
🤗
Amazon Bedrock
Novita
Hyperbolic
+6 more
View
MBZUAI Institute of Foundation Models logo
K2-V2 (medium)
MBZUAI Institute of Foundation Models
19
70B
512k
-
-
🤗
-
View
NVIDIA logo
Llama Nemotron Super 49B v1.5 (Reasoning)
NVIDIA
19
49B
128k
$0.2
75
🤗
DeepInfra
View
Alibaba logo
Qwen3 4B 2507 (Reasoning)
Alibaba
19
4.02B
262k
-
-
🤗
-
View
NVIDIA logo
Llama 3.3 Nemotron Super 49B v1 (Reasoning)
NVIDIA
18
49B
128k
-
-
🤗
-
View
Meta logo
Llama 4 Maverick
Meta
18
402B
(17B active at inference time)
1.00M
$0.5
124
🤗
Groq
Microsoft Azure
Snowflake
+9 more
View
Mistral logo
Devstral Small (May '25)
Mistral
18
23.6B
256k
$0.1
-
🤗
Mistral
DeepInfra
View
Baidu logo
ERNIE 4.5 300B A47B
Baidu
17
300B
(47B active at inference time)
131k
$0.5
27
🤗
Novita
SiliconFlow
View
Alibaba logo
Qwen3 VL 32B Instruct
Alibaba
17
33.4B
256k
$1.2
64
🤗
Alibaba Cloud
Together.ai
View
DeepSeek logo
DeepSeek R1 Distill Qwen 32B
DeepSeek
17
32B
128k
$0.3
38
🤗
DeepInfra
View
Nous Research logo
Hermes 4 - Llama-3.1 405B (Non-reasoning)
Nous Research
17
406B
128k
$1.5
32
🤗
Nebius
View
Z AI logo
GLM-4.6V (Non-reasoning)
Z AI
17
108B
128k
$0.5
55
🤗
SiliconFlow
Parasail
Novita
View
Alibaba logo
Qwen3 235B A22B (Non-reasoning)
Alibaba
17
235B
(22B active at inference time)
32.8k
$1.2
45
🤗
Novita
Alibaba Cloud
Together.ai
+2 more
View
Allen Institute for AI logo
Olmo 3 7B Think
Allen Institute for AI
17
7B
65.5k
$0.1
153
🤗
Parasail
View
Mistral logo
Magistral Small 1
Mistral
17
23.6B
40.0k
$0.8
-
🤗
Mistral
View
LG AI Research logo
EXAONE 4.0 32B (Reasoning)
LG AI Research
17
32B
131k
$0.7
98
🤗
FriendliAI
View
Alibaba logo
Qwen3 VL 8B (Reasoning)
Alibaba
17
8.77B
256k
$0.7
123
🤗
Alibaba Cloud
View
Alibaba logo
Qwen3 32B (Reasoning)
Alibaba
17
32.8B
32.8k
$2.6
89
🤗
Nebius
Groq
Nebius
+5 more
View
DeepSeek logo
DeepSeek R1 0528 Qwen3 8B
DeepSeek
16
8.19B
32.8k
-
-
🤗
-
View
DeepSeek logo
DeepSeek V3 (Dec '24)
DeepSeek
16
671B
(37B active at inference time)
128k
$0.6
-
🤗
Together.ai
Novita
Novita
+2 more
View
Alibaba logo
Qwen3 14B (Reasoning)
Alibaba
16
14.8B
32.8k
$1.3
58
🤗
DeepInfra
Alibaba Cloud
View
Alibaba logo
Qwen3 VL 30B A3B Instruct
Alibaba
16
30B
(3B active at inference time)
256k
$0.3
110
🤗
Fireworks
Novita
DeepInfra
+1 more
View
Mistral logo
Ministral 3 14B
Mistral
16
14B
256k
$0.2
137
🤗
Amazon Bedrock
Mistral
Together.ai
View
DeepSeek logo
DeepSeek R1 Distill Llama 70B
DeepSeek
16
70B
128k
$0.9
41
🤗
DeepInfra
SambaNova
Scaleway
View
DeepSeek logo
DeepSeek R1 Distill Qwen 14B
DeepSeek
16
14B
128k
-
-
🤗
-
View
TII UAE logo
Falcon-H1R-7B
TII UAE
16
7B
256k
-
-
Not available
-
View
Alibaba logo
Qwen3 Omni 30B A3B (Reasoning)
Alibaba
16
35.3B
(3B active at inference time)
65.5k
$0.4
97
🤗
Alibaba Cloud
View
Alibaba logo
Qwen2.5 Instruct 72B
Alibaba
16
72B
131k
-
47
🤗
Alibaba Cloud
Together.ai
DeepInfra
+1 more
View
InclusionAI logo
Ling-flash-2.0
InclusionAI
15
103B
(6.1B active at inference time)
128k
$0.2
58
🤗
SiliconFlow
View
StepFun logo
Step3 VL 10B
StepFun
15
10.2B
65.5k
-
-
🤗
-
View
Alibaba logo
Qwen3 30B A3B (Reasoning)
Alibaba
15
30.5B
(3.3B active at inference time)
32.8k
$0.8
70
🤗
Alibaba Cloud
DeepInfra
Fireworks
+1 more
View
Mistral logo
Devstral Small (Jul '25)
Mistral
15
24B
256k
$0.1
228
🤗
Mistral
DeepInfra
View
Alibaba logo
QwQ 32B-Preview
Alibaba
15
32.8B
32.8k
$0.1
39
🤗
DeepInfra
View
InclusionAI logo
Ling-mini-2.0
InclusionAI
15
16.3B
(1.4B active at inference time)
131k
$0.1
149
🤗
SiliconFlow
View
Mistral logo
Mistral Large 2 (Nov '24)
Mistral
15
123B
128k
$3.0
42
🤗
Microsoft Azure
Mistral
View
Mistral logo
Mistral Small 3.2
Mistral
15
24B
128k
$0.1
126
🤗
DeepInfra
Mistral
View
Alibaba logo
Qwen3 30B A3B 2507 Instruct
Alibaba
15
30.5B
(3.3B active at inference time)
262k
$0.3
58
🤗
Nebius
Alibaba Cloud
Clarifai
View
Alibaba logo
Qwen3 VL 4B (Reasoning)
Alibaba
15
4.44B
256k
-
-
🤗
-
View
NVIDIA logo
NVIDIA Nemotron Nano 12B v2 VL (Reasoning)
NVIDIA
15
13.2B
128k
$0.3
128
🤗
DeepInfra
View
NVIDIA logo
NVIDIA Nemotron Nano 9B V2 (Reasoning)
NVIDIA
15
9B
131k
$0.1
108
🤗
DeepInfra
View
Mistral logo
Ministral 3 8B
Mistral
15
8B
256k
$0.1
86
🤗
Mistral
Amazon Bedrock
View
Alibaba logo
Qwen3 32B (Non-reasoning)
Alibaba
15
32.8B
32.8k
$1.2
85
🤗
SambaNova
Alibaba Cloud
Nebius
+6 more
View
NVIDIA logo
Llama Nemotron Super 49B v1.5 (Non-reasoning)
NVIDIA
15
49B
128k
$0.2
71
🤗
DeepInfra
View
MBZUAI Institute of Foundation Models logo
K2-V2 (low)
MBZUAI Institute of Foundation Models
14
70B
512k
-
-
🤗
-
View
NVIDIA logo
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning)
NVIDIA
14
4.51B
128k
-
-
🤗
-
View
Kimi logo
Kimi Linear 48B A3B Instruct
Kimi
14
49.1B
(3B active at inference time)
1.00M
-
-
🤗
-
View
Reka AI logo
Reka Flash 3
Reka AI
14
21B
128k
$0.3
49
🤗
Reka AI
View
NVIDIA logo
Llama 3.3 Nemotron Super 49B v1 (Non-reasoning)
NVIDIA
14
49B
128k
-
-
🤗
-
View
Alibaba logo
Qwen3 VL 8B Instruct
Alibaba
14
8.77B
256k
$0.3
118
🤗
Together.ai
Alibaba Cloud
View
Allen Institute for AI logo
Olmo 3.1 32B Think
Allen Institute for AI
14
32.2B
65.5k
-
102
🤗
Parasail
View
Meta logo
Llama 3.3 Instruct 70B
Meta
14
70B
128k
$0.7
104
🤗
Hyperbolic
CompactifAI
Databricks
+19 more
View
Alibaba logo
Qwen3 4B (Reasoning)
Alibaba
14
4.02B
32.0k
$0.4
78
🤗
Alibaba Cloud
View
Meta logo
Llama 3.1 Instruct 405B
Meta
14
405B
128k
$4.2
25
🤗
Amazon Bedrock
Replicate
Hyperbolic
+5 more
View
Allen Institute for AI logo
Llama 3.1 Tulu3 405B
Allen Institute for AI
14
405B
128k
-
-
🤗
-
View
Mistral logo
Pixtral Large
Mistral
14
124B
128k
$3.0
52
🤗
Mistral
View
Mistral logo
Mistral Small 3.1
Mistral
14
24B
128k
$0.1
124
🤗
CompactifAI
Mistral
Google
+1 more
View
xAI logo
Grok 2 (Dec '24)
xAI
14
270B
131k
-
-
🤗
-
View
Nous Research logo
Hermes 4 - Llama-3.1 70B (Non-reasoning)
Nous Research
14
70.6B
128k
$0.2
75
🤗
Nebius
View
Meta logo
Llama 4 Scout
Meta
13
109B
(17B active at inference time)
10.0M
$0.3
124
🤗
Amazon Bedrock
Google
CompactifAI
+7 more
View
Cohere logo
Command A
Cohere
13
111B
256k
$4.4
58
🤗
Microsoft Azure
Cohere
View
NVIDIA logo
Llama 3.1 Nemotron Instruct 70B
NVIDIA
13
70B
128k
$1.2
39
🤗
DeepInfra
View
NVIDIA logo
NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)
NVIDIA
13
31.6B
(3.6B active at inference time)
1.00M
$0.1
175
🤗
DeepInfra
Nebius
View
Alibaba logo
Qwen2.5 Instruct 32B
Alibaba
13
32B
128k
-
-
🤗
-
View
Alibaba logo
Qwen3 4B 2507 Instruct
Alibaba
13
4.02B
262k
-
-
🤗
-
View
Microsoft Azure logo
Phi-4
Microsoft Azure
13
14B
16.0k
$0.2
10
🤗
Microsoft Azure
DeepInfra
View
Meta logo
Llama 3.1 Instruct 70B
Meta
13
70B
128k
$0.6
61
🤗
Together.ai
Amazon Bedrock
DeepInfra
+4 more
View
NVIDIA logo
NVIDIA Nemotron Nano 9B V2 (Non-reasoning)
NVIDIA
13
9B
131k
$0.1
116
🤗
Together.ai
DeepInfra
Amazon Bedrock
View
Alibaba logo
Qwen3 8B (Reasoning)
Alibaba
13
8.19B
131k
$0.7
60
🤗
Alibaba Cloud
View
Mistral logo
Mistral Large 2 (Jul '24)
Mistral
13
123B
128k
$3.0
-
🤗
Amazon Bedrock
View
Alibaba logo
Qwen2.5 Coder Instruct 32B
Alibaba
13
32B
131k
$0.1
32
🤗
Hyperbolic
DeepInfra
View
Alibaba logo
Qwen3 14B (Non-reasoning)
Alibaba
13
14.8B
32.8k
$0.6
53
🤗
DeepInfra
Alibaba Cloud
View
Mistral logo
Mistral Small 3
Mistral
13
24B
32.0k
$0.1
237
🤗
DeepInfra
Mistral
Together.ai
View
Z AI logo
GLM-4.5V (Non-reasoning)
Z AI
13
108B
(12B active at inference time)
64.0k
$0.9
44
🤗
Novita
View
DeepSeek logo
DeepSeek-V2.5 (Dec '24)
DeepSeek
13
236B
(21B active at inference time)
128k
-
-
🤗
-
View
Alibaba logo
Qwen3 4B (Non-reasoning)
Alibaba
12
4.02B
32.0k
$0.2
65
🤗
Alibaba Cloud
View
Alibaba logo
Qwen3 30B A3B (Non-reasoning)
Alibaba
12
30.5B
(3.3B active at inference time)
32.8k
$0.3
68
🤗
Alibaba Cloud
DeepInfra
View
DeepSeek logo
DeepSeek-V2.5
DeepSeek
12
236B
(21B active at inference time)
128k
-
-
🤗
-
View
DeepSeek logo
DeepSeek R1 Distill Llama 8B
DeepSeek
12
8B
128k
-
-
🤗
-
View
Allen Institute for AI logo
Olmo 3.1 32B Instruct
Allen Institute for AI
12
32.2B
65.5k
$0.3
47
🤗
DeepInfra
View
Perplexity logo
R1 1776
Perplexity
12
671B
(37B active at inference time)
128k
-
-
🤗
-
View
Meta logo
Llama 3.2 Instruct 90B (Vision)
Meta
12
90B
128k
$0.7
36
🤗
DeepInfra
Microsoft Azure
Amazon Bedrock
+1 more
View
Upstage logo
Solar Mini
Upstage
12
10.7B
4.10k
$0.1
79
🤗
Upstage
View
xAI logo
Grok-1
xAI
12
314B
(78B active at inference time)
8.19k
-
-
🤗
-
View
Meta logo
Llama 3.1 Instruct 8B
Meta
12
8B
128k
$0.1
169
🤗
Eigen AI
Nebius
SambaNova
+16 more
View
Alibaba logo
Qwen2 Instruct 72B
Alibaba
12
72B
131k
-
-
🤗
-
View
LG AI Research logo
EXAONE 4.0 32B (Non-reasoning)
LG AI Research
12
32B
131k
$0.7
88
🤗
FriendliAI
View
Mistral logo
Ministral 3 3B
Mistral
11
3B
256k
$0.1
293
🤗
Amazon Bedrock
Mistral
View
Microsoft Azure logo
Phi-4 Mini Instruct
Microsoft Azure
11
3.84B
128k
-
45
🤗
Microsoft Azure
View
Nous Research logo
DeepHermes 3 - Mistral 24B Preview (Non-reasoning)
Nous Research
11
24B
32.0k
-
-
🤗
-
View
Meta logo
Llama 3.2 Instruct 11B (Vision)
Meta
11
11B
128k
$0.2
52
🤗
Amazon Bedrock
Microsoft Azure
DeepInfra
View
IBM logo
Granite 4.0 H Small
IBM
11
32B
(9B active at inference time)
128k
$0.1
304
🤗
Replicate
View
IBM logo
Granite 3.3 8B (Non-reasoning)
IBM
11
8.17B
128k
$0.1
457
🤗
Replicate
View
AI21 Labs logo
Jamba 1.5 Large
AI21 Labs
11
398B
(94B active at inference time)
256k
$3.5
-
🤗
Google
Amazon Bedrock
View
Alibaba logo
Qwen3 Omni 30B A3B Instruct
Alibaba
11
35.3B
(3B active at inference time)
65.5k
$0.4
87
🤗
Alibaba Cloud
View
Nous Research logo
Hermes 3 - Llama-3.1 70B
Nous Research
11
70.6B
128k
$0.3
45
🤗
DeepInfra
View
DeepSeek logo
DeepSeek-Coder-V2
DeepSeek
11
236B
(21B active at inference time)
128k
-
-
🤗
-
View
Allen Institute for AI logo
OLMo 2 32B
Allen Institute for AI
11
32.2B
4.10k
-
-
🤗
-
View
AI21 Labs logo
Jamba 1.6 Large
AI21 Labs
11
398B
(94B active at inference time)
256k
$3.5
45
🤗
AI21 Labs
View
Alibaba logo
Qwen3 8B (Non-reasoning)
Alibaba
11
8.19B
32.8k
$0.3
64
🤗
Fireworks
Alibaba Cloud
View
AI21 Labs logo
Jamba Reasoning 3B
AI21 Labs
10
3B
262k
-
-
🤗
-
View
Meta logo
Llama 3 Instruct 70B
Meta
10
70B
8.19k
$0.9
36
🤗
Replicate
Novita
Amazon Bedrock
+1 more
View
Google logo
Gemma 3 27B Instruct
Google
10
27.4B
128k
-
36
🤗
Google
Novita
DeepInfra
+3 more
View
Mistral logo
Mistral Small (Sep '24)
Mistral
10
22B
32.8k
$0.3
125
🤗
Mistral
View
NVIDIA logo
NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning)
NVIDIA
10
13.2B
128k
$0.3
134
🤗
Amazon Bedrock
Nebius
DeepInfra
View
Microsoft Azure logo
Phi-3 Mini Instruct 3.8B
Microsoft Azure
10
3.8B
4.10k
$0.2
-
🤗
Microsoft Azure
View
Google logo
Gemma 3n E4B Instruct Preview (May '25)
Google
10
8.39B
(4B active at inference time)
32.0k
-
-
🤗
-
View
Microsoft Azure logo
Phi-4 Multimodal Instruct
Microsoft Azure
10
5.6B
128k
-
17
🤗
Microsoft Azure
View
Alibaba logo
Qwen2.5 Coder Instruct 7B
Alibaba
10
7.62B
131k
-
-
🤗
-
View
Mistral logo
Mixtral 8x22B Instruct
Mistral
10
141B
(39B active at inference time)
65.4k
-
-
🤗
-
View
Meta logo
Llama 2 Chat 7B
Meta
10
7B
4.10k
$0.1
112
🤗
Replicate
View
Google logo
Gemma 3n E2B Instruct
Google
10
5.98B
(2B active at inference time)
32.0k
-
35
🤗
Google
View
Meta logo
Llama 3.2 Instruct 3B
Meta
10
3B
128k
$0.1
72
🤗
Together.ai
DeepInfra
Hyperbolic
+1 more
View
Alibaba logo
Qwen1.5 Chat 110B
Alibaba
10
110B
32.0k
-
-
🤗
-
View
Alibaba logo
Qwen3 VL 4B Instruct
Alibaba
10
4.44B
256k
-
-
🤗
-
View
Allen Institute for AI logo
OLMo 2 7B
Allen Institute for AI
9
7.3B
4.10k
-
-
🤗
-
View
AI21 Labs logo
Jamba 1.7 Large
AI21 Labs
9
398B
(94B active at inference time)
256k
$3.5
40
🤗
AI21 Labs
View
Allen Institute for AI logo
Molmo 7B-D
Allen Institute for AI
9
8.02B
4.10k
-
-
🤗
-
View
Meta logo
Llama 3.2 Instruct 1B
Meta
9
1B
128k
$0.1
73
🤗
DeepInfra
Novita
Amazon Bedrock
View
DeepSeek logo
DeepSeek R1 Distill Qwen 1.5B
DeepSeek
9
1.5B
128k
-
-
🤗
-
View
DeepSeek logo
DeepSeek-V2-Chat
DeepSeek
9
236B
(21B active at inference time)
128k
-
-
🤗
-
View
Snowflake logo
Arctic Instruct
Snowflake
9
480B
(17B active at inference time)
4.00k
-
-
🤗
-
View
Alibaba logo
Qwen Chat 72B
Alibaba
9
72B
33.8k
-
-
🤗
-
View
Google logo
Gemma 3 12B Instruct
Google
9
12.2B
128k
-
36
🤗
Cloudflare
Google
DeepInfra
+2 more
View
Meta logo
Llama 3 Instruct 8B
Meta
9
8B
8.19k
$0.1
67
🤗
DeepInfra
Novita
Amazon Bedrock
+1 more
View
Google logo
Gemma 3 1B Instruct
Google
9
1B
32.0k
-
32
🤗
Google
View
DeepSeek logo
DeepSeek Coder V2 Lite Instruct
DeepSeek
8
16B
(2.4B active at inference time)
128k
-
-
🤗
-
View
Google logo
Gemma 3 270M
Google
8
0.268B
32.0k
-
-
🤗
-
View
Meta logo
Llama 2 Chat 70B
Meta
8
70B
4.10k
-
-
🤗
-
View
DeepSeek logo
DeepSeek LLM 67B Chat (V1)
DeepSeek
8
7B
4.10k
-
-
🤗
-
View
Meta logo
Llama 2 Chat 13B
Meta
8
13B
4.10k
-
-
🤗
-
View
Cohere logo
Command-R+ (Apr '24)
Cohere
8
104B
128k
$6.0
-
🤗
Amazon Bedrock
View
OpenChat logo
OpenChat 3.5 (1210)
OpenChat
8
7B
8.19k
-
-
🤗
-
View
Databricks logo
DBRX Instruct
Databricks
8
132B
(36B active at inference time)
32.8k
-
-
🤗
-
View
LG AI Research logo
Exaone 4.0 1.2B (Reasoning)
LG AI Research
8
1.28B
64.0k
-
-
🤗
-
View
Allen Institute for AI logo
Olmo 3 7B Instruct
Allen Institute for AI
8
7B
65.5k
$0.1
37
🤗
Parasail
View
Liquid AI logo
LFM2.5-1.2B-Thinking
Liquid AI
8
1.17B
32.0k
-
-
🤗
-
View
LG AI Research logo
Exaone 4.0 1.2B (Non-reasoning)
LG AI Research
8
1.28B
64.0k
-
-
🤗
-
View
AI21 Labs logo
Jamba 1.5 Mini
AI21 Labs
8
52B
(12B active at inference time)
256k
$0.3
-
🤗
Google
Amazon Bedrock
View
IBM logo
Granite 4.0 H 1B
IBM
8
1.5B
128k
-
-
🤗
-
View
Liquid AI logo
LFM2.5-1.2B-Instruct
Liquid AI
8
1.17B
32.0k
-
-
🤗
?
View
Alibaba logo
Qwen3 1.7B (Reasoning)
Alibaba
8
2.03B
32.0k
$0.4
126
🤗
Alibaba Cloud
View
AI21 Labs logo
Jamba 1.6 Mini
AI21 Labs
8
52B
(12B active at inference time)
256k
$0.3
114
🤗
AI21 Labs
View
Liquid AI logo
LFM2 2.6B
Liquid AI
8
2.57B
32.8k
-
-
🤗
?
View
Mistral logo
Mixtral 8x7B Instruct
Mistral
8
46.7B
(12.9B active at inference time)
32.8k
$0.5
-
🤗
Together.ai
DeepInfra
Amazon Bedrock
View
IBM logo
Granite 4.0 Micro
IBM
8
3B
128k
-
-
🤗
-
View
Nous Research logo
DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning)
Nous Research
8
8B
128k
-
-
🤗
-
View
Meta logo
Llama 65B
Meta
7
65B
2.05k
-
-
Not available
-
View
Alibaba logo
Qwen Chat 14B
Alibaba
7
14B
8.19k
-
-
Not available
-
View
Mistral logo
Mistral 7B Instruct
Mistral
7
7B
8.19k
$0.3
71
🤗
Mistral
Together.ai
Amazon Bedrock
View
Cohere logo
Command-R (Mar '24)
Cohere
7
35B
128k
$0.8
-
🤗
Amazon Bedrock
View
AI21 Labs logo
Jamba 1.7 Mini
AI21 Labs
7
52B
(12B active at inference time)
258k
$0.3
-
🤗
AI21 Labs
View
IBM logo
Granite 4.0 1B
IBM
7
1.6B
128k
-
-
🤗
-
View
Liquid AI logo
LFM2 8B A1B
Liquid AI
7
8.34B
(1.5B active at inference time)
32.8k
-
-
🤗
?
View
Alibaba logo
Qwen3 1.7B (Non-reasoning)
Alibaba
7
2.03B
32.0k
$0.2
119
🤗
Alibaba Cloud
View
IBM logo
Granite 4.0 350M
IBM
7
0.35B
32.8k
-
-
🤗
-
View
Alibaba logo
Qwen3 0.6B (Reasoning)
Alibaba
6
0.752B
32.0k
$0.4
204
🤗
Alibaba Cloud
View
Liquid AI logo
LFM2 1.2B
Liquid AI
6
1.17B
32.8k
-
-
🤗
?
View
Google logo
Gemma 3 4B Instruct
Google
6
4.3B
128k
-
33
🤗
Google
Amazon Bedrock
DeepInfra
View
Google logo
Gemma 3n E4B Instruct
Google
6
8.39B
(4B active at inference time)
32.0k
$0.0
42
🤗
Together.ai
View
Liquid AI logo
LFM2.5-VL-1.6B
Liquid AI
6
1.6B
32.0k
-
-
🤗
-
View
Alibaba logo
Qwen3 0.6B (Non-reasoning)
Alibaba
6
0.752B
32.0k
$0.2
192
🤗
Alibaba Cloud
View
IBM logo
Granite 4.0 H 350M
IBM
5
0.34B
32.8k
-
-
🤗
-
View
DeepSeek logo
DeepSeek-OCR
DeepSeek
-
3.34B
8.19k
$0.0
310
🤗
DeepInfra
Novita
Google
View
Allen Institute for AI logo
Molmo2-8B
Allen Institute for AI
-
8.66B
36.9k
-
69
🤗
Parasail
View
Deep Cogito logo
Cogito v2.1 (Reasoning)
Deep Cogito
-
671B
(37B active at inference time)
128k
$1.3
74
🤗
Together.ai
View