Comparison of Open Source Models

Comparison and analysis of open source AI models across key performance metrics including quality, performance, inference speed, context window, parameter count & licensing details. Models are considered open source (also commonly referred to as open weights) where their weights are accessible to download. This allows self-hosting on your own infrastructure and enables customizing the model such as through fine-tuning. Click on any model to see detailed metrics. For more details relating to our methodology, see our FAQs.
Kimi logoKimi K2.6 and Xiaomi logoMiMo-V2.5-Pro are the highest intelligence open source models, followed by DeepSeek logoDeepSeek V4 Pro (Max) & Z AI logoGLM-5.1.

Highlights

Artificial Analysis Openness Index · Higher is better
Artificial Analysis Intelligence Index · Higher is better
Trainable parameters in billions

Openness

Artificial Analysis Openness Index: Results

Openness Index assesses model openness on a 0 to 100 normalized scale (higher is more open)
Reasoning models are indicated by a lightbulb icon

Open Source Progress

Progress in Open Weights vs. Proprietary Intelligence

Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Indicates whether the model weights are available. Models are labelled as 'Commercial Use Restricted' if the weights are available but commercial use is limited (typically requires obtaining a paid license).

Open Source Language Models Intelligence By Lab Over Time

Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Open Source Models Intelligence By Size Over Time

Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

  • Tiny: Less than or equal to 4B parameters. These are usually the smallest models in terms of resource demand.
  • Small: Less than 40B parameters.
  • Medium: Between 40B-150B parameters.
  • Large: Over 150B parameters.

Intelligence

Artificial Analysis Intelligence Index

Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Estimate (independent evaluation forthcoming)
Reasoning models are indicated by a lightbulb icon

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Intelligence Evaluations

Intelligence evaluations measured independently by Artificial Analysis · Higher is better
GDPval-AA

Agentic real-world work tasks, (Elo-500)/2000

Terminal-Bench Hard

Agentic coding & terminal use

𝜏²-Bench Telecom

Agentic tool use

AA-LCR

Long context reasoning

Humanity's Last Exam

Reasoning & knowledge

GPQA Diamond

Scientific reasoning

SciCode

Coding

IFBench

Instruction following

CritPt

Physics reasoning

APEX-Agents-AA

Long-horizon agentic tasks

ITBench-AA

Kubernetes incident root-cause analysis

MMMU-Pro

Visual reasoning

Reasoning models are indicated by a lightbulb icon.

While model intelligence generally translates across use cases, specific evaluations may be more relevant for certain use cases.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Size

Intelligence Index By Model Size

Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Estimate (independent evaluation forthcoming)
Large Models (>150B)
Medium Models (40B-150B)
Small Models (4B-40B)
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Indicates whether the model weights are available. Models are labelled as 'Commercial Use Restricted' if the weights are available but commercial use is limited (typically requires obtaining a paid license).

  • Tiny: Less than or equal to 4B parameters. These are usually the smallest models in terms of resource demand.
  • Small: Less than 40B parameters.
  • Medium: Between 40B-150B parameters.
  • Large: Over 150B parameters.

Model Size: Total and Active Parameters

Comparison between total model parameters and parameters active during inference
Reasoning models are indicated by a lightbulb icon

The total number of trainable weights and biases in the model, expressed in billions. These parameters are learned during training and determine the model's ability to process and generate responses.

The number of parameters actually executed during each inference forward pass, expressed in billions. For Mixture of Experts (MoE) models, a routing mechanism selects a subset of experts per token, resulting in fewer active than total parameters. Dense models use all parameters, so active equals total.

Intelligence vs. Active Parameters

Active parameters at inference time · Artificial Analysis Intelligence Index
Most attractive quadrant
Alibaba
DeepSeek
Google
Kimi
MBZUAI Institute of Foundation Models
MiniMax
Mistral
NVIDIA
OpenAI
Xiaomi
Z AI
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

The number of parameters actually executed during each inference forward pass, expressed in billions. For Mixture of Experts (MoE) models, a routing mechanism selects a subset of experts per token, resulting in fewer active than total parameters. Dense models use all parameters, so active equals total.

Intelligence vs. Total Parameters

Artificial Analysis Intelligence Index · Size in parameters (billions)
Most attractive quadrant
Alibaba
DeepSeek
Google
Kimi
MBZUAI Institute of Foundation Models
MiniMax
Mistral
NVIDIA
OpenAI
Xiaomi
Z AI
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

The total number of trainable weights and biases in the model, expressed in billions. These parameters are learned during training and determine the model's ability to process and generate responses.

Context Window

Context Window

Context window: tokens limit · Higher is better
Reasoning models are indicated by a lightbulb icon

Larger context windows are relevant to RAG (Retrieval Augmented Generation) LLM workflows which typically involve reasoning and information retrieval of large amounts of data.

Maximum number of combined input & output tokens. Output tokens commonly have a significantly lower limit (varied by model).

Further details

Weights
Provider Benchmarks
Kimi K2.6
Kimi logoKimi
54
1.0KB
32B active at inference time
256k
$0.7
44
NovitaMakoraClarifai
+12
MiMo-V2.5-Pro
Xiaomi logoXiaomi
54
1.0KB
42B active at inference time
1.00M
$0.2
45
NovitaDeepInfraXiaomiGMI
DeepSeek V4 Pro (Reasoning, Max Effort)
DeepSeek logoDeepSeek
52
1.6KB
49B active at inference time
1.00M
$0.2
58
DeepInfraNovitaMicrosoft Azure
+8
GLM-5.1 (Reasoning)
Z AI logoZ AI
51
744B
40B active at inference time
200k
$0.9
77
DeepInfraFireworksSiliconFlow
+9
DeepSeek V4 Pro (Reasoning, High Effort)
DeepSeek logoDeepSeek
50
1.6KB
49B active at inference time
1.00M
$0.2
58
DeepSeekBasetenMakora
+8
GLM-5 (Reasoning)
Z AI logoZ AI
50
744B
40B active at inference time
200k
$0.7
72
FriendliAIGoogleParasail
+9
MiniMax-M2.7
MiniMax logoMiniMax
50
230B
10B active at inference time
205k
$0.2
120
Together.aiNovitaGMI
+3
MiMo-V2.5
Xiaomi logoXiaomi
49
310B
15B active at inference time
1.00M
$0.1
76
NovitaParasailDeepInfra
+2
Nemotron 3 Ultra 550B A55B (Reasoning)
NVIDIA logoNVIDIA
48
550B
55B active at inference time
262k
$0.5
142
Not available
NebiusCoreWeaveDeepInfra
+4
Kimi K2.5 (Reasoning)
Kimi logoKimi
47
1.0KB
32B active at inference time
256k
$0.6
48
DeepInfraNovitaSiliconFlow
+12
DeepSeek V4 Flash (Reasoning, Max Effort)
DeepSeek logoDeepSeek
47
284B
13B active at inference time
1.00M
$0.1
124
DeepInfraSiliconFlowNovita
+4
DeepSeek V4 Flash (Reasoning, High Effort)
DeepSeek logoDeepSeek
46
284B
13B active at inference time
1.00M
$0.1
-
SiliconFlowDeepSeekParasail
+4
Qwen3.6 27B (Reasoning)
Alibaba logoAlibaba
46
27.8B
262k
$0.9
63
MakoraSiliconFlowNovita
+2
Qwen3.5 397B A17B (Reasoning)
Alibaba logoAlibaba
45
397B
17B active at inference time
262k
$0.9
53
GMIDeepInfraDigitalOcean
+9
GLM-5.1 (Non-reasoning)
Z AI logoZ AI
44
744B
40B active at inference time
200k
$0.9
80
SiliconFlowWaferParasail
+5
Qwen3.6 35B A3B (Reasoning)
Alibaba logoAlibaba
43
36B
3B active at inference time
262k
$0.4
189
NovitaMakoraGMI
+6
Kimi K2.6 (Non-reasoning)
Kimi logoKimi
43
1.0KB
32B active at inference time
256k
$0.7
37
ParasailMakoraCoreWeave
+9
Step 3.7 Flash
StepFun logoStepFun
43
198B
11B active at inference time
256k
$0.2
174
StepFun
GLM-4.7 (Reasoning)
Z AI logoZ AI
42
357B
32B active at inference time
200k
$0.7
84
NovitaClarifaiAmazon Bedrock
+7
Qwen3.5 27B (Reasoning)
Alibaba logoAlibaba
42
27.8B
262k
$0.5
82
GMICoreWeaveAlibaba Cloud
+3
MiniMax-M2.5
MiniMax logoMiniMax
42
230B
10B active at inference time
205k
$0.3
215
FireworksSiliconFlowGMI
+13
Hy3-preview (Reasoning)
Tencent logoTencent
42
295B
21B active at inference time
256k
$0.1
101
SiliconFlowGMI
DeepSeek V3.2 (Reasoning)
DeepSeek logoDeepSeek
42
685B
37B active at inference time
128k
$0.2
-
?
SambaNova
?
+12
Qwen3.5 122B A10B (Reasoning)
Alibaba logoAlibaba
42
125B
10B active at inference time
262k
$0.7
140
DeepInfraSiliconFlowAlibaba Cloud
+2
MiMo-V2-Flash (Feb 2026)
Xiaomi logoXiaomi
41
309B
15B active at inference time
256k
$0.1
125
Xiaomi
Kimi K2 Thinking
Kimi logoKimi
41
1.0KB
32B active at inference time
256k
$0.8
123
Microsoft AzureNovitaKimi
+3
GLM-5 (Non-reasoning)
Z AI logoZ AI
41
744B
40B active at inference time
200k
$0.7
67
FireworksSiliconFlowDeepInfra
+3
Qwen3.5 397B A17B (Non-reasoning)
Alibaba logoAlibaba
40
397B
17B active at inference time
262k
$0.9
54
Alibaba CloudDeepInfraEigen AI
+6
MiniMax-M2.1
MiniMax logoMiniMax
39
230B
10B active at inference time
205k
$0.4
212
MiniMaxFriendliAINovita
DeepSeek V4 Pro (Non-reasoning)
DeepSeek logoDeepSeek
39
1.6KB
49B active at inference time
1.00M
$0.2
61
Microsoft AzureLightning AINebius
+2
MiMo-V2-Flash (Reasoning)
Xiaomi logoXiaomi
39
309B
15B active at inference time
256k
$0.1
131
Xiaomi
Mistral Medium 3.5
Mistral logoMistral
39
128B
256k
$2.1
122
Mistral
Gemma 4 31B (Reasoning)
Google logoGoogle
39
30.7B
256k
-
36
NovitaGMIGoogle
+8
Ring-2.6-1T
InclusionAI logoInclusionAI
38
1.0KB
63B active at inference time
262k
$0.5
127
InclusionAI
Step 3.5 Flash
StepFun logoStepFun
38
196B
11B active at inference time
256k
$0.1
179
SiliconFlowStepFun
Kimi K2.5 (Non-reasoning)
Kimi logoKimi
37
1.0KB
32B active at inference time
256k
$0.8
42
GMIBasetenNovita
+6
Qwen3.5 27B (Non-reasoning)
Alibaba logoAlibaba
37
27.8B
262k
$0.5
91
Alibaba CloudDeepInfraCoreWeave
Command A+
Cohere logoCohere
37
218B
25B active at inference time
192k
-
196
Cohere
Qwen3.6 27B (Non-reasoning)
Alibaba logoAlibaba
37
27.8B
262k
$0.9
64
NovitaMakoraDeepInfraAlibaba Cloud
Qwen3.5 35B A3B (Reasoning)
Alibaba logoAlibaba
37
36B
3B active at inference time
262k
$0.4
153
SiliconFlowAlibaba CloudGMI
+2
DeepSeek V4 Flash (Non-reasoning)
DeepSeek logoDeepSeek
36
284B
13B active at inference time
1.00M
$0.1
117
DeepSeekGMIMakoraCoreWeave
MiniMax-M2
MiniMax logoMiniMax
36
230B
10B active at inference time
205k
$0.4
127
GoogleNovitaMiniMaxAmazon Bedrock
NVIDIA Nemotron 3 Super 120B A12B (Reasoning)
NVIDIA logoNVIDIA
36
120.6B
12.7B active at inference time
1.00M
$0.3
159
DeepInfraLightning AINebius
+2
Qwen3.5 122B A10B (Non-reasoning)
Alibaba logoAlibaba
36
125B
10B active at inference time
262k
$0.7
163
DeepInfraAlibaba Cloud
MiMo-V2.5-Pro (Non-reasoning)
Xiaomi logoXiaomi
36
1.0KB
41.7B active at inference time
1.00M
$0.6
54
GMIDeepInfraXiaomiNovita
GLM-4.7 (Non-reasoning)
Z AI logoZ AI
34
357B
32B active at inference time
200k
$0.7
76
ParasailCerebrasNovita
+6
DeepSeek V3.1 Terminus (Reasoning)
DeepSeek logoDeepSeek
34
685B
37B active at inference time
128k
$1.7
-
NovitaSambaNova
Hy3-preview (Non-reasoning)
Tencent logoTencent
34
295B
21B active at inference time
256k
$0.1
94
SiliconFlowGMI
Ling-2.6-1T
InclusionAI logoInclusionAI
34
1.0KB
63B active at inference time
262k
$0.5
-
InclusionAI
gpt-oss-120b (high)
OpenAI logoOpenAI
33
117B
5.1B active at inference time
131k
$0.2
341
ClarifaiCompactifAIAmazon Bedrock
+23
DeepSeek V3.2 Exp (Reasoning)
DeepSeek logoDeepSeek
33
685B
37B active at inference time
128k
$0.2
-
DeepSeekNovita
GLM-4.6 (Reasoning)
Z AI logoZ AI
33
357B
32B active at inference time
200k
$0.7
51
NovitaDeepInfraTogether.ai
Qwen3.5 9B (Reasoning)
Alibaba logoAlibaba
32
9.65B
262k
$0.1
86
Together.aiSiliconFlow
Gemma 4 31B (Non-reasoning)
Google logoGoogle
32
30.7B
256k
$0.2
45
Together.aiFriendliAINovita
+4
K-EXAONE (Reasoning)
LG AI Research logoLG AI Research
32
236B
23B active at inference time
256k
-
-
-
DeepSeek V3.2 (Non-reasoning)
DeepSeek logoDeepSeek
32
685B
37B active at inference time
128k
$0.5
-
FriendliAIAmazon BedrockSiliconFlow
+12
Trinity Large Thinking
Arcee AI logoArcee AI
32
399B
13B active at inference time
512k
$0.2
169
Arcee AIParasail
Qwen3.6 35B A3B (Non-reasoning)
Alibaba logoAlibaba
32
36B
3B active at inference time
262k
$0.6
220
ScalewayGMIMakora
+5
Gemma 4 26B A4B (Reasoning)
Google logoGoogle
31
25.2B
3.8B active at inference time
256k
$0.1
-
CloudflareNovitaDeepInfra
+4
Kimi K2 0905
Kimi logoKimi
31
1.0KB
32B active at inference time
256k
$0.8
25
Novita
Qwen3.5 35B A3B (Non-reasoning)
Alibaba logoAlibaba
31
36B
3B active at inference time
262k
$0.4
178
Alibaba CloudDeepInfra
MiMo-V2-Flash (Non-reasoning)
Xiaomi logoXiaomi
30
309B
15B active at inference time
256k
$0.1
122
Xiaomi
GLM-4.6 (Non-reasoning)
Z AI logoZ AI
30
357B
32B active at inference time
200k
$0.8
55
NovitaTogether.ai
EXAONE 4.5 33B
LG AI Research logoLG AI Research
30
34.4B
262k
-
-
-
GLM-4.7-Flash (Reasoning)
Z AI logoZ AI
30
31.2B
3B active at inference time
200k
$0.1
79
DeepInfraAmazon BedrockNovita
Qwen3 235B A22B 2507 (Reasoning)
Alibaba logoAlibaba
30
235B
22B active at inference time
256k
$0.6
52
NovitaNebiusCoreWeave
+3
DeepSeek V3.2 Speciale
DeepSeek logoDeepSeek
29
685B
37B active at inference time
128k
-
-
-
DeepSeek V3.1 Terminus (Non-reasoning)
DeepSeek logoDeepSeek
29
685B
37B active at inference time
128k
$0.3
-
NovitaSambaNovaDeepInfra
DeepSeek V3.2 Exp (Non-reasoning)
DeepSeek logoDeepSeek
28
685B
37B active at inference time
128k
$0.2
-
NovitaDeepSeek
Nemotron Cascade 2 30B A3B
NVIDIA logoNVIDIA
28
31.6B
3B active at inference time
1.00M
-
-
-
Apriel-v1.5-15B-Thinker
ServiceNow logoServiceNow
28
15B
128k
-
-
Together.ai
Qwen3 Coder Next
Alibaba logoAlibaba
28
79.7B
3B active at inference time
256k
$0.4
103
ParasailTogether.aiNovitaAmazon Bedrock
DeepSeek V3.1 (Non-reasoning)
DeepSeek logoDeepSeek
28
685B
37B active at inference time
128k
$0.7
-
SambaNovaAmazon BedrockBaseten
+7
Mistral Small 4 (Reasoning)
Mistral logoMistral
28
119B
6.5B active at inference time
256k
$0.2
177
Mistral
DeepSeek V3.1 (Reasoning)
DeepSeek logoDeepSeek
28
685B
37B active at inference time
128k
$0.7
-
SambaNovaGoogleAmazon BedrockNovita
Qwen3 VL 235B A22B (Reasoning)
Alibaba logoAlibaba
28
235B
22B active at inference time
262k
$1.4
36
Alibaba CloudNovita
Apriel-v1.6-15B-Thinker
ServiceNow logoServiceNow
28
15B
128k
-
-
Together.ai
Qwen3.5 9B (Non-reasoning)
Alibaba logoAlibaba
27
9.65B
262k
-
-
-
Gemma 4 26B A4B (Non-reasoning)
Google logoGoogle
27
25.2B
3.8B active at inference time
256k
$0.2
72
ParasailNovitaClarifai
+4
Qwen3.5 4B (Reasoning)
Alibaba logoAlibaba
27
4.66B
262k
$0.0
205
DeepInfra
DeepSeek R1 0528 (May '25)
DeepSeek logoDeepSeek
27
685B
37B active at inference time
128k
$1.6
-
DeepInfraMicrosoft AzureTogether.ai
+3
Qwen3 Next 80B A3B (Reasoning)
Alibaba logoAlibaba
27
80B
3B active at inference time
262k
$1.1
158
HyperbolicEigen AINebius
+5
GLM-4.5 (Reasoning)
Z AI logoZ AI
26
355B
32B active at inference time
128k
$0.8
51
Novita
Kimi K2
Kimi logoKimi
26
1.0KB
32B active at inference time
128k
$0.6
24
KimiNovita
Ling 2.6 Flash
InclusionAI logoInclusionAI
26
107B
7.4B active at inference time
262k
$0.1
-
Novita
Seed-OSS-36B-Instruct
ByteDance Seed logoByteDance Seed
25
36.2B
512k
$0.2
36
SiliconFlow
Qwen3 235B A22B 2507 Instruct
Alibaba logoAlibaba
25
235B
22B active at inference time
256k
$0.3
50
DeepInfraAmazon BedrockNebius
+9
Qwen3 Coder 480B A35B Instruct
Alibaba logoAlibaba
25
480B
35B active at inference time
262k
$0.5
57
CoreWeaveEigen AIHyperbolic
+6
Qwen3 VL 32B (Reasoning)
Alibaba logoAlibaba
25
33.4B
256k
$1.5
88
Alibaba Cloud
gpt-oss-20B (high)
OpenAI logoOpenAI
24
21B
3.6B active at inference time
131k
$0.1
239
Together.aiLightning AIClarifai
+10
gpt-oss-120b (low)
OpenAI logoOpenAI
24
117B
5.1B active at inference time
131k
$0.2
343
GoogleFireworksNovita
+19
MiniMax M1 80k
MiniMax logoMiniMax
24
456B
45.9B active at inference time
1.00M
$0.7
-
Novita
NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)
NVIDIA logoNVIDIA
24
31.6B
3.6B active at inference time
1.00M
$0.1
181
DeepInfraNebius
K2 Think V2
MBZUAI Institute of Foundation Models logoMBZUAI Institute of Foundation Models
24
70B
262k
-
-
-
LongCat Flash Lite
LongCat logoLongCat
24
68.5B
3B active at inference time
256k
-
-
LongCat
HyperCLOVA X SEED Think (32B)
Naver logoNaver
24
32B
128k
-
-
-
GLM-4.6V (Reasoning)
Z AI logoZ AI
23
108B
12B active at inference time
128k
$0.4
85
SiliconFlowNovita
K-EXAONE (Non-reasoning)
LG AI Research logoLG AI Research
23
236B
23B active at inference time
256k
-
-
-
GLM-4.5-Air
Z AI logoZ AI
23
106B
12B active at inference time
128k
$0.3
79
SiliconFlowTogether.ai
Mistral Large 3
Mistral logoMistral
23
675B
41B active at inference time
256k
$0.6
52
MistralMicrosoft AzureAmazon Bedrock
Ring-1T
InclusionAI logoInclusionAI
23
1.0KB
50B active at inference time
128k
-
-
-
Qwen3.5 4B (Non-reasoning)
Alibaba logoAlibaba
23
4.66B
262k
$0.0
215
DeepInfra
Qwen3 30B A3B 2507 (Reasoning)
Alibaba logoAlibaba
22
30.5B
3.3B active at inference time
262k
$0.4
123
ClarifaiAlibaba Cloud
DeepSeek V3 0324
DeepSeek logoDeepSeek
22
671B
37B active at inference time
128k
$1.2
-
HyperbolicReplicateNovita
+3
INTELLECT-3
Prime Intellect logoPrime Intellect
22
107B
12B active at inference time
131k
-
-
-
GLM-4.7-Flash (Non-reasoning)
Z AI logoZ AI
22
31.2B
3B active at inference time
200k
$0.1
107
Amazon BedrockNovita
Devstral 2
Mistral logoMistral
22
125B
256k
-
65
Mistral
Solar Open 100B (Reasoning)
Upstage logoUpstage
22
102B
12B active at inference time
128k
-
-
-
Nemotron 3 Nano Omni 30B A3B Reasoning
NVIDIA logoNVIDIA
21
30B
3B active at inference time
256k
$0.1
301
NebiusClarifai
MiniMax M1 40k
MiniMax logoMiniMax
21
456B
45.9B active at inference time
1.00M
-
-
-
gpt-oss-20B (low)
OpenAI logoOpenAI
21
21B
3.6B active at inference time
131k
$0.1
242
CompactifAIHyperbolicAmazon Bedrock
+9
Qwen3 VL 235B A22B Instruct
Alibaba logoAlibaba
21
235B
22B active at inference time
262k
$0.5
49
ParasailAlibaba CloudEigen AI
+2
K2-V2 (high)
MBZUAI Institute of Foundation Models logoMBZUAI Institute of Foundation Models
21
70B
512k
-
-
-
Qwen3 Next 80B A3B Instruct
Alibaba logoAlibaba
20
80B
3B active at inference time
262k
$0.7
150
GMIHyperbolicParasail
+4
Tri-21B-think Preview
Trillion Labs logoTrillion Labs
20
21B
32.0k
-
-
-
Qwen3 Coder 30B A3B Instruct
Alibaba logoAlibaba
20
30.5B
3.3B active at inference time
262k
$0.3
94
ClarifaiScalewayAmazon BedrockAlibaba Cloud
Qwen3 235B A22B (Reasoning)
Alibaba logoAlibaba
20
235B
22B active at inference time
32.8k
$1.5
51
Alibaba Cloud
QwQ 32B
Alibaba logoAlibaba
20
32.8B
131k
$0.7
29
Cloudflare
Qwen3 VL 30B A3B (Reasoning)
Alibaba logoAlibaba
20
30B
3B active at inference time
256k
$0.3
100
FireworksEigen AIAlibaba CloudNovita
Devstral Small 2
Mistral logoMistral
19
24B
256k
-
57
Mistral
Ling-1T
InclusionAI logoInclusionAI
19
1.0KB
50B active at inference time
128k
-
-
-
DeepSeek R1 (Jan '25)
DeepSeek logoDeepSeek
19
685B
37B active at inference time
128k
$2.0
-
HyperbolicMicrosoft AzureAmazon Bedrock
+3
Gemma 4 E4B (Reasoning)
Google logoGoogle
19
8B
4.5B active at inference time
128k
-
-
-
K2-V2 (medium)
MBZUAI Institute of Foundation Models logoMBZUAI Institute of Foundation Models
19
70B
512k
-
-
-
Llama Nemotron Super 49B v1.5 (Reasoning)
NVIDIA logoNVIDIA
19
49B
128k
$0.1
45
DeepInfra
Mistral Small 4 (Non-reasoning)
Mistral logoMistral
19
119B
6.5B active at inference time
256k
$0.2
169
Mistral
Tri-21B-Think
Trillion Labs logoTrillion Labs
19
21B
32.0k
-
-
-
Hermes 4 - Llama-3.1 405B (Reasoning)
Nous Research logoNous Research
19
406B
128k
$1.2
38
Nebius
Llama 3.3 Nemotron Super 49B v1 (Reasoning)
NVIDIA logoNVIDIA
18
49B
128k
-
-
-
Llama 4 Maverick
Meta logoMeta
18
402B
17B active at inference time
1.00M
$0.3
110
NovitaTogether.aiSnowflake
+6
Qwen3 4B 2507 (Reasoning)
Alibaba logoAlibaba
18
4.02B
262k
-
-
-
MiniCPM5-1B (Reasoning)
OpenBMB logoOpenBMB
18
1B
128k
-
-
-
Magistral Small 1.2
Mistral logoMistral
18
24B
128k
$0.6
110
Amazon BedrockMistral
Sarvam 105B (high)
Sarvam logoSarvam
18
106B
10.3B active at inference time
128k
$0.0
114
Sarvam
Devstral Small (May '25)
Mistral logoMistral
18
23.6B
256k
-
-
-
MiniCPM5-1B (Non-reasoning)
OpenBMB logoOpenBMB
18
1B
128k
-
-
-
Hermes 4 - Llama-3.1 405B (Non-reasoning)
Nous Research logoNous Research
18
406B
128k
$1.2
40
Nebius
Llama 3.1 Instruct 405B
Meta logoMeta
17
405B
128k
$3.1
47
Amazon BedrockDatabricksAmazon BedrockMicrosoft Azure
Qwen3 VL 32B Instruct
Alibaba logoAlibaba
17
33.4B
256k
$0.9
68
Alibaba Cloud
DeepSeek R1 Distill Qwen 32B
DeepSeek logoDeepSeek
17
32B
128k
-
-
-
GLM-4.6V (Non-reasoning)
Z AI logoZ AI
17
108B
12B active at inference time
128k
$0.4
94
SiliconFlowNovita
Qwen3 235B A22B (Non-reasoning)
Alibaba logoAlibaba
17
235B
22B active at inference time
32.8k
$0.6
46
NovitaAlibaba Cloud
Magistral Small 1
Mistral logoMistral
17
23.6B
40.0k
-
-
-
EXAONE 4.0 32B (Reasoning)
LG AI Research logoLG AI Research
17
32B
131k
-
-
-
Qwen3 VL 8B (Reasoning)
Alibaba logoAlibaba
17
8.77B
256k
$0.4
108
Alibaba Cloud
Qwen3 32B (Reasoning)
Alibaba logoAlibaba
17
32.8B
32.8k
$0.2
81
NebiusDeepInfraAlibaba Cloud
+3
DeepSeek V3 (Dec '24)
DeepSeek logoDeepSeek
16
671B
37B active at inference time
128k
$0.4
-
Together.aiDeepInfraHyperbolic
+2
DeepSeek R1 0528 Qwen3 8B
DeepSeek logoDeepSeek
16
8.19B
32.8k
-
-
-
Qwen3.5 2B (Reasoning)
Alibaba logoAlibaba
16
2.27B
262k
$0.0
-
DeepInfra
Qwen3 14B (Reasoning)
Alibaba logoAlibaba
16
14.8B
32.8k
$0.4
60
Alibaba CloudDeepInfra
Nanbeige4.1-3B
Nanbeige logoNanbeige
16
3.93B
256k
-
-
-
Qwen3 VL 30B A3B Instruct
Alibaba logoAlibaba
16
30B
3B active at inference time
256k
$0.2
107
FireworksAlibaba CloudEigen AINovita
Hermes 4 - Llama-3.1 70B (Reasoning)
Nous Research logoNous Research
16
70.6B
128k
$0.2
82
Nebius
Ministral 3 14B
Mistral logoMistral
16
14B
256k
$0.2
87
Amazon BedrockMistral
DeepSeek R1 Distill Llama 70B
DeepSeek logoDeepSeek
16
70B
128k
$0.7
42
SambaNovaScalewayDeepInfra
DeepSeek R1 Distill Qwen 14B
DeepSeek logoDeepSeek
16
14B
128k
-
-
-
Falcon-H1R-7B
TII UAE logoTII UAE
16
7B
256k
-
-
-
Ling-flash-2.0
InclusionAI logoInclusionAI
16
103B
6.1B active at inference time
128k
$0.2
78
SiliconFlow
Qwen3 Omni 30B A3B (Reasoning)
Alibaba logoAlibaba
16
35.3B
3B active at inference time
65.5k
$0.3
75
Alibaba Cloud
Qwen2.5 Instruct 72B
Alibaba logoAlibaba
16
72B
131k
$0.2
-
Alibaba CloudDeepInfraSiliconFlow
Step3 VL 10B
StepFun logoStepFun
15
10.2B
65.5k
-
-
-
Qwen3 30B A3B (Reasoning)
Alibaba logoAlibaba
15
30.5B
3.3B active at inference time
32.8k
$0.1
65
Eigen AIAlibaba CloudDeepInfra
+2
Devstral Small (Jul '25)
Mistral logoMistral
15
24B
256k
$0.1
56
Mistral
Gemma 4 E2B (Reasoning)
Google logoGoogle
15
5.1B
2.3B active at inference time
128k
-
-
-
QwQ 32B-Preview
Alibaba logoAlibaba
15
32.8B
32.8k
-
-
-
GLM-4.5V (Reasoning)
Z AI logoZ AI
15
108B
12B active at inference time
64.0k
$0.7
26
Novita
Mistral Large 2 (Nov '24)
Mistral logoMistral
15
123B
128k
$2.4
50
Mistral
Mistral Small 3.2
Mistral logoMistral
15
24B
128k
$0.1
141
DeepInfraMistral
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)
NVIDIA logoNVIDIA
15
253B
128k
$0.7
52
Nebius
Qwen3 30B A3B 2507 Instruct
Alibaba logoAlibaba
15
30.5B
3.3B active at inference time
262k
$0.2
97
ClarifaiCoreWeaveAlibaba CloudNebius
ERNIE 4.5 300B A47B
Baidu logoBaidu
15
300B
47B active at inference time
131k
$0.4
-
SiliconFlowNovita
NVIDIA Nemotron Nano 12B v2 VL (Reasoning)
NVIDIA logoNVIDIA
15
13.2B
128k
$0.2
296
DeepInfra
Ministral 3 8B
Mistral logoMistral
15
8B
256k
$0.1
107
MistralAmazon Bedrock
Gemma 4 E4B (Non-reasoning)
Google logoGoogle
15
8B
4.5B active at inference time
128k
-
-
-
NVIDIA Nemotron Nano 9B V2 (Reasoning)
NVIDIA logoNVIDIA
15
9B
131k
$0.1
119
DeepInfra
Granite 4.1 30B
IBM logoIBM
15
30B
131k
-
-
-
NVIDIA Nemotron 3 Nano 4B
NVIDIA logoNVIDIA
15
3.97B
262k
-
-
-
Qwen3.5 2B (Non-reasoning)
Alibaba logoAlibaba
15
2.27B
262k
$0.0
366
DeepInfra
Llama Nemotron Super 49B v1.5 (Non-reasoning)
NVIDIA logoNVIDIA
15
49B
128k
$0.1
46
DeepInfra
Qwen3 32B (Non-reasoning)
Alibaba logoAlibaba
15
32.8B
32.8k
$0.2
85
DeepInfraNebiusAlibaba Cloud
+4
Llama 3.3 Instruct 70B
Meta logoMeta
14
70B
128k
$0.6
79
ScalewayNovitaDeepInfra
+18
Mistral Small 3.1
Mistral logoMistral
14
24B
128k
$0.1
170
MistralDeepInfraCloudflareCompactifAI
K2-V2 (low)
MBZUAI Institute of Foundation Models logoMBZUAI Institute of Foundation Models
14
70B
512k
-
-
-
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning)
NVIDIA logoNVIDIA
14
4.51B
128k
-
-
-
Kimi Linear 48B A3B Instruct
Kimi logoKimi
14
49.1B
3B active at inference time
1.00M
-
-
-
Llama 3.3 Nemotron Super 49B v1 (Non-reasoning)
NVIDIA logoNVIDIA
14
49B
128k
-
-
-
Qwen3 VL 8B Instruct
Alibaba logoAlibaba
14
8.77B
256k
$0.2
119
Alibaba Cloud
Qwen3 4B (Reasoning)
Alibaba logoAlibaba
14
4.02B
32.0k
$0.2
-
Alibaba Cloud
Llama 3.1 Tulu3 405B
Allen Institute for AI logoAllen Institute for AI
14
405B
128k
-
-
-
Ring-flash-2.0
InclusionAI logoInclusionAI
14
103B
6.1B active at inference time
128k
$0.2
-
SiliconFlow
Pixtral Large
Mistral logoMistral
14
124B
128k
$2.4
53
Mistral
Olmo 3.1 32B Think
Allen Institute for AI logoAllen Institute for AI
14
32.2B
65.5k
-
-
Parasail
Grok 2 (Dec '24)
xAI logoxAI
14
270B
131k
-
-
-
Qwen3 VL 4B (Reasoning)
Alibaba logoAlibaba
14
4.44B
256k
-
-
-
Llama 4 Scout
Meta logoMeta
14
109B
17B active at inference time
10.0M
$0.2
106
GroqAmazon BedrockCloudflare
+6
Command A
Cohere logoCohere
13
111B
256k
$3.3
69
CohereMicrosoft Azure
Llama 3.1 Nemotron Instruct 70B
NVIDIA logoNVIDIA
13
70B
128k
$1.2
303
DeepInfra
Qwen2.5 Instruct 32B
Alibaba logoAlibaba
13
32B
128k
-
-
-
Qwen3 8B (Reasoning)
Alibaba logoAlibaba
13
8.19B
131k
$0.2
36
Alibaba CloudEigen AI
NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)
NVIDIA logoNVIDIA
13
31.6B
3.6B active at inference time
1.00M
$0.1
91
DeepInfra
NVIDIA Nemotron Nano 9B V2 (Non-reasoning)
NVIDIA logoNVIDIA
13
9B
131k
$0.1
140
Amazon BedrockDeepInfra
Mistral Large 2 (Jul '24)
Mistral logoMistral
13
123B
128k
$2.4
-
Amazon Bedrock
Qwen3 4B 2507 Instruct
Alibaba logoAlibaba
13
4.02B
262k
-
-
-
Qwen2.5 Coder Instruct 32B
Alibaba logoAlibaba
13
32B
131k
-
-
-
Qwen3 14B (Non-reasoning)
Alibaba logoAlibaba
13
14.8B
32.8k
$0.3
59
DeepInfraAlibaba Cloud
GLM-4.5V (Non-reasoning)
Z AI logoZ AI
13
108B
12B active at inference time
64.0k
$0.7
34
Novita
Mistral Small 3
Mistral logoMistral
13
24B
32.0k
$0.1
164
MistralDeepInfra
MiniCPM-V 4.6 1.3B
OpenBMB logoOpenBMB
13
1.3B
262k
-
-
-
Hermes 4 - Llama-3.1 70B (Non-reasoning)
Nous Research logoNous Research
13
70.6B
128k
$0.2
89
Nebius
Qwen3 30B A3B (Non-reasoning)
Alibaba logoAlibaba
13
30.5B
3.3B active at inference time
32.8k
$0.1
66
DeepInfraAlibaba CloudEigen AI
DeepSeek-V2.5 (Dec '24)
DeepSeek logoDeepSeek
13
236B
21B active at inference time
128k
-
-
-
Qwen3 4B (Non-reasoning)
Alibaba logoAlibaba
12
4.02B
32.0k
$0.1
-
Alibaba Cloud
Llama 3.1 Instruct 70B
Meta logoMeta
12
70B
128k
$0.6
36
DeepInfraDeepInfraAmazon BedrockAmazon Bedrock
Granite 4.1 8B
IBM logoIBM
12
8B
131k
$0.1
120
CoreWeave
Sarvam 30B (high)
Sarvam logoSarvam
12
32.2B
2.4B active at inference time
65.5k
$0.0
171
Sarvam
DeepSeek-V2.5
DeepSeek logoDeepSeek
12
236B
21B active at inference time
128k
-
-
-
Olmo 3.1 32B Instruct
Allen Institute for AI logoAllen Institute for AI
12
32.2B
65.5k
-
-
-
DeepSeek R1 Distill Llama 8B
DeepSeek logoDeepSeek
12
8B
128k
-
-
-
Gemma 4 E2B (Non-reasoning)
Google logoGoogle
12
5.1B
2.3B active at inference time
128k
-
-
-
Olmo 3 32B Think
Allen Institute for AI logoAllen Institute for AI
12
32.2B
65.5k
-
-
-
R1 1776
Perplexity logoPerplexity
12
671B
37B active at inference time
128k
-
-
-
Llama 3.2 Instruct 90B (Vision)
Meta logoMeta
12
90B
128k
$1.4
58
Microsoft AzureAmazon Bedrock
Solar Mini
Upstage logoUpstage
12
10.7B
4.10k
$0.1
-
Upstage
Llama 3.1 Instruct 8B
Meta logoMeta
12
8B
128k
$0.1
154
Amazon BedrockMicrosoft AzureDatabricks
+12
Grok-1
xAI logoxAI
12
314B
78B active at inference time
8.19k
-
-
-
Qwen2 Instruct 72B
Alibaba logoAlibaba
12
72B
131k
-
-
-
EXAONE 4.0 32B (Non-reasoning)
LG AI Research logoLG AI Research
12
32B
131k
-
-
-
Ministral 3 3B
Mistral logoMistral
11
3B
256k
$0.1
181
Amazon BedrockMistral
DeepHermes 3 - Mistral 24B Preview (Non-reasoning)
Nous Research logoNous Research
11
24B
32.0k
-
-
-
Jamba 1.7 Large
AI21 Labs logoAI21 Labs
11
398B
94B active at inference time
256k
$2.6
60
AI21 Labs
Granite 4.0 H Small
IBM logoIBM
11
32B
9B active at inference time
128k
$0.1
417
Replicate
Jamba 1.5 Large
AI21 Labs logoAI21 Labs
11
398B
94B active at inference time
256k
$2.6
-
Amazon Bedrock
Qwen3 Omni 30B A3B Instruct
Alibaba logoAlibaba
11
35.3B
3B active at inference time
65.5k
$0.3
92
Alibaba Cloud
Hermes 3 - Llama-3.1 70B
Nous Research logoNous Research
11
70.6B
128k
$0.3
26
DeepInfra
Qwen3 8B (Non-reasoning)
Alibaba logoAlibaba
11
8.19B
32.8k
$0.2
38
Eigen AIAlibaba CloudFireworks
DeepSeek-Coder-V2
DeepSeek logoDeepSeek
11
236B
21B active at inference time
128k
-
-
-
OLMo 2 32B
Allen Institute for AI logoAllen Institute for AI
11
32.2B
4.10k
-
-
-
Jamba 1.6 Large
AI21 Labs logoAI21 Labs
11
398B
94B active at inference time
256k
$2.6
61
AI21 Labs
Qwen3.5 0.8B (Reasoning)
Alibaba logoAlibaba
11
0.873B
262k
$0.0
-
DeepInfra
LFM2 24B A2B
Liquid AI logoLiquid AI
10
23.8B
2.3B active at inference time
32.8k
$0.0
118
Together.ai
Phi-4
Microsoft logoMicrosoft
10
14B
16.0k
$0.2
31
DeepInfraMicrosoft Azure
Gemma 3 27B Instruct
Google logoGoogle
10
27.4B
128k
$0.1
-
DeepInfraNovitaAmazon Bedrock
+3
Mistral Small (Sep '24)
Mistral logoMistral
10
22B
32.8k
$0.2
170
Mistral
Phi-3 Mini Instruct 3.8B
Microsoft logoMicrosoft
10
3.8B
4.10k
-
-
-
NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning)
NVIDIA logoNVIDIA
10
13.2B
128k
$0.2
230
DeepInfraAmazon Bedrock
Gemma 3n E4B Instruct Preview (May '25)
Google logoGoogle
10
8.39B
4B active at inference time
32.0k
-
-
-
Phi-4 Multimodal Instruct
Microsoft logoMicrosoft
10
5.6B
128k
-
17
Microsoft Azure
Qwen2.5 Coder Instruct 7B
Alibaba logoAlibaba
10
7.62B
131k
-
-
-
Qwen3.5 0.8B (Non-reasoning)
Alibaba logoAlibaba
10
0.873B
262k
$0.0
85
DeepInfra
Mixtral 8x22B Instruct
Mistral logoMistral
10
141B
39B active at inference time
65.4k
-
-
-
Llama 2 Chat 7B
Meta logoMeta
10
7B
4.10k
$0.1
-
Replicate
Llama 3.2 Instruct 3B
Meta logoMeta
10
3B
128k
$0.1
51
Amazon Bedrock
Jamba Reasoning 3B
AI21 Labs logoAI21 Labs
10
3B
262k
-
-
-
Qwen3 VL 4B Instruct
Alibaba logoAlibaba
10
4.44B
256k
-
-
-
Qwen1.5 Chat 110B
Alibaba logoAlibaba
10
110B
32.0k
-
-
-
Reka Flash 3
Reka AI logoReka AI
10
21B
128k
$0.3
-
Reka AI
Olmo 3 7B Think
Allen Institute for AI logoAllen Institute for AI
9
7B
65.5k
-
-
-
OLMo 2 7B
Allen Institute for AI logoAllen Institute for AI
9
7.3B
4.10k
-
-
-
Molmo 7B-D
Allen Institute for AI logoAllen Institute for AI
9
8.02B
4.10k
-
-
-
Ling-mini-2.0
InclusionAI logoInclusionAI
9
16.3B
1.4B active at inference time
131k
-
-
-
DeepSeek R1 Distill Qwen 1.5B
DeepSeek logoDeepSeek
9
1.5B
128k
-
-
-
DeepSeek-V2-Chat
DeepSeek logoDeepSeek
9
236B
21B active at inference time
128k
-
-
-
Llama 3 Instruct 70B
Meta logoMeta
9
70B
8.19k
$0.9
-
Amazon BedrockNovitaReplicate
Arctic Instruct
Snowflake logoSnowflake
9
480B
17B active at inference time
4.00k
-
-
-
Qwen Chat 72B
Alibaba logoAlibaba
9
72B
33.8k
-
-
-
Gemma 3 12B Instruct
Google logoGoogle
9
12.2B
128k
$0.1
-
CloudflareGoogleDeepInfra
+2
Llama 3.2 Instruct 11B (Vision)
Meta logoMeta
9
11B
128k
$0.2
51
Amazon BedrockDeepInfraMicrosoft Azure
Granite 4.1 3B
IBM logoIBM
9
3B
131k
-
-
-
DeepSeek Coder V2 Lite Instruct
DeepSeek logoDeepSeek
8
16B
2.4B active at inference time
128k
-
-
-
Sarvam M (Reasoning)
Sarvam logoSarvam
8
23.6B
32.8k
-
-
Sarvam
Phi-4 Mini Instruct
Microsoft logoMicrosoft
8
3.84B
128k
-
21
CoreWeaveMicrosoft Azure
Llama 2 Chat 70B
Meta logoMeta
8
70B
4.10k
-
-
-
DeepSeek LLM 67B Chat (V1)
DeepSeek logoDeepSeek
8
7B
4.10k
-
-
-
Llama 2 Chat 13B
Meta logoMeta
8
13B
4.10k
-
-
-
Command-R+ (Apr '24)
Cohere logoCohere
8
104B
128k
$4.2
-
Amazon Bedrock
OpenChat 3.5 (1210)
OpenChat logoOpenChat
8
7B
8.19k
-
-
-
DBRX Instruct
Databricks logoDatabricks
8
132B
36B active at inference time
32.8k
-
-
-
Exaone 4.0 1.2B (Reasoning)
LG AI Research logoLG AI Research
8
1.28B
64.0k
-
-
-
Olmo 3 7B Instruct
Allen Institute for AI logoAllen Institute for AI
8
7B
65.5k
$0.1
-
Parasail
Exaone 4.0 1.2B (Non-reasoning)
LG AI Research logoLG AI Research
8
1.28B
64.0k
-
-
-
LFM2.5-1.2B-Thinking
Liquid AI logoLiquid AI
8
1.17B
32.0k
-
-
-
Jamba 1.7 Mini
AI21 Labs logoAI21 Labs
8
52B
12B active at inference time
258k
-
-
-
LFM2 2.6B
Liquid AI logoLiquid AI
8
2.57B
32.8k
-
-
?
LFM2.5-1.2B-Instruct
Liquid AI logoLiquid AI
8
1.17B
32.0k
-
-
?
Jamba 1.5 Mini
AI21 Labs logoAI21 Labs
8
52B
12B active at inference time
256k
$0.2
-
Amazon Bedrock
Granite 4.0 H 1B
IBM logoIBM
8
1.5B
128k
-
-
-
Qwen3 1.7B (Reasoning)
Alibaba logoAlibaba
8
2.03B
32.0k
$0.2
-
Alibaba Cloud
Jamba 1.6 Mini
AI21 Labs logoAI21 Labs
8
52B
12B active at inference time
256k
$0.2
187
AI21 Labs
Mixtral 8x7B Instruct
Mistral logoMistral
8
46.7B
12.9B active at inference time
32.8k
$0.5
-
Amazon Bedrock
Gemma 3 270M
Google logoGoogle
8
0.268B
32.0k
-
-
-
Apertus 70B Instruct
Swiss AI Initiative logoSwiss AI Initiative
8
70B
65.5k
$1.0
-
Public AI
Granite 4.0 Micro
IBM logoIBM
8
3B
128k
-
-
-
DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning)
Nous Research logoNous Research
8
8B
128k
-
-
-
Llama 65B
Meta logoMeta
7
65B
2.05k
-
-
-
Qwen Chat 14B
Alibaba logoAlibaba
7
14B
8.19k
-
-
-
Mistral 7B Instruct
Mistral logoMistral
7
7B
8.19k
$0.2
104
MistralAmazon Bedrock
Command-R (Mar '24)
Cohere logoCohere
7
35B
128k
$0.6
-
Amazon Bedrock
Granite 4.0 1B
IBM logoIBM
7
1.6B
128k
-
-
-
Molmo2-8B
Allen Institute for AI logoAllen Institute for AI
7
8.66B
36.9k
-
-
-
LFM2 8B A1B
Liquid AI logoLiquid AI
7
8.34B
1.5B active at inference time
32.8k
-
-
?
Granite 3.3 8B (Non-reasoning)
IBM logoIBM
7
8.17B
128k
$0.1
338
Replicate
Qwen3 1.7B (Non-reasoning)
Alibaba logoAlibaba
7
2.03B
32.0k
$0.1
-
Alibaba Cloud
Qwen3 0.6B (Reasoning)
Alibaba logoAlibaba
6
0.752B
32.0k
$0.2
-
Alibaba Cloud
Llama 3 Instruct 8B
Meta logoMeta
6
8B
8.19k
$0.1
-
NovitaDeepInfraAmazon BedrockReplicate
Gemma 3n E4B Instruct
Google logoGoogle
6
8.39B
4B active at inference time
32.0k
$0.0
53
Together.ai
LFM2 1.2B
Liquid AI logoLiquid AI
6
1.17B
32.8k
-
-
?
Gemma 3 4B Instruct
Google logoGoogle
6
4.3B
128k
$0.0
-
Amazon BedrockGoogleDeepInfra
Llama 3.2 Instruct 1B
Meta logoMeta
6
1B
128k
$0.1
84
NovitaAmazon Bedrock
LFM2.5-VL-1.6B
Liquid AI logoLiquid AI
6
1.6B
32.0k
-
-
?
Granite 4.0 350M
IBM logoIBM
6
0.35B
32.8k
-
-
-
Apertus 8B Instruct
Swiss AI Initiative logoSwiss AI Initiative
6
8B
65.5k
$0.1
-
Public AI
Qwen3 0.6B (Non-reasoning)
Alibaba logoAlibaba
6
0.752B
32.0k
$0.1
-
Alibaba Cloud
Gemma 3 1B Instruct
Google logoGoogle
6
1B
32.0k
-
-
Google
Granite 4.0 H 350M
IBM logoIBM
5
0.34B
32.8k
-
-
-
Gemma 3n E2B Instruct
Google logoGoogle
5
5.98B
2B active at inference time
32.0k
-
-
Google
Tiny Aya Global
Cohere logoCohere
5
3.35B
8.19k
-
-
Cohere
EXAONE 4.5 33B (Non-reasoning)
LG AI Research logoLG AI Research
-
34.4B
262k
-
-
-
Cogito v2.1 (Reasoning)
Deep Cogito logoDeep Cogito
-
671B
37B active at inference time
128k
$1.3
67
Together.ai