Stay connected with us on X, Discord, and LinkedIn to stay up to date with future analysis

Comparison of Open Source Models

Comparison and analysis of open source AI models across key performance metrics including quality, performance, inference speed, context window, parameter count & licensing details. Models are considered Open Source (also commonly referred to as open weights) where their weights are accessible to download. This allows self-hosting on your own infrastructure and enables customizing the model such as through fine-tuning. Click on any model to see detailed metrics. For more details relating to our methodology, see our FAQs.

Z AI logoGLM-5 and Kimi logoKimi K2.5 are the highest intelligence open source models, followed by Alibaba logoQwen3.5 397B A17B & Z AI logoGLM-4.7.

Intelligence
Artificial Analysis Intelligence Index; Higher is better
Estimate (independent evaluation forthcoming)
Total Parameters
Trainable parameters in billions

Openness

Artificial Analysis Openness Index: Results

Openness Index assesses model openness on a 0 to 100 normalized scale (higher is more open)
+ Add model from specific provider

Open Source Progress

Progress in Open Weights vs. Proprietary Intelligence

Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Open Weights
Proprietary
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Indicates whether the model weights are available. Models are labelled as 'Commercial Use Restricted' if the weights are available but commercial use is limited (typically requires obtaining a paid license).

Artificial Analysis Intelligence Index

Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
+ Add model from specific provider
Estimate (independent evaluation forthcoming)
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Open Source Language Models Intelligence By Lab Over Time

Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Alibaba
DeepSeek
Google
Meta
Microsoft Azure
Mistral
NVIDIA
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Open Source Models Intelligence By Size Over Time

Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Large Models (>150B)
Medium Models (40B-150B)
Small Models (4B-40B)
Tiny Models (≤4B)
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

  • Tiny: Less than or equal to 4B parameters. These are usually the smallest models in terms of resource demand.
  • Small: Less than 40B parameters.
  • Medium: Between 40B-150B parameters.
  • Large: Over 150B parameters.

Intelligence Evaluations

Intelligence evaluations measured independently by Artificial Analysis; Higher is better
+ Add model from specific provider
Results claimed by AI Lab (not yet independently verified)
GDPval-AA
Terminal-Bench Hard
𝜏²-Bench Telecom
AA-LCR
AA-Omniscience Accuracy
AA-Omniscience Non-Hallucination Rate
Humanity's Last Exam
GPQA Diamond
SciCode
IFBench
CritPt
MMMU-Pro
Reasoning models are indicated by a lightbulb icon.

While model intelligence generally translates across use cases, specific evaluations may be more relevant for certain use cases.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Size

Intelligence Index By Model Size

+ Add model from specific provider
Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Estimate (independent evaluation forthcoming)
Large Models (>150B)
Small Models (4B-40B)
Medium Models (40B-150B)
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Indicates whether the model weights are available. Models are labelled as 'Commercial Use Restricted' if the weights are available but commercial use is limited (typically requires obtaining a paid license).

  • Tiny: Less than or equal to 4B parameters. These are usually the smallest models in terms of resource demand.
  • Small: Less than 40B parameters.
  • Medium: Between 40B-150B parameters.
  • Large: Over 150B parameters.

Model Size: Total and Active Parameters

Comparison between total model parameters and parameters active during inference
+ Add model from specific provider
Active Parameters
Passive Parameters
Reasoning models are indicated by a lightbulb icon.

The total number of trainable weights and biases in the model, expressed in billions. These parameters are learned during training and determine the model's ability to process and generate responses.

The number of parameters actually executed during each inference forward pass, expressed in billions. For Mixture of Experts (MoE) models, a routing mechanism selects a subset of experts per token, resulting in fewer active than total parameters. Dense models use all parameters, so active equals total.

Intelligence vs. Active Parameters

Active Parameters at Inference Time; Artificial Analysis Intelligence Index
+ Add model from specific provider
Most attractive quadrant
Alibaba
DeepSeek
Google
Kimi
LG AI Research
MBZUAI Institute of Foundation Models
Meta
Mistral
NVIDIA
OpenAI
Xiaomi
Z AI
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

The number of parameters actually executed during each inference forward pass, expressed in billions. For Mixture of Experts (MoE) models, a routing mechanism selects a subset of experts per token, resulting in fewer active than total parameters. Dense models use all parameters, so active equals total.

Intelligence vs. Total Parameters

Artificial Analysis Intelligence Index; Size in Parameters (Billions)
+ Add model from specific provider
Most attractive quadrant
Alibaba
DeepSeek
Google
Kimi
LG AI Research
MBZUAI Institute of Foundation Models
Meta
Mistral
NVIDIA
OpenAI
Xiaomi
Z AI
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

The total number of trainable weights and biases in the model, expressed in billions. These parameters are learned during training and determine the model's ability to process and generate responses.

Context Window

Context Window

Context Window: Tokens Limit; Higher is better
+ Add model from specific provider
Reasoning models are indicated by a lightbulb icon.

Larger context windows are relevant to RAG (Retrieval Augmented Generation) LLM workflows which typically involve reasoning and information retrieval of large amounts of data.

Maximum number of combined input & output tokens. Output tokens commonly have a significantly lower limit (varied by model).

Further details
WeightsProvider
Benchmarks
Z AI logo
GLM-5 (Reasoning)
Z AI
50
744B
(40B active at inference time)
200k
$1.6
75
🤗
SiliconFlow
DeepInfra
Baseten
+11 more
View
Kimi logo
Kimi K2.5 (Reasoning)
Kimi
47
1.0KB
(32B active at inference time)
256k
$1.2
39
🤗
SiliconFlow
Kimi
Together.ai
+14 more
View
Alibaba logo
Qwen3.5 397B A17B (Reasoning)
Alibaba
45
397B
(17B active at inference time)
262k
$1.4
88
🤗
Clarifai
Alibaba Cloud
Parasail
+6 more
View
Z AI logo
GLM-4.7 (Reasoning)
Z AI
42
357B
(32B active at inference time)
200k
$1.0
77
🤗
Amazon Bedrock
Baseten
Together.ai
+8 more
View
Alibaba logo
Qwen3.5 27B (Reasoning)
Alibaba
42
27.8B
262k
$0.8
83
🤗
Novita
Alibaba Cloud
GMI
+1 more
View
MiniMax logo
MiniMax-M2.5
MiniMax
42
230B
(10B active at inference time)
205k
$0.5
65
🤗
SambaNova
FriendliAI
SiliconFlow
+10 more
View
DeepSeek logo
DeepSeek V3.2 (Reasoning)
DeepSeek
42
685B
(37B active at inference time)
128k
$0.3
45
🤗
Parasail
Fireworks
SiliconFlow
+8 more
View
Alibaba logo
Qwen3.5 122B A10B (Reasoning)
Alibaba
42
125B
(10B active at inference time)
262k
$1.1
139
🤗
Alibaba Cloud
GMI
Novita
+1 more
View
Xiaomi logo
MiMo-V2-Flash (Feb 2026)
Xiaomi
41
309B
(15B active at inference time)
256k
$0.1
127
🤗
Xiaomi
View
Kimi logo
Kimi K2 Thinking
Kimi
41
1.0KB
(32B active at inference time)
256k
$1.1
97
🤗
Google
Nebius
Amazon Bedrock
+5 more
View
Z AI logo
GLM-5 (Non-reasoning)
Z AI
41
744B
(40B active at inference time)
200k
$1.6
59
🤗
Fireworks
DeepInfra
SiliconFlow
+4 more
View
Alibaba logo
Qwen3.5 397B A17B (Non-reasoning)
Alibaba
40
397B
(17B active at inference time)
262k
$1.4
83
🤗
Alibaba Cloud
Nebius
Eigen AI
+3 more
View
MiniMax logo
MiniMax-M2.1
MiniMax
39
230B
(10B active at inference time)
205k
$0.5
70
🤗
MiniMax
DeepInfra
Fireworks
+3 more
View
Xiaomi logo
MiMo-V2-Flash (Reasoning)
Xiaomi
39
309B
(15B active at inference time)
256k
$0.1
128
🤗
Xiaomi
View
Google logo
Gemma 4 31B (Reasoning)
Google
39
31B
256k
-
36
🤗
Parasail
Google
Novita
+1 more
View
StepFun logo
Step 3.5 Flash
StepFun
38
196B
(11B active at inference time)
256k
$0.1
89
🤗
SiliconFlow
StepFun
View
Kimi logo
Kimi K2.5 (Non-reasoning)
Kimi
37
1.0KB
(32B active at inference time)
256k
$1.2
41
🤗
Fireworks
Novita
Baseten
+6 more
View
Alibaba logo
Qwen3.5 27B (Non-reasoning)
Alibaba
37
27.8B
262k
$0.8
82
🤗
DeepInfra
Alibaba Cloud
View
Alibaba logo
Qwen3.5 35B A3B (Reasoning)
Alibaba
37
36B
(3B active at inference time)
262k
$0.7
197
🤗
Alibaba Cloud
DeepInfra
GMI
+1 more
View
MiniMax logo
MiniMax-M2
MiniMax
36
230B
(10B active at inference time)
205k
$0.5
65
🤗
Novita
Amazon Bedrock
Google
+1 more
View
NVIDIA logo
NVIDIA Nemotron 3 Super 120B A12B (Reasoning)
NVIDIA
36
120.6B
(12.7B active at inference time)
1.00M
$0.4
155
🤗
Lightning AI
Weights & Biases
DeepInfra
+2 more
View
Alibaba logo
Qwen3.5 122B A10B (Non-reasoning)
Alibaba
36
125B
(10B active at inference time)
262k
$1.1
137
🤗
DeepInfra
Alibaba Cloud
View
Z AI logo
GLM-4.7 (Non-reasoning)
Z AI
34
357B
(32B active at inference time)
200k
$0.9
76
🤗
Amazon Bedrock
Baseten
Google
+7 more
View
DeepSeek logo
DeepSeek V3.1 Terminus (Reasoning)
DeepSeek
34
685B
(37B active at inference time)
128k
$0.8
-
🤗
Novita
Eigen AI
SambaNova
View
OpenAI logo
gpt-oss-120B (high)
OpenAI
33
117B
(5.1B active at inference time)
131k
$0.3
240
🤗
Parasail
Nebius
Databricks
+21 more
View
DeepSeek logo
DeepSeek V3.2 Exp (Reasoning)
DeepSeek
33
685B
(37B active at inference time)
128k
$0.3
44
🤗
DeepSeek
Novita
View
Z AI logo
GLM-4.6 (Reasoning)
Z AI
33
357B
(32B active at inference time)
200k
$1.0
70
🤗
DeepInfra
Baseten
Together.ai
+1 more
View
Alibaba logo
Qwen3.5 9B (Reasoning)
Alibaba
32
9.65B
262k
$0.1
157
🤗
Together.ai
DeepInfra
View
LG AI Research logo
K-EXAONE (Reasoning)
LG AI Research
32
236B
(23B active at inference time)
256k
-
-
🤗
-
View
DeepSeek logo
DeepSeek V3.2 (Non-reasoning)
DeepSeek
32
685B
(37B active at inference time)
128k
$0.3
40
🤗
Nebius
Novita
Microsoft Azure
+11 more
View
Google logo
Gemma 4 26B A4B (Reasoning)
Google
31
27B
(4B active at inference time)
256k
$0.2
-
🤗
Parasail
Google
Novita
View
Kimi logo
Kimi K2 0905
Kimi
31
1.0KB
(32B active at inference time)
256k
$1.1
50
🤗
Novita
Fireworks
DeepInfra
+1 more
View
Alibaba logo
Qwen3.5 35B A3B (Non-reasoning)
Alibaba
31
36B
(3B active at inference time)
262k
$0.7
197
🤗
Alibaba Cloud
DeepInfra
View
Xiaomi logo
MiMo-V2-Flash (Non-reasoning)
Xiaomi
30
309B
(15B active at inference time)
256k
$0.1
123
🤗
Xiaomi
View
Z AI logo
GLM-4.6 (Non-reasoning)
Z AI
30
357B
(32B active at inference time)
200k
$1.0
78
🤗
Novita
Together.ai
View
Z AI logo
GLM-4.7-Flash (Reasoning)
Z AI
30
31.2B
(3B active at inference time)
200k
$0.2
87
🤗
Novita
DeepInfra
Amazon Bedrock
View
Alibaba logo
Qwen3 235B A22B 2507 (Reasoning)
Alibaba
30
235B
(22B active at inference time)
256k
$2.6
41
🤗
Eigen AI
Weights & Biases
Novita
+4 more
View
DeepSeek logo
DeepSeek V3.2 Speciale
DeepSeek
29
685B
(37B active at inference time)
128k
-
-
🤗
-
View
DeepSeek logo
DeepSeek V3.1 Terminus (Non-reasoning)
DeepSeek
29
685B
(37B active at inference time)
128k
$0.6
-
🤗
Novita
Eigen AI
DeepInfra
+1 more
View
DeepSeek logo
DeepSeek V3.2 Exp (Non-reasoning)
DeepSeek
28
685B
(37B active at inference time)
128k
$0.3
40
🤗
DeepInfra
Novita
DeepSeek
View
ServiceNow logo
Apriel-v1.5-15B-Thinker
ServiceNow
28
15B
128k
-
145
🤗
Together.ai
View
Alibaba logo
Qwen3 Coder Next
Alibaba
28
79.7B
(3B active at inference time)
256k
$0.6
151
🤗
Novita
Amazon Bedrock
Together.ai
+1 more
View
DeepSeek logo
DeepSeek V3.1 (Non-reasoning)
DeepSeek
28
685B
(37B active at inference time)
128k
$0.8
-
🤗
Novita
Weights & Biases
Eigen AI
+8 more
View
NVIDIA logo
Nemotron Cascade 2 30B A3B
NVIDIA
28
31.6B
(3B active at inference time)
262k
-
-
🤗
-
View
DeepSeek logo
DeepSeek V3.1 (Reasoning)
DeepSeek
28
685B
(37B active at inference time)
128k
$0.9
-
🤗
Eigen AI
Amazon Bedrock
Google
+2 more
View
Alibaba logo
Qwen3 VL 235B A22B (Reasoning)
Alibaba
28
235B
(22B active at inference time)
262k
$2.6
53
🤗
Alibaba Cloud
Novita
View
ServiceNow logo
Apriel-v1.6-15B-Thinker
ServiceNow
28
15B
128k
-
79
🤗
Together.ai
View
Alibaba logo
Qwen3.5 9B (Non-reasoning)
Alibaba
27
9.65B
262k
$0.1
183
🤗
DeepInfra
View
Mistral logo
Mistral Small 4 (Reasoning)
Mistral
27
119B
(6.5B active at inference time)
256k
$0.3
146
🤗
Mistral
View
Alibaba logo
Qwen3.5 4B (Reasoning)
Alibaba
27
4.66B
262k
$0.1
223
🤗
DeepInfra
View
DeepSeek logo
DeepSeek R1 0528 (May '25)
DeepSeek
27
685B
(37B active at inference time)
128k
$2.4
-
🤗
Microsoft Azure
Nebius
Nebius
+6 more
View
Alibaba logo
Qwen3 Next 80B A3B (Reasoning)
Alibaba
27
80B
(3B active at inference time)
262k
$1.9
172
🤗
Novita
Alibaba Cloud
Eigen AI
+4 more
View
Z AI logo
GLM-4.5 (Reasoning)
Z AI
26
355B
(32B active at inference time)
128k
$0.8
48
🤗
Novita
DeepInfra
View
Kimi logo
Kimi K2
Kimi
26
1.0KB
(32B active at inference time)
128k
$1.0
37
🤗
Groq
Kimi
DeepInfra
+2 more
View
ByteDance Seed logo
Seed-OSS-36B-Instruct
ByteDance Seed
25
36.2B
512k
$0.3
35
🤗
SiliconFlow
View
Alibaba logo
Qwen3 235B A22B 2507 Instruct
Alibaba
25
235B
(22B active at inference time)
256k
$1.2
48
🤗
Together.ai
DeepInfra
Parasail
+10 more
View
Alibaba logo
Qwen3 Coder 480B A35B Instruct
Alibaba
25
480B
(35B active at inference time)
262k
$3.0
55
🤗
Amazon Bedrock
DeepInfra
Together.ai
+8 more
View
Alibaba logo
Qwen3 VL 32B (Reasoning)
Alibaba
25
33.4B
256k
$2.6
88
🤗
Alibaba Cloud
View
OpenAI logo
gpt-oss-120B (low)
OpenAI
24
117B
(5.1B active at inference time)
131k
$0.3
228
🤗
Novita
Together.ai
Amazon Bedrock
+18 more
View
OpenAI logo
gpt-oss-20B (high)
OpenAI
24
21B
(3.6B active at inference time)
131k
$0.1
271
🤗
DeepInfra
Novita
Nebius
+9 more
View
MiniMax logo
MiniMax M1 80k
MiniMax
24
456B
(45.9B active at inference time)
1.00M
$1.0
-
🤗
Novita
View
NVIDIA logo
NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)
NVIDIA
24
31.6B
(3.6B active at inference time)
1.00M
$0.1
142
🤗
Nebius
DeepInfra
View
MBZUAI Institute of Foundation Models logo
K2 Think V2
MBZUAI Institute of Foundation Models
24
70B
262k
-
-
🤗
-
View
LongCat logo
LongCat Flash Lite
LongCat
24
68.5B
(3B active at inference time)
256k
-
115
🤗
LongCat
View
Naver logo
HyperCLOVA X SEED Think (32B)
Naver
24
32B
128k
-
-
🤗
-
View
Z AI logo
GLM-4.6V (Reasoning)
Z AI
23
108B
128k
$0.5
28
🤗
Novita
DeepInfra
SiliconFlow
View
LG AI Research logo
K-EXAONE (Non-reasoning)
LG AI Research
23
236B
(23B active at inference time)
256k
-
-
🤗
-
View
Z AI logo
GLM-4.5-Air
Z AI
23
106B
(12B active at inference time)
128k
$0.4
98
🤗
Together.ai
Nebius
SiliconFlow
+1 more
View
Mistral logo
Mistral Large 3
Mistral
23
675B
(41B active at inference time)
256k
$0.8
43
🤗
Amazon Bedrock
Mistral
Microsoft Azure
View
InclusionAI logo
Ring-1T
InclusionAI
23
1.0KB
(50B active at inference time)
128k
-
-
🤗
-
View
Alibaba logo
Qwen3.5 4B (Non-reasoning)
Alibaba
23
4.66B
262k
$0.1
226
🤗
DeepInfra
View
Alibaba logo
Qwen3 30B A3B 2507 (Reasoning)
Alibaba
22
30.5B
(3.3B active at inference time)
262k
$0.8
164
🤗
Clarifai
Alibaba Cloud
Nebius
View
DeepSeek logo
DeepSeek V3 0324
DeepSeek
22
671B
(37B active at inference time)
128k
$1.3
-
🤗
Replicate
Nebius
Microsoft Azure
+6 more
View
Prime Intellect logo
INTELLECT-3
Prime Intellect
22
107B
131k
-
-
🤗
-
View
Z AI logo
GLM-4.7-Flash (Non-reasoning)
Z AI
22
31.2B
(3B active at inference time)
200k
$0.2
87
🤗
Novita
Amazon Bedrock
View
Mistral logo
Devstral 2
Mistral
22
125B
256k
-
74
🤗
Mistral
View
MiniMax logo
MiniMax M1 40k
MiniMax
21
456B
(45.9B active at inference time)
1.00M
-
-
🤗
-
View
OpenAI logo
gpt-oss-20B (low)
OpenAI
21
21B
(3.6B active at inference time)
131k
$0.1
237
🤗
Hyperbolic
Databricks
DeepInfra
+9 more
View
Alibaba logo
Qwen3 VL 235B A22B Instruct
Alibaba
21
235B
(22B active at inference time)
262k
$1.2
55
🤗
Alibaba Cloud
Parasail
Novita
+3 more
View
MBZUAI Institute of Foundation Models logo
K2-V2 (high)
MBZUAI Institute of Foundation Models
21
70B
512k
-
-
🤗
-
View
Alibaba logo
Qwen3 Next 80B A3B Instruct
Alibaba
20
80B
(3B active at inference time)
262k
$0.9
174
🤗
GMI
Hyperbolic
Alibaba Cloud
+4 more
View
Trillion Labs logo
Tri-21B-think Preview
Trillion Labs
20
21B
32.0k
-
-
🤗
-
View
Alibaba logo
Qwen3 Coder 30B A3B Instruct
Alibaba
20
30.5B
(3.3B active at inference time)
262k
$0.9
28
🤗
Alibaba Cloud
Scaleway
Amazon Bedrock
+2 more
View
Alibaba logo
Qwen3 235B A22B (Reasoning)
Alibaba
20
235B
(22B active at inference time)
32.8k
$2.6
48
🤗
Alibaba Cloud
View
Alibaba logo
QwQ 32B
Alibaba
20
32.8B
131k
$0.7
-
🤗
Cloudflare
View
Alibaba logo
Qwen3 VL 30B A3B (Reasoning)
Alibaba
20
30B
(3B active at inference time)
256k
$0.8
110
🤗
Eigen AI
Alibaba Cloud
Fireworks
+1 more
View
Mistral logo
Devstral Small 2
Mistral
19
24B
256k
-
74
🤗
Mistral
View
InclusionAI logo
Ling-1T
InclusionAI
19
1.0KB
(50B active at inference time)
128k
-
-
🤗
-
View
DeepSeek logo
DeepSeek R1 (Jan '25)
DeepSeek
19
685B
(37B active at inference time)
128k
$2.4
-
🤗
Novita
Hyperbolic
Together.ai
+6 more
View
Google logo
Gemma 4 E4B
Google
19
8B
(4.5B active at inference time)
128k
-
-
🤗
-
View
NVIDIA logo
Llama Nemotron Super 49B v1.5 (Reasoning)
NVIDIA
19
49B
128k
$0.2
65
🤗
DeepInfra
View
MBZUAI Institute of Foundation Models logo
K2-V2 (medium)
MBZUAI Institute of Foundation Models
19
70B
512k
-
-
🤗
-
View
Mistral logo
Mistral Small 4 (Non-reasoning)
Mistral
19
119B
(6.5B active at inference time)
256k
$0.3
125
🤗
Mistral
View
Trillion Labs logo
Tri-21B-Think
Trillion Labs
19
21B
32.0k
-
-
🤗
-
View
Nous Research logo
Hermes 4 - Llama-3.1 405B (Reasoning)
Nous Research
19
406B
128k
$1.5
34
🤗
Nebius
View
NVIDIA logo
Llama 3.3 Nemotron Super 49B v1 (Reasoning)
NVIDIA
18
49B
128k
-
-
🤗
-
View
Meta logo
Llama 4 Maverick
Meta
18
402B
(17B active at inference time)
1.00M
$0.5
119
🤗
Together.ai
Eigen AI
Novita
+9 more
View
Alibaba logo
Qwen3 4B 2507 (Reasoning)
Alibaba
18
4.02B
262k
-
-
🤗
-
View
Mistral logo
Magistral Small 1.2
Mistral
18
24B
128k
$0.8
155
🤗
Amazon Bedrock
Mistral
View
Sarvam logo
Sarvam 105B (high)
Sarvam
18
106B
(10.3B active at inference time)
128k
-
97
🤗
Sarvam
View
Mistral logo
Devstral Small (May '25)
Mistral
18
23.6B
256k
$0.1
-
🤗
DeepInfra
View
Nous Research logo
Hermes 4 - Llama-3.1 405B (Non-reasoning)
Nous Research
18
406B
128k
$1.5
34
🤗
Nebius
View
Meta logo
Llama 3.1 Instruct 405B
Meta
17
405B
128k
$3.7
29
🤗
Amazon Bedrock
Databricks
Microsoft Azure
+1 more
View
Alibaba logo
Qwen3 VL 32B Instruct
Alibaba
17
33.4B
256k
$1.2
72
🤗
Alibaba Cloud
View
DeepSeek logo
DeepSeek R1 Distill Qwen 32B
DeepSeek
17
32B
128k
$0.3
46
🤗
DeepInfra
View
Z AI logo
GLM-4.6V (Non-reasoning)
Z AI
17
108B
128k
$0.5
23
🤗
Novita
SiliconFlow
View
Alibaba logo
Qwen3 235B A22B (Non-reasoning)
Alibaba
17
235B
(22B active at inference time)
32.8k
$1.2
44
🤗
Novita
DeepInfra
Alibaba Cloud
View
Mistral logo
Magistral Small 1
Mistral
17
23.6B
40.0k
-
-
🤗
-
View
LG AI Research logo
EXAONE 4.0 32B (Reasoning)
LG AI Research
17
32B
131k
-
-
🤗
-
View
Alibaba logo
Qwen3 VL 8B (Reasoning)
Alibaba
17
8.77B
256k
$0.7
114
🤗
Alibaba Cloud
View
Alibaba logo
Qwen3 32B (Reasoning)
Alibaba
17
32.8B
32.8k
$2.6
98
🤗
DeepInfra
Novita
Alibaba Cloud
+4 more
View
DeepSeek logo
DeepSeek V3 (Dec '24)
DeepSeek
16
671B
(37B active at inference time)
128k
$0.6
-
🤗
DeepInfra
Hyperbolic
Novita
+2 more
View
DeepSeek logo
DeepSeek R1 0528 Qwen3 8B
DeepSeek
16
8.19B
32.8k
-
-
🤗
-
View
Alibaba logo
Qwen3.5 2B (Reasoning)
Alibaba
16
2.27B
262k
$0.0
354
🤗
DeepInfra
View
Alibaba logo
Qwen3 14B (Reasoning)
Alibaba
16
14.8B
32.8k
$1.3
60
🤗
DeepInfra
Alibaba Cloud
View
Nanbeige logo
Nanbeige4.1-3B
Nanbeige
16
3.93B
256k
-
-
🤗
-
View
Alibaba logo
Qwen3 VL 30B A3B Instruct
Alibaba
16
30B
(3B active at inference time)
256k
$0.3
109
🤗
DeepInfra
Novita
Eigen AI
+1 more
View
Nous Research logo
Hermes 4 - Llama-3.1 70B (Reasoning)
Nous Research
16
70.6B
128k
$0.2
78
🤗
Nebius
View
Mistral logo
Ministral 3 14B
Mistral
16
14B
256k
$0.2
116
🤗
Mistral
Amazon Bedrock
View
DeepSeek logo
DeepSeek R1 Distill Llama 70B
DeepSeek
16
70B
128k
$0.9
39
🤗
Scaleway
DeepInfra
SambaNova
View
DeepSeek logo
DeepSeek R1 Distill Qwen 14B
DeepSeek
16
14B
128k
-
-
🤗
-
View
TII UAE logo
Falcon-H1R-7B
TII UAE
16
7B
256k
-
-
🤗
-
View
InclusionAI logo
Ling-flash-2.0
InclusionAI
16
103B
(6.1B active at inference time)
128k
$0.2
61
🤗
SiliconFlow
View
Alibaba logo
Qwen3 Omni 30B A3B (Reasoning)
Alibaba
16
35.3B
(3B active at inference time)
65.5k
$0.4
93
🤗
Alibaba Cloud
View
Alibaba logo
Qwen2.5 Instruct 72B
Alibaba
16
72B
131k
-
28
🤗
Alibaba Cloud
SiliconFlow
DeepInfra
View
StepFun logo
Step3 VL 10B
StepFun
15
10.2B
65.5k
-
-
🤗
-
View
Alibaba logo
Qwen3 30B A3B (Reasoning)
Alibaba
15
30.5B
(3.3B active at inference time)
32.8k
$0.8
65
🤗
Novita
Alibaba Cloud
Eigen AI
+2 more
View
Google logo
Gemma 4 E2B
Google
15
5.1B
(2.3B active at inference time)
128k
-
-
🤗
-
View
Mistral logo
Devstral Small (Jul '25)
Mistral
15
24B
256k
$0.1
197
🤗
DeepInfra
Mistral
View
Alibaba logo
QwQ 32B-Preview
Alibaba
15
32.8B
32.8k
$0.1
46
🤗
DeepInfra
View
Mistral logo
Mistral Large 2 (Nov '24)
Mistral
15
123B
128k
$3.0
40
🤗
Mistral
View
Z AI logo
GLM-4.5V (Reasoning)
Z AI
15
108B
(12B active at inference time)
64.0k
$0.9
50
🤗
Novita
View
Mistral logo
Mistral Small 3.2
Mistral
15
24B
128k
$0.1
115
🤗
Mistral
DeepInfra
View
NVIDIA logo
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)
NVIDIA
15
253B
128k
$0.9
42
🤗
Nebius
View
Alibaba logo
Qwen3 30B A3B 2507 Instruct
Alibaba
15
30.5B
(3.3B active at inference time)
262k
$0.3
70
🤗
Nebius
Alibaba Cloud
Clarifai
+1 more
View
Baidu logo
ERNIE 4.5 300B A47B
Baidu
15
300B
(47B active at inference time)
131k
$0.5
25
🤗
SiliconFlow
Novita
View
NVIDIA logo
NVIDIA Nemotron Nano 12B v2 VL (Reasoning)
NVIDIA
15
13.2B
128k
$0.3
130
🤗
DeepInfra
View
Mistral logo
Ministral 3 8B
Mistral
15
8B
256k
$0.1
172
🤗
Mistral
Amazon Bedrock
View
NVIDIA logo
NVIDIA Nemotron Nano 9B V2 (Reasoning)
NVIDIA
15
9B
131k
$0.1
158
🤗
DeepInfra
View
NVIDIA logo
NVIDIA Nemotron 3 Nano 4B
NVIDIA
15
3.97B
262k
-
-
🤗
-
View
Alibaba logo
Qwen3.5 2B (Non-reasoning)
Alibaba
15
2.27B
262k
$0.0
273
🤗
DeepInfra
View
NVIDIA logo
Llama Nemotron Super 49B v1.5 (Non-reasoning)
NVIDIA
15
49B
128k
$0.2
66
🤗
DeepInfra
View
Alibaba logo
Qwen3 32B (Non-reasoning)
Alibaba
15
32.8B
32.8k
$1.2
98
🤗
SambaNova
Nebius
DeepInfra
+5 more
View
Meta logo
Llama 3.3 Instruct 70B
Meta
14
70B
128k
$0.6
84
🤗
Scaleway
Groq
Fireworks
+18 more
View
Mistral logo
Mistral Small 3.1
Mistral
14
24B
128k
$0.1
129
🤗
Cloudflare
Mistral
CompactifAI
View
MBZUAI Institute of Foundation Models logo
K2-V2 (low)
MBZUAI Institute of Foundation Models
14
70B
512k
-
-
🤗
-
View
NVIDIA logo
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning)
NVIDIA
14
4.51B
128k
-
-
🤗
-
View
Kimi logo
Kimi Linear 48B A3B Instruct
Kimi
14
49.1B
(3B active at inference time)
1.00M
-
-
🤗
-
View
NVIDIA logo
Llama 3.3 Nemotron Super 49B v1 (Non-reasoning)
NVIDIA
14
49B
128k
-
-
🤗
-
View
Alibaba logo
Qwen3 VL 8B Instruct
Alibaba
14
8.77B
256k
$0.3
118
🤗
Together.ai
Alibaba Cloud
View
Alibaba logo
Qwen3 4B (Reasoning)
Alibaba
14
4.02B
32.0k
$0.4
90
🤗
Alibaba Cloud
View
Allen Institute for AI logo
Llama 3.1 Tulu3 405B
Allen Institute for AI
14
405B
128k
-
-
🤗
-
View
InclusionAI logo
Ring-flash-2.0
InclusionAI
14
103B
(6.1B active at inference time)
128k
$0.2
86
🤗
SiliconFlow
View
Mistral logo
Pixtral Large
Mistral
14
124B
128k
$3.0
52
🤗
Mistral
View
Allen Institute for AI logo
Olmo 3.1 32B Think
Allen Institute for AI
14
32.2B
65.5k
-
95
🤗
Parasail
View
xAI logo
Grok 2 (Dec '24)
xAI
14
270B
131k
-
-
🤗
-
View
Alibaba logo
Qwen3 VL 4B (Reasoning)
Alibaba
14
4.44B
256k
-
-
🤗
-
View
Meta logo
Llama 4 Scout
Meta
14
109B
(17B active at inference time)
10.0M
$0.3
144
🤗
Google
CompactifAI
Novita
+6 more
View
Cohere logo
Command A
Cohere
13
111B
256k
$4.4
41
🤗
Microsoft Azure
Cohere
View
NVIDIA logo
Llama 3.1 Nemotron Instruct 70B
NVIDIA
13
70B
128k
$1.2
46
🤗
DeepInfra
View
Alibaba logo
Qwen2.5 Instruct 32B
Alibaba
13
32B
128k
-
-
🤗
-
View
Alibaba logo
Qwen3 8B (Reasoning)
Alibaba
13
8.19B
131k
$0.7
81
🤗
Eigen AI
Alibaba Cloud
View
NVIDIA logo
NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)
NVIDIA
13
31.6B
(3.6B active at inference time)
1.00M
$0.1
97
🤗
DeepInfra
View
NVIDIA logo
NVIDIA Nemotron Nano 9B V2 (Non-reasoning)
NVIDIA
13
9B
131k
$0.1
166
🤗
DeepInfra
Amazon Bedrock
View
Mistral logo
Mistral Large 2 (Jul '24)
Mistral
13
123B
128k
$3.0
-
🤗
Amazon Bedrock
View
Alibaba logo
Qwen3 4B 2507 Instruct
Alibaba
13
4.02B
262k
-
-
🤗
-
View
Alibaba logo
Qwen2.5 Coder Instruct 32B
Alibaba
13
32B
131k
-
-
🤗
-
View
Alibaba logo
Qwen3 14B (Non-reasoning)
Alibaba
13
14.8B
32.8k
$0.6
60
🤗
Alibaba Cloud
DeepInfra
View
Z AI logo
GLM-4.5V (Non-reasoning)
Z AI
13
108B
(12B active at inference time)
64.0k
$0.9
53
🤗
Novita
View
Mistral logo
Mistral Small 3
Mistral
13
24B
32.0k
$0.1
127
🤗
Together.ai
DeepInfra
Mistral
View
Nous Research logo
Hermes 4 - Llama-3.1 70B (Non-reasoning)
Nous Research
13
70.6B
128k
$0.2
78
🤗
Nebius
View
Alibaba logo
Qwen3 30B A3B (Non-reasoning)
Alibaba
13
30.5B
(3.3B active at inference time)
32.8k
$0.3
63
🤗
Alibaba Cloud
DeepInfra
Eigen AI
View
DeepSeek logo
DeepSeek-V2.5 (Dec '24)
DeepSeek
13
236B
(21B active at inference time)
128k
-
-
🤗
-
View
Alibaba logo
Qwen3 4B (Non-reasoning)
Alibaba
12
4.02B
32.0k
$0.2
87
🤗
Alibaba Cloud
View
Meta logo
Llama 3.1 Instruct 70B
Meta
12
70B
128k
$0.6
20
🤗
DeepInfra
Amazon Bedrock
Amazon Bedrock
+1 more
View
Sarvam logo
Sarvam 30B (high)
Sarvam
12
32.2B
(2.4B active at inference time)
65.5k
-
150
🤗
Sarvam
View
DeepSeek logo
DeepSeek-V2.5
DeepSeek
12
236B
(21B active at inference time)
128k
-
-
🤗
-
View
Allen Institute for AI logo
Olmo 3.1 32B Instruct
Allen Institute for AI
12
32.2B
65.5k
$0.3
54
🤗
DeepInfra
View
DeepSeek logo
DeepSeek R1 Distill Llama 8B
DeepSeek
12
8B
128k
-
-
🤗
-
View
Allen Institute for AI logo
Olmo 3 32B Think
Allen Institute for AI
12
32.2B
65.5k
-
-
🤗
-
View
Perplexity logo
R1 1776
Perplexity
12
671B
(37B active at inference time)
128k
-
-
🤗
-
View
Meta logo
Llama 3.2 Instruct 90B (Vision)
Meta
12
90B
128k
$0.7
42
🤗
Microsoft Azure
Amazon Bedrock
DeepInfra
View
Upstage logo
Solar Mini
Upstage
12
10.7B
4.10k
$0.1
-
🤗
Upstage
View
Meta logo
Llama 3.1 Instruct 8B
Meta
12
8B
128k
$0.1
155
🤗
Microsoft Azure
Eigen AI
Nebius
+14 more
View
xAI logo
Grok-1
xAI
12
314B
(78B active at inference time)
8.19k
-
-
🤗
-
View
Alibaba logo
Qwen2 Instruct 72B
Alibaba
12
72B
131k
-
-
🤗
-
View
LG AI Research logo
EXAONE 4.0 32B (Non-reasoning)
LG AI Research
12
32B
131k
-
-
🤗
-
View
Mistral logo
Ministral 3 3B
Mistral
11
3B
256k
$0.1
263
🤗
Amazon Bedrock
Mistral
View
Nous Research logo
DeepHermes 3 - Mistral 24B Preview (Non-reasoning)
Nous Research
11
24B
32.0k
-
-
🤗
-
View
AI21 Labs logo
Jamba 1.7 Large
AI21 Labs
11
398B
(94B active at inference time)
256k
$3.5
60
🤗
AI21 Labs
View
IBM logo
Granite 4.0 H Small
IBM
11
32B
(9B active at inference time)
128k
$0.1
405
🤗
Replicate
View
AI21 Labs logo
Jamba 1.5 Large
AI21 Labs
11
398B
(94B active at inference time)
256k
$3.5
-
🤗
Amazon Bedrock
View
Alibaba logo
Qwen3 Omni 30B A3B Instruct
Alibaba
11
35.3B
(3B active at inference time)
65.5k
$0.4
91
🤗
Alibaba Cloud
View
Nous Research logo
Hermes 3 - Llama-3.1 70B
Nous Research
11
70.6B
128k
$0.3
30
🤗
DeepInfra
View
Alibaba logo
Qwen3 8B (Non-reasoning)
Alibaba
11
8.19B
32.8k
$0.3
82
🤗
Fireworks
Alibaba Cloud
Eigen AI
View
DeepSeek logo
DeepSeek-Coder-V2
DeepSeek
11
236B
(21B active at inference time)
128k
-
-
🤗
-
View
Allen Institute for AI logo
OLMo 2 32B
Allen Institute for AI
11
32.2B
4.10k
-
-
🤗
-
View
AI21 Labs logo
Jamba 1.6 Large
AI21 Labs
11
398B
(94B active at inference time)
256k
$3.5
61
🤗
AI21 Labs
View
Alibaba logo
Qwen3.5 0.8B (Reasoning)
Alibaba
11
0.873B
262k
$0.0
418
🤗
DeepInfra
View
Liquid AI logo
LFM2 24B A2B
Liquid AI
10
23.8B
(2.3B active at inference time)
32.8k
$0.1
225
🤗
Together.ai
View
Microsoft Azure logo
Phi-4
Microsoft Azure
10
14B
16.0k
$0.2
34
🤗
Microsoft Azure
DeepInfra
View
Google logo
Gemma 3 27B Instruct
Google
10
27.4B
128k
-
29
🤗
DeepInfra
Novita
Google
+3 more
View
Mistral logo
Mistral Small (Sep '24)
Mistral
10
22B
32.8k
$0.3
127
🤗
Mistral
View
Microsoft Azure logo
Phi-3 Mini Instruct 3.8B
Microsoft Azure
10
3.8B
4.10k
-
-
🤗
-
View
NVIDIA logo
NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning)
NVIDIA
10
13.2B
128k
$0.3
139
🤗
Nebius
Amazon Bedrock
DeepInfra
View
Google logo
Gemma 3n E4B Instruct Preview (May '25)
Google
10
8.39B
(4B active at inference time)
32.0k
-
-
🤗
-
View
Microsoft Azure logo
Phi-4 Multimodal Instruct
Microsoft Azure
10
5.6B
128k
-
18
🤗
Microsoft Azure
View
Alibaba logo
Qwen2.5 Coder Instruct 7B
Alibaba
10
7.62B
131k
-
-
🤗
-
View
Alibaba logo
Qwen3.5 0.8B (Non-reasoning)
Alibaba
10
0.873B
262k
$0.0
303
🤗
DeepInfra
View
Mistral logo
Mixtral 8x22B Instruct
Mistral
10
141B
(39B active at inference time)
65.4k
-
-
🤗
-
View
Meta logo
Llama 2 Chat 7B
Meta
10
7B
4.10k
$0.1
-
🤗
Replicate
View
Meta logo
Llama 3.2 Instruct 3B
Meta
10
3B
128k
$0.1
51
🤗
Amazon Bedrock
DeepInfra
View
AI21 Labs logo
Jamba Reasoning 3B
AI21 Labs
10
3B
262k
-
-
🤗
-
View
Alibaba logo
Qwen3 VL 4B Instruct
Alibaba
10
4.44B
256k
-
-
🤗
-
View
Alibaba logo
Qwen1.5 Chat 110B
Alibaba
10
110B
32.0k
-
-
🤗
-
View
Reka AI logo
Reka Flash 3
Reka AI
10
21B
128k
$0.3
-
🤗
Reka AI
View
Allen Institute for AI logo
Olmo 3 7B Think
Allen Institute for AI
9
7B
65.5k
-
-
🤗
-
View
Allen Institute for AI logo
OLMo 2 7B
Allen Institute for AI
9
7.3B
4.10k
-
-
🤗
-
View
Allen Institute for AI logo
Molmo 7B-D
Allen Institute for AI
9
8.02B
4.10k
-
-
🤗
-
View
InclusionAI logo
Ling-mini-2.0
InclusionAI
9
16.3B
(1.4B active at inference time)
131k
-
-
🤗
-
View
DeepSeek logo
DeepSeek R1 Distill Qwen 1.5B
DeepSeek
9
1.5B
128k
-
-
🤗
-
View
DeepSeek logo
DeepSeek-V2-Chat
DeepSeek
9
236B
(21B active at inference time)
128k
-
-
🤗
-
View
Meta logo
Llama 3 Instruct 70B
Meta
9
70B
8.19k
$0.9
-
🤗
Novita
Replicate
DeepInfra
+1 more
View
Snowflake logo
Arctic Instruct
Snowflake
9
480B
(17B active at inference time)
4.00k
-
-
🤗
-
View
Alibaba logo
Qwen Chat 72B
Alibaba
9
72B
33.8k
-
-
🤗
-
View
Google logo
Gemma 3 12B Instruct
Google
9
12.2B
128k
-
29
🤗
Google
Cloudflare
DeepInfra
+2 more
View
Meta logo
Llama 3.2 Instruct 11B (Vision)
Meta
9
11B
128k
$0.2
52
🤗
Microsoft Azure
DeepInfra
Amazon Bedrock
View
DeepSeek logo
DeepSeek Coder V2 Lite Instruct
DeepSeek
8
16B
(2.4B active at inference time)
128k
-
-
🤗
-
View
Microsoft Azure logo
Phi-4 Mini Instruct
Microsoft Azure
8
3.84B
128k
-
43
🤗
Weights & Biases
Microsoft Azure
View
Sarvam logo
Sarvam M (Reasoning)
Sarvam
8
23.6B
32.8k
-
-
🤗
-
View
Meta logo
Llama 2 Chat 70B
Meta
8
70B
4.10k
-
-
🤗
-
View
DeepSeek logo
DeepSeek LLM 67B Chat (V1)
DeepSeek
8
7B
4.10k
-
-
🤗
-
View
Meta logo
Llama 2 Chat 13B
Meta
8
13B
4.10k
-
-
🤗
-
View
Cohere logo
Command-R+ (Apr '24)
Cohere
8
104B
128k
$6.0
-
🤗
Amazon Bedrock
View
OpenChat logo
OpenChat 3.5 (1210)
OpenChat
8
7B
8.19k
-
-
🤗
-
View
Databricks logo
DBRX Instruct
Databricks
8
132B
(36B active at inference time)
32.8k
-
-
🤗
-
View
LG AI Research logo
Exaone 4.0 1.2B (Reasoning)
LG AI Research
8
1.28B
64.0k
-
-
🤗
-
View
Allen Institute for AI logo
Olmo 3 7B Instruct
Allen Institute for AI
8
7B
65.5k
$0.1
-
🤗
Parasail
View
LG AI Research logo
Exaone 4.0 1.2B (Non-reasoning)
LG AI Research
8
1.28B
64.0k
-
-
🤗
-
View
Liquid AI logo
LFM2.5-1.2B-Thinking
Liquid AI
8
1.17B
32.0k
-
-
🤗
-
View
AI21 Labs logo
Jamba 1.7 Mini
AI21 Labs
8
52B
(12B active at inference time)
258k
-
-
🤗
-
View
Liquid AI logo
LFM2.5-1.2B-Instruct
Liquid AI
8
1.17B
32.0k
-
-
🤗
?
View
Liquid AI logo
LFM2 2.6B
Liquid AI
8
2.57B
32.8k
-
-
🤗
?
View
AI21 Labs logo
Jamba 1.5 Mini
AI21 Labs
8
52B
(12B active at inference time)
256k
$0.3
-
🤗
Amazon Bedrock
View
IBM logo
Granite 4.0 H 1B
IBM
8
1.5B
128k
-
-
🤗
-
View
Alibaba logo
Qwen3 1.7B (Reasoning)
Alibaba
8
2.03B
32.0k
$0.4
127
🤗
Alibaba Cloud
View
AI21 Labs logo
Jamba 1.6 Mini
AI21 Labs
8
52B
(12B active at inference time)
256k
$0.3
184
🤗
AI21 Labs
View
Mistral logo
Mixtral 8x7B Instruct
Mistral
8
46.7B
(12.9B active at inference time)
32.8k
$0.5
-
🤗
Together.ai
Amazon Bedrock
DeepInfra
View
Google logo
Gemma 3 270M
Google
8
0.268B
32.0k
-
-
🤗
-
View
Swiss AI Initiative logo
Apertus 70B Instruct
Swiss AI Initiative
8
70B
65.5k
$1.3
-
🤗
Public AI
View
IBM logo
Granite 4.0 Micro
IBM
8
3B
128k
-
-
🤗
-
View
Nous Research logo
DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning)
Nous Research
8
8B
128k
-
-
🤗
-
View
Meta logo
Llama 65B
Meta
7
65B
2.05k
-
-
Not available
-
View
Alibaba logo
Qwen Chat 14B
Alibaba
7
14B
8.19k
-
-
🤗
-
View
Mistral logo
Mistral 7B Instruct
Mistral
7
7B
8.19k
$0.3
172
🤗
Amazon Bedrock
Mistral
View
Cohere logo
Command-R (Mar '24)
Cohere
7
35B
128k
$0.8
-
🤗
Amazon Bedrock
View
IBM logo
Granite 4.0 1B
IBM
7
1.6B
128k
-
-
🤗
-
View
Allen Institute for AI logo
Molmo2-8B
Allen Institute for AI
7
8.66B
36.9k
-
-
🤗
Parasail
View
Liquid AI logo
LFM2 8B A1B
Liquid AI
7
8.34B
(1.5B active at inference time)
32.8k
-
-
🤗
?
View
IBM logo
Granite 3.3 8B (Non-reasoning)
IBM
7
8.17B
128k
$0.1
391
🤗
Replicate
View
Alibaba logo
Qwen3 1.7B (Non-reasoning)
Alibaba
7
2.03B
32.0k
$0.2
124
🤗
Alibaba Cloud
View
Alibaba logo
Qwen3 0.6B (Reasoning)
Alibaba
6
0.752B
32.0k
$0.4
161
🤗
Alibaba Cloud
View
Google logo
Gemma 3n E4B Instruct
Google
6
8.39B
(4B active at inference time)
32.0k
$0.0
29
🤗
Together.ai
View
Meta logo
Llama 3 Instruct 8B
Meta
6
8B
8.19k
$0.1
-
🤗
Amazon Bedrock
Replicate
Novita
+1 more
View
Liquid AI logo
LFM2 1.2B
Liquid AI
6
1.17B
32.8k
-
-
🤗
?
View
Google logo
Gemma 3 4B Instruct
Google
6
4.3B
128k
-
31
🤗
DeepInfra
Google
Amazon Bedrock
View
Meta logo
Llama 3.2 Instruct 1B
Meta
6
1B
128k
$0.1
87
🤗
Amazon Bedrock
Novita
View
Liquid AI logo
LFM2.5-VL-1.6B
Liquid AI
6
1.6B
32.0k
-
-
🤗
?
View
IBM logo
Granite 4.0 350M
IBM
6
0.35B
32.8k
-
-
🤗
-
View
Swiss AI Initiative logo
Apertus 8B Instruct
Swiss AI Initiative
6
8B
65.5k
$0.1
-
🤗
Public AI
View
Alibaba logo
Qwen3 0.6B (Non-reasoning)
Alibaba
6
0.752B
32.0k
$0.2
163
🤗
Alibaba Cloud
View
Google logo
Gemma 3 1B Instruct
Google
6
1B
32.0k
-
46
🤗
Google
View
IBM logo
Granite 4.0 H 350M
IBM
5
0.34B
32.8k
-
-
🤗
-
View
Google logo
Gemma 3n E2B Instruct
Google
5
5.98B
(2B active at inference time)
32.0k
-
-
🤗
Google
View
Cohere logo
Tiny Aya Global
Cohere
5
3.35B
8.19k
-
-
🤗
-
View
Deep Cogito logo
Cogito v2.1 (Reasoning)
Deep Cogito
-
671B
(37B active at inference time)
128k
$1.3
89
🤗
Together.ai
View