Stay connected with us on X, Discord, and LinkedIn to stay up to date with future analysis

Comparison of Open Source Models

Comparison and analysis of open source AI models across key performance metrics including quality, performance, inference speed, context window, parameter count & licensing details. Models are considered Open Source (also commonly referred to as open weights) where their weights are accessible to download. This allows self-hosting on your own infrastructure and enables customizing the model such as through fine-tuning. Click on any model to see detailed metrics. For more details relating to our methodology, see our FAQs.

Z AI logoGLM-5 and Kimi logoKimi K2.5 are the highest intelligence open source models, followed by Alibaba logoQwen3.5 397B A17B & Z AI logoGLM-4.7.

Intelligence
Artificial Analysis Intelligence Index; Higher is better
Estimate (independent evaluation forthcoming)
Total Parameters
Trainable parameters in billions

Openness

Artificial Analysis Openness Index: Results

Openness Index assesses model openness on a 0 to 100 normalized scale (higher is more open)

Open Source Progress

Progress in Open Weights vs. Proprietary Intelligence

Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Open Weights
Proprietary

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Indicates whether the model weights are available. Models are labelled as 'Commercial Use Restricted' if the weights are available but commercial use is limited (typically requires obtaining a paid license).

Artificial Analysis Intelligence Index

Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Estimate (independent evaluation forthcoming)

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

{"@context":"https://schema.org","@type":"Dataset","name":"Artificial Analysis Intelligence Index","creator":{"@type":"Organization","name":"Artificial Analysis","url":"https://artificialanalysis.ai"},"description":"Artificial Analysis Intelligence Index: Includes GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt evaluations spanning reasoning, knowledge, math & coding; Evaluation results measured independently by Artificial Analysis","measurementTechnique":"Independent test run by Artificial Analysis on dedicated hardware.","spatialCoverage":"Worldwide","keywords":["analytics","llm","AI","benchmark","model","gpt","claude"],"license":"https://artificialanalysis.ai/docs/legal/Terms-of-Use.pdf","isAccessibleForFree":true,"citation":"Artificial Analysis (2025). LLM benchmarks dataset. https://artificialanalysis.ai","data":""}

Open Source Language Models Intelligence By Lab Over Time

Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Alibaba
DeepSeek
Google
Meta
Microsoft Azure
Mistral
NVIDIA

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Open Source Models Intelligence By Size Over Time

Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Large Models (>150B)
Medium Models (40B-150B)
Small Models (4B-40B)
Tiny Models (≤4B)

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

  • Tiny: Less than or equal to 4B parameters. These are usually the smallest models in terms of resource demand.
  • Small: Less than 40B parameters.
  • Medium: Between 40B-150B parameters.
  • Large: Over 150B parameters.

Intelligence Evaluations

Intelligence evaluations measured independently by Artificial Analysis; Higher is better
Results claimed by AI Lab (not yet independently verified)
GDPval-AA (Agentic Real-World Work Tasks, (ELO-500)/2000)
Terminal-Bench Hard (Agentic Coding & Terminal Use)
𝜏²-Bench Telecom (Agentic Tool Use)
AA-LCR (Long Context Reasoning)
AA-Omniscience Accuracy (Knowledge)
AA-Omniscience Non-Hallucination Rate (1 - Hallucination Rate)
Humanity's Last Exam (Reasoning & Knowledge)
GPQA Diamond (Scientific Reasoning)
SciCode (Coding)
IFBench (Instruction Following)
CritPt (Physics Reasoning)
MMMU Pro (Visual Reasoning)

While model intelligence generally translates across use cases, specific evaluations may be more relevant for certain use cases.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Size

Intelligence Index By Model Size

Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Estimate (independent evaluation forthcoming)
Large Models (>150B)
Medium Models (40B-150B)
Small Models (4B-40B)

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Indicates whether the model weights are available. Models are labelled as 'Commercial Use Restricted' if the weights are available but commercial use is limited (typically requires obtaining a paid license).

  • Tiny: Less than or equal to 4B parameters. These are usually the smallest models in terms of resource demand.
  • Small: Less than 40B parameters.
  • Medium: Between 40B-150B parameters.
  • Large: Over 150B parameters.

Model Size: Total and Active Parameters

Comparison between total model parameters and parameters active during inference
Active Parameters
Passive Parameters

The total number of trainable weights and biases in the model, expressed in billions. These parameters are learned during training and determine the model's ability to process and generate responses.

The number of parameters actually executed during each inference forward pass, expressed in billions. For Mixture of Experts (MoE) models, a routing mechanism selects a subset of experts per token, resulting in fewer active than total parameters. Dense models use all parameters, so active equals total.

Intelligence vs. Active Parameters

Active Parameters at Inference Time; Artificial Analysis Intelligence Index
Most attractive quadrant
Alibaba
DeepSeek
Kimi
LG AI Research
MBZUAI Institute of Foundation Models
Meta
MiniMax
Mistral
NVIDIA
OpenAI
Xiaomi
Z AI

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

The number of parameters actually executed during each inference forward pass, expressed in billions. For Mixture of Experts (MoE) models, a routing mechanism selects a subset of experts per token, resulting in fewer active than total parameters. Dense models use all parameters, so active equals total.

Intelligence vs. Total Parameters

Artificial Analysis Intelligence Index; Size in Parameters (Billions)
Most attractive quadrant
Alibaba
DeepSeek
Kimi
LG AI Research
MBZUAI Institute of Foundation Models
Meta
MiniMax
Mistral
NVIDIA
OpenAI
Xiaomi
Z AI

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

The total number of trainable weights and biases in the model, expressed in billions. These parameters are learned during training and determine the model's ability to process and generate responses.

Context Window

Context Window

Context Window: Tokens Limit; Higher is better

Larger context windows are relevant to RAG (Retrieval Augmented Generation) LLM workflows which typically involve reasoning and information retrieval of large amounts of data.

Maximum number of combined input & output tokens. Output tokens commonly have a significantly lower limit (varied by model).

{"@context":"https://schema.org","@type":"Dataset","name":"Context Window","creator":{"@type":"Organization","name":"Artificial Analysis","url":"https://artificialanalysis.ai"},"description":"Context window is the maximum number of tokens a model can accept in a single request. Higher limits allow longer prompts, documents, and more complex instructions.","measurementTechnique":"Independent test run by Artificial Analysis on dedicated hardware.","spatialCoverage":"Worldwide","keywords":["analytics","llm","AI","benchmark","model","gpt","claude"],"license":"https://artificialanalysis.ai/docs/legal/Terms-of-Use.pdf","isAccessibleForFree":true,"citation":"Artificial Analysis (2025). LLM benchmarks dataset. https://artificialanalysis.ai","data":""}

Further details
WeightsProvider
Benchmarks
Z AI logo
GLM-5 (Reasoning)
Z AI
50
744B
(40B active at inference time)
200k
$1.6
67
🤗
Novita
DeepInfra
Together.ai
+5 more
View
Kimi logo
Kimi K2.5 (Reasoning)
Kimi
47
1.0KB
(32B active at inference time)
256k
$1.2
41
🤗
Novita
DeepInfra
Baseten
+6 more
View
Alibaba logo
Qwen3.5 397B A17B (Reasoning)
Alibaba
45
397B
(17B active at inference time)
262k
$1.4
87
🤗
Together.ai
Alibaba Cloud
Novita
+1 more
View
Z AI logo
GLM-4.7 (Reasoning)
Z AI
42
357B
(32B active at inference time)
200k
$0.9
107
🤗
Google
Baseten
Novita
+7 more
View
Alibaba logo
Qwen3.5 27B (Reasoning)
Alibaba
42
27.8B
262k
$0.8
100
🤗
Novita
Alibaba Cloud
View
MiniMax logo
MiniMax-M2.5
MiniMax
42
230B
(10B active at inference time)
205k
$0.5
57
🤗
SambaNova
Parasail
Together.ai
+6 more
View
DeepSeek logo
DeepSeek V3.2 (Reasoning)
DeepSeek
42
685B
(37B active at inference time)
128k
$0.3
46
🤗
Nebius
SiliconFlow
Fireworks
+5 more
View
Alibaba logo
Qwen3.5 122B A10B (Reasoning)
Alibaba
42
125B
(10B active at inference time)
262k
$1.1
116
🤗
Alibaba Cloud
Novita
View
Xiaomi logo
MiMo-V2-Flash (Feb 2026)
Xiaomi
41
309B
(15B active at inference time)
256k
$0.1
154
🤗
Xiaomi
View
Kimi logo
Kimi K2 Thinking
Kimi
41
1.0KB
(32B active at inference time)
256k
$1.1
66
🤗
DeepInfra
Kimi
Microsoft Azure
+5 more
View
Z AI logo
GLM-5 (Non-reasoning)
Z AI
41
744B
(40B active at inference time)
200k
$1.6
45
🤗
Fireworks
DeepInfra
Novita
+1 more
View
Alibaba logo
Qwen3.5 397B A17B (Non-reasoning)
Alibaba
40
397B
(17B active at inference time)
262k
$1.4
88
🤗
Novita
Alibaba Cloud
View
MiniMax logo
MiniMax-M2.1
MiniMax
39
230B
(10B active at inference time)
205k
$0.5
53
🤗
Fireworks
MiniMax
Novita
+3 more
View
Xiaomi logo
MiMo-V2-Flash (Reasoning)
Xiaomi
39
309B
(15B active at inference time)
256k
$0.1
167
🤗
Xiaomi
View
Kimi logo
Kimi K2.5 (Non-reasoning)
Kimi
37
1.0KB
(32B active at inference time)
256k
$1.2
39
🤗
Fireworks
Together.ai
Novita
+3 more
View
Alibaba logo
Qwen3.5 35B A3B (Reasoning)
Alibaba
37
36B
262k
$0.7
166
🤗
Alibaba Cloud
Novita
View
MiniMax logo
MiniMax-M2
MiniMax
36
230B
(10B active at inference time)
205k
$0.5
52
🤗
MiniMax
Amazon Bedrock
Novita
+1 more
View
Z AI logo
GLM-4.7 (Non-reasoning)
Z AI
34
357B
(32B active at inference time)
200k
$0.9
104
🤗
Parasail
SiliconFlow
Baseten
+5 more
View
DeepSeek logo
DeepSeek V3.2 Speciale
DeepSeek
34
685B
(37B active at inference time)
128k
-
-
🤗
-
View
DeepSeek logo
DeepSeek V3.1 Terminus (Reasoning)
DeepSeek
34
685B
(37B active at inference time)
128k
$0.8
-
🤗
Eigen AI
Novita
SambaNova
View
OpenAI logo
gpt-oss-120B (high)
OpenAI
33
117B
(5.1B active at inference time)
131k
$0.3
302
🤗
SambaNova
Baseten
Microsoft Azure
+19 more
View
DeepSeek logo
DeepSeek V3.2 Exp (Reasoning)
DeepSeek
33
685B
(37B active at inference time)
128k
$0.3
44
🤗
DeepSeek
Novita
View
Z AI logo
GLM-4.6 (Reasoning)
Z AI
33
357B
(32B active at inference time)
200k
$1.0
97
🤗
DeepInfra
Baseten
Novita
+1 more
View
LG AI Research logo
K-EXAONE (Reasoning)
LG AI Research
32
236B
(23B active at inference time)
256k
-
-
🤗
-
View
DeepSeek logo
DeepSeek V3.2 (Non-reasoning)
DeepSeek
32
685B
(37B active at inference time)
128k
$0.3
45
🤗
DeepSeek
SiliconFlow
SambaNova
+7 more
View
Kimi logo
Kimi K2 0905
Kimi
31
1.0KB
(32B active at inference time)
256k
$1.2
65
🤗
Fireworks
Parasail
Groq
+2 more
View
Xiaomi logo
MiMo-V2-Flash (Non-reasoning)
Xiaomi
30
309B
(15B active at inference time)
256k
$0.1
141
🤗
Xiaomi
View
Z AI logo
GLM-4.6 (Non-reasoning)
Z AI
30
357B
(32B active at inference time)
200k
$1.0
77
🤗
Novita
Together.ai
View
Z AI logo
GLM-4.7-Flash (Reasoning)
Z AI
30
31.2B
(3B active at inference time)
200k
$0.1
61
🤗
Novita
DeepInfra
View
Alibaba logo
Qwen3 235B A22B 2507 (Reasoning)
Alibaba
30
235B
(22B active at inference time)
256k
$2.6
39
🤗
Novita
Alibaba Cloud
Hyperbolic
+2 more
View
DeepSeek logo
DeepSeek V3.1 Terminus (Non-reasoning)
DeepSeek
29
685B
(37B active at inference time)
128k
$0.6
-
🤗
DeepInfra
Eigen AI
Novita
+1 more
View
DeepSeek logo
DeepSeek V3.2 Exp (Non-reasoning)
DeepSeek
28
685B
(37B active at inference time)
128k
$0.3
46
🤗
DeepInfra
DeepSeek
Novita
View
ServiceNow logo
Apriel-v1.5-15B-Thinker
ServiceNow
28
15B
128k
-
144
🤗
Together.ai
View
Alibaba logo
Qwen3 Coder Next
Alibaba
28
79.7B
(3B active at inference time)
256k
$0.5
120
🤗
Novita
Together.ai
Parasail
View
DeepSeek logo
DeepSeek V3.1 (Non-reasoning)
DeepSeek
28
685B
(37B active at inference time)
128k
$0.8
-
🤗
DeepInfra
Fireworks
Google
+6 more
View
DeepSeek logo
DeepSeek V3.1 (Reasoning)
DeepSeek
28
685B
(37B active at inference time)
128k
$0.9
-
🤗
SambaNova
Amazon Bedrock
Novita
+1 more
View
Alibaba logo
Qwen3 VL 235B A22B (Reasoning)
Alibaba
28
235B
(22B active at inference time)
262k
$2.6
31
🤗
Alibaba Cloud
Novita
View
ServiceNow logo
Apriel-v1.6-15B-Thinker
ServiceNow
28
15B
128k
-
141
🤗
Together.ai
View
DeepSeek logo
DeepSeek R1 0528 (May '25)
DeepSeek
27
685B
(37B active at inference time)
128k
$2.4
-
🤗
Nebius
Novita
SambaNova
+6 more
View
Alibaba logo
Qwen3 Next 80B A3B (Reasoning)
Alibaba
27
80B
(3B active at inference time)
262k
$1.9
129
🤗
Google
Hyperbolic
Eigen AI
+4 more
View
Z AI logo
GLM-4.5 (Reasoning)
Z AI
26
355B
(32B active at inference time)
128k
$0.8
38
🤗
Novita
DeepInfra
View
Kimi logo
Kimi K2
Kimi
26
1.0KB
(32B active at inference time)
128k
$1.1
44
🤗
Novita
Groq
DeepInfra
+4 more
View
Z AI logo
GLM-4.5-Air
Z AI
26
106B
(12B active at inference time)
128k
$0.4
121
🤗
DeepInfra
Together.ai
Nebius
+1 more
View
ByteDance Seed logo
Seed-OSS-36B-Instruct
ByteDance Seed
25
36.2B
512k
$0.3
32
🤗
SiliconFlow
View
Alibaba logo
Qwen3 235B A22B 2507 Instruct
Alibaba
25
235B
(22B active at inference time)
256k
$1.2
43
🤗
Parasail
Amazon Bedrock
Hyperbolic
+7 more
View
Alibaba logo
Qwen3 Coder 480B A35B Instruct
Alibaba
25
480B
(35B active at inference time)
262k
$3.0
39
🤗
DeepInfra
Hyperbolic
Google
+7 more
View
Alibaba logo
Qwen3 VL 32B (Reasoning)
Alibaba
25
33.4B
256k
$2.6
85
🤗
Alibaba Cloud
View
Alibaba logo
Qwen3 30B A3B 2507 (Reasoning)
Alibaba
25
30.5B
(3.3B active at inference time)
262k
$0.8
150
🤗
Alibaba Cloud
Nebius
Clarifai
View
MBZUAI Institute of Foundation Models logo
K2-V2 (high)
MBZUAI Institute of Foundation Models
25
70B
512k
-
-
🤗
-
View
OpenAI logo
gpt-oss-120B (low)
OpenAI
24
117B
(5.1B active at inference time)
131k
$0.3
294
🤗
Clarifai
SambaNova
Nebius
+17 more
View
OpenAI logo
gpt-oss-20B (high)
OpenAI
24
21B
(3.6B active at inference time)
131k
$0.1
293
🤗
Cloudflare
Lightning AI
Novita
+7 more
View
MiniMax logo
MiniMax M1 80k
MiniMax
24
456B
(45.9B active at inference time)
1.00M
$1.0
-
🤗
Novita
View
NVIDIA logo
NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)
NVIDIA
24
31.6B
(3.6B active at inference time)
1.00M
$0.1
174
🤗
DeepInfra
Nebius
View
MBZUAI Institute of Foundation Models logo
K2 Think V2
MBZUAI Institute of Foundation Models
24
70B
262k
-
-
Not available
-
View
NVIDIA logo
Llama Nemotron Super 49B v1.5 (Reasoning)
NVIDIA
24
49B
128k
$0.2
76
🤗
DeepInfra
View
Prime Intellect logo
INTELLECT-3
Prime Intellect
24
107B
131k
-
-
🤗
-
View
Naver logo
HyperCLOVA X SEED Think (32B)
Naver
24
32B
128k
-
-
🤗
-
View
Alibaba logo
Qwen3 Next 80B A3B Instruct
Alibaba
24
80B
(3B active at inference time)
262k
$0.9
131
🤗
Hyperbolic
Google
DeepInfra
+4 more
View
InclusionAI logo
Ling-1T
InclusionAI
24
1.0KB
(50B active at inference time)
128k
-
-
🤗
-
View
LG AI Research logo
K-EXAONE (Non-reasoning)
LG AI Research
23
236B
(23B active at inference time)
256k
-
-
🤗
-
View
Alibaba logo
Qwen3 VL 235B A22B Instruct
Alibaba
23
235B
(22B active at inference time)
262k
$1.2
44
🤗
Parasail
Alibaba Cloud
Novita
+2 more
View
DeepSeek logo
DeepSeek R1 (Jan '25)
DeepSeek
23
685B
(37B active at inference time)
128k
$2.4
-
🤗
Amazon Bedrock
DeepInfra
SambaNova
+6 more
View
Korea Telecom logo
Mi:dm K 2.5 Pro
Korea Telecom
23
32B
128k
-
-
Not available
-
View
Mistral logo
Mistral Large 3
Mistral
23
675B
(41B active at inference time)
256k
$0.8
50
🤗
Microsoft Azure
Mistral
Amazon Bedrock
View
Alibaba logo
Qwen3 4B 2507 (Reasoning)
Alibaba
23
4.02B
262k
-
-
🤗
-
View
Mistral logo
Magistral Small 1.2
Mistral
23
24B
128k
$0.8
210
🤗
Mistral
Amazon Bedrock
View
LG AI Research logo
EXAONE 4.0 32B (Reasoning)
LG AI Research
22
32B
131k
$0.7
99
🤗
FriendliAI
View
DeepSeek logo
DeepSeek V3 0324
DeepSeek
22
671B
(37B active at inference time)
128k
$1.3
-
🤗
Together.ai
Nebius
Microsoft Azure
+6 more
View
Z AI logo
GLM-4.7-Flash (Non-reasoning)
Z AI
22
31.2B
(3B active at inference time)
200k
$0.2
69
🤗
Novita
View
InclusionAI logo
Ring-1T
InclusionAI
22
1.0KB
(50B active at inference time)
128k
-
-
🤗
-
View
Alibaba logo
Qwen3 235B A22B (Reasoning)
Alibaba
22
235B
(22B active at inference time)
32.8k
$2.6
45
🤗
Alibaba Cloud
View
Nous Research logo
Hermes 4 - Llama-3.1 405B (Reasoning)
Nous Research
22
406B
128k
$1.5
33
🤗
Nebius
View
Alibaba logo
Qwen3 VL 32B Instruct
Alibaba
21
33.4B
256k
$1.2
63
🤗
Together.ai
Alibaba Cloud
View
Z AI logo
GLM-4.6V (Reasoning)
Z AI
21
108B
128k
$0.5
98
🤗
Parasail
DeepInfra
Novita
+1 more
View
NVIDIA logo
NVIDIA Nemotron Nano 12B v2 VL (Reasoning)
NVIDIA
21
13.2B
128k
$0.3
126
🤗
DeepInfra
View
MiniMax logo
MiniMax M1 40k
MiniMax
21
456B
(45.9B active at inference time)
1.00M
-
-
🤗
-
View
MBZUAI Institute of Foundation Models logo
K2-V2 (medium)
MBZUAI Institute of Foundation Models
21
70B
512k
-
-
🤗
-
View
Alibaba logo
Qwen3 Omni 30B A3B (Reasoning)
Alibaba
21
35.3B
(3B active at inference time)
65.5k
$0.4
87
🤗
Alibaba Cloud
View
OpenAI logo
gpt-oss-20B (low)
OpenAI
21
21B
(3.6B active at inference time)
131k
$0.1
248
🤗
Groq
Novita
Amazon Bedrock
+8 more
View
InclusionAI logo
Ring-flash-2.0
InclusionAI
21
103B
(6.1B active at inference time)
128k
$0.2
81
🤗
SiliconFlow
View
Nous Research logo
Hermes 4 - Llama-3.1 70B (Reasoning)
Nous Research
20
70.6B
128k
$0.2
80
🤗
Nebius
View
Alibaba logo
Qwen3 32B (Reasoning)
Alibaba
20
32.8B
32.8k
$2.6
89
🤗
SambaNova
Novita
Nebius
+4 more
View
NVIDIA logo
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)
NVIDIA
20
253B
128k
$0.9
38
🤗
Nebius
View
Alibaba logo
Qwen3 VL 30B A3B Instruct
Alibaba
20
30B
(3B active at inference time)
256k
$0.3
100
🤗
Fireworks
DeepInfra
Alibaba Cloud
+1 more
View
InclusionAI logo
Ling-flash-2.0
InclusionAI
20
103B
(6.1B active at inference time)
128k
$0.2
60
🤗
SiliconFlow
View
Alibaba logo
QwQ 32B
Alibaba
20
32.8B
131k
$0.5
29
🤗
Cloudflare
Hyperbolic
View
Alibaba logo
Qwen3 VL 30B A3B (Reasoning)
Alibaba
20
30B
(3B active at inference time)
256k
$0.8
79
🤗
Fireworks
Novita
Alibaba Cloud
View
Z AI logo
GLM-4.5V (Reasoning)
Z AI
19
108B
(12B active at inference time)
64.0k
$0.9
40
🤗
Novita
View
Alibaba logo
Qwen3 30B A3B 2507 Instruct
Alibaba
19
30.5B
(3.3B active at inference time)
262k
$0.3
76
🤗
Nebius
Alibaba Cloud
Clarifai
View
Alibaba logo
Qwen3 30B A3B (Reasoning)
Alibaba
19
30.5B
(3.3B active at inference time)
32.8k
$0.8
71
🤗
Fireworks
DeepInfra
Novita
+1 more
View
Mistral logo
Devstral 2
Mistral
19
125B
256k
-
77
🤗
Mistral
View
Allen Institute for AI logo
Olmo 3 32B Think
Allen Institute for AI
19
32.2B
65.5k
-
-
🤗
-
View
NVIDIA logo
NVIDIA Nemotron Nano 9B V2 (Non-reasoning)
NVIDIA
19
9B
131k
$0.1
118
🤗
Together.ai
DeepInfra
Amazon Bedrock
View
Alibaba logo
Qwen3 14B (Reasoning)
Alibaba
19
14.8B
32.8k
$1.3
56
🤗
DeepInfra
Alibaba Cloud
View
NVIDIA logo
Llama 3.3 Nemotron Super 49B v1 (Reasoning)
NVIDIA
18
49B
128k
-
-
🤗
-
View
Meta logo
Llama 4 Maverick
Meta
18
402B
(17B active at inference time)
1.00M
$0.5
115
🤗
Parasail
Databricks
Google
+9 more
View
Alibaba logo
Qwen3 Coder 30B A3B Instruct
Alibaba
17
30.5B
(3.3B active at inference time)
262k
$0.9
20
🤗
Scaleway
Clarifai
Amazon Bedrock
+2 more
View
Baidu logo
ERNIE 4.5 300B A47B
Baidu
17
300B
(47B active at inference time)
131k
$0.5
20
🤗
SiliconFlow
Novita
View
DeepSeek logo
DeepSeek R1 Distill Qwen 32B
DeepSeek
17
32B
128k
$0.3
56
🤗
DeepInfra
View
Nous Research logo
Hermes 4 - Llama-3.1 405B (Non-reasoning)
Nous Research
17
406B
128k
$1.5
31
🤗
Nebius
View
DeepSeek logo
DeepSeek V3 (Dec '24)
DeepSeek
17
671B
(37B active at inference time)
128k
$0.6
-
🤗
Hyperbolic
Novita
Novita
+2 more
View
Allen Institute for AI logo
Olmo 3 7B Think
Allen Institute for AI
17
7B
65.5k
$0.1
68
🤗
Parasail
View
Mistral logo
Magistral Small 1
Mistral
17
23.6B
40.0k
-
-
🤗
-
View
Mistral logo
Devstral Small 2
Mistral
17
24B
256k
-
200
🤗
Mistral
View
Alibaba logo
Qwen3 VL 8B (Reasoning)
Alibaba
17
8.77B
256k
$0.7
118
🤗
Alibaba Cloud
View
MBZUAI Institute of Foundation Models logo
K2-V2 (low)
MBZUAI Institute of Foundation Models
16
70B
512k
-
-
🤗
-
View
DeepSeek logo
DeepSeek R1 0528 Qwen3 8B
DeepSeek
16
8.19B
32.8k
-
-
🤗
-
View
Mistral logo
Ministral 3 14B
Mistral
16
14B
256k
$0.2
142
🤗
Amazon Bedrock
Together.ai
Mistral
View
Z AI logo
GLM-4.6V (Non-reasoning)
Z AI
16
108B
128k
$0.5
27
🤗
Novita
SiliconFlow
Parasail
View
Alibaba logo
Qwen3 4B 2507 Instruct
Alibaba
16
4.02B
262k
-
-
🤗
-
View
LG AI Research logo
EXAONE 4.0 32B (Non-reasoning)
LG AI Research
16
32B
131k
$0.7
90
🤗
FriendliAI
View
Alibaba logo
Qwen3 Omni 30B A3B Instruct
Alibaba
16
35.3B
(3B active at inference time)
65.5k
$0.4
89
🤗
Alibaba Cloud
View
Alibaba logo
Qwen3 235B A22B (Non-reasoning)
Alibaba
16
235B
(22B active at inference time)
32.8k
$1.2
35
🤗
Novita
Alibaba Cloud
DeepInfra
View
DeepSeek logo
DeepSeek R1 Distill Llama 70B
DeepSeek
16
70B
128k
$0.9
58
🤗
SambaNova
DeepInfra
Scaleway
View
DeepSeek logo
DeepSeek R1 Distill Qwen 14B
DeepSeek
16
14B
128k
-
-
🤗
-
View
Alibaba logo
Qwen3 14B (Non-reasoning)
Alibaba
16
14.8B
32.8k
$0.6
53
🤗
DeepInfra
Alibaba Cloud
View
Alibaba logo
Qwen2.5 Instruct 72B
Alibaba
16
72B
131k
-
46
🤗
Hyperbolic
DeepInfra
Alibaba Cloud
View
Alibaba logo
Qwen3 8B (Reasoning)
Alibaba
15
8.19B
131k
$0.7
62
🤗
Alibaba Cloud
View
Mistral logo
Ministral 3 8B
Mistral
15
8B
256k
$0.1
166
🤗
Mistral
Amazon Bedrock
View
Meta logo
Llama 3.1 Instruct 405B
Meta
15
405B
128k
$4.4
25
🤗
Amazon Bedrock
Google
Amazon Bedrock
+4 more
View
Alibaba logo
QwQ 32B-Preview
Alibaba
15
32.8B
32.8k
$0.1
55
🤗
DeepInfra
View
InclusionAI logo
Ling-mini-2.0
InclusionAI
15
16.3B
(1.4B active at inference time)
131k
$0.1
148
🤗
SiliconFlow
View
Mistral logo
Mistral Small 3.2
Mistral
15
24B
128k
$0.1
147
🤗
Mistral
DeepInfra
View
Mistral logo
Devstral Small (Jul '25)
Mistral
15
24B
256k
$0.1
233
🤗
DeepInfra
Mistral
View
Alibaba logo
Qwen3 VL 8B Instruct
Alibaba
15
8.77B
256k
$0.3
116
🤗
Together.ai
Alibaba Cloud
View
NVIDIA logo
NVIDIA Nemotron Nano 9B V2 (Reasoning)
NVIDIA
15
9B
131k
$0.1
93
🤗
DeepInfra
View
Cohere logo
Command A
Cohere
15
111B
256k
$4.4
44
🤗
Microsoft Azure
Cohere
View
Mistral logo
Mistral Large 2 (Nov '24)
Mistral
15
123B
128k
$3.0
35
🤗
Microsoft Azure
Mistral
View
LG AI Research logo
Exaone 4.0 1.2B (Reasoning)
LG AI Research
15
1.28B
64.0k
-
-
🤗
-
View
NVIDIA logo
Llama Nemotron Super 49B v1.5 (Non-reasoning)
NVIDIA
15
49B
128k
$0.2
68
🤗
DeepInfra
View
Alibaba logo
Qwen3 30B A3B (Non-reasoning)
Alibaba
15
30.5B
(3.3B active at inference time)
32.8k
$0.3
64
🤗
DeepInfra
Alibaba Cloud
View
Alibaba logo
Qwen3 32B (Non-reasoning)
Alibaba
15
32.8B
32.8k
$1.2
78
🤗
Amazon Bedrock
Nebius
Alibaba Cloud
+5 more
View
Meta logo
Llama 3.3 Instruct 70B
Meta
14
70B
128k
$0.6
83
🤗
Nebius
Fireworks
Cloudflare
+18 more
View
NVIDIA logo
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning)
NVIDIA
14
4.51B
128k
-
-
🤗
-
View
Kimi logo
Kimi Linear 48B A3B Instruct
Kimi
14
49.1B
(3B active at inference time)
1.00M
-
-
🤗
-
View
Z AI logo
GLM-4.5V (Non-reasoning)
Z AI
14
108B
(12B active at inference time)
64.0k
$0.9
38
🤗
Novita
View
Reka AI logo
Reka Flash 3
Reka AI
14
21B
128k
$0.3
49
🤗
Reka AI
View
NVIDIA logo
Llama 3.3 Nemotron Super 49B v1 (Non-reasoning)
NVIDIA
14
49B
128k
-
-
🤗
-
View
Alibaba logo
Qwen3 4B (Reasoning)
Alibaba
14
4.02B
32.0k
$0.4
92
🤗
Alibaba Cloud
View
NVIDIA logo
NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)
NVIDIA
14
31.6B
(3.6B active at inference time)
1.00M
$0.1
112
🤗
DeepInfra
View
NVIDIA logo
NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning)
NVIDIA
14
13.2B
128k
$0.3
134
🤗
DeepInfra
Nebius
Amazon Bedrock
View
Allen Institute for AI logo
Llama 3.1 Tulu3 405B
Allen Institute for AI
14
405B
128k
-
-
🤗
-
View
Alibaba logo
Qwen3 VL 4B Instruct
Alibaba
14
4.44B
256k
-
-
🤗
-
View
Mistral logo
Pixtral Large
Mistral
14
124B
128k
$3.0
50
🤗
Mistral
View
Mistral logo
Mistral Small 3.1
Mistral
14
24B
128k
$0.1
142
🤗
Cloudflare
Google
Mistral
+1 more
View
xAI logo
Grok 2 (Dec '24)
xAI
14
270B
131k
-
-
🤗
-
View
Alibaba logo
Qwen3 VL 4B (Reasoning)
Alibaba
14
4.44B
256k
-
-
🤗
-
View
Nous Research logo
Hermes 4 - Llama-3.1 70B (Non-reasoning)
Nous Research
14
70.6B
128k
$0.2
70
🤗
Nebius
View
Meta logo
Llama 4 Scout
Meta
14
109B
(17B active at inference time)
10.0M
$0.3
146
🤗
Microsoft Azure
DeepInfra
Google
+6 more
View
NVIDIA logo
Llama 3.1 Nemotron Instruct 70B
NVIDIA
14
70B
128k
$1.2
31
🤗
DeepInfra
View
Alibaba logo
Qwen3 8B (Non-reasoning)
Alibaba
13
8.19B
32.8k
$0.3
52
🤗
Alibaba Cloud
Fireworks
View
Alibaba logo
Qwen2.5 Instruct 32B
Alibaba
13
32B
128k
-
-
🤗
-
View
IBM logo
Granite 4.0 H Small
IBM
13
32B
(9B active at inference time)
128k
$0.1
519
🤗
Replicate
View
Microsoft Azure logo
Phi-4
Microsoft Azure
13
14B
16.0k
$0.2
8
🤗
DeepInfra
Microsoft Azure
View
Meta logo
Llama 3.1 Instruct 70B
Meta
13
70B
128k
$0.6
44
🤗
Amazon Bedrock
DeepInfra
Hyperbolic
+4 more
View
Alibaba logo
Qwen3 1.7B (Reasoning)
Alibaba
13
2.03B
32.0k
$0.4
124
🤗
Alibaba Cloud
View
Mistral logo
Mistral Large 2 (Jul '24)
Mistral
13
123B
128k
$3.0
-
🤗
Amazon Bedrock
View
Allen Institute for AI logo
Olmo 3 7B Instruct
Allen Institute for AI
13
7B
65.5k
$0.1
38
🤗
Parasail
View
Alibaba logo
Qwen2.5 Coder Instruct 32B
Alibaba
13
32B
131k
$0.2
-
🤗
Hyperbolic
View
Mistral logo
Ministral 3 3B
Mistral
13
3B
256k
$0.1
292
🤗
Mistral
Amazon Bedrock
View
Mistral logo
Mistral Small 3
Mistral
13
24B
32.0k
$0.1
241
🤗
Together.ai
DeepInfra
Mistral
View
AI21 Labs logo
Jamba Reasoning 3B
AI21 Labs
13
3B
262k
-
-
🤗
-
View
AI21 Labs logo
Jamba 1.7 Large
AI21 Labs
13
398B
(94B active at inference time)
256k
$3.5
51
🤗
AI21 Labs
View
DeepSeek logo
DeepSeek-V2.5 (Dec '24)
DeepSeek
13
236B
(21B active at inference time)
128k
-
-
🤗
-
View
Alibaba logo
Qwen3 4B (Non-reasoning)
Alibaba
12
4.02B
32.0k
$0.2
84
🤗
Alibaba Cloud
View
LG AI Research logo
Exaone 4.0 1.2B (Non-reasoning)
LG AI Research
12
1.28B
64.0k
-
-
🤗
-
View
Google logo
Gemma 3 12B Instruct
Google
12
12.2B
128k
-
32
🤗
DeepInfra
Google
Amazon Bedrock
+2 more
View
DeepSeek logo
DeepSeek-V2.5
DeepSeek
12
236B
(21B active at inference time)
128k
-
-
🤗
-
View
Mistral logo
Devstral Small (May '25)
Mistral
12
23.6B
256k
$0.1
-
🤗
DeepInfra
View
DeepSeek logo
DeepSeek R1 Distill Llama 8B
DeepSeek
12
8B
128k
-
-
🤗
-
View
Perplexity logo
R1 1776
Perplexity
12
671B
(37B active at inference time)
128k
-
-
🤗
-
View
Meta logo
Llama 3.2 Instruct 90B (Vision)
Meta
12
90B
128k
$0.7
42
🤗
Google
Amazon Bedrock
Microsoft Azure
+1 more
View
Upstage logo
Solar Mini
Upstage
12
10.7B
4.10k
$0.1
82
🤗
Upstage
View
xAI logo
Grok-1
xAI
12
314B
(78B active at inference time)
8.19k
-
-
🤗
-
View
Alibaba logo
Qwen2 Instruct 72B
Alibaba
12
72B
131k
-
-
🤗
-
View
Liquid AI logo
LFM2 8B A1B
Liquid AI
11
8.34B
(1.5B active at inference time)
32.8k
-
-
🤗
?
View
Meta logo
Llama 3.1 Instruct 8B
Meta
11
8B
128k
$0.1
173
🤗
Eigen AI
DeepInfra
Microsoft Azure
+15 more
View
IBM logo
Granite 4.0 Micro
IBM
11
3B
128k
-
-
🤗
-
View
Microsoft Azure logo
Phi-4 Mini Instruct
Microsoft Azure
11
3.84B
128k
-
45
🤗
Microsoft Azure
View
Nous Research logo
DeepHermes 3 - Mistral 24B Preview (Non-reasoning)
Nous Research
11
24B
32.0k
-
-
🤗
-
View
Meta logo
Llama 3.2 Instruct 11B (Vision)
Meta
11
11B
128k
$0.2
63
🤗
Microsoft Azure
DeepInfra
Amazon Bedrock
View
Google logo
Gemma 3n E4B Instruct
Google
11
8.39B
(4B active at inference time)
32.0k
$0.0
44
🤗
Together.ai
View
IBM logo
Granite 3.3 8B (Non-reasoning)
IBM
11
8.17B
128k
$0.1
486
🤗
Replicate
View
AI21 Labs logo
Jamba 1.5 Large
AI21 Labs
11
398B
(94B active at inference time)
256k
$3.5
-
🤗
Google
Amazon Bedrock
View
AI21 Labs logo
Jamba 1.7 Mini
AI21 Labs
11
52B
(12B active at inference time)
258k
-
-
🤗
-
View
Google logo
Gemma 3 4B Instruct
Google
11
4.3B
128k
-
32
🤗
DeepInfra
Google
Amazon Bedrock
View
Nous Research logo
Hermes 3 - Llama-3.1 70B
Nous Research
11
70.6B
128k
$0.3
32
🤗
DeepInfra
View
DeepSeek logo
DeepSeek-Coder-V2
DeepSeek
11
236B
(21B active at inference time)
128k
-
-
🤗
-
View
Alibaba logo
Qwen3 1.7B (Non-reasoning)
Alibaba
11
2.03B
32.0k
$0.2
117
🤗
Alibaba Cloud
View
Allen Institute for AI logo
OLMo 2 32B
Allen Institute for AI
11
32.2B
4.10k
-
-
🤗
-
View
AI21 Labs logo
Jamba 1.6 Large
AI21 Labs
11
398B
(94B active at inference time)
256k
$3.5
55
🤗
AI21 Labs
View
Alibaba logo
Qwen3 0.6B (Reasoning)
Alibaba
11
0.752B
32.0k
$0.4
201
🤗
Alibaba Cloud
View
Liquid AI logo
LFM2 24B A2B
Liquid AI
10
23.8B
(2.3B active at inference time)
32.8k
$0.1
86
🤗
Together.ai
View
IBM logo
Granite 4.0 H 1B
IBM
10
1.5B
128k
-
-
🤗
-
View
Google logo
Gemma 3 27B Instruct
Google
10
27.4B
128k
-
34
🤗
Parasail
Amazon Bedrock
Novita
+2 more
View
IBM logo
Granite 4.0 1B
IBM
10
1.6B
128k
-
-
🤗
-
View
Meta logo
Llama 3 Instruct 70B
Meta
10
70B
8.19k
$0.9
38
🤗
DeepInfra
Replicate
Amazon Bedrock
+1 more
View
Mistral logo
Mistral Small (Sep '24)
Mistral
10
22B
32.8k
$0.3
147
🤗
Mistral
View
Microsoft Azure logo
Phi-3 Mini Instruct 3.8B
Microsoft Azure
10
3.8B
4.10k
$0.2
-
🤗
Microsoft Azure
View
Google logo
Gemma 3n E4B Instruct Preview (May '25)
Google
10
8.39B
(4B active at inference time)
32.0k
-
-
🤗
-
View
Microsoft Azure logo
Phi-4 Multimodal Instruct
Microsoft Azure
10
5.6B
128k
-
17
🤗
Microsoft Azure
View
Alibaba logo
Qwen2.5 Coder Instruct 7B
Alibaba
10
7.62B
131k
-
-
🤗
-
View
Liquid AI logo
LFM2 2.6B
Liquid AI
10
2.57B
32.8k
-
-
🤗
?
View
Mistral logo
Mixtral 8x22B Instruct
Mistral
10
141B
(39B active at inference time)
65.4k
-
-
🤗
-
View
Meta logo
Llama 2 Chat 7B
Meta
10
7B
4.10k
$0.1
117
🤗
Replicate
View
Google logo
Gemma 3n E2B Instruct
Google
10
5.98B
(2B active at inference time)
32.0k
-
47
🤗
Google
View
Meta logo
Llama 3.2 Instruct 3B
Meta
10
3B
128k
$0.1
61
🤗
Amazon Bedrock
Together.ai
DeepInfra
+1 more
View
Alibaba logo
Qwen3 0.6B (Non-reasoning)
Alibaba
10
0.752B
32.0k
$0.2
193
🤗
Alibaba Cloud
View
Alibaba logo
Qwen1.5 Chat 110B
Alibaba
10
110B
32.0k
-
-
🤗
-
View
Liquid AI logo
LFM2 1.2B
Liquid AI
9
1.17B
32.8k
-
-
🤗
?
View
Allen Institute for AI logo
OLMo 2 7B
Allen Institute for AI
9
7.3B
4.10k
-
-
🤗
-
View
Allen Institute for AI logo
Molmo 7B-D
Allen Institute for AI
9
8.02B
4.10k
-
-
🤗
-
View
Meta logo
Llama 3.2 Instruct 1B
Meta
9
1B
128k
$0.1
139
🤗
Novita
Amazon Bedrock
View
DeepSeek logo
DeepSeek R1 Distill Qwen 1.5B
DeepSeek
9
1.5B
128k
-
-
🤗
-
View
DeepSeek logo
DeepSeek-V2-Chat
DeepSeek
9
236B
(21B active at inference time)
128k
-
-
🤗
-
View
IBM logo
Granite 4.0 H 350M
IBM
9
0.34B
32.8k
-
-
🤗
-
View
IBM logo
Granite 4.0 350M
IBM
9
0.35B
32.8k
-
-
🤗
-
View
Snowflake logo
Arctic Instruct
Snowflake
9
480B
(17B active at inference time)
4.00k
-
-
🤗
-
View
Alibaba logo
Qwen Chat 72B
Alibaba
9
72B
33.8k
-
-
🤗
-
View
Meta logo
Llama 3 Instruct 8B
Meta
9
8B
8.19k
$0.1
72
🤗
DeepInfra
Replicate
Amazon Bedrock
+1 more
View
Google logo
Gemma 3 1B Instruct
Google
9
1B
32.0k
-
43
🤗
Google
View
DeepSeek logo
DeepSeek Coder V2 Lite Instruct
DeepSeek
8
16B
(2.4B active at inference time)
128k
-
-
🤗
-
View
Google logo
Gemma 3 270M
Google
8
0.268B
32.0k
-
-
🤗
-
View
Meta logo
Llama 2 Chat 70B
Meta
8
70B
4.10k
-
-
🤗
-
View
DeepSeek logo
DeepSeek LLM 67B Chat (V1)
DeepSeek
8
7B
4.10k
-
-
🤗
-
View
Meta logo
Llama 2 Chat 13B
Meta
8
13B
4.10k
-
-
🤗
-
View
Cohere logo
Command-R+ (Apr '24)
Cohere
8
104B
128k
$6.0
-
🤗
Amazon Bedrock
View
OpenChat logo
OpenChat 3.5 (1210)
OpenChat
8
7B
8.19k
-
-
🤗
-
View
Databricks logo
DBRX Instruct
Databricks
8
132B
(36B active at inference time)
32.8k
-
-
🤗
-
View
AI21 Labs logo
Jamba 1.5 Mini
AI21 Labs
8
52B
(12B active at inference time)
256k
$0.3
-
🤗
Google
Amazon Bedrock
View
AI21 Labs logo
Jamba 1.6 Mini
AI21 Labs
8
52B
(12B active at inference time)
256k
$0.3
151
🤗
AI21 Labs
View
Mistral logo
Mixtral 8x7B Instruct
Mistral
8
46.7B
(12.9B active at inference time)
32.8k
$0.5
-
🤗
DeepInfra
Together.ai
Amazon Bedrock
View
Nous Research logo
DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning)
Nous Research
8
8B
128k
-
-
🤗
-
View
Meta logo
Llama 65B
Meta
7
65B
2.05k
-
-
Not available
-
View
Alibaba logo
Qwen Chat 14B
Alibaba
7
14B
8.19k
-
-
Not available
-
View
Mistral logo
Mistral 7B Instruct
Mistral
7
7B
8.19k
$0.3
148
🤗
Amazon Bedrock
Together.ai
Mistral
View
Cohere logo
Command-R (Mar '24)
Cohere
7
35B
128k
$0.8
-
🤗
Amazon Bedrock
View
TII UAE logo
Falcon-H1R-7B
TII UAE
-
7B
256k
-
-
Not available
-
View
Liquid AI logo
LFM2.5-VL-1.6B
Liquid AI
-
1.6B
32.0k
-
-
🤗
-
View
Liquid AI logo
LFM2.5-1.2B-Thinking
Liquid AI
-
1.17B
32.0k
-
-
🤗
-
View
Liquid AI logo
LFM2.5-1.2B-Instruct
Liquid AI
-
1.17B
32.0k
-
-
🤗
?
View
StepFun logo
Step3 VL 10B
StepFun
-
10.2B
65.5k
-
-
🤗
-
View
Allen Institute for AI logo
Molmo2-8B
Allen Institute for AI
-
8.66B
36.9k
-
113
🤗
Parasail
View
Allen Institute for AI logo
Olmo 3.1 32B Instruct
Allen Institute for AI
-
32.2B
65.5k
$0.3
44
🤗
DeepInfra
View
Allen Institute for AI logo
Olmo 3.1 32B Think
Allen Institute for AI
-
32.2B
65.5k
-
74
🤗
Parasail
View
Deep Cogito logo
Cogito v2.1 (Reasoning)
Deep Cogito
-
671B
(37B active at inference time)
128k
$1.3
73
🤗
Together.ai
View
Trillion Labs logo
Tri-21B-Think
Trillion Labs
-
21B
32.0k
-
-
Not available
-
View
Trillion Labs logo
Tri-21B-think Preview
Trillion Labs
-
21B
32.0k
-
-
Not available
-
View
Cohere logo
Tiny Aya Global
Cohere
-
3.35B
8.19k
-
-
🤗
-
View