Anthropic
36.53
N/A
Amazon Bedrock
32.07
N/A
Google
39.43
N/A
Amazon Bedrock
56.68
N/A
OpenAI
27.07
N/A
Google
25.00
N/A
Microsoft Azure
30.29
N/A
Amazon Bedrock
8.32
N/A
Anthropic
24.50
N/A
Amazon Bedrock
28.44
N/A
OpenAI
21.49
N/A
Makora (FP8)
15.68
12.07
Wafer
26.55
20.57
Fireworks
28.51
22.12
Together AI
18.90
14.61
Baseten
9.98
7.17
FriendliAI
16.48
12.94
Novita (FP8)
46.09
35.30
GMI (FP8)
18.60
13.43
Databricks
12.78
9.15
Parasail (FP8)
14.68
11.04
CoreWeave
13.67
10.51
DeepInfra (FP8)
54.64
43.05
SiliconFlow (FP8)
48.35
36.43
Amazon Bedrock
13.41
N/A
OpenAI
12.18
N/A
Google AI Studio
Gemini 3.5 Flash AI Studio 17.08
N/A
Anthropic
81.60
N/A
Microsoft Azure
50.84
N/A
Amazon Bedrock
70.49
N/A
Google
39.84
N/A
Google (AI Studio)
Gemini 3.1 Pro Preview (AI Studio) 33.70
N/A
Google (Vertex)
Gemini 3.1 Pro Preview (Vertex) 28.58
N/A
Alibaba Cloud
15.97
11.87
GMI (FP8)
58.18
46.45
Novita
56.54
45.74
Google
Gemini 3.5 Flash (medium) 14.90
N/A
Parasail (MXFP8)
104.41
82.29
Together AI
35.88
27.96
Novita
35.34
26.73
SiliconFlow
41.19
31.34
MiniMax
40.39
30.71
Makora (MXFP8)
26.40
20.74
GMI
36.65
26.97
DeepInfra (FP4)
DeepSeek V4 Pro (Max) (FP4) 122.50
109.38
Microsoft Azure
69.12
61.06
SiliconFlow
62.34
54.68
Novita
59.45
52.34
Lightning AI
29.96
26.48
GMI
52.99
45.99
Nebius
115.11
102.38
Together AI
25.31
21.91
Fireworks
40.02
35.35
DeepSeek
47.49
41.61
OpenAI
63.13
N/A
Amazon Bedrock
9.26
N/A
OpenAI
9.69
N/A
Databricks
20.47
17.88
Crusoe
11.58
9.85
CoreWeave
21.04
18.08
Novita
77.65
69.01
GMI FP8
125.73
110.97
Together AI (FP4)
13.90
11.76
Fireworks
12.76
10.96
SiliconFlow (FP8)
153.13
135.67
DeepInfra (FP4)
87.81
78.15
Parasail
96.21
85.98
Nebius
69.27
61.12
Eigen AI (FP4)
22.98
20.11
Cloudflare
77.61
67.53
Microsoft Azure
28.42
24.86
Kimi
146.39
130.36
Anthropic
Claude Opus 4.7 (Non-reasoning, high)
10.29
N/A
Amazon Bedrock
Claude Opus 4.7 (Non-reasoning, high)
8.42
N/A
Novita
51.77
40.16
GMI
115.30
41.96
DeepInfra
43.65
34.62
Xiaomi
51.67
39.69
GMI (FP8)
53.46
41.90
CoreWeave
10.06
7.46
Parasail
57.24
46.24
Together AI
18.63
14.90
DeepInfra
52.68
42.51
Novita
44.93
35.65
Crusoe
6.86
5.13
Databricks
20.16
15.97
Makora
13.92
10.94
Kimi
57.53
46.06
DeepInfra (FP4)
DeepSeek V4 Pro (High) (FP4) 55.11
43.09
Fireworks
21.13
16.38
Microsoft Azure
33.20
25.68
DeepSeek
25.02
19.17
SiliconFlow (FP8)
DeepSeek V4 Pro (High) (FP8) 37.69
28.99
Nebius
48.88
37.96
Together AI
13.63
10.24
Makora
15.05
11.56
Lightning AI
15.55
12.06
Novita
31.34
24.10
Baseten
19.77
15.24
GMI
58.41
51.74
Makora
20.77
18.70
DeepInfra FP4
DeepSeek V4 Flash (Max) FP4 398.31
365.08
SiliconFlow (FP8)
DeepSeek V4 Flash (Max) (FP8) 53.03
47.51
Parasail (FP8)
DeepSeek V4 Flash (Max) (FP8) 182.08
166.49
Novita
52.65
47.25
DeepSeek
47.56
42.71
Wafer
67.88
59.24
DeepInfra (FP4)
45.29
39.41
Fireworks
22.55
19.43
Parasail (FP8)
59.17
51.58
Novita (FP8)
73.71
63.13
CoreWeave
22.45
18.97
Nebius (FP8, Base)
128.83
113.12
Together AI
18.41
15.85
SiliconFlow
88.14
74.90
FriendliAI
28.02
24.46
Novita
28.87
22.03
Parasail
20.22
15.75
DeepInfra
13.27
10.30
Xiaomi
41.02
31.01
Microsoft Azure
119.63
N/A
OpenAI
6.67
N/A
xAI
36.08
28.54
Alibaba Cloud
116.96
105.81
Alibaba Cloud
50.59
39.11
OpenAI
5.96
N/A
Together AI
6.38
4.71
GMI (FP8)
64.26
51.42
Fireworks
28.71
23.28
MiniMax
62.15
50.31
Novita (FP8)
51.07
41.31
SambaNova
7.23
5.38
CoreWeave
10.94
8.24
Together AI
14.00
11.02
DeepInfra
40.06
29.13
Nebius
24.44
18.31
Lightning AI
33.63
27.21
Blackbox AI
6.34
4.62
xAI
24.74
N/A
Amazon Bedrock
19.12
N/A
Microsoft Azure
37.43
N/A
CoreWeave
32.47
22.45
DeepInfra (FP4)
DeepSeek V4 Flash (High) (FP4) 112.10
79.26
GMI
17.15
10.76
Makora
6.18
4.12
Parasail (FP8)
DeepSeek V4 Flash (High) (FP8) 48.33
33.97
SiliconFlow (FP8)
DeepSeek V4 Flash (High) (FP8) 16.36
10.80
Novita
16.07
10.69
DeepSeek
15.74
10.51
Groq
14.93
13.16
SiliconFlow (FP8)
154.52
140.49
DeepInfra FP8
79.19
72.42
Makora FP4
53.92
49.11
Novita
99.28
90.29
Alibaba Cloud
97.97
88.80
Xiaomi
38.28
28.59
Amazon Bedrock
11.95
N/A
Microsoft Azure
22.81
N/A
xAI
16.08
N/A
Amazon Bedrock
Claude Sonnet 4.6 (Non-reasoning)
8.20
N/A
Microsoft Azure
Claude Sonnet 4.6 (Non-reasoning)
9.64
N/A
Google
Claude Sonnet 4.6 (Non-reasoning)
8.84
N/A
Anthropic
Claude Sonnet 4.6 (Non-reasoning)
9.31
N/A
Amazon Bedrock
5.86
N/A
Microsoft Azure
11.54
N/A
xAI
8.17
N/A
StreamLake
KAT-Coder-Pro V2
5.16
N/A
OpenAI
GPT-5.5 (Non-reasoning)
10.65
N/A
Amazon Bedrock
GPT-5.5 (Non-reasoning)
7.66
N/A
Wafer
GLM-5.1
8.90
N/A
Parasail
GLM-5.1
7.29
N/A
DeepInfra (FP4)
GLM-5.1 (FP4)
7.06
N/A
Novita (FP8)
GLM-5.1 (FP8)
11.18
N/A
FriendliAI
GLM-5.1
3.56
N/A
SiliconFlow (FP8)
GLM-5.1 (FP8)
13.38
N/A
Nebius (FP8, Base)
GLM-5.1 (FP8, Base)
15.19
N/A
Xiaomi
40.48
30.11
Google AI Studio
Gemini 3.5 Flash (minimal) AI Studio
3.01
N/A
Databricks
Kimi K2.6
2.86
N/A
Together AI (FP4)
Kimi K2.6 (FP4)
2.21
N/A
SiliconFlow (FP8)
Kimi K2.6 (FP8)
16.62
N/A
Fireworks
Kimi K2.6
1.81
N/A
Parasail (INT4)
Kimi K2.6 (INT4)
10.26
N/A
DeepInfra (FP4)
Kimi K2.6 (FP4)
10.71
N/A
CoreWeave
Kimi K2.6
3.30
N/A
Microsoft Azure
Kimi K2.6
3.36
N/A
Crusoe
Kimi K2.6
2.55
N/A
Kimi
Kimi K2.6
13.95
N/A
Nebius
Kimi K2.6
5.19
N/A
Novita
Kimi K2.6
9.27
N/A
Anthropic
Claude Sonnet 4.6 (Non-reasoning, Low Effort)
9.23
N/A
Wafer
34.01
26.90
Together AI
39.67
13.03
Parasail
48.84
41.67
SiliconFlow (FP8)
34.78
28.71
DeepInfra (FP8)
80.76
69.43
DigitalOcean
136.44
117.43
GMI (FP8)
73.52
61.58
Nebius (Base, FP4)
Qwen3.5 397B A17B (Base, FP4) 30.35
25.25
Eigen AI
13.14
10.82
Alibaba Cloud
72.48
61.22
Novita
72.55
61.68
GMI
42.64
31.83
SiliconFlow
20.41
14.69
Xiaomi
38.31
28.58
SiliconFlow (FP8)
53.40
41.62
DeepInfra (FP8)
58.97
46.80
Alibaba Cloud
17.90
13.46
GMI (FP8)
18.48
13.36
Novita
26.09
20.22
Wafer
Qwen3.5 397B A17B
6.25
N/A
Together AI
Qwen3.5 397B A17B
2.91
N/A
DeepInfra (FP8)
Qwen3.5 397B A17B (FP8)
9.62
N/A
Eigen AI
Qwen3.5 397B A17B
2.31
N/A
Alibaba Cloud
Qwen3.5 397B A17B
11.28
N/A
Nebius Fast
Qwen3.5 397B A17B Fast
2.95
N/A
Nebius (Base, FP4)
Qwen3.5 397B A17B (Base, FP4)
4.87
N/A
DigitalOcean
Qwen3.5 397B A17B
80.00
N/A
Novita
Qwen3.5 397B A17B
10.80
N/A
Alibaba Cloud
34.83
30.64
SiliconFlow (FP8)
84.79
76.28
Scaleway
32.19
28.79
DeepInfra (FP8)
48.52
44.17
Makora FP4
113.56
103.52
Parasail
69.48
63.05
Novita
35.62
31.85
Microsoft Azure
DeepSeek V4 Pro
6.83
N/A
Nebius
DeepSeek V4 Pro
11.17
N/A
Makora
DeepSeek V4 Pro
3.11
N/A
Lightning AI
DeepSeek V4 Pro
3.24
N/A
DeepSeek
DeepSeek V4 Pro
5.83
N/A
Alibaba Cloud
Qwen3.5 Omni Plus
10.96
N/A
InclusionAI
19.99
14.47
Microsoft Azure
20.30
N/A
OpenAI
8.17
N/A
OpenAI
5.73
N/A
Mistral
19.49
15.00
Microsoft Azure
8.53
N/A
OpenAI
5.62
N/A
StepFun
6.83
4.82
Amazon Bedrock
12.91
N/A
Google Vertex
14.13
N/A
Anthropic
16.19
N/A
Microsoft Azure
16.10
N/A
CoreWeave
44.54
34.15
GMI (FP8)
111.06
83.12
Novita
99.13
75.78
Parasail
40.12
30.61
DeepInfra
85.83
66.32
SiliconFlow (FP8)
37.70
28.07
SambaNova
12.94
8.75
FriendliAI
30.24
23.00
Lightning AI
8.60
6.43
Together AI
32.36
24.75
Google (AI Studio)
64.72
49.49
Cohere
12.73
10.05
Groq
Qwen3.6 27B
1.72
N/A
Makora FP4
Qwen3.6 27B FP4
4.60
N/A
DeepInfra FP8
Qwen3.6 27B FP8
6.56
N/A
Alibaba Cloud
Qwen3.6 27B
9.19
N/A
Novita
Qwen3.6 27B
59.84
N/A
DeepSeek
DeepSeek V4 Flash
5.19
N/A
GMI
DeepSeek V4 Flash
7.94
N/A
Makora
DeepSeek V4 Flash
1.94
N/A
CoreWeave
DeepSeek V4 Flash
9.88
N/A
Novita
KAT-Coder-Pro V1
5.19
N/A
DeepInfra FP8
Qwen3.5 122B A10B FP8
10.60
N/A
Alibaba Cloud
Qwen3.5 122B A10B
3.92
N/A
GMI
MiMo-V2.5-Pro
16.01
N/A
DeepInfra
MiMo-V2.5-Pro
10.43
N/A
Xiaomi
MiMo-V2.5-Pro
10.48
N/A
Novita
MiMo-V2.5-Pro
11.41
N/A
Google Vertex
28.31
N/A
Google (AI Studio)
Gemini 2.5 Pro (AI Studio) 25.30
N/A
GMI
Hy3-preview
9.55
N/A
SiliconFlow
Hy3-preview
5.33
N/A
StepFun
13.06
9.77
Cloudflare
23.19
18.15
DeepInfra
82.07
65.33
Parasail
53.93
42.73
Novita
73.52
57.85
Google AI Studio
Gemma 4 26B A4B AI Studio 54.27
42.59
GMI (FP8)
75.64
57.54
CoreWeave
18.28
13.90
Nebius
6.74
4.51
Inception
3.69
N/A
Google (AI Studio)
Gemini 3.1 Flash-Lite (AI Studio) 6.18
N/A
SiliconFlow (FP8)
50.87
39.60
Together AI (FP8)
39.83
31.57
Parasail
Gemma 4 31B
9.28
N/A
FriendliAI
Gemma 4 31B
6.73
N/A
Together AI (FP8)
Gemma 4 31B (FP8)
6.77
N/A
SiliconFlow (FP8)
Gemma 4 31B (FP8)
8.68
N/A
DeepInfra (FP8)
Gemma 4 31B (FP8)
17.26
N/A
SambaNova
Gemma 4 31B
4.18
N/A
Novita
Gemma 4 31B
20.08
N/A
Amazon Bedrock
Grok 4.3 (Non-reasoning)
2.88
N/A
xAI
Grok 4.3 (Non-reasoning)
4.07
N/A
Parasail (FP8)
Trinity Large Thinking (FP8) 16.49
12.76
Arcee AI
9.19
6.73
Scaleway
Qwen3.6 35B A3B
3.17
N/A
Makora FP4
Qwen3.6 35B A3B FP4
7.76
N/A
DeepInfra (FP8)
Qwen3.6 35B A3B (FP8)
6.55
N/A
Novita
Qwen3.6 35B A3B
3.34
N/A
Parasail (FP8)
Qwen3.6 35B A3B (FP8)
5.91
N/A
Alibaba Cloud
Qwen3.6 35B A3B
3.91
N/A
Groq
5.48
4.20
Scaleway
14.35
10.92
Cloudflare
19.29
15.10
DeepInfra (Turbo)
gpt-oss-120b (high) (Turbo) 10.89
8.38
Cerebras
1.32
0.83
Google Vertex
gpt-oss-120b (high) Vertex 5.34
4.02
Eigen AI
3.50
2.46
Baseten
8.74
6.86
SambaNova
3.90
2.70
Parasail
11.96
9.21
Microsoft Azure
5.64
4.12
CoreWeave
25.92
20.07
Fireworks
5.30
2.19
Nebius Fast
4.28
2.57
Nebius Base
7.17
5.21
DeepInfra
48.38
38.32
Databricks
7.69
5.62
Amazon Bedrock
15.52
11.77
Together AI
4.15
2.95
Novita
16.99
13.17
Amazon Bedrock
Claude 4.5 Haiku
4.84
N/A
Anthropic
Claude 4.5 Haiku
4.31
N/A
Microsoft Azure
Claude 4.5 Haiku
4.83
N/A
Google Vertex
Claude 4.5 Haiku Vertex
4.06
N/A
Alibaba Cloud
Qwen3.5 35B A3B
3.77
N/A
DeepInfra FP8
Qwen3.5 35B A3B FP8
3.64
N/A
Xiaomi
MiMo-V2-Flash
9.85
N/A
CompactifAI
6.36
4.66
SiliconFlow
16.44
12.04
Amazon Bedrock
Nova 2.0 Pro Preview (medium) 32.30
14.28
Parasail (FP8)
Qwen3 Coder Next (FP8)
6.04
N/A
Novita (FP8)
Qwen3 Coder Next (FP8)
3.79
N/A
Amazon Bedrock
Qwen3 Coder Next
12.76
N/A
Mistral
14.30
10.97
Cohere
20.84
16.50
Amazon Bedrock
24.59
9.93
Mistral
67.46
53.49
Parasail
Gemma 4 26B A4B
9.68
N/A
GMI (FP8)
Gemma 4 26B A4B (FP8)
17.38
N/A
SiliconFlow (FP8)
Gemma 4 26B A4B (FP8)
7.10
N/A
Scaleway
Gemma 4 26B A4B
3.44
N/A
DeepInfra (FP8)
Gemma 4 26B A4B (FP8)
19.86
N/A
Novita
Gemma 4 26B A4B
15.84
N/A
DeepInfra (FP8)
69.44
55.17
Alibaba Cloud
14.30
10.56
Nebius (FP8)
24.96
19.43
Eigen AI
7.38
5.43
Nebius Fast
7.91
5.33
Google Vertex
Qwen3 Next 80B A3B Vertex 18.00
14.05
Novita
13.17
9.68
Amazon Bedrock
Nova 2.0 Pro Preview (low) 20.81
12.98
Amazon Bedrock
32.66
10.30
Alibaba Cloud
Qwen3.5 Omni Flash
2.83
N/A
Amazon Bedrock
17.94
10.13
Nebius Base
8.77
6.50
SambaNova
3.86
2.65
Google Vertex
gpt-oss-120b (low) Vertex 5.32
4.01
Fireworks
6.42
4.43
CoreWeave
22.06
16.99
Cerebras
1.62
1.08
Microsoft Azure
5.20
3.71
Cloudflare
22.99
17.75
Baseten
8.86
6.77
Amazon Bedrock
24.30
18.75
Parasail
13.19
10.19
Databricks
7.56
5.53
Together AI
3.98
2.85
Groq
5.48
4.20
Eigen AI
3.95
2.78
Novita
15.33
11.76
OpenAI
GPT-5.4 nano
3.55
N/A
Nebius
26.59
20.61
DeepInfra
30.47
20.37
Microsoft Azure
GPT-5.4 mini
3.14
N/A
OpenAI
GPT-5.4 mini
3.08
N/A
Amazon Bedrock
Nova 2.0 Pro Preview
4.01
N/A
DeepInfra FP8
Qwen3.5 4B FP8
17.27
N/A
Amazon Bedrock
Mistral Large 3
3.46
N/A
Microsoft Azure
Mistral Large 3
6.49
N/A
Mistral
Mistral Large 3
8.83
N/A
Mistral
Devstral 2
10.72
N/A
Nebius (FP8)
Nemotron 3 Nano Omni 30B A3B Reasoning (FP8) 9.06
6.77
DeepInfra
20.75
15.81
Together AI
3.40
2.31
Lightning AI
9.45
7.26
CoreWeave
12.75
9.60
Cloudflare
15.07
11.78
Novita
18.53
14.21
Google Vertex
gpt-oss-20B (high) Vertex 21.22
16.74
Amazon Bedrock
45.35
11.41
Databricks
9.39
7.03
Groq
2.93
2.15
Together AI
3.75
2.75
Lightning AI
9.46
7.28
CoreWeave
11.73
8.78
Cloudflare
14.38
11.23
Amazon Bedrock
30.54
17.51
Novita
17.08
13.07
Databricks
9.12
6.90
Groq
3.01
2.22
Google Vertex
14.94
11.73
Microsoft Azure (FP8)
Llama 4 Maverick (FP8)
2.78
N/A
Amazon Bedrock
Llama 4 Maverick
3.05
N/A
Snowflake
Llama 4 Maverick
4.63
N/A
DeepInfra (FP8)
Llama 4 Maverick (FP8)
13.29
N/A
Databricks
Llama 4 Maverick
6.74
N/A
Novita (FP8)
Llama 4 Maverick (FP8)
14.32
N/A
Parasail (FP8)
Llama 4 Maverick (FP8)
4.45
N/A
DeepInfra
Qwen3 Next 80B A3B
3.87
N/A
Parasail
Qwen3 Next 80B A3B
5.48
N/A
Novita
Qwen3 Next 80B A3B
5.26
N/A
Alibaba Cloud
Qwen3 Next 80B A3B
3.66
N/A
Google Vertex
Qwen3 Next 80B A3B Vertex
2.86
N/A
SiliconFlow
Gemma 4 12B (Non-reasoning)
4.59
N/A
Mistral
Devstral Small 2
11.37
N/A
Amazon Bedrock
Nova Premier
7.87
N/A
DeepInfra
Llama Nemotron Super 49B v1.5 48.98
38.98
Mistral
Mistral Small 4
3.69
N/A
Amazon Bedrock
33.44
26.04
Mistral
23.17
18.18
Sarvam
24.04
18.26
Amazon Bedrock
Nova 2.0 Lite
3.20
N/A
DeepInfra (FP8)
78.31
62.27
CompactifAI
Llama 4 Scout
5.85
N/A
Microsoft Azure
Llama 4 Scout
4.81
N/A
Cloudflare
Llama 4 Scout
9.06
N/A
Amazon Bedrock
Llama 4 Scout
2.88
N/A
Novita
Llama 4 Scout
8.13
N/A
Google Vertex
Llama 4 Scout Vertex
3.71
N/A
Groq
Llama 4 Scout
1.60
N/A
DeepInfra
Llama 4 Scout
6.91
N/A
Nebius (FP8)
31.09
24.35
Mistral
Ministral 3 14B
6.60
N/A
Amazon Bedrock
Ministral 3 14B
2.86
N/A
Alibaba Cloud
27.12
20.96
Nebius Base
Llama Nemotron Ultra Base 48.54
38.27
Nebius (FP8)
61.65
48.68
DeepInfra (FP8)
NVIDIA Nemotron Nano 12B v2 VL (FP8) 8.94
6.98
Amazon Bedrock
Ministral 3 8B
2.64
N/A
Mistral
Ministral 3 8B
5.65
N/A
DeepInfra
NVIDIA Nemotron Nano 9B V2 36.64
22.46
Nebius (FP8)
Hermes 4 405B (FP8)
12.38
N/A
DeepInfra FP8
Qwen3.5 2B FP8
17.90
N/A
DeepInfra
Llama Nemotron Super 49B v1.5
9.91
N/A
Parasail (FP8)
Llama 3.3 70B (FP8)
6.32
N/A
SambaNova
Llama 3.3 70B
1.97
N/A
Snowflake Snowflake
Llama 3.3 70B Snowflake
3.80
N/A
Nebius Base
Llama 3.3 70B Base
19.00
N/A
FriendliAI
Llama 3.3 70B
3.82
N/A
Hyperbolic
Llama 3.3 70B
9.24
N/A
Makora FP8
Llama 3.3 70B FP8
1.72
N/A
CoreWeave
Llama 3.3 70B
6.13
N/A
CompactifAI
Llama 3.3 70B
2.51
N/A
Together AI Turbo
Llama 3.3 70B Turbo
8.64
N/A
Microsoft Azure
Llama 3.3 70B
4.40
N/A
DeepInfra (Turbo, FP8)
Llama 3.3 70B (Turbo, FP8)
31.30
N/A
Cloudflare
Llama 3.3 70B
7.63
N/A
Amazon Bedrock
Llama 3.3 70B
4.16
N/A
Groq
Llama 3.3 70B
1.54
N/A
Google Vertex
Llama 3.3 70B Vertex
3.13
N/A
Databricks
Llama 3.3 70B
7.17
N/A
Scaleway
Llama 3.3 70B
6.29
N/A
Novita
Llama 3.3 70B
12.43
N/A
Amazon Bedrock Standard
Llama 3.1 405B Standard
27.71
N/A
Amazon Bedrock Latency Optimized
Llama 3.1 405B Latency Optimized
6.41
N/A
Microsoft Azure
Llama 3.1 405B
17.64
N/A
Liquid AI
12.66
8.62
Microsoft Azure
Command A
12.60
N/A
Cohere
Command A
6.88
N/A
DeepInfra
Llama 3.1 Nemotron 70B
6.01
N/A
DeepInfra
NVIDIA Nemotron 3 Nano
5.66
N/A
DeepInfra
NVIDIA Nemotron Nano 9B V2
13.80
N/A
Amazon Bedrock
NVIDIA Nemotron Nano 9B V2
3.79
N/A
Nebius (FP8)
Hermes 4 70B (FP8)
6.34
N/A
CoreWeave
Granite 4.1 8B
4.26
N/A
Sarvam
13.09
9.54
Amazon Bedrock
Llama 3.2 90B (Vision)
8.87
N/A
Microsoft Azure
Llama 3.2 90B (Vision)
12.68
N/A
Mistral
Ministral 3 3B
3.11
N/A
Amazon Bedrock
Ministral 3 3B
1.70
N/A
AI21 Labs
Jamba 1.7 Large
9.52
N/A
Replicate
Granite 4.0 H Small
10.27
N/A
Alibaba Cloud
Qwen3 Omni 30B A3B
5.73
N/A
DeepInfra (FP8)
61.29
48.65
Together AI
LFM2 24B A2B
4.31
N/A
Microsoft Azure
Phi-4
13.02
N/A
DeepInfra
Phi-4
7.36
N/A
Amazon Bedrock
Nova Micro
2.59
N/A
Amazon Bedrock
NVIDIA Nemotron Nano 12B v2 VL
3.22
N/A
DeepInfra (FP8)
NVIDIA Nemotron Nano 12B v2 VL (FP8)
1.89
N/A
Microsoft Azure
Phi-4 Multimodal
32.43
N/A
DeepInfra FP8
Qwen3.5 0.8B FP8
12.41
N/A
Reka AI
27.75
20.82
DeepInfra
Llama 3.2 11B (Vision)
9.85
N/A
Amazon Bedrock
Llama 3.2 11B (Vision)
3.40
N/A
Microsoft Azure
Llama 3.2 11B (Vision)
7.26
N/A
CoreWeave
Phi-4 Mini
2.65
N/A
Microsoft Azure
Phi-4 Mini
12.39
N/A
Liquid AI
LFM2 2.6B
2.71
N/A
Liquid AI
LFM2.5-1.2B-Instruct
2.09
N/A
Liquid AI
LFM2.5-VL-1.6B
2.12
N/A
Cohere
Tiny Aya Global
4.03
N/A
Together AI
26.71
20.98