Anthropic
35.55
N/A
Amazon Bedrock
34.66
N/A
Google
40.13
N/A
Amazon Bedrock
55.20
N/A
OpenAI
25.77
N/A
Google
24.83
N/A
Microsoft Azure
31.08
N/A
Amazon Bedrock
8.32
N/A
Anthropic
24.50
N/A
Amazon Bedrock
26.73
N/A
OpenAI
21.37
N/A
Makora (FP8)
15.22
11.70
Wafer
26.55
20.57
Fireworks
28.78
22.34
Together AI
18.76
14.50
Baseten
9.98
7.17
FriendliAI
16.44
12.89
Novita (FP8)
45.77
34.94
GMI (FP8)
24.53
18.16
Databricks
12.78
9.15
Parasail (FP8)
14.18
10.65
CoreWeave
13.67
10.51
DeepInfra (FP8)
54.64
43.05
SiliconFlow (FP8)
48.52
36.41
Amazon Bedrock
12.31
N/A
OpenAI
11.50
N/A
Google AI Studio
Gemini 3.5 Flash AI Studio 17.41
N/A
Anthropic
81.60
N/A
Microsoft Azure
49.77
N/A
Amazon Bedrock
70.26
N/A
Google
41.13
N/A
Google (AI Studio)
Gemini 3.1 Pro Preview (AI Studio) 37.44
N/A
Google (Vertex)
Gemini 3.1 Pro Preview (Vertex) 28.59
N/A
Alibaba Cloud
15.89
11.81
GMI (FP8)
58.14
46.43
Novita
56.51
45.74
Google
Gemini 3.5 Flash (medium) 15.36
N/A
Parasail (MXFP8)
103.58
81.66
Together AI
35.88
27.96
Novita
35.34
26.73
SiliconFlow
39.07
29.62
MiniMax
43.08
32.87
Makora (MXFP8)
26.46
20.78
GMI
36.65
26.97
DeepInfra (FP4)
DeepSeek V4 Pro (Max) (FP4) 127.07
113.47
Microsoft Azure
72.96
64.49
SiliconFlow
64.96
57.00
Novita
59.45
52.34
Lightning AI
30.02
26.54
GMI
52.99
45.99
Nebius
115.38
102.61
Together AI
25.31
21.91
Fireworks
40.02
35.35
DeepSeek
48.68
42.64
OpenAI
63.13
N/A
Amazon Bedrock
8.56
N/A
OpenAI
9.81
N/A
Databricks
20.43
17.83
Crusoe
11.50
9.79
CoreWeave
21.10
18.13
Novita
82.51
73.11
GMI FP8
126.46
111.55
Together AI (FP4)
13.77
11.63
Fireworks
12.76
10.96
SiliconFlow (FP8)
153.40
135.74
DeepInfra (FP4)
87.81
78.15
Parasail
97.18
86.84
Nebius
71.54
63.17
Eigen AI (FP4)
22.98
20.11
Cloudflare
77.61
67.53
Microsoft Azure
25.11
21.89
Kimi
147.74
131.54
Anthropic
Claude Opus 4.7 (Non-reasoning, high)
10.29
N/A
Amazon Bedrock
Claude Opus 4.7 (Non-reasoning, high)
8.34
N/A
Novita
51.77
40.16
GMI
114.14
41.05
DeepInfra
43.65
34.62
Xiaomi
52.13
40.03
GMI (FP8)
53.46
41.90
CoreWeave
10.06
7.46
Parasail
55.85
45.09
Together AI
18.63
14.90
DeepInfra
53.28
42.99
Novita
46.27
36.75
Crusoe
6.86
5.13
Databricks
20.16
15.97
Makora
13.93
10.94
Kimi
60.09
48.14
DeepInfra (FP4)
DeepSeek V4 Pro (High) (FP4) 55.11
43.09
Fireworks
21.20
16.44
Microsoft Azure
33.20
25.68
DeepSeek
26.08
19.99
SiliconFlow (FP8)
DeepSeek V4 Pro (High) (FP8) 31.93
24.40
Nebius
50.01
38.84
Together AI
13.57
10.19
Makora
15.05
11.56
Lightning AI
15.42
11.95
Novita
31.34
24.10
Baseten
19.73
15.24
GMI
58.41
51.74
Makora
20.67
18.60
DeepInfra FP4
DeepSeek V4 Flash (Max) FP4 402.09
368.47
SiliconFlow (FP8)
DeepSeek V4 Flash (Max) (FP8) 53.16
47.63
Parasail (FP8)
DeepSeek V4 Flash (Max) (FP8) 182.08
166.49
Novita
52.60
47.17
DeepSeek
51.60
46.42
Wafer
66.40
57.93
DeepInfra (FP4)
45.88
39.93
Fireworks
22.38
19.28
Parasail (FP8)
57.15
49.79
Novita (FP8)
75.47
64.70
CoreWeave
22.72
19.21
Nebius (FP8, Base)
130.22
114.35
Together AI
18.41
15.85
SiliconFlow
88.14
74.90
FriendliAI
28.02
24.46
Novita
29.15
22.21
Parasail
20.22
15.75
DeepInfra
13.17
10.21
Xiaomi
40.98
31.01
Microsoft Azure
119.63
N/A
OpenAI
6.61
N/A
xAI
35.99
28.47
Alibaba Cloud
116.69
105.58
Alibaba Cloud
50.30
38.86
OpenAI
5.96
N/A
Together AI
6.50
4.82
GMI (FP8)
63.42
50.87
Fireworks
28.80
23.36
MiniMax
63.86
51.69
Novita (FP8)
50.99
41.31
SambaNova
7.23
5.37
CoreWeave
11.02
8.30
Together AI
14.00
11.02
DeepInfra
40.89
29.13
Nebius
24.33
18.22
Lightning AI
33.70
27.26
Blackbox AI
6.34
4.62
xAI
23.51
N/A
Amazon Bedrock
19.28
N/A
Microsoft Azure
38.62
N/A
CoreWeave
38.83
26.97
DeepInfra (FP4)
DeepSeek V4 Flash (High) (FP4) 112.05
79.26
GMI
16.21
10.10
Makora
6.18
4.12
Parasail (FP8)
DeepSeek V4 Flash (High) (FP8) 48.34
34.00
SiliconFlow (FP8)
DeepSeek V4 Flash (High) (FP8) 16.36
10.80
Novita
16.54
11.03
DeepSeek
15.74
10.51
Groq
15.04
13.22
SiliconFlow (FP8)
158.72
144.34
DeepInfra FP8
79.19
72.42
Makora FP4
53.92
49.11
Novita
99.46
90.47
Alibaba Cloud
97.97
88.80
Xiaomi
38.28
28.59
Amazon Bedrock
10.88
N/A
Microsoft Azure
22.95
N/A
xAI
16.34
N/A
Amazon Bedrock
Claude Sonnet 4.6 (Non-reasoning)
8.20
N/A
Microsoft Azure
Claude Sonnet 4.6 (Non-reasoning)
9.69
N/A
Google
Claude Sonnet 4.6 (Non-reasoning)
8.90
N/A
Anthropic
Claude Sonnet 4.6 (Non-reasoning)
9.39
N/A
Amazon Bedrock
5.92
N/A
Microsoft Azure
12.21
N/A
xAI
8.58
N/A
StreamLake
KAT-Coder-Pro V2
5.14
N/A
OpenAI
GPT-5.5 (Non-reasoning)
10.65
N/A
Amazon Bedrock
GPT-5.5 (Non-reasoning)
7.54
N/A
Wafer
GLM-5.1
8.90
N/A
Parasail
GLM-5.1
7.26
N/A
DeepInfra (FP4)
GLM-5.1 (FP4)
6.36
N/A
Novita (FP8)
GLM-5.1 (FP8)
11.02
N/A
FriendliAI
GLM-5.1
3.58
N/A
SiliconFlow (FP8)
GLM-5.1 (FP8)
13.69
N/A
Nebius (FP8, Base)
GLM-5.1 (FP8, Base)
14.98
N/A
Xiaomi
42.23
31.55
Google AI Studio
Gemini 3.5 Flash (minimal) AI Studio
3.01
N/A
Databricks
Kimi K2.6
2.85
N/A
Together AI (FP4)
Kimi K2.6 (FP4)
2.21
N/A
SiliconFlow (FP8)
Kimi K2.6 (FP8)
16.65
N/A
Fireworks
Kimi K2.6
1.69
N/A
Parasail (INT4)
Kimi K2.6 (INT4)
9.63
N/A
DeepInfra (FP4)
Kimi K2.6 (FP4)
10.54
N/A
CoreWeave
Kimi K2.6
3.53
N/A
Microsoft Azure
Kimi K2.6
3.41
N/A
Crusoe
Kimi K2.6
2.55
N/A
Kimi
Kimi K2.6
14.04
N/A
Nebius
Kimi K2.6
5.27
N/A
Novita
Kimi K2.6
9.36
N/A
Anthropic
Claude Sonnet 4.6 (Non-reasoning, Low Effort)
9.14
N/A
Wafer
34.01
26.90
Together AI
40.10
13.04
Parasail
49.63
42.35
SiliconFlow (FP8)
34.77
28.71
DeepInfra (FP8)
82.11
70.61
DigitalOcean
136.44
117.43
GMI (FP8)
73.51
61.58
Nebius (Base, FP4)
Qwen3.5 397B A17B (Base, FP4) 30.38
25.26
Eigen AI
13.24
10.90
Alibaba Cloud
72.45
61.20
Novita
72.32
61.47
GMI
42.68
31.85
SiliconFlow
19.84
14.22
Xiaomi
37.67
28.04
SiliconFlow (FP8)
49.82
38.75
DeepInfra (FP8)
58.97
46.80
Alibaba Cloud
17.89
13.46
GMI (FP8)
18.50
13.35
Novita
27.00
20.95
Wafer
Qwen3.5 397B A17B
6.25
N/A
Together AI
Qwen3.5 397B A17B
2.91
N/A
DeepInfra (FP8)
Qwen3.5 397B A17B (FP8)
10.01
N/A
Eigen AI
Qwen3.5 397B A17B
2.31
N/A
Alibaba Cloud
Qwen3.5 397B A17B
11.31
N/A
Nebius Fast
Qwen3.5 397B A17B Fast
2.93
N/A
Nebius (Base, FP4)
Qwen3.5 397B A17B (Base, FP4)
4.99
N/A
DigitalOcean
Qwen3.5 397B A17B
80.00
N/A
Novita
Qwen3.5 397B A17B
10.75
N/A
Alibaba Cloud
35.87
31.60
SiliconFlow (FP8)
90.68
81.68
Scaleway
32.20
28.80
DeepInfra (FP8)
46.56
42.38
Makora FP4
113.65
103.61
Parasail
68.35
62.02
Novita
36.09
32.25
Microsoft Azure
DeepSeek V4 Pro
6.89
N/A
Nebius
DeepSeek V4 Pro
11.59
N/A
Makora
DeepSeek V4 Pro
3.12
N/A
Lightning AI
DeepSeek V4 Pro
3.24
N/A
DeepSeek
DeepSeek V4 Pro
6.13
N/A
Alibaba Cloud
Qwen3.5 Omni Plus
10.92
N/A
InclusionAI
19.81
14.33
Microsoft Azure
20.60
N/A
OpenAI
8.73
N/A
OpenAI
5.76
N/A
Mistral
22.53
17.43
Microsoft Azure
6.75
N/A
OpenAI
5.86
N/A
StepFun
6.83
4.82
Amazon Bedrock
13.05
N/A
Google Vertex
14.36
N/A
Anthropic
16.19
N/A
Microsoft Azure
16.48
N/A
CoreWeave
44.75
34.32
GMI (FP8)
111.06
83.12
Novita
101.15
77.25
Parasail
39.77
30.33
DeepInfra
83.19
64.27
SiliconFlow (FP8)
38.36
28.60
SambaNova
12.93
8.75
FriendliAI
32.14
24.47
Lightning AI
8.60
6.43
Together AI
32.36
24.75
Google (AI Studio)
64.72
49.50
Cohere
12.78
10.08
Groq
Qwen3.6 27B
1.69
N/A
Makora FP4
Qwen3.6 27B FP4
4.60
N/A
DeepInfra FP8
Qwen3.6 27B FP8
6.56
N/A
Alibaba Cloud
Qwen3.6 27B
9.17
N/A
Novita
Qwen3.6 27B
60.47
N/A
DeepSeek
DeepSeek V4 Flash
5.15
N/A
GMI
DeepSeek V4 Flash
8.21
N/A
Makora
DeepSeek V4 Flash
1.94
N/A
CoreWeave
DeepSeek V4 Flash
10.17
N/A
Novita
KAT-Coder-Pro V1
5.21
N/A
DeepInfra FP8
Qwen3.5 122B A10B FP8
10.60
N/A
Alibaba Cloud
Qwen3.5 122B A10B
3.89
N/A
GMI
MiMo-V2.5-Pro
14.92
N/A
DeepInfra
MiMo-V2.5-Pro
10.27
N/A
Xiaomi
MiMo-V2.5-Pro
10.76
N/A
Novita
MiMo-V2.5-Pro
11.50
N/A
Google Vertex
28.53
N/A
Google (AI Studio)
Gemini 2.5 Pro (AI Studio) 22.81
N/A
GMI
Hy3-preview
9.48
N/A
SiliconFlow
Hy3-preview
5.46
N/A
StepFun
12.96
9.69
Cloudflare
23.12
18.15
DeepInfra
83.82
66.73
Parasail
53.93
42.73
Novita
74.41
58.54
Google AI Studio
Gemma 4 26B A4B AI Studio 54.29
42.57
GMI (FP8)
74.85
57.29
CoreWeave
18.28
13.90
Nebius
6.81
4.56
Inception
3.58
N/A
Google (AI Studio)
Gemini 3.1 Flash-Lite (AI Studio) 6.22
N/A
SiliconFlow (FP8)
55.89
43.62
Together AI (FP8)
40.06
31.75
Parasail
Gemma 4 31B
7.46
N/A
FriendliAI
Gemma 4 31B
6.74
N/A
Together AI (FP8)
Gemma 4 31B (FP8)
6.77
N/A
SiliconFlow (FP8)
Gemma 4 31B (FP8)
8.40
N/A
DeepInfra (FP8)
Gemma 4 31B (FP8)
17.47
N/A
SambaNova
Gemma 4 31B
4.18
N/A
Novita
Gemma 4 31B
20.03
N/A
Amazon Bedrock
Grok 4.3 (Non-reasoning)
2.88
N/A
xAI
Grok 4.3 (Non-reasoning)
4.04
N/A
Parasail (FP8)
Trinity Large Thinking (FP8) 16.49
12.76
Arcee AI
9.04
6.61
Scaleway
Qwen3.6 35B A3B
3.17
N/A
Makora FP4
Qwen3.6 35B A3B FP4
8.02
N/A
DeepInfra (FP8)
Qwen3.6 35B A3B (FP8)
6.52
N/A
Novita
Qwen3.6 35B A3B
3.32
N/A
Parasail (FP8)
Qwen3.6 35B A3B (FP8)
5.92
N/A
Alibaba Cloud
Qwen3.6 35B A3B
3.97
N/A
Groq
5.48
4.20
Scaleway
14.49
11.04
Cloudflare
19.83
15.53
DeepInfra (Turbo)
gpt-oss-120b (high) (Turbo) 11.19
8.62
Cerebras
1.29
0.80
Google Vertex
gpt-oss-120b (high) Vertex 5.39
4.06
Eigen AI
3.50
2.45
Baseten
8.98
7.05
SambaNova
3.94
2.70
Parasail
11.97
9.21
Microsoft Azure
5.69
4.15
CoreWeave
25.92
20.07
Fireworks
5.13
2.19
Nebius Fast
4.27
2.57
Nebius Base
7.22
5.24
DeepInfra
48.38
38.32
Databricks
7.67
5.61
Amazon Bedrock
15.94
12.10
Together AI
4.16
2.95
Novita
16.99
13.17
Amazon Bedrock
Claude 4.5 Haiku
4.84
N/A
Anthropic
Claude 4.5 Haiku
4.31
N/A
Microsoft Azure
Claude 4.5 Haiku
4.83
N/A
Google Vertex
Claude 4.5 Haiku Vertex
4.06
N/A
Alibaba Cloud
Qwen3.5 35B A3B
3.78
N/A
DeepInfra FP8
Qwen3.5 35B A3B FP8
3.49
N/A
Xiaomi
MiMo-V2-Flash
9.60
N/A
CompactifAI
6.36
4.66
SiliconFlow
16.44
12.04
Amazon Bedrock
Nova 2.0 Pro Preview (medium) 33.84
13.82
Parasail (FP8)
Qwen3 Coder Next (FP8)
5.75
N/A
Novita (FP8)
Qwen3 Coder Next (FP8)
3.87
N/A
Amazon Bedrock
Qwen3 Coder Next
12.98
N/A
Mistral
14.30
10.96
Cohere
20.81
16.48
Amazon Bedrock
25.64
9.93
Mistral
67.46
53.49
Parasail
Gemma 4 26B A4B
9.11
N/A
GMI (FP8)
Gemma 4 26B A4B (FP8)
17.58
N/A
SiliconFlow (FP8)
Gemma 4 26B A4B (FP8)
7.10
N/A
Scaleway
Gemma 4 26B A4B
3.51
N/A
DeepInfra (FP8)
Gemma 4 26B A4B (FP8)
19.54
N/A
Novita
Gemma 4 26B A4B
16.26
N/A
DeepInfra (FP8)
69.40
55.17
Alibaba Cloud
14.19
10.49
Nebius (FP8)
24.96
19.43
Eigen AI
7.37
5.43
Nebius Fast
7.91
5.33
Google Vertex
Qwen3 Next 80B A3B Vertex 17.86
13.94
Novita
13.17
9.68
Amazon Bedrock
Nova 2.0 Pro Preview (low) 20.72
12.92
Amazon Bedrock
32.20
9.93
Alibaba Cloud
Qwen3.5 Omni Flash
2.84
N/A
Amazon Bedrock
17.94
10.13
Nebius Base
8.77
6.50
SambaNova
3.84
2.65
Google Vertex
gpt-oss-120b (low) Vertex 5.32
4.01
Fireworks
6.39
4.40
CoreWeave
22.06
16.99
Cerebras
1.62
1.08
Microsoft Azure
5.27
3.78
Cloudflare
22.99
17.75
Baseten
8.72
6.67
Amazon Bedrock
24.30
18.75
Parasail
14.47
11.21
Databricks
7.55
5.52
Together AI
4.00
2.85
Groq
5.47
4.20
Eigen AI
3.95
2.78
Novita
16.04
12.25
OpenAI
GPT-5.4 nano
3.55
N/A
Nebius
26.59
20.61
DeepInfra
30.72
20.01
Microsoft Azure
GPT-5.4 mini
3.17
N/A
OpenAI
GPT-5.4 mini
3.13
N/A
DeepInfra FP8
Qwen3.5 4B FP8
17.27
N/A
Amazon Bedrock
Mistral Large 3
3.45
N/A
Microsoft Azure
Mistral Large 3
6.49
N/A
Mistral
Mistral Large 3
8.82
N/A
Mistral
Devstral 2
10.72
N/A
Nebius (FP8)
Nemotron 3 Nano Omni 30B A3B Reasoning (FP8) 9.06
6.77
DeepInfra
20.80
15.87
Together AI
3.40
2.31
Lightning AI
9.45
7.26
CoreWeave
12.28
9.23
Cloudflare
15.11
11.81
Novita
17.71
13.55
Google Vertex
gpt-oss-20B (high) Vertex 15.71
12.27
Amazon Bedrock
40.53
11.38
Databricks
9.83
7.38
Groq
2.95
2.16
Amazon Bedrock
Nova 2.0 Pro Preview
3.86
N/A
Together AI
3.54
2.60
Lightning AI
9.46
7.28
CoreWeave
11.16
8.32
Cloudflare
14.69
11.48
Amazon Bedrock
30.54
17.51
Novita
16.55
12.64
Databricks
9.51
7.21
Groq
3.01
2.22
Google Vertex
15.07
11.84
Microsoft Azure (FP8)
Llama 4 Maverick (FP8)
2.99
N/A
Amazon Bedrock
Llama 4 Maverick
3.05
N/A
Snowflake
Llama 4 Maverick
4.63
N/A
DeepInfra (FP8)
Llama 4 Maverick (FP8)
13.35
N/A
Databricks
Llama 4 Maverick
6.71
N/A
Novita (FP8)
Llama 4 Maverick (FP8)
15.21
N/A
Parasail (FP8)
Llama 4 Maverick (FP8)
4.47
N/A
DeepInfra
Qwen3 Next 80B A3B
3.78
N/A
Parasail
Qwen3 Next 80B A3B
5.44
N/A
Novita
Qwen3 Next 80B A3B
5.28
N/A
Alibaba Cloud
Qwen3 Next 80B A3B
3.63
N/A
Google Vertex
Qwen3 Next 80B A3B Vertex
2.84
N/A
SiliconFlow
Gemma 4 12B (Non-reasoning)
4.60
N/A
Mistral
Devstral Small 2
11.42
N/A
Amazon Bedrock
Nova Premier
7.73
N/A
DeepInfra
Llama Nemotron Super 49B v1.5 48.89
38.91
Mistral
Mistral Small 4
3.74
N/A
Amazon Bedrock
33.44
26.04
Mistral
23.19
18.19
Sarvam
24.04
18.26
Amazon Bedrock
Nova 2.0 Lite
3.20
N/A
DeepInfra (FP8)
83.36
66.30
CompactifAI
Llama 4 Scout
5.91
N/A
Microsoft Azure
Llama 4 Scout
4.76
N/A
Cloudflare
Llama 4 Scout
9.05
N/A
Amazon Bedrock
Llama 4 Scout
2.88
N/A
Novita
Llama 4 Scout
8.28
N/A
Google Vertex
Llama 4 Scout Vertex
3.69
N/A
Groq
Llama 4 Scout
1.61
N/A
DeepInfra
Llama 4 Scout
7.30
N/A
Nebius (FP8)
31.66
24.81
Mistral
Ministral 3 14B
6.60
N/A
Amazon Bedrock
Ministral 3 14B
2.86
N/A
Alibaba Cloud
27.04
20.90
Nebius Base
Llama Nemotron Ultra Base 48.54
38.27
Nebius (FP8)
61.65
48.68
DeepInfra (FP8)
NVIDIA Nemotron Nano 12B v2 VL (FP8) 9.08
7.09
Amazon Bedrock
Ministral 3 8B
2.64
N/A
Mistral
Ministral 3 8B
5.66
N/A
DeepInfra
NVIDIA Nemotron Nano 9B V2 36.69
22.63
Nebius (FP8)
Hermes 4 405B (FP8)
12.51
N/A
DeepInfra FP8
Qwen3.5 2B FP8
17.90
N/A
DeepInfra
Llama Nemotron Super 49B v1.5
9.88
N/A
Parasail (FP8)
Llama 3.3 70B (FP8)
6.09
N/A
SambaNova
Llama 3.3 70B
1.97
N/A
Snowflake Snowflake
Llama 3.3 70B Snowflake
3.75
N/A
Nebius Base
Llama 3.3 70B Base
19.00
N/A
FriendliAI
Llama 3.3 70B
3.82
N/A
Hyperbolic
Llama 3.3 70B
8.20
N/A
Makora FP8
Llama 3.3 70B FP8
1.73
N/A
CoreWeave
Llama 3.3 70B
6.13
N/A
CompactifAI
Llama 3.3 70B
2.51
N/A
Together AI Turbo
Llama 3.3 70B Turbo
8.64
N/A
Microsoft Azure
Llama 3.3 70B
4.40
N/A
DeepInfra (Turbo, FP8)
Llama 3.3 70B (Turbo, FP8)
30.97
N/A
Cloudflare
Llama 3.3 70B
7.63
N/A
Amazon Bedrock
Llama 3.3 70B
4.15
N/A
Groq
Llama 3.3 70B
1.54
N/A
Google Vertex
Llama 3.3 70B Vertex
3.12
N/A
Databricks
Llama 3.3 70B
7.17
N/A
Scaleway
Llama 3.3 70B
6.31
N/A
Novita
Llama 3.3 70B
12.43
N/A
Amazon Bedrock Standard
Llama 3.1 405B Standard
27.71
N/A
Amazon Bedrock Latency Optimized
Llama 3.1 405B Latency Optimized
6.36
N/A
Microsoft Azure
Llama 3.1 405B
17.64
N/A
Liquid AI
12.73
8.66
Microsoft Azure
Command A
12.57
N/A
Cohere
Command A
6.96
N/A
DeepInfra
Llama 3.1 Nemotron 70B
6.02
N/A
DeepInfra
NVIDIA Nemotron 3 Nano
5.66
N/A
DeepInfra
NVIDIA Nemotron Nano 9B V2
13.80
N/A
Amazon Bedrock
NVIDIA Nemotron Nano 9B V2
3.78
N/A
Nebius (FP8)
Hermes 4 70B (FP8)
6.34
N/A
CoreWeave
Granite 4.1 8B
4.27
N/A
Sarvam
14.47
10.64
Amazon Bedrock
Llama 3.2 90B (Vision)
8.87
N/A
Microsoft Azure
Llama 3.2 90B (Vision)
12.85
N/A
Mistral
Ministral 3 3B
3.14
N/A
Amazon Bedrock
Ministral 3 3B
1.69
N/A
AI21 Labs
Jamba 1.7 Large
9.52
N/A
Replicate
Granite 4.0 H Small
10.27
N/A
Alibaba Cloud
Qwen3 Omni 30B A3B
5.73
N/A
DeepInfra (FP8)
60.40
47.96
Together AI
LFM2 24B A2B
4.26
N/A
Microsoft Azure
Phi-4
13.28
N/A
DeepInfra
Phi-4
7.24
N/A
Amazon Bedrock
Nova Micro
2.65
N/A
Amazon Bedrock
NVIDIA Nemotron Nano 12B v2 VL
3.22
N/A
DeepInfra (FP8)
NVIDIA Nemotron Nano 12B v2 VL (FP8)
1.92
N/A
Microsoft Azure
Phi-4 Multimodal
32.45
N/A
DeepInfra FP8
Qwen3.5 0.8B FP8
12.02
N/A
Reka AI
27.75
20.82
DeepInfra
Llama 3.2 11B (Vision)
9.85
N/A
Amazon Bedrock
Llama 3.2 11B (Vision)
3.40
N/A
Microsoft Azure
Llama 3.2 11B (Vision)
7.26
N/A
CoreWeave
Phi-4 Mini
2.65
N/A
Microsoft Azure
Phi-4 Mini
12.39
N/A
Liquid AI
LFM2 2.6B
2.74
N/A
Liquid AI
LFM2.5-1.2B-Instruct
2.16
N/A
Liquid AI
LFM2.5-VL-1.6B
2.00
N/A
Cohere
Tiny Aya Global
4.03
N/A
Together AI
26.74
21.00