Anthropic
32.76
N/A
Amazon Bedrock
53.64
N/A
Google
42.54
N/A
Amazon Bedrock
64.30
N/A
OpenAI
85.08
N/A
Google
34.07
N/A
Microsoft Azure
46.24
N/A
Amazon Bedrock
8.96
N/A
Anthropic
66.52
N/A
Amazon Bedrock
43.88
N/A
OpenAI
42.58
N/A
Amazon Bedrock
250.63
N/A
OpenAI
82.40
N/A
Microsoft Azure
113.98
N/A
Makora (FP8)
14.86
11.45
Wafer
33.67
24.32
Fireworks
39.52
29.78
Together AI
9.24
7.03
FriendliAI
24.31
16.54
Novita (FP8)
51.07
39.46
GMI (FP8)
21.27
11.53
Parasail (FP8)
23.78
18.44
DeepInfra (FP8)
72.82
56.94
SiliconFlow (FP8)
37.49
27.34
Google AI Studio
Gemini 3.5 Flash AI Studio 16.57
N/A
Anthropic
68.80
N/A
Microsoft Azure
70.89
N/A
Amazon Bedrock
51.09
N/A
Google
57.93
N/A
Amazon Bedrock
17.56
N/A
OpenAI
17.56
N/A
Google (AI Studio)
Gemini 3.1 Pro Preview (AI Studio) 26.81
N/A
Google (Vertex)
Gemini 3.1 Pro Preview (Vertex) 44.54
N/A
Alibaba Cloud
28.51
22.17
GMI (FP8)
68.52
52.41
Novita
57.69
46.68
Google
Gemini 3.5 Flash (medium) 14.85
N/A
Parasail (MXFP8)
86.33
68.08
Together AI
40.76
31.86
Novita
35.80
27.58
SiliconFlow
34.89
26.88
MiniMax
36.49
27.92
Makora (MXFP8)
16.59
12.94
GMI
38.90
28.10
DeepInfra (FP4)
DeepSeek V4 Pro (Max) (FP4) 185.69
165.66
Microsoft Azure
62.96
55.56
SiliconFlow
76.55
67.41
Novita
62.91
55.48
Lightning AI
31.65
27.95
GMI
56.67
49.36
Nebius
97.67
86.63
Together AI
27.65
24.04
Fireworks
43.22
38.27
DeepSeek
52.21
45.93
OpenAI
48.15
N/A
Microsoft Azure
22.16
N/A
Anthropic
23.22
N/A
Amazon Bedrock
17.80
N/A
Google
20.89
N/A
Databricks
19.66
17.18
CoreWeave
30.57
26.67
Novita
77.59
68.66
GMI FP8
120.90
106.67
Together AI (FP4)
14.06
11.90
Makora (FP4)
30.26
26.76
Fireworks
12.68
10.97
SiliconFlow (FP8)
142.65
127.11
DeepInfra (FP4)
162.21
145.27
Parasail
88.78
79.20
Nebius
53.26
46.77
Eigen AI (FP4)
22.19
19.41
Cloudflare
116.62
101.71
Microsoft Azure
28.73
25.11
Kimi
156.88
139.91
Anthropic
Claude Opus 4.7 (Non-reasoning, high)
8.53
N/A
Amazon Bedrock
Claude Opus 4.7 (Non-reasoning, high)
8.43
N/A
Novita
40.10
30.89
GMI
40.64
29.00
DeepInfra
53.82
34.21
Xiaomi
39.15
30.08
OpenAI
61.74
N/A
Databricks
89.00
N/A
Microsoft Azure
49.56
N/A
GMI (FP8)
55.46
42.64
CoreWeave
11.24
8.47
Parasail
51.58
41.72
Together AI
19.73
15.80
DeepInfra
82.85
67.13
Novita
43.46
34.61
Databricks
19.71
15.69
Makora
13.12
10.35
Kimi
53.50
42.86
Amazon Bedrock
7.57
N/A
OpenAI
6.50
N/A
DeepInfra (FP4)
DeepSeek V4 Pro (High) (FP4) 94.13
74.60
Fireworks
22.68
17.50
Microsoft Azure
34.21
26.51
DeepSeek
30.70
23.71
SiliconFlow (FP8)
DeepSeek V4 Pro (High) (FP8) 33.51
25.62
Nebius
58.97
46.08
Together AI
14.22
10.68
Makora
17.92
13.81
Lightning AI
16.44
12.80
Novita
33.33
25.80
Baseten
23.37
17.99
Google Vertex
28.70
N/A
Anthropic
34.76
N/A
Amazon Bedrock
24.96
N/A
Microsoft Azure
29.69
N/A
Xiaomi
53.24
43.71
GMI
56.36
49.30
Makora
24.08
21.62
DeepInfra FP4
DeepSeek V4 Flash (Max) FP4 314.42
287.88
SiliconFlow (FP8)
DeepSeek V4 Flash (Max) (FP8) 61.68
55.71
Parasail (FP8)
DeepSeek V4 Flash (Max) (FP8) 194.01
177.60
Novita
62.26
56.09
DeepSeek
62.57
56.56
Wafer
32.84
28.49
Baseten
27.85
22.33
DeepInfra (FP4)
58.64
51.23
Fireworks
22.04
18.87
Parasail (FP8)
61.00
53.21
Novita (FP8)
62.01
52.90
CoreWeave
22.36
18.90
Nebius (FP8, Base)
121.28
106.46
Together AI
17.59
15.08
SiliconFlow
72.43
61.11
FriendliAI
34.63
30.23
OpenAI
12.54
N/A
Microsoft Azure
15.35
N/A
Novita
25.67
19.59
Parasail
17.58
13.73
DeepInfra
13.34
10.32
Xiaomi
30.05
22.50
Alibaba Cloud
60.92
47.21
Microsoft Azure
114.31
N/A
OpenAI
8.09
N/A
Alibaba Cloud
117.39
106.27
Databricks
Gemini 3 Pro Preview (high) 32.34
N/A
Parasail (FP8)
120.35
103.12
DeepInfra FP8
114.95
98.17
Google
19.07
15.81
Nebius (FP4)
45.91
38.53
Together AI (FP4)
22.74
19.07
Novita FP8
59.30
49.79
FriendliAI
27.26
23.19
Amazon Bedrock
12.99
N/A
OpenAI
3.93
N/A
Alibaba Cloud
50.20
38.81
OpenAI
30.55
N/A
Microsoft Azure
26.81
N/A
Databricks
31.93
N/A
OpenAI
6.35
N/A
Together AI
5.19
3.77
GMI (FP8)
61.32
47.63
Fireworks
27.11
21.99
MiniMax
46.40
37.32
Novita (FP8)
53.79
43.49
SambaNova
7.46
5.51
Clarifai
7.82
6.30
CoreWeave
25.60
21.15
DeepInfra
125.41
106.77
SiliconFlow
99.01
83.73
Fireworks
9.48
7.59
Microsoft Azure
25.12
20.95
Nebius Fast
8.43
6.05
Amazon Bedrock
32.90
27.45
Nebius
11.78
9.10
Kimi
81.94
68.98
Novita
88.61
74.94
OpenAI
14.14
N/A
Microsoft Azure
10.55
N/A
Microsoft Azure
Claude Opus 4.6 (high)
9.14
N/A
Amazon Bedrock
Claude Opus 4.6 (high)
7.89
N/A
Anthropic
Claude Opus 4.6 (high)
10.00
N/A
Google
Claude Opus 4.6 (high)
8.69
N/A
CoreWeave
10.80
8.11
Together AI
16.07
12.76
DeepInfra
26.83
19.16
DeepInfra BF16
11.33
8.87
Nebius
24.73
18.80
Lightning AI
33.47
27.08
Blackbox AI
5.86
4.30
Google (AI Studio)
Gemini 3 Flash (AI Studio) 9.37
N/A
xAI
14.23
N/A
Amazon Bedrock
9.30
N/A
Microsoft Azure
28.01
N/A
DeepInfra (FP4)
DeepSeek V4 Flash (High) (FP4) 86.97
61.34
GMI
18.79
11.21
Makora
7.40
4.93
Parasail (FP8)
DeepSeek V4 Flash (High) (FP8) 56.58
39.86
SiliconFlow (FP8)
DeepSeek V4 Flash (High) (FP8) 17.79
11.76
CoreWeave
29.45
20.31
Novita
17.64
11.74
DeepSeek
18.84
12.69
Groq
13.04
11.70
SiliconFlow (FP8)
177.63
161.87
DeepInfra FP8
89.26
81.75
Makora FP4
31.04
27.98
Novita
98.52
89.61
Alibaba Cloud
96.69
87.68
xAI
16.42
N/A
Microsoft Azure
13.93
N/A
xAI
16.81
N/A
Xiaomi
30.94
23.46
OpenAI
11.24
N/A
Microsoft Azure
14.05
N/A
Databricks
50.06
N/A
Microsoft Azure
42.67
N/A
OpenAI
53.47
N/A
Amazon Bedrock
9.72
N/A
Microsoft Azure
17.03
N/A
xAI
11.21
N/A
Amazon Bedrock
Claude Sonnet 4.6 (Non-reasoning)
7.03
N/A
Microsoft Azure
Claude Sonnet 4.6 (Non-reasoning)
8.58
N/A
Google
Claude Sonnet 4.6 (Non-reasoning)
7.44
N/A
Anthropic
Claude Sonnet 4.6 (Non-reasoning)
8.11
N/A
Amazon Bedrock
5.25
N/A
Microsoft Azure
11.80
N/A
xAI
7.63
N/A
StreamLake
KAT-Coder-Pro V2
5.15
N/A
Wafer
GLM-5.1
4.91
N/A
Baseten
GLM-5.1
4.55
N/A
Parasail
GLM-5.1
8.05
N/A
DeepInfra (FP4)
GLM-5.1 (FP4)
11.92
N/A
Novita (FP8)
GLM-5.1 (FP8)
11.36
N/A
FriendliAI
GLM-5.1
5.77
N/A
SiliconFlow (FP8)
GLM-5.1 (FP8)
11.70
N/A
Nebius (FP8, Base)
GLM-5.1 (FP8, Base)
14.91
N/A
Xiaomi
29.99
22.56
Google AI Studio
Gemini 3.5 Flash (minimal) AI Studio
3.41
N/A
OpenAI
11.92
N/A
Microsoft Azure
12.79
N/A
Databricks
Claude Opus 4.5
8.09
N/A
Google Vertex
Claude Opus 4.5 Vertex
7.61
N/A
Anthropic
Claude Opus 4.5
7.86
N/A
Amazon Bedrock
Claude Opus 4.5
7.96
N/A
Microsoft Azure
Claude Opus 4.5
7.88
N/A
Anthropic
37.42
N/A
Google Vertex
31.16
N/A
Amazon Bedrock
25.75
N/A
Microsoft Azure
35.18
N/A
Databricks
Kimi K2.6
2.08
N/A
Together AI (FP4)
Kimi K2.6 (FP4)
1.92
N/A
SiliconFlow (FP8)
Kimi K2.6 (FP8)
14.61
N/A
Fireworks
Kimi K2.6
1.48
N/A
Parasail (INT4)
Kimi K2.6 (INT4)
8.18
N/A
DeepInfra (FP4)
Kimi K2.6 (FP4)
18.31
N/A
CoreWeave
Kimi K2.6
9.27
N/A
Microsoft Azure
Kimi K2.6
2.68
N/A
Kimi
Kimi K2.6
15.95
N/A
Makora FP4
Kimi K2.6 FP4
2.99
N/A
Nebius
Kimi K2.6
5.55
N/A
Novita
Kimi K2.6
12.69
N/A
Anthropic
Claude Sonnet 4.6 (Non-reasoning, Low Effort)
8.28
N/A
Baseten
11.62
9.04
DeepInfra (FP4)
69.83
55.38
Amazon Bedrock
24.01
18.41
Google
13.74
10.55
Novita
47.59
36.79
Cerebras
1.93
1.38
CoreWeave
23.94
18.51
SiliconFlow (FP8)
85.38
67.10
Alibaba Cloud
28.94
22.08
GMI (FP8)
30.40
22.24
DeepInfra (FP8)
55.44
44.15
Novita
35.88
27.59
Microsoft Azure
30.16
N/A
OpenAI
30.51
N/A
Anthropic
23.57
N/A
Google Vertex
25.38
N/A
Amazon Bedrock
45.70
N/A
Microsoft Azure
21.88
N/A
Wafer
71.32
60.41
Together AI
36.60
13.79
Parasail
42.20
35.89
SiliconFlow (FP8)
37.67
31.33
DeepInfra (FP8)
67.55
57.95
DigitalOcean
216.65
186.74
GMI (FP8)
103.68
79.41
Nebius (Base, FP4)
Qwen3.5 397B A17B (Base, FP4) 30.30
25.19
Eigen AI
14.10
11.57
Alibaba Cloud
73.69
62.26
Novita
71.15
60.38
Novita
20.28
15.17
Fireworks
20.03
15.62
SiliconFlow (FP8)
47.25
36.82
CoreWeave
29.57
23.03
MiniMax
15.18
8.26
Parasail (FP8)
18.24
14.24
Eigen AI
11.53
8.70
GMI FP8
21.86
11.13
Lightning AI
18.71
14.72
DeepInfra (FP8)
50.66
40.11
Nebius (FP4)
28.06
21.61
DigitalOcean
67.12
53.32
FriendliAI
16.12
12.47
GMI
27.05
18.97
SiliconFlow
16.25
11.38
Nebius
28.47
21.85
Amazon Bedrock
43.96
34.43
Nebius Fast
18.16
13.25
DigitalOcean
30.44
23.47
Novita
82.44
64.58
Eigen AI
16.84
12.95
SiliconFlow (FP8)
123.25
97.06
Microsoft Azure
122.23
N/A
Xiaomi
27.69
20.59
OpenAI
58.62
N/A
Microsoft Azure
48.01
N/A
Databricks
53.06
N/A
OpenAI
GPT-5.5 (Non-reasoning)
6.32
N/A
Amazon Bedrock
GPT-5.5 (Non-reasoning)
6.18
N/A
Amazon Bedrock
19.88
15.20
Microsoft Azure
19.48
15.09
Google Vertex
9.19
7.03
Novita
39.76
30.76
OpenAI
110.22
N/A
Microsoft Azure
187.40
N/A
Novita (FP8)
GLM-5 (FP8)
11.09
N/A
Nebius (FP4)
GLM-5 (FP4)
7.37
N/A
DeepInfra (FP8)
GLM-5 (FP8)
20.79
N/A
Baseten
GLM-5
5.54
N/A
SiliconFlow (FP8)
43.38
33.60
DeepInfra (FP8)
36.59
28.86
Alibaba Cloud
17.67
13.29
GMI (FP8)
27.30
19.83
Novita
25.18
19.48
Wafer
Qwen3.5 397B A17B
62.78
N/A
Together AI
Qwen3.5 397B A17B
2.95
N/A
DeepInfra (FP8)
Qwen3.5 397B A17B (FP8)
9.79
N/A
Eigen AI
Qwen3.5 397B A17B
2.38
N/A
Alibaba Cloud
Qwen3.5 397B A17B
11.49
N/A
Nebius Fast
Qwen3.5 397B A17B Fast
2.97
N/A
Nebius (Base, FP4)
Qwen3.5 397B A17B (Base, FP4)
4.77
N/A
DigitalOcean
Qwen3.5 397B A17B
167.99
N/A
Novita
Qwen3.5 397B A17B
11.00
N/A
Alibaba Cloud
34.26
30.25
SiliconFlow (FP8)
75.64
67.91
Scaleway
29.28
26.19
DeepInfra (FP8)
81.34
74.19
Makora FP4
14.76
13.10
Parasail
69.40
63.02
Novita
37.39
33.44
MiniMax
13.58
7.17
Novita
20.96
15.92
Microsoft Azure
DeepSeek V4 Pro
5.99
N/A
Nebius
DeepSeek V4 Pro
9.64
N/A
Makora
DeepSeek V4 Pro
3.34
N/A
Lightning AI
DeepSeek V4 Pro
3.20
N/A
DeepSeek
DeepSeek V4 Pro
5.30
N/A
Xiaomi
28.59
21.50
OpenAI
14.45
N/A
Microsoft Azure
16.97
N/A
Google Vertex
17.44
N/A
Microsoft Azure
17.57
N/A
OpenAI
17.49
N/A
Google Vertex
25.52
N/A
Alibaba Cloud
Qwen3.5 Omni Plus
10.91
N/A
OpenAI
GPT-5.1 Codex mini (high) 6.27
N/A
Microsoft Azure
GPT-5.1 Codex mini (high) 16.90
N/A
InclusionAI
19.16
13.90
Microsoft Azure
18.27
N/A
OpenAI
6.27
N/A
OpenAI
6.29
N/A
Mistral
15.98
12.30
Microsoft Azure
6.60
N/A
OpenAI
7.33
N/A
StepFun
7.18
5.15
Amazon Bedrock
27.43
N/A
Google Vertex
22.13
N/A
Anthropic
22.92
N/A
Microsoft Azure
27.48
N/A
Novita
Kimi K2.5
14.42
N/A
Fireworks
Kimi K2.5
5.48
N/A
Nebius
Kimi K2.5
2.57
N/A
GMI
Kimi K2.5
17.26
N/A
Kimi
Kimi K2.5
12.70
N/A
Microsoft Azure
Kimi K2.5
4.43
N/A
CoreWeave
47.71
36.64
Parasail
66.38
50.94
DeepInfra
76.33
58.96
SiliconFlow (FP8)
31.50
23.35
SambaNova
13.84
9.53
FriendliAI
38.42
29.34
GMI (FP8)
109.63
81.65
Lightning AI
8.77
6.54
Together AI
37.98
29.12
Novita
55.92
42.55
Google (AI Studio)
69.81
53.47
CoreWeave
Qwen3.5 27B
5.31
N/A
Alibaba Cloud
Qwen3.5 27B
6.40
N/A
DeepInfra FP8
Qwen3.5 27B FP8
11.66
N/A
Cohere
12.09
9.56
Groq
Qwen3.6 27B
1.37
N/A
Makora FP4
Qwen3.6 27B FP4
2.82
N/A
DeepInfra FP8
Qwen3.6 27B FP8
7.39
N/A
Alibaba Cloud
Qwen3.6 27B
8.54
N/A
Novita
Qwen3.6 27B
79.96
N/A
Anthropic
Claude 4.5 Sonnet
7.96
N/A
Google Vertex
Claude 4.5 Sonnet Vertex
7.87
N/A
Microsoft Azure
Claude 4.5 Sonnet
8.38
N/A
Databricks
Claude 4.5 Sonnet
7.87
N/A
Amazon Bedrock
Claude 4.5 Sonnet
7.35
N/A
SiliconFlow (FP8)
19.37
14.44
Alibaba Cloud
15.22
11.32
GMI (FP8)
18.15
12.21
DeepInfra (FP8)
24.28
19.14
Novita
19.54
14.86
DeepSeek
DeepSeek V4 Flash
4.94
N/A
GMI
DeepSeek V4 Flash
9.76
N/A
Makora
DeepSeek V4 Flash
2.26
N/A
CoreWeave
DeepSeek V4 Flash
7.11
N/A
Amazon Bedrock
20.17
15.40
MiniMax
21.01
15.83
Google Vertex
14.29
11.20
Novita
20.52
15.53
Novita
KAT-Coder-Pro V1
5.31
N/A
Amazon Bedrock
Claude 4.1 Opus
28.02
N/A
Anthropic
Claude 4.1 Opus
12.04
N/A
Google Vertex
Claude 4.1 Opus Vertex
11.46
N/A
Microsoft Azure
Claude 4.1 Opus
12.30
N/A
Databricks
Claude 4.1 Opus
28.05
N/A
DeepInfra FP8
Qwen3.5 122B A10B FP8
8.49
N/A
Alibaba Cloud
Qwen3.5 122B A10B
4.33
N/A
GMI
MiMo-V2.5-Pro
13.62
N/A
DeepInfra
MiMo-V2.5-Pro
24.00
N/A
Xiaomi
MiMo-V2.5-Pro
9.33
N/A
Novita
MiMo-V2.5-Pro
8.67
N/A
Amazon Bedrock
GPT-5.4 (Non-reasoning)
11.81
N/A
OpenAI
GPT-5.4 (Non-reasoning)
3.95
N/A
Google (AI Studio)
Gemini 3 Flash (AI Studio)
33.48
N/A
Google Vertex
37.63
N/A
Google (AI Studio)
Gemini 2.5 Pro (AI Studio) 32.23
N/A
Baseten
GLM-4.7
2.80
N/A
Amazon Bedrock
GLM-4.7
5.05
N/A
DeepInfra (FP4)
GLM-4.7 (FP4)
12.49
N/A
Novita
GLM-4.7
78.10
N/A
Cerebras
GLM-4.7
0.59
N/A
Google
GLM-4.7
2.84
N/A
Novita (FP8)
DeepSeek V3.1 Terminus (FP8) 86.63
67.94
GMI
Hy3-preview
8.38
N/A
SiliconFlow
Hy3-preview
5.44
N/A
OpenAI
GPT-5.2
7.35
N/A
Microsoft Azure
GPT-5.2
6.28
N/A
StepFun
13.83
10.41
Cloudflare
29.28
23.10
DeepInfra
120.77
96.30
Parasail
38.75
30.54
Novita
80.76
63.19
Google AI Studio
Gemma 4 26B A4B AI Studio 59.52
46.30
GMI (FP8)
142.72
107.33
Microsoft Azure
14.76
N/A
OpenAI
13.52
N/A
StepFun
13.84
10.37
SiliconFlow (FP8)
47.81
36.53
Google Vertex
Claude 4 Opus Vertex
11.44
N/A
Amazon Bedrock
Claude 4 Sonnet
6.21
N/A
Google Vertex
Claude 4 Sonnet Vertex
7.98
N/A
Databricks
Claude 4 Sonnet
7.91
N/A
Novita (FP8)
83.42
65.59
CoreWeave
18.15
13.85
Nebius
7.24
4.93
Inception
2.41
N/A
DeepInfra (FP4)
39.67
31.41
Novita
47.33
36.54
Google (AI Studio)
Gemini 3.1 Flash-Lite (AI Studio) 7.77
N/A
Alibaba Cloud
Qwen3 Max Thinking (Preview) 47.49
36.50
SiliconFlow (FP8)
64.29
50.24
Together AI (FP8)
38.76
30.73
Parasail
Gemma 4 31B
13.34
N/A
FriendliAI
Gemma 4 31B
5.62
N/A
Together AI (FP8)
Gemma 4 31B (FP8)
11.49
N/A
SiliconFlow (FP8)
Gemma 4 31B (FP8)
10.02
N/A
DeepInfra (FP8)
Gemma 4 31B (FP8)
17.56
N/A
SambaNova
Gemma 4 31B
4.22
N/A
Novita
Gemma 4 31B
20.27
N/A
Amazon Bedrock
Grok 4.3 (Non-reasoning)
2.82
N/A
xAI
Grok 4.3 (Non-reasoning)
3.53
N/A
GMI
DeepSeek V3.2
12.51
N/A
Nebius Fast
DeepSeek V3.2 Fast
4.42
N/A
SambaNova
DeepSeek V3.2
2.53
N/A
Eigen AI
DeepSeek V3.2
3.68
N/A
Microsoft Azure
DeepSeek V3.2
2.68
N/A
Nebius
DeepSeek V3.2
6.73
N/A
Amazon Bedrock
DeepSeek V3.2
10.73
N/A
DeepInfra
DeepSeek V3.2
38.33
N/A
DigitalOcean
DeepSeek V3.2
7.43
N/A
Novita
DeepSeek V3.2
12.83
N/A
FriendliAI
DeepSeek V3.2
6.15
N/A
SiliconFlow (FP8)
DeepSeek V3.2 (FP8)
27.14
N/A
Parasail (FP8)
Trinity Large Thinking (FP8) 18.53
14.38
Arcee AI
8.88
6.53
Scaleway
Qwen3.6 35B A3B
3.06
N/A
Makora FP4
Qwen3.6 35B A3B FP4
1.55
N/A
DeepInfra (FP8)
Qwen3.6 35B A3B (FP8)
9.67
N/A
Novita
Qwen3.6 35B A3B
3.82
N/A
Parasail (FP8)
Qwen3.6 35B A3B (FP8)
5.71
N/A
Alibaba Cloud
Qwen3.6 35B A3B
4.15
N/A
Alibaba Cloud
Qwen3 Max
9.35
N/A
Novita
Qwen3 Max
179.39
N/A
Groq
5.57
4.22
Scaleway
14.21
10.95
Cloudflare
20.14
15.76
DeepInfra (Turbo)
gpt-oss-120b (high) (Turbo) 10.32
8.02
Makora
5.66
4.22
Cerebras
1.46
0.94
Google Vertex
gpt-oss-120b (high) Vertex 5.23
3.93
Eigen AI
3.57
2.52
Baseten
10.45
7.99
SambaNova
3.83
2.70
Parasail
12.67
9.77
Microsoft Azure
4.34
3.09
CoreWeave
27.31
21.19
Fireworks
5.63
2.24
Nebius Fast
4.48
2.52
Nebius Base
8.14
6.07
DeepInfra
54.57
43.14
Databricks
7.59
5.62
Amazon Bedrock
13.52
10.11
Together AI
3.91
2.62
Novita
19.94
15.53
Amazon Bedrock
Claude 4.5 Haiku
4.19
N/A
Anthropic
Claude 4.5 Haiku
4.16
N/A
Microsoft Azure
Claude 4.5 Haiku
4.16
N/A
Google Vertex
Claude 4.5 Haiku Vertex
3.68
N/A
Novita
Kimi K2 0905
20.98
N/A
OpenAI
22.40
N/A
Microsoft Azure
28.16
N/A
Alibaba Cloud
Qwen3.5 35B A3B
3.64
N/A
DeepInfra FP8
Qwen3.5 35B A3B FP8
3.79
N/A
Xiaomi
MiMo-V2-Flash
6.84
N/A
Novita
GLM-4.6
9.99
N/A
DeepInfra
45.59
36.09
Amazon Bedrock
10.24
7.48
Novita
31.80
24.25
xAI
Grok 3 mini Reasoning (high) 39.24
30.95
xAI Fast
Grok 3 mini Reasoning (high) Fast 47.35
37.38
xAI
Grok 4.20 0309
2.70
N/A
CoreWeave
38.30
30.00
Eigen AI (FP8)
Qwen3 235B A22B 2507 (FP8) 15.33
11.82
DeepInfra (FP8)
Qwen3 235B A22B 2507 (FP8) 56.73
45.15
Nebius Fast
Qwen3 235B A22B 2507 Fast 15.86
11.65
Novita
40.01
31.08
Alibaba Cloud
37.44
28.94
CompactifAI
7.00
5.15
SiliconFlow
16.78
12.33
xAI
Grok 4.20 0309 v2
2.77
N/A
Amazon Bedrock
Nova 2.0 Pro Preview (medium) 33.66
13.67
DeepInfra (FP4)
DeepSeek V3.1 Terminus (FP4)
5.90
N/A
Novita (FP8)
DeepSeek V3.1 Terminus (FP8)
16.31
N/A
Novita (FP8)
DeepSeek V3.2 Exp (FP8)
17.21
N/A
Parasail (FP8)
Qwen3 Coder Next (FP8)
10.97
N/A
Novita (FP8)
Qwen3 Coder Next (FP8)
3.38
N/A
Amazon Bedrock
Qwen3 Coder Next
12.51
N/A
Amazon Bedrock
DeepSeek V3.1
3.34
N/A
CoreWeave
DeepSeek V3.1
10.71
N/A
Novita
DeepSeek V3.1
18.50
N/A
DeepInfra (FP4)
DeepSeek V3.1 (FP4)
48.87
N/A
Google Vertex
DeepSeek V3.1 Vertex
2.58
N/A
SambaNova
DeepSeek V3.1
2.17
N/A
Baseten (FP8)
DeepSeek V3.1 (FP8)
3.95
N/A
Mistral
13.85
10.63
Amazon Bedrock
13.70
10.19
Novita
80.53
62.84
Google Vertex
13.99
10.65
Alibaba Cloud
42.83
33.23
Novita
55.61
43.58
Cohere
15.00
11.85
Amazon Bedrock
53.60
15.21
OpenAI
GPT-5.1
5.02
N/A
Microsoft Azure
GPT-5.1
4.06
N/A
Mistral
64.30
50.99
Parasail
Gemma 4 26B A4B
9.25
N/A
GMI (FP8)
Gemma 4 26B A4B (FP8)
20.26
N/A
SiliconFlow (FP8)
Gemma 4 26B A4B (FP8)
7.14
N/A
Scaleway
Gemma 4 26B A4B
3.41
N/A
DeepInfra (FP8)
Gemma 4 26B A4B (FP8)
23.88
N/A
Novita
Gemma 4 26B A4B
19.74
N/A
DeepInfra (FP8)
60.27
47.93
Microsoft Azure
25.89
20.14
Google Vertex
14.65
11.45
Novita
84.25
66.03
DeepInfra
66.55
52.87
Google (AI Studio)
Gemini 2.5 Flash (AI Studio) 14.32
N/A
Google (Vertex)
Gemini 2.5 Flash (Vertex) 25.59
N/A
Databricks
51.93
N/A
OpenAI
58.48
N/A
Microsoft Azure
57.25
N/A
Alibaba Cloud
14.45
10.73
Nebius (FP8)
23.82
18.49
Eigen AI
7.88
5.87
Nebius Fast
8.68
5.94
Google Vertex
Qwen3 Next 80B A3B Vertex 17.77
13.79
Novita
13.80
10.22
Amazon Bedrock
Nova 2.0 Pro Preview (low) 20.39
13.59
Novita
43.75
34.10
Novita
Kimi K2
19.78
N/A
Microsoft Azure
GPT-4.1
4.18
N/A
OpenAI
GPT-4.1
5.35
N/A
Alibaba Cloud
Qwen3 Max (Preview)
9.45
N/A
Amazon Bedrock
68.25
14.21
OpenAI
25.73
N/A
Microsoft Azure
27.34
N/A
Alibaba Cloud
Qwen3.5 Omni Flash
2.79
N/A
OpenAI
7.54
N/A
Microsoft Azure
8.76
N/A
OpenAI
21.43
N/A
Microsoft Azure
21.63
N/A
xAI Fast
Grok 3 Fast
7.43
N/A
SiliconFlow
69.32
54.28
Nebius
Qwen3 235B 2507
4.25
N/A
CoreWeave
Qwen3 235B 2507
6.02
N/A
Amazon Bedrock
Qwen3 235B 2507
7.18
N/A
Together AI (FP8)
Qwen3 235B 2507 (FP8)
16.10
N/A
Parasail
Qwen3 235B 2507
14.26
N/A
DeepInfra
Qwen3 235B 2507
22.43
N/A
Scaleway
Qwen3 235B 2507
8.17
N/A
Alibaba Cloud
Qwen3 235B 2507
6.71
N/A
Google Vertex
Qwen3 235B 2507 Vertex
4.02
N/A
Novita
Qwen3 235B 2507
8.22
N/A
FriendliAI
Qwen3 235B 2507
10.18
N/A
DeepInfra (Turbo, FP4)
Qwen3 Coder 480B (Turbo, FP4)
16.24
N/A
Amazon Bedrock
Qwen3 Coder 480B
3.26
N/A
CoreWeave
Qwen3 Coder 480B
10.78
N/A
Google Vertex
Qwen3 Coder 480B Vertex
3.15
N/A
Novita
Qwen3 Coder 480B
5.40
N/A
Alibaba Cloud
Qwen3 Coder 480B
6.30
N/A
Alibaba Cloud
26.80
20.54
Amazon Bedrock
27.24
11.82
Nebius Base
7.66
5.63
Makora
5.49
4.05
SambaNova
3.78
2.69
Google Vertex
gpt-oss-120b (low) Vertex 4.80
3.57
Fireworks
4.66
3.01
CoreWeave
27.12
21.02
Cerebras
1.65
1.07
Microsoft Azure
4.23
3.01
Cloudflare
19.49
15.27
Baseten
8.95
7.05
Amazon Bedrock
13.19
9.90
Parasail
22.11
17.36
Databricks
7.82
5.78
Together AI
3.65
2.52
Groq
5.54
4.23
Eigen AI
3.66
2.57
Novita
38.77
30.54
Novita
19.30
14.40
OpenAI
GPT-5.4 nano
3.64
N/A
Nebius
26.37
20.55
DeepInfra
35.42
23.70
OpenAI
GPT-5 (minimal)
6.44
N/A
Microsoft Azure
GPT-5 (minimal)
7.08
N/A
Microsoft Azure
15.82
N/A
Novita
37.30
29.17
Microsoft Azure
GPT-5.4 mini
3.27
N/A
OpenAI
GPT-5.4 mini
3.30
N/A
SiliconFlow
30.08
22.84
Amazon Bedrock
Nova 2.0 Pro Preview
3.32
N/A
OpenAI
GPT-4.1 mini
6.05
N/A
Microsoft Azure
GPT-4.1 mini
6.90
N/A
Amazon Bedrock
Mistral Large 3
3.31
N/A
Microsoft Azure
Mistral Large 3
6.27
N/A
Mistral
Mistral Large 3
6.85
N/A
DeepInfra FP8
Qwen3.5 4B FP8
19.91
N/A
Alibaba Cloud
17.75
13.27
DeepInfra
DeepSeek V3 0324
31.74
N/A
Microsoft Azure
DeepSeek V3 0324
5.30
N/A
Replicate
DeepSeek V3 0324
18.84
N/A
Novita
DeepSeek V3 0324
13.72
N/A
Amazon Bedrock
GLM-4.7-Flash
2.61
N/A
Novita
GLM-4.7-Flash
6.75
N/A
Mistral
Devstral 2
12.35
N/A
OpenAI
GPT-5 (ChatGPT)
2.81
N/A
Nebius (FP8)
Nemotron 3 Nano Omni 30B A3B Reasoning (FP8) 8.73
6.51
DeepInfra
46.54
36.92
Together AI
3.31
2.33
Lightning AI
9.33
7.25
CoreWeave
13.72
10.40
Cloudflare
14.60
11.42
Novita
18.32
14.17
Google Vertex
gpt-oss-20B (high) Vertex 33.01
26.10
Amazon Bedrock
21.69
6.65
Databricks
8.20
6.17
Groq
3.13
2.33
Clarifai
11.24
8.76
Mistral
Mistral Medium 3.1
4.74
N/A
Clarifai
11.03
8.63
Together AI
3.10
2.25
Lightning AI
9.39
7.27
CoreWeave
12.44
9.35
Cloudflare
14.20
11.16
Amazon Bedrock
25.26
13.15
Novita
19.02
14.66
Databricks
9.40
7.13
Groq
3.07
2.26
Google Vertex
27.17
21.52
Alibaba Cloud
Qwen3 VL 235B A22B
10.30
N/A
Parasail (FP8)
Qwen3 VL 235B A22B (FP8)
19.40
N/A
Novita
Qwen3 VL 235B A22B
14.28
N/A
Microsoft Azure (FP8)
Llama 4 Maverick (FP8)
2.62
N/A
Amazon Bedrock
Llama 4 Maverick
2.23
N/A
Snowflake
Llama 4 Maverick
4.59
N/A
DeepInfra (FP8)
Llama 4 Maverick (FP8)
10.32
N/A
Databricks
Llama 4 Maverick
6.45
N/A
Novita (FP8)
Llama 4 Maverick (FP8)
13.83
N/A
Parasail (FP8)
Llama 4 Maverick (FP8)
5.52
N/A
OpenAI
GPT-5 mini (minimal)
5.73
N/A
Google (Vertex)
Gemini 2.5 Flash (Vertex)
3.20
N/A
Google (AI Studio)
Gemini 2.5 Flash (AI Studio)
2.86
N/A
DeepInfra
Qwen3 Next 80B A3B
2.90
N/A
Parasail
Qwen3 Next 80B A3B
6.18
N/A
Novita
Qwen3 Next 80B A3B
4.80
N/A
Alibaba Cloud
Qwen3 Next 80B A3B
3.25
N/A
Google Vertex
Qwen3 Next 80B A3B Vertex
2.64
N/A
Alibaba Cloud
Qwen3 Coder 30B A3B
5.81
N/A
Amazon Bedrock
Qwen3 Coder 30B A3B
2.11
N/A
Scaleway
Qwen3 Coder 30B A3B
3.33
N/A
Alibaba Cloud
34.83
26.96
Cloudflare
94.46
78.32
Alibaba Cloud
21.42
16.22
Novita
23.32
17.73
SiliconFlow
Gemma 4 12B (Non-reasoning)
4.42
N/A
Mistral
Devstral Small 2
9.14
N/A
Amazon Bedrock
Nova Premier
7.07
N/A
Microsoft Azure
28.00
22.37
Amazon Bedrock
16.70
13.19
Novita Turbo
95.31
77.20
Novita
95.63
77.46
Mistral
Mistral Medium 3
10.72
N/A
Microsoft Azure
Mistral Medium 3
10.90
N/A
DeepInfra
Llama Nemotron Super 49B v1.5 49.30
39.25
Google Vertex
Claude 3.5 Haiku Vertex
9.15
N/A
Mistral
Devstral Medium
9.62
N/A
Mistral
Mistral Small 4
2.99
N/A
Amazon Bedrock
33.16
25.84
Mistral
22.92
17.99
Sarvam
27.36
20.92
Amazon Bedrock
Nova 2.0 Lite
2.62
N/A
Google (AI Studio)
Gemini 2.5 Flash-Lite (AI Studio) 60.75
N/A
OpenAI
GPT-4o (Nov)
2.83
N/A
Microsoft Azure
GPT-4o (Nov)
8.57
N/A
Alibaba Cloud
Qwen3 VL 32B
7.55
N/A
Novita
GLM-4.6V
9.16
N/A
Alibaba Cloud
Qwen3 235B
7.06
N/A
Novita (FP8)
Qwen3 235B (FP8)
30.35
N/A
Alibaba Cloud
19.82
14.96
Nebius Base
65.42
51.83
DeepInfra (FP8)
44.02
35.01
Alibaba Cloud
40.40
31.50
Groq
5.55
4.20
Novita
DeepSeek V3 (Dec)
13.80
N/A
DeepInfra
DeepSeek V3 (Dec)
20.47
N/A
Novita Turbo
DeepSeek V3 (Dec) Turbo
13.31
N/A
DeepInfra (FP8)
96.48
76.84
DeepInfra (FP8)
41.39
32.94
Alibaba Cloud
38.65
30.14
CompactifAI
Llama 4 Scout
5.91
N/A
Microsoft Azure
Llama 4 Scout
4.60
N/A
Cloudflare
Llama 4 Scout
11.32
N/A
Amazon Bedrock
Llama 4 Scout
2.46
N/A
Novita
Llama 4 Scout
10.38
N/A
Google Vertex
Llama 4 Scout Vertex
3.66
N/A
Groq
Llama 4 Scout
1.79
N/A
DeepInfra
Llama 4 Scout
10.22
N/A
Novita
Qwen3 VL 30B A3B
5.46
N/A
Alibaba Cloud
Qwen3 VL 30B A3B
4.98
N/A
Nebius (FP8)
31.32
24.58
Mistral
Ministral 3 14B
5.47
N/A
Amazon Bedrock
Ministral 3 14B
2.81
N/A
DeepInfra
DeepSeek R1 Distill Llama 70B 54.26
43.00
SiliconFlow
Ling-flash-2.0
9.14
N/A
Alibaba Cloud
25.40
19.58
Microsoft Azure
GPT-4o (Aug)
3.56
N/A
OpenAI
GPT-4o (Aug)
4.90
N/A
SiliconFlow (FP8)
Qwen2.5 72B (FP8)
19.07
N/A
DeepInfra
Qwen2.5 72B
19.25
N/A
DeepInfra (FP8)
23.80
18.87
Alibaba Cloud
22.96
17.56
Mistral
Devstral Small
8.79
N/A
Novita
99.98
78.97
Mistral
Mistral Large 2 (Nov)
6.99
N/A
DeepInfra (FP8)
Mistral Small 3.2 (FP8)
15.43
N/A
Mistral
Mistral Small 3.2
3.96
N/A
Nebius Base
Llama Nemotron Ultra Base 48.02
37.91
Nebius
Qwen3 30B A3B 2507
7.85
N/A
Alibaba Cloud
Qwen3 30B A3B 2507
3.62
N/A
CoreWeave
Qwen3 30B A3B 2507
5.39
N/A
Nebius (FP8)
61.81
48.87
DeepInfra (FP8)
NVIDIA Nemotron Nano 12B v2 VL (FP8) 9.50
7.43
Amazon Bedrock
Ministral 3 8B
2.58
N/A
Mistral
Ministral 3 8B
5.98
N/A
DeepInfra
NVIDIA Nemotron Nano 9B V2 38.63
27.07
Nebius (FP8)
Hermes 4 405B (FP8)
11.96
N/A
DeepInfra FP8
Qwen3.5 2B FP8
15.00
N/A
DeepInfra
Llama Nemotron Super 49B v1.5
9.76
N/A
Nebius Base
Qwen3 32B Base
13.22
N/A
Alibaba Cloud
Qwen3 32B
8.73
N/A
Groq
Qwen3 32B
1.18
N/A
Amazon Bedrock
Qwen3 32B
2.29
N/A
DeepInfra (FP8)
Qwen3 32B (FP8)
8.57
N/A
Microsoft Azure
GPT-4o (May)
3.80
N/A
OpenAI
GPT-4o (May)
4.25
N/A
Parasail (FP8)
Llama 3.3 70B (FP8)
5.40
N/A
SambaNova
Llama 3.3 70B
1.78
N/A
Snowflake Snowflake
Llama 3.3 70B Snowflake
3.03
N/A
Nebius Base
Llama 3.3 70B Base
18.24
N/A
FriendliAI
Llama 3.3 70B
3.20
N/A
Hyperbolic
Llama 3.3 70B
5.01
N/A
Makora FP8
Llama 3.3 70B FP8
1.59
N/A
CoreWeave
Llama 3.3 70B
5.88
N/A
CompactifAI
Llama 3.3 70B
2.21
N/A
Together AI Turbo
Llama 3.3 70B Turbo
5.31
N/A
Microsoft Azure
Llama 3.3 70B
3.76
N/A
DeepInfra (Turbo, FP8)
Llama 3.3 70B (Turbo, FP8)
35.77
N/A
Cloudflare
Llama 3.3 70B
6.83
N/A
Amazon Bedrock
Llama 3.3 70B
3.55
N/A
Groq
Llama 3.3 70B
1.44
N/A
Google Vertex
Llama 3.3 70B Vertex
2.68
N/A
Databricks
Llama 3.3 70B
7.47
N/A
Scaleway
Llama 3.3 70B
6.51
N/A
Novita
Llama 3.3 70B
12.19
N/A
Cloudflare
Mistral Small 3.1
12.23
N/A
DeepInfra
Mistral Small 3.1
8.76
N/A
CompactifAI
Mistral Small 3.1
7.11
N/A
Mistral
Mistral Small 3.1
2.95
N/A
Amazon Bedrock Latency Optimized
Llama 3.1 405B Latency Optimized
5.66
N/A
Microsoft Azure
Llama 3.1 405B
17.09
N/A
Alibaba Cloud
Qwen3 VL 8B
4.29
N/A
Liquid AI
12.12
8.61
Mistral
Pixtral Large
7.02
N/A
OpenAI
GPT-5 nano (minimal)
4.44
N/A
Microsoft Azure
GPT-4 Turbo
3.63
N/A
OpenAI
GPT-4 Turbo
14.63
N/A
Amazon Bedrock
Nova Pro
2.95
N/A
Microsoft Azure
Command A
14.21
N/A
Cohere
Command A
6.53
N/A
DeepInfra
Llama 3.1 Nemotron 70B
4.00
N/A
Eigen AI
8.77
6.69
Alibaba Cloud
42.75
33.08
DeepInfra
NVIDIA Nemotron 3 Nano
5.92
N/A
DeepInfra
NVIDIA Nemotron Nano 9B V2
11.16
N/A
Amazon Bedrock
NVIDIA Nemotron Nano 9B V2
3.67
N/A
OpenAI
GPT-4.1 nano
4.03
N/A
Microsoft Azure
GPT-4.1 nano
3.04
N/A
Amazon Bedrock
Mistral Large 2 (Jul)
16.81
N/A
Alibaba Cloud
Qwen3 14B
8.54
N/A
DeepInfra (FP8)
Qwen3 14B (FP8)
8.80
N/A
OpenAI
GPT-4
14.51
N/A
Novita
GLM-4.5V
33.16
N/A
DeepInfra
Mistral Small 3
10.95
N/A
Mistral
Mistral Small 3
2.99
N/A
Google (AI Studio)
Gemini 2.5 Flash-Lite (AI Studio)
2.66
N/A
Amazon Bedrock
Nova Lite
2.81
N/A
OpenAI
GPT-4o mini
7.00
N/A
Microsoft Azure
GPT-4o mini
6.14
N/A
Nebius (FP8)
Hermes 4 70B (FP8)
6.51
N/A
DeepInfra (FP8)
Qwen3 30B (FP8)
5.68
N/A
Alibaba Cloud
Qwen3 30B
5.44
N/A
Amazon Bedrock Standard
Llama 3.1 70B Standard
16.45
N/A
Amazon Bedrock Latency Optimized
Llama 3.1 70B Latency Optimized
3.29
N/A
DeepInfra (Turbo, FP8)
Llama 3.1 70B (Turbo, FP8)
16.17
N/A
DeepInfra
Llama 3.1 70B
14.76
N/A
CoreWeave
Granite 4.1 8B
4.12
N/A
Sarvam
12.60
9.16
Alibaba Cloud
Qwen2.5 Turbo
5.30
N/A
Reka AI
Reka Flash
7.35
N/A
Amazon Bedrock
Llama 3.2 90B (Vision)
8.66
N/A
Microsoft Azure
Llama 3.2 90B (Vision)
12.58
N/A
Upstage
Solar Mini
7.79
N/A
CoreWeave
Llama 3.1 8B
4.04
N/A
Microsoft Azure
Llama 3.1 8B
3.48
N/A
Amazon Bedrock
Llama 3.1 8B
1.73
N/A
Cloudflare
Llama 3.1 8B
2.45
N/A
FriendliAI
Llama 3.1 8B
1.63
N/A
CompactifAI
Llama 3.1 8B
1.67
N/A
DeepInfra (Turbo, FP8)
Llama 3.1 8B (Turbo, FP8)
11.41
N/A
DeepInfra
Llama 3.1 8B
12.30
N/A
Databricks
Llama 3.1 8B
2.69
N/A
Groq
Llama 3.1 8B
0.92
N/A
Scaleway
Llama 3.1 8B
3.92
N/A
Novita
Llama 3.1 8B
3.99
N/A
Mistral
Ministral 3 3B
3.16
N/A
Amazon Bedrock
Ministral 3 3B
1.73
N/A
AI21 Labs
Jamba 1.7 Large
9.75
N/A
Replicate
Granite 4.0 H Small
10.21
N/A
Amazon Bedrock
Jamba 1.5 Large
12.38
N/A
Alibaba Cloud
Qwen3 Omni 30B A3B
5.52
N/A
DeepInfra
Hermes 3 - Llama-3.1 70B
15.53
N/A
Alibaba Cloud
Qwen3 8B
8.65
N/A
Eigen AI
Qwen3 8B
1.97
N/A
AI21 Labs
Jamba 1.6 Large
9.57
N/A
DeepInfra (FP8)
51.79
40.97
Together AI
LFM2 24B A2B
4.81
N/A
Microsoft Azure
Phi-4
12.34
N/A
DeepInfra
Phi-4
6.98
N/A
DeepInfra
Gemma 3 27B
33.57
N/A
Amazon Bedrock
Gemma 3 27B
7.98
N/A
Nebius (FP8)
Gemma 3 27B (FP8)
29.80
N/A
Parasail
Gemma 3 27B
8.29
N/A
Novita
Gemma 3 27B
27.39
N/A
Amazon Bedrock
Nova Micro
2.13
N/A
Mistral
Mistral Small (Sep)
3.08
N/A
Amazon Bedrock
NVIDIA Nemotron Nano 12B v2 VL
3.09
N/A
DeepInfra (FP8)
NVIDIA Nemotron Nano 12B v2 VL (FP8)
1.81
N/A
Microsoft Azure
Phi-4 Multimodal
32.21
N/A
DeepInfra FP8
Qwen3.5 0.8B FP8
13.26
N/A
Amazon Bedrock
Mistral Large (Feb)
12.69
N/A
Replicate
Llama 2 Chat 7B
11.48
N/A
Amazon Bedrock
Llama 3.2 3B
10.08
N/A
Reka AI
30.06
22.53
Mistral
Mistral Small (Feb)
2.95
N/A
Mistral
Mistral Medium
3.82
N/A
OpenAI
GPT-3.5 Turbo
3.88
N/A
Replicate
Llama 3 70B
11.17
N/A
Novita
Llama 3 70B
25.45
N/A
Amazon Bedrock
Llama 3 70B
11.12
N/A
DeepInfra
Gemma 3 12B
14.35
N/A
Amazon Bedrock
Gemma 3 12B
5.52
N/A
Databricks
Gemma 3 12B
6.68
N/A
DeepInfra
Llama 3.2 11B (Vision)
10.06
N/A
Amazon Bedrock
Llama 3.2 11B (Vision)
3.42
N/A
Microsoft Azure
Llama 3.2 11B (Vision)
6.11
N/A
CoreWeave
Phi-4 Mini
2.61
N/A
Microsoft Azure
Phi-4 Mini
11.33
N/A
Amazon Bedrock
Command-R+ (Apr)
14.62
N/A
Liquid AI
LFM2 2.6B
2.53
N/A
Liquid AI
LFM2.5-1.2B-Instruct
1.99
N/A
Amazon Bedrock
Jamba 1.5 Mini
10.66
N/A
AI21 Labs
Jamba 1.6 Mini
3.69
N/A
Amazon Bedrock
Mixtral 8x7B
7.39
N/A
Amazon Bedrock
Mistral 7B
6.39
N/A
Mistral
Mistral 7B
4.87
N/A
Amazon Bedrock
Command-R (Mar)
6.71
N/A
Replicate
Granite 3.3 8B
22.68
N/A
Amazon Bedrock
Llama 3 8B
5.39
N/A
Replicate
Llama 3 8B
6.45
N/A
DeepInfra
Llama 3 8B
9.20
N/A
Together AI
Gemma 3n E4B
12.13
N/A
Liquid AI
LFM2 1.2B
2.41
N/A
Amazon Bedrock
Gemma 3 4B
2.90
N/A
DeepInfra
Gemma 3 4B
13.30
N/A
Amazon Bedrock
Llama 3.2 1B
6.28
N/A
Liquid AI
LFM2.5-VL-1.6B
12.09
N/A
Cohere
Tiny Aya Global
4.09
N/A
Together AI
27.24
21.44