LLM API Providers Leaderboard - Comparison of over 100 LLM endpoints

Comparison and ranking of API provider performance for over 100 AI LLM Model endpoints across key metrics including price, performance / speed (throughput & latency), context window & others. For more details including relating to our methodology, see our FAQs.

API providers compared: OpenAI, Mistral, Microsoft Azure, Amazon Bedrock, Groq, Together.ai, Anthropic, Perplexity, Google, Fireworks, Baseten, Cohere, Lepton AI, Speechmatics, Deepinfra, Replicate, NVIDIA NGC (Demo), Runpod, Rev AI, DeepSeek, AssemblyAI, fal.ai, Deepgram, Gladia, Stability.ai, Midjourney, Databricks, and OctoAI.

Context
Model Quality
Price
Throughput
Latency
Further
Analysis
OpenAI logo
OpenAI logoGPT-4o
128k
100
$7.50
85.5
0.39
OpenAI logo
OpenAI logoGPT-4 Turbo
128k
94
$15.00
25.5
0.66
Microsoft Azure logo
OpenAI logoGPT-4 Turbo
128k
94
$15.00
16.9
0.46
OpenAI logo
OpenAI logoGPT-4
8k
83
$37.50
20.5
0.64
Microsoft Azure logo
OpenAI logoGPT-4
8k
83
$37.50
14.6
0.48
OpenAI logo
OpenAI logoGPT-3.5 Turbo
16k
65
$0.75
59.2
0.44
Microsoft Azure logo
OpenAI logoGPT-3.5 Turbo
16k
65
$0.75
44.3
0.31
OpenAI logo
OpenAI logoGPT-3.5 Turbo Instruct
4k
60
$1.63
75.3
0.34
Microsoft Azure logo
OpenAI logoGPT-3.5 Turbo Instruct
4k
60
$1.63
139.2
0.64
Google logo
Google logoGemini 1.5 Flash
1m
76
$0.79
149.2
0.51
Google logo
Google logoGemini 1.5 Pro
1m
88
$10.50
51.8
1.07
Google logo
Google logoGemini 1.0 Pro
33k
62
$0.75
79.8
1.37
Fireworks logo
Google logoGemma 7B
8k
57
$0.20
206.7
0.27
Deepinfra logo
Google logoGemma 7B
8k
57
$0.13
53.9
0.24
Groq logo
Google logoGemma 7B
8k
57
$0.10
772.6
0.29
Together.ai logo
Google logoGemma 7B
8k
57
$0.20
146.3
0.39
Replicate logo
Meta logoLlama 3 (70B)
8k
88
$1.18
24.4
0.22
Amazon Bedrock logo
Meta logoLlama 3 (70B)
8k
88
$2.86
35.8
0.36
OctoAI logo
Meta logoLlama 3 (70B)
8k
88
$0.93
36.0
0.37
Microsoft Azure logo
Meta logoLlama 3 (70B)
8k
88
$5.67
19.4
2.29
Fireworks logo
Meta logoLlama 3 (70B)
8k
88
$0.90
151.8
0.23
Deepinfra logo
Meta logoLlama 3 (70B)
8k
88
$0.64
28.6
0.32
Groq logo
Meta logoLlama 3 (70B)
8k
88
$0.64
304.2
0.37
Perplexity logo
Meta logoLlama 3 (70B)
8k
88
$1.00
49.6
0.25
Together.ai logo
Meta logoLlama 3 (70B)
8k
88
$0.90
100.5
0.45
Replicate logo
Meta logoLlama 3 (8B)
8k
65
$0.10
77.1
0.22
Amazon Bedrock logo
Meta logoLlama 3 (8B)
8k
65
$0.45
80.3
0.30
OctoAI logo
Meta logoLlama 3 (8B)
8k
65
$0.14
96.2
0.30
Microsoft Azure logo
Meta logoLlama 3 (8B)
8k
65
$0.55
80.4
0.84
Fireworks logo
Meta logoLlama 3 (8B)
8k
65
$0.20
206.5
0.29
Deepinfra logo
Meta logoLlama 3 (8B)
8k
65
$0.08
104.8
0.19
Groq logo
Meta logoLlama 3 (8B)
8k
65
$0.06
838.8
0.27
Perplexity logo
Meta logoLlama 3 (8B)
8k
65
$0.20
121.8
0.20
Together.ai logo
Meta logoLlama 3 (8B)
8k
65
$0.20
163.7
0.44
Fireworks logo
Meta logoCode Llama (70B)
4k
58
$0.90
53.5
0.29
Deepinfra logo
Meta logoCode Llama (70B)
4k
58
$0.75
31.4
0.27
Perplexity logo
Meta logoCode Llama (70B)
16k
58
$1.00
51.8
0.23
Together.ai logo
Meta logoCode Llama (70B)
4k
58
$0.90
29.0
0.37
Replicate logo
Meta logoLlama 2 Chat (70B)
4k
50
$1.18
42.8
0.21
Amazon Bedrock logo
Meta logoLlama 2 Chat (70B)
4k
50
$2.10
41.3
0.40
OctoAI logo
Meta logoLlama 2 Chat (70B)
4k
50
$0.93
24.8
0.29
Microsoft Azure logo
Meta logoLlama 2 Chat (70B)
4k
50
$1.60
16.8
2.76
Fireworks logo
Meta logoLlama 2 Chat (70B)
4k
50
$0.90
73.0
0.32
Deepinfra logo
Meta logoLlama 2 Chat (70B)
4k
50
$0.76
70.8
0.22
Perplexity logo
Meta logoLlama 2 Chat (70B)
4k
50
$1.00
Together.ai logo
Meta logoLlama 2 Chat (70B)
4k
50
$0.90
35.6
0.49
Replicate logo
Meta logoLlama 2 Chat (13B)
4k
36
$0.20
74.2
0.22
Amazon Bedrock logo
Meta logoLlama 2 Chat (13B)
4k
36
$0.81
52.9
0.31
OctoAI logo
Meta logoLlama 2 Chat (13B)
4k
36
$0.28
50.1
0.21
Microsoft Azure logo
Meta logoLlama 2 Chat (13B)
4k
36
$0.84
41.8
1.22
Fireworks logo
Meta logoLlama 2 Chat (13B)
4k
36
$0.20
138.8
0.27
Deepinfra logo
Meta logoLlama 2 Chat (13B)
4k
36
$0.35
42.3
0.23
Together.ai logo
Meta logoLlama 2 Chat (13B)
4k
36
$0.23
50.9
0.31
Replicate logo
Meta logoLlama 2 Chat (7B)
4k
27
$0.10
119.6
0.21
Microsoft Azure logo
Meta logoLlama 2 Chat (7B)
4k
27
$0.56
68.7
0.85
Fireworks logo
Meta logoLlama 2 Chat (7B)
4k
27
$0.20
192.4
0.26
Deepinfra logo
Meta logoLlama 2 Chat (7B)
4k
27
$0.20
37.0
0.21
Together.ai logo
Meta logoLlama 2 Chat (7B)
4k
27
$0.20
95.6
0.32
Mistral logo
Mistral logoMixtral 8x22B
65k
81
$3.00
76.4
0.23
OctoAI logo
Mistral logoMixtral 8x22B
65k
81
$1.20
42.4
0.32
Fireworks logo
Mistral logoMixtral 8x22B
65k
81
$1.20
82.2
0.25
Deepinfra logo
Mistral logoMixtral 8x22B
65k
81
$0.65
39.8
0.25
Perplexity logo
Mistral logoMixtral 8x22B
16k
81
$1.00
62.1
0.23
Together.ai logo
Mistral logoMixtral 8x22B
65k
81
$1.20
67.4
0.67
Mistral logo
Mistral logoMistral Large
33k
75
$12.00
29.9
0.24
Amazon Bedrock logo
Mistral logoMistral Large
33k
75
$12.00
31.4
0.36
Microsoft Azure logo
Mistral logoMistral Large
33k
75
$6.00
29.3
1.87
Mistral logo
Mistral logoMistral Medium
33k
73
$4.05
21.8
0.23
Mistral logo
Mistral logoMistral Small
33k
71
$3.00
55.5
0.22
Microsoft Azure logo
Mistral logoMistral Small
33k
71
$1.50
67.7
1.27
Mistral logo
Mistral logoMixtral 8x7B
33k
65
$0.70
92.8
0.22
Replicate logo
Mistral logoMixtral 8x7B
33k
65
$0.47
84.4
0.22
Amazon Bedrock logo
Mistral logoMixtral 8x7B
33k
65
$0.51
61.8
0.32
OctoAI logo
Mistral logoMixtral 8x7B
33k
65
$0.35
60.2
0.26
Lepton AI logo
Mistral logoMixtral 8x7B
33k
65
$0.50
134.5
0.32
Fireworks logo
Mistral logoMixtral 8x7B
33k
65
$0.50
237.1
0.22
Deepinfra logo
Mistral logoMixtral 8x7B
33k
65
$0.24
58.9
0.22
Groq logo
Mistral logoMixtral 8x7B
33k
65
$0.24
476.8
0.24
Perplexity logo
Mistral logoMixtral 8x7B
16k
65
$0.60
117.6
0.20
Together.ai logo
Mistral logoMixtral 8x7B
33k
65
$0.60
114.1
0.43
Mistral logo
Mistral logoMistral 7B
33k
39
$0.25
63.3
0.22
Replicate logo
Mistral logoMistral 7B
33k
39
$0.10
71.0
0.23
Amazon Bedrock logo
Mistral logoMistral 7B
33k
39
$0.16
71.5
0.30
OctoAI logo
Mistral logoMistral 7B
33k
39
$0.14
78.4
0.20
Fireworks logo
Mistral logoMistral 7B
33k
39
$0.20
245.2
0.18
Deepinfra logo
Mistral logoMistral 7B
33k
39
$0.07
54.0
0.23
Perplexity logo
Mistral logoMistral 7B
16k
39
$0.20
104.6
0.21
Together.ai logo
Mistral logoMistral 7B
8k
39
$0.20
78.9
0.33
Baseten logo
Mistral logoMistral 7B
4k
39
$0.20
219.0
0.15
Amazon Bedrock logo
Anthropic logoClaude 3 Opus
200k
94
$30.00
28.1
0.87
Anthropic logo
Anthropic logoClaude 3 Opus
200k
94
$30.00
29.7
1.20
Amazon Bedrock logo
Anthropic logoClaude 3 Sonnet
200k
78
$6.00
54.3
0.55
Google logo
Anthropic logoClaude 3 Sonnet
200k
78
$6.00
Anthropic logo
Anthropic logoClaude 3 Sonnet
200k
78
$6.00
62.4
0.64
Amazon Bedrock logo
Anthropic logoClaude 3 Haiku
200k
72
$0.50
81.1
0.47
Google logo
Anthropic logoClaude 3 Haiku
200k
72
$0.50
Anthropic logo
Anthropic logoClaude 3 Haiku
200k
72
$0.50
149.8
0.43
Anthropic logo
Anthropic logoClaude 2.0
100k
69
$12.00
41.4
0.43
Amazon Bedrock logo
Anthropic logoClaude 2.1
200k
63
$12.00
40.2
0.51
Anthropic logo
Anthropic logoClaude 2.1
200k
63
$12.00
43.9
0.36
Amazon Bedrock logo
Anthropic logoClaude Instant
100k
63
$1.20
86.2
0.34
Anthropic logo
Anthropic logoClaude Instant
100k
63
$1.20
95.2
0.40
Amazon Bedrock logo
Cohere logoCommand Light
4k
$0.38
46.4
0.33
Cohere logo
Cohere logoCommand Light
4k
$0.38
81.6
0.16
Amazon Bedrock logo
Cohere logoCommand
4k
$1.63
28.4
0.35
Cohere logo
Cohere logoCommand
4k
$1.25
28.9
0.38
Cohere logo
Cohere logoCommand-R+
128k
74
$6.00
41.6
0.19
Microsoft Azure logo
Cohere logoCommand-R+
128k
74
$6.00
61.1
1.31
Cohere logo
Cohere logoCommand-R
128k
62
$0.75
96.8
0.16
Microsoft Azure logo
Cohere logoCommand-R
128k
62
$0.75
61.3
1.52
Perplexity logo
Perplexity logoPPLX-70B Online
4k
42
$1.00
38.8
1.18
Perplexity logo
Perplexity logoPPLX-7B-Online
4k
33
$0.20
90.4
0.97
Deepinfra logo
OpenChat logoOpenChat 3.5
8k
54
$0.13
62.8
0.26
Together.ai logo
OpenChat logoOpenChat 3.5
8k
54
$0.20
89.7
0.47
Lepton AI logo
Databricks logoDBRX
33k
74
$0.90
Fireworks logo
Databricks logoDBRX
33k
74
$1.60
57.1
0.33
Databricks logo
Databricks logoDBRX
33k
74
$3.38
118.5
0.59
Together.ai logo
Databricks logoDBRX
33k
74
$1.20
81.0
0.42
DeepSeek logo
DeepSeek logoDeepSeek-V2
128k
82
$0.17
15.5
1.51
Together.ai logo
Snowflake logoArctic
4k
63
$2.40
107.2
1.38

Key definitions

Quality: Index represents normalized average relative performance across Chatbot arena, MMLU & MT-Bench.
Context window: Maximum number of combined input & output tokens. Output tokens commonly have a significantly lower limit (varied by model).
Throughput: Tokens per second received while the model is generating tokens (ie. after first chunk has been received from the API).
Latency: Time to first token of tokens received, in seconds, after API request sent.
Price: Price per token, represented as USD per million Tokens. Price is a blend of Input & Output token prices (3:1 ratio).
Output price: Price per token generated by the model (received from the API), represented as USD per million Tokens.
Input price: Price per token included in the request/message sent to the API, represented as USD per million Tokens.
Time period: Metrics are 'live' and are based on the past 14 days of measurements, measurements are taken 8 times a day for single requests and 2 times per day for parallel requests.