logo

Llama 3.1 Instruct 405B: Quality, Performance & Price Analysis

Analysis of Meta's Llama 3.1 Instruct 405B and comparison to other AI models across key metrics including quality, price, performance (tokens per second & time to first token), context window & more. Click on any model to compare API providers for that model. For more details including relating to our methodology, see our FAQs.
For analysis of API providers see
Creator:
Meta
License:
Open
Context window:
128k
Link:

Comparison Summary

Quality:
Llama 3.1 405B is of higher quality compared to average, with a MMLU score of 0.886 and a Quality Index across evaluations of 72.
Price:
Llama 3.1 405B is more expensive compared to average with a price of $4.75 per 1M Tokens (blended 3:1).
Llama 3.1 405B Input token price: $4.50, Output token price: $7.00 per 1M Tokens.
Speed:
Llama 3.1 405B is slower compared to average, with a output speed of 28.4 tokens per second.
Latency:
Llama 3.1 405B has a higher latency compared to average, taking 0.65s to receive the first token (TTFT).
Context Window:
Llama 3.1 405B has a smaller context windows than average, with a context window of 130k tokens.

Highlights

Quality
Artificial Analysis Quality Index; Higher is better
Speed
Output Tokens per Second; Higher is better
Price
USD per 1M Tokens; Lower is better
Parallel Queries:
Prompt Length:
Further details
Model NameFurther analysis
OpenAI logo
OpenAI logoGPT-4o (2024-08-06)
OpenAI logoGPT-4o
OpenAI logoGPT-4o mini
OpenAI logoGPT-4 Turbo
OpenAI logoGPT-3.5 Turbo
OpenAI logoGPT-4
OpenAI logoGPT-3.5 Turbo Instruct
Meta logo
Meta logoLlama 3.1 Instruct 405B
Meta logoLlama 3.1 Instruct 70B
Meta logoLlama 3.1 Instruct 8B
Meta logoLlama 3 Instruct 70B
Meta logoLlama 3 Instruct 8B
Meta logoLlama 2 Chat 70B
Meta logoLlama 2 Chat 13B
Meta logoLlama 2 Chat 7B
Google logo
Google logoGemini 1.5 Pro
Google logoGemini 1.5 Flash
Google logoGemma 2 27B
Google logoGemma 2 9B
Google logoGemma 7B Instruct
Google logoGemini 1.0 Pro
Anthropic logo
Anthropic logoClaude 3.5 Sonnet
Anthropic logoClaude 3 Opus
Anthropic logoClaude 3 Haiku
Anthropic logoClaude 3 Sonnet
Anthropic logoClaude Instant
Anthropic logoClaude 2.0
Anthropic logoClaude 2.1
Mistral logo
Mistral logoMistral Large 2
Mistral logoMixtral 8x22B Instruct
Mistral logoMistral NeMo
Mistral logoMistral Small
Mistral logoMixtral 8x7B Instruct
Mistral logoCodestral-Mamba
Mistral logoMistral Large
Mistral logoMistral 7B Instruct
Mistral logoMistral Medium
Mistral logoCodestral
Cohere logo
Cohere logoCommand-R (03-2024)
Cohere logoCommand-R+ (04-2024)
Cohere logoCommand-R (08-2024)
Cohere logoCommand-R+ (08-2024)
Cohere logoCommand
Cohere logoCommand Light
Perplexity logo
Perplexity logoSonar Large
Perplexity logoSonar Small
Perplexity logoSonar 3.1 Large
Perplexity logoSonar 3.1 Small
Microsoft Azure logo
Microsoft Azure logoPhi-3 Medium Instruct 14B
Databricks logo
Databricks logoDBRX Instruct
Reka AI logo
Reka AI logoReka Core
Reka AI logoReka Flash
Reka AI logoReka Edge
AI21 Labs logo
AI21 Labs logoJamba 1.5 Large
AI21 Labs logoJamba 1.5 Mini
AI21 Labs logoJamba Instruct
DeepSeek logo
DeepSeek logoDeepSeek-V2.5
DeepSeek logoDeepSeek-Coder-V2
DeepSeek logoDeepSeek-V2-Chat
Alibaba logo
Alibaba logoQwen2 Instruct 72B
01.AI logo
01.AI logoYi-Large
OpenChat logo
OpenChat logoOpenChat 3.5 (1210)
Glaive logo
Glaive logoReflection Llama 3.1 70B