German: Comparison of Leading AI Models (Multilingual Reasoning Comparison)

German language comparison and analysis of AI models across key performance metrics, including German reasoning capabilities, cost, output speed, latency, context window, and more. To compare other languages, see our full Multilingual Reasoning Comparison.

Model Comparison Summary

Intelligence:Claude 3.5 Sonnet (Oct) logo Claude 3.5 Sonnet (Oct) and GPT-4o (Aug '24) logo GPT-4o (Aug '24) are the highest quality models, followed by DeepSeek V3 (Dec '24) logo DeepSeek V3 (Dec '24) & Llama 3.3 70B logo Llama 3.3 70B.Output Speed (tokens/s):o1 logo o1 (160 t/s) and Llama 3.3 70B logo Llama 3.3 70B (108 t/s) are the fastest models, followed by Gemini 1.5 Pro (Sep) logo Gemini 1.5 Pro (Sep) & GPT-4o (Aug '24) logo GPT-4o (Aug '24).Latency (seconds):Nova Pro logo Nova Pro (0.00s) and  DeepSeek V3 (Dec '24) logo DeepSeek V3 (Dec '24) (0.00s) are the lowest latency models, followed by Llama 3.3 70B logo Llama 3.3 70B & Gemini 1.5 Pro (Sep) logo Gemini 1.5 Pro (Sep).Price ($ per M tokens):Qwen2.5 72B logo Qwen2.5 72B ($0.00) and GPT-4o mini logo GPT-4o mini ($0.26) are the cheapest models, followed by DeepSeek V3 (Dec '24) logo DeepSeek V3 (Dec '24) & Llama 3.3 70B logo Llama 3.3 70B.Context Window:Gemini 1.5 Pro (Sep) logo Gemini 1.5 Pro (Sep) (2m) and Nova Pro logo Nova Pro (300k) are the largest context window models, followed by o1 logo o1 & Claude 3.5 Sonnet (Oct) logo Claude 3.5 Sonnet (Oct).

Highlights

Intelligence: German
Multilingual Index (German); Higher is better
Speed
Output Tokens per Second; Higher is better
Price
USD per 1M Tokens; Lower is better