Articles
GPT-5 Benchmarks and Analysis
Independent benchmarks of OpenAI's new GPT-5 models
7 Aug 2025
Analysis of OpenAI's gpt-oss models
Analysis of OpenAI's new open weights reasoning models gpt-oss-120b and gpt-oss-20b
6 Aug 2025
Announcing Artificial Analysis Long Context Reasoning (AA-LCR)
Artificial Analysis new long context reasoning benchmark
5 Aug 2025
MiniMax M2 Benchmarks & Analysis
Analysis of MiniMax's new 200B parameter reasoning model, M2, and comparisons to other leading models
27 Oct 2025
Independent Performance Analysis of Leading GPUs
Benchmarking of AMD MI300X, NVIDIA H100, NVIDIA H200
9 Jun 2025
DeepSeek R1 Update
Benchmarking of DeepSeek's updated R1 model released in May 2025
29 May 2025
Overview of Google I/O Benchmarking Results
Independent benchmarking results covering launches across LLMs, image, video, music and more
21 May 2025