Stay connected with us on X, Discord, and LinkedIn to stay up to date with future analysis

Artificial Analysis

human eval

Create MicroEval

Header image for human eval

human eval

Prompt

evaluate whether they sound human or like a bot

Drag to resize

Drag to resize

Drag to resize

Qwen3 235B A22B (Non-reasoning)

Drag to resize

DeepSeek V3 0324

Drag to resize

Llama 4 Maverick

Drag to resize

DeepSeek R1 0528 (May '25)

Drag to resize

Claude 4 Sonnet (Non-reasoning)

Drag to resize

GPT-5 (ChatGPT)

Drag to resize