Artificial Analysis Economics Index
Measures performance on capabilities that matter most for economics work, including economics knowledge, agentic knowledge work, reasoning, and long-context reading. Weights reflect how often each capability appears across common economics tasks.
See representative workflowsThe Artificial Analysis Economics Index combines performance across benchmarks chosen for economics work, spanning economics knowledge, agentic execution, reasoning, and long-context reading.
This composite metric prevents narrow specialization and provides a single score for tracking model performance across economics tasks.
Each capability sub-score is normalized to a 0-100 scale, then combined using the weights below. All underlying benchmarks are run independently by Artificial Analysis. See our Intelligence Benchmarking Methodology for how evaluations are conducted.
| Capability | Weight | Evaluations |
|---|---|---|
| Economics Knowledge | 35% | AA-Omniscience Business Accuracy |
| Reasoning | 35% | HLE |
| Agentic Knowledge Work | 15% | GDPval-AA v2 |
| Long-Context | 15% | LCR |
Score
Artificial Analysis Economics Index
Economics Index: Capability Breakdown
Capability Breakdown
Economics Index: Economics Knowledge
Representative Workflows
Real-world workflows that exercise the capabilities the Economics Index weights most heavily.
Release Date
Economics Index vs. Release Date
Cost
Economics Index: Cost per Task
Economics Index: Total Cost
Speed
Economics Index: Time per Task
Output Tokens
Economics Index: Output Tokens per Task
Frequently Asked Questions
The Economics Index is a composite benchmark from Artificial Analysis that measures performance on capabilities that matter most for economics work, including economics knowledge, agentic knowledge work, reasoning, and long-context reading. Weights reflect how often each capability appears across common economics tasks.
The Economics Index is calculated as a weighted average of capability sub-scores, each normalized to a 0–100 scale. The sub-scores and their weights are: Economics Knowledge (35%), Reasoning (35%), Agentic Knowledge Work (15%), and Long-Context (15%).
The Economics Index includes AA-Omniscience Business Accuracy, HLE, GDPval-AA v2, and LCR.
Claude Fable 5 (Adaptive Reasoning, Max Effort, Opus 4.8 Fallback) currently has the highest Economics Index score, with a score of 62 among models with published results. View model
A higher Economics Index score indicates stronger overall performance across the benchmarks that make up the index. For a specific use case, individual benchmark results may be more informative than the composite score.