ZenMux: Models Intelligence, Performance & Price
Analysis of ZenMux's models across key metrics including quality, price, output speed, latency, context window & more. This analysis is intended to support you in choosing the best model provided by ZenMux for your use-case.
Intelligence
Artificial Analysis Intelligence Index; Higher is better
No data available
Speed
Output Tokens per Second; Higher is better
No data available
Price
USD per 1M Tokens; Lower is better
No data available
Intelligence Evaluations
Artificial Analysis Intelligence Index
Artificial Analysis Intelligence Index; Higher is better
No data available
Intelligence Evaluations
Intelligence evaluations measured independently by Artificial Analysis; Higher is better
Results claimed by AI Lab (not yet independently verified)
GDPval-AA (Agentic Real-World Work Tasks, (ELO-500)/2000)
No data available
Terminal-Bench Hard (Agentic Coding & Terminal Use)
No data available
𝜏²-Bench Telecom (Agentic Tool Use)
No data available
AA-LCR (Long Context Reasoning)
No data available
AA-Omniscience Accuracy (Knowledge)
No data available
AA-Omniscience Non-Hallucination Rate (1 - Hallucination Rate)
No data available
Humanity's Last Exam (Reasoning & Knowledge)
No data available
GPQA Diamond (Scientific Reasoning)
No data available
SciCode (Coding)
No data available
IFBench (Instruction Following)
No data available
CritPt (Physics Reasoning)
No data available
MMMU Pro (Visual Reasoning)
No data available
Intelligence vs. Price
Artificial Analysis Intelligence Index; Price: USD per 1M Tokens
Most attractive quadrant
Context Window
Context Window
Context Window: Tokens Limit; Higher is better
No data available
JSON Mode & Function Calling
Function (Tool) Calling & JSON Mode
No comments available
Please check back later or adjust your filters.
Pricing
Intelligence vs. Price
Artificial Analysis Intelligence Index; Price: USD per 1M Tokens
Most attractive quadrant
Performance Summary
Output Speed vs. Price
Output Speed: Output Tokens per Second; Price: USD per 1M Tokens; 10,000 Input Tokens
Most attractive quadrant
Speed
Measured by Output Speed (tokens per second)
Output Speed
Output Tokens per Second; Higher is better
No data available
Latency
Measured by Time (seconds) to First Token
Time to First Token
Seconds to First Token Received; Lower is better
No data available
End-to-End Response Time
Seconds to output 500 Tokens, calculated based on time to first token, 'thinking' time for reasoning models, and output speed
End-to-End Response Time vs. Price
End-to-End Response Time: End-to-End Seconds to Output 500 Tokens; Price: USD per 1M Tokens
Most attractive quadrant
Key definitions
Frequently Asked Questions
Common questions about ZenMux
We are not currently tracking any models from ZenMux. Check back later for updates.