Qwen3 235B A22B 2507 (Reasoning) vs. Mixtral 8x22B Instruct
Comparison between Qwen3 235B A22B 2507 (Reasoning) and Mixtral 8x22B Instruct across intelligence, price, speed, context window and more.
For details relating to our methodology, see our Methodology page.
Model Comparison
| Metric | Mixtral 8x22B Instruct | Analysis | |
|---|---|---|---|
| Creator | |||
| Context Window | 256k tokens (~384 A4 pages of size 12 Arial font) | 65k tokens (~98 A4 pages of size 12 Arial font) | Qwen3 235B A22B 2507 (Reasoning) is larger than Mixtral 8x22B Instruct |
| Release Date | July, 2025 | April, 2024 | Qwen3 235B A22B 2507 (Reasoning) has a more recent release date than Mixtral 8x22B Instruct |
| Parameters | 235B, 22B active at inference time | 141B, 39B active at inference time | Qwen3 235B A22B 2507 (Reasoning) is larger than Mixtral 8x22B Instruct |
| Image Input Support | No | No | Neither Qwen3 235B A22B 2507 (Reasoning) nor Mixtral 8x22B Instruct have image input support |
| Open Source (Weights) | Both Qwen3 235B A22B 2507 (Reasoning) and Mixtral 8x22B Instruct are open source | ||
| License | |||
| License Supports Commercial Use Without Restrictions | Yes | Yes | Both Qwen3 235B A22B 2507 (Reasoning) and Mixtral 8x22B Instruct have license supports commercial use without restrictions |
Intelligence
Artificial Analysis Intelligence Index
Artificial Analysis Intelligence Index by Open Weights / Proprietary
Intelligence Evaluations
Openness
Artificial Analysis Openness Index: Results
Intelligence Index Comparisons
Intelligence vs. Price
Intelligence Index Token Use & Cost
Output Tokens Used to Run Artificial Analysis Intelligence Index
Cost to Run Artificial Analysis Intelligence Index
Context Window
Context Window
Pricing
Pricing: Input and Output Prices
Intelligence vs. Price (Log Scale)
Speed
Measured by Output Speed (tokens per second)
Output Speed
Output Speed vs. Price
Latency
Measured by Time (seconds) to First Token
Latency: Time To First Answer Token
End-to-End Response Time
Seconds to output 500 Tokens, calculated based on time to first token, 'thinking' time for reasoning models, and output speed
End-to-End Response Time
Model Size (Open Weights Models Only)
Model Size: Total and Active Parameters
Comparisons to Qwen3 235B A22B 2507
Qwen3 235B A22B 2507
gpt-oss-20B (high)
gpt-oss-120B (high)
GPT-5.2 (xhigh)
GPT-5.4 (xhigh)
GPT-5.4 Pro (xhigh)
GPT-5.3 Codex (xhigh)
Llama 4 Maverick
Gemini 3.1 Flash-Lite Preview
Gemini 3.1 Pro Preview
Gemini 3 Flash
Claude 4.5 Haiku
Claude Sonnet 4.6 (max)
Claude Opus 4.6 (max)
Mistral Large 3DeepSeek V3.2
Grok 4.20 Beta 0309
Grok 4.1 Fast
Nova 2.0 Pro Preview (medium)
MiniMax-M2.5
NVIDIA Nemotron 3 Super
NVIDIA Nemotron 3 Nano
Kimi K2.5
K-EXAONEMiMo-V2-Flash (Feb 2026)
K2 Think V2
Mi:dm K 2.5 ProGLM-5
Qwen3.5 397B A17B
Comparisons to Mixtral 8x22B
Mixtral 8x22Bgpt-oss-20B (high)
gpt-oss-120B (high)
GPT-5.2 (xhigh)
GPT-5.4 (xhigh)
GPT-5.4 Pro (xhigh)
GPT-5.3 Codex (xhigh)
Llama 4 Maverick
Gemini 3.1 Flash-Lite Preview
Gemini 3.1 Pro Preview
Gemini 3 Flash
Claude 4.5 Haiku
Claude Sonnet 4.6 (max)
Claude Opus 4.6 (max)
Mistral Large 3DeepSeek V3.2
Grok 4.20 Beta 0309
Grok 4.1 Fast
Nova 2.0 Pro Preview (medium)
MiniMax-M2.5
NVIDIA Nemotron 3 Super
NVIDIA Nemotron 3 Nano
Kimi K2.5
K-EXAONEMiMo-V2-Flash (Feb 2026)
K2 Think V2
Mi:dm K 2.5 ProGLM-5
Qwen3.5 397B A17B
Frequently Asked Questions
Gemini 3.1 Pro Preview currently leads the Artificial Analysis Intelligence Index with a score of 57, out of 295 models evaluated.
The top AI models by Intelligence Index are: 1. Gemini 3.1 Pro Preview (57), 2. GPT-5.4 (xhigh) (57), 3. GPT-5.3 Codex (xhigh) (54), 4. Claude Opus 4.6 (Adaptive Reasoning, Max Effort) (53), 5. Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) (52).
Mercury 2 is the fastest at 897.9 tokens per second, followed by Granite 4.0 H Small (445.4 t/s) and NVIDIA Nemotron 3 Super 120B A12B (Reasoning) (436.2 t/s).
Gemma 3n E4B Instruct is the most affordable at $0.03 per 1M tokens (blended), followed by LFM2 24B A2B ($0.05) and Nova Micro ($0.06).
Apriel-v1.5-15B-Thinker has the lowest time to first token at 0.37s, followed by Ministral 3 3B (0.42s) and Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning) (0.43s).
GLM-5 (Reasoning) is the highest-ranked open weights model with an Intelligence Index score of 50. There are 193 open weights models out of 295 total evaluated.
The top open weights AI models by Intelligence Index are: 1. GLM-5 (Reasoning) (50), 2. Kimi K2.5 (Reasoning) (47), 3. Qwen3.5 397B A17B (Reasoning) (45).
Gemini 3.1 Pro Preview leads among 146 reasoning models with an Intelligence Index score of 57. Reasoning models use extended thinking to work through complex problems before providing answers.
Models are compared across multiple dimensions including intelligence (quality), pricing, output speed (tokens per second), latency (time to first token), end-to-end response time, and context window size. Performance metrics are measured directly using standardized prompts across 410 models.
Click on any model name or row in the charts to view its dedicated page with detailed metrics and direct comparisons against similar models. You can also use the model selector to customize which models appear in each chart. View the leaderboard