Ring-flash-2.0 Intelligence, Performance & Price Analysis
Model summary
IntelligenceUpdated
Speed
Input Price
Output Price
Verbosity
Ring-flash-2.0 is below average in intelligence, but reasonably priced when comparing to other open weight models of similar size. The model supports text input, outputs text, and has a 128k tokens context window.
Ring-flash-2.0 scores 8 on the Artificial Analysis Intelligence Index, placing it below average among comparable models (averaging 9).
Pricing for Ring-flash-2.0 is $0.14 per 1M input tokens (moderately priced, average: $0.15) and $0.57 per 1M output tokens (somewhat expensive, average: $0.56).
| Reasoning | Yes This page shows the reasoning version of this model. A non-reasoning variant may also exist. |
|---|---|
| Input modality | Supports: text |
| Output modality | Supports: text |
| Context window | 128k ~192 A4 pages of size 12 Arial font |
| Total parameters | 103B |
| Active parameters | 6.1B Number of parameters active per token during inference |
| License | MIT |
| Model weights | Hugging Face |
Metrics are compared against models of the same class:
- Non-reasoning models → compared only with other non-reasoning models
- Reasoning models → compared across both reasoning and non-reasoning
- Open weights models → compared only with other open weights models of the same size class:
- Tiny: ≤4B parameters
- Small: 4B–40B parameters
- Medium: 40B–150B parameters
- Large: >150B parameters
- Proprietary models → compared across proprietary and open weights models of the same price range, using a blended 3:1 input/output price ratio:
- <$0.15 per 1M tokens
- $0.15–$1 per 1M tokens
- >$1 per 1M tokens
Highlights
Speed
IntelligenceUpdated
Artificial Analysis Intelligence Index
Artificial Analysis Intelligence Index by Open Weights / Proprietary
Intelligence Evaluations
Agentic real-world work tasks, (Elo-500)/2000
Agentic tool use
Agentic coding & terminal use
Coding
Reasoning & knowledge
Scientific reasoning
Physics reasoning
Knowledge
1 - hallucination rate
Long context reasoning
Agentic knowledge work, Elo
Instruction following
Long-horizon agentic tasks
Kubernetes incident root-cause analysis
Visual reasoning
Openness
Artificial Analysis Openness Index: Score
Intelligence Index Comparisons
Intelligence vs. Cost per Intelligence Index Task
Price and Cost
Cost per Intelligence Index Task
Cost to Run Artificial Analysis Intelligence Index
Pricing: Cache Hit, Input, and Output
Context Window
Context Window
Model Size (Open Weights Models Only)
Model Size: Total and Active Parameters
Frequently Asked Questions
Common questions about Ring-flash-2.0
Ring-flash-2.0 was released on September 19, 2025.
Ring-flash-2.0 was created by InclusionAI.
Ring-flash-2.0 scores 8 (estimated) on the Artificial Analysis Intelligence Index, placing it below average among other open weight models of similar size (median: 9).
Ring-flash-2.0 costs $0.14 per 1M input tokens (very competitive, median: $0.32) and $0.57 per 1M output tokens (better than average, median: $0.84), based on the median across providers serving the model.
Ring-flash-2.0 costs $0.14 per 1M input tokens and $0.57 per 1M output tokens (based on the median across providers serving the model). For a blended rate (7:2:1 cache hit/input/output ratio), this is $0.18 per 1M tokens. Pricing may vary by provider. Compare provider pricing
Yes, Ring-flash-2.0 is a reasoning model. It uses extended thinking or chain-of-thought reasoning to work through complex problems before providing an answer.
Ring-flash-2.0 supports text input.
Ring-flash-2.0 supports text output.
No, Ring-flash-2.0 does not support image input. It can only process text.
No, Ring-flash-2.0 is not multimodal. It only supports text input.
Ring-flash-2.0 has a context window of 130k tokens. This determines how much text and conversation history the model can process in a single request.
Yes, Ring-flash-2.0 is open weights. The model weights are publicly available and can be downloaded for self-hosting.
Ring-flash-2.0 has 103 billion parameters (6.1 billion active).
Ring-flash-2.0 is a Mixture of Experts (MoE) model with 103 billion total parameters, but only 6.1 billion active parameters are used during inference.
Ring-flash-2.0 is released under the MIT license. This license allows commercial use. View license
Ring-flash-2.0 achieves a score of 8 on the Artificial Analysis Intelligence Index. This composite benchmark evaluates models across reasoning, knowledge, mathematics, and coding.
Ring-flash-2.0 is an open weights model that can be self-hosted. View providers
Ring-flash-2.0 is an open weights model that can be downloaded and self-hosted. Compare providers
