Phi-4 Intelligence, Performance & Price Analysis
Model summary
Intelligence
Speed
Input PriceUpdated
USD per 1M tokens
Cache: $0.125 (-0%)
Output Price
Verbosity
Phi-4 is below average in intelligence and somewhat expensive when comparing to other open weight non-reasoning models of similar size. The model has a 16k tokens context window with knowledge up to June 2024.
Phi-4 scores 10 on the Artificial Analysis Intelligence Index, placing it below average among comparable models (averaging 12).
Pricing for Phi-4 is $0.13 per 1M input tokens (somewhat expensive, average: $0.05) and $0.50 per 1M output tokens (expensive, average: $0.19).
At 36 tokens per second, Phi-4 is notably slow (95).
| Reasoning | No This page shows the non-reasoning version of this model. A reasoning variant may also exist. |
|---|---|
| Input modality | Supports: This information is still being updated |
| Output modality | Supports: This information is still being updated |
| Knowledge cutoff | Jun 1, 2024 |
| Context window | 16k ~24 A4 pages of size 12 Arial font |
| Total parameters | 14B |
| License | MIT |
| Model weights | Hugging Face |
Metrics are compared against models of the same class:
- Non-reasoning models → compared only with other non-reasoning models
- Reasoning models → compared across both reasoning and non-reasoning
- Open weights models → compared only with other open weights models of the same size class:
- Tiny: ≤4B parameters
- Small: 4B–40B parameters
- Medium: 40B–150B parameters
- Large: >150B parameters
- Proprietary models → compared across proprietary and open weights models of the same price range, using a blended 3:1 input/output price ratio:
- <$0.15 per 1M tokens
- $0.15–$1 per 1M tokens
- >$1 per 1M tokens
Highlights
Intelligence
Artificial Analysis Intelligence Index
Artificial Analysis Intelligence Index by Open Weights / Proprietary
Intelligence Evaluations
Agentic real-world work tasks, (Elo-500)/2000
Agentic coding & terminal use
Agentic tool use
Long context reasoning
Knowledge
1 - hallucination rate
Reasoning & knowledge
Scientific reasoning
Coding
Instruction following
Physics reasoning
Long-horizon agentic tasks
Kubernetes incident root-cause analysis
Visual reasoning
Openness
Artificial Analysis Openness Index: Score
Intelligence Index Comparisons
Intelligence vs. Price
Intelligence Index Token Use & Cost
Output Tokens Used to Run Artificial Analysis Intelligence Index
Cost to Run Artificial Analysis Intelligence Index
Context Window
Context Window
PricingUpdated
Pricing: Cache Hit, Input, and Output
Speed
Measured by Output Speed (tokens per second)
Output Speed
Output Speed vs. Price
Latency
Measured by Time (seconds) to First Token
Latency: Time To First Answer Token
End-to-End Response Time
Seconds to output 500 tokens, calculated based on time to first token, 'thinking' time for reasoning models, and output speed
End-to-End Response Time
Model Size (Open Weights Models Only)
Model Size: Total and Active Parameters
Frequently Asked Questions
Common questions about Phi-4
Phi-4 was released on December 12, 2024.
Phi-4 was created by Microsoft.
Phi-4 scores 10 on the Artificial Analysis Intelligence Index, placing it below average among other open weight non-reasoning models of similar size (median: 12).
Phi-4 generates output at 35.7 tokens per second (based on Microsoft's API), which is at the lower end compared to other open weight non-reasoning models of similar size (median: 95.3 t/s).
Phi-4 has a time to first token (TTFT) of 2.13s (based on Microsoft's API), which is somewhat higher than average compared to other open weight non-reasoning models of similar size (median: 1.63s).
Phi-4 costs $0.13 per 1M input tokens (better than average, median: $0.13) and $0.50 per 1M output tokens (somewhat higher than average, median: $0.25), based on Microsoft's API.
Phi-4 costs $0.13 per 1M input tokens and $0.50 per 1M output tokens (based on Microsoft's API). For a blended rate (7:2:1 cache hit/input/output ratio), this is $0.16 per 1M tokens. Pricing may vary by provider. Compare provider pricing
No, Phi-4 is not a reasoning model. It provides direct responses without extended chain-of-thought reasoning.
Phi-4 supports text only input.
Phi-4 supports text only output.
No, Phi-4 does not support image input. It can only process text.
No, Phi-4 is not multimodal. It only supports text only input.
Phi-4 has a context window of 16k tokens. This determines how much text and conversation history the model can process in a single request.
Yes, Phi-4 is open weights. The model weights are publicly available and can be downloaded for self-hosting.
Phi-4 has 14 billion parameters.
Phi-4 is released under the MIT license. This license allows commercial use. View license
Phi-4 achieves a score of 10 on the Artificial Analysis Intelligence Index. This composite benchmark evaluates models across reasoning, knowledge, mathematics, and coding.
Phi-4 has a knowledge cutoff of June 2024. The model's training data includes information up to this date.
Phi-4 is an open weights model that can be self-hosted. View providers
Phi-4 is an open weights model that can be downloaded and self-hosted. Compare providers