Comparison Summary
Features | Model Intelligence | Price | Output tokens/s | Latency | End-to-End Response Time | |||
---|---|---|---|---|---|---|---|---|
![]() | DeepSeek R1 0528 Qwen3 8B | 131k | 39 | $0.06 | 92.3 | 0.38 | 27.46 | 21.66 |
![]() | DeepSeek R1 0528 Qwen3 8B | 128k | 39 | $0.07 | 79.5 | 0.82 | 32.25 | 25.14 |
Measured by Output Speed (tokens per second)
Measured by Time (seconds) to First Token
Seconds to output 500 Tokens, calculated based on time to first token, 'thinking' time for reasoning models, and output speed