Comparison Summary
Features | Model Intelligence | Price | Output tokens/s | Latency | End-to-End Response Time | |||
---|---|---|---|---|---|---|---|---|
![]() | Claude 3.5 Sonnet (June) | 200k | $6.00 | 43.9 | 0.84 | 12.22 | N/A | |
Claude 3.5 Sonnet (June) Vertex | 200k | $6.00 | 65.8 | 0.91 | 8.50 | N/A | ||
Claude 3.5 Sonnet (June) | 200k | $6.00 | 65.3 | 0.48 | 8.14 | N/A |
Measured by Output Speed (tokens per second)
Measured by Time (seconds) to First Token
Seconds to output 500 Tokens, calculated based on time to first token, 'thinking' time for reasoning models, and output speed