June 1, 2026
Nemotron 3 Ultra launch announced: high-speed, leading US open weights intelligence
NVIDIA just announced the release of Nemotron 3 Ultra in Jensen Huang's Computex keynote: at 550B parameters (55B active), this is the largest Nemotron 3 model to date, and it is the most intelligent US open weights model
We partnered with NVIDIA to evaluate this model for intelligence and speed - these figures use the model’s BF16 weights, but as with Nemotron 3 Super the model will be made available in NVFP4 quantization as well for higher inference performance.
➤ New leader for US open weights intelligence: Nemotron 3 Ultra scores 48 on the Artificial Analysis Intelligence Index. This is well ahead of the next strongest US open weights models, Gemma 4 31B (39), Nemotron 3 Super (36) and gpt-oss-120b (33), but behind the Chinese-led open weights frontier (Kimi K2.6 at 54).
➤ Leading speed for its intelligence: on a pre-release DeepInfra endpoint, Nemotron 3 Ultra served over 300 tokens per second. Peer models in its size class from China-based labs such as DeepSeek and Moonshot (Kimi) are generally served at speeds of 50-100 tokens per second in the market today. gpt-oss-120b is served at speeds similar to this level, but with significantly lower intelligence.
➤ Largest Nemotron 3 model so far: at approximately 550 billion total parameters and 90% sparsity, Nemotron 3 Ultra is significantly larger than its siblings and is the largest recent US open weights model release
We’ll be sharing additional analysis and full benchmarks at release.

Read the latest

Claude Opus 4.8 - The new #1 AI model
Anthropic retakes #1 on GDPval-AA and advances in terminal use and scientific reasoning
May 28, 2026

MiniCPM5-1B: The leading 1B open weights model
OpenBMB has released MiniCPM5-1B (Non-reasoning), the leading 1B open weights model scoring 17.9 on the Artificial Analysis Intelligence Index
May 26, 2026

Cursor’s Composer 2.5: third on the Coding Agent Index and ~10-60x lower cost than rivals
This release puts Composer among the leading coding agent models, something that wasn’t clear for past releases
May 21, 2026