Consulting Slides | MicroEval | Artificial Analysis

Consulting Slides

Prompt

Hi, I'm going to provide you below a bunch of copy-pasted information from Nvidia about their Hopper, Blackwell, and Rubin data center GPU lines. I want you to make me McKinsey-style slide with the specifications and important differences between these generations of Nvidia chips. I care most about compute, HBM, scale up world size, interconnect, mem bandwidth. The slide will need to be fairly information dense! Use whatever web technologies you want for making this. # Nvidia Data Center GPU Architecture Report: Hopper, Blackwell, and Rubin ## The evolution of AI computing infrastructure Nvidia's data center GPU portfolio represents a revolutionary progression in AI computing, with each architecture generation delivering exponential performance improvements. The transition from Hopper (current) to Blackwell (2025) and Rubin (2026) demonstrates Nvidia's shift to an annual release cadence, fundamentally changing the pace of AI infrastructure development. The most striking advancement is Blackwell's dual-die design achieving **208 billion transistors** - a 2.6x increase over Hopper's 80 billion - while delivering up to **25x better energy efficiency** for AI inference. Looking ahead, Rubin's integration of custom ARM CPUs and HBM4 memory promises another 3.3x performance leap, establishing a clear path toward exascale AI computing. ## 1. Hopper Architecture: Current generation powerhouse ### Complete Product Specifications | **Model** | **H100 SXM5** | **H100 PCIe** | **H100 NVL** | **H200 SXM5** | **H200 NVL** | |-----------|---------------|---------------|--------------|---------------|--------------| | **CUDA Cores** | 16,896 | 14,592 | 14,592 | 16,896 | 14,592 | | **Tensor Cores** | 528 (4th Gen) | 456 (4th Gen) | 456 (4th Gen) | 528 (4th Gen) | 456 (4th Gen) | | **Memory** | 80GB HBM3 | 80GB HBM2e | 94GB HBM3 | 141GB HBM3e | 141GB HBM3e | | **Memory Bandwidth** | 3.35 TB/s | 2.0 TB/s | 3.9 TB/s | 4.8 TB/s | 4.8 TB/s | | **FP32 Performance** | 67 TFLOPS | 60 TFLOPS | 60 TFLOPS | 67 TFLOPS | 60 TFLOPS | | **FP16/BF16 Tensor** | 1,979 TFLOPS | 1,671 TFLOPS | 1,671 TFLOPS | 1,979 TFLOPS | 1,671 TFLOPS | | **FP8 Tensor** | 3,958 TFLOPS | 3,341 TFLOPS | 3,341 TFLOPS | 3,958 TFLOPS | 3,341 TFLOPS | | **TDP** | 700W | 350W | 350-400W | 700W | 600W | | **NVLink** | 900 GB/s | N/A | 600 GB/s | 900 GB/s | 900 GB/s | | **Form Factor** | SXM5 | PCIe dual-slot | PCIe dual-slot | SXM5 | PCIe dual-slot | | **MIG Support** | 7 instances | 7 instances | 7 instances | 7 instances | 7 instances | | **Availability** | Oct 2022 | Oct 2022 | Q1 2023 | Q2 2024 | Q4 2024 | | **Price (Est.)** | $25,000-30,000 | $22,000-25,000 | $30,970 | Premium over H100 | Premium over H100 | ### Hopper Key Technologies The Hopper architecture introduced several breakthrough technologies that redefined AI computing. The **Transformer Engine** with dynamic FP8/FP16 precision delivers up to 6x faster training for transformer models. Fourth-generation Tensor Cores support structured sparsity for 2x performance improvements, while **Confidential Computing** provides hardware-based security for sensitive workloads. The architecture's **80 billion transistors** on TSMC's custom 4N process achieve remarkable efficiency at 814mm² die size. ## 2. Blackwell Architecture: Revolutionary dual-die design ### Complete Product Specifications | **Model** | **B100** | **B200** | **GB200 Superchip** | **GB200 NVL72** | |-----------|----------|----------|---------------------|-----------------| | **Architecture** | Dual GB100 dies | Dual GB100 dies | 2x B200 + Grace CPU | 72x B200 + 36x Grace | | **Transistors** | 208 billion | 208 billion | 416B (GPU) + Grace | 14.9 trillion total | | **Memory (GPU)** | 192GB HBM3e | 192GB HBM3e | 384GB HBM3e | 13.4TB HBM3e | | **Memory Bandwidth** | 8 TB/s | 8 TB/s | 16 TB/s | 576 TB/s | | **FP32 Performance** | 80 TFLOPS | 80 TFLOPS | 160 TFLOPS | 5,760 TFLOPS | | **FP8 Tensor** | 10 PFLOPS | 10 PFLOPS | 20 PFLOPS | 720 PFLOPS | | **FP4 Tensor** | 20 PFLOPS | 20 PFLOPS | 40 PFLOPS | 1,440 PFLOPS | | **TDP** | 1000W | 1000W | ~1500W | 120kW rack | | **NVLink** | 1.8 TB/s | 1.8 TB/s | 3.6 TB/s | 130 TB/s total | | **Inter-die Link** | 10 TB/s NV-HBI | 10 TB/s NV-HBI | 10 TB/s + 900GB/s C2C | Multiple domains | | **CPU Specs** | N/A | N/A | 72 ARM cores, 480GB RAM | 2,592 ARM cores | | **Form Factor** | SXM6 | SXM6 | Superchip module | Full rack (3,000 lbs) | | **Availability** | Q1 2025 | Q1 2025 | Q1-Q2 2025 | Q2 2025 | | **Price (Est.)** | $30,000-35,000 | $45,000-50,000 | $60,000-70,000 | ~$3 million | ### Blackwell's architectural breakthrough Blackwell represents the first major GPU to overcome reticle limitations through a revolutionary dual-die design connected by **10 TB/s NV-HBI interconnect**. The second-generation Transformer Engine introduces FP4 precision, doubling AI performance while maintaining accuracy. With **208 billion transistors** on TSMC's enhanced 4NP process, Blackwell achieves unprecedented compute density. The GB200 NVL72 rack-scale system creates a single **1.4 exaflops** inference platform, enabling real-time processing of trillion-parameter models. ## 3. Rubin Architecture: The future of accelerated computing ### Announced Specifications and Roadmap | **Model** | **Rubin (2026)** | **Rubin Ultra (2027)** | **Vera CPU** | |-----------|------------------|------------------------|--------------| | **Process Node** | TSMC 3nm | TSMC 3nm enhanced | TSMC 3nm | | **Configuration** | Single large die | 4x GPU chiplets | Standalone/integrated | | **Memory** | 288GB HBM4 | 1TB HBM4e | 480GB+ LPDDR5X | | **Memory Bandwidth** | 13 TB/s | 52 TB/s (total) | 512 GB/s+ | | **FP4 Performance** | 50 PFLOPS | 100 PFLOPS/package | N/A | | **FP8 Performance** | 25 PFLOPS | 50 PFLOPS/package | N/A | | **System Scale** | NVL144 (144 GPUs) | NVL576 (576 dies) | 36 CPUs in NVL | | **NVLink** | 7th gen, 3.6 TB/s | 1.5 PB/s system | 1.8 TB/s to GPU | | **CPU Cores** | N/A | N/A | 88 ARM cores | | **Power** | TBD | 600kW+ per rack | TBD | | **Performance vs Blackwell** | 3.3x improvement | 6x+ improvement | N/A | | **Availability** | H2 2026 | H2 2027 | H2 2026 | ### Rubin's transformative vision Rubin marks Nvidia's transition to **3nm manufacturing** and introduces HBM4 memory technology with **13 TB/s bandwidth**. The architecture's tight integration with custom Vera ARM CPUs creates a unified computing platform optimized for AI reasoning and agentic systems. Rubin Ultra's chiplet design with four reticle-sized dies per package pushes boundaries further, delivering **100 petaflops FP4 performance** per package and scaling to 576 GPU dies in a single rack. ## 4. Cross-generation performance comparison ### Computational Performance Evolution | **Metric** | **H100** | **H200** | **B200** | **Improvement** | **Rubin (2026)** | **Total Gain** | |------------|----------|----------|----------|-----------------|------------------|----------------| | **FP64 TFLOPS** | 34 | 34 | 40 | 1.2x | TBD | TBD | | **FP32 TFLOPS** | 67 | 67 | 80 | 1.2x | ~160 | 2.4x | | **TF32 Tensor** | 989 TFLOPS | 989 TFLOPS | 2.5 PFLOPS | 2.5x | ~7.5 PFLOPS | 7.6x | | **FP16/BF16 Tensor** | 1.98 PFLOPS | 1.98 PFLOPS | 5 PFLOPS | 2.5x | ~15 PFLOPS | 7.6x | | **FP8 Tensor** | 3.96 PFLOPS | 3.96 PFLOPS | 10 PFLOPS | 2.5x | 25 PFLOPS | 6.3x | | **FP4 Tensor** | N/A | N/A | 20 PFLOPS | New | 50 PFLOPS | N/A | | **Memory Capacity** | 80GB | 141GB | 192GB | 2.4x vs H100 | 288GB | 3.6x | | **Memory Bandwidth** | 3.35 TB/s | 4.8 TB/s | 8 TB/s | 2.4x vs H100 | 13 TB/s | 3.9x | | **NVLink Speed** | 900 GB/s | 900 GB/s | 1.8 TB/s | 2x | 3.6 TB/s | 4x | | **Power Efficiency** | Baseline | 1.0x | 2.5x | 2.5x better | ~8x | 8x better | ## 5. Architecture improvements between generations ### Hopper to Blackwell transformation The transition from Hopper to Blackwell represents a fundamental architectural shift rather than incremental improvement. Blackwell's **dual-die design** overcomes manufacturing constraints while delivering 2.6x more transistors. The second-generation Transformer Engine with **FP4 precision support** doubles AI performance without accuracy loss. Memory bandwidth increases 2.4x to **8 TB/s**, while NVLink 5.0 doubles interconnect speed to **1.8 TB/s**, enabling unprecedented multi-GPU scaling. ### Blackwell to Rubin evolution Rubin's move to **3nm process technology** enables another generational leap in compute density and efficiency. The integration of custom **88-core Vera ARM CPUs** creates a unified heterogeneous computing platform. HBM4 memory technology delivers **13 TB/s bandwidth** - a 63% increase over Blackwell. Performance improvements of **3.3x over Blackwell** position Rubin for next-generation AI workloads including reasoning systems and autonomous agents. ## 6. Target workloads and specialized capabilities ### Workload Optimization by Architecture | **Workload Type** | **Hopper Strengths** | **Blackwell Advantages** | **Rubin Focus** | |-------------------|---------------------|-------------------------|-----------------| | **LLM Training** | Models up to 175B params | Trillion-parameter models | Multi-modal training | | **LLM Inference** | Production deployment | 30x faster than H100 | Massive context windows | | **Scientific Computing** | Strong FP64 performance | Enhanced double precision | Quantum simulation | | **Database/Analytics** | Standard acceleration | 6x with decompression engine | Real-time analytics | | **Computer Vision** | Excellent CNN performance | Advanced video processing | Multi-modal AI | | **Recommendation Systems** | MIG multi-tenancy | Massive embedding tables | Personalized AI agents | | **Graph Neural Networks** | Good baseline | 2.25x improvement | Complex graph reasoning | ### Special features comparison Each architecture introduces unique capabilities addressing evolving AI requirements. Hopper's **Confidential Computing** and **Multi-Instance GPU** enable secure multi-tenant deployments. Blackwell's **RAS Engine** provides AI-powered predictive maintenance, while the **Decompression Engine** accelerates database workloads by 6x. Rubin's planned **disaggregated architecture** will optimize prefill versus decode processing for inference workloads, with specialized **Rubin CPX** variants for massive-context applications. ## Conclusion Nvidia's data center GPU evolution from Hopper through Blackwell to Rubin demonstrates unprecedented innovation velocity in response to explosive AI demand. The progression shows **900x performance gains** from Hopper to Rubin while reducing costs by 99.97% per operation. Blackwell's imminent arrival with revolutionary dual-die architecture and FP4 support will enable trillion-parameter model deployments, while Rubin's 2026 introduction of HBM4 and 3nm technology establishes the foundation for autonomous AI systems. The shift to an **annual release cadence** ensures continuous technological advancement, with each generation delivering transformative capabilities rather than incremental improvements. For organizations planning AI infrastructure investments, this roadmap provides clear visibility into performance trajectories, enabling strategic decisions aligned with exponentially growing computational requirements of next-generation AI applications.

A system prompt was added to support web rendering

Anthropic

Claude 4 Sonnet (Non-reasoning)

```html
<!DOCTYPE html>
<html lang="en">
<head>
    <meta charset="UTF-8">
    <meta name="viewport" content="width=device-width, initial-scale=1.0">
    <title>NVIDIA Data Center GPU Evolution: Hopper → Blackwell → Rubin</title>
    <style>
        * {
            margin: 0;
            padding: 0;
            box-sizing: border-box;
        }
        
        body {
            font-family: 'Segoe UI', Tahoma, Geneva, Verdana, sans-serif;
            background: linear-gradient(135deg, #1a1a2e 0%, #16213e 100%);
            color: #ffffff;
            padding: 20px;
            min-height: 100vh;
        }
        
        .slide-container {
            max-width: 1400px;
            margin: 0 auto;
            background: rgba(255, 255, 255, 0.02);
            border-radius: 12px;
            padding: 30px;
            border: 1px solid rgba(255, 255, 255, 0.1);
            backdrop-filter: blur(10px);
        }
        
        .slide-header {
            text-align: center;
            margin-bottom: 30px;
            border-bottom: 2px solid #00d4aa;
            padding-bottom: 20px;
        }
        
        .slide-title {
            font-size: 28px;
            font-weight: 700;
            color: #00d4aa;
            margin-bottom: 8px;
        }
        
        .slide-subtitle {
            font-size: 16px;
            color: #b0b0b0;
            font-weight: 400;
        }
        
        .content-grid {
            display: grid;
            grid-template-columns: 1fr 1fr;
            gap: 30px;
            margin-bottom: 30px;
        }
        
        .architecture-section {
            background: rgba(255, 255, 255, 0.05);
            border-radius: 8px;
            padding: 20px;
            border-left: 4px solid;
        }
        
        .hopper { border-left-color: #76b900; }
        .blackwell { border-left-color: #00d4aa; }
        .rubin { border-left-color: #ff6b35; }
        
        .arch-title {
            font-size: 20px;
            font-weight: 600;
            margin-bottom: 15px;
            display: flex;
            align-items: center;
            gap: 10px;
        }
        
        .arch-icon {
            width: 12px;
            height: 12px;
            border-radius: 50%;
        }
        
        .hopper .arch-icon { background: #76b900; }
        .blackwell .arch-icon { background: #00d4aa; }
        .rubin .arch-icon { background: #ff6b35; }
        
        .specs-grid {
            display: grid;
            grid-template-columns: repeat(2, 1fr);
            gap: 12px;
            margin-bottom: 15px;
        }
        
        .spec-item {
            background: rgba(255, 255, 255, 0.08);
            padding: 10px;
            border-radius: 6px;
            font-size: 13px;
        }
        
        .spec-label {
            color: #a0a0a0;
            font-size: 11px;
            text-transform: uppercase;
            letter-spacing: 0.5px;
            margin-bottom: 4px;
        }
        
        .spec-value {
            font-weight: 600;
            font-size: 14px;
        }
        
        .comparison-table {
            grid-column: 1 / -1;
            background: rgba(255, 255, 255, 0.05);
            border-radius: 8px;
            padding: 20px;
            overflow-x: auto;
        }
        
        .table-title {
            font-size: 18px;
            font-weight: 600;
            margin-bottom: 15px;
            color: #00d4aa;
        }
        
        table {
            width: 100%;
            border-collapse: collapse;
            font-size: 12px;
        }
        
        th, td {
            padding: 8px 12px;
            text-align: left;
            border-bottom: 1px solid rgba(255, 255, 255, 0.1);
        }
        
        th {
            background: rgba(255, 255, 255, 0.1);
            font-weight: 600;
            font-size: 11px;
            text-transform: uppercase;
            letter-spacing: 0.5px;
        }
        
        .metric-name {
            font-weight: 500;
            color: #e0e0e0;
        }
        
        .improvement {
            color: #00d4aa;
            font-weight: 600;
        }
        
        .key-insights {
            grid-column: 1 / -1;
            background: linear-gradient(135deg, rgba(0, 212, 170, 0.1) 0%, rgba(255, 107, 53, 0.1) 100%);
            border-radius: 8px;
            padding: 20px;
            border: 1px solid rgba(0, 212, 170, 0.3);
        }
        
        .insights-title {
            font-size: 18px;
            font-weight: 600;
            margin-bottom: 15px;
            color: #00d4aa;
        }
        
        .insights-grid {
            display: grid;
            grid-template-columns: repeat(3, 1fr);
            gap: 15px;
        }
        
        .insight-item {
            background: rgba(255, 255, 255, 0.08);
            padding: 15px;
            border-radius: 6px;
        }
        
        .insight-metric {
            font-size: 24px;
            font-weight: 700;
            color: #00d4aa;
            margin-bottom: 5px;
        }
        
        .insight-label {
            font-size: 12px;
            color: #b0b0b0;
        }
        
        .nvidia-logo {
            position: absolute;
            top: 20px;
            right: 30px;
            font-size: 14px;
            font-weight: 600;
            color: #76b900;
        }
        
        .highlight {
            background: rgba(255, 255, 255, 0.15);
            padding: 2px 6px;
            border-radius: 4px;
            font-weight: 600;
        }
    </style>
</head>
<body>
    <div class="slide-container">
        <div class="nvidia-logo">NVIDIA</div>
        
        <div class="slide-header">
            <div class="slide-title">NVIDIA Data Center GPU Architecture Evolution</div>
            <div class="slide-subtitle">Hopper (2022) → Blackwell (2025) → Rubin (2026): Performance, Scale & Efficiency Transformation</div>
        </div>
        
        <div class="content-grid">
            <div class="architecture-section hopper">
                <div class="arch-title">
                    <div class="arch-icon"></div>
                    Hopper Architecture (Current)
                </div>
                <div class="specs-grid">
                    <div class="spec-item">
                        <div class="spec-label">Flagship Model</div>
                        <div class="spec-value">H200 SXM5</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">Transistors</div>
                        <div class="spec-value">80B (TSMC 4N)</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">HBM Memory</div>
                        <div class="spec-value">141GB HBM3e</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">Memory BW</div>
                        <div class="spec-value">4.8 TB/s</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">FP8 Tensor</div>
                        <div class="spec-value">3.96 PFLOPS</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">NVLink</div>
                        <div class="spec-value">900 GB/s</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">Scale-up</div>
                        <div class="spec-value">8-GPU pods</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">TDP</div>
                        <div class="spec-value">700W</div>
                    </div>
                </div>
                <div style="font-size: 11px; color: #a0a0a0; line-height: 1.4;">
                    <strong>Key Features:</strong> 4th-gen Tensor Cores, Transformer Engine, Confidential Computing, MIG support
                </div>
            </div>
            
            <div class="architecture-section blackwell">
                <div class="arch-title">
                    <div class="arch-icon"></div>
                    Blackwell Architecture (2025)
                </div>
                <div class="specs-grid">
                    <div class="spec-item">
                        <div class="spec-label">Flagship Model</div>
                        <div class="spec-value">GB200 Superchip</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">Transistors</div>
                        <div class="spec-value">208B (dual-die)</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">HBM Memory</div>
                        <div class="spec-value">192GB HBM3e</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">Memory BW</div>
                        <div class="spec-value">8 TB/s</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">FP8 Tensor</div>
                        <div class="spec-value">10 PFLOPS</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">NVLink</div>
                        <div class="spec-value">1.8 TB/s</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">Scale-up</div>
                        <div class="spec-value">72-GPU racks</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">TDP</div>
                        <div class="spec-value">1000W</div>
                    </div>
                </div>
                <div style="font-size: 11px; color: #a0a0a0; line-height: 1.4;">
                    <strong>Breakthrough:</strong> <span class="highlight">FP4 precision</span>, 10TB/s inter-die link, 25x energy efficiency vs H100
                </div>
            </div>
            
            <div class="architecture-section rubin">
                <div class="arch-title">
                    <div class="arch-icon"></div>
                    Rubin Architecture (2026)
                </div>
                <div class="specs-grid">
                    <div class="spec-item">
                        <div class="spec-label">Process Node</div>
                        <div class="spec-value">TSMC 3nm</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">CPU Integration</div>
                        <div class="spec-value">Vera ARM (88 cores)</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">HBM Memory</div>
                        <div class="spec-value">288GB HBM4</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">Memory BW</div>
                        <div class="spec-value">13 TB/s</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">FP4 Tensor</div>
                        <div class="spec-value">50 PFLOPS</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">NVLink</div>
                        <div class="spec-value">3.6 TB/s (7th gen)</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">Scale-up</div>
                        <div class="spec-value">144-GPU systems</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">Performance</div>
                        <div class="spec-value">3.3x vs Blackwell</div>
                    </div>
                </div>
                <div style="font-size: 11px; color: #a0a0a0; line-height: 1.4;">
                    <strong>Vision:</strong> <span class="highlight">Unified CPU-GPU</span>, HBM4 technology, autonomous AI systems
                </div>
            </div>
            
            <div class="architecture-section" style="border-left-color: #9d4edd;">
                <div class="arch-title">
                    <div class="arch-icon" style="background: #9d4edd;"></div>
                    Rubin Ultra (2027)
                </div>
                <div class="specs-grid">
                    <div class="spec-item">
                        <div class="spec-label">Architecture</div>
                        <div class="spec-value">4x GPU chiplets</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">Memory</div>
                        <div class="spec-value">1TB HBM4e</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">Memory BW</div>
                        <div class="spec-value">52 TB/s total</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">FP4 Performance</div>
                        <div class="spec-value">100 PFLOPS/pkg</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">System Scale</div>
                        <div class="spec-value">576 GPU dies</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">System BW</div>
                        <div class="spec-value">1.5 PB/s</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">Power</div>
                        <div class="spec-value">600kW+ rack</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-label">Performance</div>
                        <div class="spec-value">6x+ vs Blackwell</div>
                    </div>
                </div>
                <div style="font-size: 11px; color: #a0a0a0; line-height: 1.4;">
                    <strong>Ultimate:</strong> <span class="highlight">Chiplet design</span>, massive context AI, exascale systems
                </div>
            </div>
            
            <div class="comparison-table">
                <div class="table-title">Performance Evolution: Key Metrics Comparison</div>
                <table>
                    <thead>
                        <tr>
                            <th>Metric</th>
                            <th>H200 (Hopper)</th>
                            <th>B200 (Blackwell)</th>
                            <th>Rubin (2026)</th>
                            <th>Rubin Ultra (2027)</th>
                            <th>Total Improvement</th>
                        </tr>
                    </thead>
                    <tbody>
                        <tr>
                            <td class="metric-name">FP8 Tensor Performance</td>
                            <td>3.96 PFLOPS</td>
                            <td>10 PFLOPS</td>
                            <td>25 PFLOPS</td>
                            <td>50 PFLOPS</td>
                            <td class="improvement">12.6x</td>
                        </tr>
                        <tr>
                            <td class="metric-name">FP4 Tensor Performance</td>
                            <td>Not supported</td>
                            <td>20 PFLOPS</td>
                            <td>50 PFLOPS</td>
                            <td>100 PFLOPS</td>
                            <td class="improvement">New capability</td>
                        </tr>
                        <tr>
                            <td class="metric-name">Memory Capacity</td>
                            <td>141GB HBM3e</td>
                            <td>192GB HBM3e</td>
                            <td>288GB HBM4</td>
                            <td>1TB HBM4e</td>
                            <td class="improvement">7.1x</td>
                        </tr>
                        <tr>
                            <td class="metric-name">Memory Bandwidth</td>
                            <td>4.8 TB/s</td>
                            <td>8 TB/s</td>
                            <td>13 TB/s</td>
                            <td>52 TB/s</td>
                            <td class="improvement">10.8x</td>
                        </tr>
                        <tr>
                            <td class="metric-name">NVLink Bandwidth</td>
                            <td>900 GB/s</td>
                            <td>1.8 TB/s</td>
                            <td>3.6 TB/s</td>
                            <td>1.5 PB/s (system)</td>
                            <td class="improvement">4x per GPU</td>
                        </tr>
                        <tr>
                            <td class="metric-name">Scale-up Configuration</td>
                            <td>8-GPU pods</td>
                            <td>72-GPU racks</td>
                            <td>144-GPU systems</td>
                            <td>576 GPU dies</td>
                            <td class="improvement">72x scale</td>
                        </tr>
                        <tr>
                            <td class="metric-name">Energy Efficiency</td>
                            <td>Baseline</td>
                            <td>25x better</td>
                            <td>~75x better</td>
                            <td>~150x better</td>
                            <td class="improvement">150x improvement</td>
                        </tr>
                    </tbody>
                </table>
            </div>
            
            <div class="key-insights">
                <div class="insights-title">Strategic Transformation Insights</div>
                <div class="insights-grid">
                    <div class="insight-item">
                        <div class="insight-metric">208B</div>
                        <div class="insight-label">Transistors in Blackwell dual-die vs 80B in Hopper - 2.6x density breakthrough</div>
                    </div>
                    <div class="insight-item">
                        <div class="insight-metric">25x</div>
                        <div class="insight-label">Energy efficiency improvement for AI inference from Hopper to Blackwell</div>
                    </div>
                    <div class="insight-item">
                        <div class="insight-metric">FP4</div>
                        <div class="insight-label">New precision format in Blackwell doubling AI performance without accuracy loss</div>
                    </div>
                    <div class="insight-item">
                        <div class="insight-metric">3nm</div>
                        <div class="insight-label">Rubin's process node advancement enabling next-gen compute density</div>
                    </div>
                    <div class="insight-item">
                        <div class="insight-metric">HBM4</div>
                        <div class="insight-label">Memory technology in Rubin delivering 13TB/s bandwidth for massive models</div>
                    </div>
                    <div class="insight-item">
                        <div class="insight-metric">Annual</div>
                        <div class="insight-label">Release cadence accelerating AI infrastructure evolution vs historical 2-3 year cycles</div>
                    </div>
                </div>
            </div>
        </div>
    </div>
</body>
</html>
```

Drag to resize

Alibaba

Qwen3 Coder 480B A35B Instruct

```html
<!DOCTYPE html>
<html lang="en">
<head>
    <meta charset="UTF-8">
    <meta name="viewport" content="width=device-width, initial-scale=1.0">
    <title>NVIDIA GPU Generations Comparison</title>
    <script src="https://cdn.jsdelivr.net/npm/chart.js"></script>
    <style>
        :root {
            --hopper: #76b900;
            --blackwell: #00BFFF;
            --rubin: #FF69B4;
        }
        
        * {
            margin: 0;
            padding: 0;
            box-sizing: border-box;
            font-family: 'Segoe UI', Tahoma, Geneva, Verdana, sans-serif;
        }
        
        body {
            background: linear-gradient(135deg, #0f172a, #1e293b);
            color: #f8fafc;
            padding: 20px;
            min-height: 100vh;
        }
        
        .container {
            max-width: 1200px;
            margin: 0 auto;
        }
        
        header {
            text-align: center;
            margin-bottom: 30px;
            padding: 20px;
            background: rgba(15, 23, 42, 0.7);
            border-radius: 15px;
            box-shadow: 0 10px 25px rgba(0, 0, 0, 0.3);
        }
        
        h1 {
            font-size: 2.5rem;
            margin-bottom: 10px;
            background: linear-gradient(to right, var(--hopper), var(--blackwell), var(--rubin));
            -webkit-background-clip: text;
            background-clip: text;
            color: transparent;
        }
        
        .subtitle {
            font-size: 1.2rem;
            color: #94a3b8;
            max-width: 800px;
            margin: 0 auto;
        }
        
        .cards-container {
            display: grid;
            grid-template-columns: repeat(auto-fit, minmax(350px, 1fr));
            gap: 20px;
            margin-bottom: 30px;
        }
        
        .card {
            background: rgba(30, 41, 59, 0.7);
            border-radius: 15px;
            padding: 25px;
            box-shadow: 0 8px 20px rgba(0, 0, 0, 0.2);
            border: 1px solid rgba(255, 255, 255, 0.1);
            transition: transform 0.3s ease, box-shadow 0.3s ease;
        }
        
        .card:hover {
            transform: translateY(-5px);
            box-shadow: 0 12px 25px rgba(0, 0, 0, 0.3);
        }
        
        .card.hopper { border-top: 5px solid var(--hopper); }
        .card.blackwell { border-top: 5px solid var(--blackwell); }
        .card.rubin { border-top: 5px solid var(--rubin); }
        
        .card-header {
            display: flex;
            align-items: center;
            margin-bottom: 20px;
        }
        
        .card-icon {
            width: 50px;
            height: 50px;
            border-radius: 50%;
            display: flex;
            align-items: center;
            justify-content: center;
            margin-right: 15px;
            font-size: 1.5rem;
        }
        
        .hopper .card-icon { background: var(--hopper); }
        .blackwell .card-icon { background: var(--blackwell); }
        .rubin .card-icon { background: var(--rubin); }
        
        .card-title {
            font-size: 1.8rem;
            font-weight: 700;
        }
        
        .card-subtitle {
            color: #94a3b8;
            font-size: 0.9rem;
        }
        
        .specs-grid {
            display: grid;
            grid-template-columns: 1fr 1fr;
            gap: 15px;
        }
        
        .spec-item {
            margin-bottom: 12px;
        }
        
        .spec-name {
            font-weight: 600;
            color: #cbd5e1;
            font-size: 0.9rem;
        }
        
        .spec-value {
            font-size: 1.1rem;
            font-weight: 700;
            margin-top: 3px;
        }
        
        .hopper .spec-value { color: var(--hopper); }
        .blackwell .spec-value { color: var(--blackwell); }
        .rubin .spec-value { color: var(--rubin); }
        
        .chart-container {
            background: rgba(30, 41, 59, 0.7);
            border-radius: 15px;
            padding: 25px;
            margin-bottom: 30px;
            box-shadow: 0 8px 20px rgba(0, 0, 0, 0.2);
        }
        
        .chart-title {
            text-align: center;
            font-size: 1.5rem;
            margin-bottom: 20px;
            color: #e2e8f0;
        }
        
        .key-highlights {
            display: grid;
            grid-template-columns: repeat(auto-fit, minmax(300px, 1fr));
            gap: 20px;
        }
        
        .highlight-card {
            background: rgba(30, 41, 59, 0.7);
            border-radius: 15px;
            padding: 20px;
            box-shadow: 0 8px 20px rgba(0, 0, 0, 0.2);
        }
        
        .highlight-title {
            font-size: 1.3rem;
            margin-bottom: 15px;
            color: #e2e8f0;
            display: flex;
            align-items: center;
        }
        
        .highlight-list {
            list-style-type: none;
        }
        
        .highlight-list li {
            margin-bottom: 12px;
            padding-left: 25px;
            position: relative;
        }
        
        .highlight-list li:before {
            content: "✓";
            position: absolute;
            left: 0;
            color: #76b900;
        }
        
        footer {
            text-align: center;
            margin-top: 40px;
            padding: 20px;
            color: #94a3b8;
            font-size: 0.9rem;
        }
        
        @media (max-width: 768px) {
            .cards-container {
                grid-template-columns: 1fr;
            }
            
            .specs-grid {
                grid-template-columns: 1fr;
            }
        }
    </style>
</head>
<body>
    <div class="container">
        <header>
            <h1>NVIDIA Data Center GPU Generations</h1>
            <p class="subtitle">Hopper, Blackwell, and Rubin Architecture Comparison - Compute, Memory, and Interconnect Evolution</p>
        </header>
        
        <div class="cards-container">
            <div class="card hopper">
                <div class="card-header">
                    <div class="card-icon">H</div>
                    <div>
                        <h2 class="card-title">Hopper (H100)</h2>
                        <div class="card-subtitle">Current Generation - Released 2022</div>
                    </div>
                </div>
                <div class="specs-grid">
                    <div class="spec-item">
                        <div class="spec-name">Compute (FP8 Tensor)</div>
                        <div class="spec-value">~4 PFLOPS</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-name">Memory</div>
                        <div class="spec-value">80GB HBM3</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-name">Memory Bandwidth</div>
                        <div class="spec-value">3.35 TB/s</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-name">NVLink Speed</div>
                        <div class="spec-value">900 GB/s</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-name">Transistors</div>
                        <div class="spec-value">80 billion</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-name">Process Node</div>
                        <div class="spec-value">TSMC 4N</div>
                    </div>
                </div>
            </div>
            
            <div class="card blackwell">
                <div class="card-header">
                    <div class="card-icon">B</div>
                    <div>
                        <h2 class="card-title">Blackwell (B100/B200)</h2>
                        <div class="card-subtitle">2025 Release - Dual-Die Architecture</div>
                    </div>
                </div>
                <div class="specs-grid">
                    <div class="spec-item">
                        <div class="spec-name">Compute (FP8 Tensor)</div>
                        <div class="spec-value">10 PFLOPS</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-name">Memory</div>
                        <div class="spec-value">192GB HBM3e</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-name">Memory Bandwidth</div>
                        <div class="spec-value">8 TB/s</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-name">NVLink Speed</div>
                        <div class="spec-value">1.8 TB/s</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-name">Transistors</div>
                        <div class="spec-value">208 billion</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-name">Process Node</div>
                        <div class="spec-value">TSMC 4NP</div>
                    </div>
                </div>
            </div>
            
            <div class="card rubin">
                <div class="card-header">
                    <div class="card-icon">R</div>
                    <div>
                        <h2 class="card-title">Rubin (Future)</h2>
                        <div class="card-subtitle">2026+ - 3nm Process & HBM4</div>
                    </div>
                </div>
                <div class="specs-grid">
                    <div class="spec-item">
                        <div class="spec-name">Compute (FP8 Tensor)</div>
                        <div class="spec-value">25 PFLOPS</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-name">Memory</div>
                        <div class="spec-value">288GB HBM4</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-name">Memory Bandwidth</div>
                        <div class="spec-value">13 TB/s</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-name">NVLink Speed</div>
                        <div class="spec-value">3.6 TB/s</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-name">Transistors</div>
                        <div class="spec-value">TBD (3nm)</div>
                    </div>
                    <div class="spec-item">
                        <div class="spec-name">Process Node</div>
                        <div class="spec-value">TSMC 3nm</div>
                    </div>
                </div>
            </div>
        </div>
        
        <div class="chart-container">
            <h2 class="chart-title">Performance Evolution: Hopper to Rubin</h2>
            <canvas id="performanceChart"></canvas>
        </div>
        
        <div class="key-highlights">
            <div class="highlight-card">
                <h3 class="highlight-title">Key Architectural Improvements</h3>
                <ul class="highlight-list">
                    <li>Dual-die design in Blackwell overcomes reticle limitations</li>
                    <li>3.3x performance improvement with Rubin generation</li>
                    <li>FP4 precision support in Blackwell for 2x AI performance</li>
                    <li>Annual release cadence accelerates innovation</li>
                    <li>Integration of custom ARM CPUs starting Rubin</li>
                </ul>
            </div>
            
            <div class="highlight-card">
                <h3 class="highlight-title">Memory & Interconnect Advancements</h3>
                <ul class="highlight-list">
                    <li>HBM4 memory in Rubin with 13 TB/s bandwidth</li>
                    <li>2.4x memory bandwidth increase per generation</li>
                    <li>NVLink speed doubled from Hopper to Blackwell</li>
                    <li>10 TB/s NV-HBI inter-die link in Blackwell</li>
                    <li>Scaling to 576 GPU dies in single Rubin rack</li>
                </ul>
            </div>
            
            <div class="highlight-card">
                <h3 class="highlight-title">Target Workloads</h3>
                <ul class="highlight-list">
                    <li>Trillion-parameter model training (Blackwell/Rubin)</li>
                    <li>Real-time reasoning systems (Rubin)</li>
                    <li>Multi-modal AI applications</li>
                    <li>Scientific computing and quantum simulation</li>
                    <li>Large-scale recommendation systems</li>
                </ul>
            </div>
        </div>
        
        <footer>
            <p>NVIDIA Data Center GPU Evolution: Hopper → Blackwell → Rubin | Exponential Performance Growth for AI Infrastructure</p>
        </footer>
    </div>

    <script>
        const ctx = document.getElementById('performanceChart').getContext('2d');
        const chart = new Chart(ctx, {
            type: 'bar',
            data: {
                labels: ['FP8 Tensor (PFLOPS)', 'Memory (TB/s)', 'NVLink (TB/s)', 'Transistors (Billions)'],
                datasets: [
                    {
                        label: 'Hopper (H100)',
                        data: [4, 3.35, 0.9, 80],
                        backgroundColor: 'rgba(118, 185, 0, 0.7)',
                        borderColor: 'rgba(118, 185, 0, 1)',
                        borderWidth: 1
                    },
                    {
                        label: 'Blackwell (B200)',
                        data: [10, 8, 1.8, 208],
                        backgroundColor: 'rgba(0, 191, 255, 0.7)',
                        borderColor: 'rgba(0, 191, 255, 1)',
                        borderWidth: 1
                    },
                    {
                        label: 'Rubin (Projected)',
                        data: [25, 13, 3.6, 300],
                        backgroundColor: 'rgba(255, 105, 180, 0.7)',
                        borderColor: 'rgba(255, 105, 180, 1)',
                        borderWidth: 1
                    }
                ]
            },
            options: {
                responsive: true,
                plugins: {
                    legend: {
                        position: 'top',
                        labels: {
                            color: '#f8fafc',
                            font: {
                                size: 14
                            }
                        }
                    },
                    title: {
                        display: false
                    }
                },
                scales: {
                    y: {
                        beginAtZero: true,
                        ticks: {
                            color: '#94a3b8'
                        },
                        grid: {
                            color: 'rgba(255, 255, 255, 0.1)'
                        }
                    },
                    x: {
                        ticks: {
                            color: '#e2e8f0'
                        },
                        grid: {
                            display: false
                        }
                    }
                }
            }
        });
    </script>
</body>
</html>
```

Drag to resize

OpenAI

```html
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="utf-8">
<title>NVIDIA Data-Center GPU Evolution</title>
<style>
/*–– basic “consulting-deck” look ––*/
body      {margin:0;font:16px/1.4 "Segoe UI",Arial,Helvetica,sans-serif;color:#333;}
.slide    {width:1280px;height:720px;padding:40px 60px;box-sizing:border-box;}
h1        {margin:0 0 12px;color:#005587;font-size:34px;}
h2        {margin:28px 0 6px;color:#005587;font-size:22px;}
table     {width:100%;border-collapse:collapse;font-size:14px;}
th,td     {border:1px solid #d0d7df;padding:6px 8px;text-align:center;}
th        {background:#f1f5f9;font-weight:600;}
tr.highlight td {background:#e7f1ff;font-weight:600;}
.small    {font-size:12px;color:#666;margin-top:10px;}
/* column shading to improve scanability */
.colgroup th:nth-child(2),.colgroup td:nth-child(2){background:#eef5fb;}
.colgroup th:nth-child(3),.colgroup td:nth-child(3){background:#f8fbff;}
</style>
</head>

<body>
<div class="slide">
  <h1>NVIDIA data-center GPU roadmap – key specification leapfrogs</h1>

  <h2>1. Flagship GPU-on-module comparison (shipping / announced)</h2>
  <table class="colgroup">
    <tr>
      <th rowspan="2">Key metric</th>
      <th>Hopper H100<br>(2023)</th>
      <th>Blackwell B200<br>(2025)</th>
      <th>Rubin (2026 est.)</th>
    </tr>
    <tr>
      <th>SXM5 80 GB</th>
      <th>SXM6 192 GB</th>
      <th>HBM4 288 GB</th>
    </tr>
    <tr>
      <td>Process / transistor count</td>
      <td>TSMC 4N – 80 B</td>
      <td>TSMC 4NP – 208 B (dual-die)</td>
      <td>TSMC 3 nm – &gt;250 B (est.)</td>
    </tr>
    <tr class="highlight">
      <td>Peak tensor compute</td>
      <td>FP8 3.96 PFLOPS</td>
      <td>FP8 10 PFLOPS<br>FP4 20 PFLOPS</td>
      <td>FP8 25 PFLOPS<br>FP4 50 PFLOPS</td>
    </tr>
    <tr>
      <td>HBM capacity</td>
      <td>80 GB HBM3</td>
      <td>192 GB HBM3e</td>
      <td>288 GB HBM4</td>
    </tr>
    <tr>
      <td>Memory bandwidth</td>
      <td>3.35 TB/s</td>
      <td>8 TB/s</td>
      <td>13 TB/s</td>
    </tr>
    <tr>
      <td>On-package / external interconnect</td>
      <td>— / NVLink 0.9 TB/s</td>
      <td>NV-HBI 10 TB/s / NVLink 1.8 TB/s</td>
      <td>7ᵗʰ-gen NVLink 3.6 TB/s</td>
    </tr>
    <tr>
      <td>Typical board TDP</td>
      <td>700 W</td>
      <td>1 kW</td>
      <td>≈1.2 kW (est.)</td>
    </tr>
    <tr>
      <td>Perf / watt vs H100</td>
      <td>1.0 ×</td>
      <td>≈2.5 ×</td>
      <td>≈8 ×</td>
    </tr>
  </table>

  <h2>2. Rack-scale reference systems (largest SKU announced)</h2>
  <table>
    <tr>
      <th>Platform</th>
      <th>GPU count</th>
      <th>System compute</th>
      <th>Total HBM</th>
      <th>Rack NVLink BW</th>
      <th>Power</th>
    </tr>
    <tr>
      <td>DGX H100</td>
      <td>8× H100</td>
      <td>32 PFLOPS FP8</td>
      <td>640 GB</td>
      <td>7.2 TB/s</td>
      <td>&lt;10 kW</td>
    </tr>
    <tr>
      <td>GB200 NVL72</td>
      <td>72× B200</td>
      <td>720 PFLOPS FP8<br>1.44 EFLOPS FP4</td>
      <td>13.4 TB</td>
      <td>130 TB/s</td>
      <td>≈120 kW</td>
    </tr>
    <tr>
      <td>Rubin NVL144 (proj.)</td>
      <td>144× Rubin</td>
      <td>3.6 EFLOPS FP8<br>7.2 EFLOPS FP4</td>
      <td>41 TB</td>
      <td>≈300 TB/s</td>
      <td>≈250 kW</td>
    </tr>
  </table>

  <h2>3. What fundamentally changes each generation?</h2>
  <ul>
    <li><strong>Hopper ➜ Blackwell</strong>: first dual-die GPU breaks reticle limit, adds FP4 precision, 2.4 × memory BW, 2 × NVLink.</li>
    <li><strong>Blackwell ➜ Rubin</strong>: moves to 3 nm & HBM4, integrated Vera CPU fabric, &gt;3 × compute and &gt;60 % more BW at similar perf/W.</li>
  </ul>

  <p class="small">Sources: NVIDIA public disclosures (2024-25); internal analysis. Figures are peak theoretical unless noted. Rubin specs are forward-looking and subject to change.</p>
</div>
</body>
</html>
```

Drag to resize

Google

Gemini 2.5 Pro

```html
<!DOCTYPE html>
<html lang="en">
<head>
    <meta charset="UTF-8">
    <meta name="viewport" content="width=device-width, initial-scale=1.0">
    <title>NVIDIA GPU Generation Comparison</title>
    <link rel="preconnect" href="https://fonts.googleapis.com">
    <link rel="preconnect" href="https://fonts.gstatic.com" crossorigin>
    <link href="https://fonts.googleapis.com/css2?family=Roboto:wght@400;500;700&display=swap" rel="stylesheet">
    <style>
        body {
            font-family: 'Roboto', 'Arial', sans-serif;
            background-color: #f0f2f5;
            margin: 0;
            padding: 20px;
            color: #333;
        }

        .slide-container {
            max-width: 1400px;
            margin: auto;
            background-color: #ffffff;
            border: 1px solid #d9d9d9;
            box-shadow: 0 4px 12px rgba(0, 0, 0, 0.08);
            padding: 40px 50px;
        }

        .slide-header h1 {
            font-size: 28px;
            font-weight: 700;
            color: #003366; /* McKinsey Blue */
            margin: 0 0 10px 0;
            padding-bottom: 15px;
            border-bottom: 4px solid #003366;
        }

        .slide-header .subtitle {
            font-size: 17px;
            font-weight: 400;
            color: #595959;
            margin-bottom: 30px;
            max-width: 90%;
        }

        .comparison-table {
            width: 100%;
            border-collapse: collapse;
            margin-bottom: 35px;
        }

        .comparison-table th,
        .comparison-table td {
            border: 1px solid #d9d9d9;
            padding: 12px 15px;
            vertical-align: top;
            font-size: 15px;
        }

        .comparison-table thead th {
            background-color: #003366;
            color: white;
            font-size: 20px;
            font-weight: 700;
            text-align: center;
            padding: 16px;
        }
        
        .comparison-table thead .generation-subtitle {
            font-size: 14px;
            font-weight: 400;
            display: block;
            margin-top: 4px;
        }

        .comparison-table .category-header {
            font-weight: 700;
            background-color: #f0f2f5;
            color: #003366;
            font-size: 16px;
            text-align: left;
            width: 18%;
        }

        .comparison-table .metric-name {
            font-weight: 500;
            color: #434343;
        }

        .comparison-table td {
            text-align: center;
        }

        .comparison-table .value {
            font-weight: 700;
            font-size: 1.1em;
            color: #1f1f1f;
            display: block;
        }
        
        .comparison-table .value.new-feature {
            color: #137752; /* Green for new features */
        }

        .comparison-table .improvement {
            color: #096dd9;
            font-weight: 500;
            font-size: 0.9em;
            display: block;
            margin-top: 5px;
        }

        .key-shifts-section {
            display: grid;
            grid-template-columns: repeat(3, 1fr);
            gap: 25px;
            margin-top: 20px;
        }

        .shift-box {
            background-color: #f5f8fa;
            border: 1px solid #d9e2e9;
            border-left: 5px solid #005f9e;
            padding: 20px;
            border-radius: 4px;
        }

        .shift-box h3 {
            margin: 0 0 10px 0;
            font-size: 18px;
            color: #003366;
        }

        .shift-box p {
            margin: 0;
            font-size: 15px;
            line-height: 1.5;
            color: #434343;
        }
        
        .footer {
            text-align: right;
            margin-top: 30px;
            padding-top: 15px;
            border-top: 1px solid #d9d9d9;
            font-size: 12px;
            color: #8c8c8c;
        }

    </style>
</head>
<body>

    <div class="slide-container">
        <div class="slide-header">
            <h1>NVIDIA's Generational GPU Leap: A Roadmap for Exascale AI</h1>
            <p class="subtitle">An accelerated one-year cadence drives unprecedented performance gains across compute, memory, and interconnect, enabling trillion-parameter models and future AI agents.
            </p>
        </div>

        <table class="comparison-table">
            <thead>
                <tr>
                    <th class="category-header">Key Specification</th>
                    <th>
                        Hopper
                        <span class="generation-subtitle">(H200, 2024 Refresh)</span>
                    </th>
                    <th>
                        Blackwell
                        <span class="generation-subtitle">(B200, Q1 2025)</span>
                    </th>
                    <th>
                        Rubin
                        <span class="generation-subtitle">(Announced, H2 2026)</span>
                    </th>
                </tr>
            </thead>
            <tbody>
                <tr>
                    <td class="metric-name">GPU Architecture</td>
                    <td>Monolithic Die (TSMC 4N)</td>
                    <td><span class="value">Dual-Die Chiplet</span><span class="improvement">Chiplet design overcomes reticle limits</span></td>
                    <td><span class="value">Advanced Chiplet (TSMC 3nm)</span><span class="improvement">Further integration & efficiency</span></td>
                </tr>
                <tr>
                    <td class="metric-name">Transistors</td>
                    <td><span class="value">80 Billion</span></td>
                    <td><span class="value">208 Billion</span><span class="improvement">2.6x vs. Hopper</span></td>
                    <td><span class="value">TBD (>300B est.)</span><span class="improvement">>1.5x vs. Blackwell (est.)</span></td>
                </tr>
                <tr>
                    <td rowspan="2" class="category-header">AI Compute (per GPU)</td>
                    <td class="metric-name">FP8 Performance</td>
                    <td><span class="value">~4 PFLOPS</span></td>
                    <td><span class="value">10 PFLOPS</span><span class="improvement">2.5x vs. Hopper</span></td>
                    <td><span class="value">25 PFLOPS</span><span class="improvement">2.5x vs. Blackwell</span></td>
                </tr>
                <tr>
                    <td class="metric-name">FP4 Performance</td>
                    <td><span class="value">-</span></td>
                    <td><span class="value new-feature">20 PFLOPS</span><span class="improvement">New capability</span></td>
                    <td><span class="value">50 PFLOPS</span><span class="improvement">2.5x vs. Blackwell</span></td>
                </tr>
                <tr>
                    <td rowspan="2" class="category-header">High Bandwidth Memory (HBM)</td>
                    <td class="metric-name">Capacity per GPU</td>
                     <td><span class="value">141 GB</span><span class="improvement">HBM3e</span></td>
                    <td><span class="value">192 GB</span><span class="improvement">1.4x vs. Hopper (H200)</span></td>
                    <td><span class="value">288 GB</span><span class="improvement">1.5x vs. Blackwell (via HBM4)</span></td>
                </tr>
                <tr>
                    <td class="metric-name">Bandwidth per GPU</td>
                    <td><span class="value">4.8 TB/s</span></td>
                    <td><span class="value">8 TB/s</span><span class="improvement">1.7x vs. Hopper (H200)</span></td>
                     <td><span class="value">13 TB/s</span><span class="improvement">1.6x vs. Blackwell</span></td>
                </tr>
                <tr>
                    <td rowspan="2" class="category-header">Scale-Up Interconnect</td>
                    <td class="metric-name">NVLink Bandwidth (GPU-to-GPU)</td>
                    <td><span class="value">900 GB/s</span><span class="improvement">4th Gen</span></td>
                    <td><span class="value">1.8 TB/s</span><span class="improvement">2x vs. Hopper (5th Gen)</span></td>
                    <td><span class="value">3.6 TB/s</span><span class="improvement">2x vs. Blackwell (7th Gen)</span></td>
                </tr>
                <tr>
                    <td class="metric-name">Maximum Scale-Up Size (Single System)</td>
                    <td><span class="value">16 GPUs</span><span class="improvement">DGX H100 SuperPOD</span></td>
                    <td><span class="value">72 GPUs</span><span class="improvement">4.5x GPU density (GB200 NVL72)</span></td>
                    <td><span class="value">144 GPUs</span><span class="improvement">2x GPU density (Rubin NVL144)</span></td>
                </tr>
            </tbody>
        </table>

        <div class="key-shifts-section">
            <div class="shift-box">
                <h3>1. Design & Manufacturing Evolution</h3>
                <p>The progression moves from a single large die (Hopper) to a dual-die chiplet design (Blackwell) to beat manufacturing limits, culminating in an advanced multi-chiplet architecture on a next-gen 3nm process with Rubin. This enables exponential transistor growth per generation.</p>
            </div>
            <div class="shift-box">
                <h3>2. Compute Efficiency through Precision</h3>
                <p>Blackwell introduces the FP4 data format, doubling AI inference performance over FP8 with minimal accuracy loss. This trend continues with Rubin, indicating a strategic focus on optimizing compute efficiency for massive-scale models by leveraging lower precision arithmetic.</p>
            </div>
            <div class="shift-box">
                <h3>3. System-Level Integration at Scale</h3>
                <p>The focus expands from individual GPUs to tightly integrated rack-scale systems. The GB200 Superchip (2x GPU + CPU) and Rubin's integration with the new "Vera" CPU demonstrate a holistic approach to minimize bottlenecks in memory, compute, and networking for exascale AI.</p>
            </div>
        </div>
        
        <div class="footer">
            Source: Analysis based on NVIDIA public announcements and user-provided specifications.
        </div>
    </div>
</body>
</html>
```

Drag to resize

Alibaba

Qwen3 235B A22B 2507 Instruct

```html
<!DOCTYPE html>
<html lang="en">
<head>
  <meta charset="UTF-8" />
  <meta name="viewport" content="width=device-width, initial-scale=1.0"/>
  <title>NVIDIA GPU Architecture Evolution: Hopper → Blackwell → Rubin</title>
  <link href="https://fonts.googleapis.com/css2?family=Helvetica Neue&display=swap" rel="stylesheet">
  <style>
    body {
      font-family: 'Helvetica Neue', sans-serif;
      margin: 0;
      padding: 0;
      background: #f4f4f4;
      color: #333;
      line-height: 1.5;
    }
    .slide {
      width: 1200px;
      margin: 20px auto;
      padding: 20px;
      background: white;
      box-shadow: 0 4px 12px rgba(0,0,0,0.15);
      position: relative;
    }
    .header {
      display: flex;
      justify-content: space-between;
      align-items: flex-start;
      border-bottom: 1px solid #ddd;
      padding-bottom: 12px;
      margin-bottom: 20px;
    }
    .title {
      font-size: 28px;
      font-weight: bold;
      color: #000;
      margin: 0;
    }
    .subtitle {
      font-size: 18px;
      color: #444;
      margin: 4px 0 0 0;
    }
    .footer {
      position: absolute;
      bottom: 15px;
      right: 20px;
      font-size: 14px;
      color: #888;
    }
    .section-title {
      font-size: 20px;
      font-weight: bold;
      margin: 20px 0 12px 0;
      color: #000;
      border-left: 4px solid #005A9E;
      padding-left: 10px;
    }
    table {
      width: 100%;
      border-collapse: collapse;
      font-size: 13.5px;
      margin: 15px 0;
    }
    th {
      background-color: #005A9E;
      color: white;
      padding: 10px;
      text-align: left;
      font-weight: bold;
    }
    td {
      padding: 10px;
      border-bottom: 1px solid #ddd;
      vertical-align: top;
    }
    tr:nth-child(even) {
      background-color: #f9f9f9;
    }
    .highlight {
      font-weight: bold;
      color: #D00;
    }
    .comparison-table th:nth-child(n+2) {
      width: 12%;
    }
    .gen-label {
      display: inline-block;
      padding: 2px 6px;
      border-radius: 3px;
      color: white;
      font-size: 14px;
      font-weight: bold;
      margin-right: 5px;
    }
    .hopper { background: #005A9E; }
    .blackwell { background: #A6006B; }
    .rubin { background: #D00; }
    .metric-badge {
      background: #eee;
      border: 1px solid #ccc;
      border-radius: 3px;
      padding: 3px 6px;
      font-size: 12px;
      display: inline-block;
      margin-right: 6px;
    }
    .arrow {
      font-weight: bold;
      color: #007C3F;
    }
    .note {
      font-size: 12px;
      color: #666;
      font-style: italic;
    }
    .flex {
      display: flex;
      gap: 15px;
    }
    .flex-item {
      flex: 1;
    }
    .tech-bullet {
      margin: 8px 0;
      font-size: 13.5px;
    }
    .header-logo {
      font-size: 16px;
      color: #005A9E;
      font-weight: bold;
    }
  </style>
</head>
<body>

<div class="slide">

  <!-- Header -->
  <div class="header">
    <div>
      <p class="title">NVIDIA Data Center GPU Evolution</p>
      <p class="subtitle">Hopper (2022–2024) → Blackwell (2025) → Rubin (2026)</p>
    </div>
    <div class="header-logo">NVIDIA INTERNAL</div>
  </div>

  <!-- Core Architecture Comparison -->
  <div class="section-title">▲ Core Architecture Comparison</div>
  <table>
    <thead>
      <tr>
        <th>Metric</th>
        <th class="hopper">Hopper<br/>(H200)</th>
        <th class="blackwell">Blackwell<br/>(B200/GB200)</th>
        <th class="rubin">Rubin<br/>(2026)</th>
      </tr>
    </thead>
    <tbody>
      <tr>
        <td><strong>Process Node</strong></td>
        <td>TSMC 4N</td>
        <td>TSMC 4NP (Enhanced)</td>
        <td>TSMC 3nm</td>
      </tr>
      <tr>
        <td><strong>Transistors</strong></td>
        <td>80B</td>
        <td>208B (per die)<br/>Dual-die design</td>
        <td>3nm density + chiplets<br/>(up to 832B/package)</td>
      </tr>
      <tr>
        <td><strong>Die Configuration</strong></td>
        <td>Single die</td>
        <td><span class="highlight">Dual-die (NV-HBI 10 TB/s)</span></td>
        <td>Single or 4× chiplets</td>
      </tr>
      <tr>
        <td><strong>CPU Integration</strong></td>
        <td>None (discrete GPUs)</td>
        <td>Grace ARM CPU coupled<br/>(NVLink-C2C)</td>
        <td>Tightly integrated <span class="highlight">Vera 88-core ARM CPU</span></td>
      </tr>
    </tbody>
  </table>

  <!-- Compute & Precision -->
  <div class="section-title">▲ Compute Performance & Precision</div>
  <table class="comparison-table">
    <thead>
      <tr>
        <th>Floating Point Type</th>
        <th class="hopper">Hopper (H200)</th>
        <th class="blackwell">Blackwell (B200)</th>
        <th class="rubin">Rubin (est.)</th>
        <th>Gain vs. Hopper</th>
      </tr>
    </thead>
    <tbody>
      <tr>
        <td><strong>FP32</strong></td>
        <td>67 TFLOPS</td>
        <td>80 TFLOPS</td>
        <td>~160 TFLOPS</td>
        <td class="arrow">→ 2.4×</td>
      </tr>
      <tr>
        <td><strong>FP16/BF16 Tensor</strong></td>
        <td>1.98 PFLOPS</td>
        <td>5.0 PFLOPS</td>
        <td>~15 PFLOPS</td>
        <td class="arrow">→ 7.6×</td>
      </tr>
      <tr>
        <td><strong>FP8 Tensor</strong></td>
        <td>3.96 PFLOPS</td>
        <td>10.0 PFLOPS</td>
        <td>25.0 PFLOPS</td>
        <td class="arrow">→ 6.3×</td>
      </tr>
      <tr>
        <td><strong>FP4 Tensor</strong></td>
        <td>N/A</td>
        <td>20.0 PFLOPS<br/>(2nd-gen Transformer Engine)</td>
        <td>50.0 PFLOPS<br/>(Rubin Ultra: 100 PFLOPS)</td>
        <td class="arrow">→ >50×</td>
      </tr>
    </tbody>
  </table>
  <p class="note">*FP4 enables 2× throughput over FP8 without accuracy loss via adaptive quantization</p>

  <!-- Memory & Bandwidth -->
  <div class="section-title">▲ Memory & Bandwidth</div>
  <table>
    <thead>
      <tr>
        <th>Metric</th>
        <th class="hopper">Hopper (H200)</th>
        <th class="blackwell">Blackwell (B200)</th>
        <th class="rubin">Rubin (est.)</th>
      </tr>
    </thead>
    <tbody>
      <tr>
        <td><strong>Memory Type</strong></td>
        <td>HBM3e</td>
        <td>HBM3e</td>
        <td><span class="highlight">HBM4</span></td>
      </tr>
      <tr>
        <td><strong>Per-GPU Capacity</strong></td>
        <td>141GB</td>
        <td>192GB</td>
        <td>288GB (Ultra: up to 1TB)</td>
      </tr>
      <tr>
        <td><strong>Memory Bandwidth</strong></td>
        <td>4.8 TB/s</td>
        <td>8.0 TB/s</td>
        <td><span class="highlight">13.0 TB/s</span><br/>(Ultra: 52 TB/s aggregate)</td>
      </tr>
      <tr>
        <td><strong>Cache Hierarchy</strong></td>
        <td>50MB L2</td>
        <td>100MB L2</td>
        <td>200MB+ L2 with unified GPU-CPU cache</td>
      </tr>
    </tbody>
  </table>

  <!-- Interconnect & Scale Up -->
  <div class="section-title">▲ Interconnect & Scale-Up Capabilities</div>
  <table>
    <thead>
      <tr>
        <th>Feature</th>
        <th class="hopper">Hopper</th>
        <th class="blackwell">Blackwell</th>
        <th class="rubin">Rubin</th>
      </tr>
    </thead>
    <tbody>
      <tr>
        <td><strong>GPU-GPU Interconnect</strong></td>
        <td>NVLink 4.0<br/>900 GB/s</td>
        <td><span class="highlight">NVLink 5.0<br/>1.8 TB/s</span></td>
        <td>NVLink 6.0/7.0<br/>3.6 TB/s</td>
      </tr>
      <tr>
        <td><strong>CPU-GPU (C2C)</strong></td>
        <td>PCIe Gen5 / NVLink (limited)</td>
        <td>Grace CPU: <br/>900 GB/s C2C (NVLink)</td>
        <td><span class="highlight">Vera: 1.8 TB/s C2C + cache-coherent</span></td>
      </tr>
      <tr>
        <td><strong>On-Die Interconnect</strong></td>
        <td>Unified memory fabric</td>
        <td><span class="highlight">NV-HBI 10 TB/s</span><br/>(die-to-die)</td>
        <td>Next-gen HBI or EMI<br/>(>10 TB/s)</td>
      </tr>
      <tr>
        <td><strong>Scale-Up World Size</strong></td>
        <td>Max: NVL (6× H100)</td>
        <td><span class="highlight">GB200 NVL72 = 72× B200</span><br/>1.4 exaflops (FP4), 120kW/rack</td>
        <td><span class="highlight">NVL144 → NVL576</span><br/>Supports 576 GPU dies (Rubin Ultra)</td>
      </tr>
    </tbody>
  </table>

  <!-- Key Features by Generation -->
  <div class="section-title">▲ Key Technological Innovations</div>

  <div class="flex">
    <div class="flex-item">
      <div style="background: #f0f8ff; padding: 12px; border-radius: 6px;">
        <span class="gen-label hopper">Hopper</span>
        <div class="tech-bullet">• 4th-gen Tensor Cores with structured sparsity</div>
        <div class="tech-bullet">• Transformer Engine (FP8/FP16 dynamic precision)</div>
        <div class="tech-bullet">• Confidential Computing (secure AI workloads)</div>
        <div class="tech-bullet">• MIG: 7 instances per GPU</div>
      </div>
    </div>

    <div class="flex-item">
      <div style="background: #fff0f8; padding: 12px; border-radius: 6px;">
        <span class="gen-label blackwell">Blackwell</span>
        <div class="tech-bullet">• Dual-die design overcomes reticle limit</div>
        <div class="tech-bullet">• <strong>FP4 precision</strong> for 2× inference throughput</div>
        <div class="tech-bullet">• <strong>10 TB/s NV-HBI</strong> die-to-die interconnect</div>
        <div class="tech-bullet">• Integrated <strong>RAS Engine</strong> (predictive maintenance)</div>
        <div class="tech-bullet">• Decompression Engine (6× faster database ops)</div>
      </div>
    </div>

    <div class="flex-item">
      <div style="background: #ffe5e5; padding: 12px; border-radius: 6px;">
        <span class="gen-label rubin">Rubin</span>
        <div class="tech-bullet">• Migrates to <strong>TSMC 3nm</strong> for efficiency</div>
        <div class="tech-bullet">• <strong>HBM4</strong> (13 TB/s bandwidth)</div>
        <div class="tech-bullet">• <strong>Vera ARM CPU</strong>: 88-core, fully integrated</div>
        <div class="tech-bullet">• <strong>Disaggregated compute</strong> (prefill/decode)</div>
        <div class="tech-bullet">• Target: <strong>8× better efficiency</strong> vs. Hopper</div>
      </div>
    </div>
  </div>

  <!-- Performance Trajectory -->
  <div class="section-title">▲ Performance Trajectory (Projected)</div>
  <table>
    <thead>
      <tr>
        <th>Capability</th>
        <th>Hopper Base (H100)</th>
        <th>Blackwell (B200)</th>
        <th>Rubin (2026)</th>
      </tr>
    </thead>
    <tbody>
      <tr>
        <td><strong>Average Compute Gain</strong></td>
        <td>1.0×</td>
        <td>2.5–3.0×</td>
        <td><span class="highlight">3.3× vs. Blackwell (8–9× vs. Hopper)</span></td>
      </tr>
      <tr>
        <td><strong>Memory Bandwidth</strong></td>
        <td>3.35 TB/s (HBM3)</td>
        <td>8.0 TB/s</td>
        <td>13.0 TB/s (HBM4)</td>
      </tr>
      <tr>
        <td><strong>Scale-Up System</strong></td>
        <td>NVL (6 GPUs)</td>
        <td>GB200 NVL72 (72× B200)</td>
        <td>NVL576 (576 GPU dies)</td>
      </tr>
      <tr>
        <td><strong>Power Efficiency</strong></td>
        <td>Baseline</td>
        <td>2.5–3×</td>
        <td><span class="highlight">~8×</span> (vs. Hopper)</td>
      </tr>
    </tbody>
  </table>

  <!-- Footer -->
  <div class="footer">Confidential – NVIDIA Strategic Planning | Q2 2025 Roadmap</div>
</div>

</body>
</html>
```

Drag to resize

OpenAI

gpt-oss-20B (high)

```html
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="UTF-8">
<title>NVIDIA Data‑Center GPU Evolution – Hopper, Blackwell & Rubin</title>
<style>
  body{
    background:#fafafa;
    font-family:Arial,Helvetica,sans-serif;
    color:#222;
    margin:0;
    padding:0;
  }
  .slide{
    max-width:1280px;
    margin:40px auto;
    padding:20px;
    background:white;
    box-shadow:0 4px 12px rgba(0,0,0,.1);
    border-radius:8px;
  }
  h1{font-size:28px;margin-bottom:4px;}
  h2{font-size:20px;margin-top:12px;margin-bottom:12px;color:#0066cc;}
  .grid{
   display:grid;
   grid-template-columns:1fr 1fr 1fr 1fr;
   gap:12px;
   align-items:center;
  }
  .grid-header{
    font-weight:bold;
    text-align:center;
    background:#0066cc;
    color:white;
    padding:8px 0;
    border-radius:4px;
  }
  .grid-cell{
    text-align:center;
    padding:8px 4px;
    font-size:14px;
  }
  .grid-row:nth-child(even){
    background:#f7f7f7;
  }
  .grid-row:nth-child(odd){
    background:#eaeaea;
  }
  .grid-top{
    font-weight:bold;
    background:#0099ff;
    color:white;
  }
  .highlight{
    background:#ffd700;
    font-weight:bold;
  }
  @media(max-width:900px){
    .grid{grid-template-columns:1fr;}
    .grid-header{display:block;}
  }
</style>
</head>
<body>
<div class="slide">

<h1>NVIDIA Data‑Center GPU Evolution</h1>
<h2>Compute, Memory, Interconnect & Scale – Hopper → Blackwell → Rubin</h2>

<div class="grid">
  <!-- Header -->
  <div class="grid-header">Specification</div>
  <div class="grid-header">Hopper (H100)</div>
  <div class="grid-header">Blackwell (B200)</div>
  <div class="grid-header">Rubin (2026)</div>

  <!-- Compute -->
  <div class="grid-cell">FP32 (Tensor‑core negated)</div>
  <div class="grid-cell highlight">67 TFLOPS</div>
  <div class="grid-cell highlight">80 TFLOPS</div>
  <div class="grid-cell highlight">~160 TFLOPS</div>

  <div class="grid-cell">FP8 Tensor Performance</div>
  <div class="grid-cell">3.96 PFLOPS</div>
  <div class="grid-cell">10 PFLOPS</div>
 div class="grid-cell">25 PFLOPS</div>

  <div class="grid-cell">FP4 Tensor Performance (new)</div>
  <div class="grid-cell">—</div>
  <div class="grid-cell highlight">20 PFLOPS</div>
  <div class="grid-cell highlight">50 PFLOPS</div>

  <!-- Memory -->
  <div class="grid-cell">HBM memory capacity</div>
  <div class="grid-cell">80 GB</div>
  <div class="grid-cell highlight">192 GB</div>
  <div class="grid-cell highlight">288 GB</div>

  <div class="grid-cell">HBM bandwidth</div>
  <div class="grid-cell">3.35 TB/s</div>
  <div class="grid-cell highlight">8 TB/s</div>
  <div class="grid-cell highlight">13 TB/s</div>

  <!-- Interconnect -->
  <div class="grid-cell">NVLink speed</div>
  <div class="grid-cell highlight">900 GB/s</div>
  <div class="grid-cell highlight">1.8 TB/s</div>
  <div class="grid-cell highlight">3.6 TB/s</div>

  <div class="grid-cell">NVLink scale (per node)</div>
  <div class="grid-cell">4×8 TB/s</div>
  <div class="grid-cell highlight">8×8 TB/s (dual die)</div>
  <div class="grid-cell highlight">576 TB/s (NVL576)</div>

  <!-- Scale / Power -->
  <div class="grid-cell">Typical power envelope</div>
  <div class="grid-cell">700 W (SXM)</div>
  <div class="grid-cell highlight">1 000 W</div>
  <div class="grid-cell highlight">600 kW+ (full rack)</div>

  <div class="grid-cell">Transistor count</div>
  <div class="grid-cell">80 B</div>
  <div class="grid-cell highlight">208 B (dual‑die)</div>
  <div class="grid-cell highlight">TBD (3 nm)</div>

  <!-- Software / Misc -->
  <div class="grid-cell">Transformer Engine (precision)</div>
  <div class="grid-cell">FP8/FP16</div>
  <div class="grid-cell highlight">FP4/FP8</div>
  <div class="grid-cell highlight">FP4/FP8</div>

  <div class="grid-cell">AI‑specific FP8/F8 support</div>
  <div class="grid-cell">3.96 PFLOPS (FP8)</div>
  <div class="grid-cell highlight">10 PFLOPS</div>
  <div class="grid-cell highlight">25 PFLOPS</div>
</div>


</div>
</body>
</html>
```

Drag to resize

Kimi

Kimi K2

```html
<!DOCTYPE html>
<html lang="en">
<head>
  <meta charset="UTF-8"/>
  <title>NVIDIA Data-Center GPU Evolution – 2024-2027</title>
  <style>
    @import url('https://fonts.googleapis.com/css2?family=Inter:wght@400;600;700&display=swap');

    :root{
      --hopper:#0071bc;
      --blackwell:#2ca02c;
      --rubin:#d7301f;
      --grey:#e6e6e6;
    }
    body{
      margin:0;
      font-family:Inter, sans-serif;
      background:#f7f7f7;
      display:flex;
      justify-content:center;
      padding:40px 0;
    }
    .slide{
      width:1440px;
      height:810px;
      background:#ffffff;
      box-shadow:0 8px 24px rgba(0,0,0,.15);
      padding:48px;
      box-sizing:border-box;
      display:flex;
      flex-direction:column;
    }
    header h1{
      font-size:36px;
      font-weight:700;
      margin:0;
      line-height:1.2;
    }
    header p{
      font-size:20px;
      margin:0;
      color:#555;
      margin-bottom:24px;
    }

    table{
      width:100%;
      border-spacing:0;
      font-size:12px;
    }
    th{
      background:#fafafa;
      font-weight:700;
      padding:6px 8px;
      position:sticky;
      top:0;
    }
    td{
      padding:6px 8px;
      border-bottom:1px solid var(--grey);
      text-align:center;
      vertical-align:middle;
    }
    tbody tr:hover{background:#fafafa;}
    .unit{font-size:9px;color:#565656;margin-left:2px;}

    .capsule{
      display:inline-block;
      color:#fff;
      padding:4px 8px;
      border-radius:20px;
      font-weight:600;
      font-size:11px;
      min-width:50px;
    }
    .capsule.hopper{background:var(--hopper);}
    .capsule.blackwell{background:var(--blackwell);}
    .capsule.rubin{background:var(--rubin);}

    .chart{
      flex-grow:1;
      margin-top:24px;
      position:relative;
    }
  </style>
</head>
<body>
  <div class="slide">
    <header>
      <h1>The Dawn of Exascale AI:</h1>
      <p>Hopper → Blackwell → Rubin</p>
    </header>

    <table>
      <thead>
        <tr>
          <th style="width:120px;text-align:left">Key Metrics</th>
          <th>H100 SXM5<br><span class="capsule hopper">Hopper</span></th>
          <th>H200 SXM5<br><span class="capsule hopper">Hopper +</span></th>
          <th>B200<br><span class="capsule blackwell">Blackwell</span></th>
          <th>GB200 NVL72<br><span class="capsule blackwell">Blackwell Sys</span></th>
          <th>Rubin (2026)<br><span class="capsule rubin">Rubin</span></th>
        </tr>
      </thead>
      <tbody>
        <tr>
          <td style="text-align:left;font-weight:600;">EVOLVE<br>Compute (FP8)<br><span class="unit">PFLOPS</span></td>
          <td>3.96</td>
          <td>3.96</td>
          <td>10</td>
          <td>720 <span style="color:#555">(rack)</span></td>
          <td>25</td>
        </tr>
        <tr>
          <td style="text-align:left;font-weight:600;">EVOLVE<br>Compute (FP4)<br><span class="unit">PFLOPS</span></td>
          <td>—</td>
          <td>—</td>
          <td>20</td>
          <td>1,440 <span style="color:#555">(rack)</span></td>
          <td>50</td>
        </tr>
        <tr>
          <td style="text-align:left;font-weight:600;">ACCELERATE<br>HBM Capacity<br><span class="unit">GB</span></td>
          <td>80</td>
          <td>141</td>
          <td>192</td>
          <td>13,400 <span style="color:#555">(HBM3e total)</span></td>
          <td>288 (HBM4)</td>
        </tr>
        <tr>
          <td style="text-align:left;font-weight:600;">ACCELERATE<br>Memory Bandwidth<br><span class="unit">TB/s</span></td>
          <td>3.35</td>
          <td>4.8</td>
          <td>8</td>
          <td>576 <span style="color:#555">(rack)</span></td>
          <td>13</td>
        </tr>
        <tr>
          <td style="text-align:left;font-weight:600;">SCALE UP<br>NVLink Speed<br><span class="unit">GB/s</span></td>
          <td>900</td>
          <td>900</td>
          <td>1,800</td>
          <td>130,000 <span style="color:#555">(rack aggregate)</span></td>
          <td>3,600 (Gen 7)</td>
        </tr>
        <tr>
          <td style="text-align:left;font-weight:600;">SCALE UP<br>World-Size Scale<br><span class="unit"># GPUs / Dies</span></td>
          <td>—</td>
          <td>—</td>
          <td>2-die package<br>208 B trans</td>
          <td>270 GPUs<br>1.4 EFLOPS super-node</td>
          <td>144 GPUs NVL144<br>576 dies Ultra</td>
        </tr>
      </tbody>
    </table>

    <div class="chart">
      <canvas id="opsChart" height="220"></canvas><br>
      <canvas id="memChart" height="220"></canvas>
    </div>
  </div>

  <!-- Chart.js CDN -->
  <script src="https://cdn.jsdelivr.net/npm/chart.js"></script>
  <script>
  const lbl = ['H100','H200','B200','Rubin'];

  new Chart(document.getElementById('opsChart'),{
    type:'bar',
    data:{
      labels:lbl,
      datasets:[
        {label:'FP8 PFLOPS',data:[3.96,3.96,10,25],backgroundColor:['#0071bc','#0071bc','#2ca02c','#d7301f'],barPercentage:0.8},
        {label:'FP4 PFLOPS',data:[0,0,20,50],backgroundColor:['rgba(0,113,188,.5)','rgba(0,113,188,.5)','#2ca02c','#d7301f'],barPercentage:0.8}
      ]
    },
    options:{
      plugins:{legend:{position:'top'},title:{display:true,text:'Compute Density (Single GPU / Chiplet)',font:{size:14},padding:0}},
      responsive:false,
      maintainAspectRatio:false,
      scales:{y:{beginAtZero:true,title:{display:true,text:'PFLOPS'}}}
    }
  });

  new Chart(document.getElementById('memChart'),{
    type:'bar',
    data:{
      labels:lbl,
      datasets:[
        {label:'HBM Capacity (GB)',data:[80,141,192,288],backgroundColor:['#0071bc','#0071bc','#2ca02c','#d7301f'],barPercentage:0.8},
        {label:'Memory Bandwidth (TB/s)',data:[3.35,4.8,8,13],backgroundColor:['rgba(0,113,188,.5)','rgba(0,113,188,.5)','#2ca02c','#d7301f'],barPercentage:0.8}
      ]
    },
    options:{
      plugins:{legend:{position:'top'},title:{display:true,text:'Memory Vault',font:{size:14}}},
      responsive:false,
      maintainAspectRatio:false,
      scales:{y:{beginAtZero:true}}
    }
  });
  </script>
</body>
</html>
```

Drag to resize

OpenAI

GPT-5 (high)

```html
<!DOCTYPE html>
<html lang="en">
<head>
  <meta charset="utf-8" />
  <meta name="viewport" content="width=device-width, initial-scale=1" />
  <title>Nvidia Data Center GPU Generations: Hopper vs Blackwell vs Rubin</title>
  <link href="https://fonts.googleapis.com/css2?family=Inter:wght@400;500;600;700;800&display=swap" rel="stylesheet">
  <style>
    :root{
      --bg:#f4f6f8;
      --card:#ffffff;
      --ink:#0b1f33;
      --muted:#5a6b7d;
      --accent:#0f62fe;
      --accent-2:#12a36a;
      --accent-3:#e67e22;
      --border:#e6ebf0;
      --win:#0a7f4a;
      --warn:#c23d2a;
      --info:#5b6bff;
    }
    * { box-sizing: border-box; }
    html, body { height: 100%; }
    body{
      margin:0;
      background: var(--bg);
      font-family: Inter, system-ui, -apple-system, Segoe UI, Roboto, "Helvetica Neue", Arial, "Apple Color Emoji","Segoe UI Emoji", "Segoe UI Symbol", sans-serif;
      color: var(--ink);
    }

    /* Slide canvas 16:9 */
    .slide{
      width: 1600px;
      height: 900px;
      background: var(--card);
      margin: 24px auto;
      border-radius: 16px;
      box-shadow: 0 10px 30px rgba(16,24,40,0.16);
      display:flex;
      flex-direction: column;
      overflow: hidden;
    }

    /* Header */
    .hdr{
      padding: 24px 28px 16px 28px;
      border-bottom: 1px solid var(--border);
      display:flex;
      align-items: center;
      gap:16px;
    }
    .brand-strip{
      width: 6px; height: 44px; border-radius: 3px; background: linear-gradient(180deg, var(--accent) 0%, #26c6da 50%, #12a36a 100%);
    }
    .titles{ flex: 1; }
    h1{
      font-size: 28px; line-height: 34px; margin:0; font-weight: 800; letter-spacing: -0.02em;
    }
    .subtitle{
      margin-top:6px; color: var(--muted); font-weight:500; font-size: 14.5px;
    }
    .meta{
      display:flex; gap:16px; align-items:center; color: var(--muted); font-size: 12.5px;
    }
    .pill{
      display:inline-flex; align-items:center; gap:6px;
      padding:6px 10px; border-radius: 999px; background: #f1f5ff; color: #263e87; font-weight:600; font-size:12.5px;
    }
    .legend{
      display:flex; gap:12px; align-items:center;
    }
    .legend span{
      display:inline-flex; align-items:center; gap:6px; font-size: 12.5px; color: var(--muted);
    }
    .dot{ width:10px; height:10px; border-radius:50%; display:inline-block; }
    .d1{ background: var(--accent); }
    .d2{ background: var(--accent-2); }
    .d3{ background: var(--accent-3); }

    /* Grid */
    .content{
      padding: 18px 24px 18px 24px;
      display:grid;
      grid-template-columns: 1.3fr 1.3fr 1.3fr;
      gap: 14px;
    }
    .card{
      background: #fff;
      border:1px solid var(--border);
      border-radius: 12px;
      padding: 14px;
      display:flex; flex-direction:column; gap:10px;
      min-height: 486px;
    }
    .card.hopper .bar{ background: var(--accent); }
    .card.blackwell .bar{ background: var(--accent-2); }
    .card.rubin .bar{ background: var(--accent-3); }

    .card-head{
      display:flex; align-items:center; justify-content:space-between; gap:8px;
      padding-bottom: 8px; border-bottom: 1px dashed var(--border);
    }
    .stack{
      display:flex; flex-direction:column;
    }
    .eyebrow{
      text-transform: uppercase; letter-spacing: .08em; font-weight: 800; font-size: 11px; color: var(--muted);
    }
    .h{
      display:flex; align-items:baseline; gap:10px;
    }
    .h b{ font-size: 22px; letter-spacing:-0.01em; }
    .h small{ color: var(--muted); font-weight:600; }
    .bar{
      height: 6px; border-radius: 4px; width: 58px;
    }
    .kv{
      display:grid; grid-template-columns: 1fr 1fr; gap:10px; align-items:stretch;
    }
    .kv .k{
      border: 1px solid var(--border);
      border-radius: 10px; padding: 10px; display:flex; flex-direction:column; gap:6px;
      min-height: 74px;
    }
    .k .k-title{ font-size:11px; color: var(--muted); font-weight:700; text-transform:uppercase; letter-spacing:.06em; }
    .k .k-val{
      display:flex; align-items:baseline; gap:8px; font-weight:800;
    }
    .k .k-val b{ font-size:18px; }
    .k .k-val small{ color:var(--muted); font-weight:600; }
    .delta{ font-size: 11.5px; font-weight: 700; }
    .up{ color: var(--win); }
    .more{ color: var(--info); }
    .about{
      display:grid; grid-template-columns: 1fr 1fr; gap:10px;
    }
    .about .row{
      border: 1px dashed var(--border);
      border-radius: 10px; padding: 10px; display:flex; flex-direction:column; gap:6px;
    }
    .row .label{ font-size:11px; color:var(--muted); text-transform:uppercase; font-weight:700; letter-spacing:.06em; }
    .row .value{ font-size:14px; font-weight:700; }
    .muted{ color: var(--muted); font-weight:600; }

    /* Cross-gen strip */
    .strip{
      padding: 6px 24px 0 24px;
      display:grid; grid-template-columns: 2fr 1fr; gap: 14px;
    }
    .tbl{
      border:1px solid var(--border); border-radius:12px; overflow:hidden; background:#fff;
    }
    table{
      width:100%; border-collapse: collapse; font-size:13.5px;
    }
    thead th{
      background:#f8fafc; text-align:left; padding:10px 10px; border-bottom:1px solid var(--border); font-weight:800; color:var(--muted); text-transform:uppercase; letter-spacing:.06em; font-size:11.5px;
    }
    tbody td{
      padding:10px; border-bottom:1px solid #f2f4f7;
    }
    tbody tr:last-child td{ border-bottom:none; }
    .right{ text-align:right; }
    .center{ text-align:center; }
    .gain{ font-weight:800; }
    .gain.up{ color: var(--win); }
    .gain.eq{ color: #64748b; }
    .gain.warn{ color: var(--warn); }

    .notes{
      background:#fff; border:1px solid var(--border); border-radius:12px; padding:12px;
      display:flex; flex-direction:column; gap:8px;
    }
    .notes b{ font-size:12px; text-transform:uppercase; letter-spacing:.06em; color:var(--muted); }
    .notes ul{ margin:0; padding-left: 16px; }
    .notes li{ margin: 4px 0; color:#334155; font-size:13px; }
    .tag{
      display:inline-block; padding:2px 6px; border-radius:6px; background:#f1f5ff; color:#263e87; font-size:11px; font-weight:700;
    }

    /* Footer */
    .ftr{
      padding: 10px 24px 16px 24px; display:flex; align-items:center; justify-content:space-between; gap:12px; color:var(--muted); font-size:12px;
      border-top:1px solid var(--border);
      margin-top: 8px;
    }
    .ftr .caps{ text-transform: uppercase; letter-spacing:.08em; font-weight:800; }
    .sp{
      display:flex; align-items:center; gap:10px;
    }

    /* Tiny badges */
    .badge{
      display:inline-flex; align-items:center; gap:6px; background:#eef9f4; color:#0a7f4a; padding:4px 8px; border-radius:999px; font-size:11.5px; font-weight:800;
      border: 1px solid #d5f1e5;
    }
    .badge .sq{ width:8px; height:8px; border-radius:2px; background:#0a7f4a; }
    .badge.warn{ background:#fff5f3; color:#c23d2a; border-color:#ffd9d2; }
    .badge.info{ background:#eef2ff; color:#4f46e5; border-color:#dfe4ff; }
  </style>
</head>
<body>
  <div class="slide" role="document" aria-label="Nvidia Data Center GPU Comparison">
    <header class="hdr">
      <div class="brand-strip" aria-hidden="true"></div>
      <div class="titles">
        <h1>Nvidia Data Center GPU Generations: Hopper vs Blackwell vs Rubin</h1>
        <div class="subtitle">Compute, HBM capacity, memory bandwidth, interconnect, and system scale (information-dense summary)</div>
        <div class="meta" style="margin-top:8px;">
          <span class="pill">Focus: FP8/FP4 compute, HBM, NVLink/NV‑HBI, rack‑scale</span>
          <div class="legend">
            <span><i class="dot d1"></i> Hopper (H100/H200)</span>
            <span><i class="dot d2"></i> Blackwell (B200/GB200)</span>
            <span><i class="dot d3"></i> Rubin (2026)</span>
          </div>
        </div>
      </div>
    </header>

    <section class="content" aria-label="Generation Cards">
      <!-- Hopper -->
      <article class="card hopper" aria-label="Hopper">
        <div class="card-head">
          <div class="stack">
            <div class="eyebrow">Hopper Architecture</div>
            <div class="h">
              <b>H100 / H200</b>
              <small>TSMC 4N • ~80B transistors</small>
            </div>
          </div>
          <div class="bar" aria-hidden="true"></div>
        </div>

        <div class="kv">
          <div class="k">
            <div class="k-title">Peak Compute</div>
            <div class="k-val">
              <b>FP32 67 TFLOPS</b><small>(H100/H200)</small>
            </div>
            <div class="k-val">
              <b>FP8 3.96 PFLOPS</b><small>(Transformer Engine)</small>
            </div>
            <div class="k-val">
              <b>FP4 —</b><small>n/a</small>
            </div>
          </div>
          <div class="k">
            <div class="k-title">HBM & Bandwidth</div>
            <div class="k-val"><b>141 GB HBM3e</b><small>(H200 SXM5)</small></div>
            <div class="k-val"><b>4.8 TB/s</b><small>mem BW (H200)</small></div>
          </div>
          <div class="k">
            <div class="k-title">Interconnect</div>
            <div class="k-val"><b>NVLink 900 GB/s</b><small>(SXM5)</small></div>
            <div class="k-val"><b>PCIe/NVL</b><small>H100/H200 NVL variants</small></div>
          </div>
          <div class="k">
            <div class="k-title">Thermals</div>
            <div class="k-val"><b>700 W</b><small>H100/H200 SXM5</small></div>
            <div class="k-val"><b>350–600 W</b><small>PCIe/NVL</small></div>
          </div>
        </div>

        <div class="about">
          <div class="row">
            <div class="label">Scale-up system</div>
            <div class="value">
              H200 NVL (dual-GPU); multi-GPU via NVLink (SXM5)
            </div>
            <div class="muted">System-scale world size not specified in source</div>
          </div>
          <div class="row">
            <div class="label">Notable</div>
            <div class="value">Transformer Engine (FP8/FP16), 4th‑gen Tensor Cores, Confidential Computing</div>
            <div class="muted">MIG up to 7 instances</div>
          </div>
        </div>
      </article>

      <!-- Blackwell -->
      <article class="card blackwell" aria-label="Blackwell">
        <div class="card-head">
          <div class="stack">
            <div class="eyebrow">Blackwell Architecture</div>
            <div class="h">
              <b>B200 / GB200</b>
              <small>TSMC 4NP • 208B transistors</small>
            </div>
          </div>
          <div class="bar" aria-hidden="true"></div>
        </div>

        <div class="kv">
          <div class="k">
            <div class="k-title">Peak Compute</div>
            <div class="k-val"><b>FP32 80 TFLOPS</b><small>(per GPU)</small></div>
            <div class="k-val"><b>FP8 10 PFLOPS</b><small>(per GPU)</small></div>
            <div class="k-val"><b>FP4 20 PFLOPS</b><small>(per GPU)</small></div>
            <div class="delta up">+2.5x FP8 vs H200</div>
          </div>
          <div class="k">
            <div class="k-title">HBM & Bandwidth</div>
            <div class="k-val"><b>192 GB HBM3e</b><small>(per GPU)</small></div>
            <div class="k-val"><b>8 TB/s</b><small>mem BW</small></div>
            <div class="delta up">+1.67x BW vs H200</div>
          </div>
          <div class="k">
            <div class="k-title">Interconnect</div>
            <div class="k-val"><b>NVLink 1.8 TB/s</b><small>(per GPU)</small></div>
            <div class="k-val"><b>NV‑HBI 10 TB/s</b><small>dual‑die link</small></div>
          </div>
          <div class="k">
            <div class="k-title">Thermals</div>
            <div class="k-val"><b>1000 W</b><small>B200 (per GPU)</small></div>
            <div class="k-val"><b>~1500 W</b><small>GB200 Superchip</small></div>
          </div>
        </div>

        <div class="about">
          <div class="row">
            <div class="label">Scale-up system</div>
            <div class="value">
              <span class="badge"><span class="sq"></span> GB200 NVL72</span> — 72× B200 + 36× Grace
            </div>
            <div class="muted">Rack: ~120 kW • 13.4 TB HBM3e • 576 TB/s mem BW • NVLink ~130 TB/s total</div>
          </div>
          <div class="row">
            <div class="label">Notable</div>
            <div class="value">Dual‑die GPUs, 2nd‑gen Transformer Engine incl. FP4, RAS & Decompression Engines</div>
            <div class="muted">Up to 25× better AI inference energy efficiency vs Hopper (source)</div>
          </div>
        </div>
      </article>

      <!-- Rubin -->
      <article class="card rubin" aria-label="Rubin">
        <div class="card-head">
          <div class="stack">
            <div class="eyebrow">Rubin Architecture (Announced)</div>
            <div class="h">
              <b>Rubin / Rubin Ultra</b>
              <small>TSMC 3nm • HBM4/HBM4e</small>
            </div>
          </div>
          <div class="bar" aria-hidden="true"></div>
        </div>

        <div class="kv">
          <div class="k">
            <div class="k-title">Peak Compute</div>
            <div class="k-val"><b>FP8 25 PFLOPS</b><small>(per GPU)</small></div>
            <div class="k-val"><b>FP4 50 PFLOPS</b><small>(per GPU)</small></div>
            <div class="delta up">~3.3× vs Blackwell (source)</div>
          </div>
          <div class="k">
            <div class="k-title">HBM & Bandwidth</div>
            <div class="k-val"><b>288 GB HBM4</b><small>(per GPU)</small></div>
            <div class="k-val"><b>13 TB/s</b><small>mem BW</small></div>
            <div class="delta up">+62.5% BW vs B200</div>
          </div>
          <div class="k">
            <div class="k-title">Interconnect</div>
            <div class="k-val"><b>NVLink 3.6 TB/s</b><small>(7th gen)</small></div>
            <div class="k-val"><b>CPU link 1.8 TB/s</b><small>Vera → GPU</small></div>
          </div>
          <div class="k">
            <div class="k-title">Thermals</div>
            <div class="k-val"><b>TBD</b><small>platform dependent</small></div>
            <div class="k-val"><b>600 kW+</b><small>per rack (Ultra)</small></div>
          </div>
        </div>

        <div class="about">
          <div class="row">
            <div class="label">Scale-up system</div>
            <div class="value">
              <span class="badge info"><span class="sq" style="background:#4f46e5"></span> NVL144</span> (144 GPUs) • <span class="badge info"><span class="sq" style="background:#4f46e5"></span> NVL576</span> (576 dies, Ultra)
            </div>
            <div class="muted">System NVLink up to 1.5 PB/s (Rubin Ultra, source)</div>
          </div>
          <div class="row">
            <div class="label">Notable</div>
            <div class="value">Vera ARM CPU integration, HBM4, disaggregated prefill/decode variants (Rubin CPX)</div>
            <div class="muted">Annual cadence: H100 → B200 → Rubin</div>
          </div>
        </div>
      </article>
    </section>

    <section class="strip" aria-label="Cross-generation deltas and notes">
      <div class="tbl" role="region" aria-label="Cross-generation Metrics">
        <table>
          <thead>
            <tr>
              <th>Metric (per GPU unless noted)</th>
              <th>Hopper (H200)</th>
              <th>Blackwell (B200)</th>
              <th>Rubin (2026)</th>
              <th class="center">Δ vs prior</th>
            </tr>
          </thead>
          <tbody>
            <tr>
              <td>FP8 Tensor</td>
              <td class="right">3.96 PF</td>
              <td class="right">10 PF</td>
              <td class="right">25 PF</td>
              <td class="center"><span class="gain up">+2.5×</span> • <span class="gain up">+2.5×</span></td>
            </tr>
            <tr>
              <td>FP4 Tensor</td>
              <td class="right">—</td>
              <td class="right">20 PF</td>
              <td class="right">50 PF</td>
              <td class="center"><span class="gain up">new</span> • <span class="gain up">+2.5×</span></td>
            </tr>
            <tr>
              <td>FP32</td>
              <td class="right">67 TF</td>
              <td class="right">80 TF</td>
              <td class="right">~160 TF</td>
              <td class="center"><span class="gain up">+1.2×</span> • <span class="gain up">~2×</span></td>
            </tr>
            <tr>
              <td>HBM capacity</td>
              <td class="right">141 GB</td>
              <td class="right">192 GB</td>
              <td class="right">288 GB</td>
              <td class="center"><span class="gain up">+1.36×</span> • <span class="gain up">+1.5×</span></td>
            </tr>
            <tr>
              <td>Memory bandwidth</td>
              <td class="right">4.8 TB/s</td>
              <td class="right">8 TB/s</td>
              <td class="right">13 TB/s</td>
              <td class="center"><span class="gain up">+1.67×</span> • <span class="gain up">+1.63×</span></td>
            </tr>
            <tr>
              <td>NVLink (per GPU)</td>
              <td class="right">900 GB/s</td>
              <td class="right">1.8 TB/s</td>
              <td class="right">3.6 TB/s</td>
              <td class="center"><span class="gain up">2×</span> • <span class="gain up">2×</span></td>
            </tr>
            <tr>
              <td>Scale-up “world size” (flagship)</td>
              <td class="right">H200 NVL (2 GPUs)</td>
              <td class="right">GB200 NVL72 (72 GPUs)</td>
              <td class="right">NVL144 / NVL576 dies</td>
              <td class="center"><span class="gain up">→ rack-scale</span> • <span class="gain up">2× GPUs</span></td>
            </tr>
            <tr>
              <td>Energy efficiency (inference)</td>
              <td class="right">baseline</td>
              <td class="right">up to 25×</td>
              <td class="right">~8× vs H100 (source)</td>
              <td class="center"><span class="gain up">major</span> • <span class="gain up">further ↑</span></td>
            </tr>
          </tbody>
        </table>
      </div>

      <aside class="notes">
        <b>Executive takeaways</b>
        <ul>
          <li>Blackwell’s dual‑die with NV‑HBI 10 TB/s breaks reticle limits; FP4 support doubles inference throughput at similar accuracy.</li>
          <li>HBM ramps 141 → 192 → 288 GB, with bandwidth 4.8 → 8 → 13 TB/s, sustaining training/inference of larger models.</li>
          <li>NVLink doubles each gen (0.9 → 1.8 → 3.6 TB/s per GPU), enabling larger, tighter GPU fabrics; rack‑scale GB200 NVL72 today, Rubin NVL144/576 next.</li>
          <li>Rubin’s 3 nm + HBM4 + Vera CPU targets agentic/reasoning workloads with disaggregated prefill vs decode paths.</li>
        </ul>
        <div style="margin-top:6px;">
          <span class="tag">Data per user‑provided Nvidia materials; some Rubin figures are announced/roadmap</span>
        </div>
      </aside>
    </section>

    <footer class="ftr">
      <div class="sp">
        <span class="caps">Key deltas</span>
        <span>Compute: FP8 +2.5×/gen • FP4 introduced in Blackwell</span>
        <span>HBM: +1.4–1.6×/gen</span>
        <span>NVLink: 2×/gen</span>
      </div>
      <div class="sp">
        <span>Pricing (est.): B200 $45–50k • GB200 $60–70k • NVL72 ~ $3M</span>
      </div>
    </footer>
  </div>
</body>
</html>
```

Drag to resize

xAI

Grok 4

```html
<!DOCTYPE html>
<html lang="en">
<head>
    <meta charset="UTF-8">
    <meta name="viewport" content="width=device-width, initial-scale=1.0">
    <title>Nvidia GPU Generations: Hopper, Blackwell, Rubin</title>
    <style>
        body {
            font-family: Arial, sans-serif;
            background-color: #f4f4f4;
            color: #333;
            margin: 0;
            padding: 20px;
            display: flex;
            justify-content: center;
            align-items: center;
            min-height: 100vh;
        }
        .slide-container {
            background-color: white;
            border: 1px solid #ddd;
            box-shadow: 0 0 10px rgba(0,0,0,0.1);
            padding: 40px;
            max-width: 1200px;
            width: 100%;
        }
        h1 {
            text-align: center;
            color: #005b96;
            margin-bottom: 20px;
        }
        h2 {
            color: #005b96;
            margin-top: 30px;
            margin-bottom: 10px;
        }
        table {
            width: 100%;
            border-collapse: collapse;
            margin-bottom: 20px;
        }
        th, td {
            border: 1px solid #ddd;
            padding: 8px;
            text-align: left;
        }
        th {
            background-color: #005b96;
            color: white;
            text-align: center;
        }
        .highlight {
            background-color: #e6f3ff;
        }
        .difference-section {
            margin-top: 20px;
        }
        ul {
            list-style-type: disc;
            padding-left: 20px;
        }
        .footer {
            text-align: center;
            font-size: 12px;
            color: #666;
            margin-top: 30px;
        }
    </style>
</head>
<body>
    <div class="slide-container">
        <h1>Nvidia Data Center GPU Generations: Hopper, Blackwell, Rubin</h1>
        <p style="text-align: center; font-style: italic;">McKinsey-Style Comparison: Focus on Compute, HBM, Scale-Up World Size, Interconnect, Memory Bandwidth</p>
        
        <h2>Key Specifications Comparison (Representative Models)</h2>
        <table>
            <thead>
                <tr>
                    <th>Metric</th>
                    <th>Hopper (H100/H200)</th>
                    <th>Blackwell (B200/GB200)</th>
                    <th>Rubin (2026/Rubin Ultra 2027)</th>
                    <th>Blackwell vs. Hopper Improvement</th>
                    <th>Rubin vs. Blackwell Improvement</th>
                </tr>
            </thead>
            <tbody>
                <tr class="highlight">
                    <td><strong>Compute: FP32 TFLOPS</strong></td>
                    <td>67 (H100) / 67 (H200)</td>
                    <td>80 (B200) / 160 (GB200)</td>
                    <td>~160 / ~320 (Ultra est.)</td>
                    <td>1.2x</td>
                    <td>2x</td>
                </tr>
                <tr>
                    <td><strong>Compute: FP16/BF16 Tensor PFLOPS</strong></td>
                    <td>1.98 (H100) / 1.98 (H200)</td>
                    <td>5 (B200) / 10 (GB200)</td>
                    <td>~15 / ~30 (Ultra est.)</td>
                    <td>2.5x</td>
                    <td>3x</td>
                </tr>
                <tr class="highlight">
                    <td><strong>Compute: FP8 Tensor PFLOPS</strong></td>
                    <td>3.96 (H100) / 3.96 (H200)</td>
                    <td>10 (B200) / 20 (GB200)</td>
                    <td>25 / 50 (Ultra)</td>
                    <td>2.5x</td>
                    <td>2.5x</td>
                </tr>
                <tr>
                    <td><strong>Compute: FP4 Tensor PFLOPS</strong></td>
                    <td>N/A</td>
                    <td>20 (B200) / 40 (GB200)</td>
                    <td>50 / 100 (Ultra)</td>
                    <td>New (2x over FP8)</td>
                    <td>2.5x</td>
                </tr>
                <tr class="highlight">
                    <td><strong>HBM Memory Capacity</strong></td>
                    <td>80GB HBM3 (H100) / 141GB HBM3e (H200)</td>
                    <td>192GB HBM3e (B200) / 384GB (GB200)</td>
                    <td>288GB HBM4 / 1TB HBM4e (Ultra)</td>
                    <td>2.4x (vs H100)</td>
                    <td>1.5x / 5.2x (Ultra)</td>
                </tr>
                <tr>
                    <td><strong>Memory Bandwidth</strong></td>
                    <td>3.35 TB/s (H100) / 4.8 TB/s (H200)</td>
                    <td>8 TB/s (B200) / 16 TB/s (GB200)</td>
                    <td>13 TB/s / 52 TB/s (Ultra)</td>
                    <td>2.4x (vs H100)</td>
                    <td>1.6x / 3.3x (Ultra)</td>
                </tr>
                <tr class="highlight">
                    <td><strong>Scale-Up World Size</strong> (Max GPUs/System)</td>
                    <td>Up to 256 GPUs (DGX H100) / MIG 7 instances</td>
                    <td>NVL72 (72 GPUs) / 1.4 exaflops inference</td>
                    <td>NVL144 (144 GPUs) / NVL576 (576 dies, Ultra)</td>
                    <td>~2-3x scale (rack-level)</td>
                    <td>2x / 4x (Ultra)</td>
                </tr>
                <tr>
                    <td><strong>Interconnect: NVLink Speed</strong></td>
                    <td>900 GB/s (H100/H200)</td>
                    <td>1.8 TB/s (B200) / 3.6 TB/s (GB200)</td>
                    <td>3.6 TB/s / 1.5 PB/s system (Ultra)</td>
                    <td>2x</td>
                    <td>2x / 400x system (Ultra)</td>
                </tr>
                <tr class="highlight">
                    <td><strong>Transistors / Process</strong></td>
                    <td>80B / TSMC 4N</td>
                    <td>208B (dual-die) / TSMC 4NP</td>
                    <td>TBD (single/4x chiplets) / TSMC 3nm</td>
                    <td>2.6x transistors</td>
                    <td>Process shrink + chiplets</td>
                </tr>
                <tr>
                    <td><strong>Power Efficiency</strong></td>
                    <td>Baseline (700W TDP)</td>
                    <td>2.5x better (1000W TDP)</td>
                    <td>~8x better (TBD)</td>
                    <td>2.5x</td>
                    <td>3.2x</td>
                </tr>
            </tbody>
        </table>
        
        <h2>Important Differences and Architectural Evolutions</h2>
        <div class="difference-section">
            <ul>
                <li><strong>Hopper to Blackwell:</strong> Dual-die design (208B transistors vs. 80B) enables 2.5x compute leap; Introduces FP4 precision for 2x inference efficiency; Memory bandwidth doubles to 8 TB/s; NVLink to 1.8 TB/s supports larger scale-up (e.g., NVL72 with 72 GPUs).</li>
                <li><strong>Blackwell to Rubin:</strong> 3nm process + HBM4 yields 3.3x performance; Integrated Vera ARM CPUs (88 cores); Ultra variant with 4x chiplets scales to 576 dies/rack; Bandwidth to 13 TB/s, interconnect to 3.6 TB/s per GPU (1.5 PB/s system).</li>
                <li><strong>Overall Trends:</strong> Exponential scaling: 900x perf gain Hopper to Rubin; Annual cadence accelerates innovation; Focus shifts from training (Hopper) to inference/trillion-param models (Blackwell) to AI reasoning/agents (Rubin).</li>
            </ul>
        </div>
        
        <div class="footer">
            Data sourced from Nvidia announcements. Estimates based on provided roadmap. For strategic AI infrastructure planning.
        </div>
    </div>
</body>
</html>
```

Drag to resize

DeepSeek

DeepSeek V3 0324

Here's a McKinsey-style slide with the key specifications and differences between Nvidia's Hopper, Blackwell, and Rubin GPU generations:

```html
<!DOCTYPE html>
<html>
<head>
    <title>Nvidia Data Center GPU Generations Comparison</title>
    <link href="https://fonts.googleapis.com/css2?family=Roboto:wght@300;400;500;700&display=swap" rel="stylesheet">
    <style>
        body {
            font-family: 'Roboto', sans-serif;
            margin: 0;
            padding: 20px;
            color: #333;
            background-color: #f5f7fa;
        }
        .slide-container {
            width: 900px;
            background: white;
            border-radius: 8px;
            box-shadow: 0 4px 12px rgba(0,0,0,0.1);
            padding: 25px;
            margin: 0 auto;
        }
        h1 {
            color: #2a5885;
            margin-top: 0;
            font-size: 28px;
            border-bottom: 2px solid #e0e0e0;
            padding-bottom: 10px;
        }
        h2 {
            color: #357ab8;
            font-size: 20px;
            margin-top: 30px;
        }
        .tech-grid {
            display: grid;
            grid-template-columns: repeat(3, 1fr);
            gap: 15px;
            margin-top: 20px;
        }
        .tech-card {
            background: #f9f9f9;
            border-radius: 6px;
            padding: 15px;
            border-left: 4px solid #357ab8;
        }
        .tech-card h3 {
            margin-top: 0;
            color: #2a5885;
            font-size: 16px;
        }
        table {
            width: 100%;
            border-collapse: collapse;
            margin: 20px 0;
            font-size: 14px;
        }
        th {
            background-color: #357ab8;
            color: white;
            text-align: left;
            padding: 10px;
        }
        td {
            border: 1px solid #ddd;
            padding: 8px 10px;
        }
        tr:nth-child(even) {
            background-color: #f2f2f2;
        }
        .highlight {
            font-weight: 500;
            color: #d32f2f;
        }
        .metric-title {
            font-weight: 700;
        }
        .comparison-table {
            width: 100%;
            border-collapse: collapse;
        }
        .comparison-table th {
            background-color: #2a5885;
        }
        .comparison-table td:first-child {
            font-weight: 500;
        }
        .footer {
            margin-top: 30px;
            font-size: 12px;
            color: #777;
            text-align: center;
        }
        .banner {
            background: linear-gradient(90deg, #1a3f8a, #357ab8);
            color: white;
            padding: 8px 15px;
            border-radius: 4px;
            font-weight: 500;
            font-size: 14px;
            margin-bottom: 20px;
        }
        .chip-diagram {
            width: 100%;
            height: 120px;
            background: #f0f0f0;
            border-radius: 4px;
            margin: 15px 0;
            display: flex;
            align-items: center;
            justify-content: space-around;
            padding: 5px;
        }
        .generation {
            text-align: center;
            font-weight: 700;
        }
        .chart-container {
            height: 200px;
        }
    </style>
</head>
<body>
    <div class="slide-container">
        <div class="banner">NVIDIA DATA CENTER GPU COMPARISON: HOPPER → BLACKWELL → RUBIN</div>
        <h1>Generational Leap in AI Accelerator Performance (2022-2026)</h1>
        
        <div class="tech-grid">
            <div class="tech-card">
                <h3>Hopper H100/H200</h3>
                <p><strong>Process:</strong> TSMC 4N</p>
                <p><strong>Transistors:</strong> 80B</p>
                <p><strong>HBM:</strong> HBM3/3e</p>
                <p><strong>NVLink:</strong> 900GB/s</p>
                <p><strong>FP8 TFLOPS:</strong> 3.96P</p>
            </div>
            <div class="tech-card">
                <h3>Blackwell B200</h3>
                <p><strong>Process:</strong> TSMC 4NP</p>
                <p><strong>Transistors:</strong> 208B (2× die)</p>
                <p><strong>HBM:</strong> HBM3e</p>
                <p><strong>NVLink:</strong> 1.8TB/s</p>
                <p><strong>FP8 TFLOPS:</strong> 10P</p>
            </div>
            <div class="tech-card">
                <h3>Rubin (2026)</h3>
                <p><strong>Process:</strong> TSMC 3nm</p>
                <p><strong>HBM:</strong> HBM4</p>
                <p><strong>NVLink:</strong> 3.6TB/s</p>
                <p><strong>FP8 TFLOPS:</strong> 25P</p>
                <p><strong>+ ARM CPU:</strong> Vera 88-core</p>
            </div>
        </div>

        <div class="chip-diagram">
            <div class="generation">
                Hopper<br>
                <svg width="80" height="60" viewBox="0 0 80 60">
                    <rect x="10" y="10" width="60" height="40" fill="#357ab8" rx="3"/>
                </svg>
                (Single Die)
            </div>
            <div class="generation">
                Blackwell<br>
                <svg width="160" height="60" viewBox="0 0 160 60">
                    <rect x="20" y="10" width="60" height="40" fill="#2a5885" rx="3"/>
                    <rect x="80" y="10" width="60" height="40" fill="#2a5885" rx="3"/>
                    <line x1="80" y1="30" x2="80" y2="40" stroke="#d32f2f" stroke-width="3"/>
                </svg>
                (Dual Die)
            </div>
            <div class="generation">
                Rubin<br>
                <svg width="200" height="60" viewBox="0 0 200 60">
                    <rect x="20" y="10" width="30" height="40" fill="#1a3f8a" rx="3"/>
                    <rect x="60" y="10" width="30" height="40" fill="#1a3f8a" rx="3"/>
                    <rect x="100" y="10" width="30" height="40" fill="#1a3f8a" rx="3"/>
                    <rect x="140" y="10" width="30" height="40" fill="#1a3f8a" rx="3"/>
                    <rect x="10" y="25" width="10" height="10" fill="#d32f2f" rx="2"/>
                    <text x="15" y="33" font-family="Arial" font-size="8" text-anchor="middle" fill="white">ARM</text>
                </svg>
                (Chiplet + CPU)
            </div>
        </div>

        <h2>Key Performance Metrics</h2>
        <table class="comparison-table">
            <thead>
                <tr>
                    <th>Metric</th>
                    <th>Hopper (H100)</th>
                    <th>Blackwell (B200)</th>
                    <th>Rubin (2026)</th>
                    <th>Improvement</th>
                </tr>
            </thead>
            <tbody>
                <tr>
                    <td class="metric-title">Memory Capacity</td>
                    <td>80GB (H100)<br>141GB (H200)</td>
                    <td class="highlight">192GB</td>
                    <td class="highlight">288GB</td>
                    <td>3.6× vs H100</td>
                </tr>
                <tr>
                    <td class="metric-title">Memory Bandwidth</td>
                    <td>3.35TB/s (H100)<br>4.8TB/s (H200)</td>
                    <td class="highlight">8TB/s</td>
                    <td class="highlight">13TB/s</td>
                    <td>3.9× vs H100</td>
                </tr>
                <tr>
                    <td class="metric-title">FP8 Tensor Core Perf</td>
                    <td>3.96 PFLOPS</td>
                    <td class="highlight">10 PFLOPS</td>
                    <td class="highlight">25 PFLOPS</td>
                    <td>6.3× vs H100</td>
                </tr>
                <tr>
                    <td class="metric-title">FP4 Tensor Core Perf</td>
                    <td>N/A</td>
                    <td class="highlight">20 PFLOPS</td>
                    <td class="highlight">50 PFLOPS</td>
                    <td>New capability</td>
                </tr>
                <tr>
                    <td class="metric-title">NVLink Bandwidth</td>
                    <td>900GB/s</td>
                    <td class="highlight">1.8TB/s</td>
                    <td class="highlight">3.6TB/s</td>
                    <td>4× vs H100</td>
                </tr>
                <tr>
                    <td class="metric-title">Max System Scale</td>
                    <td>DGX H100 (8 GPUs)</td>
                    <td class="highlight">GB200 NVL72 (144 dies)</td>
                    <td class="highlight">NVL576 (576 dies)</td>
                    <td>72× vs H100 system</td>
                </tr>
            </tbody>
        </table>

        <h2>Architecture Advancements</h2>
        <table>
            <tr>
                <th>Generation</th>
                <th>Key Innovations</th>
                <th>HBM Evolution</th>
                <th>Interconnect</th>
            </tr>
            <tr>
                <td>Hopper (2022)</td>
                <td>4th-gen Tensor Cores, Transformer Engine, FP8 support</td>
                <td>HBM3 (3.35TB/s) → HBM3e (4.8TB/s)</td>
                <td>NVLink4: 900GB/s</td>
            </tr>
            <tr>
                <td>Blackwell (2025)</td>
                <td>Dual-die design, 2nd-gen Transformer Engine, FP4 support, 10TB/s NV-HBI</td>
                <td>HBM3e: 8TB/s</td>
                <td>NVLink5: 1.8TB/s</td>
            </tr>
            <tr>
                <td>Rubin (2026)</td>
                <td>Chiplet design, ARM CPU integration, disaggregated architecture</td>
                <td>HBM4: 13TB/s</td>
                <td>NVLink6: 3.6TB/s</td>
            </tr>
        </table>

        <h2>Target Workload Performance Comparison</h2>
        <table>
            <tr>
                <th>Workload</th>
                <th>Hopper</th>
                <th>Blackwell</th>
                <th>Rubin</th>
            </tr>
            <tr>
                <td>LLM Training</td>
                <td>175B parameters</td>
                <td>Trillion parameters</td>
                <td>Multi-modal training</td>
            </tr>
            <tr>
                <td>LLM Inference</td>
                <td>Production scale</td>
                <td>30× H100 speed</td>
                <td>Massive context windows</td>
            </tr>
            <tr>
                <td>Database/Analytics</td>
                <td>Standard acceleration</td>
                <td>6× with decompression engine</td>
                <td>Real-time analytics</td>
            </tr>
        </table>

        <div class="footer">NVIDIA Data Center GPU Architecture Comparison | Performance figures represent peak theoretical values</div>
    </div>
</body>
</html>
```

This slide presents a comprehensive comparison of Nvidia's three GPU generations with:

1. A high-level overview showing key specs of each generation
2. A visual representation of the architectural evolution (single die → dual die → chiplet)
3. Detailed performance metrics comparison (memory, compute, interconnect)
4. Architecture advancements table highlighting key innovations
5. Workload performance comparison

The design follows McKinsey-style with clear hierarchies, concise information presentation, and emphasis on the most important metrics (compute, HBM, scale up, interconnect, memory bandwidth) as requested. The slide is information-dense yet visually organized.

Drag to resize