NVIDIA B200 is their Blackwell-generation GPU. Each GPU has:

  • 10 GPCs1
  • ? TPCs
  • 192 GB HBM3 (8 stacks)
    • 8 TB/s (max)
  • 2x 900 GB/s NVLink v5 (D2D)2
  • 2x 256 GB/s PCIe Gen6 (H2D)2
  • 1000 W maximum

B100 GPUs are a lower-power variant of B200 (700W) that is meant to be a “drop-in replacement” for HGX H100 platforms.3 That is, you can take a server platform built for 8-way H100 baseboards, swap in B100 baseboards, and sell them without having to re-engineer power or thermals.

Performance

The following are theoretical maximum performance in TFLOPS:3

Data TypeVFMAMatrixSparse
FP649090
FP32180
TF3212502500
FP16500010000
BF1625005000
FP8500010000
FP6500010000
FP41000020000
INT8500010000

Footnotes

  1. open-gpu-kernel-modules/kernel-open/nvidia-uvm/uvm_blackwell_fault_buffer.h at main · NVIDIA/open-gpu-kernel-modules (github.com)

  2. GB200 NVL2 | NVIDIA 2

  3. https://resources.nvidia.com/en-us-blackwell-architecture 2