← All hardware

hardware Data Center AI Accelerators

NVIDIA GH200 Grace Hopper Superchip

by NVIDIA

Grace CPU + Hopper GPU fused over NVLink-C2C

Pros

Huge coherent memory pool
Excellent for recommender/graph AI
Tight CPU-GPU coupling
Available on several clouds

Cons

Niche vs pure GPU racks
Hopper-gen GPU
Limited cloud availability
Superseded by Grace Blackwell

✓ Where it shines / best for

Large-memory AI inference and recommender systems
HPC and scientific computing with big working sets
Graph analytics and memory-bound workloads

✕ Not the best fit for

Pure max-throughput training where B200/H200 SXM clusters fit better
Edge or on-device deployment
Small teams without datacenter infrastructure

Features

✓ AI inference
✓ Data-center scale
✓ HBM3E
✓ Grace CPU
✓ Arm CPU
✓ Coherent Memory
✓ Hpc
✓ NVLink C2c

Pricing

Plan	Price	Billing	Notes
GH200 superchip (street price)	$34,000-$40,000	one-time	Per-superchip price via OEM systems; not sold at public retail
Cloud rental (per GH200)	~$1.99-$6.50	per hour	On-demand neocloud pricing (e.g., Lambda); ~$1,433-$4,680/mo at 720 hrs

Pricing verified from the official source. Prices change often — confirm on the vendor's site before buying.

Specifications

use	Memory-bound AI and HPC
power	~1,000W
memory	96GB HBM3 + up to 480GB LPDDR5X
performance	~4 PFLOPS FP8 (GPU)
architecture	Grace (Arm) + Hopper

A full review is being generated for this product and will appear here shortly.

Compare with

NVIDIA GB200 NVL72

A rack-scale exaflop AI supercomputer that acts as one giant GPU.

9.6/10 hardware From $10.50/per hour

NVIDIA GB300 NVL72

Rack-scale Blackwell Ultra: 72 GPUs + 36 Grace CPUs as one giant accelerator

9.5/10 hardware From $12/per hour

NVIDIA GB300 NVL72 (Blackwell Ultra)

Blackwell Ultra rack-scale system tuned for the age of AI reasoning.

9.5/10 hardware From $12/per hour

NVIDIA B300 (Blackwell Ultra)

Blackwell Ultra single-GPU module for AI reasoning at scale

9.4/10 hardware From $30000/one-time

Compare

Compare