← All hardware
Grace CPU + Hopper GPU fused over NVLink-C2C
Pros
- Huge coherent memory pool
- Excellent for recommender/graph AI
- Tight CPU-GPU coupling
- Available on several clouds
Cons
- Niche vs pure GPU racks
- Hopper-gen GPU
- Limited cloud availability
- Superseded by Grace Blackwell
✓ Where it shines / best for
- Large-memory AI inference and recommender systems
- HPC and scientific computing with big working sets
- Graph analytics and memory-bound workloads
✕ Not the best fit for
- Pure max-throughput training where B200/H200 SXM clusters fit better
- Edge or on-device deployment
- Small teams without datacenter infrastructure
Features
- ✓ AI inference
- ✓ Data-center scale
- ✓ HBM3E
- ✓ Grace CPU
- ✓ Arm CPU
- ✓ Coherent Memory
- ✓ Hpc
- ✓ NVLink C2c
Pricing
| Plan | Price | Billing | Notes |
|---|---|---|---|
| GH200 superchip (street price) | $34,000-$40,000 | one-time | Per-superchip price via OEM systems; not sold at public retail |
| Cloud rental (per GH200) | ~$1.99-$6.50 | per hour | On-demand neocloud pricing (e.g., Lambda); ~$1,433-$4,680/mo at 720 hrs |
Pricing verified from the official source. Prices change often — confirm on the vendor's site before buying.
Specifications
| use | Memory-bound AI and HPC |
| power | ~1,000W |
| memory | 96GB HBM3 + up to 480GB LPDDR5X |
| performance | ~4 PFLOPS FP8 (GPU) |
| architecture | Grace (Arm) + Hopper |
Sponsored
A full review is being generated for this product and will appear here shortly.