← All hardware
Google TPU v6e (Trillium) logo
hardware Data Center AI Accelerators

Google TPU v6e (Trillium)

by Google

Google's widely available 6th-gen TPU for efficient training and inference.

Pros

  • Generally available and widely proven
  • Cost-efficient training/inference via cloud
  • Tight Google Cloud and JAX integration

Cons

  • Lower per-chip memory than v7 Ironwood
  • Cloud-only, Google-ecosystem-locked
  • Superseded at the high end by Ironwood

✓ Where it shines / best for

  • Training and serving large language and generative models on Google Cloud
  • Teams already on JAX or PyTorch/XLA wanting cost-efficient throughput
  • High-volume batch inference at scale

✕ Not the best fit for

  • On-premises deployments (cloud-rental only, not sold as hardware)
  • Small experiments where a single GPU is cheaper
  • CUDA-locked workloads that can't port to XLA

Features

  • ✓ LLM
  • ✓ API access
  • ✓ Inference
  • ✓ Training
  • ✓ High Throughput
  • ✓ Hbm
  • ✓ Jax
  • ✓ Cloud TPU
  • ✓ Free trial
  • ✓ Pytorch Xla

Pricing

PlanPriceBillingNotes
On-demand (us-central)~$2.70per chip-hourCloud TPU v6e (Trillium) on-demand list price; varies by region. No purchase of hardware — rented via Google Cloud.
1-year commitment~$1.89per chip-hourApprox. 30% discount vs on-demand with committed-use.
3-year commitment~$1.22per chip-hourApprox. 55% discount vs on-demand with committed-use.
Spot/Preemptiblevariesper chip-hourDiscounted preemptible pricing available; subject to reclamation.

Pricing verified from the official source. Prices change often — confirm on the vendor's site before buying.

Specifications

memory32 GB HBM per chip, ~1,600 GB/s
coolingcloud-only
pod_scale256 chips per pod
performance4x training / up to 3x inference vs prior gen
architectureTrillium (TPU v6e), Google custom ASIC
Sponsored

A full review is being generated for this product and will appear here shortly.

Compare with

Compare
Compare