Independent · 20 providers · re-verified weekly · no vendor influence
The Live GPU Price Index
The cheapest place to rent every major NVIDIA GPU — H100 to B200 — ranked across 20 clouds, with on-demand and spot rates re-verified weekly. The same card costs up to 10× more depending on where you rent it. We do the reading so you stop overpaying.
Cheapest cloud GPU, by model
Lowest tracked on-demand rate per GPU. Tap a model for the full provider breakdown.
| GPU | VRAM | Cheapest $/hr | Best provider | Providers | Spread |
|---|---|---|---|---|---|
| B200 | 192GB | $4.99 | Lambda | 5 | 2.9× |
| H200 | 141GB | $2.60 | GMI Cloud | 2 | 1.7× |
| H100 | 80GB | $1.65 | Vast.ai | 18 | 8.6× |
| A100 80GB | 80GB | $0.78 | Thunder Compute | 6 | 7.4× |
| L40S | 48GB | $0.72 | Spheron | 2 | 1.2× |
| RTX 4090 | 24GB | $0.35 | Vast.ai | 3 | 2.0× |
| RTX 5090 | 32GB | $0.76 | Spheron | 1 | 1.0× |
37 price points across 20 providers · standard on-demand, per-GPU, USD · verified 2026-06-05.
Compare every provider
NVIDIA H100
80GB · 18 providers tracked · verified 2026-06-05
Cheapest on-demand
$1.65/hr
| Provider | Type | On-demand $/hr | Spot $/hr | |
|---|---|---|---|---|
Vast.aimarketplacecheapest | PCIe | $1.65 | $0.91 | Rent → |
UpCloud | SXM | $2.08 | — | Rent → |
Sesterce | SXM | $2.09 | — | Rent → |
FluidStack | SXM | $2.10 | — | Rent → |
CUDO Compute | SXM | $2.25 | — | Rent → |
Lambda | SXM | $2.49 | — | Rent → |
Spheron | SXM | $2.50 | $1.03 | Rent → |
Novita AI | SXM | $2.59 | — | Rent → |
RunPod | SXM | $2.69 | — | Rent → |
Nebius | SXM | $2.95 | — | Rent → |
OVHcloud | PCIe | $2.99 | — | Rent → |
Vultr | SXM | $2.99 | — | Rent → |
Gcore | SXM | $3.21 | — | Rent → |
DigitalOcean | SXM | $3.39 | — | Rent → |
Paperspace | SXM | $5.95 | — | Rent → |
AWS | SXM | $6.88 | — | Rent → |
Microsoft Azure | SXM | $6.98 | — | Rent → |
Google Cloud | SXM | $14.19 | — | Rent → |
Standard published on-demand pricing, USD per single GPU per hour, re-verified weekly (last 2026-06-05). Spot/marketplace and committed-use rates run lower. Hyperscaler rates are per-GPU from multi-GPU instance list prices. Spread on H100: 8.6× between cheapest and dearest tracked rate.
Why the same GPU costs 10× more elsewhere
An H100 is the same silicon whether you rent it from a specialist cloud or a hyperscaler — but the hourly price isn't. Specialist and marketplace providers compete on raw price; hyperscalers bundle the GPU with their platform, support, and networking and charge several times more. For a training run or a busy inference fleet, that gap is the difference between a healthy and a ruinous compute bill.
Renting vs. paying per token
Renting a GPU only beats a managed LLM API above a certain volume — and the line moves once you count the hours the card sits idle. Before you commit to a GPU, run your numbers through the self-host vs API breakeven.
Methodology
Rates are standard published on-demand pricing, per single GPU, in USD, re-verified weekly (last 2026-06-05). Spot and marketplace rates are shown where tracked and run lower with variable availability. Hyperscaler per-GPU figures are derived from multi-GPU instance list prices. Published under CC BY 4.0 — cite freely with a link. Spotted a stale rate? Tell us and we correct within 48 hours.