Independent · 20 providers · re-verified weekly · no vendor influence

The Live GPU Price Index

The cheapest place to rent every major NVIDIA GPU — H100 to B200 — ranked across 20 clouds, with on-demand and spot rates re-verified weekly. The same card costs up to 10× more depending on where you rent it. We do the reading so you stop overpaying.

Cheapest cloud GPU, by model

Lowest tracked on-demand rate per GPU. Tap a model for the full provider breakdown.

GPUVRAMCheapest $/hrBest providerProvidersSpread
B200192GB$4.99Lambda52.9×
H200141GB$2.60GMI Cloud21.7×
H10080GB$1.65Vast.ai188.6×
A100 80GB80GB$0.78Thunder Compute67.4×
L40S48GB$0.72Spheron21.2×
RTX 409024GB$0.35Vast.ai32.0×
RTX 509032GB$0.76Spheron11.0×

37 price points across 20 providers · standard on-demand, per-GPU, USD · verified 2026-06-05.

Compare every provider

NVIDIA H100

80GB · 18 providers tracked · verified 2026-06-05

Cheapest on-demand

$1.65/hr

ProviderTypeOn-demand $/hrSpot $/hr
Vast.aimarketplacecheapest
PCIe
$1.65
$0.91Rent →
UpCloud
SXM
$2.08
Rent →
Sesterce
SXM
$2.09
Rent →
FluidStack
SXM
$2.10
Rent →
CUDO Compute
SXM
$2.25
Rent →
Lambda
SXM
$2.49
Rent →
Spheron
SXM
$2.50
$1.03Rent →
Novita AI
SXM
$2.59
Rent →
RunPod
SXM
$2.69
Rent →
Nebius
SXM
$2.95
Rent →
OVHcloud
PCIe
$2.99
Rent →
Vultr
SXM
$2.99
Rent →
Gcore
SXM
$3.21
Rent →
DigitalOcean
SXM
$3.39
Rent →
Paperspace
SXM
$5.95
Rent →
AWS
SXM
$6.88
Rent →
Microsoft Azure
SXM
$6.98
Rent →
Google Cloud
SXM
$14.19
Rent →

Standard published on-demand pricing, USD per single GPU per hour, re-verified weekly (last 2026-06-05). Spot/marketplace and committed-use rates run lower. Hyperscaler rates are per-GPU from multi-GPU instance list prices. Spread on H100: 8.6× between cheapest and dearest tracked rate.

Why the same GPU costs 10× more elsewhere

An H100 is the same silicon whether you rent it from a specialist cloud or a hyperscaler — but the hourly price isn't. Specialist and marketplace providers compete on raw price; hyperscalers bundle the GPU with their platform, support, and networking and charge several times more. For a training run or a busy inference fleet, that gap is the difference between a healthy and a ruinous compute bill.

Renting vs. paying per token

Renting a GPU only beats a managed LLM API above a certain volume — and the line moves once you count the hours the card sits idle. Before you commit to a GPU, run your numbers through the self-host vs API breakeven.

Methodology

Rates are standard published on-demand pricing, per single GPU, in USD, re-verified weekly (last 2026-06-05). Spot and marketplace rates are shown where tracked and run lower with variable availability. Hyperscaler per-GPU figures are derived from multi-GPU instance list prices. Published under CC BY 4.0 — cite freely with a link. Spotted a stale rate? Tell us and we correct within 48 hours.