Independent · 20 providers · re-verified weekly · no vendor influence

The Live GPU Price Index

The cheapest place to rent every major NVIDIA GPU — H100 to B200 — ranked across 20 clouds, with on-demand and spot rates re-verified weekly. The same card costs up to 10× more depending on where you rent it. We do the reading so you stop overpaying.

Cheapest cloud GPU, by model

Lowest tracked on-demand rate per GPU. Tap a model for the full provider breakdown.

GPU	VRAM	Cheapest $/hr	Best provider	Providers	Spread
B200	192GB	$4.99	Lambda	5	2.9×
H200	141GB	$2.60	GMI Cloud	2	1.7×
H100	80GB	$1.65	Vast.ai	18	8.6×
A100 80GB	80GB	$0.78	Thunder Compute	6	7.4×
L40S	48GB	$0.72	Spheron	2	1.2×
RTX 4090	24GB	$0.35	Vast.ai	3	2.0×
RTX 5090	32GB	$0.76	Spheron	1	1.0×

37 price points across 20 providers · standard on-demand, per-GPU, USD · verified 2026-07-14.

Compare every provider

NVIDIA H100

80GB · 18 providers tracked · verified 2026-07-14

Cheapest on-demand

$1.65/hr

Provider	Type	On-demand $/hr	Spot $/hr
Vast.aimarketplacecheapest	PCIe	$1.65	$0.91	Rent →
UpCloud	SXM	$2.08	—	Rent →
Sesterce	SXM	$2.09	—	Rent →
FluidStack	SXM	$2.10	—	Rent →
CUDO Compute	SXM	$2.25	—	Rent →
Lambda	SXM	$2.49	—	Rent →
Spheron	SXM	$2.50	$1.03	Rent →
Novita AI	SXM	$2.59	—	Rent →
OVHcloud	PCIe	$2.99	—	Rent →
Vultr	SXM	$2.99	—	Rent →
Gcore	SXM	$3.21	—	Rent →
RunPod	SXM	$3.29	—	Rent →
DigitalOcean	SXM	$3.39	—	Rent →
Nebius	SXM	$3.85	—	Rent →
Paperspace	SXM	$5.95	—	Rent →
AWS	SXM	$6.88	—	Rent →
Microsoft Azure	SXM	$6.98	—	Rent →
Google Cloud	SXM	$14.19	—	Rent →

Standard published on-demand pricing, USD per single GPU per hour, re-verified weekly (last 2026-07-14). Spot/marketplace and committed-use rates run lower. Hyperscaler rates are per-GPU from multi-GPU instance list prices. Spread on H100: 8.6× between cheapest and dearest tracked rate.

Why the same GPU costs 10× more elsewhere

An H100 is the same silicon whether you rent it from a specialist cloud or a hyperscaler — but the hourly price isn't. Specialist and marketplace providers compete on raw price; hyperscalers bundle the GPU with their platform, support, and networking and charge several times more. For a training run or a busy inference fleet, that gap is the difference between a healthy and a ruinous compute bill.

Renting vs. paying per token

Renting a GPU only beats a managed LLM API above a certain volume — and the line moves once you count the hours the card sits idle. Before you commit to a GPU, run your numbers through the self-host vs API breakeven.

Methodology

Rates are standard published on-demand pricing, per single GPU, in USD, re-verified weekly (last 2026-07-14). Spot and marketplace rates are shown where tracked and run lower with variable availability. Hyperscaler per-GPU figures are derived from multi-GPU instance list prices. Published under CC BY 4.0 — cite freely with a link. Spotted a stale rate? Tell us and we correct within 48 hours.