Cheapest H200 cloud price

Hopper refresh with 141GB HBM3e — more memory headroom than the H100 for long-context and large models. Below: every cloud we track for the NVIDIA H200, ranked by price and re-verified weekly — so you rent 141GB of compute at the best rate, not the first one you find.

Cheapest
$2.60
Best provider
GMI Cloud
Providers
2
Price spread
1.7×

NVIDIA H200

141GB · 2 providers tracked · verified 2026-06-05

Cheapest on-demand

$2.60/hr

ProviderTypeOn-demand $/hrSpot $/hr
GMI Cloudcheapest
SXM
$2.60
Rent →
Spheron
SXM
$4.54
Rent →

Standard published on-demand pricing, USD per single GPU per hour, re-verified weekly (last 2026-06-05). Spot/marketplace and committed-use rates run lower. Hyperscaler rates are per-GPU from multi-GPU instance list prices. Spread on H200: 1.7× between cheapest and dearest tracked rate.

What drives the H200 price gap

The NVIDIA H200 is identical silicon everywhere — the 1.7× gap between GMI Cloud at the bottom and the hyperscalers at the top is about packaging, not performance. Specialist and marketplace clouds compete on raw $/hr; AWS, Azure and Google bundle the card with their platform, networking and support and price accordingly. For a sustained training run or a busy inference fleet, settling on the cheapest reliable provider is one of the largest single levers on your compute bill.

On-demand vs spot

On-demand guarantees the GPU is yours; spot and marketplace supply is cheaper but can be reclaimed, so it suits fault-tolerant or checkpointed workloads. Toggle "Best (incl. spot)" in the table above to see the cheapest available rate including interruptible supply.

Should you rent, or use an API?

If your goal is running an LLM rather than training one, renting a H200only beats a managed API above a breakeven volume — and the maths flips once you count idle hours. Check it first with the self-host vs API breakeven calculator.

Frequently asked questions

What is the cheapest cloud H200 price?

The lowest on-demand rate we track for the NVIDIA H200 is $2.60/hr at GMI Cloud, across 2 providers (verified 2026-06-05). Spot and marketplace rates run lower with variable availability.

How much does an H200 cost per hour?

On-demand H200 pricing ranges from $2.60/hr to $4.54/hr per GPU depending on provider — about a 1.7× spread for the same card. Specialist clouds are cheapest; hyperscalers (AWS/Azure/GCP) sit at the top.

Is it cheaper to rent an H200 or use an LLM API?

Renting only wins above a breakeven volume, because a GPU bills every hour it exists while an API bills per token. Model your own crossover with the self-host vs API breakeven calculator before committing.

Independent comparison, no vendor influence. Standard on-demand pricing per single GPU, re-verified weekly (last 2026-06-05); negotiated and committed-use rates differ. Published under CC BY 4.0.

Compare other GPUs