What drives the RTX 4090 price gap
The NVIDIA RTX 4090 is identical silicon everywhere — the 2.0× gap between Vast.ai at the bottom and the hyperscalers at the top is about packaging, not performance. Specialist and marketplace clouds compete on raw $/hr; AWS, Azure and Google bundle the card with their platform, networking and support and price accordingly. For a sustained training run or a busy inference fleet, settling on the cheapest reliable provider is one of the largest single levers on your compute bill.
On-demand vs spot
On-demand guarantees the GPU is yours; spot and marketplace supply is cheaper but can be reclaimed, so it suits fault-tolerant or checkpointed workloads. Toggle "Best (incl. spot)" in the table above to see the cheapest available rate including interruptible supply.
Should you rent, or use an API?
If your goal is running an LLM rather than training one, renting a RTX 4090only beats a managed API above a breakeven volume — and the maths flips once you count idle hours. Check it first with the self-host vs API breakeven calculator.
Frequently asked questions
What is the cheapest cloud RTX 4090 price?
The lowest on-demand rate we track for the NVIDIA RTX 4090 is $0.35/hr at Vast.ai, across 3 providers (verified 2026-06-05). Spot and marketplace rates run lower with variable availability.
How much does an RTX 4090 cost per hour?
On-demand RTX 4090 pricing ranges from $0.35/hr to $0.69/hr per GPU depending on provider — about a 2.0× spread for the same card. Specialist clouds are cheapest; hyperscalers (AWS/Azure/GCP) sit at the top.
Is it cheaper to rent an RTX 4090 or use an LLM API?
Renting only wins above a breakeven volume, because a GPU bills every hour it exists while an API bills per token. Model your own crossover with the self-host vs API breakeven calculator before committing.
Independent comparison, no vendor influence. Standard on-demand pricing per single GPU, re-verified weekly (last 2026-06-05); negotiated and committed-use rates differ. Published under CC BY 4.0.