Companies are hoarding expensive AI GPUs and leaving most of that expensive computing power unused while bills skyrocket.


  • Most AI GPUs run at incredibly low utilization on production systems
  • Businesses pay for GPU capacity twenty times more than needed
  • Overprovisioning is increasing sharply instead of improving year over year

Tech companies are rushing to buy massive amounts of AI infrastructure, but most of them are doing virtually no useful work.

A report from Cast AI, based on tens of thousands of Kubernetes clusters across AWS, Azure, and GCP, found that the average GPU utilization sits at just 5%.

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top