Amazon Web Services (AWS) has announced a significant AWS GPU price cut, reducing the cost of running machine learning and high-performance computing (HPC) workloads by 33% to 45%. This move is part of AWS’s ongoing strategy to deliver more value to developers, startups, and enterprises building AI models at scale.
The new pricing applies to the latest EC2 P5, P5e, and P5en instances, powered by cutting-edge NVIDIA H100 and H200 GPUs. These instances offer faster performance and higher memory bandwidth, which helps lower the total time and cost required to train large language models and generative AI applications. Compared to previous-generation P4 instances, the P5 family now offers as much as 40% cost savings while significantly improving throughput.
In addition to upgrading its NVIDIA-based infrastructure, AWS is also promoting its Trainium AI chips as a powerful and cheaper alternative to GPUs. Trainium-based instances are tailored for deep learning training and promise up to 40% better price-performance than comparable GPU options. In some use cases, AWS Trainium can run AI workloads at just 25% of the cost of traditional GPU training, which could drastically cut cloud bills for startups and researchers alike.
The AWS GPU price cut is a direct response to growing demand for compute-heavy AI workloads, which are fueling new innovations across sectors like healthcare, finance, gaming, and autonomous systems. With these price reductions, AWS aims to stay competitive against other cloud providers like Microsoft Azure and Google Cloud, which are also expanding their AI-focused compute offerings.
Amazon’s new pricing strategy also reflects a broader shift toward custom silicon. By offering more cost-effective alternatives like Trainium and Inferentia chips, AWS is giving customers more flexibility to optimize their performance and budgets based on specific workloads. Whether a company needs raw GPU power or a budget-friendly solution for training mid-size models, AWS now provides more tailored options at competitive prices.
These pricing changes are effective immediately and apply across various regions, depending on instance availability. Developers and data scientists can begin leveraging the new AWS GPU price cut by selecting updated EC2 instances via the AWS Management Console, API, or CLI.
As AI adoption continues to grow, lower GPU prices could be the catalyst that allows more companies to experiment, scale, and innovate faster in the cloud.