Rapt AI and AMD to enhance AI workload management and inference performance

1 min read

Rapt AI, a provider of AI-powered AI-workload automation for a range of GPUs and AI accelerators, has signed a strategic collaboration agreement with AMD.

Agreement to redefine AI infrastructure management and AI inference Credit: jeffrey - adobe.stock.com

The agreement looks to redefine AI infrastructure management and to improve AI inference and training workload management and performance on AMD’s Instinct GPUs, providing customers with a more scalable and cost-effective solution for deploying AI applications.

AI adoption is forcing organisations to address issues of resource allocation, performance bottlenecks, and complex GPU management and by integrating Rapt’s intelligent workload automation platform with AMD’s Instinct MI300X, MI325X and upcoming MI350 series GPUs, it will now be possible to deliver a scalable, high-performance, and cost-effective solution that enables customers to maximise AI inference and training efficiency across on-premises and multi-cloud infrastructures. 

According to AMD and Rapt, the collaboration will reduce costs and maximise GPU utilisation, while Rapt’s platform will streamline GPU management, eliminating the need for data scientists to spend valuable time on trial-and-error infrastructure configurations.

By automatically optimising resource allocation for their specific workloads, the platform enables users to focus on innovation rather than infrastructure and it seamlessly supports diverse GPU environments (AMD and others, whether in the cloud, on-premises or both) through a single instance, helping ensure maximum infrastructure flexibility.

The combined solution intelligently optimises job density and resource allocation on AMD Instinct GPUs, resulting in better inference performance and scalability for production AI deployments. Rapt’s auto-scaling capabilities also help ensure efficient resource use based on demand, reducing latency and maximising cost efficiency

Rapt’s platform has been designed to work out-of-the-box with AMD Instinct GPUs, helping ensure immediate benefits to performance.

Ongoing collaboration between Rapt and AMD will drive further optimisations across GPU scheduling, memory utilisation and more, helping ensure customers are equipped with a future ready AI infrastructure.

Commenting on the collaboration, Negin Oliver, corporate vice president of business development, Data Center GPU Business at AMD, said, "Our collaboration with Rapt AI combines the cutting-edge capabilities of AMD Instinct GPUs with Rapt’s intelligent workload automation, enabling customers to achieve greater efficiency, flexibility, and cost savings across their AI infrastructure.”

“Collaboration with AMD allows us to further enhance our platform, optimising it for the powerful AMD Instinct GPUs,” said Charlie Leeming, CEO of Rapt. “This joint solution is set to transform AI infrastructure management, driving better performance, cost efficiency, and faster time to value for our mutual customers. We are excited about the impact this will have on accelerating AI innovation across industries.”