Rafay Debuts New Platform Capabilities to Speed Up GPU Infrastructure Consumption and Monetization

November 20, 2024

New Capabilities Enable Self-service Consumption of Accelerated Computing Infrastructure in Addition to AI and ML Tooling for Cloud Providers and Enterprises

SUNNYVALE, Calif. – Nov. 19, 2024 – (BUSINESS WIRE)--Rafay Systems, the leading provider of Platform-as-a-Service (PaaS) capabilities for cloud-native and GPU and AI consumption, today announced new platform advancements that help enterprises and GPU cloud providers deliver developer-friendly consumption workflows for GPU infrastructure. The new Rafay Platform capabilities include enterprise-grade controls, SKU definition, customer-specific policy enforcement and granular chargeback data.

Enterprises investing in GPU-based infrastructure in data centers can leverage the Rafay Platform to roll out feature-rich enterprise-wide GPU clouds that developers and data scientists can consume on demand — complete with workbenches for model training, fine-tuning and inferencing. GPU cloud providers deploying GPUs for consumption by downstream customers can leverage the Rafay Platform to operate a full-featured, multi-tenant GPU PaaS that delivers both accelerated computing resources along with AI and ML tooling for training, tuning and serving large language models (LLMs).

GPU Investments Outpace Platform Team Bandwidth, Delaying AI Projects and Increasing Costs

Demand for accelerated computing infrastructure is at an all-time high. A majority of enterprises and service providers are investing in GPU hardware to meet generative AI application development demand. Whether they are buying hardware and deploying it in a data center, or committing to long-term leases with GPU cloud providers, there is urgency to provide developers and data scientists with this expensive hardware. Unfortunately, building a platform to enable self-service consumption of accelerated computing hardware and AI and ML workbenches can be a one to two year project. As a result of these platform development delays, expensive hardware is underutilized — nearly a third of enterprises are utilizing less than 15% of GPU capacity.

“Our work with customers across high-stakes industries over the last two quarters has revealed that enterprises and GPU cloud providers are running into similar challenges. Both are looking for ways to speed up the delivery of accelerated computing hardware to developers and data scientists,” said Haseeb Budhani, CEO and co-founder of Rafay Systems. “The new Rafay Platform capabilities address this need, helping enterprises and GPU cloud providers speed the delivery of a PaaS experience in order to monetize their significant investments in accelerated computing infrastructure.”

Rafay Accelerates GPU Monetization With Standardized Platform Building Blocks

With Rafay, GPU cloud providers and enterprises can quickly launch production-ready AI services. Platform teams can now deliver much-needed services to developers and data scientists through a PaaS offering that enables self-service consumption of compute as well as AI and ML workbenches for fast experimentation and productization of AI-based applications.Newly added Rafay Platform capabilities include:

The new platform capabilities are now generally available to customers in the Rafay Platform.

Additional Resources

About Rafay Systems

Rafay builds infrastructure orchestration and workflow automation software that powers self-service compute consumption for Sovereign AI Clouds, Cloud Service Providers & large Enterprises. Customers leverage the Rafay Platform to orchestrate multi-tenant consumption of AI infrastructure along with AI platforms and applications such as AI-Models-as-a-Service, Accenture's AI Refinery, and other 3rd-party applications. The Rafay Platform provides the automation and governance capabilities that platform teams need to standardize Kubernetes toolsets and workflows. With Rafay, platform teams at MoneyGram, GuardantHealth, Verizon and many other companies are operating Kubernetes environments across data centers, public cloud and Edge environments with centralized visibility and access control, environment standardization, and guardrail enforcement. As a result, platform teams are able to deliver self-service and automation capabilities that delight developer and operations teams. For more information, please visit www.rafay.co.