Product updates

Introducing Ori Serverless Kubernetes

Meet Ori Serverless Kubernetes, an AI infrastructure service that brings you the best of Serverless and Kubernetes by blending powerful scalability, simple management and affordability.

José Domingos

Jul 15, 2024

Product updates

The emergence of generative AI, especially large language models (LLMs) have made AI accessible to a wide range of startups and businesses. However, to truly capture the value of this opportunity, AI-focused businesses need to find a balance between scalable infrastructure and cost-effectiveness. Kubernetes has traditionally been a powerful framework to run scalable and reliable applications while also helping right-size computing infrastructure. But Kubernetes is often perceived as complex by decision makers and developers alike, and many of them often prefer the simplicity of Serverless tools such as Platform-as-a-Service(PaaS) and Function-as-a-Service(FaaS).

Ori Serverless Kubernetes, best of both worlds

Introducing Ori Serverless Kubernetes, an AI infrastructure service that combines powerful scalability, simple management and affordability to help AI-focused businesses train, deploy and scale world-changing AI/ML models. Serverless Kubernetes blends the scalability, reliability and flexibility of Kubernetes with the ease-of-use of serverless platforms.

The following capabilities are included in Ori Serverless Kubernetes out of the box:

Powerful GPUs and ML frameworks on-demand: Choice of NVIDIA^® H100, L4, L40S, and V100S. Leverage pre-configured ML frameworks or bring your Helm charts to build with the ML tools of your choice.
Autoscaling that adapts your AI infrastructure to user demand, while optimizing costs.
Full command-line access to the control plane via kubectl provides developers enhanced flexibility. Users leverage existing app catalogs via Helm charts and can access multiple namespaces within a cluster.
Simplicity of Serverless with control plane isolation. Developers can relax knowing that Ori fully manages and balances the load of the clusters, while complete isolation via a separate control plane keeps your data secure.
Familiarity of Vanilla Kubernetes, no refactoring or learning curve for Kubernetes users.

Transform your AI infrastructure

Power of Kubernetes, minus the complexity: Experience the benefits of a full-scale control plane, enhanced security via complete control isolation, and access pre-package app catalogs, but with a serverless implementation that is designed to simplify your MLOps.
Get your AI/ML models to market faster: No waiting for GPUs and no approvals needed. Pick from a range of high-performance GPU models and create a cluster in less than a minute. Leverage Helm charts and tools of your choice without needing to adapt them to our platform.
Scale your infrastructure, not your costs: Autoscaling of GPU clusters helps you pay only for what you use. Scale up or down based on demand and make the most of your GPU budgets.

Ori customers, such as Snowcrash who tried out the new Serverless Kubernetes love the simplicity it infuses into their AI/ML infrastructure.

Finally, no more worrying about GPU slices, or figuring out if your pod is visible by other clients, or wondering if you can run sidecars with your ML apps! Simple and straightforward!

See Serverless Kubernetes in action

Check out how easy it is to spin up a serverless cluster and deploy your model.

Our transparent, pay-per-use pricing means you only pay for the resources you consume, helping you optimize GPU costs. Learn all about Ori Serverless Kubernetes in our documentation.

How to know if Serverless Kubernetes is right for you

Manage infrastructure manually: If your team is spending a significant portion of their time on scaling your infrastructure up/down or updating virtual machines with essential packages and drivers, Serverless Kubernetes simplifies infrastructure management so you can focus on building and serving AI/ML models.
Experience underutilization of GPUs: If you are overprovisioning GPUs to handle unexpected inference traffic, large-scale data events or batch-processing jobs, Serverless Kubernetes is a better fit for your AI/ML workloads than virtual machines. Also, when you need to run large-scale training and inference workloads virtual machines are slower to scale compared to Kubernetes.
If you’re already using Kubernetes: Managing Kubernetes infrastructure for your AI/ML workloads needs considerable DevOps support and Kubernetes-trained ML engineers. Ori Serverless Kubernetes, on the other hand, takes care of cluster management and load balancing to make it easier for you, especially if you’re running a lean AI/ML team.

Build, deploy, and scale AI on Ori Serverless Kubernetes

Spin up a serverless GPU cluster on Ori today! If you’d like to have a conversation about using Ori for your business, contact our sales team.

Introducing Ori Serverless Kubernetes

Ori Serverless Kubernetes, best of both worlds

Transform your AI infrastructure

See Serverless Kubernetes in action

How to know if Serverless Kubernetes is right for you

Build, deploy, and scale AI on Ori Serverless Kubernetes

Subscribe for more news and insights

Similar posts

AI Meets Rugby: Framesports automates match analysis on Ori Serverless Kubernetes

nCompass (YC W24) leverages Ori Serverless Kubernetes to make AI inference 2x cost-effective

Deploy and scale LLMs on Ori Serverless Kubernetes with Ollama and Open WebUI

Introducing Ori Serverless Kubernetes

Ori Serverless Kubernetes, best of both worlds

Transform your AI infrastructure

See Serverless Kubernetes in action

How to know if Serverless Kubernetes is right for you

Build, deploy, and scale AI on Ori Serverless Kubernetes

Subscribe for more news and insights

Similar posts

AI Meets Rugby: Framesports automates match analysis on Ori Serverless Kubernetes

nCompass (YC W24) leverages Ori Serverless Kubernetes to make AI inference 2x cost-effective

Deploy and scale LLMs on Ori Serverless Kubernetes with Ollama and Open WebUI

Join the new class of AI infrastructure!