Scaling AI workloads with Kubernetes

w/ Tsavo Knott, Shivay Lamba & Kunal Kushwaha

Recorded: 3rd July 2024

As AI models, particularly Large Language Models (LLMs), grow in size and complexity, their deployment becomes increasingly challenging. We explored the complexities involved and effective strategies for managing LLM/AI deployments on Kubernetes, focusing on cost-efficiency and scalability.

Given the intricate overlap between AI and DevOps, involving multiple aspects such as ML models, deployment scripts, and configurations, we also explored how to streamline these processes effectively. By leveraging live context from various documents and terminal screens via Pieces Copilot, we aim to provide real-time assistance and troubleshooting tips for deployment challenges.

What we discussed:

Best practices for configuring Kubernetes for AI workloads.
Solutions to facilitate smoother AI deployments.
Handling common issues and pitfalls in AI model deployment on Kubernetes.
Live demo showing how to deploy an AI model on Civo using GPU nodes.

Related Meetups

Maximizing Developer Rfficiency thumbnail

Kubernetes

Maximizing Developer Efficiency with Civo and StackState

Dinesh Majrekar (CTO at Civo) and Andreas Prins (CEO of StackState) discussed how developers can maximize their developer efficiency through various tools and techniques.

Kubernetes

Application Deployment to Civo with a Terraform Template

In this session our CTO, Dinesh Majrekar used a new terraform template repo to deploy a cluster, install an ingress and let-encrypt helm chart all from Terraform.

KubernetesCivo

The Evolution of DevOps with Modern Tooling

Narayan Sainaney (Codezero CTO) and Dinesh Majrekar (Civo CTO) discussed local development in a multi-cloud scenario with Civo and Codezero.

Kubernetes

Compute

Databases

CivoStack Enterprise

CivoStack for Service Providers

VMware Alternative

Cloud GPU

Carbon neutral GPU

Kubeflow as a Service

Startups

Small & mid-market

SaaS companies

CI / Testing

Move to Kubernetes

Case studies & testimonials

Learn

Blog

White papers

Documentation

Civo news

Meetups

Marketplace

Use Civo for your demos

Scaling AI workloads with Kubernetes