Loading…
Attending this event?
June 19-20, 2024
Paris, France
View More Details & Registration
Note: The schedule is subject to change.

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for AI_dev Europe to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

This schedule is automatically displayed in CEST (Central European Summer Time) UTC/GMT +2 hours. To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date."

IMPORTANT NOTE: Timing of sessions and room locations are subject to change.

Thursday, June 20 • 15:00 - 16:15
Workshop: Optimizing Kubernetes Cluster Scaling for Advanced Generative Models - Shivay Lamba, Couchbase & Shivanshu Raj Shrivastava, Adyen

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

As more organisations are seeking automation, the demand for scalable infrastructure capable of supporting complex generative models is at an all-time high. Kubernetes has emerged as a leading solution for orchestrating and managing containerized applications, including machine learning workloads. Some recent enhancements in Kubernetes have provided a way to effectively use GPU resources, and build drives for GPU to dynamically share the resources within a Kubernetes node. This talk will delve into the intricacies of scaling Kubernetes clusters to accommodate the computational demands of cutting-edge generative models.
We will explore how we have evaluated and adopted various tools and frameworks such as Flyte, MLflow, Kubeflow, Kserve, vLLM, and Argo to seamlessly integrate into our hybrid Kubernetes clusters to streamline the development, deployment, and management of generative models. Attendees will gain insights into the unique features and capabilities of each tool and understand how they contribute to the scalability, efficiency and maintainability of our hybrid Kubernetes clusters.

Speakers
avatar for Shivay Lamba

Shivay Lamba

Developer, Couchbase
Shivay Lamba is a software developer specializing in DevOps, Machine Learning and Full Stack Development. He is an Open Source Enthusiast and has been part of various programs like Google Code In and Google Summer of Code as a Mentor and is currently a MLH Fellow. He has also worked... Read More →
avatar for Shivanshu Raj Shrivastava

Shivanshu Raj Shrivastava

Software Engineer, SigNoz
Shivanshu is a Software Engineer at SigNoz, working on building an open telemetry native observability solution. He has a keen interest in open-source communities and building long-term sustainable technologies. He is a CNCF ambassador and a member of open-source projects like OpenTelemetry... Read More →


Thursday June 20, 2024 15:00 - 16:15 CEST
Sorbonne Descartes & Lutèce (Level 5)
  AI Systems & Performance
  • Audience Experience Level Any
Feedback form isn't open yet.