Loading…
Tuesday November 12, 2024 3:20pm - 3:45pm MST
This talk dives into how microservices architectures and Istio service mesh in Kubernetes empower developers to build scalable, resilient, and future-proof GenAI applications. We'll explore the challenges of running GenAI Application such as selecting the most suitable Large Language Model (LLM), leveraging effective embedding models, and deploying a robust vector database (DB), etc. We'll demonstrate how to overcome them using advanced Kubernetes strategies.

Key topics include:
• Discover how breaking down GenAI Application into microservices enhances flexibility, scalability, and maintainability.
• Learn how service mesh facilitates dynamic updates and changes to GenAI components without impacting user traffic.
• Showcase practical strategies for integrating these technologies in Kubernetes, supported by real-world examples like how to dynamically compose a GenAI application using different microservices, and how to change models or pipelines dynamically on Kubernetes.
Speakers
avatar for Iris Ding

Iris Ding

Cloud software architect, Intel
Iris Ding is a cloud software architect at Intel and has a rich background in open source development, cloud computing, Generative AI(GenAI), middleware development and design. Her current focus is intersection of GenAI and cloud computing and is leading development for Open Platform... Read More →
avatar for Lin Sun

Lin Sun

Head of Open-Source; CNCF TOC member, Solo.io
Lin is the Head of Open Source at Solo.io, contributing to open source full time. She is a CNCF TOC member and ambassador, an Istio core maintainer and leader. She is an international speaker in various tech conferences and blogs frequently about her perspective of service mesh and... Read More →
Tuesday November 12, 2024 3:20pm - 3:45pm MST
Hyatt Regency | Level 2 | Salt Lake Ballroom C
Feedback form is now closed.

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link