Loading…
or to bookmark your favorites and sync them to your phone or calendar.
strong>Intermediate [clear filter]
Tuesday, November 12
 

9:10am MST

The Power of Apache Pulsar. Harnessing Dapr to Build High Scale Messaging at FICO® - Hugo Smitter, FICO
Tuesday November 12, 2024 9:10am - 9:35am MST
Learn about FICO’s experience migrating from Apache Kafka to Apache Pulsar with the help of Dapr to build high scale messaging services into our platform. We cover how we harnessed Dapr for seamless, efficient, and scalable Pulsar integration.

We’ll explore:
  • An overview of the FICO® Platform and strategic business objectives. 
  • Key contributions to the Dapr community, enhancing its support for Apache Pulsar.
  • How Dapr abstracts the Pulsar client API, making it easier to build, deploy, and manage event-driven applications.
  • FICO’s experience, including benchmarks, performance metrics, and practical migration tips.

Join us to unlock the potential of Apache Pulsar with Dapr and transform your event management and processing capabilities. This session provides practical knowledge, real-world examples, and technical details on building high scale event driven services.
Speakers
avatar for Hugo Smitter

Hugo Smitter

Platform Architect at FICO, FICO
Senior Enterprise and Solutions Architect with international experience in multiple industries. Track record performing leadership roles as: chief architect, solution architect, platform architect, systems integrator, information architect, delivery excellence auditor, consultant... Read More →
Tuesday November 12, 2024 9:10am - 9:35am MST
Salt Palace | Level 1 | 151 G

9:10am MST

How to Grow Your Internal Backstage Community with Some Open-Sauce (Source) 🍝 - Djamaile Rahamat & Mitchell Hentges, Spotify
Tuesday November 12, 2024 9:10am - 9:35am MST
You have your Backstage instance set up, the latest plugins installed, and Backstage is the hottest thing in your company. It's so popular that other teams are now eager to contribute and even develop their own plugins. Before you know it, you have a Backstage instance being worked on by multiple teams. While this is great, it also brings its own set of challenges. How do you define ownership? How do you review every pull request that comes in? How do you clean up unused dependencies, and more? Don’t worry! There are many tools in the Backstage open-source ecosystem that can help you address these issues. In this talk, we will cover additional tools from the open-source community that you can use to improve the developer experience for Backstage itself. We’ll discuss how to integrate these tools and also share how we are utilizing them at Spotify.
Speakers
avatar for Mitchell Hentges

Mitchell Hentges

Senior Engineer, Spotify
As the previous Build Team Lead of of Mozilla Firefox, I have a history of focusing on developer experience and system performance. This applies smoothly to the recent improvements my team has been applying to Backstage! I'm always happy to talk about software tools, system design... Read More →
avatar for Djamaile Rahamat

Djamaile Rahamat

Software engineer, Spotify
Djamaile is a Software Engineer at Spotify, focused on Backstage.
Tuesday November 12, 2024 9:10am - 9:35am MST
Salt Palace | Level 1 | Grand Ballroom H

9:10am MST

Confluent's Multi-Cloud Journey to Cilium: Pitfalls and Lessons Learned - Nimisha Mehta & Alvaro Aleman, Confluent
Tuesday November 12, 2024 9:10am - 9:35am MST
Confluent Cloud is a data streaming platform built on thousands of Kubernetes clusters across AWS, Azure & GCP. Confluent migrated clusters to use Cilium for its advanced security features like transparent encryption and DNS name-based network policies, along with performance, scalability & observability improvements. The main challenge was executing a live migration without disrupting stateful workloads, complicated by the risks of replacing a low-level component like the CNI. The process required meticulous planning to ensure intra-cluster connectivity during migration, while accommodating each cloud provider's unique network config. This talk shares the journey of migrating to Cilium, highlighting obstacles and lessons learned. We will explore uninstalling pre-existing CNIs, setting up Cilium & addressing cloud-specific issues to maintain connectivity. Benefits like transparent encryption, policies, and Hubble observability, along with the challenges faced, will also be discussed.
Speakers
avatar for Alvaro Aleman

Alvaro Aleman

Software Engineer, Confluent
Alvaro is a software engineer with a deep passion for infrastructure and open source. He has been working with Kubernetes since 2017 and is a maintainer of the popular controller-runtime library.
avatar for Nimisha Mehta

Nimisha Mehta

Software Engineer, Confluent
Nimisha is a Software Engineer working on Confluent's Kubernetes Platform team. She has been in the cloud infra space for over 5 years, and has been an end-user of Kubernetes and many other open source technologies. Apart from learning about distributed systems and infrastructure... Read More →
Tuesday November 12, 2024 9:10am - 9:35am MST
Salt Palace | Level 1 | Grand Ballroom B
  Cilium + eBPF Day, Use Cases

9:10am MST

SkyRay: Seamlessly Extending KubeRay to Multi-Cluster Multi-Cloud Operation - Anne Holler, Elotl
Tuesday November 12, 2024 9:10am - 9:35am MST
Ray is a unified framework for scaling AI applications from a laptop to a cluster. KubeRay supports the creation, deletion, and scaling of Ray clusters on K8s, along with managing Ray jobs and services on the Ray clusters. This talk introduces SkyRay, in which KubeRay is extended towards the Sky computing model via interoperation with a multi-cluster fleet manager. With SkyRay, each Ray cluster is seamlessly scheduled onto a cloud K8s cluster suited to the Ray cluster's resource needs and policy requirements. The policies can capture a variety of cluster characteristics, e.g., desired cloud provider, region, K8s version, service quality, and GPU type availability. Fleet manager policy updates can be used to trigger automatic migration of Ray clusters between K8s workload clusters. The talk presents several example use cases for SkyRay, including cluster selection for resource needs, service availability, development vs production cluster configuration, and K8s version upgrade.
Speakers
avatar for Anne Holler

Anne Holler

Chief Scientist, Elotl
Anne is Chief Scientist at Elotl. She is interested in resource efficiency. She worked on Uber's Michelangelo Machine Learning platform, on Velocloud's SD-WAN management, on VMware's Distributed Resource Schedulers for servers and storage, on performance analysis for VMware, on Transmeta's... Read More →
Tuesday November 12, 2024 9:10am - 9:35am MST
Salt Palace | Level 1 | Grand Ballroom A

9:10am MST

Bridging the DevOps to Data Divide with a Common Cloud Native Stack - Elad Hirsch, Terasky & Mey Beisaron, Forter
Tuesday November 12, 2024 9:10am - 9:35am MST
Despite DevOps being the norm and best practice for software development today, many organizations still treat their data differently from code, neglecting these well-established best practices over many years. This session will challenge this outdated approach and advocate for defining better “Data as Code” strategies, emphasizing the importance of applying the same rigor and methodologies used in software development to data operations. This session will teach how to run database services as part of Kubernetes infrastructure, just like current microservices, and apply the same SDLC methodologies to data operations as we do to code management with a common cloud native stack. In this session attendees will learn how to bridge the gap with a set of powerful CNCF Tools, including managing databases in K8s, how to embrace data as code, good GitOps practices and more.
Speakers
avatar for Mey Beisaron

Mey Beisaron

Senior Platform Engineer, Forter
Mey is a Senior Platform Engineer and a public speaker who brings Star Wars geekery to everything she does. As a Backend Developer, Mey has developed in multiple programming languages, including Nodejs, Python, Groovy, and her favorite, Clojure. Today Mey is a Senior Platform Engineer... Read More →
avatar for Elad Hirsch

Elad Hirsch

Tech Lead, Terasky
Elad is a Tech Lead at TeraSky CTO Office, a global provider of multi-cloud, cloud-native, and innovative IT solutions. With an experience in principal engineering positions at Agmatix, Jfrog, IDI, and Finjan Security, his primary areas of expertise revolve around software architecture... Read More →
Tuesday November 12, 2024 9:10am - 9:35am MST
Salt Palace | Level 2 | 250 A

9:25am MST

Navigating the No-Code to Full-Code Spectrum - a Platform Engineering Journey - Jared Watts, Upbound & Maximilian Blatt, Accenture
Tuesday November 12, 2024 9:25am - 9:50am MST
YAML is prolific in the Kubernetes ecosystem as it is the language of choice to express intent in a declarative manner. As more companies build internal developer platforms (IDPs) on Kubernetes, many have found that building for enterprise needs can reach a level of complexity of hundreds of thousands of “declarative” lines, resulting in a platform that is challenging to maintain. But we don’t have to rely solely on a no-code approach to build our platforms - there is an entire spectrum from no-code to full-code available to us. In this talk, we will explore this spectrum in depth through the lens of a platform team’s journey to build an enterprise grade infrastructure control plane on Kubernetes with Crossplane. We will share all the lessons learned starting from declarative no-code and then evolving over time to a full-code approach using Golang, as well as how this journey had a major impact on their developer experience, testing, operations, stability, and more.
Speakers
avatar for Maximilian Blatt

Maximilian Blatt

Cloud Advisory Consultant, Accenture
Maximilian Blatt is a Crossplane expert, platform engineer and consultant at Accenture Germany. He has mutliple years of experience working with Crossplane, Kubernetes and is maintainer of several Crossplane-related open-source projects.
avatar for Jared Watts

Jared Watts

Founding Engineer, Upbound
Jared Watts is a Founding Engineer at Upbound, where he is working on advancing cloud-native computing by enabling anyone to build their own cloud platform. He is also a co-creator of the open source Crossplane (https://crossplane.io) and Rook (https://rook.io) projects. Prior to... Read More →
Tuesday November 12, 2024 9:25am - 9:50am MST
Salt Palace | Level 1 | Grand Ballroom G

9:45am MST

Tales from the Crypt: Application Packaging and Delivery Nightmares - Scott Rigby, Helm Maintainer & Sarah Christoff, Defense Unicorns
Tuesday November 12, 2024 9:45am - 10:10am MST
Join Sarah Christoff and Scott Rigby, maintainers of widely used CNCF tools Porter and Helm, for a cringe-worthy yet enlightening journey through the dark alleys of application packaging and delivery anti-patterns. In this session, we'll delve into the spooky stories of real-world application and configuration delivery nightmares that left teams haunted by unnecessary complexity, perplexing custom code, and security skeletons in the closet. With a blend of humor and hard-learned lessons, we'll expose the ghosts of past mistakes and share the exorcism techniques that could have saved us all some sleepless nights. Whether you're a seasoned Kubernetes operator or new to the cloud-native realm, come prepared to shiver, laugh, and learn how to avoid the traps that have ensnared so many before.
Speakers
avatar for Sarah Christoff

Sarah Christoff

Software Engineer, Defense Unicorns
Sarah is a software engineer at Defense Unicorns who loves making complex code more digestible. She is the self-proclaimed founder of the Leslie Lamport fan club. When she's not bugbusting, she is running her animal rescue and competing in triathlons. She believes code should be like... Read More →
avatar for Scott Rigby

Scott Rigby

Senior Cloud Solutions Architect, NASA / Navteca
Scott is an artist, engineer & dad, collaborating on a different kind of world. Into collective art, activism, therapy & open source nerdy stuff. Scott is a Cloud Native Ambassador, speaker, organizer of CNCF community events including the New York Kubernetes Meetup, and international... Read More →
Tuesday November 12, 2024 9:45am - 10:10am MST
Salt Palace | Level 1 | 151 G

9:45am MST

Building Zero Trust with Envoy - Florin Coras & Pradheep Shrinivasan, Cisco
Tuesday November 12, 2024 9:45am - 10:10am MST
In this talk, we will guide you through our journey of developing an enterprise-grade, multi-tenant Clientless Zero Trust Network Access (ZTNA) proxy using Envoy. Our proxy operates across more than 30 data centers and can handle up to 4.5 Gbps per client connection. Join us as we reflect on various topics, including the decision to choose Envoy, the advantages of running Envoy on top of VPP, the challenges of supporting over 10k upstream destinations, and the decision to move away from WASM. Finally, we will conclude the talk by discussing new use cases we are planning to implement using Envoy.
Speakers
avatar for Florin Coras

Florin Coras

Principal Engineer, Cisco
Florin Coras is a Principal Engineer at Cisco where he focuses on user space host stacks, network virtualization and programmable overlays. He has contributed to a number of open source projects including FD.io, EnvoyProxy and OpenDaylight. He is a VPP maintainer, co-developer of... Read More →
avatar for Pradheep Shrinivasan

Pradheep Shrinivasan

Technical Lead, Cisco
Pradheep Shrinivasan is a technical lead in Cisco currently leading the development of Zero trust network. Current interests are Zero Trust networks, Security and distributed systems.
Tuesday November 12, 2024 9:45am - 10:10am MST
Hyatt Regency | Level 2 | Salt Lake Ballroom C
  EnvoyCon, Envoy in production case studies

10:30am MST

Panel: Dynamic Configuration and Scaling of VPN Concentrator and Envoy SASE Proxy in Multi-Tenant Edge - Srinivasa Addepalli & Ritu Sood, Aryaka; Mrittika Ganguli & Jeff Shaw, Intel Corporation
Tuesday November 12, 2024 10:30am - 11:05am MST
This discussion shows a framework that integrates a VPN Concentrator with Envoy-based Secure Access Service Edge (SASE) proxy, leveraging APIs for configuration and management of network functions within containers. This is designed to dynamically scale. The VPN Concentrator (VPNC) establishes secure IPSec tunnels that encapsulate data traffic, providing privacy and protection against threats. As no. of tenants or volume of traffic increases, the need for additional VPNCs, IPSec tunnels and proxies arise. The SASE proxy is a network filter at the edge, enforcing security policies, optimizing traffic flow, providing a zero-trust network access to cloud based services. Number of proxies is changed as a ratio-based scaling approach to IPSec tunnels or tenants based on metrics like : • Throughput • Latency • Error rates • Active, denied connections • Security breaches • No. of active user sessions. • No. of route changes for loadbalancing • Envoy utilization with/without optimizations
Speakers
avatar for Mrittika Ganguli

Mrittika Ganguli

Architect, Principal Engineer, Intel Corporation
Mrittika Ganguli is a Principal Engineer and Director, Cloud Native Pathfinding in Intel’s Network and Edge Architecture team. She has 25+ years of experience in cloud hardware and software management, network processing control and data plane, cloud orchestration, telemetry and... Read More →
avatar for Srinivasa Addepalli

Srinivasa Addepalli

CTO, Aryaka
Srini Addepalli is Aryaka's CTO with a strong background in edge computing, network security. He was instrumental in driving open-source initiatives at Intel, leading projects such as Service Mesh, cloud-native SASE framework, and Distributed HSM. With experience as a Fellow at Freescale... Read More →
avatar for Ritu Sood

Ritu Sood

Distinguished Engineer, Aryaka
Ritu Sood is a Distinguished Engineer working at Aryaka. She has over 10 years of experience in cloud-related technologies. During this time she worked and contributed on open source projects like Openstack, ODL, ONAP, Nodus and EMCO.
JS

Jeff Shaw

Cloud Software Architect, Intel
Jeff Shaw works on packet processing at Intel.
Tuesday November 12, 2024 10:30am - 11:05am MST
Hyatt Regency | Level 2 | Salt Lake Ballroom C

10:40am MST

Applying CloudEvents to Level up Application Communications - Vyom Yadav, Canonical & Evan Anderson, Stacklok
Tuesday November 12, 2024 10:40am - 11:05am MST
Minder is an open-source project to secure software supply chains, starting with repository security. While Minder provides a gRPC API for user interactions, it also operates continuously in the background, detecting and reacting to supply chain events. In this talk, we’ll describe how this background operation evolved from ad-hoc webhooks into an asynchronous event-driven platform with multiple services. Our journey began with asynchronous flows, starting with Go channels and then using Watermill with a SQL database for events. This worked great for a single binary but had issues when sharing the database between two applications – each attempted to enforce its own schema on the shared database. The problem gets worse when trying to interoperate between languages – the Watermill library we used doesn’t support Python or NodeJS. The next step we’re undertaking is to migrate to using CloudEvents – a common schema which can be evolved independently of the specific components.
Speakers
avatar for Evan Anderson

Evan Anderson

Software Engineer, Stacklok
Co-founder and maintainer on Knative project. Member of sigstore-oncall. Previously worked on Google Compute Engine and Serverless (App Engine, Functions) and in SRE. Principal engineer at Stacklok. Ex-Google, ex-VMware. Author of Building Serverless Applications on Knative by O'Reilly... Read More →
avatar for Vyom Yadav

Vyom Yadav

CNCF Ambassador, Canonical
Vyom is a recent graduate and CNCF Ambassador working on Canonical's Security Team. His focus areas include vulnerability management, patching, and software supply chain security. Vyom has been involved in open source since his freshman year, participating in programs like Google... Read More →
Tuesday November 12, 2024 10:40am - 11:05am MST
Salt Palace | Level 1 | 151 G

10:40am MST

Dog Food Delight: How Argo Workflows Eats Its Own CI - Denise Schannon, Loft Labs & Tim Collins, Pipekit
Tuesday November 12, 2024 10:40am - 11:05am MST
Upstream, Argo Workflow’s CI uses Github Actions. In a true "eating our own dog food" moment, we embarked on a journey to migrate Argo Workflows CI to Argo Workflows itself on our local fork.

Our goals were:
- Boost Efficiency & Reliability: Streamline CI processes for faster development cycles.
- Fork Power: Enable quicker patch releases on our Argo Workflows fork.
- Real-World Inspiration: Provide a practical CI example for Argo Workflows users.

Dive deeper with us:
- Uncover the journey involved in migrating from GitHub Actions to Argo Workflows.
- Explore the challenges and solutions of running Kubernetes inside Kubernetes.
- Learn how we tackled various technical hurdles encountered during the migration.
- Discover the performance, cost, and reliability gains achieved through this self-hosted CI solution.
Speakers
avatar for Tim Collins

Tim Collins

Staff Infrastructure Engineer, Pipekit
Tim is a Staff Infrastructure Engineer at Pipekit, a control plane for Argo Workflows that enables massive data pipelines in minutes, saving engineering time and cloud spend. He has a keen interest in open source technologies and is an active member of the Argo community, often found... Read More →
avatar for Denise Schannon

Denise Schannon

VP of Engineering, Loft Labs
Denise Schannon, VP of Engineering at Loft Labs, is a seasoned engineering and product management leader, specializing in developing open-source software for start-up environments. She excels in scaling engineering teams, delivering superior products, and making Kubernetes more accessible... Read More →
Tuesday November 12, 2024 10:40am - 11:05am MST
Salt Palace | Level 2 | 254 B
  ArgoCon, Software Delivery

10:40am MST

Backstage in the Board Room: What Makes a Digital Portal Helpful for C-Level Executives? - Olivier Liechti, Avalia Systems
Tuesday November 12, 2024 10:40am - 11:05am MST
Backstage is typically adopted by platform teams to solve engineering problems. Securing extended funding can be challenging if its broader business value is not clearly articulated. We present 2 case studies from the energy and insurance sectors where we have used Backstage to address questions from business stakeholders: Is our tech organization aligned with strategic objectives? Is our information system enabling key initiatives? Can we explain and measure how our tech teams deliver value? By driving the initiative with executive-level sponsors and extending Backstage with "business-oriented" features, we have demonstrated its value beyond an IDP platform. We share learnings from two projects, and demonstrate a set of new features: 1) the augmentation of the (static) System Model with (dynamic) concepts such as initiatives, 2) the design of a flexible measurement framework (for KPIs), and 3) the integration of Enterprise Architecture concepts and their visual representation.
Speakers
avatar for Olivier Liechti

Olivier Liechti

CTO, Avalia Systems
Olivier is CTO at Avalia Systems, which he co-founded in 2016. His background is in applied research in software engineering, with a particular interest in human factors. Previously, Olivier was full professor at the University of Applied Sciences Western Switzerland. Olivier holds... Read More →
Tuesday November 12, 2024 10:40am - 11:05am MST
Salt Palace | Level 1 | Grand Ballroom H

10:40am MST

Multitenancy and Fairness at Scale with Kueue: A Case Study - Aldo Culquicondor, Google & Rajat Phull, Apple
Tuesday November 12, 2024 10:40am - 11:05am MST
Developed by the Kubernetes community in collaboration with the ecosystem, Kueue augments k8s and ClusterAutoscaler to provide an E2E batch system. Kueue implements job queueing, deciding when jobs should wait and when they should start or be preempted, based on quotas and a hierarchy for sharing resources among teams. An exciting addition in the v0.7 release is fair sharing, designed to support large ML platforms serving multiple teams. Kueue allows platforms to model their teams and achieve a high utilization of resources, while sharing cost and providing equitative access to unused resources. Teams can always reclaim their guaranteed quotas via preemption. The Kueue v0.7 and the Kubernetes v1.31 releases also include performance optimizations to achieve high throughput. In this talk, you will learn about the challenges faced during design and implementation of fair sharing and preemption, about this system running in production, and the plans to support complex hierarchies.
Speakers
avatar for Aldo Culquicondor

Aldo Culquicondor

Sr. Software Engineer, Google
Aldo is a Senior Software Engineer at Google. He works on Kubernetes and Google Kubernetes Engine, where he contributes to kube-scheduler, the Job API and other features to support batch, AI/ML and HPC workloads. He is currently a TL at SIG Scheduling and an active member of WG Batch... Read More →
avatar for Rajat Phull

Rajat Phull

Engineering Manager, Apple
Rajat Phull is an Engineering Manager at Apple. He works in Machine Learning Platform team with a focus on GPU resource management, and ML training orchestration at scale using Kubernetes.
Tuesday November 12, 2024 10:40am - 11:05am MST
Salt Palace | Level 1 | Grand Ballroom A

10:40am MST

Cloud Native PostgreSQL - Running PostgreSQL on Kubernetes - Peter Zaitsev, Coroot, FerretDB, Percona
Tuesday November 12, 2024 10:40am - 11:05am MST
Running PostgreSQL on Kubernetes is becoming increasingly popular, offering various methods for implementation. This talk will examine different operators that facilitate running PostgreSQL on Kubernetes. We'll also take a look at Neon PostgreSQL, specifically designed for cloud-native environments, and cover the most important best practices for effectively running PostgreSQL on Kubernetes.
Speakers
avatar for Peter Zaitsev

Peter Zaitsev

Founder, Coroot, FerretDB, Percona
Peter Zaitsev is an entrepreneur and co-founder of Coroot, Percona, FerretDB, and other tech companies. As one of the leading experts in Open Source strategy and database optimization, Peter has applied his technical knowledge and entrepreneurial drive to contribute as a board member... Read More →
Tuesday November 12, 2024 10:40am - 11:05am MST
Salt Palace | Level 2 | 250 A

10:40am MST

When Things Go Sideways: Troubleshooting the OTel Operator - Adriana Villela, Dynatrace & Reese Lee, New Relic
Tuesday November 12, 2024 10:40am - 11:05am MST
The OpenTelemetry (OTel) Operator is a great tool that helps make your life a little easier by managing OTel for you in your Kubernetes cluster, by: Managing the deployment of the OpenTelemetry Collector Managing the configuration of a fleet of OpenTelemetry Collectors via OpAMP integration Injecting and configuring auto-instrumentation into your pods But what happens when THINGS. DON’T. WORK??

In this talk, Adriana and Reese will cover:
  • An overview of the OTel Operator 
  • Common installation issues
  • Common auto-instrumentation issues
  • Common OTel Collector deployment issues
  • * …and how to tackle them all
Attendees will walk away from this session with a better understanding of how they can leverage the Operator, and be empowered to use it with confidence.
Speakers
avatar for Reese Lee

Reese Lee

Senior Developer Relations Engineer, New Relic
Reese Lee is a Senior Developer Relations Engineer at New Relic, where she is focused on enabling customers and colleagues on OSS via workshops, blog posts, and documentation. She enjoys figuring out solutions to technical problems, learning about interesting user stories and use... Read More →
avatar for Adriana Villela

Adriana Villela

Principal Developer Advocate, Dynatrace
Adriana Villela is a Principal Developer Advocate, helping companies achieve reliability greatness through Observability, SRE, & DevOps practices. Previously, she managed a Platform Engineering team & an Observability Practices team at Tucows. Adriana has worked at various large-scale... Read More →
Tuesday November 12, 2024 10:40am - 11:05am MST
Salt Palace | Level 2 | 255 B

10:40am MST

Migrating to Open Feature at Scale: 0 to Billions - Chetan Kapoor & Justin Abrahms, eBay
Tuesday November 12, 2024 10:40am - 11:05am MST
eBay, a seasoned 20+ year big-tech with ~4000 engineers adopted OpenFeature on its large scale decision making platform. This success story is more than simply a feature flag enablement, it's about simplifying a complex onboarding process, enabling flags and offering a 5-star developer experience! This 2023 highlight took two partners with complimenting skill-sets, a lot of trust and continuous iterations. We'll walk through the state of the world before the introduction of Open Feature, how we built an internal community around the mission and practical techniques that were used to establish leadership buy-in and accelerate momentum in such a large complex organization.
Speakers
avatar for Chetan Kapoor

Chetan Kapoor

Product Leader @ eBay, eBay
Chetan is a change-maker, OpenFeature evangelist and technical product leader at eBay, responsible for establishing a culture of data-informed decision making and building platform for experimentation and feature flags. Previous to eBay, he grew a B2C Chicago FinTech from 100M to... Read More →
avatar for Justin Abrahms

Justin Abrahms

Principal Engineer, Thrive Market
In recent years, I’ve spent the bulk of my work-related time helping large enterprises deliver software more successfully. I care a lot about open source, software delivery and solarpunk ethos.In my spare time, You’ll find me motorcycling, working in the wood & metal shops or... Read More →
Tuesday November 12, 2024 10:40am - 11:05am MST
Salt Palace | Level 2 | 250 D

10:40am MST

Portals and Platforms, Two Ps in a Pod? How Good Interfaces Make for Good Operability - Jorge Lainfiesta, Rootly & Abby Bangser, Syntasso
Tuesday November 12, 2024 10:40am - 11:05am MST
Platforms aren’t new, but intentional platform engineering is getting more widespread adoption. One area where organisations are investing heavily is the relationship between the “frontend” user interfaces that developers get value from (CLIs, portals, APIs) and the "backend" platform orchestration components that ops manage. What should you, as a platform engineer, expect from your investments on each side? Over time, maintaining a portal and your platform orchestration becomes an important cost. But don't fret! Your platform can (and should) manage this ongoing maintenance for you, allowing you to focus on what you care about: improving developer experience and extending platform capabilities. In this talk, Abby and Jorge will provide an overview of the challenges and solutions, and map this topic to the CNCF Platform Engineering Maturity Model. You will learn what you, as a platform engineer, should be demanding of your platform orchestration tooling and your portal.
Speakers
avatar for Abby Bangser

Abby Bangser

Principal Engineer, Syntasso
Abby is a Principal Engineer at Syntasso delivering Kratix, an open-source cloud-native framework for building internal platforms on Kubernetes. Her keen interest in supporting internal development comes from over a decade of experience in consulting and product delivery roles across... Read More →
avatar for Jorge Lainfiesta

Jorge Lainfiesta

Reliability Advocate, Rootly
Jorge is the author of the Linux Foundation Introduction to Backstage (LFS142) course and reliability advocate. He has a background in software engineering (ex-PayPal) and digital communication (UCLA). He's also a certified sommelier (CETT Barcelona).
Tuesday November 12, 2024 10:40am - 11:05am MST
Salt Palace | Level 1 | Grand Ballroom G

11:15am MST

⚡ Lightning Talk: Are You Really Ready to Adopt a Platform? - Atulpriya Sharma, InfraCloud Technologies
Tuesday November 12, 2024 11:15am - 11:25am MST
Everyone is talking about platforms and why not, they bring in a lot of benefits when it comes to agility and flexibility to your software development process. But are you really ready to adopt a platform? Implementing a platform isn't just about the tools, tech and integrations. There are people, processes and culture involved too. This talk will focus on organization readiness and present you with a checklist that'll help you check your preparedness to adopt a platform. Throughout the checklist, we'll focus on all the critical aspects right from identifying key stakeholders, investments, implementation and other critical processes that are linked to the successful adoption of your platform. The checklist will lead you to the Platform maturity model that will further help you evolve and mature your platform over time. So, are you really ready to adopt a platform? Join in to find the answer.
Speakers
avatar for Atulpriya Sharma

Atulpriya Sharma

Co-Chair Platforms-WG, TAG - App Delivery | CNCF Ambassador, InfraCloud Technologies
Manual tester turned developer advocate focusing on open source, cloud native technologies. I help organizations and their developers adopt related technologies by creating helpful and impactful content. My current interest areas at Platform Engineering and AIOps. I'm also a CNCF... Read More →
Tuesday November 12, 2024 11:15am - 11:25am MST
Salt Palace | Level 1 | Grand Ballroom G

11:15am MST

Empowering LLMs with Backstage: Broader Insights Driven by the Developer Portal - Niall Thomson, Amazon Web Service
Tuesday November 12, 2024 11:15am - 11:40am MST
Unlocking the potential of Generative AI for software development and operations requires access to diverse systems across an organization. In this session, we'll explore how Backstage, with its extensive ecosystem of plugins, can serve as a central hub for powering generative AI initiatives. We'll dive into how Backstage's ability to connect with aspects such as cloud cost management, security posture analysis, and incident management tools can provide valuable data to generative AI models. We'll present an architecture designed to leverage these existing community integrations to enrich AI models with contextual data and enable them to perform actions on behalf of the user in their existing single pane of glass.
Speakers
avatar for Niall Thomson

Niall Thomson

Principal Container Specialist Solutions Architect, Amazon Web Service
Niall is an Amazon Web Services solution architect specializing in container technology and Kubernetes. With a passion for building developer platforms that enhance efficiency and security he helps guide AWS customers on platform strategy and implementation, sharing his expertise... Read More →
Tuesday November 12, 2024 11:15am - 11:40am MST
Salt Palace | Level 1 | Grand Ballroom H

11:15am MST

LLM Powered Agents with Kubernetes - Hema Veeradhi & Shrey Anand, Red Hat
Tuesday November 12, 2024 11:15am - 11:40am MST
How would you build an LLM system to modify a Kubernetes deployment based on its live telemetry data stream? A vanilla LLM is not enough to solve this problem as it is limited to outdated training data and is prone to hallucinations. In this talk, we will explore the concept of Agents—a powerful framework for solving complex multi-level tasks using a LLM as its reasoning engine, supported by a suite of tools. These tools can be advanced calculators, real time web scrapers, domain knowledge extractors, etc. They include executable functions, RAG pipelines, APIs or other services that allow the agents to complete their tasks effectively. We will walk-through a demo that leverages Kubernetes services and Podman containerization techniques that enable the agent workflow. Attendees will learn how a Kubernetes based agent framework enhances the performance capabilities of LLMs, offering a scalable and autonomous solution for next-generation intelligent systems.
Speakers
avatar for Shrey Anand

Shrey Anand

Mr., Red Hat
Shrey Anand is a data scientist with over five years of experience in the field of AI / ML. He collaborates with the emerging technologies at Red Hat where he develops cutting-edge data science solutions to solve open source and business problems. As a strong advocate of open source... Read More →
avatar for Hema Veeradhi

Hema Veeradhi

Principal Data Scientist, Red Hat
Hema Veeradhi is a Principal Data Scientist working in the Emerging Technologies team part of the office of the CTO at Red Hat. Her work primarily focuses on implementing innovative open AI and machine learning solutions to help solve business and engineering problems. Hema is a staunch... Read More →
Tuesday November 12, 2024 11:15am - 11:40am MST
Salt Palace | Level 1 | Grand Ballroom A

11:15am MST

Perfect Match: Correlating Continuous Profiling with Distributed Tracing for Stronger Observability - Jonas Kunz & Christos Kalkanis, Elastic
Tuesday November 12, 2024 11:15am - 11:40am MST
Continuous profiling is a technique to collect stack trace granularity insight into production resource usage. It is something SREs and other engineers can enable without changes to the app, or knowing how it was compiled. This year, this powerful signal and a polyglot eBPF profiling agent were added to OpenTelemetry. Our talk explores how an existing OpenTelemetry system is better with profiling, specifically how distributed tracing fits into the picture. You'll see both tools in action on Kubernetes, including cross-service requests and how correlation of distributed traces and profiles let you answer more questions, specifically code level causality. We'll show how to leverage this data for resource utilization and even monitoring your carbon footprint. You'll leave with a concrete understanding of continuous profiling, how it relates to OpenTelemetry and how these tools combine to reduce time while adding more understanding of your Kubernetes workloads, from kernel to code.
Speakers
avatar for Christos Kalkanis

Christos Kalkanis

Elastic
Christos is a principal engineer at Elastic, a maintainer for the OpenTelemetry Profiling SIG and a co-author of the donated OpenTelemetry profiling agent previously known as the Elastic Universal Profiling agent. After more than a decade of focusing on cybersecurity offense he moved... Read More →
avatar for Jonas Kunz

Jonas Kunz

Jonas Kunz, Elastic
I work as (primarily) Java Developer at Elastic, focusing on the Elastic APM Java-agent and our Java OpenTelemetry Distribution. While I love the safety of managed languages, I also enjoy occasionally visiting the native side of things. I'm an active contributor to the OpenTelemetry... Read More →
Tuesday November 12, 2024 11:15am - 11:40am MST
Salt Palace | Level 2 | 255 B

11:15am MST

Experimentation Programs at Scale: Lessons Learned from Top Companies - Graham McNicoll, GrowthBook
Tuesday November 12, 2024 11:15am - 11:40am MST
Running a few experiments is easy, especially when using feature flags. However, scaling up experimentation and adopting an experimentation-driven approach across an entire organization is much more challenging. How can the frequency of experiments be increased while maintaining quality and trustworthy results? This talk will address this question by exploring the methods top companies use to run their experimentation programs at massive scales. It will start with an introduction to experimentation-driven product development and discuss the advantages of these techniques. Then, the discussion will cover how experimentation programs can go wrong and, finally, how top companies structure their programs to solve these issues and unlock experimentation at a large scale.
Speakers
avatar for Graham McNicoll

Graham McNicoll

Co-founder, GrowthBook
Graham is the co-founder and CEO of GrowthBook, an open-source feature flagging and AB testing platform that is part of the open feature project and used by thousands of companies. Before starting GrowthBook, Graham was the CTO of Education.com for six years. Graham is a three-time... Read More →
Tuesday November 12, 2024 11:15am - 11:40am MST
Salt Palace | Level 2 | 250 D

11:45am MST

⚡ Lightning Talk: The Hidden Costs of Feature Flags: Understanding and Managing Adoption Challenges - Shreya Shivratriwar, Software Engineer
Tuesday November 12, 2024 11:45am - 11:55am MST
Feature flags offer powerful control over software releases, but they introduce hidden complexities in version management and user adoption that can catch development teams off guard. As application states multiply and the definition of a "version" blurs, teams struggle with maintaining compatibility and ensuring smooth updates across diverse platforms and user segments. To address these challenges, we propose a strategy of treating feature flag rollouts as code changes, favoring smaller, more frequent updates over large-scale alterations. This talk will unveil the often-overlooked costs of feature flags, explore their impact on versioning and user experience, and provide attendees with practical strategies for mitigating these issues. By the end, you'll have a clearer understanding of how to navigate the complex landscape of feature flag implementation, balancing the benefits of flexibility with the need for manageable, user-friendly software evolution.
Speakers
avatar for Shreya Shivratriwar

Shreya Shivratriwar

Software Engineer, N/A
Shreya is an engineering student and AI researcher from India, deeply involved in AI and its applications. Passionate about technology, she enjoys coding, exploring new AI advancements, and contributing to open-source projects. Shreya is also focused on AI ethics and aims to create... Read More →
Tuesday November 12, 2024 11:45am - 11:55am MST
Salt Palace | Level 2 | 250 D

11:50am MST

Building Real-World Applications with Dapr: A Startup Developer's Journey - Whit Waldo, Innovian
Tuesday November 12, 2024 11:50am - 12:15pm MST
In today’s developer ecosystem, numerous tools and frameworks exist, but often, only a few core capabilities are needed. This is where Dapr excels. As a polyglot framework and CNCF incubating project since 2021, Dapr provides essential building blocks to abstract dependencies, enabling rapid design and iteration of complex full-stack applications. As the founder and sole developer at Innovian, I’ll present a detailed architectural study of my product, showcasing how I leverage Dapr's building blocks - cryptography, state management, pub/sub messaging, service discovery/invocation, actors, workflows, job scheduling, and more. My goal is to demonstrate why Dapr is the foundation of my platform and why it could be valuable for others.
Speakers
avatar for Whit Waldo

Whit Waldo

CEO, Innovian
Whit Waldo is the founder and CEO of Innovian where he manages the development of innovative technical solutions for diverse businesses. With over 14 years of experience, Whit has excelled in roles such as CTO for various startups and most recently as Director of Product Development... Read More →
Tuesday November 12, 2024 11:50am - 12:15pm MST
Salt Palace | Level 1 | 151 G

11:50am MST

From Supercomputing to Serving: A Case Study Delivering Cloud Native Foundation Models - Autumn Moulder, Cohere
Tuesday November 12, 2024 11:50am - 12:15pm MST
Cloud native takes on new meaning in the AI and HPC domains. What does cloud native mean when your software is tightly coupled to hardware? When capacity is fixed, which assumptions start to break down? How can you flex GPUs batch training workloads and inference? Join us for a case study, demonstrating how a small team scaled ML infrastructure from a single cloud to multiple clusters across 4 cloud providers - in under 6 months. We’ll share unique multi-cloud challenges we uncovered around supercomputing infrastructure, cross cloud networking, capacity & quota management, batch workloads, FinOps, and observability. We will particularly highlight our experience using Kueue to manage fixed capacity across clouds & where Kubernetes still falls short for HPC workloads. Leave with a solid understanding of what it takes for an infrastructure team to support the lifecycle of a cloud native foundation model.
Speakers
avatar for Autumn  Moulder

Autumn Moulder

Director of Infrastructure & Security, Cohere
Autumn is the Director of Infrastructure & Security at Cohere. She’s been with the company since September 2022 scaling teams & tools. Prior to buying into the startup life, she spent 3 years in financial services and 14 years at a large non-profit. Her passion is helping innovative... Read More →
Tuesday November 12, 2024 11:50am - 12:15pm MST
Salt Palace | Level 1 | Grand Ballroom A

11:50am MST

Panel: Exploring eBPF Use Cases in Cloud-Native Security - Oshrat Nir, ARMO; Anna Kapuścińska, Isovalent, now part of Cisco; Whitney Lee, CNCF Ambassador; Maya Singh, Microsoft; Cortney Nickerson, Kubeshop
Tuesday November 12, 2024 11:50am - 12:25pm MST
Cloud-native security requires a shift in mindset. Workloads are ephemeral, the attack surface has grown and with it, the complexities. eBPF has emerged as a powerful technology, enabling deep visibility and dynamic security capabilities within the Linux kernel. This panel will explore use cases in which eBPF enhances cloud-native security. We will explore how eBPF can be leveraged to perform real-time monitoring, threat detection, and mitigation across containerized applications and microservices. Our expert panelists will share insights on using eBPF for network security, application profiling, anomaly detection, and enforcing security policies at the kernel level. Additionally, we will discuss the integration of eBPF with popular cloud-native tools and platforms, showcasing practical implementations.
Speakers
avatar for Whitney Lee

Whitney Lee

CNCF Ambassador
Whitney is a lovable goofball and a CNCF Ambassador who enjoys understanding and using tools in the cloud native landscape. Creative and driven, Whitney recently pivoted from an art-related career to one in tech. You can catch her lightboard streaming show ⚡️ Enlightning on her... Read More →
avatar for Anna Kapuscinska

Anna Kapuscinska

Software Engineer, Isovalent at Cisco
Anna is a software engineer at Isovalent, focusing on eBPF-based observability and security. Her previous roles span the industry: she wore both developer and SRE hats, and worked in AdTech, FinTech, public healthcare, end-user SaaS company and a hosting provider. On good weather... Read More →
avatar for Oshrat Nir

Oshrat Nir

Developer Advocate, ARMO
Oshrat Nir is the Developer Advocate at ARMO, where she helps customers adopt Kubernetes security. She has over 20 years of IT experience, including roles at Amdocs and Giant Swarm. She is a big believer in transparency and community, and she loves telling stories. She excels at bridging... Read More →
avatar for Maya Singh

Maya Singh

Product Manager, Microsoft
Maya is a Product Manager at Microsoft who is passionate about data driven product development. With experience in financial services and Ed-tech she is excited to now delve into all things open source. Maya holds a Bachelor's degree in Biomedical Engineering and an MBA, both from... Read More →
avatar for Cortney Nickerson

Cortney Nickerson

Developer Advocate at Kubeshop, Kubeshop
Cortney is a Developer Advocate at Kubeshop and a co-organizer of the CNCF Bilbao Community. Initially, a non-techie turned tech lover, she began her career as employee number 7 at a DevSecOps startup (acquired by DataDog) and wrote the newsletter and other content for the Data on... Read More →
Tuesday November 12, 2024 11:50am - 12:25pm MST
Salt Palace | Level 1 | Grand Ballroom B
  Cilium + eBPF Day, Benefits of eBPF

12:00pm MST

⚡ Lightning Talk: Building a Scalable Multi-Protocol API Gateway with Envoy - Matt Poegel, Bloomberg LP
Tuesday November 12, 2024 12:00pm - 12:10pm MST
How far can Envoy be taken as an edge proxy? What if your downstream clients are not using HTTP? As a case study for Envoy’s use as a layer four edge proxy, this talk presents how Envoy is being used at Bloomberg to provide connectivity for enterprise clients across a range of protocols using TCP including FIX, MQ, and SFTP. It discusses the challenges of managing decades-old connectivity endpoints with an aggressive SLA.
Do you operate in an environment where the security of the system is paramount? Of course you do. The combination of Envoy with SPIRE creates a strong security posture from day one. With flexible deployment options from containers on the cloud to virtual machines on-prem, this talk demonstrates how it is all possible.
Speakers
avatar for Matt Poegel

Matt Poegel

Senior Software Engineer, Bloomberg
Matt Poegel is a Senior Software Engineer at Bloomberg. For the past seven years, he has worked in the firm's Connectivity and Integration Engineering group, where he is using C++ and Golang to build resilient and secure connectivity solutions for different enterprise systems and... Read More →
Tuesday November 12, 2024 12:00pm - 12:10pm MST
Hyatt Regency | Level 2 | Salt Lake Ballroom C
  EnvoyCon, Envoy in production case studies

12:00pm MST

⚡ Lightning Talk: Would a Gradual Rollout by Any Other Hashing Algorithm Still Smell as Sweet? - Chris Griffing, GitKraken
Tuesday November 12, 2024 12:00pm - 12:10pm MST
Gradual rollouts are an important use-case for feature flags. But not all hashing algorithms have the same goals. In this analysis, we will compare/contrast hashing algorithms and why your standard password hashing algorithm might not be ideal compared to some others. Some are also more readily available in various languages which can influence your decisions when building SDKs. We will outline which metrics we care about and why while graphing the results.
Speakers
avatar for Chris Griffing

Chris Griffing

Developer Advocate, GitKraken
I like to make things and share them with people.
Tuesday November 12, 2024 12:00pm - 12:10pm MST
Salt Palace | Level 2 | 250 D

12:25pm MST

⚡ Lightning Talk: Data Science Environments in Seconds: Scaling Jupyter Notebooks in Kubernetes - Jialin Zhang, Apple
Tuesday November 12, 2024 12:25pm - 12:35pm MST
The interactive nature of Jupyter notebooks has made them indispensable tools for data scientists and AI researchers, facilitating exploratory data analysis, prototyping, and model development. Setting up remote computational environments with multiple data or model sources on cloud is slow, repetitive and complex. However, warm pools of preconfigured environments with adjustable runtime configurations can alleviate these concerns enabling data scientists to focus more on actual data science and less on infrastructure. In this presentation, speakers will present a need for a system to manage integrated environments to meet the challenges of running remote Jupyter kernels at scale. With emphasis on productivity and experience, a solution for maintaining warm pools of such environments is presented while highlighting its key prediction algorithms. The session will showcase a brief and simplified collaboration experience on JupyterLab with the warm-pool system of kernels.
Speakers
avatar for Jialin Zhang

Jialin Zhang

Jialin Zhang, Software Engineer, Apple, Apple
Jialin Zhang is a software engineer with Notebooks team offering Jupyterlab as a service to data scientists engineers at Apple. Her previous experience includes stint at Microsoft and Expedia.
Tuesday November 12, 2024 12:25pm - 12:35pm MST
Salt Palace | Level 1 | 151 G

12:30pm MST

⚡ Lightning Talk: Charm++ on Kubernetes Cloud - Aditya Bhosale, University of Illinois at Urbana-Champaign
Tuesday November 12, 2024 12:30pm - 12:40pm MST
In this talk, we will detail the use of Kubernetes operators to run HPC applications using Charm++ runtime system on Kubernetes cluster on cloud. Charm++ is an adaptive intelligent runtime system that provides capabilities such as dynamic load balancing, energy optimizations, and communication optimizations, in addition to support for resource elasticity. It is a well-established system in the HPC world, and supports highly scalable applications such as NAMD for biomolecular simulations. We will talk about capabilities added to the setup like job malleability utilizing the shrink and expand feature in Charm++ jobs and by changing the number of pods assigned to a job at run-time. We will demonstrate effectiveness of shrink/expand operations for different scheduling policies and quantify the associated overhead. Charm++ has recently added support for python-based framework, Charm4py, for python codes for HPC. We will also talk about running Charm4Py applications on Kubernetes.
Speakers
avatar for Aditya Bhosale

Aditya Bhosale

Graduate Student, University of Illinois at Urbana-Champaign
Tuesday November 12, 2024 12:30pm - 12:40pm MST
Salt Palace | Level 1 | Grand Ballroom A

12:45pm MST

Panel: OpenTelemetry: Bridging Platform and Enablement - Daniel Gomez Blanco, Skyscanner; Ariel Valentin, GitHub; Hazel Weakly, Hachyderm; Suman Karumuri, Airbnb; Vijay Samuel, eBay
Tuesday November 12, 2024 12:45pm - 1:20pm MST
OpenTelemetry is everywhere, used by engineers in all roles. For telemetry data to provide effective observability it must permeate all areas of a software system, all the way up the domain-specific aspects that matter the most to end users. As a cross-cutting concern, it should be used within business logic to describe application internals. However, engineers in charge of developing new features are not always empowered with the modern observability practices supported by OpenTelemetry and, in a distributed environment, this may damage the overall observability of the system. In this panel, leaders from organizations at the forefront of this field take us through their experiences building platforms, tooling, enablement materials, and team topologies that allow them to scale adoption of OpenTelemetry best practices with minimal friction, and ensure that the telemetry data produced by their systems is of the highest quality, provides value, and maximizes return-on-investment.
Speakers
avatar for Daniel Gomez Blanco

Daniel Gomez Blanco

Principal Software Engineer at Skyscanner, OpenTelemetry Governance Committee Member, Skyscanner
Observability lead at Skyscanner, member of the OpenTelemetry Governance Committee, and author of "Practical OpenTelemetry: Adopting Open Observability Standards Across Your Organization". Throughout my career, my main focus has been reducing the cognitive load required to operate... Read More →
avatar for Suman Karumuri

Suman Karumuri

Principal Engineer, Airbnb
Suman Karumuri is a Sr. Staff Software Engineer and the tech lead for Observability at Slack. Suman Karumuri is an expert in distributed tracing and was a tech lead of Zipkin and a co-author of OpenTracing standard, a Linux Foundation project via the CNCF. Previously, Suman Karumuri... Read More →
avatar for Vijay Samuel

Vijay Samuel

Principal MTS, Architect, eBay
Vijay Samuel works with eBay's observability platform as its architect. During his time at eBay Vijay has transformed eBay's observability platform into a cloud native offering that is primarily built on top of open source technologies. He loves to code in Go and play video games... Read More →
avatar for Ariel Valentin

Ariel Valentin

Staff Software Engineer, GitHub
Staff Software Engineer on the Observability Infrastructure Team at GitHub and OpenTelemetry Ruby Contrib maintainer. Ariel has been a champion for Open Standards his entire career and is leading the effort to adopt OpenTelemetry at GitHub since 2020.
avatar for Hazel Weakly

Hazel Weakly

Fellow, Nivenly Foundation
Hazel spends her days working on building out teams of humans as well as the infrastructure, systems, and tooling to make life better for others. She’s worked at a variety of companies and knows that the hardest problems to solve are the social ones. One of her favorite things is... Read More →
Tuesday November 12, 2024 12:45pm - 1:20pm MST
Salt Palace | Level 2 | 255 E

12:55pm MST

⚡ Lightning Talk: On-Premise and SaaS CI/CD Large-Scale Production Automation with Argo Services - Edgar Magana, Splunk
Tuesday November 12, 2024 12:55pm - 1:05pm MST
In this presentation we will share our Continuous Integrations (CI) pipeline extensions via Argo Workflows in a pure cloud-native way seamlessly integrated with our code versioning system. We will also describe a large-scale asynchronous event-based bus communication that has accelerated our CI pipelines but also increased their resiliency. Finally, we will cover some Continuous Delivery enhancements via ArgoCD and Argo Rollouts. Actually, even our own infrastructure is operated via GitOps model. This collection of open-source projects and internal tools provides a full cloud-native orchestration and gitops experience for Dev, QA and Performance teams. Our solution extends the existing GitLab CI/CD capabilities by providing the ability to orchestrate, build and manage service interdependencies and integrations with other systems and enhanced metrics-based service rollouts in a multi-region cloud architecture.
Speakers
avatar for Edgar Magana

Edgar Magana

Sr. Principal Engineer, Splunk
Edgar has specialized in SaaS Architectures, Micro-services, Software-defined Networking (SDN) and CI/CD processes. Edgar has strong experience in fully automated systems with hands-on experience in technologies such as Docker, Terraform, Kubernetes, Argo, OpenStack and Spinnaker... Read More →
Tuesday November 12, 2024 12:55pm - 1:05pm MST
Salt Palace | Level 2 | 254 B
  ArgoCon, Progressive Delivery

12:55pm MST

⚡ Lightning Talk: Progressive Infrastructure Delivery Using Kargo and Argo CD - Engin Diri, Pulumi
Tuesday November 12, 2024 12:55pm - 1:05pm MST
Since the day Kargo was released, I have been exploring the idea of using it not only to deliver and promote applications but also to deliver infrastructure through its progressive delivery capabilities. Using Kubernetes-based tools like Crossplane or Pulumi, we can define infrastructure as code and deliver it progressively to our management clusters and then promote this infrastructure through different stages without the need for extra CD script magic. Let me show you how Kargo helps platform engineering streamline and automate the progressive rollout of infrastructure changes to all stages. This talk will cover the basics of Kargo and how to use it with Infrastructure as Code tools.
Speakers
avatar for Engin Diri

Engin Diri

Senior Solutions Architect, Pulumi
Engin is a Senior Solutions Architect at Pulumi and has been in the IT industry for over 15 years.He started as a Java backend developer and later migrated to the fronted development.This is where he found his passion for CI/CD, Cloud technologies and in particular Kubernetes.Engin... Read More →
Tuesday November 12, 2024 12:55pm - 1:05pm MST
Salt Palace | Level 2 | 251 AD
  ArgoCon, Progressive Delivery

1:10pm MST

Space Age GitOps: Lifting off with Argo Promotions (Live Demo!) - Michael Crenshaw & Zach Aller, Intuit
Tuesday November 12, 2024 1:10pm - 1:35pm MST
GitOps is an industry standard best practice in the Kubernetes space. But there are still gaps in the developer experience. Change previews and environment promotion have been two of the main pain points. Upcoming Argo CD features fill this gap by providing automated change previews as PR comments and by managing environment promotion by automatically opening and merging pull requests. This talk presents a live demo of these new Argo CD features. We’ll show you how change previews will help your developers merge changes faster and with more confidence; we’ll show how you can design a change promotion strategy tailored to your organization’s needs; and we’ll show how your developers can monitor change promotions via the Argo CD UI. Promotion-by-PR is one of several options for managing GitOps promotions. Besides showing a demo, we’ll compare and contrast this strategy with other open source and vendor-based solutions so that the audience can select the system that matches their needs.
Speakers
avatar for Zach Aller

Zach Aller

Staff Software Engineer, Intuit
Zach Aller is a software engineer at Intuit and a lead maintainer of Argo Rollouts. He has 15+ years of software development experience with a strong focus on SRE/Platform tooling. He has a strong background in Kubernetes and has managed large scale Kubernetes clusters for multiple... Read More →
avatar for Michael Crenshaw

Michael Crenshaw

Staff Software Engineer, Intuit
Michael Crenshaw is a Staff Software Engineer on the Argo CD team at Intuit. He is the most active contributor to the Argo project, focusing on security and performance improvements in Argo CD. He helps maintain Intuit’s ~50 Argo CD instances and ~20k Argo CD applications.
Tuesday November 12, 2024 1:10pm - 1:35pm MST
Salt Palace | Level 2 | 251 AD
  ArgoCon, Progressive Delivery

1:30pm MST

Scaling Network Policy Enforcement Beyond the Cluster Boundary with Cilium - Hemanth Malla & Maxime Visonneau, Datadog
Tuesday November 12, 2024 1:30pm - 1:55pm MST
To keep up with infrastructure growth, companies around the world are managing an increasing number of kubernetes clusters. Enforcing kubernetes native network policy at scale is already hard enough within a single cluster. Extending this to multiple clusters is even more challenging. Depending on the shape of your infrastructure, your cross-cluster policy requirements may be unique, and there’s no one-size-fits-all configuration. In this talk, we’ll dive deep into how different solutions work in cilium to understand sources of potential bottlenecks. We’ll discuss Clustermesh, KVstoremesh, DNS-based FQDN policy and a custom variant of KVstoremesh Datadog leverages while meshing at scale. Specifically, we’ll discuss how factors like the number of pods, identities and pod churn will impact scalability and time to policy enforcement. Join us if you’re curious about understanding the latest in cross-cluster policy and leave with actionable insights you can apply to your infrastructure.
Speakers
avatar for Hemanth Malla

Hemanth Malla

Senior Software Engineer, Datadog
Hemanth Malla is a Senior Software Engineer working on Kubernetes and container networking at Datadog. He is also a Cilium CNCF maintainer. Previously he worked on various distributed systems in industries like e-commerce, fintech and high frequency trading. Apart from computers... Read More →
avatar for Maxime Visonneau

Maxime Visonneau

Engineering Manager, Datadog
Maxime is an experienced systems and software engineer known for his passion in building robust infrastructures for small to large businesses. Having successfully led his startup to acquisition by Twitter in 2021. He is currently leading teams at Datadog where he brings a wealth of... Read More →
Tuesday November 12, 2024 1:30pm - 1:55pm MST
Salt Palace | Level 1 | Grand Ballroom B
  Cilium + eBPF Day, Cilium Architecture

1:30pm MST

Observing the Future: Embracing OTEL in WebAssembly - Victor Adossi, Cosmonic
Tuesday November 12, 2024 1:30pm - 1:55pm MST
WebAssembly is the next platform for computing, and this time, we can have observability from day one. In building distributed WebAssembly on top of wasmCloud, we built in the full OpenTelemetry ("OTEL") trifecta: traces, metrics and logs.

**Along the way we found a new way to achieve the holy grail of observability — free application & backing service instrumentation.**

This talk will cover how we implemented OTEL in wasmCloud and the benefits and challenges we faced. In a live demo, you will learn how you can trace globally distributed applications written in different programming languages and connected to a variety of backend services.
Speakers
avatar for Victor Adossi

Victor Adossi

Backend Engineer, Cosmonic
Talk to me about Rust, WebAssembly, and building microscalers.
Tuesday November 12, 2024 1:30pm - 1:55pm MST
Salt Palace | Level 2 | 255 B

1:35pm MST

Accelerating Application Delivery with OpenTofu Controller and GitOps - Lucas Duarte & Tiago Reichert, AWS
Tuesday November 12, 2024 1:35pm - 2:00pm MST
Developers often face challenges in reusing code and infrastructure across different projects, leading to duplication of effort and slower time-to-market. Platform engineering aims to solve this by providing a shared, standardized platform with reusable assets, automation, and governance, enabling faster development and delivery of applications.

Explore how OpenTofu Controller, in conjunction with Fluxv2 and GitOps principles, enables a seamless integration of infrastructure and application resources within a Kubernetes environment. We will dive into a real-world use case, showcasing the deployment of a multi-tenant SaaS platform on Kubernetes using OpenTofu Controller and Helm to consistently package reusable assets and deploy both application definitions and infrastructure components through a Git-centric workflow.
Speakers
avatar for Lucas Duarte

Lucas Duarte

Sr. Specialist Containers SA, AWS
avatar for Tiago Reichert

Tiago Reichert

Sr. Specialist Containers SA, AWS
Tiago is a Solutions Architect for AWS Brazil focused on helping ISV partners on their cloud journey. His passion lies in Containers, DevOps and SaaS. Additionally, Tiago is one of the organizer for the KCD Brazil.
Tuesday November 12, 2024 1:35pm - 2:00pm MST
Salt Palace | Level 2 | 250 D
  OpenTofu Day, Community Insights

1:40pm MST

GitOps Safety: Rendering Accurate ArgoCD Diffs Directly on Pull Requests - Dag Bjerre Andersen, Doubble & Regina Voloshin, Codefresh by Octopus Deploy
Tuesday November 12, 2024 1:40pm - 2:05pm MST
As organizations increasingly adopt GitOps and infrastructure-as-code, accurately visualizing manifest changes before they merge has become crucial. Mentally parsing Helm templates and Kustomize patches is too unreliable for catching configuration errors. Join us as we review the current landscape of tools and methods used for visualizing code changes in Argo CD, highlight their limitations, and introduce a new method that leverages ephemeral clusters and Argo CD to render accurate diffs of Helm Charts and Kustomize overlays directly on pull requests. The presentation showcases a tool illustrating this new approach and discusses its overall design. We will demonstrate how the approach can be seamlessly integrated into CI/CD pipelines to prevent deployment errors and streamline code reviews, all without access to live infrastructure. Finally, we'll conclude with an honest assessment of the method's capabilities and limitations and discuss potential areas for future development.
Speakers
avatar for Regina Voloshin

Regina Voloshin

OSS Tech Lead, Codefresh by Octopus Deploy
Regina is a GitOps fan. She is also an ArgoCD contributor and a CNCF Ambassador.
avatar for Dag Bjerre Andersen

Dag Bjerre Andersen

Infrastructure Engineer, Doubble
Dag is an Infrastructure Engineer at Doubble. He is passionate about nearly everything related to Kubernetes and has worked extensively with Argo CD, Flux, and Kubernetes over the past few years
Tuesday November 12, 2024 1:40pm - 2:05pm MST
Salt Palace | Level 2 | 254 B
  ArgoCon, Software Delivery

2:05pm MST

Enhancing Asynchronous Communication Observability with OpenTelemetry - Liudmila Molkova, Microsoft & Shivanshu Raj Shrivastava, SigNoz
Tuesday November 12, 2024 2:05pm - 2:30pm MST
OTel community has been working on standardizing semantic conventions to correlate telemetry data from various systems. The Messaging SemConv aims to solve it for commonly used queues like Kafka, RabbitMQ, and others systems. OTel instrumentations are adopting these conventions, but the end users still face challenges with async messaging observability at scale. They struggle with questions like "how to trace message flow?", "how to correlate metrics with traces?", "how to do capacity planning and cost optimizations based on telemetry data?". The end-to-end visibility often remains a black box! In this session, through a demo, we'll delve deeper into async architecture to address these questions, demonstrate context propagation within queues, and show how to correlate traces and client or broker-side metrics. Participants will gain hands-on experience with messaging instrumentation, learning how to achieve observability in both simple and complex asynchronous messaging scenarios.
Speakers
avatar for Liudmila Molkova

Liudmila Molkova

Principal Software Engineer, Microsoft
Liudmila Molkova is a Principal Software Engineer at Microsoft working on observability and Azure client libraries. She is a co-author of distributed tracing implementations across the .NET ecosystem including HTTP client instrumentation and Azure Functions. Liudmila is an active... Read More →
avatar for Shivanshu Raj Shrivastava

Shivanshu Raj Shrivastava

Founding Engineer, SigNoz
Shivanshu is a Founding Engineer at SigNoz, working on building an OTeL native observability product. He has a keen interest in deep tech and OSS. He is a CNCF ambassador and a member of CNCF projects like OTeL, k8s, and Istio. He has got the opportunity to mentor contributors in... Read More →
Tuesday November 12, 2024 2:05pm - 2:30pm MST
Salt Palace | Level 2 | 255 B

2:05pm MST

Harnessing Crossplane and Dapr for DevOps: FICO’s Platform Engineering Journey to Increase Velocity - Hugo Smitter, FICO
Tuesday November 12, 2024 2:05pm - 2:30pm MST
FICO’s platform engineering team is constantly researching new tools to help accelerate delivery of solutions to our customers. We reviewed various tools enabling our teams to write sophisticated automation pipelines. Learn how we leverage Crossplane and Dapr to build Composition pipelines for increased deployment flexibility and velocity.

We’ll explore:
  • Overview of the FICO® Platform and strategic business objectives driving our tool selection. 
  • Explore Crossplane Functions for dynamic resource generation, database interactions, secret management and more.
  • Combine Dapr’s building blocks with Crossplane to streamline your DevOps workflows to increase deployment flexibility and velocity.
  •  Building Anything-as-Code (AaC) pipelines using Crossplane’s Composition Functions aided by Dapr to separate user logic from boilerplate code.
Gain technical insights and practical knowledge on new tools to increase velocity and flexibility of your team’s workflows and AaC deployments.
Speakers
avatar for Hugo Smitter

Hugo Smitter

Platform Architect at FICO, FICO
Senior Enterprise and Solutions Architect with international experience in multiple industries. Track record performing leadership roles as: chief architect, solution architect, platform architect, systems integrator, information architect, delivery excellence auditor, consultant... Read More →
Tuesday November 12, 2024 2:05pm - 2:30pm MST
Salt Palace | Level 1 | Grand Ballroom G

2:10pm MST

Argonauts of Data: Building Scalable and Effective Data Pipelines - Satabrata Paul & Nishchith Shetty, Atlan
Tuesday November 12, 2024 2:10pm - 2:35pm MST
Atlan is a collaborative workspace for data teams that offers functionality like metadata cataloging and data lineage amongst others. Atlan provides connector integrations which ingest metadata from various data sources. As the data estate volume hit a massive scale, the platform encountered performance drags with ETL pipelines impacting resiliency, processing runtimes and efficiency. The existing architecture suffered pipeline failures encompassing computation and storage exhaustion, and parallel and concurrent processing pit-falls with troubling spikes in workflow failure rates. In this talk, Satabrata and Nishchith will share how they leveraged Argo’s parallelization techniques with robust re-try mechanisms and effective artifactory loading to ingest 100 Million assets achieving a 450% reduction in processing time. This improvisation also helped them process 3 Million SQL Queries in just 2 hours reducing overall pipeline runtime by 50% and having Argo-powered horizontal scale-out.
Speakers
avatar for Satabrata Paul

Satabrata Paul

Software Engineer II, Atlan
Satabrata Paul is a seasoned Data Engineer specializing in Backend Systems and CI/CD methodologies to optimize connector integrations for robust data cataloging. At Atlan, he is a part of the Metadata Marketplace team crafting solutions for data asset discovery and lineage. Satabrata... Read More →
avatar for Nishchith Shetty

Nishchith Shetty

Software Engineer, Platform Team, Atlan
Nishchith Shetty is a Software Engineer, part of the Platform Engineering Team at Atlan. He currently lives in San Jose, California. In the past, he has contributed to several open-source projects like Numaflow, CLTK, ScanCode, and Linux Foundation. Nishchith recently graduated from... Read More →
Tuesday November 12, 2024 2:10pm - 2:35pm MST
Salt Palace | Level 2 | 254 B
  ArgoCon, Data Processing

2:10pm MST

Managing Application Dependencies in Argo CD - Christian Hernandez, Akuity
Tuesday November 12, 2024 2:10pm - 2:35pm MST
In this talk, we will explore how to effectively manage inter Applications dependencies in Argo CD. Argo CD's Application Custom Resource Definition allows you to group Kubernetes Manifests logically, treating them as a single entity. While Applications are designed to be autonomous, this can pose challenges in a microservices architecture where components are isolated into separate Applications. This session will discuss patterns, best practices, and lessons learned to manage dependencies using the current capabilities of Argo CD. The session will begin with an overview of the Application CRD and its role as the atomic unit in Argo CD, highlighting how it enables the logical grouping of Kubernetes objects. We will then discuss Application characteristics and how it impacts microservices, using real-world examples. The we'll go over the current limitations in setting up Application dependencies in Argo CD, followed by strategies and patterns for managing inter-Application dependencies
Speakers
avatar for Christian Hernandez

Christian Hernandez

Head of Community, Akuity, Inc
Christian is a well rounded technologist with experience in infrastructure engineering, systems administration, enterprise architecture, tech support, advocacy, and product management. Passionate about OpenSource and containerizing the world one application at a time. He is currently... Read More →
Tuesday November 12, 2024 2:10pm - 2:35pm MST
Salt Palace | Level 2 | 251 AD
  ArgoCon, Scalability

2:10pm MST

Confluent’s Service Mesh Journey - Building Security and Reliability One Sidecar at a Time! - Adam Sayah, Solo.io & Cody Ray, Confluent
Tuesday November 12, 2024 2:10pm - 2:35pm MST
Confluent Cloud leverages Istio service mesh in its control plane to secure, observe and route traffic between microservices at scale processing millions of requests per second. In this case study we will explore how Istio service mesh was incrementally adopted at Confluent providing a uniform identity model using SPIRE and advanced observability using OpenTelemetry. We will go over our Istio adoption journey - from securing traffic using mTLS to advanced multi cluster routing and share our experiences running Istio at scale in production along with learnings to operationalize Istio for Day 2 operations.
Speakers
avatar for Adam Sayah

Adam Sayah

Product Manager, Solo.io
Adam Sayah is Field Engineer at Solo.io, a company specializing in open source and enterprise software for application networking from the edge to service mesh. At Solo.io, Adam helps organizations build and operate robust cloud-native architecture. Prior to Solo.io, Adam held software... Read More →
avatar for Cody A. Ray

Cody A. Ray

Staff Software Engineer, Confluent, Inc.
REST/GRPC APIs, OpenAPI, schemas, API versioning and evolution, golang, Kubernetes, Docker, and so on.
Tuesday November 12, 2024 2:10pm - 2:35pm MST
Hyatt Regency | Level 2 | Salt Lake Ballroom C

2:40pm MST

Fake It to Make It! - TDD of gRPC Microservices - Ed Crewe, EnterpriseDB
Tuesday November 12, 2024 2:40pm - 3:05pm MST
For unit testing cloud native applications we mock a lot, dependencies are in the cloud, and unit tests should be isolated. For testing the full service delivered by k8s composed microservices, we can easily stand up ephemeral test deployments as part of CI/CD. K8s gives us replicable IaC out of the box. Add E2E tests to our pipeline and we are good, right? Unfortunately no, that means we not only have an hourglass of testing. But one whose bottom may barely test if your code actually works, whatever the coverage says. Leaving us relying on very slow and expensive E2E build and test feedback from our CI/CD requiring full k8s cluster deployments. This talk is about why developers need to turn that hourglass of testing back into a pyramid and how to do so. Enabling efficient test driven refactoring of your microservices by developing fast fake frameworks for functional testing. Driving up quality via rapid development and refactoring cycles.
Speakers
avatar for Ed Crewe

Ed Crewe

Developer EDB Postgres AI, EDB (www.enterprisedb.com)
I have been a cloud engineer for the last 8 years. Working mainly on k8s Golang micro-services for the last 5. Before that I was a web developer and have spoken at Djangocon.Eu and EuroPython as well as many times at local techie groups, including my local Golang meetup. Currently... Read More →
Tuesday November 12, 2024 2:40pm - 3:05pm MST
Salt Palace | Level 1 | 151 G

2:40pm MST

Live Migrating Production Clusters From Calico to Cilium - Moh Ahmed & Raymond Maika, SamsungAds
Tuesday November 12, 2024 2:40pm - 3:05pm MST
Engineers may be tasked with rolling out a new Container Networking Interface (CNI) to their environment. Sounds easy enough! Delete the old one, deploy the new one. Or maybe just deploy a brand new cluster! What if... there was another way? The talk will show how a live, in-place migration of the CNI plugin was performed in production clusters. It will highlight a few approaches that were considered, and what approach was eventually selected before proceeding with the migration process. Lastly, the procedure and steps taken to execute this migration will be shared, along with any lessons learned.
Speakers
avatar for Moh Ahmed

Moh Ahmed

Staff Developer, Site Reliability Engineer, SamsungAds
avatar for Raymond Maika

Raymond Maika

Manager, Platform Engineering, SamsungAds
Tuesday November 12, 2024 2:40pm - 3:05pm MST
Salt Palace | Level 1 | Grand Ballroom B
  Cilium + eBPF Day, Use Cases

2:40pm MST

Brag Your RAG with the MLOPS Swag - Madhav Sathe, Google & Jitender Kumar, publicissapient
Tuesday November 12, 2024 2:40pm - 3:05pm MST
Organizations are beginning to unlock significant value by integrating Large Language Models (LLMs) & Retrieval-Augmented Generation (RAG) into their business-critical processes. However, enterprises often face challenges in meeting the high expectations of GenAI-driven business outcomes. Bridging this gap requires meticulous planning in governance, continuous evaluation, seamless scaling, operational costs, and time-to-market. In this session, attendees will witness a live demonstration of a RAG application stack built with LangChain, Canopy, and a PostgreSQL Vector database, all deployed on Kubernetes. Additionally, we will discuss leveraging GPU and TPU accelerators to enhance computational efficiency. The audience will also gain insights into MLOps strategies for data splitting, embeddings, retrieval, and prompt engineering. Join us to explore how to effectively leverage MLOps with Kubernetes to achieve scalable and impactful GenAI solutions.
Speakers
avatar for Jitender Kumar

Jitender Kumar

Director Technology- Devops and Cloud, Publicis Sapient
20+ years of successful IT and Delivery management experience leading mission critical infrastructure, software development and implementation projects involving strategic business and technology change and providing measurable financial results for the organization. Worked with Financial... Read More →
avatar for Madhav Sathe

Madhav Sathe

Principal Architect, Google
Madhav helps major enterprises drive innovation using modern application architectures, containers and DevOps. Madhav has been a speaker at conferences such as SpringOne, Cloud Foundry Summit and Oracle OpenWorld. He has co-authored a white paper on container security. Madhav currently... Read More →
Tuesday November 12, 2024 2:40pm - 3:05pm MST
Salt Palace | Level 1 | Grand Ballroom A

2:40pm MST

Build-Time Auto-Instrumentation in Android - Jason Plumb, Splunk
Tuesday November 12, 2024 2:40pm - 3:05pm MST
This session provides an in-depth dissection and live demonstration of OpenTelemetry’s build-time auto-instrumentation for Android. Class-loading in the Android runtime poses significant challenges to instrumentation engineers looking for attach or injection points. Fortunately, developers can now use off-the-shelf OpenTelemetry tools to instrument their applications without the need to manually instrument or even alter their application code. We will cover the benefits of front-loading bytecode weaving to build time, the tools used by OpenTelemetry to accomplish this, and the types of rich client-side telemetry created through this approach.
Speakers
avatar for Jason Plumb

Jason Plumb

Software Engineer, Splunk
Jason Plumb (he/him) is a hacker, artist, experimenter, polyglot programmer, and dad from Portland, OR, USA. He is co-maintainer of OpenTelemetry Android and an approver in various OpenTelemetry java projects. When not at work, Jason volunteers with Futel to install and maintain a... Read More →
Tuesday November 12, 2024 2:40pm - 3:05pm MST
Salt Palace | Level 2 | 255 B

2:45pm MST

Taming the Chaos: Fine-Grained RBAC in Argo CD for Incident Avoidance - Katie Lamkin-Fulsher & Alexandre Gaudreault, Intuit
Tuesday November 12, 2024 2:45pm - 3:10pm MST
Incidents caused by accidental actions can have far-reaching consequences. At Intuit, developers encountered a series of such incidents due to unintended actions performed through the Argo CD UI, including deletion of Replica Sets and Argo Rollouts. To prevent these types of unintended actions, we extended the current RBAC system to implement fine-grained policies. With this policy model, developers can now make changes with confidence, free from the fear of inadvertently impacting production systems. In this talk, we will delve into the intricacies of our journey, the strategies we employed to accomplish our goals, the future of Argo CD RBAC and what is yet to come. Join us as we explore the transformative power of fine-grained permissioning in preventing incidents and cultivating a culture of secure development.
Speakers
avatar for Alexandre Gaudreault

Alexandre Gaudreault

Software Developer & Argo CD Maintainer, Intuit
Alexandre is a Senior Software Developer at Intuit working on the core Argo team. He is a maintainer of the CNCF-graduated project Argo CD. He thrives on building internal developer platforms using open-source technologies to increase development velocity. Outside of work, you may... Read More →
avatar for Katie Lamkin

Katie Lamkin

Sr Product Manager of Platform and Open Source, Intuit
Katie Lamkin is a Sr Product Manager of Platform and Open Source at Intuit, who works with application development teams to achieve operational excellence through CICD platforms and progressive delivery strategies. Katie has been a Cloud Architect and held Engineering Management positions... Read More →
Tuesday November 12, 2024 2:45pm - 3:10pm MST
Salt Palace | Level 2 | 251 AD
  ArgoCon, Software Delivery

3:00pm MST

⚡ Lightning Talk: Enhancing Security with Istio: Realtime JWT Access Revocation - Josh Oberdick, Rocket Companies
Tuesday November 12, 2024 3:00pm - 3:10pm MST
Do you use JSON Web Tokens in your Istio system? Do you like how you can't revoke any of the JWTs that you use? Wish you could make a more secure system that allows you revoke a token across all your clusters within seconds without compromising all that you love about JWTs? Join my talk to learn about how to build an event based system that uses native Istio functionality to build a zero trust layer on top of Istio that enables token revocation when: - a user account is disabled - a user account logs off - (insert your use case) We will walk through all the pieces needed accomplish this, and of course a live demo.
Speakers
avatar for Josh Oberdick

Josh Oberdick

Platform Architect, Rocket Companies
Josh Oberdick is a Platform Architect at Rocket Companies, the nation’s largest online mortgage lender. In his 14 years’ experience of infrastructure engineering, he has developed solutions for mission critical storage and processing systems. Most recently, he has been focused... Read More →
Tuesday November 12, 2024 3:00pm - 3:10pm MST
Hyatt Regency | Level 2 | Salt Lake Ballroom C

3:20pm MST

Why Does Continuous Profiling Matter to Developers? - Jonas Kunz, Elastic & Mauricio Salatino, Diagrid
Tuesday November 12, 2024 3:20pm - 3:45pm MST
OpenTelemetry has gained momentum in the observability space, with a wide range of vendors supporting the specification. In this presentation, we will look into how developers can benefit from having a clear observability strategy by also adding continuous profiling to the mix. Setting up and understanding distributed application traces and profiling information used to be a complex task, usually left for critical situations where things go wrong. Join us on a journey to combine traces and profiling data to understand how our applications perform and where the bottlenecks are. In this presentation, you will learn about: How Otel and profiling can work together Advantages of continuous profiling How to set up an observability stack that works for your developers
Speakers
avatar for Jonas Kunz

Jonas Kunz

Jonas Kunz, Elastic
I work as (primarily) Java Developer at Elastic, focusing on the Elastic APM Java-agent and our Java OpenTelemetry Distribution. While I love the safety of managed languages, I also enjoy occasionally visiting the native side of things. I'm an active contributor to the OpenTelemetry... Read More →
avatar for Mauricio Salatino

Mauricio Salatino

OSS Software Engineer, Diagrid
Mauricio works as an Open Source Software Engineer at @Diagrid, contributing to and driving initiatives for the Dapr OSS project. Mauricio also serves as a Steering Committee member for the Knative Project and Co-Leading the Knative Functions initiative. He published a book titled... Read More →
Tuesday November 12, 2024 3:20pm - 3:45pm MST
Salt Palace | Level 1 | 151 G

3:20pm MST

Tuning Argo Rollouts for Thousands of Workloads - Carlos Sanchez & Roxana Balasoiu, Adobe
Tuesday November 12, 2024 3:20pm - 3:45pm MST
Argo Rollouts makes Progressive Delivery easy to adopt, but some times things do not work as expected. Are the steps correctly set? are the analysis metrics right for the workloads? At Adobe Experience Manager we deploy over 10k customer services to Kubernetes. Changes can occur multiple times per day both internal and from code. A new feature can work fine for 99% of customers but still affect the other 1%, and detecting this just from tests is costly. Enter Argo Rollouts, which allows deploying new versions to a subset of users before rolling them to the totality of the users, and rolling them back if not matching some key metrics, using techniques like canary deployments. We will show our learnings deploying Argo Rollouts to manage over 10k workloads using canaries, how do we balance speed and safety for our customers, and some of the issues that we have faced when adopting it.
Speakers
avatar for Roxana Balasoiu

Roxana Balasoiu

Software Development Engineer, Adobe
Roxana Balasoiu is a Software Development Engineer at Adobe where she has been working for the last 5 years. She is currently working at Adobe Experience Manager and previously contributed to Adobe Analytics. Roxana focuses on enhancing cloud infrastructure and developing new features... Read More →
avatar for Carlos Sanchez

Carlos Sanchez

Principal Scientist, Adobe
Carlos Sanchez is a Principal Scientist at Adobe Experience Manager, specializing in software automation, from build tools to Continuous Delivery and Progressive Delivery. Involved in Open Source for over 20 years, he is the author of the Jenkins Kubernetes plugin and a member of... Read More →
Tuesday November 12, 2024 3:20pm - 3:45pm MST
Salt Palace | Level 2 | 254 B
  ArgoCon, Progressive Delivery

3:20pm MST

DevEx and Productivity Metrics on Backstage - Do's and Don'ts - Yishai Beeri, LinearB
Tuesday November 12, 2024 3:20pm - 3:45pm MST
If you're already running a Backstage based IDP, it makes a lot of sense to use it for showing dev productivity metrics and developer experience signals as well. Before you start simply pushing DORA metrics to your IDP, you might want to look at some common gotchas. This talk with review some pitfalls around exposing dev productivity metrics in backstage, and how to avoid them. We will cover best practices for maximizing the value and actionability of what you're sending to the IDP. We'll talk about how to use benchmarks to anchor your data with context; leveraging a/b and control groups to make a point; and how to enable drilling down to learn more about a data point, and how to enable the various personas that will consume these metrics. Finally, we'll talk about where to get started and how to then take it to the next level.
Speakers
avatar for Yishai Beeri

Yishai Beeri

CTO, LinearB
Yishai Beeri loves to solve problems, and that’s why he was so fascinated with programming when he first encountered Logo back in the 80s. The possibilities seemed endless. In 2014 he joined a fast-moving cloud security startup, later acquired by a networking giant. That's where... Read More →
Tuesday November 12, 2024 3:20pm - 3:45pm MST
Salt Palace | Level 1 | Grand Ballroom H

3:20pm MST

Hubble Beyond Cilium - Anubhab Majumdar & Mathew Merrick, Microsoft
Tuesday November 12, 2024 3:20pm - 3:45pm MST
Hubble is a great solution for finding and fixing network problems in a Kubernetes cluster. However, we noticed that one of the main barriers for people to use Hubble is its dependency on Cilium as the dataplane. In this talk, we'll demonstrate how to decouple Hubble from Cilium, and use Hubble as a powerful Observability/metrics platform on top of any custom data plane. We will show you how to make Hubble work with any data source you want, without changing any code in Hubble. We'll show you an example of one such open source project called Retina and compare how key features work with both Cilium and custom CNI. In a live demo, we will show that you can get the same experience with Hubble regardless of what CNI you use.
Speakers
avatar for Anubhab Majumdar

Anubhab Majumdar

Senior Software Engineer, Microsoft
Software engineer in the Azure Container Networking team; previously with VMware Tanzu team
avatar for Mathew Merrick

Mathew Merrick

Software Engineer II, Microsoft
Software Engineer on the Azure Container Networking team.
Tuesday November 12, 2024 3:20pm - 3:45pm MST
Salt Palace | Level 1 | Grand Ballroom B
  Cilium + eBPF Day, Benefits of eBPF

3:20pm MST

Reproducible AI with Kubeflow, lakeFS and Langchain - Oz Katz, Treeverse
Tuesday November 12, 2024 3:20pm - 3:45pm MST
Langchain has become one of the most popular frameworks for anyone building custom, generative AI-driven apps powered by LLMs, that leverage RAG (Retrieval-Augmented Generation) for the most enhanced results.  But like all data products, these applications are really only as good as the organizational data fed into them––and we’ve all learned the hard way that the data is oftentimes far from perfect. In this hands-on tutorial you’ll learn how to build a reproducible AI application pipeline with Kubeflow, Langchain and lakeFS, widely adopted OSS tools in the ML & GenAI stack.  By learning how to build a RAG chatbot, while iteratively tuning it for best results leveraging lakeFS’s temporal versions, you’ll come away with improved methods for data reproducibility for your custom AI apps, that provide better data quality, alongside an improved user experience for your application users.
Speakers
avatar for Oz Katz

Oz Katz

Co-Founder, CTO, lakeFS
Oz Katz is the CTO and Co-Creator of the open source lakeFS Project, an open source platform that delivers resilience and manageability to object-storage based data lakes. Oz engineered and maintained petabyte-scale data infrastructure at analytics giant SmilarWeb, which he joined... Read More →
Tuesday November 12, 2024 3:20pm - 3:45pm MST
Salt Palace | Level 1 | Grand Ballroom A

3:20pm MST

Unlock the Full Potential of Generative AI via Microservices and Istio Service Mesh - Lin Sun, Solo.io & Iris Ding, Intel
Tuesday November 12, 2024 3:20pm - 3:45pm MST
This talk dives into how microservices architectures and Istio service mesh in Kubernetes empower developers to build scalable, resilient, and future-proof GenAI applications. We'll explore the challenges of running GenAI Application such as selecting the most suitable Large Language Model (LLM), leveraging effective embedding models, and deploying a robust vector database (DB), etc. We'll demonstrate how to overcome them using advanced Kubernetes strategies.

Key topics include:
• Discover how breaking down GenAI Application into microservices enhances flexibility, scalability, and maintainability.
• Learn how service mesh facilitates dynamic updates and changes to GenAI components without impacting user traffic.
• Showcase practical strategies for integrating these technologies in Kubernetes, supported by real-world examples like how to dynamically compose a GenAI application using different microservices, and how to change models or pipelines dynamically on Kubernetes.
Speakers
avatar for Iris Ding

Iris Ding

Cloud software architect, Intel
Iris Ding is a cloud software architect at Intel and has a rich background in open source development, cloud computing, Generative AI(GenAI), middleware development and design. Her current focus is intersection of GenAI and cloud computing and is leading development for Open Platform... Read More →
avatar for Lin Sun

Lin Sun

Head of Open-Source; CNCF TOC member, Solo.io
Lin is the Head of Open Source at Solo.io, contributing to open source full time. She is a CNCF TOC member and ambassador, an Istio core maintainer and leader. She is an international speaker in various tech conferences and blogs frequently about her perspective of service mesh and... Read More →
Tuesday November 12, 2024 3:20pm - 3:45pm MST
Hyatt Regency | Level 2 | Salt Lake Ballroom C

3:20pm MST

Turn the Volume Down on Noisy Neighbors! - Sándor Guba, Axoflow
Tuesday November 12, 2024 3:20pm - 3:45pm MST
Using Kubernetes as a multi-tenant platform is becoming a general concept. However, the fact that different logs are mixed on the same nodes makes it difficult. Learn how the Telemetry Controller (an abstract layer on top of the OTel Collector) helps you tackle the first problem. Routing from the edges provides isolation for the tenants and simplifies filtering and routing on a tenant level. Sandor will present different scenarios to do intelligent routing and how to build a smart aggregator on top of the Telemetry Controller.
Speakers
avatar for Sándor Guba

Sándor Guba

CTO, Axoflow
Sandor is a software engineer, CTO, and founder at Axoflow. His main field has always been observability and logging. He is a former co-founder at Banzai Cloud. He was responsible for observability and founded open-source projects like the Logging Operator and Thanos Operator. He... Read More →
Tuesday November 12, 2024 3:20pm - 3:45pm MST
Salt Palace | Level 2 | 255 B

3:20pm MST

Enabling OpenTofu for the Enterprise - Jordan Argueta and Douglas Flagg, Fidelity Investments
Tuesday November 12, 2024 3:20pm - 3:45pm MST
In this presentation, we will cover how we built an internal Infrastructure as Code (IaC) Platform to support a growing user base with over 2,070 applications managing 46,430 workspaces. It is our goal to explore what it means to enable an IaC tool like OpenTofu in a way that promotes security and compliance while also offering quick and easy self-service solutions. We will also discuss how we are enabling existing Terraform users to use OpenTofu, and the strategies implemented to facilitate a seamless transition. Finally, we will highlight what kind of applications, provider/module catalogs, and observability tools that platform teams should consider when creating a developer centric ecosystem for OpenTofu. Attendees will take away a concrete road map for enabling OpenTofu across many different teams within an enterprise.
Speakers
avatar for Douglas Flagg

Douglas Flagg

Principal Cloud Engineer, Fidelity Investments
Doug is a Principal Cloud Engineer at Fidelity Investments who collaborates daily with multiple internal business partners to deliver customer value through development of IaC tools and practices and applying them practically in various public clouds.Because of the visibility this... Read More →
avatar for Jordan Argueta

Jordan Argueta

Senior Cloud Engineer, Fidelity Investments
Jordan is Senior Cloud Engineer who has had the opportunity to be apart of the Cloud Platform and Engineering department at Fidelity Investments. His day-to-day task include contributing toward self-service tools, applications, and modules for OpenTofu and Terraform.
Tuesday November 12, 2024 3:20pm - 3:45pm MST
Salt Palace | Level 2 | 250 D

3:55pm MST

Bridge the Gap Between Terraform and GitOps - Junze Bao & Alexander Matyushentsev, Akuity
Tuesday November 12, 2024 3:55pm - 4:20pm MST
In modern cloud-native environments, GitOps practitioners face a significant challenge: managing dynamically generated resource values across Infrastructure as Code (IaC) tools like Terraform and Kubernetes manifests. This disconnect often leads to manual interventions, hard-coded values, and cumbersome update processes that compromise the efficiency and security of GitOps workflows. This talk introduces the Terraform Bridge, an innovative solution to seamlessly integrate dynamically generated cloud resource attributes with Kubernetes resources. We present a novel controller that leverages Terraform's output feature to automatically update Kubernetes objects such as ConfigMaps and Deployments. This approach eliminates the need for manual updates and hard-coding of sensitive or randomly generated values, thereby enhancing both security and automation in GitOps pipelines.
Speakers
avatar for Junze Bao

Junze Bao

Site Reliability Engineer, Akuity
Junze is a site reliability engineer at akuity.io.
avatar for Alexander Matyushentsev

Alexander Matyushentsev

Co-founder and Chief Architect, Akuity
Argo Co-Creator, Argo CD Lead, and maintainer. Energetic and passionate software engineer with over a decade of software development experience. I'm an enthusiast of continuous integration, agile environments, and a huge open-source believer. Core contributor and maintainer of http://argoproj.io... Read More →
Tuesday November 12, 2024 3:55pm - 4:20pm MST
Salt Palace | Level 2 | 254 B
  ArgoCon, Software Delivery

3:55pm MST

Lessons Learned Migrating to Modern Multi-Platform eBPF Programs - Dave Tucker, Red Hat
Tuesday November 12, 2024 3:55pm - 4:20pm MST
Kepler needed to migrate its old eBPF probes developed with BCC to probes that were compiled ahead of time. Maybe you do too? While performing this migration we were able to use some modern features of eBPF, the cilium/ebpf Go library, and bpf2go to make our probes multi-platform. Kepler (Kubernetes-based Efficient Power Level Exporter) is a CNCF project focused on measuring the environmental impact of software. At its core, Kepler uses eBPF to gather metrics from the Linux Kernel, which feed into an ML model that estimates power consumption for processes, VMs, and Pods. By the end of this session, you’ll gain a deeper understanding of eBPF, practical insights into its application in power consumption monitoring, and strategies for modernizing existing eBPF programs. Join us to learn from our experience and take away actionable best practices for your own projects!
Speakers
avatar for Dave Tucker

Dave Tucker

Sr. Principal Software Engineer, Red Hat
Dave is a long-time networking nerd, turned software engineer at the dawn of Software Defined Networking (SDN). A passionate Rustacean who currently helps to maintain Aya - a pure Rust eBPF library - alongside the Rust Compiler's BPF target, which allows users to program in Rust as... Read More →
Tuesday November 12, 2024 3:55pm - 4:20pm MST
Salt Palace | Level 1 | Grand Ballroom B
  Cilium + eBPF Day, Benefits of eBPF

3:55pm MST

How We Streamlined Our SDLC with Observability - from GitHUB via Jenkins, Harbor, Argo to K8S - Michael Gläss & Andreas Grabner, Dynatrace
Tuesday November 12, 2024 3:55pm - 4:20pm MST
Like many enterprises, we at Dynatrace deal with multiple delivery streams and various tooling to be used to develop, build, test and deliver our product to our customers. To better scale and have a resilient delivery practice, we regularly need to streamline the processes. From a requirement till the running solution, it typically involves many steps, often various people and different systems. Distributed knowledge makes it hard to transparently identify issues and slows things down. Observability in SDLC (Software Delivery Lifecycle) is our key enabler that gives us insights into resources, tools, techniques, staging, quality, rollout, etc. In this session learn how we automatically trace every delivery value flow, identify bottlenecks, wait times, lead times and other characteristics with the help of cloud native observability. Observability tells us where to start with optimization(s) and how to apply a continuous improvement cycle to our delivery chain.
Speakers
avatar for Michael Gläss

Michael Gläss

Chief Product Architect, Dynatrace
Michael is overseeing the Dynatrace Product Architecture with a strong focus on Cloud technologies, Platform Engineering, enablement and continuous delivery. He joined Dynatrace in March 2022 with more than 25 years hands on software engineering including the last 10+ years leading... Read More →
avatar for Andi Grabner

Andi Grabner

CNCF Ambassador and DevRel, Dynatrace
Andreas Grabner (@grabnerandi) has 20+ years of experience as a software developer, tester and architect and is an advocate for high-performing cloud scale applications. He is a CNCF ambassador, contributor to the CNCF project keptn and a DevRel for Dynatrace. Andreas is also a regular... Read More →
Tuesday November 12, 2024 3:55pm - 4:20pm MST
Salt Palace | Level 2 | 255 B

3:55pm MST

If You Build It, They Will Come - a Platform Modernization Journey - Ken Heaslip & Chloe Clarke, Canada Life
Tuesday November 12, 2024 3:55pm - 4:20pm MST
The story of a shift from on an premise container orchestration platform to a cloud based, feature rich, platform that became the first of it's kind in the company. We will share the challenges we saw in the "current state" on-premise platform (infra scalability, observability gaps, inconsistent security design, usability for customers, etc). We will discuss the process used to design the new platform including inclusion of customer feedback, how tooling was selected, how each was used to addressed the gaps and significantly improve the platform, and strategies on ensuring success for consumers of all skill levels. A few tools used include Amazon EKS, Tetrate (observability, security, and access control patterns), Karpenter (automate node scaling, ensure 99.99% HA, control cost, and address infra DR needs), Velero (automate cluster configuration backups and restores), Cloudability (cost reporting), and integration with service now for automated CI creation and deletion of CI's.
Speakers
avatar for Chloe Clarke

Chloe Clarke

DevOps Engineer, Canada Life
A dedicated Devops Engineer specializing in containerization, kubernetes and cloud migrations. Passionate about transforming legacy systems into modern, scalable environments.
avatar for Ken Heaslip

Ken Heaslip

Mr, Canada Life
I've worked in the tech field for 18 years and had the opportunity to span many technologies in my time. My best work comes from researching and implementing new tech where the challenge to learn it well enough to build a great design is high. I enjoy mentoring others and helping... Read More →
Tuesday November 12, 2024 3:55pm - 4:20pm MST
Salt Palace | Level 1 | Grand Ballroom G

4:15pm MST

ArgoCD: Lessons in Spinning Out OSS from Intuit to Independence - Jesse Suen & Hong Wang, Akuity
Tuesday November 12, 2024 4:15pm - 4:45pm MST
Join Jesse Suen and Hong Wang, co-founders of Akuity, as they share their journey of transforming ArgoCD from an internal Intuit project to a thriving independent open-source software (OSS) business. This session will cover:
  • The origins of ArgoCD within Intuit and its evolution
  • Challenges and opportunities in spinning out an enterprise-born OSS project
  • Building a sustainable business model around ArgoCD
  • Maintaining community engagement while pursuing commercial growth
  • Key lessons learned during the transition to independence
Gain valuable insights into the complexities of OSS commercialization, corporate spin-outs, and building a startup around a popular open-source tool. This presentation is ideal for developers, OSS maintainers, entrepreneurs, and corporate innovation leaders interested in the intersection of open source and business strategy.

Speakers
avatar for Jesse Suen

Jesse Suen

CTO, Akuity
Jesse Suen is the CTO and co-founder of Akuity. He is a co-creator and a project lead on the Argo project. Prior to founding Akuity, Jesse was a Principal Software Engineer and lead for the Argo team at Intuit, leading the design and architecture for Workflows, CD, and Rollouts. Jesse... Read More →
avatar for Hong Wang

Hong Wang

CEO, Akuity
A founding member of Argo Project. Prior to founding Akuity, Hong was the Argo team manager at Intuit and built the control-plane used to manage hundreds of Kubernetes clusters and thousands of namespaces. Hong has extensive experience in distributed system projects ranging from storage... Read More →
Tuesday November 12, 2024 4:15pm - 4:45pm MST
Hyatt Regency | Level 2 | Salt Lake Ballroom A

4:30pm MST

⚡ Lightning Talk: Vote on Your Favorite--Standardize This!--10 PromQL Queries That Measure CPU & Memory Usage - Sarah Hudspeth, Chronosphere
Tuesday November 12, 2024 4:30pm - 4:40pm MST
In the course of several SaaS migrations to Otel and Prom-based exporters, the search for the perfect PromQL CPU Usage (and Mem Usage) query ended up with several candidates. This quick talk will walk through these queries, explain their differences, and offer the crowd a way to rate the queries and vote on which ones they think should be used. A quick search on Stack Overflow will also produce quite a few queries to use and prove the need for some crowdsourcing on what queries should be the standard. Let's let the people decide! Or at least let's understand the appropriateness of various queries versus other options; how container and various prometheus exporter metrics work; and talk about the challenges of observability migrations with regards to translating queries from difference vendors to Open Source telemetry. Queries include cadvisor metrics, kubestate metrics, and node exporter metrics -- which also probably adds to the chaos of why there are so many cpu/mem queries.
Speakers
avatar for Sarah Hudspeth

Sarah Hudspeth

Solutions Architect, Chronosphere
Sarah is a datanerd who dabbled in data science and full-stack engineering before entering the Observability space as a Sales Engineer. She is now a Solutions Architect at Chronosphere and enjoys talking PromQL and how best to use telemetry data to scale and optimize your cloudnative... Read More →
Tuesday November 12, 2024 4:30pm - 4:40pm MST
Salt Palace | Level 2 | 255 B

4:30pm MST

Build a ChatGPT RAG Data Pipeline with RisingWave Stream Processor and Vector Store - Mary Grygleski, Callibrity & Rayees Pasha, RisingWave Labs
Tuesday November 12, 2024 4:30pm - 4:55pm MST
Enter the exciting brave new world of GenAI, by building a ChatGPT Data Pipeline that leverages on RisingWave's efficient stream processing jobs for real-time data that we draw from an X (or Twitter feed) that's been enriched with vector data and similarity search.

We'll explore the exciting ChatGPT world, building an efficient data pipeline that's enriched with vector embeddings as stored in a vector DB (PgVector) , and how it can pair with the performant RisingWave cloud-based stream processor for its write job. We will illustrate a sample use case with live coding, as follows:

* Simulate a streaming data feed from X (or Twitter), we'll be using Kafka as the message broker for data ingestion
* RisingWave will consume the data stream, and perform data analysis
* Construct prompts based on the top 3 hashtags identified by RisingWave
* Prompts will be used for inferencing against a RAG-based BOT built with PgVector
Speakers
avatar for Rayees Pasha

Rayees Pasha

Chief Product Officer, RisingWave Labs
Rayees Pasha is the Chief Product Officer at RisingWave Labs, a startup pioneering the development of Stream Processing Database. His expertise is in the areas of data management and data analytics. He has held product management roles delivering enterprise software in both traditional... Read More →
avatar for Mary Grygleski

Mary Grygleski

AI Practice Lead, Callibrity
Mary is a Java Champion, and the AI Practice Lead at Callibrity, She started as an engineer in Unix/C, then transitioned to Java around 2000. After 20+ years of being a software engineer and technical architect, she discovered her true passion in developer and customer advocacy. Most... Read More →
Tuesday November 12, 2024 4:30pm - 4:55pm MST
Salt Palace | Level 1 | 151 G

4:30pm MST

Stop Deploying Blind! Using Observability and Argo Rollouts to Light the Way - Kostis Kapelonis, Codefresh by Octopus Deploy
Tuesday November 12, 2024 4:30pm - 4:55pm MST
Are you tired of looking at metrics and logs after each deployment? Do you learn about failed deployments from unhappy customers? Did you always want to deploy on Friday afternoon and go straight to the pub? Many teams perform “blind” deployments without any real insight into what will be affected by the new application version. Consequently, they don’t have enough data to understand the blast radius of a release and whether to decide if it was successful or not. Even companies that have several metrics in place, don’t always use them in an automated manner. Wouldn’t it be great if you could see user behavior with new features in real time and identify performance bottlenecks before a full release?

In this talk, we will focus on common scenarios regarding Argo Rollouts and observability metrics, we will explain:
  • Minimum requirements in terms of tools and metrics/traces/logs 
  • Well-known observability use cases
  • Common automation pitfalls
  • RED/USE metrics tradeoffs
Speakers
avatar for Kostis Kapelonis

Kostis Kapelonis

Developer Advocate, Codefresh by Octopus Deploy
Kostis is a software engineer/technical-writer dual class character. He lives and breathes automation, good testing practices and stress-free deployments with GitOps.
Tuesday November 12, 2024 4:30pm - 4:55pm MST
Salt Palace | Level 2 | 251 AD
  ArgoCon, Progressive Delivery

4:30pm MST

IDPs For The Rest of Us: Maximizing Impact When You Aren't a Tech Giant - Eric Irwin, Quantum Metric
Tuesday November 12, 2024 4:30pm - 4:55pm MST
In todays platform engineering landscape, conventional wisdom suggests that the justification for building and maintaining an internal developer platform (IDP) is closely tied to the size of the engineering team; However, this overlooks critical factors such as platform complexity, integration needs, and the efficiency gains that can be realized even within smaller teams. Smaller teams now have the ability to stand on the shoulder of giants and build on a strong foundation which reduces the overhead and cost for building and maintaining an IDP, regardless of the size of your organization. But how do you begin and what does the path to successful implementation look like? Drawing from our own experiences, we’ll discuss how adopting Backstage has streamlined our development processes, cultivated a self-service culture, and enabled us to manage the complexities inherent in platform development.
Speakers
avatar for Eric Irwin

Eric Irwin

Senior Director of Engineering, Quantum Metric
As Director of Engineering at Quantum Metric, Eric has managed multiple Platform Engineering teams focused on Developer Experience, Site Reliability Engineering, and Platform Quality Engineering. He spearheaded the adoption of a self-serve platform, streamlined delivery processes... Read More →
Tuesday November 12, 2024 4:30pm - 4:55pm MST
Salt Palace | Level 1 | Grand Ballroom H

4:30pm MST

Incremental GPU Slicing in Action - Abhishek Malvankar & Olivier Tardieu, IBM Research
Tuesday November 12, 2024 4:30pm - 4:55pm MST
Large language models are often released as families of models with varying parameter counts and quantization. To reduce cost, inference services increasingly rely on dynamic model selection, preferring smaller models when possible. GPU vendors are on a journey to enable dynamic GPU slicing, making it possible for a workload to request a fraction of the compute and memory units in a GPU, and for the slices to be created and destroyed on demand without disrupting existing workloads. The onus is now on Kubernetes. The Device Management Working Group is hard at work to expose these capabilities. While vendor-agnostic slicing APIs do not exist yet, this talk demonstrates that incremental GPU slicing is possible today. We replace the Multi-Instance GPU manager, which only permits partitioning GPUs in bulk, with an open-source incremental-slicing controller without needing new APIs or changes to the device plugin. Come learn how to achieve incremental slicing in your GPU clusters.
Speakers
avatar for Abhishek Malvankar

Abhishek Malvankar

Senior Software Engineer, IBM Research
Abhishek is a Senior Software Engineer and Master Inventor at IBM Research and Co-chairs CNCF Batch System Initiative. He focuses on resource management, performance, and distributed computing for AI workloads in the cloud. Abhishek enjoys designing easy-to-use solutions for the cloud... Read More →
avatar for Olivier Tardieu

Olivier Tardieu

Principal Research Scientist, Manager, IBM
Dr. Olivier Tardieu is a Principal Research Scientist and Manager at IBM T.J. Watson, NY, USA. He joined IBM Research in 2007. His current research focuses on cloud-related technologies, including Serverless Computing and Kubernetes, as well as their application to Machine Learning... Read More →
Tuesday November 12, 2024 4:30pm - 4:55pm MST
Salt Palace | Level 1 | Grand Ballroom A

4:30pm MST

From Sensors to Servers: Efficient Edge Computing with Akri and WebAssembly - Kate Goldenring, Fermyon & Yu Jin Kim, Microsoft
Tuesday November 12, 2024 4:30pm - 4:55pm MST
At the edge, thousands of sensors publish data. Servers on the edge must discover these devices and then continually process new data from them. Akri and SpinKube are open source projects that address these challenges, providing a seamless experience for connecting IoT devices to Kubernetes clusters at the edge. Akri discovers these devices and automatically deploys Pods to consume their data. SpinKube runs serverless WebAssembly (Wasm) applications as Pods on Kubernetes. Since Wasm has small binary sizes and sub-millisecond start up times, it's the ideal application type for resource constrained servers at the edge. This presentation will conclude with a demo of using Akri to discover MQTT-based devices and automatically deploy Wasm applications to the cluster. The deployed applications will be event-driven and triggered by new MQTT messages, ensuring precise resource usage. By the end of the talk, you'll have the tools to run efficient data processing applications at the edge.
Speakers
avatar for Kate Goldenring

Kate Goldenring

Senior Software Engineer, Fermyon
Kate Goldenring is a senior software engineer at Fermyon and serves as co-chair of the Cloud Native Computing Foundation IoT Edge Working Group. She is an open-source developer who is drawn to building the best of what’s to come, maintaining projects focused on serverless WebAssembly... Read More →
avatar for Yu Jin Kim

Yu Jin Kim

Product Manager, Microsoft
Yu Jin is a product manager at Microsoft working on IoT and Kubernetes at the edge. She is currently a maintainer of the CNCF Sandbox project Akri.
Tuesday November 12, 2024 4:30pm - 4:55pm MST
Salt Palace | Level 2 | 250 A

4:30pm MST

Enabling Intelligent Observability Volume Management - Priyanka Naik, IBM & Vaishnavi Hire, Red Hat
Tuesday November 12, 2024 4:30pm - 4:55pm MST
Ensuring observability is critical, especially as operations expand into edge cloud environments. Effective observability data volume management is crucial to benefit from observability while controlling costs (storage, network, processing). Naive data reduction (e.g.,changing all metrics freq from 30s to 5min, sampling traces) can harm SLAs. With Observability Volume Manager(OVM), IBM and RedHat have co-developed an open-source intelligent framework with dynamic, fine-grained control of observability data volume. It works across modalities and integrates with open-source tools like Prometheus and OTel. OVM performs dynamic transformations at edge like filtering/aggregating data to reduce volume as well as adjust metric frequency/log levels for risk based zoom in/out and issue triaging. OVM identifies highly correlated/computed metrics and performs intelligent pruning and given an observability budget, identifies the subset of metrics and frequencies to minimize downstream task impact.
Speakers
avatar for Vaishnavi Hire

Vaishnavi Hire

Sr. Software Engineer, Red Hat OpenShift AI, Red Hat, Inc
Hi, I am a Senior Software engineer at Red Hat, contributing to OpenShift AI. My contributions include designing and developing the OpenShift AI Platform integrations. Additionally, I am also a co-maintainer of the open source project Open Data Hub.
avatar for Priyanka Naik

Priyanka Naik

Research Scientist, IBM
Priyanka Naik is a Ph.D. from IIT Bombay, India with experience in networked system. She is working on multi-cloud aspects around edge observability. She is a speaker at multiple tutorials and a co-author to a cloud networking book. 
Tuesday November 12, 2024 4:30pm - 4:55pm MST
Salt Palace | Level 2 | 255 E

4:40pm MST

Policy-as-Code for Infrastructure-as-Code with OPA and OpenTofu - Colin Lacy, Cisco
Tuesday November 12, 2024 4:40pm - 5:05pm MST
The problem: when managing OpenTofu configurations, a lot of manual work is involved. Will this update delete something it shouldn't? Was that cloud VM over-provisioned? Does this new resource follow the required naming convention?

The solution: set policies for OpenTofu state changes with policy-as-code, using Open Policy Agent (OPA).

In this session, attendees will:
- see how OPA's Rego policy language can prevent unwanted changes to their infrastructure
- learn how OpenTofu can inform OPA policy bundles to create dynamic policies
- watch a demo of an IaC deployment pipeline with policy enforcement and policy updates.
Speakers
avatar for Colin Lacy

Colin Lacy

Lead Software Engineer, Cisco
Colin Lacy is a Lead Software Engineer in Cisco Meraki's Hardware Analytics team, and a life-long believer in leveraging automation to standardize and simplify everyday tasks. He is an active mentor to junior developers, regularly contributes to OPA, and has taught multiple software... Read More →
Tuesday November 12, 2024 4:40pm - 5:05pm MST
Salt Palace | Level 2 | 250 D

4:45pm MST

⚡ Lightning Talk: Deep Dive: How Fluent Bit Collects File Logs - Braydon Kains, Google Cloud
Tuesday November 12, 2024 4:45pm - 4:55pm MST
Reading logs from a file is a very simple concept to understand, but implementing it is a whole other story. This lightning talk will dive into the full process of how Fluent Bit file logging works: detecting file changes, reading new lines from files, tracking state, handling file rotations, and more. This talk will be a brief glimpse into just one of the many deep challenges of implementing efficient observability data collection.
Speakers
avatar for Braydon Kains

Braydon Kains

Software Developer, Google Cloud
Braydon is a software developer at Google Cloud working on the Ops Agent. Under the GitHub username @braydonk you can find his contributions in Fluent Bit, OpenTelemetry repos, and various auxiliary repos. He is also the creator and maintainer of the yamlfmt tool.
Tuesday November 12, 2024 4:45pm - 4:55pm MST
Salt Palace | Level 2 | 255 B

5:00pm MST

⚡ Lightning Talk: Don't Get Blown up! Avoiding Configuration Gotchas for Tetragon Newbies - Pratik Lotia, Reddit
Tuesday November 12, 2024 5:00pm - 5:10pm MST
This talk will dive into five common configuration pitfalls that beginners encounter when using Tetragon for runtime observability on their workloads. We'll explore the implications of each gotcha and provide clear steps to avoid them. The talk will also cover best practices for configuring Tetragon in a Kubernetes environment.
Speakers
avatar for Pratik Lotia

Pratik Lotia

Senior Cloud Security Engineer, Reddit
Pratik Lotia is an infrastructure security engineer at Reddit, where he is responsible for building tools and processes for implementing security best practices for cloud native environments. He has extensive experience working on security projects for public & private clouds and... Read More →
Tuesday November 12, 2024 5:00pm - 5:10pm MST
Salt Palace | Level 1 | Grand Ballroom B
  Cilium + eBPF Day, Use Cases

5:00pm MST

⚡ Lightning Talk: Transform Your Kubernetes Cluster Into a GenAI Platform: Get Ready-to-Use LLM APIs Today! - Kenji Kaneda, CloudNatix Inc.
Tuesday November 12, 2024 5:00pm - 5:10pm MST
Are you eager to fine-tune LLMs and run inference directly within your Kubernetes clusters? Do you want an API compatible with OpenAI to leverage the extensive GenAI ecosystem? If so, LLMariner (https://llmariner.ai) is what you need. It instantly builds a software stack that provides an OpenAI-compatible API for inference, fine-tuning, and model management. In this talk, we'll provide an overview of the LLMariner and showcase its capabilities through practical use cases. Join us to learn how you can leverage LLMariner to enhance Kubernetes for your Generative AI workflows.
Speakers
avatar for Kenji Kaneda

Kenji Kaneda

Chief Architect, CloudNatix Inc.
Kenji is a chief architect at CloudNatix and has been working on large-scale distributed systems - especially cluster management systems - for over ten years. Most recently, he was a Principal Engineer at Nvidia, responsible for developing their deep learning training platform and... Read More →
Tuesday November 12, 2024 5:00pm - 5:10pm MST
Salt Palace | Level 1 | Grand Ballroom A

5:00pm MST

⚡ Lightning Talk: Is OpenTelemetry Too Complicated to Get Started? - Pranay Prateek, SigNoz
Tuesday November 12, 2024 5:00pm - 5:10pm MST
In this talk I will highlight some of the key issues end users face getting started with OpenTelemetry, and some possible ways to solve for this. Some common questions we see are: 1. What is the right way to deploy otel collectors for my scale? 2. I have instrumented my applications but I don't see my telemetry data data ( esp. around tracing) 3. Questions around manual instrumentation or any use cases which are not covered by auto instrumentation are tough to get help and guidance on This is more of an introspective talk on what we can do better as a community and derives from our experience of helping 1000s of users get started with OpenTelemetry.
Speakers
avatar for Pranay Prateek

Pranay Prateek

Maintainer, SigNoz
Pranay is one of the maintainers at SigNoz, an open source APM. He loves working on open source and observability, and has deep interest in philosophy esp. around Existentialism He is one of the organisers of OpenTelemetry APAC discussion group meetings & has been speaker in events... Read More →
Tuesday November 12, 2024 5:00pm - 5:10pm MST
Salt Palace | Level 2 | 255 E

5:10pm MST

⚡ Lightning Talk: Optimizing Multi-Cluster Communication and Cost Efficiency with Admiral and Istio - Punakshi Chaand, Intuit Inc.
Tuesday November 12, 2024 5:10pm - 5:20pm MST
Running a multicluster Service Mesh with 1000s of microservices has several challenges. One is to keep the Istio sidecar configuration minimal and thereby improve pod density.Istio has several knobs to fine-tune this and& many of those are unexplored & underutilized. Another challenge,sharded apps,where every shard needs tuning or it gets a bloated superset of configurations.In this talk,we will share the insights & technical breakthroughs from Intuit’s Service Mesh journey.We'll dive into how the `exportTo` configuration, in conjunction with Admiral's advanced identity management,enabled us to efficiently manage Istio resources across 300 clusters with remarkable cost savings. We will discuss strategic use of identity sharding & discovery selectors in multi-tenant API GW,highlighting resource management & optimized sidecar configuration.If you want to run a resource & cost-effective multicluster multitenant Istio deployment, this session provides practical guidance & valuable lessons.
Speakers
avatar for Punakshi Chaand

Punakshi Chaand

Software Engineer II, Intuit Inc.
Punakshi specializes in Service Mesh at Intuit. She has developed deep expertise in Identity and Access Management through her roles at Red Hat and HSBC. At Intuit, she enhances service mesh capabilities by customizing various Golang-based control and data plane components to ensure... Read More →
Tuesday November 12, 2024 5:10pm - 5:20pm MST
Hyatt Regency | Level 2 | Salt Lake Ballroom C

5:15pm MST

⚡ Lightning Talk: How GitOps Changes Identity and RBAC Management - Alice Jones, Liatrio
Tuesday November 12, 2024 5:15pm - 5:25pm MST
ArgoCD pulls and applies resources from Git repositories, CI/CD pipelines act based on events emitted by repositories, and developers interact with and debug the resources managed by both. Unfortunately, GitOps controllers, CI/CD pipelines, and developers all have different identities and different RBAC models. How can we find harmony in these approaches and effectively manage permissions across them? We’ll show how we’re using OAuth2-Token Exchange (RFC 8693) with Dex and GitHub Teams with ArgoCD AppProjects to provide consistent permissions to repositories, people, and CI pipelines with just OIDC. We also show how we designed a just-in-time RBAC approach by programmatically managing the ArgoCD AppProjects based on the repository manifests.
Speakers
avatar for Alice Jones

Alice Jones

Lead DevOps Engineer, Liatrio
Alice Jones is a Lead DevOps Engineer at Liatrio, where she leads the company's internal Kubernetes platform team.
Tuesday November 12, 2024 5:15pm - 5:25pm MST
Salt Palace | Level 2 | 251 AD
  ArgoCon, Software Delivery

5:15pm MST

⚡ Lightning Talk: Applying Cilium at Edge with KubeEdge - Tomoya Fujita, Sony Corporation of America
Tuesday November 12, 2024 5:15pm - 5:25pm MST
Applications at edge environment can be platform dependent, complicated and distributed in regions, and the number of devices significantly increases. Our final goal is to create the infrastructure that can be applied to the entire environment crossing over the cloud and edge in common. Working with KubeEdge and Cilium, we are now successfully able to use Cilium with KubeEdge hosted nodes at edge environment. This means, enabling wireguard VPN with Cilium can provide the transparent network connectivity with the nodes running in the cloud infrastructure, so that edge nodes running at edge environment just appear to be a member of cluster system but with edge autonomy feature provided by KubeEdge. We would like to share our technical insights and experience with using Cilium at edge with KubeEdge, and what are the future development and contribution with Cilium community.
Speakers
avatar for Tomoya Fujita

Tomoya Fujita

Senior Staff Software Engineer, Sony Corporation of America
Software Engineer, Sony Corporation of America System software architect and developer in Sony Corporation R&D Center. A member of ROS(Robot Operating System) TSC(Technical Steering Committee): https://index.ros.org/doc/ros2/Governance/ Github: https://github.com/fujitatomoya
Tuesday November 12, 2024 5:15pm - 5:25pm MST
Salt Palace | Level 1 | Grand Ballroom B
  Cilium + eBPF Day, Use Cases

5:15pm MST

⚡ Lightning Talk: Cost Saving Strategies for Interactive AI Development - Shravan Achar, Apple
Tuesday November 12, 2024 5:15pm - 5:25pm MST
The interactive nature of Jupyter notebooks has made them indispensable tools for data scientists and AI researchers, facilitating exploratory data analysis, prototyping, and model development. However, managing the cost of resource-intensive computations at different stages of AI/ML lifecycle presents significant challenges. We leveraged Apache YuniKorn to design a resource management system tailored for notebook workloads, which incorporates fair sharing, user-specific policies and budget constraints to allocate computational resources efficiently while adapting for both data preparation and model training stages. And thanks to the extensibility of JupyterLab, we offer rich displays next to the Notebook enabling data scientists to introspect resource usage in real time. This session presents cost saving strategies for interactive development on Jupyter using Kubeflow for model training and Spark for data preparation with YuniKorn scheduler.
Speakers
avatar for Shravan Achar

Shravan Achar

Sr. Software Engineer, Apple
Shravan is a senior software engineer at Apple with a passion for open source technologies. With a background in Mathematics and Computer Science, their current interests include MLOps, Scheduling in AI and Jupyter Notebooks.
Tuesday November 12, 2024 5:15pm - 5:25pm MST
Salt Palace | Level 1 | Grand Ballroom A

5:15pm MST

⚡ Lightning Talk: The Merge Conflict - Conflict: Apply-Before-Merge vs. Traditional Continuous Deployment - Asaf Blubshtein, env0
Tuesday November 12, 2024 5:15pm - 5:25pm MST
In modern infrastructure management, Terraform and now it's open source fork, OpenTofu, has emerged as a leading tool for Infrastructure as Code (IaC), enabling efficient and reproducible infrastructure deployments.

This talk explores two distinct approaches to integrating OpenTofu into your deployment pipeline: the approve-before-merge approach using Atlantis and the traditional continuous deployment (CD) strategy where changes are applied post-merge.

We’ll delve into the workflows, benefits, and challenges of each method, examining how they handle collaboration, ensure compliance, and manage deployment risks. Attendees will gain insights into choosing the right strategy for their teams, balancing speed and safety in their Terraform deployments.
Speakers
avatar for Asaf Blubshtein

Asaf Blubshtein

DevOps Solution Architect, env0
Asaf Blubshtein is a Solution Architect at env0, helping DevOps teams optimize and scale their Infrastructure as Code operations. In previous roles he helped various customers adopt and integrate native cloud solutions. He is passionate about technology, technical design, and continuous... Read More →
Tuesday November 12, 2024 5:15pm - 5:25pm MST
Salt Palace | Level 2 | 250 D
 

Share Modal

Share this link via

Or copy link

Filter sessions
Apply filters to sessions.
  • AppDeveloperCon
  • ArgoCon
  • BackstageCon
  • Breaks
  • Cilium + eBPF Day
  • Cloud Native + Kubernetes AI Day
  • Cloud Native StartupFest
  • Cloud Native University
  • Data on Kubernetes Day
  • EnvoyCon
  • Istio Day
  • Kubernetes on Edge Day
  • Observability Day
  • OpenFeature Summit
  • OpenTofu Day
  • Platform Engineering Day
  • Registration
  • WasmCon