Loading…
strong>ArgoCon [clear filter]
Tuesday, November 12
 

11:15am MST

Breaking the 1.5MB Barrier: Running Large Metaflow Flows with Argo for AI/ML Workloads - Saurabh Garg, Outerbounds
Tuesday November 12, 2024 11:15am - 11:40am MST
Managing large-scale batch workflows efficiently is critical for AI/ML workloads. Data preparation for training or fine tuning models can involve a large number of steps. These make for excellent Argo workflows. But Argo faces the etcd limitation of the 1.5MB object size. This limitation restricts the ability of Argo to run truly large-scale workflows. This talk will delve into the intricacies of this limitation and its impact on AI/ML workflows. We will illustrate with examples how this has been a non-deterministic and frustrating bottleneck for users. To address this challenge, Argo introduced a feature that circumvents the etcd object size restriction. By offloading the bulk of the workflow status to an RDBMS and only storing the reference in etcd, Argo maintains its scaling capabilities still adhering to Kubernetes' limitations. This talk will provide a comprehensive guide on configuring and utilizing the Argo offloading feature in AWS using Aurora Postgres RDS and EKS.
Speakers
avatar for Saurabh Garg

Saurabh Garg

Senior Software Engineer, Outerbounds, Inc.
Tuesday November 12, 2024 11:15am - 11:40am MST
Salt Palace | Level 2 | 254 B
  ArgoCon, Data Processing

1:10pm MST

Data Science Workflows Made Easy: Python-Powered Argo for Your Organization - Elliot Gunton, Pipekit Inc. & Flaviu Vadan, Xaira Therapeutics
Tuesday November 12, 2024 1:10pm - 1:35pm MST
Get ready to supercharge your data science workflow development using the power of Hera, the versatile Python SDK for Argo Workflows.
  • Python for Everything: Learn how Hera lets you focus on your business logic and seamlessly integrate it into Argo Workflows – all within your favorite Python environment.
  • Effortless Argo with Hera: Use Hera to craft Argo Workflows with ease using simple Python code that handles common tasks such as template parameters, passing data, and fan-out.
  • Beyond the Basics: We'll explore how Hera provides a base for your organization to build on, using its advanced capabilities, including pre-build hooks, that empower you to configure Hera for your organization's specific needs.
  • Boost your CICD: Learn best practices to build Python workflows efficiently with Hera. Automate developer setups and recurring CICD tasks. 
By the end of this talk, you'll be equipped to supercharge your Argo Workflows with Hera to unlock a new level of automation and efficiency!
Speakers
avatar for Elliot Gunton

Elliot Gunton

Senior Software Engineer, Pipekit Inc
Elliot is a passionate maintainer of Hera, the Python SDK for Argo Workflows. At Pipekit, he is helping to bring scalable data pipelines to the Python world, unlocking the full potential of Argo Workflows for data scientists. Previously, at Bloomberg, Elliot supported Machine Learning... Read More →
avatar for Flaviu Vadan

Flaviu Vadan

Senior Software Engineer, Xaira Therapeutics
TBD
Tuesday November 12, 2024 1:10pm - 1:35pm MST
Salt Palace | Level 2 | 254 B
  ArgoCon, Data Processing

2:10pm MST

Argonauts of Data: Building Scalable and Effective Data Pipelines - Satabrata Paul & Nishchith Shetty, Atlan
Tuesday November 12, 2024 2:10pm - 2:35pm MST
Atlan is a collaborative workspace for data teams that offers functionality like metadata cataloging and data lineage amongst others. Atlan provides connector integrations which ingest metadata from various data sources. As the data estate volume hit a massive scale, the platform encountered performance drags with ETL pipelines impacting resiliency, processing runtimes and efficiency. The existing architecture suffered pipeline failures encompassing computation and storage exhaustion, and parallel and concurrent processing pit-falls with troubling spikes in workflow failure rates. In this talk, Satabrata and Nishchith will share how they leveraged Argo’s parallelization techniques with robust re-try mechanisms and effective artifactory loading to ingest 100 Million assets achieving a 450% reduction in processing time. This improvisation also helped them process 3 Million SQL Queries in just 2 hours reducing overall pipeline runtime by 50% and having Argo-powered horizontal scale-out.
Speakers
avatar for Satabrata Paul

Satabrata Paul

Software Engineer II, Atlan
Satabrata Paul is a seasoned Data Engineer specializing in Backend Systems and CI/CD methodologies to optimize connector integrations for robust data cataloging. At Atlan, he is a part of the Metadata Marketplace team crafting solutions for data asset discovery and lineage. Satabrata... Read More →
avatar for Nishchith Shetty

Nishchith Shetty

Software Engineer, Platform Team, Atlan
Nishchith Shetty is a Software Engineer, part of the Platform Engineering Team at Atlan. He currently lives in San Jose, California. In the past, he has contributed to several open-source projects like Numaflow, CLTK, ScanCode, and Linux Foundation. Nishchith recently graduated from... Read More →
Tuesday November 12, 2024 2:10pm - 2:35pm MST
Salt Palace | Level 2 | 254 B
  ArgoCon, Data Processing

4:30pm MST

Demystifying Argo Events: An Architectural Deep Dive - JP Zivalich, Pipekit Inc. & Becky Pauley, Venafi Jetstack Consult
Tuesday November 12, 2024 4:30pm - 4:55pm MST
Are you new to Argo Events and trying to get your head around how everything works? Join us for an in-depth exploration of the Argo Events architecture, designed to transform you from a novice to a seasoned Argo Events maestro. In this session, we'll embark on a journey through the workings of Argo Events, unraveling the mysteries of the event bus, event source and sensors. We'll delve into their roles and responsibilities, and understand how they go about wrangling events within your Kubernetes cluster. This is the talk that we wished we had heard when we started on our Argo Events journey. Whether you're a seasoned Kubernetes practitioner or a newcomer to the Argo Events realm, this session will provide you with invaluable insights that will elevate your understanding and mastery of this powerful eventing engine.
Speakers
avatar for Becky Pauley

Becky Pauley

Solutions Engineer, Venafi Jetstack Consult
A self-taught engineer and career changer, I finally made the leap from teaching to tech several years ago. I’ve since worked in various Platform and Cloud Engineering roles, with a focus on Kubernetes best practices and Cost-Optimisation. As well as all things Cloud Native, I’m... Read More →
avatar for J.P. Zivalich

J.P. Zivalich

Cofounder, CTO, Pipekit Inc
Cofounder & CTO of Pipekit.
Tuesday November 12, 2024 4:30pm - 4:55pm MST
Salt Palace | Level 2 | 254 B
  ArgoCon, Data Processing
 

Share Modal

Share this link via

Or copy link

Filter sessions
Apply filters to sessions.
  • AppDeveloperCon
  • ArgoCon
  • BackstageCon
  • Breaks
  • Cilium + eBPF Day
  • Cloud Native + Kubernetes AI Day
  • Cloud Native StartupFest
  • Cloud Native University
  • Data on Kubernetes Day
  • EnvoyCon
  • Istio Day
  • Kubernetes on Edge Day
  • Observability Day
  • OpenFeature Summit
  • OpenTofu Day
  • Platform Engineering Day
  • Registration
  • WasmCon