Loading…
Attending this event?
Observability Day clear filter
Tuesday, November 12
 

9:00am MST

Observability Day | Welcome + Opening Remarks
Tuesday November 12, 2024 9:00am - 9:05am MST
Tuesday November 12, 2024 9:00am - 9:05am MST
Salt Palace | Level 2 | 255 EF
  Observability Day

9:10am MST

Observability Day Project Updates
Tuesday November 12, 2024 9:10am - 9:35am MST
Tuesday November 12, 2024 9:10am - 9:35am MST
Salt Palace | Level 2 | 255 EF
  Observability Day

9:45am MST

Profiling Mastery: Decode Your Data and Elevate Observability - Frederic Branczyk, Polar Signals
Tuesday November 12, 2024 9:45am - 10:10am MST
Profiling offers powerful insights, but many developers and engineers find them intimidating. This session aims to demystify profiling data, turning novices into experts.

In this talk, Frederic will cover:
  • Fundamentals of Profiling: What is profiling, and what can you gain from it? 
  • Decoding Flamegraphs: Learn how to read flamegraphs, turning raw data into actionable insights.
  • Real-World Applications: Discover how continuous profiling is revolutionizing observability within the Observability ecosystem. 
  • Hands-On Examples: Practical demonstrations of open-source profiling tools and techniques, helping you apply what you learn immediately. 
By the end of this talk, you will have the skills to make full use of profiling data, ensuring your applications run efficiently. Whether you're new to profiling or looking to refine your skills, this session will provide you with the knowledge to elevate your observability practices and become a cloud-native performance expert.
Speakers
avatar for Frederic Branczyk

Frederic Branczyk

Founder, Polar Signals
Frederic is the founder of Polar Signals. Before founding Polar Signals he was a senior principal engineer and main architect for all things Observability at Red Hat, joining through the CoreOS acquisition. Frederic is a Prometheus & Thanos maintainer and tenured as a tech lead for... Read More →
Tuesday November 12, 2024 9:45am - 10:10am MST
Salt Palace | Level 2 | 255 EF

10:15am MST

Sponsored Keynote: Collecting and Processing OTEL Data at Scale - Rob Skillington, Chronosphere
Tuesday November 12, 2024 10:15am - 10:20am MST
With the move to microservices and distributed systems, the amount of telemetry data that organizations need to collect and analyze has skyrocketed. This session will include different methods and best practices for collecting telemetry data with OpenTelemetry and Fluent Bit. In particular, we will talk about how to leverage Fluent Bit for custom processing of OpenTelemetry data at scale and a real-world use case from LinkedIn. 
Speakers
avatar for Rob Skillington

Rob Skillington

CTO & Co-Founder, Chronosphere
Rob is co-founder and CTO of Chronosphere, an observability platform built for Cloud Native. He is a distinguished engineer and open source advocate who led the metrics and monitoring team at Uber. In that role, he built M3DB, an open source time series database. Rob is a frequent... Read More →
Tuesday November 12, 2024 10:15am - 10:20am MST
Salt Palace | Level 2 | 255 EF
  Observability Day

10:25am MST

Sponsored Keynote: TBA, IBM
Tuesday November 12, 2024 10:25am - 10:30am MST
Tuesday November 12, 2024 10:25am - 10:30am MST
Salt Palace | Level 2 | 255 EF
  Observability Day

10:40am MST

But Wait! There's...Still More‽ - Observability Data Volumes and Strategies for Managing Them - Éamon Ryan, Grafana Labs
Tuesday November 12, 2024 10:40am - 11:05am MST
As we add more observability signals and points of insight into the software we design, we create more and more data. Data that has to be processed. Data that has to be stored. Data that has to be iterated over and retrieved. All of this increases stress on the system that stores it which naturally increases costs to run the system both in infrastructure, people power and more. This talk takes people through the history of how these volumes have grown over time, then moves into the current strategies and tradeoffs for managing them. It also offers a glimpse into what might come into this space in the future.
Speakers
avatar for Éamon Ryan

Éamon Ryan

Senior Principal Field Engineer, Grafana Labs
Éamon is a Senior Principal Field Engineer at Grafana Labs, where he builds and maintains internal and external environments, builds out advanced workshops, provides input on product use-cases and acts as a subject matter expert in some specific areas. He has held many customer-facing... Read More →
Tuesday November 12, 2024 10:40am - 11:05am MST
Salt Palace | Level 2 | 255 EF

10:40am MST

When Things Go Sideways: Troubleshooting the OTel Operator - Adriana Villela, ServiceNow Cloud Observability & Reese Lee, New Relic
Tuesday November 12, 2024 10:40am - 11:05am MST
The OpenTelemetry (OTel) Operator is a great tool that helps make your life a little easier by managing OTel for you in your Kubernetes cluster, by: Managing the deployment of the OpenTelemetry Collector Managing the configuration of a fleet of OpenTelemetry Collectors via OpAMP integration Injecting and configuring auto-instrumentation into your pods But what happens when THINGS. DON’T. WORK??

In this talk, Adriana and Reese will cover:
  • An overview of the OTel Operator 
  • Common installation issues
  • Common auto-instrumentation issues
  • Common OTel Collector deployment issues
  • * …and how to tackle them all
Attendees will walk away from this session with a better understanding of how they can leverage the Operator, and be empowered to use it with confidence.
Speakers
avatar for Reese Lee

Reese Lee

Senior Developer Relations Engineer, New Relic
Reese Lee is a Senior Developer Relations Engineer at New Relic, where she is focused on enabling customers and colleagues on OSS via workshops, blog posts, and documentation. She enjoys figuring out solutions to technical problems, learning about interesting user stories and use... Read More →
avatar for Adriana Villela

Adriana Villela

Sr. Staff Developer Advocate, ServiceNow Cloud Observability
Adriana Villela is a Sr. Developer Advocate, helping companies achieve reliability greatness through Observability, SRE, & DevOps practices. Before her current role, she managed a Platform Engineering team & an Observability Practices team at Tucows. Adriana has worked at various... Read More →
Tuesday November 12, 2024 10:40am - 11:05am MST
Salt Palace | Level 2 | 255 BC

11:15am MST

Perfect Match: Correlating Continuous Profiling with Distributed Tracing for Stronger Observability - Jonas Kunz & Christos Kalkanis, Elastic
Tuesday November 12, 2024 11:15am - 11:40am MST
Continuous profiling is a technique to collect stack trace granularity insight into production resource usage. It is something SREs and other engineers can enable without changes to the app, or knowing how it was compiled. This year, this powerful signal and a polyglot eBPF profiling agent were added to OpenTelemetry. Our talk explores how an existing OpenTelemetry system is better with profiling, specifically how distributed tracing fits into the picture. You'll see both tools in action on Kubernetes, including cross-service requests and how correlation of distributed traces and profiles let you answer more questions, specifically code level causality. We'll show how to leverage this data for resource utilization and even monitoring your carbon footprint. You'll leave with a concrete understanding of continuous profiling, how it relates to OpenTelemetry and how these tools combine to reduce time while adding more understanding of your Kubernetes workloads, from kernel to code.
Speakers
avatar for Jonas Kunz

Jonas Kunz

Jonas Kunz, Elastic
I work as (primarily) Java Developer at Elastic, focusing on the Elastic APM Java-agent and our Java OpenTelemetry Distribution. While I love the safety of managed languages, I also enjoy occasionally visiting the native side of things. I'm an active contributor to the OpenTelemetry... Read More →
Tuesday November 12, 2024 11:15am - 11:40am MST
Salt Palace | Level 2 | 255 BC

11:15am MST

Simplifying OpenTelemetry with Configuration - Alex Boten, Honeycomb & Jack Berg, New Relic
Tuesday November 12, 2024 11:15am - 11:40am MST
Learning how to use a new tool, and using it well enough to get value out of it quickly is really important. Configuring OpenTelemetry SDKs and instrumentation is essential, but programmatic configuration is cumbersome and environment variables are limited. OpenTelemetry file configuration is the community’s next generation config tool. Building on the original flattened environment variable based scheme, file configuration introduces tooling for configuring SDKs and instrumentation which accommodates the complexities of how people use OpenTelemetry in practice.

Come learn about:
  • What the OpenTelemetry Configuration Schema is and why it exists. 
  • How language agnostic file configuration provides OpenTelemetry adopters the tools needed to support complex scenarios.
  • How various components in OpenTelemetry have adopted the configuration schema.
  • Using a single configuration to configure OpenTelemetry SDKs in multiple languages and the Collector.
Speakers
avatar for Jack Berg

Jack Berg

Software Engineer, New Relic
I'm a member of OpenTelemetry Technical Committee and maintainer of OpenTelemetry Java. I've worked on a lot of components in OpenTelemetry, with an emphasis in metrics, logging, and configuration.
avatar for Alex Boten

Alex Boten

Staff Software Engineer, Honeycomb
Alex Boten is a staff software engineer that has spent the last ten years helping organizations adapt to a cloud-native landscape by mashing keyboards. From building core network infrastructure to mobile client applications and everything in between, Alex has first-hand knowledge... Read More →
Tuesday November 12, 2024 11:15am - 11:40am MST
Salt Palace | Level 2 | 255 EF

12:45pm MST

Panel: OpenTelemetry: Bridging Platform and Enablement - Daniel Gomez Blanco, Skyscanner; Ariel Valentin, GitHub; Hazel Weakly, Hachyderm; Suman Karumuri, Airbnb; Vijay Samuel, eBay
Tuesday November 12, 2024 12:45pm - 1:20pm MST
OpenTelemetry is everywhere, used by engineers in all roles. For telemetry data to provide effective observability it must permeate all areas of a software system, all the way up the domain-specific aspects that matter the most to end users. As a cross-cutting concern, it should be used within business logic to describe application internals. However, engineers in charge of developing new features are not always empowered with the modern observability practices supported by OpenTelemetry and, in a distributed environment, this may damage the overall observability of the system. In this panel, leaders from organizations at the forefront of this field take us through their experiences building platforms, tooling, enablement materials, and team topologies that allow them to scale adoption of OpenTelemetry best practices with minimal friction, and ensure that the telemetry data produced by their systems is of the highest quality, provides value, and maximizes return-on-investment.
Speakers
avatar for Daniel Gomez Blanco

Daniel Gomez Blanco

Principal Software Engineer at Skyscanner, OpenTelemetry Governance Committee Member, Skyscanner
Observability lead at Skyscanner, member of the OpenTelemetry Governance Committee, and author of "Practical OpenTelemetry: Adopting Open Observability Standards Across Your Organization". Throughout my career, my main focus has been reducing the cognitive load required to operate... Read More →
avatar for Suman Karumuri

Suman Karumuri

Principal Engineer, Airbnb
Suman Karumuri is a Sr. Staff Software Engineer and the tech lead for Observability at Slack. Suman Karumuri is an expert in distributed tracing and was a tech lead of Zipkin and a co-author of OpenTracing standard, a Linux Foundation project via the CNCF. Previously, Suman Karumuri... Read More →
avatar for Vijay Samuel

Vijay Samuel

Principal MTS, Architect, eBay
Vijay Samuel works with eBay's observability platform as its architect. During his time at eBay Vijay has transformed eBay's observability platform into a cloud native offering that is primarily built on top of open source technologies. He loves to code in Go and play video games... Read More →
avatar for Ariel Valentin

Ariel Valentin

Staff Software Engineer, GitHub
Staff Software Engineer on the Observability Infrastructure Team at GitHub and OpenTelemetry Ruby Contrib maintainer. Ariel has been a champion for Open Standards his entire career and is leading the effort to adopt OpenTelemetry at GitHub since 2020.
avatar for Hazel Weakly

Hazel Weakly

Infrastructure Witch, hachyderm.io
Hazel spends her days working on building out teams of humans as well as the infrastructure, systems, and tooling to make life better for others. She’s worked at a variety of companies and knows that the hardest problems to solve are the social ones. One of her favorite things is... Read More →
Tuesday November 12, 2024 12:45pm - 1:20pm MST
Salt Palace | Level 2 | 255 EF

12:55pm MST

Optimizing LLM Efficiency One Trace at a Time on Kubernetes - Aditya Soni, Forrester & Seema Saharan, Autodesk
Tuesday November 12, 2024 12:55pm - 1:20pm MST
LLMs are huge to deploy and manage. Have you ever wondered if there's a way to pinpoint exactly which parts of your code are draining resources, causing latency, and hurting performance? In a world where efficiency is crucial, dynamically inspecting application behavior and performance at runtime can be transformative. Join this session to explore how to leverage OpenTelemetry’s profiling feature to optimize LLM code at a much deeper level.

We'll cover how to:
1. Identify specific pieces of code that consume excessive CPU and memory, or cause memory leaks and OOM errors.
2. Improve LLM performance by understanding model behavior, reducing latency, and meeting SLAs and SLOs.
3. Achieve efficient deployments on Kubernetes, ensuring optimal resource utilization and cost savings.
Speakers
avatar for Seema Saharan

Seema Saharan

Site Reliability Engineer, CNCF Ambassador, Autodesk
Meet Seema, the tech whiz at Autodesk. She's not just about fixing things – she loves sharing what she knows! Whether speaking at cool events like GitLab Commit, and GitHub Universe or breaking down tech on her YouTube channel, Seema makes the complicated stuff easy and fun. Join... Read More →
avatar for Aditya Soni

Aditya Soni

DevOps Engineer ll, CNCF Ambassador, Forrester
Aditya Soni is a DevOps/SRE tech professional He worked with Product and Service based companies including Red Hat, Searce, and is currently positioned at Forrester Research as a DevOps Engineer II. He holds AWS, GCP, Azure, RedHat, and Kubernetes Certifications.He is a CNCF Ambassador... Read More →
Tuesday November 12, 2024 12:55pm - 1:20pm MST
Salt Palace | Level 2 | 255 BC

1:30pm MST

Observing the Future: Embracing OTEL in WebAssembly - Victor Adossi, Cosmonic
Tuesday November 12, 2024 1:30pm - 1:55pm MST
WebAssembly is the next platform for computing, and this time, we can have observability from day one. In building distributed WebAssembly on top of wasmCloud, we built in the full OpenTelemetry ("OTEL") trifecta: traces, metrics and logs.

**Along the way we found a new way to achieve the holy grail of observability — free application & backing service instrumentation.**

This talk will cover how we implemented OTEL in wasmCloud and the benefits and challenges we faced. In a live demo, you will learn how you can trace globally distributed applications written in different programming languages and connected to a variety of backend services.
Speakers
avatar for Victor Adossi

Victor Adossi

Backend Engineer, Cosmonic
Talk to me about Rust, WebAssembly, and building microscalers.
Tuesday November 12, 2024 1:30pm - 1:55pm MST
Salt Palace | Level 2 | 255 BC

1:30pm MST

Where’s the Auto in Auto-Instrumentation? a Look at Current Automation Strategies with OTel - Joshua Lee, Altinity
Tuesday November 12, 2024 1:30pm - 1:55pm MST
“Automatic Instrumentation” can mean a lot of things depending on context. Whether we’re discussing the Instrumentation SDKs or full-kernel observability with eBPF, the promise is the same: end-to-end observability coverage with no custom code and minimal setup. First, I will review how the different mechanisms available for automatic instrumentation work within each of the 11 languages supported by OpenTelemetry. I’ll examine:
  • How code-path instrumentation works at the library level by diving into the Node.js OpenTelemetry Extension and the JavaScript libraries it supports 
  • Automatic instrumentation via attachment with Java and Python
  • Automatic instrumentation injection using the OTel Operator for DotNet, Java, and NodeJS
Finally, I’ll take a peek at the future of automatic instrumentation of compiled binaries with a look at the Go instrumentation library built using eBPF.
Speakers
avatar for Josh Lee

Josh Lee

Open Source Developer Advocate, Altinity
Joshua is a seasoned software developer with over a decade of experience, specializing in a broad range of topics including operations, observability, agile methodologies, and accessibility. Currently, Joshua serves as a Developer Advocate for Altinity, where he creates educational... Read More →
Tuesday November 12, 2024 1:30pm - 1:55pm MST
Salt Palace | Level 2 | 255 EF

2:05pm MST

Enhancing Asynchronous Communication Observability with OpenTelemetry - Liudmila Molkova, Microsoft & Shivanshu Raj Shrivastava, SigNoz
Tuesday November 12, 2024 2:05pm - 2:30pm MST
OTel community has been working on standardizing semantic conventions to correlate telemetry data from various systems. The Messaging SemConv aims to solve it for commonly used queues like Kafka, RabbitMQ, and others systems. OTel instrumentations are adopting these conventions, but the end users still face challenges with async messaging observability at scale. They struggle with questions like "how to trace message flow?", "how to correlate metrics with traces?", "how to do capacity planning and cost optimizations based on telemetry data?". The end-to-end visibility often remains a black box! In this session, through a demo, we'll delve deeper into async architecture to address these questions, demonstrate context propagation within queues, and show how to correlate traces and client or broker-side metrics. Participants will gain hands-on experience with messaging instrumentation, learning how to achieve observability in both simple and complex asynchronous messaging scenarios.
Speakers
avatar for Liudmila Molkova

Liudmila Molkova

Principal Software Engineer, Microsoft
Liudmila Molkova is a Principal Software Engineer at Microsoft working on observability and Azure client libraries. She is a co-author of distributed tracing implementations across the .NET ecosystem including HTTP client instrumentation and Azure Functions. Liudmila is an active... Read More →
avatar for Shivanshu Raj Shrivastava

Shivanshu Raj Shrivastava

Founding Engineer, SigNoz
Shivanshu is a Founding Engineer at SigNoz, working on building an OTeL native observability product. He has a keen interest in deep tech and OSS. He is a CNCF ambassador and a member of CNCF projects like OTeL, k8s, and Istio. He has got the opportunity to mentor contributors in... Read More →
Tuesday November 12, 2024 2:05pm - 2:30pm MST
Salt Palace | Level 2 | 255 BC

2:05pm MST

Measure What Matters - Jamie Danielson, Honeycomb
Tuesday November 12, 2024 2:05pm - 2:30pm MST
Have you ever had an alert go off that you immediately ignore? It’s a nuisance alert, not actionable, but you keep it around just in case. Have you ever looked at your trace waterfall and wondered what exactly happened during that gap that just doesn’t drill down deep enough to surface meaningful details? Do you know the feeling where you have just enough information to sort of monitor what’s going on in your systems, but not quite enough to put your mind at ease or feel like you know where to look when things go wrong? These experiences are almost universally known, something that everyone can relate to at some point in their observability journey. This session will help tackle common pitfalls, offering strategic ways to improve your observability. Armed with a mix of tailored custom instrumentation and well-reasoned SLOs, you’ll walk away able to help yourself and your team sleep better at night knowing you are measuring what matters and reducing the noise of unhelpful alerts.
Speakers
avatar for Jamie Danielson

Jamie Danielson

Senior Software Engineer, Honeycomb
Jamie is a Senior Software Engineer at Honeycomb where she works on instrumentation libraries. She is an active contributor to multiple OpenTelemetry projects, and is an approver for OpenTelemetry JavaScript. When she’s not working she’s playing dek hockey.
Tuesday November 12, 2024 2:05pm - 2:30pm MST
Salt Palace | Level 2 | 255 EF

2:40pm MST

Build-Time Auto-Instrumentation in Android - Jason Plumb, Splunk
Tuesday November 12, 2024 2:40pm - 3:05pm MST
This session provides an in-depth dissection and live demonstration of OpenTelemetry’s build-time auto-instrumentation for Android. Class-loading in the Android runtime poses significant challenges to instrumentation engineers looking for attach or injection points. Fortunately, developers can now use off-the-shelf OpenTelemetry tools to instrument their applications without the need to manually instrument or even alter their application code. We will cover the benefits of front-loading bytecode weaving to build time, the tools used by OpenTelemetry to accomplish this, and the types of rich client-side telemetry created through this approach.
Speakers
avatar for Jason Plumb

Jason Plumb

Software Engineer, Splunk
Jason Plumb (he/him) is a hacker, artist, experimenter, polyglot programmer, and dad from Portland, OR, USA. He is co-maintainer of OpenTelemetry Android and an approver in various OpenTelemetry java projects. When not at work, Jason volunteers with Futel to install and maintain a... Read More →
Tuesday November 12, 2024 2:40pm - 3:05pm MST
Salt Palace | Level 2 | 255 BC

2:40pm MST

Unlocking Advanced Processing Capabilities with OpenTelemetry - Aunsh Chaudhari, Splunk & Anthony Mirabella, AWS
Tuesday November 12, 2024 2:40pm - 3:05pm MST
OpenTelemetry stands at the forefront of modern observability, providing a robust framework for collecting, processing, and exporting telemetry data in complex distributed systems. In this session, we will go over how you can benefit from the processing capabilities of OpenTelemetry, presenting an exploration of different features and their potential impact on optimizing observability. We will talk through some real world examples and best practices allowing you to control costs, reduce data volume, balance overhead, and focus on critical telemetry data. By the end of this session, you will know how to transform, filter, sample and route data effectively at scale with OpenTelemetry.
Speakers
avatar for Anthony Mirabella

Anthony Mirabella

Senior Software Dev Engineer, AWS
Anthony is a Senior SDE at Amazon Web Services working on Open Source data collection tools and services. He has contributed to the OpenTelemetry project since 2019 in a variety of roles.
avatar for Aunsh Chaudhari

Aunsh Chaudhari

Senior Product Manager, Splunk
Aunsh is a Product Manager at Splunk focused on helping users adopt Observability with OpenTelemetry. He has a keen interest in cloud native technologies and open source projects. Prior to Splunk, Aunsh worked closely with engineering and product at Shutterfly on the User Media Assets... Read More →
Tuesday November 12, 2024 2:40pm - 3:05pm MST
Salt Palace | Level 2 | 255 EF

3:20pm MST

The Road to Observability Everywhere at Monday.Com - David Gohberg, monday.com
Tuesday November 12, 2024 3:20pm - 3:45pm MST
Our team at monday.com has experienced explosive growth over the past four years. We prioritized enabling tracing and observability across our distributed app. However, we did not have robust end-to-end tests. Building reliable end-to-end tests for distributed apps is hard! We decided to use OpenTelemetry tracing for end-to-end testing as well and adhering to our “Observability Everywhere” mantra! In this talk we’ll showcase how we implemented trace-driven testing with Tracetest and OpenTelemetry. This improved our feedback cycle by 50%, uncovered issues that traditional tests missed and decreased test creation time by 90%! Attendees will also learn how to apply this new approach to various use-cases, such as API tests, browser end-to-end testing and even testing in production. Trace-driven testing works by leveraging the work already invested in instrumenting our systems via distributed tracing. Sounds fun? Join me to learn how we did it at monday.com!
Speakers
avatar for David Gohberg

David Gohberg

Senior Software Engineer, monday.com
David Gohberg is a Senior Software Engineer at monday.com where he works on distributed trace testing while building the best CRM on the planet. Before that, he worked at Dropbox where he improved tracing capabilities in high scale workloads. He has worked across the stack, from embedded... Read More →
Tuesday November 12, 2024 3:20pm - 3:45pm MST
Salt Palace | Level 2 | 255 EF

3:20pm MST

Turn the Volume Down on Noisy Neighbors! - Sándor Guba, Axoflow
Tuesday November 12, 2024 3:20pm - 3:45pm MST
Using Kubernetes as a multi-tenant platform is becoming a general concept. However, the fact that different logs are mixed on the same nodes makes it difficult. Learn how the Telemetry Controller (an abstract layer on top of the OTel Collector) helps you tackle the first problem. Routing from the edges provides isolation for the tenants and simplifies filtering and routing on a tenant level. Sandor will present different scenarios to do intelligent routing and how to build a smart aggregator on top of the Telemetry Controller.
Speakers
avatar for Sándor Guba

Sándor Guba

CTO, Axoflow
Sandor is a software engineer, CTO, and founder at Axoflow. His main field has always been observability and logging. He is a former co-founder at Banzai Cloud. He was responsible for observability and founded open-source projects like the Logging Operator and Thanos Operator. He... Read More →
Tuesday November 12, 2024 3:20pm - 3:45pm MST
Salt Palace | Level 2 | 255 BC

3:55pm MST

Distributed Tracing: All the Warning Signs Were Out There!!! - Vijay Samuel, eBay
Tuesday November 12, 2024 3:55pm - 4:20pm MST
Distributed tracing is hard! That is a statement that most folks would agree too. At large organizations like eBay where there are 1000s of microservices and call chains that have 100s or even 1000s of spans, the complexity to adopt tracing is fairly high. Over the course of the last couple of years, we have traveled far in getting to a state where most applications are instrumented for tracing. However, the journey has numerous hurdles that we have had to cross. This talk discusses, a solid approach we took with the help of Open Telemetry, how it jump started our adoption but at the same time, how we had to do more to get much closer to the finish line.

Key points that we would discuss include:
  • Our strategy to deliver instrumentation to all applications 
  • Challenges in context propagation
  • Doing more than what open telemetry alone provides
  • Experiences required to make tracing useful
  • Focus for the future
Speakers
avatar for Vijay Samuel

Vijay Samuel

Principal MTS, Architect, eBay
Vijay Samuel works with eBay's observability platform as its architect. During his time at eBay Vijay has transformed eBay's observability platform into a cloud native offering that is primarily built on top of open source technologies. He loves to code in Go and play video games... Read More →
Tuesday November 12, 2024 3:55pm - 4:20pm MST
Salt Palace | Level 2 | 255 EF

3:55pm MST

How We Streamlined Our SDLC with Observability - from GitHUB via Jenkins, Harbor, Argo to K8S - Michael Gläss & Andreas Grabner, Dynatrace
Tuesday November 12, 2024 3:55pm - 4:20pm MST
Like many enterprises, we at Dynatrace deal with multiple delivery streams and various tooling to be used to develop, build, test and deliver our product to our customers. To better scale and have a resilient delivery practice, we regularly need to streamline the processes. From a requirement till the running solution, it typically involves many steps, often various people and different systems. Distributed knowledge makes it hard to transparently identify issues and slows things down. Observability in SDLC (Software Delivery Lifecycle) is our key enabler that gives us insights into resources, tools, techniques, staging, quality, rollout, etc. In this session learn how we automatically trace every delivery value flow, identify bottlenecks, wait times, lead times and other characteristics with the help of cloud native observability. Observability tells us where to start with optimization(s) and how to apply a continuous improvement cycle to our delivery chain.
Speakers
avatar for Michael Gläss

Michael Gläss

Chief Product Architect, Dynatrace
Michael is overseeing the Dynatrace Product Architecture with a strong focus on Cloud technologies, Platform Engineering, enablement and continuous delivery. He joined Dynatrace in March 2022 with more than 25 years hands on software engineering including the last 10+ years leading... Read More →
avatar for Andi Grabner

Andi Grabner

CNCF Ambassador and DevRel, Dynatrace
Andreas Grabner (@grabnerandi) has 20+ years of experience as a software developer, tester and architect and is an advocate for high-performing cloud scale applications. He is a CNCF ambassador, contributor to the CNCF project keptn and a DevRel for Dynatrace. Andreas is also a regular... Read More →
Tuesday November 12, 2024 3:55pm - 4:20pm MST
Salt Palace | Level 2 | 255 BC

4:30pm MST

Designing for Observability - George Hantzaras, MongoDB
Tuesday November 12, 2024 4:30pm - 4:55pm MST
Observability is a key aspect of modern software design, allowing teams to monitor and understand their systems in real-time. But designing for observability can be challenging, with a wide range of tools and best practices to consider. In this presentation, we'll cover strategies for designing systems that are observable, meaning that they can be monitored and understood in real-time. We'll explore the best practices and tools for designing systems that are observable, with a focus on practical tips and techniques that teams can implement in their own projects. We'll also dive into topics such as instrumentation, data collection, and analysis, as well as emerging trends in observability tools and approaches. Whether you're a software engineer, architect, or DevOps professional, this presentation will provide valuable insights and practical takeaways on how a systems design and architecture can play a role in the system's observability and how to implement best practices.
Speakers
avatar for George Peter Hantzaras

George Peter Hantzaras

Director of Engineering, MongoDB
George is a hands-on engineering leader with a focus on delivering Enterprise cloud services at scale. He is a Director of Engineering at MongoDB, focusing on implementing cloud native technologies at enterprise scale. He is also and angel investor and has worked as a Venture Partner... Read More →
Tuesday November 12, 2024 4:30pm - 4:55pm MST
Salt Palace | Level 2 | 255 BC

4:30pm MST

Enabling Intelligent Observability Volume Management - Priyanka Naik, IBM & Vaishnavi Hire, Red Hat
Tuesday November 12, 2024 4:30pm - 4:55pm MST
Ensuring observability is critical, especially as operations expand into edge cloud environments. Effective observability data volume management is crucial to benefit from observability while controlling costs (storage, network, processing). Naive data reduction (e.g.,changing all metrics freq from 30s to 5min, sampling traces) can harm SLAs. With Observability Volume Manager(OVM), IBM and RedHat have co-developed an open-source intelligent framework with dynamic, fine-grained control of observability data volume. It works across modalities and integrates with open-source tools like Prometheus and OTel. OVM performs dynamic transformations at edge like filtering/aggregating data to reduce volume as well as adjust metric frequency/log levels for risk based zoom in/out and issue triaging. OVM identifies highly correlated/computed metrics and performs intelligent pruning and given an observability budget, identifies the subset of metrics and frequencies to minimize downstream task impact.
Speakers
avatar for Vaishnavi Hire

Vaishnavi Hire

Sr. Software Engineer, Red Hat OpenShift AI, Red Hat, Inc
Hi, I am a Senior Software engineer at Red Hat, contributing to OpenShift AI. My contributions include designing and developing the OpenShift AI Platform integrations. Additionally, I am also a co-maintainer of the open source project Open Data Hub.
avatar for Priyanka Naik

Priyanka Naik

Research Scientist, IBM
Priyanka Naik is a Ph.D. from IIT Bombay, India with experience in networked system. She is working on multi-cloud aspects around edge observability. She is a speaker at multiple tutorials and a co-author to a cloud networking book. 
Tuesday November 12, 2024 4:30pm - 4:55pm MST
Salt Palace | Level 2 | 255 EF

5:00pm MST

⚡ Lightning Talk: Is OpenTelemetry Too Complicated to Get Started? - Pranay Prateek, SigNoz
Tuesday November 12, 2024 5:00pm - 5:10pm MST
In this talk I will highlight some of the key issues end users face getting started with OpenTelemetry, and some possible ways to solve for this. Some common questions we see are: 1. What is the right way to deploy otel collectors for my scale? 2. I have instrumented my applications but I don't see my telemetry data data ( esp. around tracing) 3. Questions around manual instrumentation or any use cases which are not covered by auto instrumentation are tough to get help and guidance on This is more of an introspective talk on what we can do better as a community and derives from our experience of helping 1000s of users get started with OpenTelemetry.
Speakers
avatar for Pranay Prateek

Pranay Prateek

Co-founder, SigNoz
Pranay is one of the maintainers at SigNoz, an open source APM. He loves working on open source and observability, and has deep interest in philosophy esp. around Existentialism He is one of the organisers of OpenTelemetry APAC discussion group meetings & has been speaker in events... Read More →
Tuesday November 12, 2024 5:00pm - 5:10pm MST
Salt Palace | Level 2 | 255 EF

5:00pm MST

⚡ Lightning Talk: Vote on Your Favorite--Standardize This!--10 PromQL Queries That Measure CPU & Memory Usage - Sarah Hudspeth, Chronosphere
Tuesday November 12, 2024 5:00pm - 5:10pm MST
In the course of several SaaS migrations to Otel and Prom-based exporters, the search for the perfect PromQL CPU Usage (and Mem Usage) query ended up with several candidates. This quick talk will walk through these queries, explain their differences, and offer the crowd a way to rate the queries and vote on which ones they think should be used. A quick search on Stack Overflow will also produce quite a few queries to use and prove the need for some crowdsourcing on what queries should be the standard. Let's let the people decide! Or at least let's understand the appropriateness of various queries versus other options; how container and various prometheus exporter metrics work; and talk about the challenges of observability migrations with regards to translating queries from difference vendors to Open Source telemetry. Queries include cadvisor metrics, kubestate metrics, and node exporter metrics -- which also probably adds to the chaos of why there are so many cpu/mem queries.
Speakers
avatar for Sarah Hudspeth

Sarah Hudspeth

Solutions Architect, Chronosphere
Sarah is a datanerd who dabbled in data science and full-stack engineering before entering the Observability space as a Sales Engineer. She is now a Solutions Architect at Chronosphere and enjoys talking PromQL and how best to use telemetry data to scale and optimize your cloudnative... Read More →
Tuesday November 12, 2024 5:00pm - 5:10pm MST
Salt Palace | Level 2 | 255 BC

5:15pm MST

⚡ Lightning Talk: Deep Dive: How Fluent Bit Collects File Logs - Braydon Kains, Google Cloud
Tuesday November 12, 2024 5:15pm - 5:25pm MST
Reading logs from a file is a very simple concept to understand, but implementing it is a whole other story. This lightning talk will dive into the full process of how Fluent Bit file logging works: detecting file changes, reading new lines from files, tracking state, handling file rotations, and more. This talk will be a brief glimpse into just one of the many deep challenges of implementing efficient observability data collection.
Speakers
avatar for Braydon Kains

Braydon Kains

Software Developer, Google Cloud
Braydon is a software developer at Google Cloud working on the Ops Agent. Under the GitHub username @braydonk you can find his contributions in Fluent Bit, OpenTelemetry repos, and various auxiliary repos. He is also the creator and maintainer of the yamlfmt tool.
Tuesday November 12, 2024 5:15pm - 5:25pm MST
Salt Palace | Level 2 | 255 BC

5:15pm MST

⚡ Lightning Talk: Observability Migration Worst Case Scenarios - Paige Cruz, Chronosphere
Tuesday November 12, 2024 5:15pm - 5:25pm MST
Do you remember the first observability migration you shepherded? I do…and not because it went smoothly. Now that I have some distance from this disaster I can share the mistakes I made from hazy requirements to evaluate platforms to issues interpreting vendor pricing plans and underestimating the importance of developer enablement. The challenges were endless and while the migration did eventually finish I would do almost everything differently if I could go back in time. Let my mistakes guide you away from a messy migration by learning what NOT to do when migrating observability platforms.
Speakers
avatar for Paige Cruz

Paige Cruz

Principal Developer Advocate, Chronosphere
Paige Cruz is a Principal Developer Advocate at Chronosphere passionate about cultivating sustainable on-call practices and bringing folks their aha moment with observability. She started as a software engineer at New Relic before switching to Site Reliability Engineering holding... Read More →
Tuesday November 12, 2024 5:15pm - 5:25pm MST
Salt Palace | Level 2 | 255 EF

5:25pm MST

Observability Day | Closing Remarks
Tuesday November 12, 2024 5:25pm - 5:30pm MST
Tuesday November 12, 2024 5:25pm - 5:30pm MST
Salt Palace | Level 2 | 255 EF
  Observability Day
 

Share Modal

Share this link via

Or copy link

Filter sessions
Apply filters to sessions.
  • AppDeveloperCon
  • ArgoCon
  • BackstageCon
  • Breaks
  • Cilium + eBPF Day
  • Cloud Native AI + Kubernetes Day
  • Cloud Native StartupFest
  • Cloud Native University
  • Data on Kubernetes Day
  • EnvoyCon
  • Istio Day
  • Kubernetes on Edge Day
  • Observability Day
  • OpenFeature Summit
  • OpenTofu Day
  • Platform Engineering Day
  • WasmCon