Loading…
strong>Salt Palace | Level 2 | 255 B [clear filter]
Tuesday, November 12
 

10:40am MST

When Things Go Sideways: Troubleshooting the OTel Operator - Adriana Villela, ServiceNow Cloud Observability & Reese Lee, New Relic
Tuesday November 12, 2024 10:40am - 11:05am MST
The OpenTelemetry (OTel) Operator is a great tool that helps make your life a little easier by managing OTel for you in your Kubernetes cluster, by: Managing the deployment of the OpenTelemetry Collector Managing the configuration of a fleet of OpenTelemetry Collectors via OpAMP integration Injecting and configuring auto-instrumentation into your pods But what happens when THINGS. DON’T. WORK??

In this talk, Adriana and Reese will cover:
  • An overview of the OTel Operator 
  • Common installation issues
  • Common auto-instrumentation issues
  • Common OTel Collector deployment issues
  • * …and how to tackle them all
Attendees will walk away from this session with a better understanding of how they can leverage the Operator, and be empowered to use it with confidence.
Speakers
avatar for Reese Lee

Reese Lee

Senior Developer Relations Engineer, New Relic
Reese Lee is a Senior Developer Relations Engineer at New Relic, where she is focused on enabling customers and colleagues on OSS via workshops, blog posts, and documentation. She enjoys figuring out solutions to technical problems, learning about interesting user stories and use... Read More →
avatar for Adriana Villela

Adriana Villela

Principal Developer Advocate, Dynatrace
Adriana Villela is a Principal Developer Advocate, helping companies achieve reliability greatness through Observability, SRE, & DevOps practices. Previously, she managed a Platform Engineering team & an Observability Practices team at Tucows. Adriana has worked at various large-scale... Read More →
Tuesday November 12, 2024 10:40am - 11:05am MST
Salt Palace | Level 2 | 255 B

11:15am MST

Perfect Match: Correlating Continuous Profiling with Distributed Tracing for Stronger Observability - Jonas Kunz & Christos Kalkanis, Elastic
Tuesday November 12, 2024 11:15am - 11:40am MST
Continuous profiling is a technique to collect stack trace granularity insight into production resource usage. It is something SREs and other engineers can enable without changes to the app, or knowing how it was compiled. This year, this powerful signal and a polyglot eBPF profiling agent were added to OpenTelemetry. Our talk explores how an existing OpenTelemetry system is better with profiling, specifically how distributed tracing fits into the picture. You'll see both tools in action on Kubernetes, including cross-service requests and how correlation of distributed traces and profiles let you answer more questions, specifically code level causality. We'll show how to leverage this data for resource utilization and even monitoring your carbon footprint. You'll leave with a concrete understanding of continuous profiling, how it relates to OpenTelemetry and how these tools combine to reduce time while adding more understanding of your Kubernetes workloads, from kernel to code.
Speakers
avatar for Christos Kalkanis

Christos Kalkanis

Elastic
Christos is a principal engineer at Elastic, a maintainer for the OpenTelemetry Profiling SIG and a co-author of the donated OpenTelemetry profiling agent previously known as the Elastic Universal Profiling agent. After more than a decade of focusing on cybersecurity offense he moved... Read More →
avatar for Jonas Kunz

Jonas Kunz

Jonas Kunz, Elastic
I work as (primarily) Java Developer at Elastic, focusing on the Elastic APM Java-agent and our Java OpenTelemetry Distribution. While I love the safety of managed languages, I also enjoy occasionally visiting the native side of things. I'm an active contributor to the OpenTelemetry... Read More →
Tuesday November 12, 2024 11:15am - 11:40am MST
Salt Palace | Level 2 | 255 B

12:55pm MST

Optimizing LLM Efficiency One Trace at a Time on Kubernetes - Aditya Soni, Forrester & Seema Saharan, Autodesk
Tuesday November 12, 2024 12:55pm - 1:20pm MST
LLMs are huge to deploy and manage. Have you ever wondered if there's a way to pinpoint exactly which parts of your code are draining resources, causing latency, and hurting performance? In a world where efficiency is crucial, dynamically inspecting application behavior and performance at runtime can be transformative. Join this session to explore how to leverage OpenTelemetry’s profiling feature to optimize LLM code at a much deeper level.

We'll cover how to:
1. Identify specific pieces of code that consume excessive CPU and memory, or cause memory leaks and OOM errors.
2. Improve LLM performance by understanding model behavior, reducing latency, and meeting SLAs and SLOs.
3. Achieve efficient deployments on Kubernetes, ensuring optimal resource utilization and cost savings.
Speakers
avatar for Seema Saharan

Seema Saharan

Site Reliability Engineer, CNCF Ambassador, Autodesk
Meet Seema, the tech whiz at Autodesk. She's not just about fixing things – she loves sharing what she knows! Whether speaking at cool events like GitLab Commit, and GitHub Universe or breaking down tech on her YouTube channel, Seema makes the complicated stuff easy and fun. Join... Read More →
avatar for Aditya Soni

Aditya Soni

CNCF Ambassador, DevOps Engineer II, Forrester
Aditya Soni is a DevOps/SRE tech professional He worked with Product and Service based companies including Red Hat, Searce, and is currently positioned at Forrester Research as a DevOps Engineer II. He holds AWS, GCP, Azure, RedHat, and Kubernetes Certifications.He is a CNCF Ambassador... Read More →
Tuesday November 12, 2024 12:55pm - 1:20pm MST
Salt Palace | Level 2 | 255 B

1:30pm MST

Observing the Future: Embracing OTEL in WebAssembly - Victor Adossi, Cosmonic
Tuesday November 12, 2024 1:30pm - 1:55pm MST
WebAssembly is the next platform for computing, and this time, we can have observability from day one. In building distributed WebAssembly on top of wasmCloud, we built in the full OpenTelemetry ("OTEL") trifecta: traces, metrics and logs.

**Along the way we found a new way to achieve the holy grail of observability — free application & backing service instrumentation.**

This talk will cover how we implemented OTEL in wasmCloud and the benefits and challenges we faced. In a live demo, you will learn how you can trace globally distributed applications written in different programming languages and connected to a variety of backend services.
Speakers
avatar for Victor Adossi

Victor Adossi

Backend Engineer, Cosmonic
Talk to me about Rust, WebAssembly, and building microscalers.
Tuesday November 12, 2024 1:30pm - 1:55pm MST
Salt Palace | Level 2 | 255 B

2:05pm MST

Enhancing Asynchronous Communication Observability with OpenTelemetry - Liudmila Molkova, Microsoft & Shivanshu Raj Shrivastava, SigNoz
Tuesday November 12, 2024 2:05pm - 2:30pm MST
OTel community has been working on standardizing semantic conventions to correlate telemetry data from various systems. The Messaging SemConv aims to solve it for commonly used queues like Kafka, RabbitMQ, and others systems. OTel instrumentations are adopting these conventions, but the end users still face challenges with async messaging observability at scale. They struggle with questions like "how to trace message flow?", "how to correlate metrics with traces?", "how to do capacity planning and cost optimizations based on telemetry data?". The end-to-end visibility often remains a black box! In this session, through a demo, we'll delve deeper into async architecture to address these questions, demonstrate context propagation within queues, and show how to correlate traces and client or broker-side metrics. Participants will gain hands-on experience with messaging instrumentation, learning how to achieve observability in both simple and complex asynchronous messaging scenarios.
Speakers
avatar for Liudmila Molkova

Liudmila Molkova

Principal Software Engineer, Microsoft
Liudmila Molkova is a Principal Software Engineer at Microsoft working on observability and Azure client libraries. She is a co-author of distributed tracing implementations across the .NET ecosystem including HTTP client instrumentation and Azure Functions. Liudmila is an active... Read More →
avatar for Shivanshu Raj Shrivastava

Shivanshu Raj Shrivastava

Founding Engineer, SigNoz
Shivanshu is a Founding Engineer at SigNoz, working on building an OTeL native observability product. He has a keen interest in deep tech and OSS. He is a CNCF ambassador and a member of CNCF projects like OTeL, k8s, and Istio. He has got the opportunity to mentor contributors in... Read More →
Tuesday November 12, 2024 2:05pm - 2:30pm MST
Salt Palace | Level 2 | 255 B

2:40pm MST

Build-Time Auto-Instrumentation in Android - Jason Plumb, Splunk
Tuesday November 12, 2024 2:40pm - 3:05pm MST
This session provides an in-depth dissection and live demonstration of OpenTelemetry’s build-time auto-instrumentation for Android. Class-loading in the Android runtime poses significant challenges to instrumentation engineers looking for attach or injection points. Fortunately, developers can now use off-the-shelf OpenTelemetry tools to instrument their applications without the need to manually instrument or even alter their application code. We will cover the benefits of front-loading bytecode weaving to build time, the tools used by OpenTelemetry to accomplish this, and the types of rich client-side telemetry created through this approach.
Speakers
avatar for Jason Plumb

Jason Plumb

Software Engineer, Splunk
Jason Plumb (he/him) is a hacker, artist, experimenter, polyglot programmer, and dad from Portland, OR, USA. He is co-maintainer of OpenTelemetry Android and an approver in various OpenTelemetry java projects. When not at work, Jason volunteers with Futel to install and maintain a... Read More →
Tuesday November 12, 2024 2:40pm - 3:05pm MST
Salt Palace | Level 2 | 255 B

3:20pm MST

Turn the Volume Down on Noisy Neighbors! - Sándor Guba, Axoflow
Tuesday November 12, 2024 3:20pm - 3:45pm MST
Using Kubernetes as a multi-tenant platform is becoming a general concept. However, the fact that different logs are mixed on the same nodes makes it difficult. Learn how the Telemetry Controller (an abstract layer on top of the OTel Collector) helps you tackle the first problem. Routing from the edges provides isolation for the tenants and simplifies filtering and routing on a tenant level. Sandor will present different scenarios to do intelligent routing and how to build a smart aggregator on top of the Telemetry Controller.
Speakers
avatar for Sándor Guba

Sándor Guba

CTO, Axoflow
Sandor is a software engineer, CTO, and founder at Axoflow. His main field has always been observability and logging. He is a former co-founder at Banzai Cloud. He was responsible for observability and founded open-source projects like the Logging Operator and Thanos Operator. He... Read More →
Tuesday November 12, 2024 3:20pm - 3:45pm MST
Salt Palace | Level 2 | 255 B

3:55pm MST

How We Streamlined Our SDLC with Observability - from GitHUB via Jenkins, Harbor, Argo to K8S - Michael Gläss & Andreas Grabner, Dynatrace
Tuesday November 12, 2024 3:55pm - 4:20pm MST
Like many enterprises, we at Dynatrace deal with multiple delivery streams and various tooling to be used to develop, build, test and deliver our product to our customers. To better scale and have a resilient delivery practice, we regularly need to streamline the processes. From a requirement till the running solution, it typically involves many steps, often various people and different systems. Distributed knowledge makes it hard to transparently identify issues and slows things down. Observability in SDLC (Software Delivery Lifecycle) is our key enabler that gives us insights into resources, tools, techniques, staging, quality, rollout, etc. In this session learn how we automatically trace every delivery value flow, identify bottlenecks, wait times, lead times and other characteristics with the help of cloud native observability. Observability tells us where to start with optimization(s) and how to apply a continuous improvement cycle to our delivery chain.
Speakers
avatar for Michael Gläss

Michael Gläss

Chief Product Architect, Dynatrace
Michael is overseeing the Dynatrace Product Architecture with a strong focus on Cloud technologies, Platform Engineering, enablement and continuous delivery. He joined Dynatrace in March 2022 with more than 25 years hands on software engineering including the last 10+ years leading... Read More →
avatar for Andi Grabner

Andi Grabner

CNCF Ambassador and DevRel, Dynatrace
Andreas Grabner (@grabnerandi) has 20+ years of experience as a software developer, tester and architect and is an advocate for high-performing cloud scale applications. He is a CNCF ambassador, contributor to the CNCF project keptn and a DevRel for Dynatrace. Andreas is also a regular... Read More →
Tuesday November 12, 2024 3:55pm - 4:20pm MST
Salt Palace | Level 2 | 255 B

4:30pm MST

⚡ Lightning Talk: Vote on Your Favorite--Standardize This!--10 PromQL Queries That Measure CPU & Memory Usage - Sarah Hudspeth, Chronosphere
Tuesday November 12, 2024 4:30pm - 4:40pm MST
In the course of several SaaS migrations to Otel and Prom-based exporters, the search for the perfect PromQL CPU Usage (and Mem Usage) query ended up with several candidates. This quick talk will walk through these queries, explain their differences, and offer the crowd a way to rate the queries and vote on which ones they think should be used. A quick search on Stack Overflow will also produce quite a few queries to use and prove the need for some crowdsourcing on what queries should be the standard. Let's let the people decide! Or at least let's understand the appropriateness of various queries versus other options; how container and various prometheus exporter metrics work; and talk about the challenges of observability migrations with regards to translating queries from difference vendors to Open Source telemetry. Queries include cadvisor metrics, kubestate metrics, and node exporter metrics -- which also probably adds to the chaos of why there are so many cpu/mem queries.
Speakers
avatar for Sarah Hudspeth

Sarah Hudspeth

Solutions Architect, Chronosphere
Sarah is a datanerd who dabbled in data science and full-stack engineering before entering the Observability space as a Sales Engineer. She is now a Solutions Architect at Chronosphere and enjoys talking PromQL and how best to use telemetry data to scale and optimize your cloudnative... Read More →
Tuesday November 12, 2024 4:30pm - 4:40pm MST
Salt Palace | Level 2 | 255 B

4:45pm MST

⚡ Lightning Talk: Deep Dive: How Fluent Bit Collects File Logs - Braydon Kains, Google Cloud
Tuesday November 12, 2024 4:45pm - 4:55pm MST
Reading logs from a file is a very simple concept to understand, but implementing it is a whole other story. This lightning talk will dive into the full process of how Fluent Bit file logging works: detecting file changes, reading new lines from files, tracking state, handling file rotations, and more. This talk will be a brief glimpse into just one of the many deep challenges of implementing efficient observability data collection.
Speakers
avatar for Braydon Kains

Braydon Kains

Software Developer, Google Cloud
Braydon is a software developer at Google Cloud working on the Ops Agent. Under the GitHub username @braydonk you can find his contributions in Fluent Bit, OpenTelemetry repos, and various auxiliary repos. He is also the creator and maintainer of the yamlfmt tool.
Tuesday November 12, 2024 4:45pm - 4:55pm MST
Salt Palace | Level 2 | 255 B
 

Share Modal

Share this link via

Or copy link

Filter sessions
Apply filters to sessions.
  • AppDeveloperCon
  • ArgoCon
  • BackstageCon
  • Breaks
  • Cilium + eBPF Day
  • Cloud Native + Kubernetes AI Day
  • Cloud Native StartupFest
  • Cloud Native University
  • Data on Kubernetes Day
  • EnvoyCon
  • Istio Day
  • Kubernetes on Edge Day
  • Observability Day
  • OpenFeature Summit
  • OpenTofu Day
  • Platform Engineering Day
  • Registration
  • WasmCon