Loading…
CNCF-hosted Co-located Events Europe 2025 taking place on 1 April. This event is happening in person at Excel London in London, England.

The Sched app allows you to build your schedule, but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon Europe 2025, and have an All-Access pass in order to participate in the sessions.

To view the full event schedule for a specific CNCF-hosted Co-located event, you can use the right-hand navigation bar to sort and filter.

The schedule is subject to change.
or to bookmark your favorites and sync them to your phone or calendar.
Tuesday, April 1
 

09:00 BST

Observability Day | Welcome + Opening Remarks
Tuesday April 1, 2025 09:00 - 09:05 BST
Tuesday April 1, 2025 09:00 - 09:05 BST
Level 1 | Hall Entrance S10 | Room B
  Observability Day

09:10 BST

Observability Day | Project Updates
Tuesday April 1, 2025 09:10 - 09:55 BST
Tuesday April 1, 2025 09:10 - 09:55 BST
Level 1 | Hall Entrance S10 | Room B
  Observability Day

10:00 BST

Sponsored Keynote: The Future of Observability: Trends, AI, and New Relic’s Vision for a Smarter Stack - Harry Kimpel, New Relic
Tuesday April 1, 2025 10:00 - 10:05 BST
As cloud-native development accelerates, observability is no longer a nice-to-have, but a necessity. This session explores key trends shaping the observability space, including the role of AI in transforming monitoring practices, the rise of open standards like OpenTelemetry, and how platforms like New Relic are adapting to meet the needs of developers and SREs to monitor traffic, microservices and cloud infrastructure or your AI/LLM integrations. Join us for a brief overview into the future of observability and how to stay ahead in a rapidly evolving industry
Speakers
avatar for Harry Kimpel

Harry Kimpel

Principal Developer Relations Engineer, New Relic
Passionate software craftsman with 25+ years experience in a broad spectrum of development technologies and platforms. Main focus on cloud-native software architectures and all major cloud environments. Passion for model-driven development, application modernization and Dapr. Observing... Read More →
Tuesday April 1, 2025 10:00 - 10:05 BST
Level 1 | Hall Entrance S10 | Room B
  Observability Day

10:10 BST

Sponsored Keynote: TBA, Chronosphere
Tuesday April 1, 2025 10:10 - 10:15 BST
Tuesday April 1, 2025 10:10 - 10:15 BST
Level 1 | Hall Entrance S10 | Room B
  Observability Day

10:30 BST

AM Break 3
Tuesday April 1, 2025 10:30 - 10:40 BST
TBA
Tuesday April 1, 2025 10:30 - 10:40 BST
TBA

10:45 BST

Beyond Root Cause Analysis for K8s: What Your Logs Should Be Telling You - Ronit Belson, Sawmills
Tuesday April 1, 2025 10:45 - 11:10 BST
Logs are a goldmine of information, which is why they have become the backbone of business-critical monitoring and observability systems. Yet like gold, mining this value requires significant effort – sifting through endless entries is time-consuming, tiring, and costly. To reduce operational overhead and minimize mean time to resolution (MTTR), this talk explores advanced techniques for log summarization, offering methods to reduce log volume without losing critical insights for modern cloud-native and K8s environments.

We'll discuss the use of key attributes and metrics to make logs more meaningful and enable more rapid root cause analysis (RCA). The presentation will demonstrate semantic log understanding using advanced AI and contextual log analysis powered by large language models (LLMs) to automatically extract actionable insights and understand the deeper context of system behaviors and application flows.
Speakers
avatar for Ronit Belson

Ronit Belson

CEO & Co-Founder, Sawmills
Ronit Belson is a seasoned tech executive and entrepreneur, currently serving as the Co-Founder and CEO of Sawmills. With over two decades of experience, Ronit has a proven track record of scaling startups and driving growth. She has held key leadership roles, including COO at Testim.io... Read More →
Tuesday April 1, 2025 10:45 - 11:10 BST
Level 1 | Hall Entrance S10 | Room C

10:45 BST

Operationalizing Observability: The Invisible Parts - Hazel Weakly, Nivenly Foundation
Tuesday April 1, 2025 10:45 - 11:10 BST
Observability is increasingly becoming a differentiator in developer experience and in sustaining system health. Despite this, vendor bills are higher than ever, infrastructure budgets are lower, and the ROI is increasingly harder to sell.

Join Hazel in this talk as she takes you through:
- What it takes to get buy-in, what what that looks like, and when to not
- What happens after procurement: Going from zero to day one
- Achieving ROI beyond the engineering function

This talk will be be packed full with actionable insights for employees wanting to implement observability, vendors wanting to sell, and OSS projects working to make the system better.
Speakers
avatar for Hazel Weakly

Hazel Weakly

Fellow, Nivenly Foundation
Hazel spends her days working on building out teams of humans as well as the infrastructure, systems, and tooling to make life better for others. She’s worked at a variety of companies and knows that the hardest problems to solve are the social ones. One of her favorite things is... Read More →
Tuesday April 1, 2025 10:45 - 11:10 BST
Level 1 | Hall Entrance S10 | Room B

11:20 BST

From Logs To Insights: Real-time Conversational Troubleshooting for Kubernetes With GenAI - Tiago Reichert & Lucas Duarte, AWS
Tuesday April 1, 2025 11:20 - 11:45 BST
In today’s distributed microservices landscape, Kubernetes environments generate vast volumes of logs, making troubleshooting complex and time-consuming. Operators often sift through massive data to identify issues, leading to prolonged downtime—a challenge that intensifies with multiple clusters. Discover how GenAI optimizes troubleshooting by transforming traditional logs into conversational insights. This session covers building an AI-driven observability solution with Large Language Models (LLMs). We start by configuring Fluent Bit collectors to gather systemd logs, Kubernetes events, and application logs, which are then streamed to a scalable object storage. By constructing a vector database, we enable users to query and interact with logs in natural language. We will provide a step-by-step guide that equips attendees with actionable knowledge to implement GenAI observability in their Kubernetes clusters.
Speakers
avatar for Tiago Reichert

Tiago Reichert

Sr. Specialist SA, Containers, AWS
Tiago is a Solutions Architect at AWS, focused on helping startups across Latin America to optimize their container strategies. With a deep passion for Containers, DevOps, and SaaS, he collaborates with businesses to design scalable and efficient cloud solutions. Tiago also actively... Read More →
avatar for Lucas Duarte

Lucas Duarte

Sr. Specialist SA, Containers, AWS, AWS
Lucas is a Sr. Containers Specialist SA at AWS, dedicated to supporting ISV customers in AMER through AWS Container services. Beyond his Solutions Architect role, Lucas brings extensive hands-on experience in Kubernetes and DevOps leadership. He's been a key contributor to multiple... Read More →
Tuesday April 1, 2025 11:20 - 11:45 BST
Level 1 | Hall Entrance S10 | Room C

11:20 BST

Observability by Design: Leveraging OpenTelemetry Weaver To Take Control of Your Observability - Josh Suereth, Google & Laurent Querel, F5
Tuesday April 1, 2025 11:20 - 11:45 BST
Ever curious what labels are on a metric or trace, but can't find the documentation? Ever upgrade a service and find all your observability dashboards and alerts broken? Ever wish you had a tool to prevent developers or OpenTelemetry Collector config from breaking the signals you rely on?

In this talk we'll show how OpenTelemetry Weaver solves these problems. OpenTelemetry Weaver is the tool which powers OpenTelemetry Semantic Conventions.

This session will explore how to use OpenTelemetry Weaver to:
- Define your Observability signals
- Automatically generate documentation, code and tests, etc.
- Enforce compatibility guarantees
- Extended for your own needs
Speakers
avatar for Josh Suereth

Josh Suereth

Staff Software Engineer, Google
An enthusiastic supporter of Open-Source, Modern Application design and hipster programming languages. Previously, the author of Scala In Depth, I'm currently involved in OpenTelemetry technical committee, driving the Semantic Conventions effort. I spend my off hours on Scala and... Read More →
avatar for Laurent Querel

Laurent Querel

Distinguished Engineer, F5
Author of the OpenTelemetry Protocol with Apache Arrow Specification. Maintainer of the OTel Arrow repository. Author of the Application Telemetry Schema Specification and maintainer of the OTel Weaver project.
Tuesday April 1, 2025 11:20 - 11:45 BST
Level 1 | Hall Entrance S10 | Room B

11:40 BST

Lunch Break 2
Tuesday April 1, 2025 11:40 - 12:55 BST
TBA
Tuesday April 1, 2025 11:40 - 12:55 BST
TBA

12:55 BST

Encrypted Telemetry in Transit - Jason Plumb, Splunk
Tuesday April 1, 2025 12:55 - 13:20 BST
In some high-security or compliance-regulated environments, we sometimes need to guarantee that data is encrypted while in transit. In this talk, we will show you how to accomplish this using existing off-the-shelf OpenTelemetry components. We will walk you through the detailed technical process generating certificates and configuring the Collector to receive encrypted telemetry from the OpenTelemetry Java Instrumentation agent.
Speakers
avatar for Jason Plumb

Jason Plumb

Software Engineer, Splunk
Jason Plumb (he/him) is a hacker, artist, experimenter, polyglot programmer, and dad from Portland, OR, USA. He is co-maintainer of OpenTelemetry Android and an approver in various OpenTelemetry java projects. When not at work, Jason volunteers with Futel to install and maintain a... Read More →
Tuesday April 1, 2025 12:55 - 13:20 BST
Level 1 | Hall Entrance S10 | Room B

12:55 BST

From Sampling To Full Visibility: Scaling Tracing To Trillions of Spans - Sonam Gupta, SigLens & Sudeep Kumar, Salesforce Inc
Tuesday April 1, 2025 12:55 - 13:20 BST
Salesforce handles billions of transactions daily, generating over 50 trillion spans. These transactions represent a complex ecosystem. Failure of even a single transaction, can leave users frustrated while everything appears “green” in the system. Why? Sampling-based tracing often misses such edge cases.
In this talk, we unveil how SF overcame this challenge by enabling 100% sampling for critical flows, all while keeping costs low. We’ll share our groundbreaking migration from Zipkin to OTel and the lessons learned along the way
Discover how we equipped SF developers with a 360-degree view of service and API performance. With OTel, they can now pinpoint RED metrics, diagnose issues faster, and achieve visibility beyond the limits of logs and metrics
We’ll also dive into the backend challenges of scaling OTel, from managing high data volumes to optimizing storage and query performance. We’ll share the pros and cons of various approaches, and our experiences with open-source tools
Speakers
avatar for Sonam Gupta

Sonam Gupta

Software Engineer, SigLens
Sonam Gupta is a Software Engineer at SigLens. With a B.Tech in Information Technology, she works with the UI team to build great user experiences.
avatar for Sudeep Kumar

Sudeep Kumar

Principal Engineer, Salesforce Inc
A Principal Engineer at Salesforce, having 18+ years of experience in building scalable distributed systems managing petabytes of data daily. He has led architecture of cloud-native SaaS solutions across E-commerce, Embedded systems, & Telecom. A speaker at global conferences, Sudeep... Read More →
Tuesday April 1, 2025 12:55 - 13:20 BST
Level 1 | Hall Entrance S10 | Room C

13:30 BST

OpenTelemetry at Delivery Hero: The Good, the Bad and the Vendor-Agnostic - Elena Kovalenko, Delivery Hero
Tuesday April 1, 2025 13:30 - 13:55 BST
In this session I’m going to take you on a journey of Delivery Hero - an international food delivery company - towards the goal of becoming observability vendor-independent. This journey will be long and hard and will take our hero through the forest of the unstable OpenTelemetry contrib components and the swamp of high memory and CPU consumption. It will require them to find new allies to overcome the challenge of routing for stateful collector components in non-federated environments and fight the metric temporality conversion monsters. I am going to demonstrate what keeps our hero motivated after all these hurdles and why they are still convinced that OpenTelemetry is the right tool for them to accomplish this mission.
Speakers
avatar for Elena Kovalenko

Elena Kovalenko

Principal Software Engineer, Delivery Hero
Elena is a Swiss Army knife of an engineer. Whether backend or data engineering, MLOps or DevOps - she’s been there and she’s been there at scale. At the moment of writing she enjoys navigating technical and organisational complexity as a Principal Software Engineer at Delivery... Read More →
Tuesday April 1, 2025 13:30 - 13:55 BST
Level 1 | Hall Entrance S10 | Room C

13:30 BST

SLOs as an Organisational 'Check Engine' Light - Cail Young, Octopus Deploy
Tuesday April 1, 2025 13:30 - 13:55 BST
In this talk, Cail will explore a multi-year process of how the Octopus Deploy team tried to tame the reliability of the 18,000+ CI tests for their monolithic product. We begin with a company just starting to rapidly grow, and will end with a warts-and-all look at how SLOs not only tell you when action is required, but also when the right thing to do is sit back and watch things fail. We'll talk about goal conflicts, prioritisation, safety models, and professional growth.
Speakers
avatar for Cail Young

Cail Young

Senior SRE, Octopus Deploy
Cail has spent the last couple of decades working at the intersection of people and technology: in the performing arts, in the motion picture industry, and now in the field of software operations. He is fascinated by learning from incidents - large and small - and will gladly trade... Read More →
Tuesday April 1, 2025 13:30 - 13:55 BST
Level 1 | Hall Entrance S10 | Room B

14:05 BST

From Chaos To Clarity: Scaling Observability at Dropbox With Centralized Logging Solution - Alok Ranjan, Dropbox Inc
Tuesday April 1, 2025 14:05 - 14:30 BST
At Dropbox, managing observability for systems producing terabytes of logs daily posed a unique challenge. Initially, developers accessed logs by logging into individual servers, a process further complicated when we moved to containers. Containers are short-lived, which caused logs to disappear on termination. This highlighted the need for a scalable, persisted, centralized solution using open-source tools.

In this session, I’ll discuss our journey to build a robust observability framework centered on Loki as our logging solution. Scaling Loki to Dropbox’s data volume required extensive optimizations for reliable, efficient query performance. I’ll cover our deployment, challenges, and strategies for achieving high-performance logging.

We also integrated Grafana to unify logs and metrics in a single view, enhancing troubleshooting and security. Join us to learn how Dropbox scaled its observability with open-source solutions and key lessons from our experience.
Speakers
avatar for Alok Ranjan

Alok Ranjan

Software Engineering Manager, Dropbox Inc
Hello, I’m Alok Ranjan, an Engineering Manager focused on observability and reliability in high-scale systems. Recently, I led the implementation of Dropbox’s first unstructured logging solution using Loki, centralizing log access and optimizing query performance for terabytes... Read More →
Tuesday April 1, 2025 14:05 - 14:30 BST
Level 1 | Hall Entrance S10 | Room C

14:05 BST

Putting the Experience in UX: The Importance of Making Data Accessible - Adriana Villela, Dynatrace & Tim Banks, Caylent
Tuesday April 1, 2025 14:05 - 14:30 BST
Every organization, regardless of what they do, collects and relies on data. Our focus on Observability has largely been relegated to software engineering, operations, and other IT-focused roles. As such, data needs for sales, marketing, finance, HR, support, and other roles and groups are often overlooked or feel inaccessible to them. It only takes a good experience with something to change your relationship with it.

In this talk, Adriana and Tim will use real-life experiences and everyday examples to show how giving people a good experience with something with which they are unfamiliar can ignite a curiosity and appreciation. They will also explain why giving these experiences to everyone who collects and utilizes data can allow them to leverage it in new ways, unlocking new insights, new innovations, new approaches to everything they do.
Speakers
avatar for Adriana Villela

Adriana Villela

Principal Developer Advocate, Dynatrace
Adriana Villela is a Principal Developer Advocate, helping companies achieve reliability greatness through Observability, SRE, & DevOps practices. Previously, she managed a Platform Engineering team & an Observability Practices team at Tucows. Adriana has worked at various large-scale... Read More →
avatar for Tim Banks

Tim Banks

Staff Solutions Architect, Caylent
Tim’s tech career spans over 25 years through large corporate environments and in small startups, honing his skills in systems administration, automation, architecture, and operations for large cloud-based datastores. Today, Tim leverages his years in data, DevOps, and Site Reliability... Read More →
Tuesday April 1, 2025 14:05 - 14:30 BST
Level 1 | Hall Entrance S10 | Room B

14:40 BST

From Splunk To OTEL: Scaling Observability at MSCI With a Four-Person Team - Aftab Khan & Zach Arnold, MSCI
Tuesday April 1, 2025 14:40 - 15:05 BST
Migrating observability infrastructure for a 5,000-person financial services company is daunting enough - doing it with just four engineers might seem impossible. This session details MSCI's journey from a traditional Splunk infrastructure to a modern cloud-native observability stack built on OpenTelemetry, Prometheus, Jaeger, Grafana, and Elasticsearch. We'll share our architectural decisions, implementation strategy, and critical lessons learned while maintaining observability during the transition. Through real-world examples, we'll demonstrate how we overcame scaling challenges, managed the cultural shift, and achieved better visibility while significantly reducing costs. Learn practical strategies for planning your own observability migration, including how to phase the transition, train teams effectively, and avoid the pitfalls we encountered.
Speakers
avatar for Aftab Khan

Aftab Khan

Vice President - Shared Services Engg, MSCI
Aftab Khan is a Vice President at MSCI Inc. and a Certified Azure Solutions Architect Expert with over 10 years of experience in software development and cloud technologies. He specializes in Kubernetes, monitoring solutions, and DevOps practices, with deep expertise in tools like... Read More →
avatar for Zach Arnold

Zach Arnold

Executive Director Index Engineering, MSCI, Inc
Zach Arnold is Executive Director of Index Engineering at MSCI Inc., where he architects next-gen Kubernetes platforms in hybrid cloud environments. A Kubernetes contributor since 2018, he has transformed multiple organizations' engineering cultures through cloud-native practices... Read More →
Tuesday April 1, 2025 14:40 - 15:05 BST
Level 1 | Hall Entrance S10 | Room C

14:40 BST

Transmissions From the Tweet Factory: How Observability Transformed Mobile Performance at Twitter - Hanson Ho, Embrace
Tuesday April 1, 2025 14:40 - 15:05 BST
Twitter realized a decade ago that the biggest performance issues impeding usage occurred outside its data centers. But observability, taken for granted by backend devs and SREs, didn’t exist meaningfully on Android and iOS. Issues that don’t show up in profilers or end in crashes were practically invisible.

Stone by stone, the team built tooling that performantly and judiciously extracted telemetry on the client side. Using this newly discovered treasure trove of failure points and bottlenecks, performance was greatly improved – and in a verifiable way that shows how it can directly impact company KPIs like user growth and revenue.

Hanson Ho was there in 2015. For 7+ years, he helped build observability into the Android app and org. Listen as he describes how observability changed mobile at Twitter: what was recorded, how it was used, what results were achieved – and how the lessons learned can be applied by anyone that operates mobile apps, both in the tech and in the org.
Speakers
avatar for Hanson Ho

Hanson Ho

Android Architect, Embrace (this is not a talk about the company)
Hanson was the former Tech Lead of Android Performance and Stability at Twitter, where he spent a lot of time on collecting and interpreting performance data in order to improve the app experience for all Twitter users on all Android devices all around the world. He is now at Embrace... Read More →
Tuesday April 1, 2025 14:40 - 15:05 BST
Level 1 | Hall Entrance S10 | Room B

15:05 BST

PM Break 2
Tuesday April 1, 2025 15:05 - 15:20 BST
TBA
Tuesday April 1, 2025 15:05 - 15:20 BST
TBA

15:20 BST

Customize Your Own OpenTelemetry Collector: An Introduction To OCB - Evan Bradley, Dynatrace & Pablo Baeyens, Datadog
Tuesday April 1, 2025 15:20 - 15:45 BST
Do you enjoy using the OpenTelemetry Collector, but can’t find a distribution with the right set of components included? Do you want to write your own components for that niche use case that only you have? Or maybe you just want a Collector that has your name on it? Give the OpenTelemetry Collector Builder (OCB) a try!

OCB is developed by the Collector maintainers and is purpose-built for easily building your own Collector. This session will cover the basics of how OCB works, then will cover a wide range of use cases, including creating release pipelines using OCB, publishing Docker images, hotfixing upstream components when a change is needed immediately, and using your own components. To tie it all together, we’ll also show how OCB is used in the wild to publish popular Collector distributions.
Speakers
avatar for Evan Bradley

Evan Bradley

Senior Software Engineer, Dynatrace
Evan helps maintain the OpenTelemetry Collector, where he is a primary contributor to the OpenTelemetry Transformation Language (OTTL), and helps drive adoption of the OpenTelemetry Agent Management Protocol (OpAMP) to enable users to manage fleets of Collectors. Evan has a background... Read More →
avatar for Pablo Baeyens

Pablo Baeyens

Software Engineer, Datadog
Pablo Baeyens is a Senior Software Engineer working at Datadog. He lives in Granada, Spain and since late 2020 he has been involved in the OpenTelemetry project, where he is part of the OpenTelemetry Governance Committee and maintains the OpenTelemetry Collector. Outside of open source... Read More →
Tuesday April 1, 2025 15:20 - 15:45 BST
Level 1 | Hall Entrance S10 | Room C

15:20 BST

The Art and Craft of No-touch Instrumentation - Michele Mancioppi, Dash0
Tuesday April 1, 2025 15:20 - 15:45 BST
OpenTelemetry automatic instrumentations greatly help with achieving a high baseline of observability for applications. But applying them in containerised environments still requires manual, error-prone intervention by users. LD_PRELOAD-based injection is an advanced technique that has been used by commercial vendors for a decade or more, is by now very well understood, and does not require access to the Kernel like eBPF does (which is a huge challenge in managed compute environments), or any increase in permissions other than what the host runtime has.

In this talk, we go in depth into how this magic works across various runtimes, and how it could benefit the OpenTelemetry community at large.
Speakers
avatar for Michele Mancioppi

Michele Mancioppi

Head of Product, Dash0
Michele has been a product manager in the observability space for the best part of a decade. Former staff engineer turned PM, he likes to code tracing instrumentation, Kubernetes operators and Infrastructure-as-Code integrations for observability almost as much as asking "Why would... Read More →
Tuesday April 1, 2025 15:20 - 15:45 BST
Level 1 | Hall Entrance S10 | Room B

15:55 BST

Demystifying Monitoring and Debugging on Windows Containers - Mansi Kulkarni, Red Hat & Ritika Gupta, Microsoft
Tuesday April 1, 2025 15:55 - 16:20 BST
Struggling to monitor and debug Windows containers on Kubernetes? You’re not alone!
Unlike Linux, managing Windows workloads often feels like solving a puzzle with missing pieces. But it doesn’t have to be this way.
With Kubernetes now supporting HostProcess containers on Windows nodes, a lot more monitoring and troubleshooting is now possible.
We’ll show you how to implement a complete monitoring stack using Windows Exporter on a Windows node in Kubernetes. We’ll start by exploring the metric collectors in Windows exporter, their functionalities, and practical use. Next, we’ll demonstrate deploying the Windows exporter as a HostProcess pod, configuring a ServiceMonitor, and setting up Prometheus to collect and visualize metrics.
Finally, we’ll elevate your debugging game by exploring the newly added kubectl debug support for Windows nodes, enabling you to diagnose and resolve issues faster at the node level. You'll be ready to troubleshoot Windows nodes in no time.
Speakers
avatar for Mansi Kulkarni

Mansi Kulkarni

Senior Software Engineer, Red Hat
Mansi is a Senior software engineer at Red Hat, where she brings her expertise to the Windows Containers project on the OpenShift platform. As an active contributor to Kubernetes SIGs like SIG-Windows and SIG-Instrumentation, she is deeply involved in the ecosystem. She has also worked... Read More →
avatar for Ritika Gupta

Ritika Gupta

Software Engineer, Microsoft
With a knack for transforming chaos into seamless solutions Ritika Gupta creates technologies to bind Kubernetes, Windows Containers and Azure ecosystem leveraging cloud native tooling. She actively contributes to Kubernetes as an sig-windows member. At Microsoft, Ritika works on... Read More →
Tuesday April 1, 2025 15:55 - 16:20 BST
Level 1 | Hall Entrance S10 | Room C

15:55 BST

OTel-y Oops: Learning From Our Observability Blunders - Joe Stephenson & Rodney Karemba, Akamai Technologies
Tuesday April 1, 2025 15:55 - 16:20 BST
Learn how, in our quest for observability, we accidentally sent 300 million time series to our stack, repeatedly crashed our OTel agents due to cascading failures, and generally made our Observability SRE's lives miserable. In this talk, we'll share the key lessons learned from our missteps, including best practices for scaling observability in complex systems, avoiding common pitfalls, and building resilient monitoring pipelines (using OTel, VictoriaMetrics, Loki and Tempo). Join us to understand how a combination of over-ambitious instrumentation and lack of foresight can lead to chaos — and how to prevent it in your own organization. By the end, you'll have actionable insights to optimize your observability strategy without breaking the system (or the team).
Speakers
avatar for Joe Stephenson

Joe Stephenson

Principal Software Engineer, Akamai Technologies
Joe is a seasoned expert in cloud native technologies. They specialize in solving complex problems at scale, seamlessly navigating the realms of observability, developer experience and user-facing services
avatar for Rodney Karemba

Rodney Karemba

Platform Engineer, Akamai Technologies
Rodney is a Platform Engineer at Akamai Technologies, focused on the internal developer platform for storage at scale. Rodney has background experience with Linux, container security, Kubernetes and working with enterprise customers to solve their cloud native challenges.
Tuesday April 1, 2025 15:55 - 16:20 BST
Level 1 | Hall Entrance S10 | Room B

16:30 BST

Beyond the Ephemeral: Mastering Serverless Metrics at Scale With Shopify - Pedro Tanaka & Filip Petkovski, Shopify
Tuesday April 1, 2025 16:30 - 16:55 BST
Explore a cutting-edge approach to enhancing observability for serverless applications on solutions like Google Cloud Run and Cloud Functions. This session delves into creating a scalable metrics pipeline using Shopify's internal app platform for seamless container configuration and a ingestion system capable of handling millions of datapoints per minute.

We'll dive into the architecture featuring OpenTelemetry collectors as sidecars and a regional workloads to ingest and manage metrics with varying temporality models. Discover how we integrated OTLP with our ingestion layer, transforming exponential histograms into DD Sketches for optimal performance and accuracy.

Gain insights into the challenges and solutions in building this comprehensive observability pipeline. This talk provides valuable lessons for teams aiming to enhance serverless monitoring in Kubernetes environments, leveraging Shopify's philosophy of efficient, resilient, and cost-effective cloud utilization.
Speakers
avatar for Pedro Tanaka

Pedro Tanaka

Production Engineer, Shopify
Pedro is an engineer working in Production Engineering at Shopify. Currently working on Cloud Observability, he values upstream participation and contributes to open-source projects related to Kubernetes and Cloud Native technologies, like Thanos and KEDA. Outside of work he is a... Read More →
avatar for Filip Petkovski

Filip Petkovski

Staff Production Engineer, Shopify
Staff Production Engineer, Thanos metrics maintainer.
Tuesday April 1, 2025 16:30 - 16:55 BST
Level 1 | Hall Entrance S10 | Room C

16:30 BST

The State of OpenTelemetry Profiling- Damien Matthieu, Elastic & Felix Geisendoerfer, Datadog
Tuesday April 1, 2025 16:30 - 16:55 BST
Join us exploring OpenTelemetry profiling's evolution and future. We'll examine the changes in OTLP profiling protocol, highlighting how moving beyond pprof wire compatibility enables individual profiling events with timestamps and thread timeline visualizations.
Discover the practical implementation of profiling across the observability stack using the OTel Collector. Learn about its sophisticated capabilities in receiving, processing, and exporting profile data, along with data augmentation tools and community contribution opportunities.
This collaborative presentation from the Profiling SIG showcases achievements and future directions in profiling, offering practical insights for comprehensive profiling solutions.
Speakers
avatar for Damien Mathieu

Damien Mathieu

Principal Engineer at Elastic and OpenTelemetry Go Maintainer, Elastic
Damien is a Principal Engineer at Elastic, working on profiling for OpenTelemetry. He is also a maintainer of the Go SDK.
avatar for Felix Geisendörfer

Felix Geisendörfer

Senior Staff Engineer, Datadog
[Felix Geisendörfer](https://twitter.com/felixge) is a Senior Staff Engineer at Datadog where he works on Continuous Profiling for Go. Before that he was working on manufacturing systems for Apple, herding big PostgreSQL clusters. In his spare time he's usually working on [open source](https://github.com/felixge... Read More →
Tuesday April 1, 2025 16:30 - 16:55 BST
Level 1 | Hall Entrance S10 | Room B

17:00 BST

⚡ Lightning Talk: Unlocking Customer-Centric Observability: A Case Study of OpenTelemetry To Reduce MTTD < 3 Mins - Kokilavani Kathiresan, Intuit
Tuesday April 1, 2025 17:00 - 17:10 BST
At Intuit, we manage over 1500 web/mobile plugins serving our customer needs. However, detecting and quantifying real-time user impact during failures remains a significant challenge. Traditional approaches only highlight large-scale issues and don’t offer insights into the specific business workflows affected.

This talk covers,
1. Leveraging OpenTelemetry to develop a capability called “Failed Customer Interactions” (FCIs)
2. Computing real-time customer impact on business workflows
3. Reducing our Mean Time to Detect (MTTD) to less than 3 mins, powered by Anomaly detection
4. Designing cost effective, highly scable system that handles 130,000 spans per second
Speakers
avatar for Kokilavani Kathiresan

Kokilavani Kathiresan

Kokilavani Kathiresan, Engineering Manager, Intuit
Kokila is an Engineering Manager at Intuit, leading an exceptional team of Observability experts. Specializing in Tracing and Real User Monitoring, her team effortlessly handles millions of spans per second. A proud member of Tech Women at Intuit, sharing her expertise and providing... Read More →
Tuesday April 1, 2025 17:00 - 17:10 BST
Level 1 | Hall Entrance S10 | Room C

17:00 BST

⚡ Lightning Talk: Who Are You? Solving the Container ID Resolution With Cgroupv2 - Vincent Boulineau, Datadog
Tuesday April 1, 2025 17:00 - 17:10 BST
In containerized environments, one key challenge of pushing observability data is to identify the sender, to allow data enrichment, i.e. attaching attributes, tags, metadata based on container or Pod metadata.
This problem has been commonly solved by asking the sender to provide its own container id, easily available through /proc/self/cgroup.

When Datadog started evaluating cgroupv2, it quickly became apparent that this approach was not usable anymore, prompting us to find new solutions as the most common workaround (using mountinfo) is actually not reliable.
In this talk we'll explain why the container id is not available anymore (due private cgroup namespace) and present two solutions that we worked on.

We will go through the benefits of each one and will share how these solutions could be used in other observability projects, like OpenTelemetry.
Speakers
avatar for Vincent Boulineau

Vincent Boulineau

Staff Engineer, Datadog
Vincent began working with Kubernetes in 2016, migrating large applications from on-prem+custom orchestration to cloud+Kubernetes. Vincent is now a Staff Engineer in Datadog’s Container Monitoring group, working on making containerized environments easy to understand, monitor and... Read More →
Tuesday April 1, 2025 17:00 - 17:10 BST
Level 1 | Hall Entrance S10 | Room B

17:15 BST

⚡ Lightning Talk: Empowering OpenTelemetry Users With the OTTL Playground: Simplified Data Transformation and Testing - Edmo Vamerlatti Costa, Elastic
Tuesday April 1, 2025 17:15 - 17:25 BST
The OpenTelemetry Transformation Language (OTTL) is a powerful way to customize telemetry data transformation with the OpenTelemetry collector, but it can be daunting for new and experienced users alike.
Enter the OTTL Playground (https://ottl.run), a powerful and user-friendly tool designed to allow users to experiment with the OTTL effortlessly.

The playground provides a rich interface for users to create, modify, and test statements in real-time, making it easier to understand how different configurations impact the OpenTelemetry data transformation. Users can instantly validate OTTL transformations, from input to output, along with diffs. This allows new users to explore the nuances of OTTL without the risk of disrupting production environments.

This session provides a quick introduction to OTTL, and a live demo on how the OTTL Playground can help users to create, test and troubleshoot OTTL statements. Offering ideas for enhancements and community contributions.
Speakers
avatar for Edmo Vamerlatti Costa

Edmo Vamerlatti Costa

Senior Software Engineer, Elastic
Edmo is an experienced software engineer with a passion for emerging technologies. He currently works at Elastic, where he helps develop robust data processing solutions. Proficient in various programming languages, he has a proven track record of designing and deploying scalable... Read More →
Tuesday April 1, 2025 17:15 - 17:25 BST
Level 1 | Hall Entrance S10 | Room C

17:15 BST

⚡ Lightning Talk: From HAR To OpenTelemetry Trace: Redefining Your Observability - Antonio Jimenez Martinez, Cisco ThousandEyes
Tuesday April 1, 2025 17:15 - 17:25 BST
Have you heard about HTTP Archive (HAR) files and wondered how you could leverage this data for deeper insights into your web applications?

Imagine analyzing your page load request data as OpenTelemetry traces in your favorite observability backend. In this talk, we will explore the lessons learned from transforming HAR into an OpenTelemetry trace and streaming it to Jaeger.

You'll gain insights into the process of converting HAR data into spans following OpenTelemetry semantic conventions, and learn about the architecture we used to send these traces to any observability backend via the OpenTelemetry collector. This session is perfect for developers and observability engineers looking to enrich their tracing capabilities with detailed HTTP request data.
Speakers
avatar for Antonio Jimenez Martinez

Antonio Jimenez Martinez

Tech Lead Software Engineer, Cisco ThousandEyes
I am a Tech Lead Software Engineer at Cisco ThousandEyes, specializing in observability to ensure our customers can effectively monitor their products. My recent work involves using OpenTelemetry to stream telemetry data, enhancing network visibility and performance for our clients.I... Read More →
Tuesday April 1, 2025 17:15 - 17:25 BST
Level 1 | Hall Entrance S10 | Room B

17:25 BST

Observability Day | Closing Remarks
Tuesday April 1, 2025 17:25 - 17:30 BST
Tuesday April 1, 2025 17:25 - 17:30 BST
Level 1 | Hall Entrance S10 | Room B
 

Share Modal

Share this link via

Or copy link

Filter sessions
Apply filters to sessions.
  • ArgoCon
  • BackstageCon
  • Breaks
  • CiliumCon
  • Cloud Native + Kubernetes AI Day
  • Cloud Native Telco Day
  • Cloud Native University
  • Data on Kubernetes Day
  • EnvoyCon
  • Istio Day
  • Kubeflow Summit
  • Kubernetes on Edge Day
  • Linkerd Day
  • Observability Day
  • OpenFeature Summit
  • OpenTofu Day
  • Platform Engineering Day