InfoQ Homepage Observability Content on InfoQ
-
From Confusion to Clarity: Advanced Observability Strategies for Media Workflows at Netflix
Naveen Mareddy and Sujana Sooreddy explain how Netflix monitors massive media encoding workflows. They discuss scaling to 1M+ trace spans and using Flink to unlock real-time business insights.
-
From Dashboard Soup to Observability Lasagna: Building Better Layers
Martha Lambert shares the "Observability Lasagna" framework, explaining how her team replaced "dashboard soup" with a user-centric stack to achieve high system reliability for their on-call product.
-
Scaling API Independence: Mocking, Contract Testing & Observability in Large Microservices Environments
Tom Akehurst discusses how API mocking and simulation can solve microservice decoupling and productivity problems at scale, using observability, contract testing, and GenAI as guardrails.
-
Why Observability Matters (More!) with AI Applications
Sally O'Malley shares how to build an AI observability stack with open-source tools (Prometheus, Grafana, OpenTelemetry, Tempo, vLLM/Llama Stack). Learn to track performance, quality and cost signals.
-
High-Resolution Platform Observability
Brian Martin discusses high-resolution platform observability, highlighting how traditional metrics can be misleading and how to gain deeper system insights.
-
Lessons Learned in the Financial Market about Performance and Observability in Front-End Projects
Jessica Felix discusses how to navigate the intricate balance between performance and observability, and the challenges of maintaining equilibrium.
-
Production Comes First - an Outside-In Approach to Building Microservices
Martin Thwaites introduces outside-in testing, how to use Observability techniques in a local development to build applications that are easier to debug locally and run as a first class citizen.
-
Survival Strategies for the Noisy Neighbor Apocalypse
Meenakshi Jindal discusses experience and lessons learned with a case study from the Asset Management Platform at Netflix about how they detected and survived a noisy neighbor.
-
Reliable Architectures through Observability
Kent Quirk shows an overview of observability tools and techniques, and specific recommendations for how to fit observability into their system designs and day-to-day development process.
-
Effective and Efficient Observability with OpenTelemetry
Daniel Gomez Blanco shares his experience leading a large-scale observability initiative at Skyscanner, based on the adoption of OpenTelemetry across hundreds of services.
-
Sprinkling eBPF onto Your Observability
Frederic Branczyk discusses the eBPF's capabilities. Beyond that, Branczyk will demonstrate the real-world use of eBPF in next-generation Observability tooling.
-
Chaos Engineering Observability with Visual Metaphors
Yury Niño Roa introduces a new actor: visual metaphors, discussing visualisation and how to use colours, textures, and shapes to create mental models for observability and chaos engineering.