InfoQ Homepage Observability Content on InfoQ
-
Observability for Speed & Flow
Jessica Kerr considers that we should be looking at the software as part of the team, and observability in the software becomes an asset to organizing teams.
-
The Mechanics of Metrics: Aggregation across Dimensions
Erin Schnabel discusses how application metrics align with other observability and monitoring methods, from profiling to tracing, and the limits of aggregation.
-
Panel: Observability and Understandability
Jason Yee, John Egan, and Ben Sigelman discuss their approaches and preferred methods to get impactful results in incident management, distributed tracing, and chaos engineering.
-
Resources & Transactions: a Fundamental Duality in Observability
Ben Sigelman explores resources and transactions, both theoretically and through some real-world examples, to develop an intuition for how to understand a system more completely.
-
Reduce ‘Unknown Unknowns’ across Your CI/CD Pipeline
The panelists discuss monitoring and observability methods that DevOps and SRE teams can employ to balance change and uncertainty without the need to constantly reconfigure monitoring systems.
-
Embracing Observability in Distributed Systems
Michael Hausenblas discusses good practices and current developments around CNCF open source projects and specifications including OpenTelemetry and FluentBit.
-
True Observability Needs High-Cardinality
Pierre Vincent discusses how high-cardinality observability helps the exploration and debugging power required to understand the reality of a production system.
-
Solving Mysteries Faster with Observability
Elizabeth Carretto discusses observability at Netflix and where and how their internal tool, Edgar, comes into play.
-
InfoQ Live Roundtable: Observability Patterns for Distributed Systems
The panelists explore how a sound observability strategy can help mitigate operational costs and avoid common pitfalls in monitoring distributed systems.
-
Observability in the Development Process: Not Just for Ops Anymore
Christine Yen explores what observability looks like in practice, so that production stops being just where the development code runs into issues.
-
Architectures That Scale Deep - Regaining Control in Deep Systems
Ben Sigelman talks about "Deep Systems" and their common properties: they are layered, distributed, concurrent, multi-tenant, change continuously, and are hard to manage with conventional tools.
-
Yes, I Test in Production (And So Do You)
Charity Majors talks about testing in production and the tools and principles of canarying software and gaining confidence in a build, also instrumentation and observability .