InfoQ Homepage Chaos Conf Content on InfoQ
-
Scaling Culture of Resiliency in the Enterprise
Nate Vogel shares how he grew the data engineering team with an emphasis on building a culture of reliability, discussing processes and tools used.
-
IBM’s Principles of Chaos Engineering
Haytham Elkhoja discusses the process of getting engineers from across to agree on a list of Chaos Engineering principles, adapting existing principles to customer requirements and internal services.
-
Top Five Things You Can Do to Reduce Operational Load
Rachel Obstler discusses the things one can do to make a big difference in reducing operational work from incidents, reducing duplicate efforts, surfacing issues, and improving response times.
-
Self-Service Chaos Engineering: Fitting Gremlin into a DevOps Culture
Doug Campbell shares how they rolled out Gremlin at Grubhub and how they educated and enabled all engineering teams to use it.
-
Culturing Resiliency with Data: a Taxonomy of Outages
Ranjib Dey overviews the categorization of outages that happened at Uber in the past few years based on root cause types.
-
Certainty among the Chaos
Marco Coulter discusses the capabilities of chaos engineering beyond resiliency to support capacity optimization.
-
The More You Know: a Guide to Understanding Your Systems
Tyler Wells shares how Twilio developed a template that enables them to understand their systems better, identify critical metrics to watch, and how to use Chaos Engineering to verify it all.
-
Convergence of Chaos Engineering and Revolutionized Technology Techniques
Yury Niño Roa explores how emerging paradigms can use Chaos Engineering to manage the pains in the path toward providing a solution, showing how Chaos Engineering can benefit from AI.
-
Let Devs Be Devs: Abstracting away Compliance and Reliability to Accelerate Modern Cloud Deployments
Rahul Arya shares how they built a platform to abstract away compliance, make reliability with Chaos Engineering completely self-serve, and enable developers to ship code faster.
-
Can Chaos Coerce Clarity from Compounding Complexity? Certainly
Matt Simons attempts to catch some Black Swans in a system’s architecture and infrastructure, hidden in increased complexity.
-
Lessons from Incident Management and Postmortems at Atlassian
Jim Severino shares what worked (and didn't work) in incident management and post-mortems for Atlassian.
-
Automating Chaos Attacks
Daniel Albuquerque and Nikos Katirtzis show how to run attacks in both manual and automated ways.