InfoQ Homepage Resilience Content on InfoQ
-
Using Chaos to Build Resilient Systems
Tammy Butow explains how to build resilient systems by focusing on the detection, mitigation, resolution and prevention of incidents.
-
Properties of Chaos
Nathan Aschbacher talks about how and why chaos engineering is being applied to autonomous vehicle safety, how property-based testing principles can influence chaos engineering goals, and more.
-
Heretical Resilience: To Repair is Human
Ryn Daniels describes the “Apache SNAFU”, shares their experiences as the instigator of that snafu and walks through the lessons that can be learned from such an event.
-
Chaos Engineering: Why the World Needs More Resilient Systems
Tammy Butow shares her experiences using chaos engineering to build resilient systems, when they couldn’t build their systems from scratch.
-
Pragmatic Resiliency: Super 6 & Sky Bet Evolution
Michael Maibaum talks about the reality of adapting a complex set of interacting, highly coupled applications to make them more resilient and better able to cope with failure.
-
Incident Management at Netflix Velocity
Dave Hahn talks about how Netflix engineering teams think about failure, why they believe chaos is their friend, failure is guaranteed, and why Netflix is better off having both.
-
Best Practices Building Resilient Systems
Pablo Jensen focuses on best practices and lessons learned in building resilient systems.
-
The Art of Chaos Engineering Panel
The panelists answer audience questions on the emerging field of chaos engineering including what chaos engineering is, how you get started with it, and pitfalls of adoption.
-
Chaos Architecture
Adrian Cockcroft takes a look at best practices and challenges in getting to a chaos architecture mindset.
-
Chaos Engineering on a Budget
Heather Nakama tells the story of implementing chaos testing on a small product, and how several small and targeted early investments in chaos engineering saved time and effort.
-
Chaos: The Last Stand against Our Robot Overlords
Nathan Äschbacher talks about Chaos Engineering and how to shift towards working with chaos instead of against it, in order to build safe, reliable, and increasingly deterministic complex systems.
-
Expedia’s Journey toward Site Resiliency
Sahar Samiei and Willie Wheeler share Expedia’s resiliency journey, starting with resiliency as an afterthought and progressing toward resiliency as a first-class concern.