InfoQ Homepage Presentations Incident Management at Netflix Velocity
Incident Management at Netflix Velocity
Summary
Dave Hahn talks about how Netflix engineering teams think about failure, the tools, techniques, and training they use to shorten the inevitable failures of their systems and impacts to their customers. He explains why they believe chaos is their friend, failure is guaranteed, and why Netflix is better off having both.
Bio
Dave Hahn is a Senior SRE in the Cloud Operations and Reliability Engineering organization at Netflix. He has many years of experience in distributed systems, failures, and mis-attribution of complex problems to human error.
About the conference
Software is changing the world. QCon empowers software development by facilitating the spread of knowledge and innovation in the developer community. A practitioner-driven conference, QCon is designed for technical team leads, architects, engineering directors, and project managers who influence innovation in their teams.