InfoQ Homepage Resilience Content on InfoQ
-
Designing Services for Resilience: Netflix Lessons
Nora Jones talks about designing microservices for enabling resiliency testing and the moving parts we need to consider when designing them from the get go, and along their lifetime.
-
“GameDay” – Achieving Resilience through Chaos Engineering
Pete Cohen and Matt Fellows discuss GameDay and chaos engineering, what they are, and how they were done successfully by some organizations.
-
Confusion in the Land of the Serverless
Sam Newman introduces serverless computing, discussing how security, resilience, patterns (circuit breaker), vendor lock-in, and microservices are addressed with this technology.
-
State of Chaos Engineering
Bruce Wong discusses the current state of Chaos Engineering, emerging patterns of success, and the future opportunity at hand.
-
Real-Time Metrics Computations to Power Video Quality of Delivery Resilience
Sudhi Nada shows how Conviva deals with video delivery issues, illustrating the business impact of video delivery failures and how to build resiliency into such systems based on real-time metrics.
-
Choose Your Own Adventure: Chaos Engineering
Nora Jones talks about different experiences on "Chaos Adventures" including both successes and failures introducing Chaos in an organization.
-
Drinking from the Elixir Fountain of Resilience
Jearvon Dharrie talks about the factors that contribute to Elixir's perfect match for fault tolerance and resiliency, besides the Open Telecom Platform (OTP).
-
Nonconformist Resilience: DB-backed Job Queues
John Mileham presents how they use, deploy Delayed::Job (a database-backed job queue) at Betterment for its transactional enqueue semantics, safe retry with exponential backoff, and its storage model.
-
Resilient Functional Service Design
Uwe Friedrichsen explores how much functional design affects the overall robustness of a solution to learn how to deliver a better "resilient functional service design".
-
The Walking Dead - A Survival Guide to Resilient Reactive Applications
Michael Nitschinger discusses how to build event-driven applications that are resilient from the bottom up, allowing to deal with remote services that are failing, slow or misbehaving.
-
When Streams Fail: Kafka Off the Shore
Anton Gorshkov discusses how to evaluate and architect a resilient streaming platform, focusing on Kafka and Spark streaming and sharing his experience of using them to process financial transactions.
-
From Microliths to Microsystems
Jonas Boner explores microservices from first principles, distilling their essence and putting them in their true context: distributed systems based on reactive principles.