InfoQ Homepage Disaster Recovery Content on InfoQ

News

RSS Feed

Newer Older

Cloud

AWS Announced General Availability of Elastic Disaster Recovery

Recently AWS announced the general availability (GA) of AWS Elastic Disaster Recovery (AWS DRS). With this new service, organizations can minimize downtime and data loss through the fast, reliable recovery of on-premises and cloud-based applications.

Steef-Jan Wiggers
on Nov 29, 2021
Cloud

Amazon Introduces AWS Resilience Hub to Monitor and Improve RPO and RTO

Amazon recently announced the availability of AWS Resilience Hub, a service designed to help customers define, measure, and manage the resilience of their applications on the cloud.

Renato Losio
on Nov 17, 2021
Cloud

AWS Releases Amazon Route 53 Application Recovery Controller into General Availability

Recently, AWS announced the general availability (GA) of Amazon Route 53 Application Recovery Controller, an additional new set of capabilities in Amazon Route 53. With the capabilities, it will be easier for customers to continuously monitor their applications’ ability to recover from failures and control their recovery across AWS Regions, Availability Zones, and on-premises infrastructure.

Steef-Jan Wiggers
on Aug 10, 2021
Cloud

Microsoft Announces the Public Preview of Disk Pool for Azure VMware Solution

Microsoft recently announced the preview of disk pool enabling Azure Disk Storage as a persistent storage option for Azure VMware Solution - a vSAN hyper-converged vSphere cluster. With this persistent storage option, customers have another choice for running VMware workloads on Azure.

Steef-Jan Wiggers
on Jul 20, 2021
Architecture & Design

Uber Implements Disaster Recovery for Multi-Region Kafka

In a recent blog post, Uber engineers highlight how they use a replication platform to implement disaster recovery at scale with a multi-region Kafka deployment. Uber has a large deployment of Apache Kafka, processing trillions of messages and multiple petabytes of data per day. Uber's engineers provided business resilience and continuity in the face of natural and human-made disasters.

Eran Stiller
on Jan 04, 2021
Cloud

Amazon Introduces a New Feature for ElastiCache for Redis: Global Datastore

Recently Amazon announced Global Datastore, a new feature of Amazon ElastiCache for Redis that provides fully managed, fast, reliable and secure cross-region replication.

Steef-Jan Wiggers
on Mar 28, 2020
DevOps

Summary of Chaos Community Day v4.0: Resilience, Observability, and Gamedays

Earlier in the year, the fourth edition of “Chaos Community Day” was held at Work-Bench in New York City. Key takeaways from the day included: the topic of chaos engineering draws heavily from other domains, which software engineers can also learn from; understanding systems, and communicating and exchanging the related mental models, is vital for establishing resilience.

Daniel Bryant
on Jun 07, 2019
DevOps

Building Production-Ready Applications: Michael Kehoe Shares Lessons Learned from LinkedIn

At QCon San Francisco, Michael Kehoe presented “Building Production-Ready Applications”. Drawing on his experience with site reliability engineering (SRE), he introduced the tenets of “production-readiness” that all engineers across the organisation should focus on as: stability and reliability; scalability and performance; fault tolerance and disaster recovery; monitoring; and documentation.

Daniel Bryant
on Nov 12, 2018
DevOps

Why the World Needs More Resilient Systems: Tammy Butow Discusses Chaos Engineering at QCon London

At QCon London, Tammy Butow, explained why the world needs more resilient systems, and how this can be achieved with the practice of chaos engineering. Three primary prerequisites for chaos engineering were provided -- high severity “SEV” incident management, monitoring, and measuring the impact -- and a series of guidelines, tools and practices presented.

Daniel Bryant
on Mar 18, 2018
Cloud

Microsoft Introduces Azure Availability Zones, Completes MAREA Transatlantic Connection

In a recent blog post, Microsoft announced the expansion of High Availability (HA) and resiliency options for customers. The update comes in the form of Azure Availability Zones which increase the availability of certain Azure services within a specific region by providing complete redundancy and isolation of the infrastructure. Azure Availability Zones include a financially-backed SLA of 99.99%.

Kent Weare
on Sep 29, 2017
Cloud

Public Preview of Azure IaaS Disaster Recovery Announced

In a recent announcement, Microsoft released details about its public preview for Infrastructure-as-a-Service (IaaS) disaster recovery using Azure Site Recovery (ASR). Using the ASR service, organizations can protect IaaS workloads in one Azure region and have it replicated to a different Azure region within a geographical cluster.

Kent Weare
on Aug 07, 2017
Development

GitLab.com Postmortem Digs into Root Causes of 18 Hour Outage

GitLab's postmortem into the root cause of their 18 hour site outage is a detailed look at how the incident began, how it got worse before it got better, and how they plan to learn from the mistakes and improve the service.

David Iffland
on Feb 21, 2017
Development

BitBucket Introduces Disaster Recovery and Merge Strategies

Recently released BitBucket Server and BitBucket Data Center 4.9 bring the possibility of defining a strategy for disaster recovery, setting a preferred merge strategy, and more.

Sergio De Simone
on Sep 11, 2016
Too Big To Fail: Lessons Learnt from Google and HealthCare.gov

At QCon New York 2015, Nori Heikkinen shared stories of failure and lessons learnt during her time working as a site reliability engineer (SRE) at Google and HealthCare.gov. The discussion of managing large-scale outages included recommendations for preparation, response, analysis and prevention.

Daniel Bryant
on Jun 14, 2015
CenturyLink Acquires DataGardens to Offer DR as a Service

CenturyLink, one of the largest telecommunications and cloud providers has announced the acquisition of Canada based disaster recovery software company, DataGardens.

Janakiram MSV
on Dec 11, 2014

Newer News

Older News

InfoQ Software Architects' Newsletter

News