Building Disaster Recovery with AWS Elastic Disaster Recovery - Continuous Replication and Recovery Testing

Continuously replicate on-premises servers to AWS and validate recovery procedures with recovery drills. Learn the end-to-end workflow through failback.

Overview of Elastic Disaster Recovery

Elastic Disaster Recovery (DRS) is a service that continuously replicates on-premises or other cloud servers to AWS for rapid recovery during disasters. After installing an agent on the source server, block-level changes are continuously replicated to a staging area in AWS.

Recovery Drills and Failover

A recovery drill is a test that launches EC2 instances from replicated data and verifies application functionality. It can be executed without affecting production replication, and provides actual RTO measurements. Failover follows the same procedure as a drill, launching EC2 instances and switching DNS to direct production traffic to AWS. Failback is the operation of returning data from AWS to the original on-premises environment, with DRS managing the reverse replication.

Network Design and Recovery Automation

DRS replication servers are placed in a staging subnet to receive data from source servers. Instances launched during recovery are placed in a separate subnet (recovery subnet), with production network settings defined in advance. Launch templates configure instance type, security groups, subnets, and IAM roles to minimize manual work during recovery. Post-launch actions automatically execute scripts after instance startup, automating DNS switching and application configuration changes. For understanding DRP design best practices, related books (Amazon) are a helpful reference.

DRS Pricing

DRS pricing consists of EC2 instances and EBS volumes for replication servers. Replication servers run on small instances such as t3.small, costing approximately $15-20 per server per month. Instances launched during recovery drills or failover are billed only for the time they run. EBS snapshot storage costs (approximately $0.05 per GB per month) are incurred based on data volume. Compared to traditional DR solutions (always-on standby environments), costs are significantly reduced.

Summary

Elastic Disaster Recovery is a DR service that reduces RPO to seconds through continuous replication and enables recovery in minutes. Pre-define instance settings for recovery with launch templates and automate DNS switching with post-launch actions. Validate RTO/RPO targets through regular recovery drills, and achieve significant cost savings compared to traditional always-on standby environments.