The WoodWing Cloud platform has built-in monitoring, failover, and backup strategies and uses redundant storage and multiple Availability Zones to quickly recover from failures within the region.
Info: An Availability Zone consists of one or more discrete data centers with redundant power, networking, and connectivity within the region. All zones in a region are interconnected with high-bandwidth, low-latency networking, over fully redundant, dedicated metro fiber providing high-throughput, low-latency networking between zones. All traffic between zones is encrypted. Availability Zones are physically separated by a meaningful distance, many kilometers, from any other zone, although all are within 100 km (60 miles) of each other.
In case of a complete regional failure it is however not ensured that WoodWing has direct access to your data in the production region, and if so, access to this data needs to be moved to the disaster recovery region which - depending on the size and nature of the regional failure - can take a considerable amount of time.
To overcome this problem, the backups are duplicated to the disaster recovery region and the Assets filestore is replicated to the disaster recovery region in near-real time.
Backup
A daily backup (Recovery Point Objective, RPO) of the customer data is created with a retention period of 30 days and stored on highly durability S3 storage (designed for durability of 99.999999999%).
The backup is used to restore the customer's environment in a geographically distinct recovery region in the very unlikely event of a complete regional failure.
Non-replicated storage
In some cases it might be desirable that the data is stored in the primary region only, for example because of the sensitive nature of your content. For such cases, non-replicated storage is available.
Info: The backups and Assets file storage location (filestore) are not replicated.
Your data is still stored on high-durability storage but WoodWing might not have immediate access to it when a full regional failure occurs. The Recovery Time Objective (RTO) is therefore best effort only.
Disaster recovery process in the case of a full regional failure
The disaster recovery process follows the following steps
- Full regional failure occurs, which is detected by the monitoring or via the critical support procedure
- WoodWing investigates the problem and informs the partner and customers affected about the advised action plan
- The disaster recovery process is started up to 8 hours after the regional failure
- The Recovery Time Objective (RTO) depends on the size of the customer data.
The fail-back scenario is similar to the fail-over scenario, the timing is coordinated with the partner & customer.
Comment
Do you have corrections or additional information about this article? Leave a comment! Do you have a question about what is described in this article? Please contact Support.
0 comments
Please sign in to leave a comment.