Example Disaster Recovery Plan

Index

  1. Introduction
  2. Disaster Threat Analysis
  3. Organizational Responsibilities
  4. Disaster Avoidance
  5. Disaster Preparation
  6. Disaster Recovery
  7. Appendices

Introduction

Definition

Objectives

Definition

Reference

Disaster Threat Analysis

A disaster is any incident or event that results in a major (multi-day) interruption of operations at one or more of the contact or data centers. For disruptions in service that affect only a portion of systems or operations at any one location, a subset of the full recovery procedures will likely be used to restore normal operations. A catastrophic disaster, however, would render the centers incapable of conducting critical functions for an extended period of time. The impact of such a disruption would require that notification and periodic updates be circulated throughout the system, until normal operations were restored. The appropriate authorities, depending on the nature of the disaster (fire, flood, etc.), would also have to be contacted. Personnel at each center, organized into emergency management teams, would coordinate the initial response to the disaster, assess the damage, and determine the extent to which all or part of the disaster recovery plan should be deployed. Designated team members would have responsibility for maintaining the necessary sequence of notifications to senior management, to users, to public emergency personnel, and to utility contractors, as appropriate and as the need arises.

Disaster Scope

Disaster Levels

This disaster recovery plan will be invoked if one of the following disasters occur:

  1. Limited Disaster
    A limited disaster is characterized by limited or isolated damage to a part of a contact or data center that is sufficient that has disabled or will disable it, partially or completely, for a period of 24 hours.
  2. Moderate Disaster
    A moderate disaster is characterized by severe damage to the entire contact or data center, thereby temporarily prohibiting the performance of all user support or operations tasks. It requires either temporarily allocation of the workload to other existing sites or else temporarily transfer to a hot-backup site until the facility can be repaired. However, no cold backup site is required because of the limited time required to put the affected site into full operation.
  3. Catastrophic Disaster
    A catastrophic disaster is characterized by complete destruction of a contact or data center. Because the center is a total loss and needs to be completely rebuilt or replaced, it requires either temporarily allocation of the workload to other existing sites or else temporarily transfer to either a hot or cold-backup site.

Incidents Requiring Plan Invocation

Natural Disasters

Man-Made Disasters

Organizational Responsibilities

Disaster Avoidance

Disaster Preparation

To properly prepare for the occurance of disasters, the following steps will be taken:

Disaster Recovery

Restoration of Normal Operations

Once the back-up content and/or data sites are functioning on a full production schedule, priority return to the permanent centers. Initial assessments of damage would be refined, and reconstruction plans developed. If major facilities/site damage had been incurred, the full reconstruction plans would extend well beyond the operations staff. However, once the time schedule for facilities reconstruction were known, at least approximately, plans could be made for permanent replacement equipment. Unless arrangements had been made to continue long-term lease (or purchase) of the temporary replacement equipment, this undertaking would entail issuance of a competitive solicitation for the replacement hardware. Award/delivery would have to be timed to coincide with availability of reconstructed centers. With the permanent centers restored, operations are transferred from the temporary facility by following the same sequence of steps as were used to set up the back-up site. The re-establishment of normal operations should proceed under far less duress than the establishment of emergency operations, and the logs kept during disaster recovery should help highlight and troubleshoot/resolve any problems that may have arisen during earlier system transfers.

Disaster Recovery Checklist

Immediately

When a disaster occurs and time and safety permits, the local management team will:

When a disaster occurs and time and safety permits, the security personnel will collaborate with the management team to:

When a disaster occurs and time and safety permits, the operations team will:

Within 2 Hours

If possible, the primary (or secondary) disaster recovery coordinator will:

Within 4 Hours

The emergency response team will:

Within 8 Hours

The emergency response team will:

Within 1 Day

If replacement equipment is not yet available, the disaster recovery coordinator, in concert with operations and engineering team captains will: initiate an alternate production schedule to share the resources of the remaining site to support operational requirements for both sites test and verify communications capabilities

Within 2 Days

The disaster recovery team will:

Within 5 Days

The disaster recovery team will:

For each affected contact and data center and upon delivery of replacement equipment, the environments team will:

For each affected contact and data center, the operations team will:

The disaster recovery team will:

re-assess status of equipment ( necessity of bidding permanent replacement equipment, while continuing El Camino lease, etc.) re-assess any other physical/facilities requirements before considering restoration complete confirm status of hardware/software with vendors/service-providers

Appendices

A. Contact Information

B. Team Membership

C. Emergency Supply Sources

D. Support Agreements

E. Forms

F. Glossary

G. Major Issues

H. TBDs

J. Assumptions

This plan is based on the following assumptions:

LETTER FROM MANAGEMENT ENDORSING PROGRAM
MISSION STATEMENT."

CONTENTS:
SECTION I - GENERAL POLICIES
1.1 Introduction
1.1.1 Background
1.1.2 Scope and Objectives
1.2 Plan General
1.3 Identification of Key Personnel
1.4 Initial Response and Recovery Actions
1.5 Responsibilities
1.5.1 General Responsibilities
1.5.2 Specific Responsibilities
1.6 Recovery/Restoration Activities
1.6.1 ACD Damage Assessment Activities
1.6.2 ACD Recovery Activities
1.6.3 ACD Salvage Activities
1.6.4 ACD Restoration Activities
1.6.5 Supporting Checklists
1.7 Plan Documentation
1.8 Plan Distribution
1.8.1 Distribution List
1.8.2 Distribution Procedure
1.9 Plan Testing
1.10 Plan Maintenance
1.11 Disaster Scenarios
1.11.1 Types of Disasters
1.11.2 Where Disasters Occur
1.12 Critical Call Center Assets
1.13 Emergency Declaration Guidelines
1.13.1 Five Basic Levels of a Disaster
1.13.2 Decision Criteria
1.14 Recovery and Restoration Time Frames
1.14.1 1-6 Hours After Being Notified
1.14.2 6-12 Hours After Being Notified
1.14.3 12-24 Hours After Being Notified
1.14.4 24+ Hours After Being Notified
1.15 Plan Format
1.16 Budgeting/Funding
SECTION II - CONTINGENCY AND RECOVERY PLANS
2.1 Baseline Plan Organization and Structure
2.2 Introduction
2.3 Pre-Planning Activities
2.4 Plan Distribution
2.5 Security and Disaster Prevention
2.6 Disaster Preparedness/Security
2.7 Disaster Recovery Action Plans
2.8 Training Activities
2.9 Plan Documentation
2.10 Plan Implementation
2.11 Plan Testing
2.12 Plan Maintenance
2.13 Plan Training
2.14 Summary of Activities - Disaster Response
2.15 Summary of Activities - Disaster Recovery
2.16 Summary of Activities - Disaster Restoration
2.17 Detailed Activities - Disaster Response
2.18 Detailed Activities - Disaster Recovery
2.19 Detailed Activities - Disaster Restoration
2.20 ACD Recovery Considerations - General
2.21 Risk Analysis - External Risks
2.22 Risk Analysis - Internal Risks
2.23 Risk Analysis - Security
2.24 ACD Hardware Asset Recovery Activities
2.25 Site Recovery Plan

SECTION III - PLAN MAINTENANCE, TESTING, MISCELLANEOUS
3.1 Plan Maintenance
3.2 Plan Testing
3.3 Plan Documentation
3.4 Plan Distribution
3.5 Training
3.6 Service Prioritization