Emergency Response Procedures, Emergency Drills and Incident Management
Energy management firm Schneider Electric has mapped out emergency response considerations for data centres.
The firm warns that data centres must be prepared for exceptional events or emergencies and have plans in place to maintain continuous operation. For example it says that despite best practices being observed during site selection, extreme weather events such as Hurricane Sandy have shown that facilities may at any time be exposed to risks above and beyond hazards those from which they are normally protected.
Emergency Response Procedures include: operational measures to deploy in the event of a crisis that safely isolate faults and restore service; a crisis management plan (CMP) which is a detailed step-by-step procedure to follow when emergency strikes; and Escalation Procedures, which are documented, prioritised contact lists outlining internal contact requirements for specific situations related to data centre operations.
Emergency Drills, which should be developed in advance and scheduled to occur regularly, guide operators through what should be done to counter the top 10 identifiable operational risks.
Incident Management includes: incident notification, a process to inform the appropriate people about any safety of mission-critical event; incident identification and reporting ensures that all incidents are reported as soon as the situation is stabilised and a brief summary of the incident prepared for circulation to appropriate staff; and failure analysis, which is a comprehensive program to determine the root cause of any incident that has resulted in system downtime or personal injury, or had the likelihood of doing so.
Communication and speed of response to emergency situations can greatly reduce the risk to operations caused by incidents.
The firm’s White Paper’s recommendations focus strongly on the need for preparedness to anticipate problems before they arise, take appropriate speedy action when they do and communicate the lessons learned from each incident so that the worst effects are not repeated.
To effectively respond to different kinds of risks and crises in data centres, organisations must act quickly and know what to expect in unexpected situation. Proper operational methodology will avoid common mistakes and a good Emergency Preparedness plan, encompassing people processes and systems, will help operators run their facilities in a more predictable and effective way.
White Paper, #217 entitled "How to Prepare and Respond to Data Center Emergencies" is available for free download from the Schneider Electric website.