When your data centre is running 24/7, the operation uses three shift supervisors. Making sure theyโre on the same page is critical to smooth operations and to prevent surprises. Whatโs the best way to hand over the wheel? A little bit of MBWA โ management by walking around.
โYou canโt manage a data centre from a cubicle,โ says Darin Stahl, research lead with Info-Tech Research Groupโs data centre practice. A former data centre supervisor himself, Stahl says looking at monitoring tools isnโt enough. โI want to walk the floor,โ he says.
Stahl and the data centre group recently published the Data Centre Shift Turnover Checklist, a template companies can customize for their own operations to make the handover smoother. โItโs meant to facilitate an orderly turnover from shift to shift,โ he says.
The checklist covers three broad areas. First, โwe wanted to hit the perimeter and the facilities,โ says Stahl, checking the status of HVAC, fire protection and power and UPS systems. The second section deals with pending and completed changes for the shift. โI want to know what was completed on the last shift, so if 45 minutes in something goes awry, I know where to start,โ he says.
โIโd rather be on top of the situation from the get-go.โ
The third section deals with about a dozen operational activities โ backups, tape deliveries, server and network status, etc. And the checklist uses a much-overlooked technology โ pen and paper. While the group discussed doing the checklist online, the list requires a lot of visual inspection inside and outside the building, which makes an online version less than ideal, he says.
Keeping on top of ops
Stahl advocates a formal shift turnover meeting between the supervisors involved โ their hours should overlap to allow this. Depending on variables like whether there are tape or print rooms in the facility and how much outdoor inspection is required, the checklist should take less than 45 minutes to go through, he says.
The walkaround ensures that the oncoming supervisor knows, for example, whether there are vendors inhouse working on equipment, whether there are any potential health and safety issues, and whether someone has left a door unsecured sneaking out for a cigarette. โItโs amazing what some people will do,โ he says.
Itโs also a good practice from an audit perspective, Stahl says. Thereโs a written record of the state of the operation every eight hours. Not that data centres need hang onto the checklists for seven years like financial data, but โitโs demonstrating that you do what you say you do.โ
Gary Baker, the partner leading Deloitteโs enterprise risk practice in Toronto, agrees that a โstructured conversation toolโ can help with continuity.
โIโm not a big fan of checklists generally,โ Baker says. โAs an auditor, with a checklist, you forget to think.
โThey are not a substitute for critical thinking.โ
But from the perspective of a data centre supervisor, he says, itโs a very useful tool.
โOne of the key elements of IT is integrity of operations,โ he says. Once the status of facilities and operations is documents, it becomes something auditors can use as evidence of operational integrity. But the primary advantage is having the exchange itself.
โJust the simple act of writing it down seems to crystallize what it is and what was said,โ Baker says.
In terms of retention, Baker says, companies should hold onto the documents as long as thereโs business value, not just to satisfy auditors. That would be a window of โmonths, rather than years.โ
While companies can make their own checklists from scratch, โthis kind of gives you a jumpstart,โ Stahl says. In the shoes of a data centre supervisor, Stahl says, โIโd rather edit than create.โ