[164365] in North American Network Operators' Group

home help back first fref pref prev next nref lref last post

Re: What to expect after a cooling failure

daemon@ATHENA.MIT.EDU (Mikael Abrahamsson)
Wed Jul 10 01:47:50 2013

Date: Wed, 10 Jul 2013 07:47:20 +0200 (CEST)
From: Mikael Abrahamsson <swmike@swm.pp.se>
To: NANOG mailing list <nanog@nanog.org>
In-Reply-To: <1373426894.69598008@apps.rackspace.com>
Errors-To: nanog-bounces+nanog.discuss=bloom-picayune.mit.edu@nanog.org

On Tue, 9 Jul 2013, Erik Levinson wrote:

> For those who have gone through such events in the past, what can one 
> expect in terms of long-term impact...should we expect some premature 
> component failures? Does anyone have any stats to share?

I have experience with a different kind of event that might be of interest 
to a wider audience.

When the fire suppression system went off in a site, we had a lot of 
instant harddrive failures. I don't have any numbers, but let's say 5-10% 
of all hdd:s in the room died more or less instantly. Supposedly this was 
because of the air pressure shock when the inert fire suppression gas was 
released and the vents weren't big enough to release the overpressurised 
air outside.

I did some research and there are forum posts etc about these kinds of 
events happening in other places.

So, takeaway from this was RAID is an uptime tool, not a substitute for 
backups, and also, get a qualified ventilation/fire supression systems 
engineer to inspect your sites from this aspect.

-- 
Mikael Abrahamsson    email: swmike@swm.pp.se


home help back first fref pref prev next nref lref last post