[20841] in Hotline Meeting
Hockney (afs server) is back in service
daemon@ATHENA.MIT.EDU (salemme@MIT.EDU)
Fri Jan 28 18:31:47 1994
From: salemme@MIT.EDU
Date: Fri, 28 Jan 94 18:30:54 -0500
To: athena-outage@MIT.EDU
All volumes on hockney are finally back in service. The summary is there
were about thirty "backup" volumes on hockney:/vicepd that had to be manually
salvaged, and that after a lot of aggravation we were able to do this without
any loss of data. We do not know the direct cause of today's problems with
hockney, but since today's are similar to failures we've had recently on other
servers, I'm looking into whether these may have been caused by insufficient
space on a partition during the nightly volume cloning/releasing.
Details about today's outage on hockney:
11am: problem became apparent when the tape backup system complained
about offline volumes on hockney d
3:15pm: all partitions back in service after salvaging all, checking
for hardware problems (none found); approx 30 volumes were
offline
6pm: all volumes that needed salvaging were back in service (after
a 'bos salvage' of the read-write volume, 'vos remove' of the
backup volume, and 'vos backup' to create a new backup volume)
There was no detectable loss of data, and at this time we do not believe there
is a hardware problem on hockney. If you have any questions, please send mail
to 'op'. Thanks!
Anne