[1638] in SIPB-AFS-requests
sipb cell outage this afternoon
daemon@ATHENA.MIT.EDU (ghudson@MIT.EDU)
Tue Nov 29 13:38:37 1994
From: ghudson@MIT.EDU
Date: Tue, 29 Nov 1994 13:37:57 +0500
To: sipb-afsreq@MIT.EDU
At about 12:15, I took down rosebud2 to put the 4GB disk on it. While
I was doing this, ronald-ann went comatose. (The file server process
currently has a core, so it may have dumped core.) I have every
reason to believe that I did not disturb ronald-ann's cables while in
the machine room.
I booted ronald-ann after about twenty minutes, and it came back up
and salvaged. After salvaging, client machines were very slow.
During this entire process, client machines appeared never to have
timed out trying to access ronald-ann, regardless of whether there
were one or two other AFS servers up, and regardless of whether
ronald-ann was actually running its kernel or at the boot prompt.
I have no explanation for these events, and I find it very disturbing
that unexplainable and (evidently) minor problems with the sipb
servers have such a drastic effect on the Athena environment.