[8003] in testers
Re: m12-182-? is sad
daemon@ATHENA.MIT.EDU (Evan Broder)
Thu Apr 16 22:54:10 2009
Message-ID: <49E7EF1E.9020501@mit.edu>
Date: Thu, 16 Apr 2009 22:53:18 -0400
From: Evan Broder <broder@MIT.EDU>
MIME-Version: 1.0
To: Adam Seering <aseering@mit.edu>
CC: Mitchell E Berger <mitchb@mit.edu>, testers@mit.edu
In-Reply-To: <49E7DF16.9000103@mit.edu>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding: 7bit
In the future, I'm a lot more interested in keeping the machines in
clusters functional than I am in capturing debugging information; it's a
better idea to reboot than to leave it for a post-mortem, especially in
clusters that the developers don't frequent.
As for your specific failure, how long did you wait before killing X? It
sounds like you might have been running into the standard
AFS-hangs-when-network-goes-away failure to me. I really disbelieve that
you wouldn't see similar failures on Athena 9.
- Evan
Adam Seering wrote:
> Should X hang/die in such a situation, though? My experience with
> Athena 9 is that it generally recovers (enough to log in, at least) in
> a matter of seconds after being unplugged for an extended period.
> Admittedly, though, I've only tried this a few times.
>
> Adam
>
>
> On 4/16/09 9:41 PM, Mitchell E Berger wrote:
>> You didn't try rebooting it? If the network cable has been out for
>> a length of time, a whole bunch of things on the machine are going
>> to have noticed (among them, AFS, zhm, syslogd, aptitude, etc.), and
>> while
>> they may recover given time, assuming that the machine will immediately
>> be fine upon reinserting the cable is generally not accurate.
>>
>> Mitch
>>
>>> Hey,
>>> The DebAthena computer adjacent to M12-182-4 (it doesn't have a
>>> label
>>> and I can't log in to check) is currently sad. Its Ethernet cable was
>>> unplugged when I walked up to it. I plugged it back in, and tried to
>>> log in; the login hung while trying to render my applications bar. I
>>> killed X (ctrl-alt-bksp); the machine is now sitting at a text console.
>>>
>>> Adam
>>>
>>
>