[30434] in Hotline Meeting
Hardware fault with BULB.MIT.EDU in W-91
daemon@ATHENA.MIT.EDU (Bill Cattey)
Wed Oct 18 12:04:42 1995
Date: Wed, 18 Oct 1995 12:04:45 -0400 (EDT)
From: Bill Cattey <wdc@MIT.EDU>
To: hotline@MIT.EDU
Cc: kelley@MIT.EDU, fcs@MIT.EDU
I have an important job to run on my server host BULB.MIT.EDU that
cannot run to completion because, apparently, half the memory in the
machine went offline on Oct 17 at 10:10 AM. The machine crashed, and
brought itself back up with very little indication that anything was
wrong. IF one types "errpt" (the IBM-unique command that means, "tell
me what errors happened that you never bothered syslogging) one gets
(among other things):
bulb# errpt
ERROR_ID TIMESTAMP T CL RESOURCE_NAME ERROR_DESCRIPTION
8EA094FF 1017101295 T H sysplanar0 Checkstop
2F24221A 1017101095 T H ent0 ADAPTER ERROR
77E0148A 1017101095 P H memory Memory failure
9DBCFDEE 1017101295 T O errdemon Error logging turned on
ABB81CD5 1016170195 T H ent0 COMMUNICATION PROTOCOL ERROR
A service call needs to be scheduled as soon as possible to bring the
full 32 meg of memory back online.
The IBM service person should be able to get further detail by using the
obscure options to errpt, and determine a proper fix.
Please advise me via email as SOON as this service call is scheduled.
-wdc