[3520] in Athena Bugs

home help back first fref pref prev next nref lref last post

Continuing problems

daemon@ATHENA.MIT.EDU (irbusch@ATHENA.MIT.EDU)
Sat Oct 28 14:13:53 1989

From: irbusch@ATHENA.MIT.EDU
To: hotline@ATHENA.MIT.EDU
Cc: bugs@ATHENA.MIT.EDU, nhdoerry@ATHENA.MIT.EDU, irbusch@ATHENA.MIT.EDU
Date: Sat, 28 Oct 89 14:13:26 EDT

Howdy!

I am resending this message to reemphasize the continueing problems we are
having up in 7-321.  In addition, we are continuing to have the problem that
Norbert told you about earlier.  This is now occuring on several terminals:

	-1,2,3,4,5,8

Please E-mail back to either irbusch or nhdoerry.

Thanks

		-Ian

Received: by ATHENA-PO-2.MIT.EDU (5.45/4.7) id AA26298; Wed, 18 Oct 89 17:54:06 EDT
Received: from M7-321-1.MIT.EDU by ATHENA.MIT.EDU with SMTP
	id AA23996; Wed, 18 Oct 89 17:53:17 EDT
From: irbusch@ATHENA.MIT.EDU
Received: by M7-321-1.MIT.EDU (5.61/4.7) id AA00439; Wed, 18 Oct 89 17:52:59 -0400
Message-Id: <8910182152.AA00439@M7-321-1.MIT.EDU>
To: hotline@ATHENA.MIT.EDU
Cc: irbusch@ATHENA.MIT.EDU, nhdoerry@ATHENA.MIT.EDU, cc@ATHENA.MIT.EDU
Subject: Problems with terminals 1, 6, and 7
Date: Wed, 18 Oct 89 17:52:57 EDT


Howdy,

We continue to be having problems with terminals number 6 and 7 rebooting on a whim.  Next, people have been reporting continuing problems on terminal #5.  In addition to this problem (all the more aggrievating considering that digital just did a service call to deal with this very fact),  We are starting to have a variety of problems with terminl #1.  I have enclosed exerpts of the /usr/messages file from this machine.  At this time in the year usage of the terminals is increasing and we are continuing to lack more than 50% of our terminals running at full capability.

Oct  6 15:03:31 M7-321-1 xlogin: Couldn't open pty, looking for a good pty/tty
Oct  6 15:09:22 M7-321-1 xlogin: I/O error
Oct  6 15:09:22 M7-321-1 xlogin: Couldn't open pty, looking for a good pty/tty
Oct  6 17:32:50 M7-321-1 xlogin: I/O error
Oct  6 17:32:50 M7-321-1 xlogin: Couldn't open pty, looking for a good pty/tty
Oct  6 21:27:54 M7-321-1 vmunix: afs: Lost contact with server 2005012
Oct  6 22:01:36 M7-321-1 vmunix: afs: Lost contact with server 2b004812
Oct  8 14:45:23 M7-321-1 vmunix: Athena 4.3BSD UNIX #6-3-20 (probe@paris:VS2) Tu
e Jul 18 15:19:14 EDT 1989
Oct  8 14:45:23 M7-321-1 vmunix: real mem  = 6287360 (0x5ff000)
Oct  8 14:45:23 M7-321-1 vmunix: avail mem = 4924416 (0x4b2400)
Oct  8 14:45:24 M7-321-1 vmunix: using 204 buffers containing 418816 bytes of me
mory
Oct  8 14:45:24 M7-321-1 vmunix: uba0 at tr0
Oct  8 14:45:25 M7-321-1 vmunix: uda0 at uba0 csr 172150 vec 774, ipl 15
Oct  8 14:45:25 M7-321-1 vmunix: ra0 at uda0 slave 0
Oct  8 14:45:25 M7-321-1 vmunix: ra1 at uda0 slave 1
Oct  8 14:45:25 M7-321-1 vmunix: ra2 at uda0 slave 2
Oct  8 14:45:25 M7-321-1 vmunix: qe0 at uba0 csr 174440 vec 770, ipl 17
Oct  8 14:45:25 M7-321-1 vmunix: qe0: hardware address 08:00:2b:03:05:94
Oct  8 14:45:25 M7-321-1 vmunix: dz0 at uba0 csr 160100 vec 300, ipl 17
Oct  8 14:45:25 M7-321-1 vmunix: qv0 at uba0 csr 177200 vec 360, ipl 17
Oct  8 14:45:26 M7-321-1 vmunix: Starting afs cache scan...found 45 cache files.
Oct  8 14:45:26 M7-321-1 vmunix: afs: Lost contact with server 2005012
Oct  8 14:45:27 M7-321-1 vmunix: afs: Lost contact with server 2b004812
Oct  8 14:49:55 M7-321-1 syslog: No servers or no hesiod
Oct  8 14:50:53 M7-321-1 syslog: No servers or no hesiod
Oct  8 14:52:50 M7-321-1 last message repeated 2 times

repeated about a billion times
until...

Oct  9 18:04:25 M7-321-1 vmunix: afs: Server 2005012 back up
Oct  9 18:04:26 M7-321-1 vmunix: afs: Server 2b004812 back up
Oct  9 18:39:01 M7-321-1 login: ROOT LOGIN console
Oct 10 02:30:12 M7-321-1 inetd[9637]: execv /etc/fingerd: No such file or directory

This may be due to that power outage in building 1

Then all of this stuff:

Oct 10 16:41:37 M7-321-1 vmunix: NFS getattr failed for server MNEMOSYNE.MIT.EDU
: TIMED OUT
Oct 11 01:15:03 M7-321-1 vmunix: afs: Lost contact with server 4004812
Oct 11 01:16:03 M7-321-1 vmunix: afs: Server 4004812 back up
Oct 11 13:10:03 M7-321-1 inetd[16992]: execv /etc/fingerd: No such file or direc
tory
Oct 11 23:46:21 M7-321-1 inetd[18846]: execv /etc/tftpd: No such file or directo
ry
Oct 11 23:46:21 M7-321-1 inetd[162]: /etc/tftpd: exit status 0x100
Oct 11 23:46:48 M7-321-1 last message repeated 3 times
Oct 11 23:48:24 M7-321-1 last message repeated 3 times
Oct 11 23:53:44 M7-321-1 last message repeated 6 times
Oct 12 12:31:53 M7-321-1 xlogin: I/O error
Oct 12 12:31:53 M7-321-1 xlogin: Couldn't open pty, looking for a good pty/tty
Oct 12 14:50:44 M7-321-1 xlogin: I/O error

and many repetitions, and this stuff happened once:

Oct 13 10:15:28 M7-321-1 vmunix: NFS write error: on host CLIO.MIT.EDU remote file system full
Oct 13 10:29:58 M7-321-1 vmunix: NFS server TALOS.MIT.EDU not responding, giving up
Oct 13 10:29:59 M7-321-1 vmunix: NFS getattr failed for server TALOS.MIT.EDU: TIMED OUT

Then, the previous repetitive error was ended by:

Oct 16 15:51:35 M7-321-1 xlogin: Couldn't open pty, looking for a good pty/tty
Oct 16 16:14:37 M7-321-1 vmunix: afs: Lost contact with server 2005012
Oct 16 16:18:40 M7-321-1 vmunix: afs: Lost contact with server b005012
Oct 16 16:30:45 M7-321-1 vmunix: afs: Server b005012 back up
Oct 16 16:30:45 M7-321-1 vmunix: afs: Server 2005012 back up
Oct 16 20:39:26 M7-321-1 inetd[162]: /etc/tftpd: exit status 0x100
Oct 16 20:39:53 M7-321-1 last message repeated 3 times

and ended with: 

Oct 17 11:46:45 M7-321-1 vmunix: afs: Lost contact with server 4004812
Oct 17 11:51:49 M7-321-1 vmunix: afs: Server 4004812 back up
Oct 17 12:17:32 M7-321-1 inetd[162]: /etc/tftpd: exit status 0x100
Oct 17 12:17:44 M7-321-1 last message repeated 2 times

But the pty error was not forgotten, now:

Oct 17 12:33:05 M7-321-1 xlogin: I/O error
Oct 17 12:33:05 M7-321-1 xlogin: Couldn't open pty, looking for a good pty/tty
Oct 17 12:51:12 M7-321-1 xlogin: I/O error
Oct 17 12:51:12 M7-321-1 xlogin: Couldn't open pty, looking for a good pty/tty
Oct 17 12:58:29 M7-321-1 xlogin: I/O error
Oct 17 12:58:29 M7-321-1 xlogin: Couldn't open pty, looking for a good pty/tty
Oct 17 14:44:40 M7-321-1 xlogin: I/O error
Oct 17 14:44:40 M7-321-1 xlogin: Couldn't open pty, looking for a good pty/tty
Oct 17 15:10:14 M7-321-1 xlogin: I/O error
Oct 17 15:10:14 M7-321-1 xlogin: Couldn't open pty, looking for a good pty/tty
Oct 17 16:15:44 M7-321-1 inetd[162]: /etc/tftpd: exit status 0x100
Oct 17 16:16:11 M7-321-1 last message repeated 3 times
Oct 17 16:16:44 M7-321-1 last message repeated 2 times
Oct 17 16:27:22 M7-321-1 last message repeated 14 times
Oct 17 16:38:03 M7-321-1 last message repeated 13 times
Oct 17 16:42:20 M7-321-1 last message repeated 5 times
Oct 17 16:43:40 M7-321-1 xlogin: I/O error
Oct 17 16:43:40 M7-321-1 xlogin: Couldn't open pty, looking for a good pty/tty
Oct 17 16:44:28 M7-321-1 vmunix: inode: table is full
Oct 17 16:45:01 M7-321-1 last message repeated 4 times
Oct 17 16:46:31 M7-321-1 last message repeated 10 times
Oct 17 16:54:05 M7-321-1 last message repeated 5 times

now, some more strangeness:

Oct 17 16:54:05 M7-321-1 inetd[162]: /etc/tftpd: exit status 0x100
Oct 17 16:54:06 M7-321-1 vmunix: inode: table is full
Oct 17 16:54:21 M7-321-1 last message repeated 3 times
Oct 17 16:54:58 M7-321-1 last message repeated 4 times
Oct 17 17:36:55 M7-321-1 xlogin: I/O error
Oct 17 17:36:55 M7-321-1 xlogin: Couldn't open pty, looking for a good pty/tty
Oct 17 17:37:50 M7-321-1 vmunix: inode: table is full
Oct 17 19:00:34 M7-321-1 vmunix: afs: Lost contact with server b005012
Oct 17 19:08:42 M7-321-1 vmunix: afs: Server b005012 back up
Oct 17 22:40:13 M7-321-1 vmunix: inode: table is full
Oct 17 22:40:18 M7-321-1 last message repeated 134 times
Oct 18 04:40:02 M7-321-1 vmunix: inode: table is full
Oct 18 04:40:07 M7-321-1 last message repeated 134 times
Oct 18 04:49:24 M7-321-1 vmunix: inode: table is full
Oct 18 04:49:25 M7-321-1 vmunix: inode: table is full
Oct 18 14:31:30 M7-321-1 vmunix: inode: table is full
Oct 18 14:32:45 M7-321-1 last message repeated 36 times

and then back to the pty error, intersperced with the "table is full error"
Now we get this (really strange):  During the following times, the terminal would not even give a login window (to type USERNAME, PASSWORD).

Oct 18 15:30:03 M7-321-1 xlogin: File table overflow
Oct 18 15:30:03 M7-321-1 xlogin: Could not destroy ticket file: /tmp/tkt_ttyp4.
Oct 18 15:30:05 M7-321-1 vmunix: inode: table is full
Oct 18 15:30:28 M7-321-1 last message repeated 4 times
Oct 18 15:30:28 M7-321-1 xlogin: Cannot create console log file.
Oct 18 15:30:28 M7-321-1 xlogin: File table overflow
Oct 18 15:30:33 M7-321-1 vmunix: inode: table is full
Oct 18 15:31:04 M7-321-1 last message repeated 11 times
Oct 18 15:33:07 M7-321-1 last message repeated 31 times
Oct 18 15:34:37 M7-321-1 last message repeated 20 times
Oct 18 15:34:37 M7-321-1 xlogin: Cannot create console log file.
Oct 18 15:34:37 M7-321-1 xlogin: File table overflow
Oct 18 15:55:58 M7-321-1 vmunix: inode: table is full
Oct 18 16:01:05 M7-321-1 vmunix: inode: table is full
Oct 18 16:06:12 M7-321-1 vmunix: inode: table is full
Oct 18 16:16:13 M7-321-1 last message repeated 259 times
Oct 18 16:26:14 M7-321-1 last message repeated 731 times
Oct 18 16:36:15 M7-321-1 last message repeated 731 times
Oct 18 16:46:16 M7-321-1 last message repeated 731 times
Oct 18 16:56:17 M7-321-1 last message repeated 731 times

Until I finally hard booted the damb thing:

Oct 18 17:06:16 M7-321-1 vmunix: Athena 4.3BSD UNIX #6-3-20 (probe@paris:VS2) Tu
e Jul 18 15:19:14 EDT 1989
Oct 18 17:06:17 M7-321-1 vmunix: real mem  = 6287360 (0x5ff000)
Oct 18 17:06:17 M7-321-1 vmunix: avail mem = 4924416 (0x4b2400)
Oct 18 17:06:17 M7-321-1 vmunix: using 204 buffers containing 418816 bytes of me
mory
Oct 18 17:06:17 M7-321-1 vmunix: uba0 at tr0
Oct 18 17:06:17 M7-321-1 vmunix: uda0 at uba0 csr 172150 vec 774, ipl 15
Oct 18 17:06:17 M7-321-1 vmunix: ra0 at uda0 slave 0
Oct 18 17:06:17 M7-321-1 vmunix: ra1 at uda0 slave 1
Oct 18 17:06:18 M7-321-1 vmunix: ra2 at uda0 slave 2
Oct 18 17:06:18 M7-321-1 vmunix: qe0 at uba0 csr 174440 vec 770, ipl 17
Oct 18 17:06:18 M7-321-1 vmunix: qe0: hardware address 08:00:2b:03:05:94
Oct 18 17:06:18 M7-321-1 vmunix: dz0 at uba0 csr 160100 vec 300, ipl 17
Oct 18 17:06:18 M7-321-1 vmunix: qv0 at uba0 csr 177200 vec 360, ipl 17
Oct 18 17:06:18 M7-321-1 vmunix: Starting afs cache scan...found 62 cache files.
Oct 18 17:06:28 M7-321-1 timed[177]: date changed by oath from: Wed Oct 18 17:06
:28 1989
Oct 18 17:15:34 M7-321-1 inetd[271]: execv /etc/fingerd: No such file or directory

Note the number of cache files.  62 seems a bit high!

Talk later

			-Ian Busch
			Ocean Engineering Athena Cluster Manager
			irbusch@athena.mit.edu


home help back first fref pref prev next nref lref last post