[14890] in Athena Bugs

home help back first fref pref prev next nref lref last post

Lots of sparcs in a funny state?

daemon@ATHENA.MIT.EDU (John Hawkinson)
Sun Dec 22 20:30:31 1996

Date: Sun, 22 Dec 1996 20:30:26 -0500 (EST)
To: bugs@MIT.EDU
Cc: hotline@MIT.EDU, Kevin Fu <fubob@MIT.EDU>
From: John Hawkinson <jhawk@MIT.EDU>

Kevin reported that after the E40 router outage, a number of sparcs in
W20 and M11 were found at the "ok" prompt, right after running
gettime.

I asked him to get a crash dump, and it looked like this:

m11-113-5# adb -k unix.0 vm*
$c
physmem 1e6b
complete_panic(0xf0048bb0,0xf03e5a4c,0xf03e58d8,0x3,0x0,0x1) + f8
do_panic(?) + 1c
vcmn_err(0xf015f848,0xf03e5a4c,0xf03e5a4c,0x0,0x0,0x3)
cmn_err(0x3,0xf015f848,0xf015fc00,0x29,0x29,0xf0152400) + 1c
die(0x1,0xf03e5b54,0x0,0x164,0x3,0xf015f848) + 78
trap(0x1,0xf03e5b54,0xf017d36c,0x164,0x3,0x0) + 31c
fault(0xf00401a0,0xffeff000,0x0,0x71e1000c,0x80000000,0x0) + 84
prom_enter_mon(0x44000e3,0x44000e3,0xf0152b78,0x44000e3,0x4,0x10) + 60
debug_enter(?) + d8
abort_sequence_enter(0x0,0xfc0fd72c,0xfc0fd72c,0x0,0x29,0xf0152400)
kbdinput(0xfc01d018,0x4d,0x4d,0x0,0x1,0xfc01d040) + 2cc
kbdrput(0xfc1c6380,0xfc1c6338,0xfc1c6338,0xfc01d018,0xfc1c6348,0xfc18acc0) + 130
putnext(0x0,0x40000000,0xffffffff,0xfc1c6338,0x0,0x10) + 70
callout_execute(0xf01744c0,0xf0175aa0,0xc19d0,0x80000000,0xfc14788c,0x900111d0)+ 90
callout_thread(0x0,0x0,0xf01878c8,0xf01878c8,0xf01744d4,0xf01744c0) + 24

Everything between complete_panic() and fault() is caused by
Kevin forcing the dump. The rest strongly implies that someone
paraded around various clusters pressing STOP-A, but
it's conceivable there's some more subtle bug at work here. Especially
since some machines were found like this that were in alarmed 
locked rooms (eg: gaston).

Crash dump for this one is sitting on m11-113-5 in /var/crash.

--jhawk



m11-113-5# crash vm* u*
dumpfile = vmcore.0, namelist = unix.0, outfile = stdout
> p
PROC TABLE SIZE = 490
SLOT ST  PID  PPID  PGID   SID   UID PRI CPU   NAME        FLAGS
   0 r     0     0     0     0     0  98  69 sched          load sys lock
   1 r     1     0     0     0     0  98  17 init           load
   2 r     2     0     0     0     0  98   0 pageout        load sys lock nowait
   3 r     3     0     0     0     0  98  80 fsflush        load sys lock nowait
   4 r    47     1     0     0     0  98  13 rc2            load
   5 r   125    47     0     0     0  98  10 jsh            load
   6 r   130   125     0     0     0  98  12 gettime        load
   7 r    85     1    85    85     0  98  16 rpcbind        load
   8 r    94     1    94    94     0  98   2 in.named       load
   9 r    98     1    98    98     0  98  24 inetd          load
  10 r   107     1   107   107     0  98  18 syslogd        load nowait
  12 r   114     1   114   114     0  98  12 cron           load
  13 r   124     1   124   124     0  98   3 utmpd          load
> status
system name:    SunOS
release:        5.4
node name:      m11-113-5
version:        Generic_101945-37
machine name:   sun4m
time of crash:  Sun Dec 22 19:58:16 1996
age of system:  2 hr., 12 min.
panicstr:       Text fault
panic registers:
        pc: f0048bb0      sp: f03e58d8
> user
PER PROCESS USER AREA FOR PROCESS 0
PROCESS MISC:
        command: sched, psargs: sched
        start: Sun Dec 22 17:46:07 1996
        mem: 0, type: exec
        vnode of current directory: fc107e40
OPEN FILES, POFILE FLAGS, AND THREAD REFCNT:
        [0]: F 0xfc187fb0, 0, 0 [1]: F 0xfc187f88, 0, 0
        [2]: F 0xfc187f60, 0, 0 [3]: F 0xfc187f38, 0, 0
        [4]: F 0xfc187f10, 0, 0
 cmask: 0000
RESOURCE LIMITS:
        cpu time: unlimited/unlimited
        file size: unlimited/unlimited
        swap size: 2147479552/2147479552
        stack size: 8388608/2147479552
        coredump size: unlimited/unlimited
        file descriptors: 64/1024
        address space: unlimited/unlimited
SIGNAL DISPOSITION:
           1:  default   2:  default   3:  default   4:  default
           5:  default   6:  default   7:  default   8:  default
           9:  default  10:  default  11:  default  12:  default
          13:  default  14:  default  15:  default  16:  default
          17:  default  18:  default  19:  default  20:  default
          21:  default  22:  default  23:  default  24:  default
          25:  default  26:  default  27:  default  28:  default
          29:  default  30:  default  31:  default  32:  default
          33:  default  34:  default  35:  default  36:  default
          37:  default  38:  default  39:  default  40:  default
          41:  default  42:  default  43:  default

> mount
 FSTYP  BSZ  MAJ/MIN      FSID    VNCOVERED   PDATA      BCOUNT  FLAGS
   ufs 8192   32,24      800018          0  fc106ef8          0   notr
   ufs 8192   32,27      80001b   fc0e3be8  fc106b78          0   notr
   ufs 8192   32,30      80001e   fc0e3468  fc106c58          0   notr
  proc 1024  147,0      24c0000   fc234e98         0          0  
   ufs 8192   32,29      80001d   fc197d10  fc106d38          0   notr
> pty
ptms_tty TABLE SIZE = 48
SLOT   MWQPTR   SWQPTR  PT_BUFP  TTYPID STATE
> v
v_buf: 100
v_call:   0
v_proc: 490
v_nglobpris: 110
v_maxsyspri:  99
v_clist:   0
v_maxup: 485
v_hbuf:  64
v_hmask:  63
v_pbuf:   0
v_sptmap:   0
v_maxpmem: 0
v_autoup: 30
v_bufhwm: 620

home help back first fref pref prev next nref lref last post