[16442] in Athena Bugs
Re: Sun Ultra-10 crash
daemon@ATHENA.MIT.EDU (John Hawkinson)
Sat Oct 24 13:41:42 1998
Date: Sat, 24 Oct 1998 13:41:41 -0400
To: tb@MIT.EDU (Thomas Bushnell, BSG)
Cc: bugs@MIT.EDU
In-Reply-To: "[16440] in Athena Bugs"
From: John Hawkinson <jhawk@MIT.EDU>
| I have the errors it printed on the console written out; I'll provide
| them on request. The core dump may be found in
| /mit/bitbucket/tb-core.
I think they're probably more valuable than the core file, or at least,
they should be kept in the discuss meeting before bitbucket goes away.
Nonetheless, they're extractable from the core file. The message
buffer run through "unsym" is attached after this message.
Anyhow, here's the stack, note you need to indirect through the trap to get it:
$c
complete_panic(?) + 24
do_panic(0x1,0x301af26c,0x1040c3e8,0x0,0x0,0x60404000)
vcmn_err(0x3,0x1040d408,0x3,0x301af26c,0x301afe80,0x4) + 190
cmn_err(0x3,0x1040d408,0x0,0x14a,0x14a,0x10406400) + 1c
die(0x31,0x301af450,0x60ebe000,0x0,0x1040d408,0x0) + a0
trap(0x301af450,0x0,0x60ebe000,0x1,0x0,0x6) + 830
0x301af450$<regs
0x301af450: tstate pc npc
99 f0001e06 1000b5e0 1000b5e4
0x301af4d8: y g1
0 0 604ad8d0
0x301af460: g1 g2
0 603eb960 0 603eb960
0x301af470: g3 g4
0 1 0 0
0x301af480: g6 g7
0 0 0 301afe80
0x301af490: o0 o1
0 44030 0 80004
0x301af4a0: o2 o3
0 48 ffffffff fffffff8
0x301af4b0: o4 o5
0 1ffc4090 0 10417390
0x301af4c0: o6 o7
0 301af4e0 0 100a5fd0
100a5fd0$c
?()
data address not found
301af4e0$c
?(?) + 0
bcopy(0x60ebe000,0x604adcd8,0x10,0x0,0xfffffff8,0x2)
xdrmblk_getbytes(0x601895b8,0x60ebdc00,0xe7a,0x603eb960,0x0,0x10000010) + 44
xdr_opaque(?) + 44
xdr_bytes(0x601895b8,0x60ebdc00,0xe7a,0x2,0x60ebdc00,0x60e71140)
xdr_rrok(0x601895b8,0x301afaf4,0xffffffff,0x6076f850,0x60134db0,0x603eb960) + 100
clnt_clts_kcallit_addr(0x60189570,0x60136828,0x2,0x60189598,0x5,0x60189594) + 784
clnt_clts_kcallit(0x60189570,0x6,0x6075e648,0x301afb58,0x6075e974,0x301afaf0) +38
rfscall(0x0,0x10408d08,0x6075e648,0x60138f60,0x57,0x301afaf0) + 378
rfs2call(0x601ba3d0,0x6,0x6075e648,0x301afb58,0x6075e974,0x301afaf0) + 48
nfsread(0x60e87ccc,0x60ebdc00,0x6075e800,0x6075e400,0x400,0x60e87cc0) + f8
nfs_bio(0x18000,0x600bbc20,0x60b344bc,0x0,0x10595680,0x60e87d14) + 88
nfs_readahead(0x60e87ccc,0x18000,0x10411000,0x10595680,0x60e87cc0,0x60e87d14) +164
nfs_async_start(0x601ba3d0,0x6076e804,0x601ba49a,0x601ba494,0x159a,0x601ba450) + 2b0
The trap is 0x31 which, per /usr/include/v9/sys/machtrap.h:
#define T_DATA_MMU_MISS 0x031
which might mean I'm looking at the wrong trap decoding file, since I
have a history of getting it wrong ;-)
Nothing looks obviously wrong with the arguments to bcopy:
0x60ebe000/10b
0x60ebe000: 0 0 0 0 0 0 0 0
0 0
0x604adcd8/10b
0x604adcd8: 0 0 0 0 0 0 010 0320
0 0
Here's the faulting routine:
blkleft/14i
blkleft:
blkleft: ldd [%l7], %f2
add %l7, 0x8, %l7
subcc %i4, 0x8, %i4
faligndata %f0, %f2, %f8
std %f8, [%i0]
bl,pn %icc,blkdone
add %i0, 0x8, %i0
ldd [%l7], %f0
add %l7, 0x8, %l7
subcc %i4, 0x8, %i4
faligndata %f2, %f0, %f8
--> std %f8, [%i0]
bge,pt %icc,blkleft
add %i0, 0x8, %i0
with the --> marking the faulting instruction.
Looks like %f8 is some kind of floating point register.
I'm not confused, since I don't know why bcopy() would
be doing floating point operations, nor do I know
enough about floating point operations on the SPARC
to attempt to explain what's going on.
--jhawk
Ahp@1/su@14,3083f8> major <37> minor <0>
SUNW,afb0 at root: UPA 0x1e 0x0
SUNW,afb0 is /SUNW,afb@1e,0
stdout is </SUNW,afb@1e,0> major <9> minor <0>
se0 at ebus0: offset 14,400000
se0 is /pci@1f,0/pci@1,1/ebus@1/se@14,400000
SUNW,hme0: CheerIO 2.0 (Rev Id = c1) Found
SUNW,hme0 is /pci@1f,0/pci@1,1/network@1,1
SUNW,hme0: Using Internal Transceiver
SUNW,hme0: 10 Mbps half-duplex Link Up
dump on /dev/dsk/c0t0d0s1 size 524448K
fdthree0 at ebus0: offset 14,3023f0
fdthree0 is /pci@1f,0/pci@1,1/ebus@1/fdthree@14,3023f0
pseudo-device: winlock0
winlock0 is /pseudo/winlock@0
pseudo-device: lockstat0
lockstat0 is /pseudo/lockstat@0
pseudo-device: vol0
vol0 is /pseudo/vol@0
pseudo-device: llc10
llc10 is /pseudo/llc1@0
SUNW,CS42310 at ebus0: offset 14,200000
SUNW,CS42310 is /pci@1f,0/pci@1,1/ebus@1/SUNW,CS4231@14,200000
SUNW,m64B0 is /pci@1f,0/pci@1,1/SUNW,m64B@2
m64#0: 1152x900, 2M mappable, rev 4754.9a
pseudo-device: pm0
pm0 is /pseudo/pm@0
pseudo-device: tod0
tod0 is /pseudo/tod@0
atapicd2 at pci1095,6460 target 2 lun 0
atapicd2 is /pci@1f,0/pci@1,1/ide@3/atapicd@2,0
pem: no event handler
ecpp0 at ebus0: offset 14,3043bc
ecpp0 is /pci@1f,0/pci@1,1/ebus@1/ecpp@14,3043bc
power0 at ebus0: offset 14,724000
power0 is /pci@1f,0/pci@1,1/ebus@1/power@14,724000
Starting AFS cache scan...found 2699 non-empty cache files (37%)
.
BAD TRAP: cpu=0 type=0x31 rp=0x302d5450 addr=0x60f0a000 mmu_fsr=
0x0
sched: trap type = 0x31
addr=0x60f0a000
pid=0, pc=0x[1000b5e0=blkleft+2c], sp=0x302d54e0, tstate=0x99f0001e00, contex
t=0x0
g1-g7: 604b30d0, 603eb7a0, 603eb7a0, 1, 0, 0, 302d5e80
Begin traceback... sp = 302d54e0
Called from [601358d0=xdrmblk_getbytes+44], fp=302d5688, args=60f0a000 604b34d8 10 0 [fffffff8=-8] 2
Called from [601351d4=xdr_opaque+44], fp=302d56e8, args=[601895b8=ata_global_mutex+15fc] 60f09c00 e7a 603eb7a0 0 [10000010=trap_table0+10]
Called from [6076093c=expand_format_string+b8], fp=302d5748, args=[601895b8=ata_global_mutex+15fc] 60f09c00 e7a 260f09c00 60e93500
Called from [6012fe58=clnt_clts_kcallit_addr+784], fp=302d57a8, args=[601895b8=ata_global_mutex+15fc] 302d5af4 [ffffffff=VADDR_MASK_DEBUG] [60771850=nfs3fstyp+2e8] [60134db0=xdr_void] 603eb7a0
Called from [60130178=clnt_clts_kcallit+38], fp=302d5878, args=[60189570=ata_global_mutex+15b4] [60136828=rcstat+20] 2 [60189598=ata_global_mutex+15dc] 5 [60189594=ata_global_mutex+15d8]
Called from [6075538c=purge_rtable+60], fp=302d58e8, args=[60189570=ata_global_mutex+15b4] 6 [60760648=xdr_statfs+1c] 302d5b58 [60760974=expand_format_string+f0] 302d5af0
Called from [60754e10=make_rnode+3bc], fp=302d5990, args=0 [10408d08=hz] [60760648=xdr_statfs+1c] [601399e0=svc_head+1dd0] 57 302d5af0
Called from [6075b214=nfsrename+304], fp=302d5a20, args=[601ba3d0=symbol+251b] 6 [60760648=xdr_statfs+1c] 302d5b58 [60760974=expand_format_string+f0] 302d5af0
Called from [6075e070=xdr_writeargs+1d8], fp=302d5b88, args=60eaaa5c 60f09c00 [60760800=nfssys+140] [60760400=xdr_fastslargs+1c4] 400 60eaaa50
Called from [6075ed94=xdr_rddirargs+124], fp=302d5be8, args=20000 [600bbb60=tcp_time_wait_lock+41a4] 60e4fa440 105b5780 60eaaaa4
Called from [607524b4=nd_get_reply+25c], fp=302d5c68, args=60eaaa5c 20000 [10411000=segkmap]105b5780 60eaaa50 60eaaaa4
Called from [100264ec=thread_start+4], fp=302d5ce0, args=[601ba3d0=symbol+251b] [60770804=acl_timer_type_v3+7da] [601ba49a=symbol+25e5] [601ba494=symbol+25df] 1595 [601ba450=symbol+259b]
Called from [60752204=nd_send_data+438], fp=0, args=[60892380=pcfs_minor+1c0c] 0 0 0 0 0
End traceback...
panic[cpu0]/thread=0x302d5e80: trap
syncing file systems...BAD TRAP: cpu=0 type=0x31 rp=0x302d4ca8 a
ddr=0xc mmu_fsr=0x0
sched: trap type = 0x31
addr=0xc
pid=0, pc=0x[10059034=getblk+17c], sp=0x302d4d38, tstate=0x4480001e00, contex
t=0x0
g1-g7: da0, [1043241c=sleepq_head], 8100000012882636, [6055e6f8=ipd_mutex+3d0c], 0, 0, 302d5e80
panic[cpu0]/thread=0x302d5e80: trap
2964 static and sysmap kernel pages
4, fp=dynamic kernel data pages
173 kernel-pageable pages
1 segklled from [60762070=nfs3_vget+7c], fp=302850f0, args=[605bb2a4=config_rt_dptbl+755c] 6049fc0
0 [60764800=nfs3create+64]cess pages
3167 total pages (3167 chunks)
dumping to vp 603b67fc, offset 998230
Cpu0: SUNW,UltraSPARC-IIi (upaid 0 impl 0x12 ver 0x12 clock 333
MHz)
SunOS Release 5.6 Version Generic_105181-05 [UNIX(R) System V R
elease 4.0]
Copyright (c) 1983-1997, Sun Microsystems, Inc.
mem = 131072K (0x8000000)
avail mem = 126263296
Ethernet address = 8:0:20:93:c1:a9
root nexus = Sun Ultra 5/10 UPA/PCI (UltraSPARC-IIi 333MHz)
pci0 at root: UPA 0x1f 0x0
PCI-device: pci@1,1, simba #0
PCI-device: pci@1, simba #1
dad0 at pci1095,6460 target 0 lun 0
dad0 is /pci@1f,0/pci@1,1/ide@3/dad@0,0
<Seagate Medalist 34342A cyl 8892 alt 2 hd 15 sec 63>
root on /pci@1f,0/pci@1,1/ide@3/disk@0,0:a fstype ufs
obpsym: symbolic debugging is available.
su0 at ebus0: offset 14,3083f8
su0 is /pci@1f,0/pci@1,1/ebus@1/su@14,3083f8
su1 at ebus0: offset 14,3062f8
su1 is /pci@1f,0/pci@1,1/ebus@1/su@14,3062f8
keyboard is </pci@1f,0/pci@1,1/ebus@1/su@14,3083f8> major <37> m
inor <0>
mouse is </pci@1f,0/pci@1,1/ebus@1/su@14,3062f8> major <37> mino
r <1>
stdin is </pci@1f,0/pci@1,1/ebus@1/su@14,3083f8> major <37> mino
r <0>
SUNW,afb0 at root: UPA 0x1e 0x0
SUNW,afb0 is /SUNW,afb@1e,0
stdout is </SUNW,afb@1e,0> major <9> minor <0>
se0 at ebus0: offset 14,400000
se0 is /pci@1f,0/pci@1,1/ebus@1/se@14,400000
SUNW,hme0: CheerIO 2.0 (Rev Id = c1) Found
SUNW,hme0 is /pci@1f,0/pci@1,1/network@1,1
SUNW,hme0: Using Internal Transceiver
SUNW,hme0: 10 Mbps half-duplex Link Up
dump on /dev/dsk/c0t0d0s1 size 524448K
fdthree0 at ebus0: offset 14,3023f0
fdthree0 is /pci@1f,0/pci@1,1/ebus@1/fdthree@14,3023f0
pseudo-device: winlock0
winlock0 is /pseudo/winlock@0
pseudo-device: lockstat0
lockstat0 is /pseudo/lockstat@0
pseudo-device: vol0
vol0 is /pseudo/vol@0
pseudo-device: llc10
llc10 is /pseudo/llc1@0
SUNW,CS42310 at ebus0: offset 14,200000
SUNW,CS42310 is /pci@1f,0/pci@1,1/ebus@1/SUNW,CS4231@14,200000
SUNW,m64B0 is /pci@1f,0/pci@1,1/SUNW,m64B@2
m64#0: 1152x900, 2M mappable, rev 4754.9a
pseudo-device: pm0
pm0 is /pseudo/pm@0
pseudo-device: tod0
tod0 is /pseudo/tod@0
atapicd2 at pci1095,6460 target 2 lun 0
atapicd2 is /pci@1f,0/pci@1,1/ide@3/atapicd@2,0
pem: no event handler
ecpp0 at ebus0: offset 14,3043bc
ecpp0 is /pci@1f,0/pci@1,1/ebus@1/ecpp@14,3043bc
power0 at ebus0: offset 14,724000
power0 is /pci@1f,0/pci@1,1/ebus@1/power@14,724000
Starting AFS cache scan...found 2662 non-empty cache files (36%)
.
BAD TRAP: cpu=0 type=0x31 rp=0x301af450 addr=0x60ebe000 mmu_fsr=
0x0
sched: trap type = 0x31
addr=0x60ebe000
pid=0, pc=0x[1000b5e0=blkleft+2c], sp=0x301af4e0, tstate=0x99f0001e06, contex
t=0x0
g1-g7: 604ad8d0, 603eb960, 603eb960, 1, 0, 0, 301afe80
Begin traceback... sp = 301af4e0
Called from [601358d0=xdrmblk_getbytes+44], fp=301af688, args=60ebe000 604adcd8 10 0 [fffffff8=-8] 2
Called from [601351d4=xdr_opaque+44], fp=301af6e8, args=[601895b8=ata_global_mutex+15fc] 60ebdc00 e7a 603eb960 0 [10000010=trap_table0+10]
Called from [6075e93c=xdr_rrok+100], fp=301af748, args=[601895b8=ata_global_mutex+15fc] 60ebdc00 e7a 260ebdc00 60e71140
Called from [6012fe58=clnt_clts_kcallit_addr+784], fp=301af7a8, args=[601895b8=ata_global_mutex+15fc] 301afaf4 [ffffffff=VADDR_MASK_DEBUG] [6076f850=rdres_discrim] [60134db0=xdr_void] 603eb960
Called from [60130178=clnt_clts_kcallit+38], fp=301af878, args=[60189570=ata_global_mutex+15b4] [60136828=rcstat+20] 2 [60189598=ata_global_mutex+15dc] 5 [60189594=ata_global_mutex+15d8]
Called from [6075338c=rfscall+378], fp=301af8e8, args=[60189570=ata_global_mutex+15b4] 6 [6075e648=xdr_readargs] 301afb58 [6075e974=xdr_rdresult] 301afaf0
Called from [60752e10=rfs2call+48], fp=301af990, args=0 [10408d08=hz] [6075e648=xdr_readargs] [60138f60=svc_head+1350] 57 301afaf0
Called from [60759214=nfsread+f8], fp=301afa20, args=[601ba3d0=symbol+251b] 6 [6075e648=xdr_readargs] 301afb58 [6075e974=xdr_rdresult] 301afaf0
Called from [6075c070=nfs_bio+88], fp=301afb88, args=60e87ccc 60ebdc00 [6075e800=xdr_fastreadargs+24] [6075e400=xdr_fattr+74] 400 60e87cc0
Called from [6075cd94=nfs_readahead+164], fp=301afbe8, args=18000 [600bbc20=tcp_time_wait_lock+4264] 60b344bc0 10595680 60e87d14
Called from [607504b4=nfs_async_start+2b0], fp=301afc68, args=60e87ccc 18000 [10411000=segkmap]10595680 60e87cc0 60e87d14
Called from [100264ec=thread_start+4], fp=301afce0, args=[601ba3d0=symbol+251b] [6076e804=nfs_async_timeout] [601ba49a=symbol+25e5] [601ba494=symbol+25df] 159a [601ba450=symbol+259b]
Called from [60750204=nfs_async_start], fp=0, args=60663d48 0 0 0 0 0
End traceback...
panic[cpu0]/thread=0x301afe80: trap
syncing file systems... [5] 1 done
2962 static and sysmap kernel pages
26 dynamic kernel data pages
176 kernel-pageable pages
1 segkmap kernel pages
0 segvn kernel pages
0 current user process pages
3165 total pages (3165 chunks)
dumping to vp 603b67fc, offset