[21185] in Athena Bugs

home help back first fref pref prev next nref lref last post

Sporadic laptop AFS crash

daemon@ATHENA.MIT.EDU (Tom Cavin)
Wed Dec 11 15:56:07 2002

MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Message-ID: <15863.42568.661222.424507@lap1-wccf.mit.edu>
Date: Wed, 11 Dec 2002 15:55:36 -0500
From: Tom Cavin <cavin@MIT.EDU>
To: Athena Bugs list <bugs@MIT.EDU>
CC: SIPB Linux Help <linux-help@MIT.EDU>


Hi,

I'm getting sporadic crashes on my Linux Athena laptop and I'm not sure
whether this is a hardware problem or a software problem.  I'd like some
help in figuring out what's wrong so I can determine what I need to do
about it.

I'm including a section from the /var/log/messages file with the error
trace.  The setup for this is as follows.  The system was running without
the network (run level 4, where the run level has been configured to not
start the network and all the AFS and Athena stuff) at home.  I had run
sync and then suspended the laptop.  When I brought the system back to MIT,
I woke it up from the suspension and ran a script to reconfigure it for run
level 5, then rebooted.

After the system shutdown and before it came back up I plugged in my pcmcia
network card.  This restart did _not_ involve turning off the power.  The
system started coming back up and got as far as the AFS cache scan before
it errored.  The details of the errors are in the messages log below.
Following the messages quote, I'm including some additional text that
appeared on the console.  (This text is a mixture of some of the message
lines and the script output to the consolve.)

The system was responsive to Ctrl-Alt-Delete, and when the system finished
its shutdown -- during the POST -- I turned off the power, waited a bit,
then started it up again.  This time it came up without problem.

The problem is inconsistant, but most frequently happens in the situation
I've described above.  It has also happened while running, and has occured
about 10 times over the last month.

The hardware involved is an IBM ThinkPad A21m that has been running in this
configuration for over a year.  It's running Linux Athena and is up-to-date.

Can anyone provide me with any clues as to what is going on and what I can
do about it?

Thanks,

	--Tom

**** Exerpt from /var/log/messages

Dec 11 12:09:01 lap1-wccf network: Setting network parameters:  succeeded 
Dec 11 12:09:01 lap1-wccf network: Bringing up loopback interface:  succeeded 
Dec 11 12:09:02 lap1-wccf ifup: Determining IP information for eth0... 
Dec 11 12:09:02 lap1-wccf pumpd[506]: starting at (uptime 0 days, 0:00:38) Wed Dec 11 12:09:02 2002   
Dec 11 12:09:06 lap1-wccf ifup:  done. 
Dec 11 12:09:06 lap1-wccf network: Bringing up interface eth0:  succeeded 
Dec 11 12:09:06 lap1-wccf named[573]: starting BIND 9.2.1 
Dec 11 12:09:06 lap1-wccf athena-bind: named startup succeeded 
Dec 11 12:09:07 lap1-wccf named[573]: loading configuration from '/etc/named.conf' 
Dec 11 12:09:12 lap1-wccf kernel: Starting AFS cache scan...found 9290 non-empty cache files (94%%).
Dec 11 12:09:12 lap1-wccf kernel: Unable to handle kernel paging request at virtual address 85f3001c
Dec 11 12:09:12 lap1-wccf kernel:  printing eip:
Dec 11 12:09:12 lap1-wccf kernel: c012f4a1
Dec 11 12:09:12 lap1-wccf kernel: *pde = 00000000
Dec 11 12:09:12 lap1-wccf kernel: Oops: 0000
Dec 11 12:09:12 lap1-wccf kernel: libafs-2.4.18-17.7.x.i686 xircom_cb ds yenta_socket pcmcia_core ide-cd cdrom e
Dec 11 12:09:12 lap1-wccf kernel: CPU:    0
Dec 11 12:09:12 lap1-wccf kernel: EIP:    0010:[kmalloc+161/256]    Tainted: PF
Dec 11 12:09:12 lap1-wccf kernel: EIP:    0010:[<c012f4a1>]    Tainted: PF
Dec 11 12:09:12 lap1-wccf kernel: EFLAGS: 00010002
Dec 11 12:09:12 lap1-wccf kernel: 
Dec 11 12:09:12 lap1-wccf kernel: EIP is at kmalloc [kernel] 0xa1 (2.4.18-17.7.x)
Dec 11 12:09:12 lap1-wccf kernel: eax: aa010001   ebx: c25a7180   ecx: ddef0000   edx: 00000040
Dec 11 12:09:12 lap1-wccf kernel: esi: aa010001   edi: 00000246   ebp: ddef0100   esp: ddeadae8
Dec 11 12:09:12 lap1-wccf kernel: ds: 0018   es: 0018   ss: 0018
Dec 11 12:09:12 lap1-wccf kernel: Process cp (pid: 829, stackpage=ddead000)
Dec 11 12:09:12 lap1-wccf kernel: Stack: 00000000 df2178a0 dee19420 00000001 00000040 00000000 00000040 00000001 
Dec 11 12:09:12 lap1-wccf kernel:        e08a5d3f 00000040 000000f0 ded57448 e087be5a e08cbe84 00000000 00000040 
Dec 11 12:09:12 lap1-wccf kernel:        e08a6053 00000040 dee19420 00000000 00000002 00000000 00000001 ded57448 
Dec 11 12:09:12 lap1-wccf kernel: Call Trace: [<e08a5d3f>] linux_alloc [libafs-2.4.18-17.7.x.i686] 0x1f (0xddeadb08))
Dec 11 12:09:12 lap1-wccf kernel: [<e087be5a>] afs_FindServer [libafs-2.4.18-17.7.x.i686] 0x26 (0xddeadb18))
Dec 11 12:09:12 lap1-wccf kernel: [<e08cbe84>] afs_linux_alloc_sem [libafs-2.4.18-17.7.x.i686] 0x0 (0xddeadb1c))
Dec 11 12:09:12 lap1-wccf kernel: [<e08a6053>] osi_linux_alloc [libafs-2.4.18-17.7.x.i686] 0x53 (0xddeadb28))
Dec 11 12:09:12 lap1-wccf kernel: [<e087c5c1>] afs_GetServer [libafs-2.4.18-17.7.x.i686] 0x261 (0xddeadb48))
Dec 11 12:09:12 lap1-wccf kernel: [<e08cbc60>] xdrrx_ops [libafs-2.4.18-17.7.x.i686] 0x0 (0xddeadb60))
Dec 11 12:09:12 lap1-wccf kernel: [<e0891092>] InstallUVolumeEntry [libafs-2.4.18-17.7.x.i686] 0x3fa (0xddeadb88))
Dec 11 12:09:12 lap1-wccf kernel: [<e08a0e85>] xdrrx_getint32 [libafs-2.4.18-17.7.x.i686] 0x15 (0xddeadbe8))
Dec 11 12:09:12 lap1-wccf kernel: [<e0890370>] afs_SetupVolume [libafs-2.4.18-17.7.x.i686] 0x344 (0xddeadc38))
Dec 11 12:09:12 lap1-wccf kernel: [<e08b9f4d>] .rodata.str1.1 [libafs-2.4.18-17.7.x.i686] 0x80d (0xddeadc50))
Dec 11 12:09:12 lap1-wccf kernel: [<e0890183>] afs_SetupVolume [libafs-2.4.18-17.7.x.i686] 0x157 (0xddeadc58))
Dec 11 12:09:12 lap1-wccf kernel: [<e08908e5>] afs_NewVolumeByName [libafs-2.4.18-17.7.x.i686] 0x2a9 (0xddeadc88))
Dec 11 12:09:12 lap1-wccf kernel: [<e088744d>] EvalMountPoint [libafs-2.4.18-17.7.x.i686] 0x1d1 (0xddeadcf8))
Dec 11 12:09:12 lap1-wccf kernel: [<e0882f2f>] afs_CopyOutAttrs [libafs-2.4.18-17.7.x.i686] 0x1e7 (0xddeadd48))
Dec 11 12:09:12 lap1-wccf kernel: [<e08a80e2>] vcache2inode [libafs-2.4.18-17.7.x.i686] 0x22 (0xddeadd68))
Dec 11 12:09:12 lap1-wccf kernel: [<e087818f>] osi_dnlc_enter [libafs-2.4.18-17.7.x.i686] 0x1c7 (0xddeadd74))
Dec 11 12:09:12 lap1-wccf kernel: [<e08d08e8>] nameCache [libafs-2.4.18-17.7.x.i686] 0x4168 (0xddeadd78))
Dec 11 12:09:12 lap1-wccf kernel: [<e0878154>] osi_dnlc_enter [libafs-2.4.18-17.7.x.i686] 0x18c (0xddeadd84))
Dec 11 12:09:12 lap1-wccf kernel: [<e0887758>] afs_EvalFakeStat_int [libafs-2.4.18-17.7.x.i686] 0x114 (0xddeadde8))
Dec 11 12:09:12 lap1-wccf kernel: [<e0887a19>] afs_EvalFakeStat [libafs-2.4.18-17.7.x.i686] 0x19 (0xddeade18))
Dec 11 12:09:12 lap1-wccf kernel: [<e0882a66>] afs_access [libafs-2.4.18-17.7.x.i686] 0x7e (0xddeade38))
Dec 11 12:09:12 lap1-wccf kernel: [<e08a9506>] afs_linux_lookup [libafs-2.4.18-17.7.x.i686] 0x102 (0xddeade64))
Dec 11 12:09:12 lap1-wccf kernel: [d_alloc+28/368] d_alloc [kernel] 0x1c (0xddeade78))
Dec 11 12:09:12 lap1-wccf kernel: [<c014ae2c>] d_alloc [kernel] 0x1c (0xddeade78))
Dec 11 12:09:12 lap1-wccf kernel: [<e08a9a95>] afs_linux_permission [libafs-2.4.18-17.7.x.i686] 0x41 (0xddeade98))
Dec 11 12:09:12 lap1-wccf kernel: [<e08a9404>] afs_linux_lookup [libafs-2.4.18-17.7.x.i686] 0x0 (0xddeadeac))
Dec 11 12:09:12 lap1-wccf kernel: [<e08a9404>] afs_linux_lookup [libafs-2.4.18-17.7.x.i686] 0x0 (0xddeadeb4))
Dec 11 12:09:12 lap1-wccf kernel: [permission+29/48] permission [kernel] 0x1d (0xddeadeb8))
Dec 11 12:09:12 lap1-wccf kernel: [<c01427ad>] permission [kernel] 0x1d (0xddeadeb8))
Dec 11 12:09:12 lap1-wccf kernel: [link_path_walk+2216/2256] link_path_walk [kernel] 0x8a8 (0xddeadec4))
Dec 11 12:09:12 lap1-wccf kernel: [<c01432f8>] link_path_walk [kernel] 0x8a8 (0xddeadec4))
Dec 11 12:09:12 lap1-wccf kernel: [do_zap_page_range+399/592] do_zap_page_range [kernel] 0x18f (0xddeadee0))
Dec 11 12:09:12 lap1-wccf kernel: [<c012625f>] do_zap_page_range [kernel] 0x18f (0xddeadee0))
Dec 11 12:09:12 lap1-wccf kernel: [getname+94/160] getname [kernel] 0x5e (0xddeadf0c))
Dec 11 12:09:12 lap1-wccf kernel: [<c014262e>] getname [kernel] 0x5e (0xddeadf0c))
Dec 11 12:09:12 lap1-wccf kernel: [path_lookup+27/48] path_lookup [kernel] 0x1b (0xddeadf20))
Dec 11 12:09:12 lap1-wccf kernel: [<c01435ab>] path_lookup [kernel] 0x1b (0xddeadf20))
Dec 11 12:09:12 lap1-wccf kernel: [__user_walk+36/64] __user_walk [kernel] 0x24 (0xddeadf30))
Dec 11 12:09:12 lap1-wccf kernel: [<c01437f4>] __user_walk [kernel] 0x24 (0xddeadf30))
Dec 11 12:09:12 lap1-wccf kernel: [vfs_stat+23/80] vfs_stat [kernel] 0x17 (0xddeadf44))
Dec 11 12:09:12 lap1-wccf kernel: [<c013ffe7>] vfs_stat [kernel] 0x17 (0xddeadf44))
Dec 11 12:09:12 lap1-wccf kernel: [sys_stat64+17/48] sys_stat64 [kernel] 0x11 (0xddeadf70))
Dec 11 12:09:12 lap1-wccf kernel: [<c0140591>] sys_stat64 [kernel] 0x11 (0xddeadf70))
Dec 11 12:09:12 lap1-wccf kernel: [filp_close+77/96] filp_close [kernel] 0x4d (0xddeadf90))
Dec 11 12:09:12 lap1-wccf kernel: [<c013952d>] filp_close [kernel] 0x4d (0xddeadf90))
Dec 11 12:09:12 lap1-wccf kernel: [do_page_fault+0/1115] do_page_fault [kernel] 0x0 (0xddeadfb0))
Dec 11 12:09:12 lap1-wccf kernel: [<c0114410>] do_page_fault [kernel] 0x0 (0xddeadfb0))
Dec 11 12:09:12 lap1-wccf kernel: [error_code+52/60] error_code [kernel] 0x34 (0xddeadfb8))
Dec 11 12:09:12 lap1-wccf kernel: [<c0108a34>] error_code [kernel] 0x34 (0xddeadfb8))
Dec 11 12:09:12 lap1-wccf kernel: [system_call+51/56] system_call [kernel] 0x33 (0xddeadfc0))
Dec 11 12:09:12 lap1-wccf kernel: [<c0108943>] system_call [kernel] 0x33 (0xddeadfc0))
Dec 11 12:09:12 lap1-wccf kernel: 
Dec 11 12:09:12 lap1-wccf kernel: 
Dec 11 12:09:12 lap1-wccf kernel: Code: 8b 44 81 18 0f af f2 89 41 14 01 ee 40 75 16 8b 41 04 8b 11 
Dec 11 12:24:17 lap1-wccf shutdown: shutting down for system reboot

**** Text from Console

/etc/athena/config_afs: line 22:  829 Segmentation fault    cp /afs/athena.mit.edu/service/CellServDB ${VICEDIR}/Ctmp
Updating setuid cell information
Dec 11 12:09:12 lap1-wccf kernel: Unable to handle kernel paging request at virtual address 85f3001c
Dec 11 12:09:12 lap1-wccf kernel: *pde = 00000000

-- 
Tom Cavin                                  Phone:  (617) 258 - 7806
Computer Operations Manager                Email:     cavin@mit.edu
MIT - Whitaker College Computer Facility          or tec@ai.mit.edu

home help back first fref pref prev next nref lref last post