[225] in arla-drinkers

home help back first fref pref prev next nref lref last post

Linux/SMP/arla followup

daemon@ATHENA.MIT.EDU (Dave Morrison)
Fri Aug 21 17:42:12 1998

From owner-arla-drinkers@stacken.kth.se Fri Aug 21 21:42:11 1998
Return-Path: <owner-arla-drinkers@stacken.kth.se>
Delivered-To: arla-drinkers-mtg@bloom-picayune.mit.edu
Received: (qmail 24582 invoked from network); 21 Aug 1998 21:42:10 -0000
Received: from unknown (HELO sundance.stacken.kth.se) (130.237.234.41)
  by bloom-picayune.mit.edu with SMTP; 21 Aug 1998 21:42:10 -0000
Received: (from majordom@localhost)
	by sundance.stacken.kth.se (8.8.8/8.8.8) id XAA06202
	for arla-drinkers-list; Fri, 21 Aug 1998 23:37:18 +0200 (MET DST)
Received: from bnl.gov (bnl.gov [130.199.128.163])
	by sundance.stacken.kth.se (8.8.8/8.8.8) with ESMTP id XAA06197
	for <arla-drinkers@stacken.kth.se>; Fri, 21 Aug 1998 23:37:13 +0200 (MET DST)
Received: from bnl.gov (morrison.rhic.bnl.gov [130.199.80.17])
	by bnl.gov (8.8.8/8.8.8) with ESMTP id RAA26659;
	Fri, 21 Aug 1998 17:37:12 -0400 (EDT)
Message-ID: <35DDE877.F9F82187@bnl.gov>
Date: Fri, 21 Aug 1998 17:36:55 -0400
From: Dave Morrison <dave@bnl.gov>
X-Mailer: Mozilla 4.5b1 [en] (X11; I; Linux 2.1.116 i686)
X-Accept-Language: en
MIME-Version: 1.0
To: arla-drinkers <arla-drinkers@stacken.kth.se>
Subject: Linux/SMP/arla followup
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Sender: owner-arla-drinkers@stacken.kth.se
Precedence: bulk

Dear arla-drinkers,

I think I've finally narrowed things down to a setup that will reproducibly hang
a Linux SMP box running arla.  I'm running Linux-2.1.117, arla 0.9 on a dual
Dell PII.  I am also running as an NFS client.  If I tar a directory tree from
AFS to the NFS mounted disk and interrupt it, the machine hangs and has to be
rebooted.  This same procedure seems to work without problems on a UP machine.

The last entries in the xfs log look like:

xfs_message_rpc opcode this_process->error_or_size = -4
xfs_message_rpc opcode ((xfs_message_wakeup*)(this_process->message))->error =
1349

I repeated the experiment after installing the lastest Alan Cox NFS-related
patches I'm aware of, but the behavior remained the same.  Any help would be
appreciated.

Dave

-- 
David Morrison  Brookhaven National Laboratory  phone: 516-344-5840
                Physics Department, Bldg 510 C    fax: 516-344-3253
		          Upton, NY 11973-5000  email: dave@bnl.gov

home help back first fref pref prev next nref lref last post