[226] in arla-drinkers

home help back first fref pref prev next nref lref last post

linux/SMP/arla difficulties

daemon@ATHENA.MIT.EDU (Dave Morrison)
Sun Aug 23 21:57:09 1998

From owner-arla-drinkers@stacken.kth.se Mon Aug 24 01:57:08 1998
Return-Path: <owner-arla-drinkers@stacken.kth.se>
Delivered-To: arla-drinkers-mtg@bloom-picayune.mit.edu
Received: (qmail 29013 invoked from network); 24 Aug 1998 01:57:07 -0000
Received: from unknown (HELO sundance.stacken.kth.se) (130.237.234.41)
  by bloom-picayune.mit.edu with SMTP; 24 Aug 1998 01:57:07 -0000
Received: (from majordom@localhost)
	by sundance.stacken.kth.se (8.8.8/8.8.8) id DAA04011
	for arla-drinkers-list; Mon, 24 Aug 1998 03:51:23 +0200 (MET DST)
Received: from assaris.pdc.kth.se (assaris.pdc.kth.se [193.10.159.45])
	by sundance.stacken.kth.se (8.8.8/8.8.8) with ESMTP id DAA04007
	for <arla-drinkers@stacken.kth.se>; Mon, 24 Aug 1998 03:51:20 +0200 (MET DST)
Received: (from assar@localhost) by assaris.pdc.kth.se (8.8.5/8.7.3) id DAA04343; Mon, 24 Aug 1998 03:53:28 +0200 (MET DST)
Original-Sender: dave@bnl.gov
Message-ID: <35DC5C49.79E6D3C0@bnl.gov>
Date: Thu, 20 Aug 1998 13:26:33 -0400
From: Dave Morrison <dave@bnl.gov>
X-Mailer: Mozilla 4.5b1 [en] (X11; I; Linux 2.1.116 i686)
X-Accept-Language: en
MIME-Version: 1.0
To: arla-drinkers <arla-drinkers@stacken.kth.se>
Subject: linux/SMP/arla difficulties
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Lines: 39
Sender: owner-arla-drinkers@stacken.kth.se
Precedence: bulk

Dear arla-drinkers,

We have a bunch on Dell dual PII boxes that are running linux-2.1.116 (yes, I
know, yesterday's technology) and arla-0.9.  Well, they're sort of running -
they're a bit unstable and the instability seems to be correlated with arla's
presence.  Here are a few symptoms that may or may not be related:

o once arla is up and running, the system crashes after anywhere from a few
minutes to a few hours - no obvious correlation with AFS activity. 

o before the machine finally hangs, tons of messages appear of the form "sending
wakeup: ..." (which are apparently generated in xfs).

o during this same time, arla goes nonlinear and chews up all the CPU

o when arlad is first started (using the startarla script), the sysname can't be
changed successfully for at least a minute.

o this same machine appears to be stable if arla is not started

Other oddments:

o arla was built with -D__SMP__ (same behavior when built without this)

o this version of kernel and arla runs happily together on a bunch of quad PPro
boxes we also have

Is anyone else experiencing similar problems?  Does anyone have suggestions
that'd help with this situation?  Is there some additional info that I could
gather that could help diagnose things?  

Thanks,
Dave

-- 
David Morrison  Brookhaven National Laboratory  phone: 516-344-5840
                Physics Department, Bldg 510 C    fax: 516-344-3253
		          Upton, NY 11973-5000  email: dave@bnl.gov


home help back first fref pref prev next nref lref last post