[774] in arla-drinkers
severe cache coherency problem
daemon@ATHENA.MIT.EDU (Mattias Engdegård)
Wed Apr 21 21:56:16 1999
From owner-arla-drinkers@stacken.kth.se Thu Apr 22 01:56:15 1999
Return-Path: <owner-arla-drinkers@stacken.kth.se>
Delivered-To: arla-drinkers-mtg@bloom-picayune.mit.edu
Received: (qmail 7188 invoked from network); 22 Apr 1999 01:56:14 -0000
Received: from unknown (HELO sundance.stacken.kth.se) (130.237.234.41)
by bloom-picayune.mit.edu with SMTP; 22 Apr 1999 01:56:14 -0000
Received: (from majordom@localhost)
by sundance.stacken.kth.se (8.8.8/8.8.8) id DAA15623
for arla-drinkers-list; Thu, 22 Apr 1999 03:50:19 +0200 (MET DST)
Received: from elixir.e.kth.se (elixir.e.kth.se [130.237.48.5])
by sundance.stacken.kth.se (8.8.8/8.8.8) with ESMTP id DAA15619
for <arla-drinkers@stacken.kth.se>; Thu, 22 Apr 1999 03:50:13 +0200 (MET DST)
Received: from robert.e.kth.se (robert.e.kth.se [130.237.48.106])
by elixir.e.kth.se (8.9.2/8.9.2) with ESMTP id DAA29719
for <arla-drinkers@stacken.kth.se>; Thu, 22 Apr 1999 03:50:11 +0200 (MET DST)
Received: (from lha@localhost)
by robert.e.kth.se (8.9.2/8.9.2) id DAA10426;
Thu, 22 Apr 1999 03:50:11 +0200 (MET DST)
Date: Thu, 22 Apr 1999 01:05:52 +0200 (MET DST)
Message-Id: <199904212305.BAA15016@orion.nada.kth.se>
From: "Mattias Engdegård" <f91-men@nada.kth.se>
To: arla-drinkers@stacken.kth.se
Subject: severe cache coherency problem
MIME-Version: 1.0
Content-Type: text/plain; charset=unknown-8bit
Content-Transfer-Encoding: 8bit
Lines: 33
X-Mailer: Gnus v5.5/Emacs 20.2
Sender: owner-arla-drinkers@stacken.kth.se
Precedence: bulk
I'm creating 2 files, a and b, on host X (Solaris, transarc AFS client).
They are visible and readable from host Y (Linux 2.2.6, arla 0.23, libc5.4.46).
When X removes file a, arlad on Y says:
Thu Apr 22 00:45:31 1999: arlad: callback (130.237.42.231)
Thu Apr 22 00:45:31 1999: arlad: -1: (536880467, 23, 517742)
Thu Apr 22 00:45:31 1999: arlad: callback for non-existing file (-1, 536880467, 23, 517742)
but ls on Y can see both. When X removes file b, arlad reports nothing
(running with debug=almost-all).
Trying to open the files produces
Thu Apr 22 00:46:46 1999: arlad: worker 0: processing
Thu Apr 22 00:46:46 1999: arlad: Rec message: opcode = 12 (open), size = 40
Thu Apr 22 00:46:46 1999: arlad: read_data
Thu Apr 22 00:46:46 1999: arlad: Error reading length: Network dropped connection because of reset
Thu Apr 22 00:46:46 1999: arlad: multi-sending wakeup: seq = 28, error = 102
Thu Apr 22 00:46:46 1999: arlad: worker 0: done
Thu Apr 22 00:46:46 1999: arlad: worker 0 waiting
Thu Apr 22 00:46:46 1999: arlad: worker 0: processing
Thu Apr 22 00:46:46 1999: arlad: Rec message: opcode = 10 (inactivenode), size = 32
Thu Apr 22 00:46:46 1999: arlad: worker 0: done
Thu Apr 22 00:46:46 1999: arlad: worker 0 waiting
cat: b: Network dropped connection because of reset
When I (needlessly) re-authenticate with kauth, arla seems be re-synched with
reality again.
What can I do to help debug this mess?
(I don't subscribe to the list, so please CC any answers back to me.)