[615] in arla-drinkers

home help back first fref pref prev next nref lref last post

Re: broken cache problem for arla

daemon@ATHENA.MIT.EDU (Takashi Ichihara)
Fri Feb 19 06:37:46 1999

From owner-arla-drinkers@stacken.kth.se Fri Feb 19 11:37:45 1999
Return-Path: <owner-arla-drinkers@stacken.kth.se>
Delivered-To: arla-drinkers-mtg@bloom-picayune.mit.edu
Received: (qmail 3412 invoked from network); 19 Feb 1999 11:37:44 -0000
Received: from unknown (HELO sundance.stacken.kth.se) (130.237.234.41)
  by bloom-picayune.mit.edu with SMTP; 19 Feb 1999 11:37:44 -0000
Received: (from majordom@localhost)
	by sundance.stacken.kth.se (8.8.8/8.8.8) id MAA08058
	for arla-drinkers-list; Fri, 19 Feb 1999 12:30:06 +0100 (MET)
Received: from rarfaxp.riken.go.jp (rarfaxp.riken.go.jp [134.160.227.2])
	by sundance.stacken.kth.se (8.8.8/8.8.8) with ESMTP id MAA08044;
	Fri, 19 Feb 1999 12:29:55 +0100 (MET)
Received: from rarfaxp.riken.go.jp (localhost [127.0.0.1])
	by rarfaxp.riken.go.jp (8.9.2/8.9.2) with ESMTP id UAA04150;
	Fri, 19 Feb 1999 20:29:53 +0900 (JST)
Message-Id: <199902191129.UAA04150@rarfaxp.riken.go.jp>
To: Love <lha@stacken.kth.se>
cc: Takashi Ichihara <ichihara@rarfaxp.riken.go.jp>,
        arla-drinkers@stacken.kth.se
Subject: Re: broken cache problem for arla 
In-reply-to: Your message of "19 Feb 1999 11:22:24 +0100."
             <amr9rmn15r.fsf@hummel.e.kth.se> 
Date: Fri, 19 Feb 1999 20:29:53 +0900
From: Takashi Ichihara <ichihara@rarfaxp.riken.go.jp>
Sender: owner-arla-drinkers@stacken.kth.se
Precedence: bulk

>Takashi Ichihara <ichihara@rarfaxp.riken.go.jp> writes:
>
>>   Following problems occurs frequently around our environment: When 
>> a linux system "crash" with panic, some parts of cache files of the 
>> arla seems to be destroyed.  After this crash, arla does not work 
>> correctly even rebooting the system. Access to the specific files on 
>> /afs tree, which are related to the broken cache files, has problem.  
>
>
>Starting arla with the -z flag should solve the problem, as of 0.21
>the linux startarla does that by default. That is a better way to fix
>the problem.
>
>You have the -z flag in your startarla ?
>
>Love

  Yes.  We are using arla 0.21 and I have confirmed that arlad is 
started with -z flags in startarla in arla 0.21. But we have this 
problem.  We have small 8 Linux CPU farms (Kernel 2.2.0, arla 0.21) 
and found that arla of 3 nodes are out of order due to this broken 
cache problem. This problem was fixed by manually cleaning the 
cache directory.

    To confirm -z flags works correctly or not, I have rebooted the 
system and check the arla cache directory. The system was booted at
20:23. (arlad was started with -z flag just after 20:23) In this 
condition there are many files whcih were created before arlad 
(system) was started.

[system booting started at 20:03:15]

  taken from /var/log/message
  Feb 19 20:23:15 p6spin kernel: Linux version 2.2.0 (root@xx.riken.go.jp)
 (gcc version 2.7.2.3) #1 Wed Jan 27 10:36:09 JST 1999

[arlad is started with -z flags]
  # ps ax | grep ar
    330  ?  S    0:00 /usr/arla/bin/arlad -z

[There are some old cache files in arla cache directory]

  # ls -l /usr/arla/cache

  -rw-------   1 root     root         2048 Feb 19 20:23 0001
  -rw-------   1 root     root           11 Feb 19 20:23 0002
  -rw-------   1 root     root         2048 Feb 19 20:23 0003
  -rw-------   1 root     root           15 Feb 19 20:23 0004
  -rw-------   1 root     root         2048 Feb 19 20:23 0005
     :
  -rw-------   1 root     root            0 Feb 19 20:24 0F9F
  -rw-------   1 root     root            0 Feb 19 20:24 0FA0
  -rw-------   1 root     root         1024 Feb 19 19:49 @001    
  -rw-------   1 root     root         1024 Feb 19 19:49 @003
  -rw-------   1 root     root         1024 Feb 19 19:49 @00D
  -rw-------   1 root     root         1024 Feb 19 19:49 @010
  -rw-------   1 root     root         1024 Feb 19 19:49 @027
  -rw-------   1 root     root        15308 Feb 19 19:50 fcache
  -rw-------   1 root     root            0 Feb 19 16:54 log
  -rw-------   1 root     root         4256 Feb 19 19:50 volcache


   Therefore starting arlad with -z flag (as default option in arla 
0.21 startarla) does not clean the cache files completely, at leaset 
in our environment.
					Takashi Ichihara


home help back first fref pref prev next nref lref last post