[476] in athena10

home help back first fref pref prev next nref lref last post

Simulating AFS outages with pyhesiodfs

daemon@ATHENA.MIT.EDU (Jonathan D Reed)
Wed Sep 3 14:29:10 2008

Date: Wed, 3 Sep 2008 14:28:24 -0400 (EDT)
From: Jonathan D Reed <jdreed@MIT.EDU>
To: athena10@MIT.EDU
Message-ID: <Pine.LNX.4.64L.0809031401100.32734@infinite-loop.mit.edu>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed

pyhesiodfs seems to behave slightly better in an AFS outage than afuse 
did.   Here is what I encountered:

-If the locker had already been accessed (say, by a successful login), 
then the Failsafe Terminal option will work, but attempts to access 
other directories will fail with "Timed out" errors.  Presumably this is 
due to AFS caching, and might eventually fail if enough time elapses.

-Failsafe GNOME just plain doesn't work (it attempts to create the 
Nautilus directories, presumably because it can't read them, and fails)

-If the locker has not already been accessed (say, after a reboot), it 
hangs at the login screen, and eventually asks the user if they want to 
log in with / as their home directory.  If you say yes, it eventually 
presents the GNOME background, but just sits there (I gave up after 10 
minutes).   If you attempt to Ctrl-Alt-Bksp out of the situation, X does 
not respawn.  ps shows that X and gdmgreeter are defunct processes, but 
nothing I tried was able to cause them to completely die or to respawn X.

However, pyhesiodfs seems to recover from server outages much faster than 
afuse did.

The situation is still pretty much unusuable, though.  Rather than try and 
let Ubuntu figure out what's up with the user's homedir, is it worth 
adding code to the login sequence that explicitly checks if a user's 
homedir is available and punts if it isn't?

home help back first fref pref prev next nref lref last post