[17816] in Athena Bugs

home help back first fref pref prev next nref lref last post

SGIs unhappy when zip disk forcibly removed

daemon@ATHENA.MIT.EDU (Emil Sit)
Mon May 1 22:11:55 2000

Message-Id: <200005020211.WAA28739@department-of-alchemy.mit.edu>
To: bugs@MIT.EDU
cc: dtpaul@MIT.EDU
Date: Mon, 01 May 2000 22:11:44 -0400
From: Emil Sit <sit@MIT.EDU>

System name:		w20-575-62.mit.edu
Type and version:	SGI....

What were you trying to do?
	Debug a user problem sending mail

What's wrong:
    The machine wouldn't run it's mailq. On examination of the syslog, the load
    was too high, verified by athinfo:

 10:05pm  up 62 days,  4:45,  3 users,  load average: 62.00, 62.00, 62.00

    Based on the output of ps (in ~dtpaul/Public/ps.out for the moment) and:

athena% grep zip /etc/mtab
/dev/rdsk/dks0d5vol /zip dos rw,partition=4,nosuid 0 0

    I believe someone forcibly ejected a zip disk and the machine still
    thinks its mounted. Then when the find cron job runs to
    clean out dead.letter on local disks, it's getting stuck
    in disk wait and driving up the load, thus causing sendmail to
    decide the load is too high to warrant running the queue.

    After asking on -c consult, we decided rebooting the machine
    would be the simplest fix.

What should have happened:
    The machine should've dealt properly.
    
    One solution might be to specifically exclude /floppy and /zip from
    the find jobs. Or just punt the "remove core and dead.letter" cron job.

    This might be something worth considering on Suns as well?
--
Emil Sit / Bronx Science '95, MIT '99 -- SIPB, Athena Consulting, LCS/PDOS

home help back first fref pref prev next nref lref last post