[8318] in athena10
[Debathena] #1053: cluster reboots sometimes hang (2)
daemon@ATHENA.MIT.EDU (Debathena Trac)
Tue Aug 23 12:28:41 2011
MIME-Version: 1.0
Content-Type: text/plain; charset="utf-8"
From: "Debathena Trac" <debathena@MIT.EDU>
Cc: debathena@mit.edu
To: kaduk@mit.edu
Date: Tue, 23 Aug 2011 16:28:34 -0000
Reply-To:
Message-ID: <042.bb3249ccc6e2b723b9cca8a1f6ca8772@mit.edu>
Content-Transfer-Encoding: 8bit
#1053: cluster reboots sometimes hang (2)--------------------+-------------------------------------------------------
Reporter: kaduk | Owner:
Type: defect | Status: new
Priority: normal | Milestone: The Distant Future
Component: -- | Keywords:
See_also: |
--------------------+------------------------------------------------------- Sometimes, when a cluster machine decides it wants to reboot as a user is
logging out, it fails to actually reboot, and ends up not at the Ubuntu
splash screen, but at the text console with a bunch of things like:
{{{
INFO: task [task]:NN blocked for more than 120 seconds
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message
}}}
where [task] is things like fsnotify_mark, dbus-daemon, gdm-binar,
polkitd, and N is perhaps the corresponding pid?
It is possible to soft-reboot machines in this state, though I don't
remember whether ctrl-alt-del works or sysrq-b was needed.
-- Ticket URL: <http://debathena.mit.edu/trac/ticket/1053>Debathena <http://debathena.mit.edu/>MIT Debathena Project