[issue1298] system hang due to hammer problem?
Matthew Dillon
dillon at apollo.backplane.com
Wed Feb 25 09:28:43 PST 2009
:New submission from Patrick Georgi <patrick at georgi-clan.de>:
:
:This morning, my server was unresponsive (no network traffic, no console
:response beyond switching terminals and entering ddb. Regular keypresses,
:eg. at the login prompt weren't registered).
:The last messages in the log were
:
:Feb 25 03:05:23 t-stueck kernel: HAMMER(backup): Critical error
:inode=3D24949148432 while syncing inode
:Feb 25 03:05:23 t-stueck kernel: HAMMER(backup): Forcing read-only mode
:Feb 25 03:05:29 t-stueck kernel: pid 532 (hammer), uid 0: exited on signal =
:11
:(core dumped)
:
:The HAMMER(backup) lines were also on the console.
:
:I don't have a crash dump, and the hammer coredump isn't very useful either
:(no binary with symbols around).
:The system ran a "2.3.0-development" version from ~1 week ago or so. I thin=
:k no
:hammer related changes appeared since then.
:
:The problem seems to have happened during a "hammer prune", as started by t=
:he
:nightly "hammer cleanup", as the daily run mail reported:
:
:cleanup /backup/root/home - handle PFS #2 using /backup/pfs-snapshots/ho=
:me
: snapshots - run
: prune - Segmentation fault (core dumped)
:
:The security mail reported that PFS to be read-only, but it's not critical =
:(it's
:a pfs-slave to /home, on a different hammer partition than / and /home)
:
:----------
:messages: 6248
:nosy: pgeorgi
:priority: bug
:status: unread
:title: system hang due to hammer problem?
I think the seg-fault in the utility is a different issue, and not
the cause. A critical error can only occur if either the I/O subsystem
reports an error or the filesystem fills up unexpectedly.
If your HAMMER filesystem is nowhere near full then it was probably
an I/O error. We can't really tell without more of the console history
and/or a core dump.
We'll keep an eye for other related bug reports.
What kind of storage system is connected to the machine? Internal
hard drive? SATA or PATA? You also might as well post the dmesg
output (post reboot), just so the information is recorded in the
bug report.
-Matt
Matthew Dillon
<dillon at backplane.com>
More information about the Bugs
mailing list