[issue1729] Hammer REDO recovery panic
Matthew Dillon
dillon at apollo.backplane.com
Tue Apr 13 21:30:31 PDT 2010
:New submission from Venkatesh Srinivas <me at acm.jhu.edu>:
:
:Restarting my system after an earlier panic from running fsstress on HAMMER,
:fsync_mode=2. Running DragonFly 2.6.0-gfa1ae. Will try to have a dump tomorrow.
:
:Mounting root from hammer:serno/G3H3DSKC.s1d
:tryroot serno/G3H2DSKC.s1d
:HAMMER(ROOT) recovery check seqno=023f0d60
:HAMMER(ROOT) recovery range 3000000003838438-3000000003fee500
:HAMMER(ROOT) recovery nexto 3000000003fee500 endseqno=023f6204
:HAMMER(ROOT) recovery undo 3000000003838438-30000000003fee500 (8085704
:bytes)(RW)
:HAMMER(ROOT) Continuing recovery
:HAMMER(ROOT) Continuing recovery
:HAMMER(ROOT) Continuing recovery
:HAMMER(ROOT) Continuing recovery
:HAMMER(ROOT) Found REDO_SYNC 30000000025c4a38
:HAMMER(ROOT) recovery complete
:HAMMER(ROOT) recovery redo 3000000003838438-30000000003fee500 (80857604
:bytes)(RW)
:HAMMER(ROOT) Find extended redo 30000000025c4a38, 19347968 extbytes
:HAMMER(ROOT) Find extended redo failed 34, unable to run REDO
:HAMMER(ROOT) End redo recovery
:panic: hammer_ref_interlock_true: bad lock 0xc758a688 00000001
:
:Trace beginning at frame 0xc05d4af4
:panic(c05d4b18,50000001,c758a680,c02ed226,c05d4b24) at panic+0x8c
:...
Ok, I found the bug related to the panic. I have committed a
fix to head and will MFC to 2.6.x.
The REDO error itself is another problem. That error is not supposed
to happen. Please run the command 'hammer -f <device> show-undo' and
put the output on your leaf account. How large is the HAMMER filesystem?
(Hopefully the data hasn't been lost since that time, I'm crossing my
fingers).
You may have issues booting. You need to boot with a fixed kernel to
get past the panic and that may require booting from a USB stick or
something
-Matt
Matthew Dillon
<dillon at backplane.com>
More information about the Bugs
mailing list