failing disk, or not?
George Georgalis
george at galis.org
Mon Mar 28 13:01:41 PST 2005
I'm seeing some disk errors in dfly that I cannot reproduce with other
OS checking the partition:
ad4: UDMA ICRC error writing fsbn 249842603 of 110387664-110387679 (ad4 bn 249842603; cn 15551 tn 250 sn 38) retrying
ad4: UDMA ICRC error writing fsbn 249842603 of 110387664-110387679 (ad4 bn 249842603; cn 15551 tn 250 sn 38) retrying
ad4: UDMA ICRC error writing fsbn 249842603 of 110387664-110387679 (ad4 bn 249842603; cn 15551 tn 250 sn 38) retrying
ad4: UDMA ICRC error writing fsbn 488278315 of 229605520-229605535 (ad4 bn 488278315; cn 30393 tn 234 sn 28) retrying
ad4: UDMA ICRC error writing fsbn 488278315 of 229605520-229605535 (ad4 bn 488278315; cn 30393 tn 234 sn 28) retrying
ad4: UDMA ICRC error writing fsbn 488278443 of 229605584-229605599 (ad4 bn 488278443; cn 30393 tn 236 sn 30) retrying
This happened while running dvdbackup and I reproduced it running
a dd read from the partition. However, after several attempts I cannot
reproduce it from Linux badblocks (read or non-distructive write) check
or linux dd read from the partition. I know failures can be intermetint
But not getting any errors at all yet, from Linux, seems odd at this
point, if the disk is really failing.
# df -h
Filesystem Size Used Avail Capacity Mounted on
/dev/ad4s3a 248M 122M 106M 54% /
/dev/ad4s3d 248M 1.3M 227M 1% /var
/dev/ad4s3e 124G 94G 20G 83% /usr
procfs 4.0K 4.0K 0B 100% /proc
A bit of history, I did have a system lockup -- I could switch virtural
terminals but no keyboard input was accepted -- a week or two ago,
didn't file bug because I was half-hazard experimenting (in user space)
and couldn't explain well enough, at the time all I was doing, now I
don't even remember. A fsck was required, and with a 95Gb /usr, that
took quite a while. (welcome comments on why softupdates didn't help
here), also the /usr partition was near or over 100% capacity, but I
never got disk full errors, ie didn't *completely* run out of space.
At this point can I be sure my disk is failing or could there be some
driver instability? The full dmesg is below.
Don't see it in dmesg, but ad4 is a 200Gb Seagate drive, on a nvidia
sata controler. Disk Product Number ST3200822AS, Part Number 9W2854-301
Thanks,
// George
Copyright (c) 2003, 2004, 2005 The DragonFly Project.
Copyright (c) 1992-2003 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
DragonFly 1.1-Stable #2: Tue Mar 22 04:04:13 GMT 2005
root at .:/usr/obj/usr/src/sys/MYKERNEL
TSC clock: 2210895908 Hz, i8254 clock: 1193256 Hz
CPU: AMD Athlon(tm) 64 Processor 3400+ (2210.77-MHz 686-class CPU)
Origin = "AuthenticAMD" Id = 0xf4a Stepping = 10
Features=0x78bfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,MMX,FXSR,SSE,SSE2>
AMD Features=0xe0500000<<b20>,AMIE,<b29>,DSP,3DNow!>
real memory = 1073676288 (1048512K bytes)
avail memory = 1033658368 (1009432K bytes)
Preloaded elf kernel "/kernel" at 0xc066e000.
Preloaded elf module "/modules/acpi.ko" at 0xc066e260.
Pentium Pro MTRR support enabled
md0: Malloc disk
pcibios: BIOS version 2.10
Using $PIR table, 5 entries at 0xc00fdf10
npx0: <math processor> on motherboard
npx0: INT 16 interface
Using XMM optimized bcopy/copyin/copyout
acpi0: <XPC AWRDACPI> on motherboard
installed MI handler for int 9
acpi0: Power Button (fixed)
cpu0: <ACPI CPU> on acpi0
acpi_tz0: <Thermal Zone> on acpi0
acpi_button0: <Power Button> on acpi0
pcib0: <Host to PCI bridge> at pcibus 0 on motherboard
pci0: <PCI bus> on pcib0
agp0: <NVIDIA Generic AGP Controller> mem 0xe0000000-0xe3ffffff at device 0.0 on pci0
agp0: Unable to find NVIDIA Memory Controller 1.
device_probe_and_attach: agp0 attach returned 19
isab0: <PCI to ISA bridge (vendor=10de device=00e0)> at device 1.0 on pci0
isa0: <ISA bus> on isab0
pci0: <unknown card> (vendor=0x10de, dev=0x00e4) at 1.1 irq 10
ohci0: <OHCI (generic) USB controller> mem 0xe8002000-0xe8002fff irq 10 at device 2.0 on pci0
installed MI handler for int 10
usb0: OHCI version 1.0, legacy support
usb0: SMM does not respond, resetting
usb0: <OHCI (generic) USB controller> on ohci0
usb0: USB revision 1.0
uhub0: nVidia OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 4 ports with 4 removable, self powered
ums0: Logitech USB Receiver, rev 1.10/9.10, addr 2, iclass 3/1
ums0: 5 buttons and Z dir.
ohci1: <OHCI (generic) USB controller> mem 0xe8003000-0xe8003fff irq 10 at device 2.1 on pci0
usb1: OHCI version 1.0, legacy support
usb1: SMM does not respond, resetting
usb1: <OHCI (generic) USB controller> on ohci1
usb1: USB revision 1.0
uhub1: nVidia OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub1: 4 ports with 4 removable, self powered
umass0: USB2.0 CardReader, rev 2.00/91.38, addr 2
pci0: <USB controller> at 2.2 irq 10
pci0: <PCI to Other bridge (vendor=10de device=00df)> at 5.0 irq 10
pci0: <unknown card> (vendor=0x10de, dev=0x00ea) at 6.0 irq 3
atapci0: <Generic PCI ATA controller> port 0xf000-0xf00f at device 8.0 on pci0
ata0: at 0x1f0 irq 14 on atapci0
installed MI handler for int 14
ata1: at 0x170 irq 15 on atapci0
installed MI handler for int 15
atapci1: <Generic PCI ATA controller> port 0xec00-0xec7f,0xeb00-0xeb0f,0xb70-0xb73,0x970-0x977,0xbf0-0xbf3,0x9f0-0x9f7 irq 11 at device 10.0 on pci0
ata2: at 0x9f0 on atapci1
installed MI handler for int 11
ata3: at 0x970 on atapci1
pcib1: <PCI to PCI bridge (vendor=10de device=00e2)> at device 11.0 on pci0
pci1: <PCI bus> on pcib1
pci1: <NVidia model 0110 graphics accelerator> at 0.0 irq 5
pcib2: <PCI to PCI bridge (vendor=10de device=00ed)> at device 14.0 on pci0
pci2: <PCI bus> on pcib2
fxp0: <Intel 82557 Pro/100 Ethernet> port 0xd000-0xd01f mem 0xe6000000-0xe60fffff,0xe7000000-0xe7000fff irq 12 at device 6.0 on pci2
installed MI handler for int 12
miibus0: <MII bus> on fxp0
inphy0: <i82555 10/100 media interface> on miibus0
inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
fxp0: MAC address: 00:a0:c9:d1:b5:5e
fwohci0: <VIA Fire II (VT6306)> port 0xd100-0xd17f mem 0xe6200000-0xe62007ff irq 11 at device 7.0 on pci2
fwohci0: OHCI version 1.0 (ROM=1)
fwohci0: No. of Isochronous channel is 4.
fwohci0: EUI64 00:30:1b:b7:00:00:15:a8
fwohci0: Phy 1394a available S400, 3 ports.
fwohci0: Link S400, max_rec 2048 bytes.
firewire0: <IEEE1394(FireWire) bus> on fwohci0
fwe0: <Ethernet over FireWire> on firewire0
if_fwe0: Fake Ethernet address: 02:30:1b:00:15:a8
fwe0: MAC address: 02:30:1b:00:15:a8
sbp0: <SBP-2/SCSI over FireWire> on firewire0
fwohci0: Initiate bus reset
fwohci0: node_id=0xc800ffc0, gen=1, CYCLEMASTER mode
firewire0: 1 nodes, maxhop <= 0, cable IRM = 0 (me)
firewire0: bus manager 0 (me)
orm0: <Option ROMs> at iomem 0xc0000-0xcbfff,0xcc000-0xcffff,0xd0000-0xd17ff on isa0
pmtimer0 on isa0
fdc0: ready for input in output
fdc0: cmd 3 failed at out byte 1 of 3
atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0
kbd0 at atkbd0
installed MI handler for int 1
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x300>
sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
sio0: type 16550A
installed MI handler for int 4
sio1: can't drain, serial port might not exist, disabling
ppc0: <Parallel port> at port 0x378-0x37f irq 7 on isa0
ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode
ppbus0: <Parallel port bus> on ppc0
plip0: <PLIP network interface> on ppbus0
lpt0: <Printer> on ppbus0
lpt0: Interrupt-driven port
ppi0: <Parallel I/O> on ppbus0
installed MI handler for int 7
installed MI handler for int 0
ad0: 58644MB <Maxtor 6Y060L0> [119150/16/63] at ata0-master BIOSDMA
ad4: DMA limited to UDMA33, non-ATA66 cable or device
ad4: 190782MB <ST3200822AS> [387621/16/63] at ata2-master BIOSDMA
acd0: DVD-R <TOSHIBA DVD-ROM SD-R5002> at ata1-master PIO4
Mounting root from ufs:/dev/ad4s3a
cd0 at ata1 bus 0 target 0 lun 0
cd0: <TOSHIBA DVD-ROM SD-R5002 1031> Removable CD-ROM SCSI-0 device
cd0: 16.000MB/s transfers
cd0: Attempt to query device size failed: NOT READY, Medium not present
da0 at umass-sim0 bus 0 target 0 lun 0
da0: <USB2.0 CF CardReader > Removable Direct Access SCSI-0 device
da0: 1.000MB/s transfers
da0: Attempt to query device size failed: NOT READY, Medium not present
da1 at umass-sim0 bus 0 target 0 lun 1
da1: <USB2.0 CBO CardReader > Removable Direct Access SCSI-0 device
da1: 1.000MB/s transfers
da1: Attempt to query device size failed: NOT READY, Medium not present
ad4: UDMA ICRC error writing fsbn 249842603 of 110387664-110387679 (ad4 bn 249842603; cn 15551 tn 250 sn 38) retrying
ad4: UDMA ICRC error writing fsbn 249842603 of 110387664-110387679 (ad4 bn 249842603; cn 15551 tn 250 sn 38) retrying
ad4: UDMA ICRC error writing fsbn 249842603 of 110387664-110387679 (ad4 bn 249842603; cn 15551 tn 250 sn 38) retrying
ad4: UDMA ICRC error writing fsbn 488278315 of 229605520-229605535 (ad4 bn 488278315; cn 30393 tn 234 sn 28) retrying
ad4: UDMA ICRC error writing fsbn 488278315 of 229605520-229605535 (ad4 bn 488278315; cn 30393 tn 234 sn 28) retrying
ad4: UDMA ICRC error writing fsbn 488278443 of 229605584-229605599 (ad4 bn 488278443; cn 30393 tn 236 sn 30) retrying
--
George Georgalis, systems architect, administrator Linux BSD IXOYE
http://galis.org/george/ cell:646-331-2027 mailto:george at xxxxxxxxx
More information about the Bugs
mailing list