Deadlock on NFS server

Matthew Dillon dillon at apollo.backplane.com
Mon Sep 27 07:47:48 PDT 2010


:Hi.
:I've been experiencing similar lock-ups on DragonFly NFS server since
:the recent changes to the network code.  Similar, but without objcache
:warnings as in the previous messages.  I can reproduce it on multiple
:machines and it occurs even when mounted via loop-back interface:
:
:  $ sudo mkdir -m1777 /test
:  $ sudo mount -tnfs 127.0.0.1:/test /mnt
:  $ env MAKEOBJDIRPREFIX=/mnt make -sj300 buildworld
:  (leave it for a couple of hours to find many processes stuck
:   in ZOMB state)
:
:Probably it can reproduce with much lower -j number to make command.
:
:Best Regards.

    Hmm.  I was able to get a bunch of processes stuck in 'clock'
    using that setup.  That in turn caused their children to get
    stuck as zombies but it looks like the primary issues is the
    parent's getting stuck in clock.

    This particular deadlock is probably related to the localhost
    mount vs a remote mount.  I'll see if I can track it down today.

    Have you had any issues with remote NFS mounts?  Note that the
    most recent fixes went in on Friday and are server-side.

					-Matt
					Matthew Dillon 
					<dillon at backplane.com>





More information about the Bugs mailing list