Postfix suddenly stopped working
Matthew Dillon
dillon at apollo.backplane.com
Thu Jul 27 19:19:29 PDT 2006
:I had what may be a similar issue; I found that postfix's "local" daemon
:would stick. In my case, "local" is configured to deliver through procmail,
:but it would occasionally just get stuck (it wouldn't get as far as calling
:procmail in this case). Sometimes a postfix stop and restart would work,
:sometimes not. Sometimes, a postmaster -h <first entry in queue> followed
:by postmaster -H <same entry> would unclog things (that is postfix-speak for
:"put it on hold" and "take it off hold" (moves it to a different queue
:directory and back). Sometimes a reboot was required.
:
:I never did get to the bottom of it. I rebuilt postfix and all of its
:dependencies, with no change in behavior. I did not ktrace the process,
:but gdb told me that it was sticking on an flock - I never did figure out
:which file though.
:
:This was all after upgrading from 1.2.x to 1.4 (and then 1.6). In the end,
:I ran out of time to monkey with it and just switched to sendmail. I've still
:..
:--
: Jon Hamilton
: hamilton at xxxxxxxxx
If it is stuck on an flock then the vnodeinfo program at
/usr/src/test/debug/vnodeinfo.c may help:
cc /usr/src/test/debug/vnodeinfo.c -o /usr/local/bin/vnodeinfo -lkvm
(NOTE! you need the latest HEAD for the program to compile properly,
with the latest /usr/src/test/debug/vnodeinfo.c file and the latest
/usr/src/sys/sys/fcntl.h copied to /usr/include/sys).
With postfix stuck, do:
/usr/local/bin/vnodeinfo -a > /tmp/outfile
Then look for vnode information structures containing LOCKS or BLKED
entries that might be related to the problem.
If there are no blocked locks then it could be a race in our POSIX
locking sleep/wakeup code.
-Matt
Matthew Dillon
<dillon at xxxxxxxxxxxxx>
More information about the Users
mailing list