VM idle page zeroing
Matthew Dillon
dillon at apollo.backplane.com
Mon May 17 17:41:50 PDT 2010
Here are buildworld times on my test box with and without pre-zeroing,
single threaded (no -j option used):
/usr/bin/time -l make buildworld >& /tmp/bw.out
2306.94 real 1740.54 user 393.90 sys nocache=1, enable=1
2305.50 real 1738.80 user 397.38 sys nocache=1, enable=1
2306.85 real 1736.72 user 399.62 sys nocache=1, enable=1
2307.96 real 1737.63 user 398.24 sys nocache=1, enable=1
2306.22 real 1741.29 user 395.15 sys nocache=1, enable=1
2320.21 real 1739.50 user 396.49 sys nocache=1, enable=1
2311.48 real 1739.04 user 396.29 sys nocache=1, enable=1
2323.78 real 1735.54 user 399.61 sys nocache=1, enable=1
2331.38 real 1733.84 user 417.91 sys enable=0
2327.75 real 1733.74 user 415.24 sys enable=0
2322.29 real 1731.73 user 419.57 sys enable=0
2327.53 real 1729.05 user 416.27 sys enable=0
2335.71 real 1731.32 user 417.93 sys enable=0
As you can see it does seem to make a small difference. The system
time is about 20 seconds (4.8%) faster with pre-zeroing and the real
time is around ~18 seconds (0.7%) or so faster.
What this says, basically, is that the pre-zeroing helps out serialized
operations (such as shell scripts) which have a lot of zfod faults but
probably doesn't make much of a difference in the grand scheme of things.
I also tried some bursting tests but it turns out that once the system
is in a steady state the VM free page queue doesn't have enough pages
to really be able to handle the zfod fault rate. So I bumped
vm.v_free_target to an absurdly high number like 100000 (400MB) and
ran a little shell script which ran /bin/echo in a big loop and I
got:
#!/bin/csh
#
set i = 0
while ( $i < 10000 )
/bin/echo fubar > /dev/null
@ i = $i + 1
end
0.437u 1.148s 0:01.59 98.7% 342+164k 0+0io 0pf+0w idlezero enabled
0.460u 1.257s 0:01.71 100.0% 313+155k 0+0io 0pf+0w idlezero disabled
Or around a 7% execution time difference. The problem, though, is that
the system really needs a large number of free pages so the zfod zero
fault burst doesn't drain the pre-zerod page cache during the test.
With the default vm.v_free_target of 20000 on this box (40MB) the zfod
faults strip out the pre-zerod pages almost instantly and the pgzero
thread can't keep up. Run times wind up being 1.69 or only 1.2%
faster.
I then tried bumping up vm.idlezero_rate to infinity (well, 999999)
with a normal vm.v_free_target and that helped some. I got numbers
in the 1.61 to 1.65 range.
It may well be that the correct answer here is to remove the rate
limiting entirely.
-Matt
More information about the Kernel
mailing list