The Clustering and Userland VFS transport protocol - summary

Dmitri Nikulin dnikulin at gmail.com
Thu May 11 02:26:02 PDT 2006


On 5/11/06, Jason Smethers <jason at xxxxxxxxxxxx> wrote:
Matthew Dillon wrote:
>     In a clustered environment the execution context (what 'cp' is actually
>     running on) can be anywhere. But there is absolutely no reason for the
>     file data to physically pass through that machine if 'cp' itself does
>     not need to know what the file contains.  If done properly, the actual
>     file data would be transported directly from machine A to machine B,
>     or stay strictly within machine A in the second example.
Are such operations going to be exposed through system calls? In other
words, does this mean that userland utilities will need to be modified
to fully support (efficiently) this type of copy by reference?
cp can probably get away with using sendfile() for files anyway (not
so for more complex copies), but I don't know if DragonFly's does. At
any rate this kind of "control kernel IO via userland" thing is not
new, but using it in this way for clustering seems to be.
Even so, most code bases these days seem very happy to support
specific performance improvements on some systems as long as they can
fit it in a more compatible (i.e. POSIX or ANSI C) way of doing things
via some preprocessor work. For instance, people don't seem to object
to Linux adding system calls (vmsplice, etc.) which can offer some
minor performance gains with their own tragic penalties (i.e. having
to allocate and track N buffers where N is something awful). Compared
to that, an addition which offers much more economical behavior with
no penalty is a great bonus. And if it can be made as part of
sendfile(), that much better.
 -- Dmitri Nikulin






More information about the Kernel mailing list