VFS ROADMAP (and vfs01.patch stage 1 available for testing)
Matthew Dillon
dillon at apollo.backplane.com
Fri Aug 13 13:07:02 PDT 2004
:On 13.08.2004, at 20:09, Matthew Dillon wrote:
:> One would be able to export raw disk partitions as block devices,
:> or
:> file systems (fully cache coherent within the cluster, managed by
:> the kernel), cpu, memory, etc.
:
:Can this also be used to replicate filesystems across boxes? At the
:moment I'm missing a way to run redundant systems with redundant data
:storage, i.e. have a fallback machine that holds the same production
:data and is permanently synced with the master (or even completely
:distributed?)
:
:cheers
: simon
Real time replication is a different problem entirely. For that what
you need is a high level journaling stream. I believe we will have the
the ability to hook in a journaling stream with the new vop_*() API.
The requirement here is that the vop_*() wrappers have some sort of
management structure to hold the 'I want to journal this filesytem'
flag. This structure is looking more and more like it ought to be
a mount point (a struct mount). I can't think of a better structure
to hold information about journaling a filesystem, after all!
Part of my plan is to pass some sort of management structure into the
vop_*() call that will hold the operations vectors (instead of hanging
them off the vnode)... this is needed because in our future many
namespace related VOP's, like open and remove, will not have a vnode
passed in any more. The VOP wrapper routines need a common point of
reference to tell them what to do.
The mount structure pointer would suit both needs BUT there is a
medium sized programming problem that needs to be resolved before
that can happen.... right now a standard filesystem like UFS will
store one of *three* different operations vectors into a vnode depending
on whether the vnode represents a file/dir, a device, or a pipe.
We have to figure out what to do about that before we can move the
operations vector from the vnode to the mount structure (or construct
some sort of governing structure, maybe not 'mount' but something new).
vop_wrapper(mount_p, other args...)
|
V
MOUNT STRUCT
|
+-> Journaling/Replication hooks (kernel managed)
|
+-> Cache coherency hooks (kernel managed)
|
+-> Range locking hooks (kernel managed)
|
+-> [Vnode operations vector] (VFS managed)
I also want to consolidate the struct fileops and device ops functions
into the same management structure. Most of the subsystems listed
above apply equally to fileops AND devices (at least block storage
devices). It would be utterly cool to not only be able to journal
high level FS calls, but to also journal lower level block I/O
calls.
-Matt
More information about the Kernel
mailing list