Berkeley CSUA MOTD:Entry 36454
Berkeley CSUA MOTD
 
WIKI | FAQ | Tech FAQ
http://csua.com/feed/
2025/05/24 [General] UID:1000 Activity:popular
5/24    

2005/2/28-3/2 [Computer/SW/Unix] UID:36454 Activity:kinda low
2/28    I have a a few hundred GB of data that exists in two locations
        connected by a slow network. When a change is made in one place,
        I want it to update the second location (one way only with a 'master'
        copy and a 'mirror'). I use rsync for this now, but it is just too
        slow. Products like SnapMirror exist if you want to spend money,
        but they are also hardware dependent (Netapp). Is there some better
        way to solve this problem? I am not opposed to coding my own solution,
        but I'm not sure where the innovation needs to be made. --dim
        \_ we are currently considering a product by availl that does this.
           we are a huge netapp shop and use snapmirror but needed it to be
           r/w on both ends and platform independant.
        \_ how much data is changing?  Which part of rsync is the bottleneck?
           rsync only transfers the parts that have changed, afaik... are you
           using compression (if text-ish data)?
        \_ the problem with rsync is that it .. at run time ... scans the entire
           set of files, on the local and remote, and looks for differences
           between them.  So it is reading every file and checksumming every
           file, regardless of whether they changed or not, and communicating
           this information across the link between source and destination.
           What you need is something that hooks into the file/operating
           system and only notifies of changes when they happen, and
           propagates them over.  This is what SnapMirror does, and does quite
           well.  Maybe you need to look for coding a similar solution.  As a
           quick hack, you could just have rsync go over the files/directories
           that have had timestamp change since last run, instead of scan
           the whole directory tree. -ERicM
           \_ actually, i think if the files are the same size and same
              modification date, it'll skip them, even if the files are
              actually different.
              \_ There are options in rsync to do this, but in general usage
                 you're wrong.
                 \_ Actually, I'm pretty sure (s)he's right; look at the docs
                    for the --size-only option and what it says about the
                    Normal behavior.  --dbushong
              \_ even if you tell rsync to only check file sizes/timestamps,
                 it *still*, at runtime, it still has to communicate with
                 remote rsyncd to determine if local and remote size+timestamp
                 are different, for each file.  This will eat time and
                 bandwidth galore, unless you're trying to only sync a few
                 large files. -EricM
        \_ If you are willing to do a bit of hacking this isn't so hard. You
           want a custom NFS server that journals modifications and ships them
           to the slave. The slave then applies the journal. If the master is
           a linux box you should look into FUSE (File system in User SpacE or
           something like that), as this would be pretty easy with FUSE + clue
           --twohey
        \_ Veritas has a tool, called Veritas Volume Replicator that does
           this. I have never used it, but considering my previous experience
           with Veritas products, I would expect it to work fine. It is
           also cross platform. -ausman
           \_ We used VVR (as well as Cluster FS) at Walmart. It works very
              well and I'd definitely use it again. I love Veritas' file
              system and volume management products. (Not cheap!) -- Marco
        \_ How about giving rsync a list of files that have been modified
           since the last rsync?  First find recently modified stuff on the
           master, then rsync only that stuff to the mirror.  You might
           be able to hack rsync to do this for you--, like:
           rsync --more-recent-than 24 hours
                --PeterM
           \_ This is a good idea.
        \_ How about OpenAFS or Coda?  They both have mirrored modes, IIRC.
2025/05/24 [General] UID:1000 Activity:popular
5/24    

You may also be interested in these entries...
2009/8/21-9/1 [Computer/SW/Unix] UID:53297 Activity:nil
8/20    When I use rsync to backup, it's pretty cool except in cases where
        I rename a directory name from the source. Rsync will just do
        a plain copy. Is there a program that'll detect renaming of
        directories (by checking for children files), or at least
        move them to a dated directory?
        \_ Not related but beware of using rsync as a backup tool.
	...
2009/7/17-24 [Computer/SW/OS/OsX] UID:53156 Activity:kinda low
7/17    -rw-r--r--@
        What does the "at sign" mean? This is on Mac OS. VMWare disk file.
        \_ The file has metadata attributes
           \_ How do I add/delete attributes to files? What about
              -rw-r--r--+ <-- what is the "+" sign? Also how do you make
              tar preserve these attributes?
	...
2009/2/26-3/5 [Computer/SW/Unix] UID:52653 Activity:nil
2/26    If I want to rsync a perforce or svn directory while users
        are checking things in, could I wind up with a corrupted copy?
        \_ sure why not.  if you are really cool, your copy of the svn
           repo is on LVM, and you snap the LVM to copy it.
           \_ It'll seem less cool when you realize you've taken a snapshot
              of a block level device, which only guarantees that this set of
	...
2008/8/27-9/3 [Computer/HW/Laptop, Computer/SW/OS/Linux] UID:50980 Activity:nil
8/27    I have a rather large linux partition.  I just got a new laptop
        and want to move all my settings and customization to that new
        computer.  how to do this?  I tried remastersys but it seems that it
        get stuck somewhere, and I am hoping it is not really trying to create
        a 26GB iso file.
        any ideas?  is there anyway i can back up my debian package database
	...
2008/6/10-13 [Computer/SW/OS/VM, Computer/SW/Unix] UID:50210 Activity:nil
6/10    Is there a handy guide to virtualizing an already running
        physical linux box into an instance of Vmware?
        \_ this probably isn't the "right" way, but I have many times
        just run rsync. ("rsync -vpa root@oldbox:/ /")  on a fresh virtual
        image.  Just make sure the partitions are the same on the virt disk
        as on the real disk and if you are using a new udev, kill the info
	...
2008/5/29-31 [Computer/HW/Drives] UID:50086 Activity:nil
5/29    Considering abandoning tape backups in favor of something like this:
http://www.thermaltakeusa.com/product/Storage/hdd_station/blacx/st0005u.asp
        Please convince me that I'm crazy.
        \_ why?
           \_ Why do I want you to convince me I'm crazy?
              \_ what are you trying to accomplish?  What is your context?
	...
2008/4/3-9 [Computer/SW/Mail, Computer/SW/RevisionControl] UID:49661 Activity:nil
4/3     In "p4 client", modtime/nomodtime sets the file timestamp to
        either the modification time from submitting client or the time that
        the file is fetched.  Is there a way to set the file timestamp to
        the time that the file was submitted, like what ClearCase does?  Thx.
        \_ How is this different from the modtime option? Are you concerned
           about clock skew or timezone differences?
	...