Want to read Slashdot from your mobile device? Point it at m.slashdot.org and keep reading!

Ext3cow Versioning File System Released For 2.6 241

Posted by kdawson on Wednesday May 02, 2007 @07:02AM from the have-a-cow-man dept.

Zachary Peterson writes "Ext3cow, an open-source versioning file system based on ext3, has been released for the 2.6 Linux kernel. Ext3cow allows users to view their file system as it appeared at any point in time through a natural, time-shifting interface. This is can be very useful for revision control, intrusion detection, preventing data loss, and meeting the requirements of data retention legislation. See the link for kernel patches and details."

This discussion has been archived. No new comments can be posted.

Ext3cow Versioning File System Released For 2.6

Load All Comments

Search 241 Comments Log In/Create an Account

Comments Filter:

So which is it? (Score:3, Interesting)

by EveryNickIsTaken ( 1054794 ) writes: on Wednesday May 02, 2007 @07:06AM (#18954749)

Ext3cow, an open-source versioning file system based on ext3, has been released for the 2.6 Linux kernel. Ext3cow allows users to view their file system...
Well, is it the file system, or the file system manager?

Share
twitter facebook
- Re:So which is it? (Score:5, Informative)
  
  by Bob54321 ( 911744 ) writes: on Wednesday May 02, 2007 @07:10AM (#18954789)
  
  From the example screenshot [www.ext3cow.com] it appears it is a file system. You take a snapshot of your system at some point in time and it stores this data even when files change. Of course, with any file system it is important to have functionality that allows you to view the files as well...
  
  Parent Share
  twitter facebook
  - Re: (Score:3)
    
    by hpavc ( 129350 ) writes:
    
    You don't take a snapshot, thats the big deal with it.
- Can't tell, its slashdotted (Score:3, Informative)
  
  by tinkertim ( 918832 ) * writes:
  
  Well, is it the file system, or the file system manager?
  I can't tell, the site is experiencing the /. effect.
  
  Mirror of the patch (I grabbed it when I saw this in the firehose) can be grabbed here [echoreply.us] until my server gets sluggish too.
  
  in /usr/src type : patch -p1 linux-2.6.20.3-ext3cow.patch
  
  The site said its not been tested with other kernel versions, but if you feel brave just s/linux-2\.6\.20\.3/your-version/g. Haven't tried it, but should work.
  
  It wen't dark just around the time I was getting the docs and uti
What a name (Score:3, Funny)

by Anonymous Coward writes: on Wednesday May 02, 2007 @07:07AM (#18954761)

So is it EXT or is it just a FAT cow?

Share
twitter facebook
- Re: (Score:2)
  
  by morgan_greywolf ( 835522 ) * writes:
  
  Well, they were originally going to call it "Rosie O'Donell Versioning File System" but the name was too long and the acronym ROVFS just conjured images of that awful rap [youtube.com] by "MC Rove" at the awards dinner.
Overhead? (Score:3, Interesting)

by HateBreeder ( 656491 ) writes: on Wednesday May 02, 2007 @07:07AM (#18954765)

Couldn't find real-world information about space and performance overhead.

Does it store many copies of each file? or only the differences between the old and the new version?

Share
twitter facebook
- Re:Overhead? (Score:4, Informative)
  
  by JoeD ( 12073 ) writes: on Wednesday May 02, 2007 @07:31AM (#18954963) Homepage
  
  Check the "Publications" link. The first one is an article in "ACM Transactions on Storage".
  
  It's a bit dry, but there is an explanation of how it stores the versions, plus some performance benchmarks.
  
  Parent Share
  twitter facebook
- Re: (Score:3, Informative)
  
  by DaveCar ( 189300 ) writes:
  
  Couldn't read TFA (slashdotted), but I would *imagine* that 'cow' is copy on write and that it just uses new blocks for the changes - so only the differences, but not minimal differences.
  - Re: (Score:3, Informative)
    
    by anilg ( 961244 ) writes:
    
    COW has been present for a long time in ZFS [opensolaris.org] since Solaris 10. The overhead there is negligible and its quite stable. Last I heard, it was being ported to FUSE on linux. Upcoming in the next releases of FreeBSD and OSX. Wiki has a pretty good article [wikipedia.org].
- - Re: (Score:2)
    
    by init100 ( 915886 ) writes:
    
    Generally speaking - when you write out files to the drive they spread out all over the place and each chunk has an i-node or information node that tells a little about what file it is from, and points to the next and last inodes,
    Umm, no. At least for ext3 and similar filesystems, each file or directory corresponds to exactly one inode. The inode contains information about its owner, group, filetype (plain file, directory, symbolic link, FIFO, device file, etc), as well as permission information and extended attributes (such as for ACLs, SELinux security contexts, etc). It also contains pointers to blocklists, but each block does not have a separate inode.
Comment removed (Score:5, Interesting)

by account_deleted ( 4530225 ) writes: on Wednesday May 02, 2007 @07:08AM (#18954769)

Comment removed based on user account deletion

Share
twitter facebook
- The C in CVS. (Score:5, Informative)
  
  by SharpFang ( 651121 ) writes: on Wednesday May 02, 2007 @07:28AM (#18954929) Homepage Journal
  
  Concurrent...
  
  Sure you can "go back in time", but two users working on the same file at the same time would be a pain. Networking would require additional layers - even plain SAMBA/NFS, but still. Plus a bunch of userspace utilities as UI to access it easily.
  
  It's not bad as a backend for such a system, just like MySQL is good as a backend for a website, but by itself it's pretty much worthless.
  
  Parent Share
  twitter facebook
- Re: (Score:2)
  
  by Bacon Bits ( 926911 ) writes:
  
  It's a bad idea to use this kind of thing for version control, IMX. The documentation through TFA is very... sparse.
  
  Q: What happens to old snapshots when the disk begins to fill up?
  Q: How do I manage snapshots?
  Q: Are snapshots atomic?
  Q: What happens when a snapshot fails? What can cause a snapshot to fail?
  
  Windows Server 2003's Shadow Copies works in much the same way, AFAICT, and MS goes out of their way to caution against using Shadow Copies as a replacement for backup or version control. I expect this
- Re: (Score:2)
  
  by hey! ( 33014 ) writes:
  
  Version control is to this thing as keeping your vehicle under control while you drive is to having an airbag.
  
  The point of version control is embodied in the name -- it gives you control. Not only does it give you the power to time travel to specific dates, it gives you the ability to find specific versions, to branch and merge, to mediate cooperation between developers.
  
  This sort of thing would be useful in certain version control scenarios, e.g. the guy who checked out the software and has been modifying i
- Re: (Score:2)
  
  by cduffy ( 652 ) writes:
  
  Subversion's backend is a transactional filesystem (though it sits on top of a BDB interface or a separate FS), and many of the tools it provides work by describing a set of changes as filesystem operations (go down this directory, now go down that directory, now open this file, now seek to this position, now write this text...)
  
  That said, revision control is about much, much more than just storing snapshots that can be retrieved later. Think about branching and merging -- particularly intelligent merge algo
- Re: (Score:2)
  
  by 644bd346996 ( 1012333 ) writes:
  
  The first thing I thought when I saw the headline was this: Don't we already have GIT?
  
  Take a look at this: http://kerneltrap.org/node/4982 [kerneltrap.org] Note particularly the bit where Linus says
  In many ways you can just see git as a filesystem - it's content-addressable, and it has a notion of versioning, but I really really designed it coming at the problem from the viewpoint of a _filesystem_ person (hey, kernels is what I do), and I actually have absolutely _zero_ interest in creating a traditional SCM system.
- Re: (Score:2)
  
  by ciggieposeur ( 715798 ) writes:
  
  This might be far fetched but how far off is it to use these filesystems as a revision control system replacement ?
  
  We should probably ask some VMS users about that. They had a versioned filesystem 20 years ago.
  - Re: (Score:3, Interesting)
    
    by scottv67 ( 731709 ) writes:
    
    We should probably ask some VMS users about that. They had a versioned filesystem 20 years ago.
    
    It's actually closer to 30 years ago. I can't believe VMS is celebrating it's thirtieth birthday this year.
    
    http://h71000.www7.hp.com/openvms/25th/index.html [hp.com]
    
    Having multiple versions of a file is *extremely* handy. That feature saved me bacon many-a-time. For those of you who have never been fortunate enough to login to a VMS system, the file versioning looks like this to the user: scott_file.txt;5 s
- - Re: (Score:2)
    
    by herrlich_98 ( 267669 ) writes:
    
    Clearcase does have a higher admin overhead than CVS. Clearcase also does not work particularly well over a WAN and I suspect ext3cow or something similar would have the same issue. The use model for a SCM tool based on ext3cow would be similar to NFS which you usually do not use on a WAN.
True undelete (Score:5, Insightful)

by ex-geek ( 847495 ) writes: on Wednesday May 02, 2007 @07:13AM (#18954827)

Undelete, not half-assed, desktop based trash can implementations, is something I've always been missing on Linux. And yes, I generally know what I'm doing, but i'm also human and do make mistakes.

Share
twitter facebook
- Re: (Score:2)
  
  by 19thNervousBreakdown ( 768619 ) writes:
  
  I've always wondered about this. Aren't files always eventually deleted with an unlink() call? What reason is there that unlink() can't be modified to instead move the link to a .Trash/ which is then scrounged when more space is needed? You could either auto-delete the oldest files, or if you wanted to not affect FS fragmentation delete a file whenever you needed to clobber one of its sectors. Sure, performance will drop when you get a drive full of deleted files that have to be cleared every time you write
  - Re:True undelete (Score:4, Informative)
    
    by xenocide2 ( 231786 ) writes: on Wednesday May 02, 2007 @08:07AM (#18955373) Homepage
    
    There's a couple reasons for it not being in the kernel. First, it misleads users who expect some degree of data security. The good news is that sort of person likely follows kernel patches to the FS and would likely be aware of the problem, possibly even writing a script that replaces rm with a real-rm.
    
    The second argument is that it's better handled in user space, so the OS doesn't have to make that sort of policy. There's no reason you can't just alias rm to some .Trash, or configure your Desktop Environment to do so (GNOME does, for example). There's all sorts of things you have to decide that might not suit everyone. For example, if I delete a file on a USB drive, does it go in a .Trash storage in the USB drive, or do we copy it over to a main .Trash folder? Many people don't realize they have to empty the trash to reclaim space on their thumbdrive in GNOME.
    
    The final argument I can come up with is security problems. We can't have one global .Trash bin in a multiuser system. And quotas. And permissions.
    
    Reading historic archives of the LKML [iu.edu] suggests it's at least come up once. I guess Torvald's opinion is that anything that CAN go in the userspace SHOULD. Can't explain the webserver in kernel though. Perhaps that opinion has changed some time in the last 10 years?
    
    Parent Share
    twitter facebook
    - Re: (Score:2)
      
      by Cyberax ( 705495 ) writes:
      
      Kernel-level webserver has real performance benefits. And it is not enabled by default.
    - Re: (Score:2)
      
      by aaarrrgggh ( 9205 ) writes:
      
      I still miss the "salvage" from Netware-- the ability to restore any revision to a file as disk space permits. Just hacking rm doesn't fix someone overwriting a file.
      
      As for security, you could disable salvage for sensitive volumes or directories, or have firm policy based wipes of deleted files on a scheduled basis. Often times, Salvage was most useful when a problem was discovered 20 minutes after it occurred.
      
      It sounds like ZFS will do a better job of allowing snapshots to support something like this, bu
    - Re: (Score:2)
      
      by Ant P. ( 974313 ) writes:
      
      Can't explain the webserver in kernel though.
      
      The Tux server has never been a part of the official tree. What's there to explain?
- Re: (Score:2, Interesting)
  
  by jonadab ( 583620 ) writes:
  
  Undelete isn't what makes this really cool, IMO. I don't generally delete stuff I still want, so that isn't really a big issue.
  
  What I want, that a versioning filesystem can deliver, is the ability to revert a file back to an earlier version, after I've saved changes that turn out to be undesirable. This is a mistake I *do* make from time to time, often enough that I have been really hoping for a versioning filesystem in modern operating systems. This, to me, is a killer feature. I'm currently using Free
  - Re: (Score:2)
    
    by TheNetAvenger ( 624455 ) writes:
    
    and I've been waiting, waiting, hoping, wondering why we don't have it in modern operating systems. I *want* this
    
    Look up Windows 2003 Server, WindowsXP, Vista...
  - Re: (Score:2)
    
    by Gordonjcp ( 186804 ) writes:
    
    I have been wanting it ever since I saw the automatic versioning on OpenVMS, and I've been waiting, waiting, hoping, wondering why we don't have it in modern operating systems.
    
    Someone I know has the email signature "DIGITAL had it *then*. Don't you wish you could buy it *now*?"
- Re: (Score:2)
  
  by gmack ( 197796 ) writes:
  
  Undelete in windows is also desktop based. Ever notice that uninstallers don't delete to the "Recycle bin"? You can also try opening a cmd window and deleting something with del and notice that it does not appear in the "recycle bin"
- Re: (Score:2)
  
  by fireboy1919 ( 257783 ) writes:
  
  You mean, for example, remapping the unlink calls to libc to actually move things to ~/Trash?
  
  Surely you wouldn't want that on all accounts. Only for users. It'd be chaos if every single script on your computer that generated temp files had them moved rather than deleted.
  How about putting it into the .bashrc (or .zshrc, or whatever) file to be loaded using the preload trick?
  
  That way, all users that have that .bashrc file can have it on, and everything else won't.
  
  There is a library for this. [nyu.edu]
  
  something I've a
- - Re: (Score:2)
    
    by ex-geek ( 847495 ) writes:
    
    guess what, if you had enough energy to type www.google.com the first 8 links are all great projects and ways to do exactly what you said for the keywords "linux undelete"
    
    but then, typing those letter into a web browser is simply way to much effort.
    These options went out of the window with the introduction of journaling in ext3. But even with ext2, they barely worked, especially for large files. They didn't work for me anyway.
    you must be either management or incredibly lazy.
    I guess you are the 18-year-old i
Well, congratulations. (Score:2)

by jimicus ( 737525 ) writes:

Well done to all who worked on this patch. Guess this means you've almost caught up with OpenVMS [wikipedia.org] now, then? [throws another log of karma on the fire].

All joking aside, I never really liked VMS much. It was extremely good at being very verbose whilst being extremely bad at clear English.
- Re: (Score:2)
  
  by MichaelSmith ( 789609 ) writes:
  
  Well done to all who worked on this patch. Guess this means you've almost caught up with OpenVMS now, then?
  In the sense that you had multiple versions of every file? Well yeah but it is on a per file basis rather than a per volume basis so you can't ask it to give you the entire volume (or even a directory) as it was at a particular time.
  And I remember being caught by the 32000 version number limit, with a batch job which maintained a status file and purged the file after every run. The version number sti
VMS file versions someone? (Score:4, Interesting)

by ntufar ( 712060 ) writes: on Wednesday May 02, 2007 @07:22AM (#18954881) Homepage Journal

It reminds me of VMS file versions.

In VMS if you had a file named article.txt, each time you modified and saved it in editor, a new version was created named article.txt;1 article.txt;2 article.txt;3 and so forth. So after a long session of edit and saves you could end up with a hundred copies of file in your directory. A lot of clutter in the directory but easy access to older versions of the files.

With Ext2cow you basically get the same functionality in a bit different way. By default you see only article.txt file. If you need to access a previous version of the file you need to specify a cryptic code like this: article.txt@10233745. A bit cumbersome but, hey, how often you access older version of your file anyways. Looks better than VMS' approach.

This filesystem seems like a perfect solution for me as I am writing my Ph.D thesis. Currently I take backup every day and name it thesis20070420.tar.bz2, thesis200070421.tar.bz2, thesis20070422.tar.bz2 and so forth in case I need to go back and see how it looked some time ago.

However, in my home directory I have a lot of large audio and video files that I would never want to be versioned. I wander if Ext3cow keeps extra copies of the files if I move them around, change file named but do not modify the content. Probably I would have to make a new partition and put my text files I am working on there under Ext3cow and leave my media files on ext3.

Share
twitter facebook
- Re: (Score:2)
  
  by JohnFluxx ( 413620 ) writes:
  
  Why don't you use svn?
  - Re: (Score:2)
    
    by osgeek ( 239988 ) writes:
    
    Or better yet, SVK [bestpractical.com].
  - - Re: (Score:2)
      
      by Yaztromo ( 655250 ) writes:
      
      Frankly, using SVN would be just too much effort for me: I may forget to commit the changes after a day of work; the files are binary .odt files; I need to teach my wife to use it.
      
      Why not just extract your ODF file before committing? Other than graphic figures it's all text data inside a ZIP wrapper.
      Why is your wife working on your thesis?
      Why would you be any more likely to forget to run "svn commit" than you would be to tar your files up every day? And if you're likely to forget either, why not ju
    - Re: (Score:2)
      
      by karmatic ( 776420 ) writes:
      
      So mount SVN over webdav, and turn on auto-versioning. Whenever you make a change, it gets committed as a new revision.
    - Re: (Score:2)
      
      by radarsat1 ( 786772 ) writes:
      
      Not to repeat too much what other people have said, but I _highly_ recommend using subversion to keep track of your school work. I am currently writing my Master's thesis, and I'm using subversion to track my .lyx and .tex files. Not only does it allow me to go back to old versions, but it helps me to keep everything synchronized when I switch from working on my laptop to my desktop or to the computer in my lab. Version control is not only for code, it is for _source files_. That includes anything that
- Security, backups (Score:3, Interesting)
  
  by Midnight Thunder ( 17205 ) writes:
  
  This solution certainly helps if you accidentally delete something or need to go back to an older version. SVN is one solution, but it is a bit more explicit, while solutions like this and Apple's Time Machine help avoid needing to remember to update your repository. It should be noted that this doesn't replace backups, since this does not protect against hard-drive corruption. I do have a few of questions though:
  - what are the security considerations here?
  - can you delete the
- Re: (Score:2)
  
  by GauteL ( 29207 ) writes:
  
  "If you need to access a previous version of the file you need to specify a cryptic code like this: article.txt@10233745. A bit cumbersome but, hey, how often you access older version of your file anyways. Looks better than VMS' approach."
  
  This is exactly what a graphical file manager should abstract away through concepts such as time machine [apple.com].
  
  This announcement is just Linux file systems starting to catch up with features from file systems such as ZFS. Very good news.
  - Re: (Score:2)
    
    by TheNetAvenger ( 624455 ) writes:
    
    This is exactly what a graphical file manager should abstract away through concepts such as time machine [apple.com].
    
    I know this is SlashDot, but why reference a non-shipping product as the GUI standard example for this feature when it has been being used in Windows 2003 and WindowsXP for over 4 years?
    
    Vista even goes a few steps beyond previous Windows versions and Time Machine.
    
    PS from the last Beta I played with, Time Machine's UI has a ways to go to catch up to the simplicity of right click -> previous
- Re: (Score:2)
  
  by arivanov ( 12034 ) writes:
  
  Not quite.
  
  This is more like NetApp and other high-end NAS and SAN systems where a facility like this is used for backup. The backup system looks at a snapshot taken at X:00 and backs it up at leisure while the users continue to read/write to the filesystem on top of it. Once the backup is complete you obsolete the checkpoint on which the backup was operating. As a result you have a true backup of the filesystem at point X, not something that spread from X to X+N hours.
  
  This is a killer feature as far as any
- Re: (Score:2)
  
  by cortana ( 588495 ) writes:
  
  Interesting... but tracking the revisions of a file by name has some limitations. What happens if I rename a file (also to another directory)? What happens if I rename a directory itself? Is the file metadata (owner, access permissions, modification times, extended attributes (including selinux labels, ACLs and user extended attributes)) versioned?
  
  I guess some of this info is on the project's home page, which is down at the moment...
- Re: (Score:2)
  
  by rbanffy ( 584143 ) writes:
  
  If I got it correctly, you would only have a new copy of the directory when you rename or move the file. The file will only be copied if it is changed.
  
  And it is only necessary if you are doing it based on files. If you do it based on blocks, then only the blocks that were changed get copied.
  
  It seems quite cool. Too bad all servers even remotely related to it appear to have been slashdotted.
- VMS file versioning was lame (Score:2)
  
  by mkcmkc ( 197982 ) writes:
  
  VMS was my first real OS, and I don't miss it at all. Its versioning was fairly useless--one of the first commands everyone learned was PURGE, to get rid of all of the clutter. In order to be useful, other versions have to be out of view during normal operation...
- Re: (Score:2)
  
  by caseih ( 160668 ) writes:
  
  ext3 and ext3cow are inode file systems. So if you rename the file or move it anywhere on the disk, the inodes allocated to the file stay the same. With ext3cow, the inodes that make up the versions would stay the same too.
- Re:VMS file versions someone? (Score:4, Interesting)
  
  by physicsnick ( 1031656 ) writes: on Wednesday May 02, 2007 @10:33AM (#18957503)
  
  Hmm, when I read your post I thought I'd come here and suggest Subversion. Seems everyone else has done the same.
  
  You really should use it. It's much easier to set up than you'd think, especially if you're on a Debian/Ubuntu box. If you use the file:/// syntax, you don't even need any kind of daemon or http server running; the client can do everything on its own. Say your thesis is currently sitting in ~/thesis, it's this easy to set up:
  
  sudo apt-get install subversion svnadmin create ~/thesisrepo svn import ~/thesis file:///home/${USER}/thesisrepo -m "Initial import" mv thesis thesisbackup svn co file:///home/${USER}/thesisrepo thesis
  
  That's it, you're done. ~/thesis is now a working copy of your repository, the repository itself (which will hold all versions of your files) is contained in ~/thesisrepo, and your original folder is backed up as ~/thesisbackup.
  
  To work on your thesis, go into ~/thesis and start writing as you've always done. When you want to save a snapshot of the current state of your thesis (i.e. commit your changes), open a bash terminal, go into ~/thesis and type svn ci -m "some message". That's it. Much easier than running a backup; you can just stick it in a daily (even hourly) cron job. To back up all versions of the thesis on removable media, tar up the ~/thesisrepo folder and put it somewhere safe.
  
  There's a bit more to know about it; namely you need to tell subversion when you add, remove, move or rename files. A good source for that is the Subversion Book [red-bean.com], specifically Chapter 2.
  
  Parent Share
  twitter facebook
Smells like dirvish (Score:2, Interesting)

by Zekat ( 596172 ) writes:

This sounds like http://www.dirvish.org/ [dirvish.org], which is nearly as nice as the automatic file snapshots done by the "Network Appliance" fileserver boxes I've used at the last 2 out of 3 workplaces.
Ze First Step (ZFS) (Score:2)

by udippel ( 562132 ) writes:

Done it, been there.
Guess, this is the first step to approach ZFS, which for some stupid licence reason doesn't seem to have an easy path into the Linux kernel.
ZFS does a few, actually a lot, more. But why not write a different solution, for a plurality of choice.
May the best win !
- Re: (Score:2)
  
  by OverlordQ ( 264228 ) writes:
  
  IIRC the main reason ZFS wont make it into the kernel is that a non-trivial amount of the filesystem kernel code would need to be re-written.
some background (Score:5, Informative)

by pikine ( 771084 ) writes: on Wednesday May 02, 2007 @07:59AM (#18955281) Journal

I'm answering questions that people posted so far altogether.

Is it a file system or a file manager?

It is a file system. You access old snapshot by appending '@timestamp' to your file name. You have to first instruct ext3cow to take a snapshot first before you can retrieve old copies, otherwise it simply behaves like ext3. It appears that snapshot is always performed on a directory and applies to all inodes (files and subdirectories) under it.

My complaint is its use of '@' to access snapshot. Why not use '?' and make it look like a url query? Better yet, use a special prefix '.snapshot/' like NetApp file servers.

Does it store many copies of each file? or only the differences between the old and the new version?

How far off is it to use these filesystems as a revision control system replacement?

ext3cow takes it's name from "copy on write," and it does this on the block level. When you modify a file, it appears to the file system that you're modifying a block of e.g. 4096 bytes. COW preserves the old block while constructing a new file using the blocks you modified plus the blocks you didn't modify.

You can think about it as block-level version control. However, when you save a file, most programs simply write a whole new file (I'm only aware of mailbox programs that try to append or modify in-place). Block-level copy on write is unlikely to buy you anything in practical use.

Does it provide undelete?

Only when you remember to make a snapshot of your whole directory. An hourly cron-job would do, maybe. There is always the possibility you delete a file before a snapshot is made.

Share
twitter facebook
- Compatibility and copy on write... (Score:2)
  
  by argent ( 18001 ) writes:
  
  My first thought was the same as yours, why not use the ".snapshot" prefix from netapp, so that scriopts and tools written for Netapp servers will continue to work.
  
  Second, I have hundreds of mail folders saved in files with names like "user@example.com". Oops.
  
  Block-level copy on write is unlikely to buy you anything in practical use.
  
  For binary files (eg, databases) it will. And it's pretty cheap to implement... for a whole-file write operation where the file is first truncated the cost is the same as if the
- - Re: (Score:2)
    
    by pikine ( 771084 ) writes:
    
    The 'real' solution is new system calls, new shells that know about them--a top to bottom extension of POSIX filesystems.
    Tools augmented with snapshot support won't save you any typing. You would have to specify additional command line, which is likely going to be longer than namespace hacks. If you're concerned about number of characters to type, you should prefer namespace hack.
    Why not use '?'? Perhaps you are not yourself a Unix/Linux user--that one's a shell wildcard
    In Unix, you can escape both '*'
No Data (Score:2)

by wild_berry ( 448019 ) writes:

I can't see anything linked from the ext3cow.com site, save for the near-silent mailing lists. I'm tagging this 'slashdotted'. There's not even a huge amount on the Wayback Machine: http://web.archive.org/web/*/http://ext3cow.com [archive.org]

I guess that this is a fork of the ext3 code with Copy On Write functionality and userland tools to make snapshots and time-travel the snapshots. Wikipedia's article on Ext3cow [wikipedia.org] names Zachary Peterson, the submitter of the article, and links to an ACM Transactions on Storage paper
Linux is catching up to BSD... (Score:2)

by mi ( 197448 ) writes:

BSD operating systems had filesystem snapshots [wikipedia.org] functionality for several years now... Linux is catching up — in a usual Linux way with patches, which one has to collect from all over...
Or am I misreading the write-up and this new ext3cow thingy is much more than that?
- Re: (Score:2)
  
  by Jokkey ( 555838 ) writes:
  
  Linux has had filesystem snapshots (via LVM) for quite a while too. Ext3cow, as I understand it, differs in that it lets users access previous versions of individual files from within the current filesystem, rather than creating a snapshot of an entire filesystem or disk. As far as I know, it takes space out of the existing ext3 filesystem to do this, rather than using previously unallocated space within the disk volume group.
  - Re: (Score:2)
    
    by emj ( 15659 ) writes:
    
    The BSD feature is the same as ext3cow, and it's been there for a while. The LWN snapshots were very cumbersome some years ago, and they are block snapshots not Filesystem snapshots.
    - Re: (Score:2)
      
      by darrylo ( 97569 ) writes:
      
      ZFS [wikipedia.org] has recently been added to FreeBSD [freebsd.org]. ZFS is also rumored to be added to OS/X [insanelymac.com].
      So, yes, Linux does have some catching up to do. ;-)
- Fanboy (Score:2)
  
  by Slashdot Parent ( 995749 ) writes:
  
  This is not even close to the same thing that is a BSD filesystem snapshot [freebsd.org], but don't let interrupt your furious fanboy wankfest.
  
  BSD snapshots are a lot like LVM snapshots (that have been available in Linux since 1998), except that under Linux, you are not limited to 20 snapshots.
  
  What ext3cow does, which you would realize if you would have opened your ears before your mouth, give you true point in time recovery. In other words, without ever manually "taking a snapshot", like you'd have to under BSD, you ca
  - Re: (Score:2)
    
    by mi ( 197448 ) writes:
    
    you can simply revert your filesystem to where it was at any arbitrary point in time.
    No, you can't. According to this example [www.ext3cow.com] you need to issue an explicit "snapshot" command — I checked my facts before posting, as well as I could, anyway. There is no word yet on the maximum number of snapshots — they may well be limited to 20 as well.
    What a major oopsie, I might add... I mean, you could've come up humbly with something "As far as I know, ext3cow is better, because it requires no explicit sn
- Re: (Score:2)
  
  by Mysticalfruit ( 533341 ) writes:
  
  Actually Linux has supported snapshots through the LVM layer for several years as well. This isn't a filesystem snapshot, it's a per file snapshot system.
Ubuntu? (Score:2)

by wile_e_wonka ( 934864 ) writes:

I heard Ubuntu was planning to upgrade to Ext4 for Feisty, and then it fell through, and instead they were planning on Ext4 to be available as a patch approximately the same time Feisty was released. Is Ext3cow the change that Ubuntu was planning to impliment? (I realize Ext4 is different from Ext3cow, but I'm wondering if Ubuntu's getting this as an automatic update)
NILFS? (Score:2)

by stu42j ( 304634 ) writes:

Anybody use the similarly featured NILFS?
NILFS is a log-structured file system developed for the Linux, and it is downloadable on this site as open-source software.

http://www.nilfs.org/en/index.html [nilfs.org]
it's NOT a versioning filesystem (Score:2)

by sloth jr ( 88200 ) writes:

It's simply a filesystem with snapshots. Big deal. It'll only do cool stuff when you tell it to make a snapshot, not every time a file changes.
Interesting - I have a couple of questions (Score:3, Interesting)

by ratboy666 ( 104074 ) writes: <fred_weigel@hotm ... inus threevowels> on Wednesday May 02, 2007 @09:59AM (#18956985) Journal

No flaming -- I don't have the time to research this, so I'll just post the questions!

1 - What happens to large databases? I am assuming a delta storage method, but that might slow down the database (specifically, I use mysql).

2 - Large files? Specifically, deletion (I store lots of videos)

3 - Usenet spools? (Lots of small files, deleted regularly).

I suspect that I would have to segregate my files...

Share
twitter facebook
- Re: (Score:2, Interesting)
  
  by Anonymous Coward writes:
  
  So because it was a good idea 20 years ago, it somehow isn't good that it's been implemented now? Sure, in an ideal world we'd all have been using versioned filesystems since the advent of VMS, but we havn't.
  
  Actually a tell a lie; the ISO9660 spec. copies the VMS design and also allows files to have a version number, using the exact same scheme I.e. the version # is appended to the file following a semi-colon. So "FOO.BAR;1" is a valid ISO9660 filename.
- Re: (Score:2)
  
  by ajs318 ( 655362 ) writes:
  
  This is one of the things I missed when I moved from VMS on a VAX 11/750 {or it might have been a 780?} to MS-DOS on a '286. The commands were kind of similar between the two OSes, though DOS didn't have EVE -- which for me was the killer app. Version numbers, EVE, case-insensitivity and commands that were not "telegraphese" {there not being such a word as "txtspk" in those days when mobile phones were analogue, half-duplex [you squoze a switch in the handset when you wanted to speak, and let go to li
- Re: (Score:2)
  
  by TractorBarry ( 788340 ) writes:
  
  And Fujitsus (ex ICL) VME (Virtual Machine Environment) also has generation numbers. And they're a brilliant idea.
  
  e.g. Access a file by name eg. "OPEN_FILE(FOO)" and that will open the highest generation (latest version) of the file. Want to access an old version ? Simply specifiy something like "OPEN_FILE(FOO(23))". Obviously there are tools for tidying up old generations etc.
  
  And when you edit a file it is always saved to a new, higher, generation so you can always go back to previous version after yo
- Re: (Score:3, Interesting)
  
  by psbrogna ( 611644 ) writes:
  
  I don't think it's supposed to be new (it's one of the things I miss most about VMS). It's outstanding functionality to have both for end users and sysgeeks/devs; built right into the file system level (ie. LOW). I prefer this approach to the hacks that other O/S's have implemented at a higher level. It's always harded to do something like this down deep at the roots rather than add it on as superficial gloss later. Granted, the end users don't usually notice or appreciate the diff but over time it keeps co
- Re: (Score:3, Informative)
  
  by TodMinuit ( 1026042 ) writes:
  
  It's more like Plan 9's Fossil [bell-labs.com], only without the extremely cool Venti [bell-labs.com].
- Re: (Score:2)
  
  by delire ( 809063 ) writes:
  
  Not trolling but just somebody enlight me, what is new here?
  It is for Linux. That is what is new. The two examples you give are for other operating systems. Raising your eyes to the top of the page will reveal this article is in the section "Linux". It's a bit tricky I know.
  
  Psst: it's not a race.
  - Re: (Score:2)
    
    by init100 ( 915886 ) writes:
    
    It is for Linux. That is what is new.
    
    Actually, snapshots with copy-on-write functionality is not new in Linux, but it hasn't been available in the filesystem itself. The Logical Volume Manager is able to create and use COW snapshots, and has been for some time.
    - Re: (Score:2)
      
      by delire ( 809063 ) writes:
      
      Actually, snapshots with copy-on-write functionality is not new in Linux, but it hasn't been available in the filesystem itself.
      Precisely.
  - Re: (Score:2)
    
    by spitzak ( 4019 ) writes:
    
    There are already versions of this on Linux. One I use at work does it instead by making a new directory with the older versions of the files in it. To see yesterdays version of ./foo.txt you look at ./.snapshots/yesterday/foo.txt. This seems a lot nicer as you can more easily see when files are created and deleted as well as modified, though it is possible that technical limitations prevented this from using this naming scheme. I not sure what the system is, as it is a large file server, it is running Linu
- Re: (Score:3, Informative)
  
  by samkass ( 174571 ) writes:
  
  Apple's Time Machine isn't just a *file* backup system. It's a *record* recovery system. Neither MS Shadow Copies nor this provides an API for software to search records back through time and pull a single record back to the present (ie. a single address book entry or photo). It's frustrating having people equate them so closely when it misses half the point of Time Machine.
- Re: (Score:2, Insightful)
  
  by heffrey ( 229704 ) writes:
  
  What evidence do you have that this is reverse engineering?
  
  Or do you mean that they are re-implementing Time Machine?
  - - Re: (Score:2)
      
      by cortana ( 588495 ) writes:
      
      "Theft" how?
      
      And of what IP?
      
      Make a specific allegation or stop trolling, please.
    - Re: (Score:2)
      
      by init100 ( 915886 ) writes:
      
      Do you actually mean that it is "IP theft" to take functionality from the Linux Logical Volume Manager and implement it per file in the file system instead? Hardly.
    - Re: (Score:2)
      
      by ajs318 ( 655362 ) writes:
      
      "Theft", you say. Theft is unlawfully taking something that belongs to another person, with intent permanently to deprive them of it {so it's generally a defence to theft that you believed the former owner intended to destroy the article, since you can argue that you intended only temporarily to deprive them of it [for however long it would have taken them to destroy it]; though if the article derives value from the manner of its destruction [for example, a cream cake that they intended to destroy by e
- Re:Can No One Else INNOVATE? (Score:4, Insightful)
  
  by beezly ( 197427 ) writes: on Wednesday May 02, 2007 @07:59AM (#18955285)
  
  Go away MacTroll...
  
  Veritas VxFS has had this for years. Snapshotting has been implemented in the Linux LVM layer for ages. This is just another way to do it.
  
  I don't know anything about the technical implementation of Vista Shadow Copies or Apple's Time Machine, but if it's anything like ZFS [wikipedia.org] then I'll be impressed. I believe there are rumours about the next release of OS X using ZFS (which was developed by Sun), but I'll believe it when I see it.
  
  Parent Share
  twitter facebook
  - time machine (Score:2)
    
    by douthat ( 568842 ) writes:
    
    I don't know anything about shadow copy, but Apple's Time Machine is all userland. There is a process that looks for file system events and logs the files that have been changed. Every x time units (e.g. 1 hour) a heavily hardlinked copy of your most recent backup is copied to a new tree and the newly modified files are copied over there. Every y time units (e.g. 1 day), all but the day's newest backup are deleted. If you run out of space, old trees are also deleted.
  - - Re: (Score:2)
      
      by shani ( 1674 ) writes:
      
      As for me being a troll: When does debate end and trolling begin?
      
      Good question.
      
      I was simply pointing out that this "smelled" much like Time Machine, albeit a clumsy, wholly unintuitive version of the underlying technology.
      
      Here, for instance, the trolling begins at the word "clumsy".
    - Re: (Score:2)
      
      by rbanffy ( 584143 ) writes:
      
      You say it's unintuitive just because it has no GUI.
      
      And it shouldn't have one - it's a file system, not a userland application. The userland applications will come and may even look like Time Machine (I was once impressed, but it got less and less impressive over time, as I learned more about ZFS and LVM snapshots). I hope not - It's cool but not that much functional.
      
      OSX is a nice piece of software and sure solves a lot of problems for its users, but claiming this is in any shape or form inspired on Time Ma
- Re: (Score:2)
  
  by jonadab ( 583620 ) writes:
  
  Actually, filesystem versioning is older than Apple as a company, much less OS X. ITS had it in the sixties, and VMS has had it since the late seventies. Nonetheless, it's an undeniably useful feature, and I'm glad it's finally making its way into the major OSes.
- - - - Re: (Score:2, Informative)
        
        by siride ( 974284 ) writes:
        
        Because it wasn't REVEALED until 2006, so even if Apple was working on it in 2002 (not likely, since Open Source projects generally have longer cycles than proprietary ones due to manpower issues), the ext3cow people would not have been aware of it. Why do you think people are stealing this from Apple? It's a good idea that follows logically from ideas found in revision control software such as Subversion and its predecessors. And as others have pointed out, VMS had this 20 years ago. The idea certainly
  - - Re: (Score:2)
      
      by init100 ( 915886 ) writes:
      
      As I replied to another post: It's IP theft, direct or indirect; but IP theft just the same.
      
      As I replied to another of your posts: It isn't.
- - - Re: (Score:2)
      
      by frogstar_robot ( 926792 ) writes:
      
      There is a huge difference between reverse engineering and reimplementing. To reverse engineer a thing, it has to be possible to study it in detail. Seeing a cool demo and making something that works like it isn't reverse engineering, that is re-implementing. Also, neither reverse engineering or reimplentation isn't automatically stealing either. Apple would be in a pretty piss poor spot if they themselves could not re-implement. It surely isn't as though only Apple has the right to make accessible tec
- Re: (Score:2)
  
  by sofar ( 317980 ) writes:
  
  It appears that this is the long-term goal for ext3cow. If you look at the formatting of the patch it's obvious that they have worked towards fully integrating their code into the kernel tree from the start (instead of building a separate set of code that compiles outside of it just like -e.g.- the fglrx installer does).
  
  I would assume that their next step is to submit it to Andrew Morton and stage it for later merging into the mainline tree. From how the code looks I can see that that might go quite fast.
- - Re: (Score:3, Insightful)
    
    by ajs318 ( 655362 ) writes:
    
    The Linux kernel will never, ever have a stable ABI. Compatibility across versions is guaranteed only at the Source Code level, not the binary level. This is 100% intentional, and the only people it really hurts are those who would deny us access to the Source Code. And they deserve it.
    - Re:Excellent work but... (Score:4, Insightful)
      
      by oliverthered ( 187439 ) writes: <oliverthered@hot ... minus physicist> on Wednesday May 02, 2007 @08:13AM (#18955471) Journal
      
      Your wrong, it also hurts those people who write drivers that aren't accepted into the kernel. And it also hurts end users or haven't you noticed the lack of Linux drivers for a lot of hardware.
      
      Parent Share
      twitter facebook
      - Re: (Score:3, Informative)
        
        by cyclop ( 780354 ) writes:
        
        If someone writes kernel drivers correctly, those drivers will end in the kernel mainline. Linux supports out of the box more hardware than every other OS, no matter how obsolete and obscure. If you don't have your drivers accepted, AFAIK it's a problem with your code not being of enough good quality, nothing else.
    - Re:Excellent work but... (Score:4, Insightful)
      
      by Toffins ( 1069136 ) writes: on Wednesday May 02, 2007 @08:30AM (#18955733)
      
      Compatibility across versions is guaranteed only at the Source Code level
      
      (Disclaimer: Linux is excellent) But is compatibility even guaranteed at source code level?
      Here are some specific examples where source level API changes have occurred:
      1. Consider that up to linux-2.6.6 all SATA disks were treated as IDE PATA disks accessible via /dev/hd*, but in linux-2.6.7 they started to be treated as SATA disks only accessible via /dev/sd*. This changeover caused existing SATA disk systems to become unbootable after upgrading to linux-2.6.7 because the boot device at /dev/hd* was no longer accessible. Never documented in kernel/Documentation/*
      2. And between linux-2.6.15 and linux-2.6.20 the way the usb subsystem handled usb devices was changed so that usermode usb drivers like the usermode speedtouch driver was broken due to kernel returning EINVAL from each USBDEVFS_SUBMITURB command which is required after a USBDEVFS_CONTROL command issued by the modem_run ADSL line monitoring process. This generates thousands of error messages per second via syslogd. No news of this particular aspect of the usb changes was ever documented in kernel/Documentation/*.
      
      Parent Share
      twitter facebook
    - Re: (Score:2)
      
      by diamondsw ( 685967 ) writes:
      
      Hi, this is the real world calling. We've been leaving messages for several years as Linux has failed to work on the desktop. We wanted to let you know that we've found the problem, and it's not going to be cheap to fix. Essentially, users want to be able to download and install software or install it from a CD and just have a binary work. "Package management" and dependency hell confuses them and reminds them of DLL's on Win95.
      
      You're going to have to decide if you want every last thing to be GPL and zealou
      - Re: (Score:2)
        
        by Ant P. ( 974313 ) writes:
        
        Hi, this is the real world calling. The last software distributed on CDs that wanted kernel access in the way you're condoning, instead of behaving itself and using system libraries, was the Sony Rootkit.
        
        In summary, fuck off.
      - Re: (Score:3, Insightful)
        
        by Doug Neal ( 195160 ) writes:
        
        A huge number of problems in Windows can be attributed to its lack of package management. Every installer is pretty much allowed to do whatever it wants, put files where it wants, change registry keys, whatever.. and when was the last time you saw a Windows program with an uninstaller that worked? I mean really worked? They all leave crap lying around afterwards that they "couldn't" remove for some vague/unspecified reason. Sometimes you don't even get an uninstaller at all. There's no version tracking, and
    - - Re: (Score:2, Insightful)
        
        by ajs318 ( 655362 ) writes:
        
        How about us who don't want to recompile everything whenever a new kernel release comes out? It is a freaking pain in the butt.
        No it isn't. That's a filthy lie made up by people who want to sell you pre-compiled binaries and stop you mucking about with the Source Code, and nobody who can spell 'make clean && make install' believes it. (Or you could use Gentoo, which automates the recompilation; or a distribution using pre-compiled .rpm or .deb binary packages, which will have been recompiled for
        
        Re: (Score:2)
        
        by smenor ( 905244 ) writes:
        
        Perhaps what the GP meant wasn't that recompiling everything isn't difficult so much as slow, tedious, and annoying?
- Re: (Score:2)
  
  by 644bd346996 ( 1012333 ) writes:
  
  You seem to think that the name of a filesystem matters. It does not. In particular, desktop users should never have to know the name of their filesystem. If "ext3cow" keeps anybody from switching to Linux, it will be because they needed to learn the name of the filesystem, not because the name is pathetic.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

So which is it? (Score:3, Interesting)

Re:So which is it? (Score:5, Informative)

Re: (Score:3)

Can't tell, its slashdotted (Score:3, Informative)

What a name (Score:3, Funny)

Re: (Score:2)

Overhead? (Score:3, Interesting)

Re:Overhead? (Score:4, Informative)

Re: (Score:3, Informative)

Re: (Score:3, Informative)

Re: (Score:2)

Comment removed (Score:5, Interesting)

The C in CVS. (Score:5, Informative)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3, Interesting)

Re: (Score:2)

True undelete (Score:5, Insightful)

Re: (Score:2)

Re:True undelete (Score:4, Informative)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2, Interesting)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Well, congratulations. (Score:2)

Re: (Score:2)

VMS file versions someone? (Score:4, Interesting)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Security, backups (Score:3, Interesting)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

VMS file versioning was lame (Score:2)

Re: (Score:2)

Re:VMS file versions someone? (Score:4, Interesting)

Smells like dirvish (Score:2, Interesting)

Ze First Step (ZFS) (Score:2)

Re: (Score:2)

some background (Score:5, Informative)

Compatibility and copy on write... (Score:2)

Re: (Score:2)

No Data (Score:2)

Linux is catching up to BSD... (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Fanboy (Score:2)

Re: (Score:2)

Re: (Score:2)

Ubuntu? (Score:2)

NILFS? (Score:2)

it's NOT a versioning filesystem (Score:2)

Interesting - I have a couple of questions (Score:3, Interesting)

Re: (Score:2, Interesting)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3, Interesting)

Re: (Score:3, Informative)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3, Informative)

Re: (Score:2, Insightful)

Re: (Score:2)

Re: (Score:2)