Fedora 11 To Default To the Ext4 File System 161
ffs writes "The next release of Fedora, 11, will default to the ext4 file system unless serious regressions are seen, as reported by heise online. The LWN story has a few comments extolling the virtues of the file system. Some benchmarks have shown ext4 to be much faster than the current default ext3. Some of the new features that matter for desktop users are a faster file system check, extents support (for efficiently storing large files and reducing fragmentation), multiblock allocation (faster writes), delayed block allocation, journal checksumming (saving against power / hardware failures), and others. The KernelNewbies page has more information on each feature. As is the extfs tradition, mounting a current ext3 filesystem as ext4 will work seamlessly; however, most new features will not be available with the same on-disk format, meaning a fresh format with ext4 or converting the disk layout to ext4 will offer the best experience."
EXT4 in Clusters? (Score:1, Informative)
No (Score:5, Informative)
Ext4 is not a SAN or distributed filesystem. GPFS/lustre/GFS remain a good choice for that.
Re: (Score:2, Interesting)
Re:EXT4 in Clusters? (Score:5, Insightful)
What I would like to see are clustered FSs which are easier to set up. I.e. You go to the 1st machine and start up the cluster config program and it asks: "Is this the 1st machine in your cluster?" Once you say yes there, you go to the other machines in turn, fire up the same program and say no to that question and enter the IP of the 1st machine.
Once all those machines are added, the next step is to select. "Add Local disk to cluster pool" and then you select partitions on your local hard drive that should be in the pool. They don't have to all be the same size either.
Once you have done that for each machine (either by going from one to the next or using the the tool on the primary node to add disks from each one (or a whole group of them if they are already partitioned in the same way).
Then you just start mounting this virtual disk and dumping files to it.
The technology exists to do this. The problem is that each time it's done' its a manual process tantamount to a programing job. Who want's to take up the task of tying all the pieces together to make the setup feel this simple for the user.
Additional functionality (like tuning the FS for Database or Email usage and failover hierarchy) would be added over time and in a way that dose not detract from the simplicity of that basic setup.
Re: (Score:3, Funny)
Re:EXT4 in Clusters? (Score:5, Insightful)
I have bartered PC repair and System admin services for competent legal advise, accounting service and even medical care on one occasion (Every desktop in my dentist's office had the "worm of the month").
Sensible people do what they are good at and wherever possible get others to do the other things.
This little project may take a day or a few months for a pearl wizard. I'm not sure. I do know it would take me years, if it got done at all.
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
It needs specifying. If you had example configuration files and how they would be changed with various operations, examples of what the dialogs should be like, and other details planned out you could probably get someone to program it pretty easily. It doesn't sound like it'd be a lot of code for a CLI app that helped with some of the discovery/etc.
Re: (Score:2)
Re: (Score:3, Informative)
Red Hat ship some web based tools called Luci and Ricci which basically do all of this, with a pointy-clicky interface.
Rich.
Re:EXT4 in Clusters? (Score:4, Informative)
1. The work for sharing a SAN but are not so useful for clustering local disks.
2. Even doing what they are good at, setup is a bit more tedious than I would like.
BTW: A nice setup menu dose not need to be GUI. Many of the console tools in the system-config-whatever series are nice.
Re: (Score:2)
Darn. I really like your
How does it compare to ext2? (Score:4, Interesting)
So where can I see some benchmarks showing just how much of a slowdown I can expect after switching from ext2 to ext4? All the benchmarks I see around here compare it to ext3 and to ReiserFS only. Also, is it possible to run ext4 without the journal? Any benchmarks on that? (Oh, and please, don't bother with the reliability lectures. I couldn't care less.)
Re:How does it compare to ext2? (Score:5, Informative)
is it possible to run ext4 without the journal?
Yes, it is [kernel.org]. And, as you can see in the link, ext4 is faster than ext2. Even with journaling.
Re: (Score:2)
You may be able to make one:
ext4_noj = {
features = extents,huge_file,flex_bg,uninit_bg,dir_nlink,extra_isize
inode_size = 256
}
# mke2fs -T ext4_noj ext4image.iso
mke2fs 1.41.3 (12-Oct-2008)
ext4image.iso is not a block special device.
Proceed anyway? (y,n) y
Filesystem label=
OS type: Linux
Block size=4096 (log=2)
Fragment size=4096 (log=2)
31296 inodes, 125000 blocks
6250 blocks (5.00%) reserved for the super user
First data block=0
Maximum filesystem blocks=130023424
4 block groups
32768 blocks per group, 32768 fragments per group
7824 inodes per group
Superblock backups stored on blocks:
32768, 98304
Writing inode tables: done
Writing superblocks and filesystem accounting information: done
But you might not beable to actually use it:
# mount -t ext4 -o loop ext4image.iso /mnt/loop1/ /dev/loop/0,
mount: wrong fs type, bad option, bad superblock on
missing codepage or helper program, or other error
In some cases useful info is found in syslog - try
dmesg | tail or so
# dmesg | tail
ext4: No journal on filesystem on loop0
I use ext4 on my media partition with no problems.
Re: (Score:2)
ext4 vs reiserfs & jfs would be particularly interesting tbh, its hard NOT to be faster than ext3/2.
Ext4 small files performance? (Score:5, Interesting)
I still haven't seen sensible benchmarks for ext4 with respect to how large directories scale, interleaved small file read and create, and small-file write with one fsync() at the very end (the only real world case.)
At this point, I have to wonder if the emporer has no clothes, or if the people posting benchmarks are just idiots.
Re: (Score:2, Interesting)
For those who are not filesystem wizzes, could you expand or provide a link on why this is important? I'm wanting to improve the performance of my boxes like everyone else but understanding the ins and outs of the filesystems is a weak point of mine. Thanks.
Re:Ext4 small files performance? (Score:5, Insightful)
Because disks are buffered, and fsyncing after every call (or forgetting to do so entirely) is silly.
I suppose somebody cares about how well they can expect their 124GB file to stream to disk, but for the rest of us mortals, we care about journalling support (check), a toolset (mostly check), and common-case performance, which in the *nix world involves a lot of reading and writing of small files.
I'd also like to see how these things perform under load, or when multiple benchmarks are running simultaneously.
Re: (Score:2)
maybe a silly question, but wouldn't running multiple benchmarks simultaneously cause spurious results?
Re: (Score:3, Interesting)
What he really means is having a new benchmark that has a combination of loads from other benchmarks -- this is closer to a real-world case than any one individual benchmark, which is some kind of extreme case.
Re:Ext4 small files performance? (Score:5, Interesting)
I suppose somebody cares about how well they can expect their 124GB file to stream to disk
I know for certain that I care about big-file performance in almost only these ways:
Can I write the file faster than the network sends it to me?
Can I read the file faster than the application (typically mplayer) needs to consume it?
When I know I shouldn't sit and wait for a larger task to continue, I really don't care how long it takes as long as I can do interactive stuff with good performance and the disk won't still be rattling when I go to sleep. Five minutes? An hour?
I'd rather have effort put into usability of disk management tools: four-way on-line resizing (left/right end moving left/right), on-line repacking (defragmentation) and on-disk format conversion, on-line repartitioning [which goes beyond the scope of ext4, of course] and things like that. A versioning file system would be cool, and btrfs snapshots sound like they'd be nice as well .
But that's the desires for my usage pattern, and I acknowledge that there are others.
Re: (Score:2)
I know for certain that I care about big-file performance in almost only these ways:
Can I write the file faster than the network sends it to me?
Whether or not that's at all difficult depends on the network in question. You could just about carve the data into stone tablets with a hammer and a chisel at the speed my cable modem delivers it, but when I'm moving files between machines on the LAN (Gig-E), it's a very different story.
Re: (Score:3, Funny)
Re: (Score:2)
Although, 50 Mbps is only a little over 5 MBps, which is an I/O rate even an ancient laptop drive with a lousy file system can handle.
Do they give you a decent upstream with that?
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
I'd like my filesystem to come with a fast way to backup incrementally without having to read all the metadata. Like a lightweight journal, a changelog only. I know Veritas and Tivoli have had journalling services that poll the filesystem for changes for a while. Would it be so hard to break the journals up into a few parts so I only have to look at changed files when I'm backing up incrementally. Or do they already do this?
Re: (Score:3, Informative)
You can see some of those benchmarks in this paper [fedoraproject.org] which explains the block allocator improvements that have been done in ext4.
Re: (Score:2)
Alright! And NOW I feel somewhat excited about ext4. I just wish for a change /. were posting things like this instead of the article in the summary.
Re: (Score:2)
It's not that important to me. But for once, I'd love to see fairly exhaustive benchmarks like I used to see for Reiser3. At least, at the time, they felt pretty good.
I imagine Ext4 is actually quite good. It would be nice to be able to see some quantification of how good that was not obviously flawed.
Fresh format vs conversion (Score:3, Informative)
I do not propose offering migration from ext3 by default, at this point, due to bugs in that process, and extra risk involved. Perhaps an "ext4migrate" boot option could be used to expose it for further testing.
Re:Fresh format vs conversion (Score:5, Interesting)
Good to know. Personally I'll be happy to use ext4 on new disks or when I'm really doing a complete re-install, but I'm in no hurry to "upgrade", seeing as my current ext3 disks are working just fine. I played with different filesystems once until I got some corruption and realized that one of the advantages of ext3 is that it's been around long enough that there are lots of tools to help with recovery and checking. So I'll probably stick with what I know until I have an opportunity to try out ext4, but I'm not going to go and reformat my disks right away.
Re: (Score:2)
It's not even that.
ext4 functions as an enabled flag on ext3.
So you can transition without formatting, or anything.
Re: (Score:3, Informative)
Re: (Score:3, Insightful)
Re: (Score:2)
So... you upgraded to a brand-new filesystem without first making a backup? Some glitches are to be expected at this point, just as when ext3 first came out. If everyone followed your advice and avoided ext4 "for at least five years," however, those glitches would never be found, much less fixed.
Re: (Score:3, Insightful)
So Ext4 in RH7 ? (Score:2)
Re:So Ext4 in RH7 ? (Score:5, Informative)
Thank you Red Hat (Score:5, Interesting)
Re: (Score:2)
And thanks to the Fedora users, apparently the first large user base that will (hopefully in full knowledge) be testing this thing for the benefit of the rest of the community (nothing against RedHat, somebody have to do the first step.)
Re:Thank you Red Hat (Score:5, Funny)
There is a saying in Spanish, which translated says:
"They are braver than the first men to try oysters!"
Re:Thank you Red Hat (Score:4, Interesting)
Stuff like: century eggs, tofu, lutefisk, casu marzu (not sure if the last is really that edible
Re: (Score:2)
Or extreme starvation: "we dropped our fish in the fireplace, but we'll surely die if we don't eat it anyway!"
Re: (Score:2)
According to the natives of many tropical jungle tribes, they watch what the animals eat, and adjust their diets accordingly.
Re: (Score:2)
How did the animals learn? Check out these parrots that eat clay so they can eat poisonous nuts and seeds:
http://www.highlightskids.com/Science/Stories/SS1201_parrotseatDirt.asp [highlightskids.com]
Re: (Score:2)
Animals learn from watching each other. Perhaps these parrots were able to determine that eating seeds that had fallen in this layer of clay were more edible that seeds from other areas. Then when they went to other areas, they found out that they could eat the seeds there.
Maybe they have a sense of taste/smell that can detect alkaloids and suitable antidotes. Mammals can smell salt/humidity and know instinctively that if they eat something salty, they should drink water.
Re: (Score:2)
Maybe a parrot that accidentally ate clay one day figured it out, and it eventually became a common tradition amongst parrots (who are intelligent enough to copy each other).
Re: (Score:2)
So that more mainstream distributions like Ubuntu can implement the technologies after they've been bug-tested.
</rimshot>
Re: (Score:2)
Thank you Fedora for both the freedom and the constant drive to be on the leading edge of technology.
Thank you, Fedora users, for not abandonding Red Hat when they started demanding money for their operating system and only giving away their alpha test version. You truly provide the earliest bug reports and suffer the most damage. The whole team is truly grateful.
Fedora 10 bugs (Score:2)
Fedora is my favorite distro, but this fakeraid bug is ridiculous -- keeping me from running F10 on my desktop. Sure runs nicely on my Samsung NC10, though.
You Can Always Tell The Pioneers ... (Score:2)
... They're the ones with the arrows in their backs! It's changes like this that underscore treating new distro versions as a public beta. Chances are, this or some other new feature will cause someone real pain. It's always a good idea to make sure that that someone is *not* you. Whether it's Fedora or OpenSuse, or Ubuntu, oftentimes features are added that aren't really ready for prime time. Trust no one.
Re: (Score:3, Informative)
Re: (Score:2)
Re: (Score:2)
I'm using them regularly. (Score:2)
Not bleeding edge use, though.
fs-driver (Score:2)
Ext4 in Ubuntu jaunty jackalope (Score:4, Interesting)
Apparently there is a serious risk of data loss at this time in case of power loss (at least in ubuntu). http://ubuntuforums.org/showthread.php?t=1040199 [ubuntuforums.org]
FS choices in the Datacenter (Score:4, Interesting)
One of my biggest beefs with ext3 in the data center is the required fsck periodically. Redhat won't support jfs or xfs (which I can get from CentOs) but some vendors won't support anything that isn't on their supported platform list (IBM Clearcase for one).
So is ext4 going to force a fsck at boot, which takes 1/2 a day with ext3 on some of my multi-Tb systems? Will Redhat finally adopt a better server filesystem? These are the questions that some of us doing professional Redhat support are asking.
Re: (Score:3, Informative)
So turn it off the periodical fsck then:
tune2fs -c 0 -i 0 /dev/foo
It's perfectly safe as long as the underlying blockdevice is safe (RAID).
Re: (Score:2)
It's perfectly safe as long as the underlying blockdevice is safe (RAID).
I'd rather have a filesystem that's perfectly safe period, thankyouverymuch.
Re: (Score:2)
No filesystem in current Linux is going to save you from silent bitflips. The only way to be protected from that is to use checksumming and parity calculation. Either you implement that in the block device (classic RAID) or in the filesystem (ZFS Z-RAID or similar) or you have to live with the possibility of corrupted data.
Re: (Score:2)
It's perfectly safe as long as the underlying blockdevice is safe (RAID).
And the filesystem driver is bug-free.
It WILL Help (Score:4, Informative)
Ext4 is orders of magnitude faster than Ext3 regarding fsck time. Your half-day checks will almost certainly be reduced to minutes. The developers rewrote the algorithm to not require as intensive of a search in phase 1.
If it's really important to get the machines up in minimal time (even at risk of some data loss) then you can turn off the auto checks entirely.
Re: (Score:2)
Take LVM snapshots and do periodic background fsck's, if they pass reset the last check time. If they fail set it to some time in the distance past and raise an alert.
However I have to agree, my biggest beef with Redhat is their boneheaded sticking with ext3, and the utter waste of effort that is ext4. It would be far more sensible to have picked either JFS or XFS (I don't care which) and used that instead.
Re: (Score:2)
The Ext guys need to take a lesson from UFS2 (FreeBSD 5.0, circa 2003) and perform the fsck in the background, at low priority, while the system is up and fully functional.
I hear Btrfs is going to eventually get similar capabilities, so maybe the answer is to just keep waiting.
Re: (Score:2)
I've been using a trick since the ext2 days to reduce fsck times by a lot.. Reduce the inode count. I seem to recall EXT3 allocates 1 inode for every 16kb of diskspace by default. This means 20 million inodes on a 300gb partitian. 2.5gb scanned, on every fsck! A lot of the time, this is overkill. I generally run my parititians with 1/5 to 1/20 this number of inodes. I don't have any partitians formatted under the defaults to compare this to, but a mostly full 300gb partitian, with 1M inodes and 50k files, f
Re: (Score:2)
And if there is a risk of corruption due to power outage (or hard crash), then why not write a
But I guess I'm missi
Lunatic Fringe (Score:2)
This is good news. All those Fedora folks can be beta testers. In five years or so I'll consider going from ext3 to ext4. It's only about a year since I went to ext3. I figured it must be OK by now since there haven't been any scare stories. I used to use Reiser before ext3 was stable.
xfs is really over-rated. I used to work on an "Enterprise" storage appliance that used xfs. It was scary. Don't go there. Also, avoid anything from IBM.
What's Best for SSDs? (Score:2)
Re: (Score:2)
Probably a silly worry. Every SSD out there provides it's own wear-leveling hardware, so the FS you use is fairly immaterial. Heck, most (all?) gear out there doesn't even provide a mechanism for direct access to the underlying storage, so you *can't* do the leveling in software (ie, with something like JFFS, etc) even if you wanted to.
Re: (Score:1, Funny)
Re:Why not ReiserFS? (Score:5, Funny)
Some of the minor features are just... less than desirable. [wikipedia.org]
Re: (Score:2)
Re:Why not ReiserFS? (Score:5, Informative)
ReiserFS isn't actively maintained. In addition to that ext3 and now ext4 have learned quite a few new tricks since ReiserFS first appeared, you can now online resize an ext3 filesystem, it supports hashed b-trees, which should speed up directory handling, it is getting an online defrag tool and a bunch of other goodies. So many of the benefits that ReiserFS originally brought to the table can now be have with ext3 or ext4.
Re: (Score:2, Informative)
In fact suse changed their defaults from reiser to ext3 long time ago [opensuse.org]. The main reasons were: scalability on SMP, xattrs performance, lack of mainteinance, lack of a upgrade path...
Re:Why not ReiserFS? (Score:5, Insightful)
Not to mention ext3 doesn't lose random OS files in the wake of a sudden power failure like reiserfs does. I might be alone on this event but after it happening on 3 different systems at 3 different times I'd think it would be documented more. I've always thought reiserfs was overrated and even if there is a speed increase data integrity is a lot more important to me.
It doesn't bother me that the creator is a convicted murderer, it does bother me that the file system gets away with murder all the time.
Re: (Score:2)
In the very very early days of ReiserFS, it had the tendency to trash files and it happened to me a couple of times, but it got fixed at some point and I didn't had any problem with that after that and have used ReiserFS3 happily for years.
Then on a new system (Ubuntu8.10) I tried XFS just for a change and I had file loss on day one, followed by continual file loss after each and every crash of the system (buggy Nvidia driver). The recommend solution to this was switching of hard drive cache, which however
Re: (Score:2)
ext3 fscks were just too slow for me, reiser is may be a bit unstable (my laptop's hardware is screwed the HDD bales out and the pci-bus lock up randomly but ive only needed to do a whole disk fsck once (200G in 20min wasnt too bad), so im yet to see it)
Because anything valuable is backed up and given how slow ext has generally been i have no intention of using ext for anything that doesn't contain critical data, and ill stick with reiserfs/jfs until a better alternative btfs, et al, comes out.
Re: (Score:2)
Not to mention ext3 doesn't lose random OS files in the wake of a sudden power failure like reiserfs does. I might be alone on this event but after it happening on 3 different systems at 3 different times I'd think it would be documented more.
How long ago was that?
I've used reiser almost exclusively for over five years now, including on my multi-TB home file server which has suffered lots of crashes due to power and -- for a time -- hardware issues (bad mobo).
In all that time, I've abused the heck out of the file system, including using one reiserfs that was on a failed RAID array that was forcibly reassembled and had missing data. I did lose a few files in that case, but that's the only time, and I was impressed reiser was able to rebuild
Re: (Score:2)
Critical systems should be on a UPS and have frequent backups... it makes those session much less drastic. That said, ext3 has always been an easier repair.
Re: (Score:2)
a filesystem should help insulate the data during sudden power loss, not pretty much guarantee that it will thrash all of your data. Sometimes, shit just happens despite the best planning.
Re:Why not ReiserFS? (Score:5, Funny)
Vendor lock-in?
ReiserFS is good... (Score:5, Funny)
for when you need to partition your wife.
Re: (Score:2)
Re: (Score:3, Interesting)
Shouldn't that be the other way around? If you mount an ext4 filesystem as ext3 the system will work, using only the ext3 subset of ext4. However, if you mount an ext3 as ext4 the system will try to use ext4 data structures which are simply not on the disk.
... or am I missing something?
Re:A few answers (Score:5, Informative)
Re: (Score:3, Informative)
Actually, ext4 can be mounted as ext3 provided extents are disabled when creating the ext4 fs. This may be useful for someone who wants some of the new features of ext4 but needs to mount the media on a machine that can't be upgraded to support it.
Re: (Score:2)
As is the extfs tradition, mounting a current ext3 filesystem as ext4 will work seamlessly; Shouldn't that be the other way around? If you mount an ext4 filesystem as ext3 the system will work, using only the ext3 subset of ext4. However, if you mount an ext3 as ext4 the system will try to use ext4 data structures which are simply not on the disk. ... or am I missing something?
Your missing something. Geeks are only worried about upgrading to the latest, so mount an ext3 and it will create the extra data for you. The other way round might work, but testing fallback is so unexciting
Re: (Score:2)
Re: (Score:3, Insightful)
Re: (Score:2)
For my own purposes, I can't use ReiserFS because I can't stop thinking about that guy when I install it. Just...ick.
For me, I can't use ReiserFS because I prefer to keep my data. I've had ReiserFS screw me over twice now. Once where the data was completely lost, and once where the data was damn-near lost but luckily recoverable with only a few files getting corrupted.
Re: (Score:2)
*has been using Reiser since 2002 and has yet to lose a byte*
(Having said that, I'mma switch my XFS and Reiser partitions to EXT4, just as soon as .28 is marked as stable in the Portage tree.)
BTW: Anecdotal evidence sucks. :(
Re: (Score:2)
Re: (Score:2)
Is Reiser the only FS you've used that fails under those loads?
Re: (Score:2)
Re: (Score:2, Insightful)
People are looking forward to ext4 because ext3 has performed so poorly compared to ReiserFS. No one wants to touch ReiserFS, regardless of its performance advantages over anything out today, because Hans was a psycho.
In a previous job, we tried everything we could to use ext3 for one of our server-based products, but the performance was never good enough compared to ReiserFS v3.
Hopefully the ext4 folks have learned how to implement all the excellent ideas Hans Reiser came up with between ReiserFS v3 and R