Linux Kernel Gets Fully Automated Test

Follow Slashdot stories on Twitter

Linux Kernel Gets Fully Automated Test 159

Posted by CmdrTaco on Sunday June 05, 2005 @11:45AM from the just-like-a-real-project dept.

An anonymous reader writes "The Linux Kernel is now getting automatically tested within 15 minutes of a new version being released, across a variety of hardware and the results are being published for all to see. Martin Bligh announced this yesterday, running on top of IBM's internal test automation system. Maybe this will enable the kernel developers to keep up with the 2.6 kernel's rapid pace of change. Looks like it caught one new problem with last night's build already ..."

This discussion has been archived. No new comments can be posted.

Linux Kernel Gets Fully Automated Test

Load All Comments

Search 159 Comments Log In/Create an Account

Comments Filter:

now all we need is automated.... (Score:5, Funny)

by 3seas ( 184403 ) writes: on Sunday June 05, 2005 @11:48AM (#12729387) Homepage Journal

code generation...

Share
twitter facebook
- Re:now all we need is automated.... (Score:2)
  
  by caluml ( 551744 ) writes:
  
  Actually, that could be done, could it not? Throw in some random functions, if/while/do loops in, return random variables, etc. It could create some funky new software. :)
  - Re:now all we need is automated.... (Score:3, Funny)
    
    by Baal Sebub ( 797455 ) writes:
    
    I already got 1 million monkeys in my basement working on it.
  - Re:now all we need is automated.... (Score:5, Interesting)
    
    by Curtman ( 556920 ) writes: on Sunday June 05, 2005 @12:26PM (#12729592)
    
    Actually, that could be done, could it not?
    
    Apparently it works for Samba [samba.org]. :)
    
    Parent Share
    twitter facebook
    - Re:now all we need is automated.... (Score:2)
      
      by petermgreen ( 876956 ) writes:
      
      code generation is good for repetitive stuff especially if your language doesn't have much in the way of a built in preprocessor
      
      say for example producing similar load on demand wrappers for a load of functions in a dynamic library.
      
      p.s. /. seems to be restricting me to one post every 15 mins right now dunno why (the error says Slashdot requires you to wait 2 minutes between each successful posting of a comment to allow everyone a fair chance at posting a comment.
      
      It's been 14 minutes since you last success
      - Re:now all we need is automated.... (Score:2)
        
        by Curtman ( 556920 ) writes:
        
        code generation is good for repetitive stuff especially if your language doesn't have much in the way of a built in preprocessor
        
        There's a fair bit of repetitive code in the kernel. I had to do some hacking to make some RS-422 cards we had work properly, and found that a lot of the char drivers especially contain very similar code, and structure. Code generation might help with older drivers that nobody cares about until they break. They tend to rot from the looks of things.
- Re:now all we need is automated.... (Score:3, Insightful)
  
  by maxwell demon ( 590494 ) writes:
  
  No problem. The following is an automated code generator. It generates a hello world program in C and writes it to stdout. (untested)
  
  #include <stdio.h> int main() { char const* program_pattern = "%s%s"; char const* include_pattern = "#include <%s>\n"; char const* function_declaration_pattern = "int %s(%s)"; char const* function_definition_pattern = "%s\n{\n %s;\n}\n"; char const* print_pattern = "printf(%s)\n"; char const* string_pattern = "\"%s\""; char const* stdio_header_name = "stdi
  - Re:now all we need is automated.... (Score:1)
    
    by kurzweilfreak ( 829276 ) writes:
    
    Beautiful. Scale it up to output a new Linux kernel.
  - Re:now all we need is automated.... (Score:2)
    
    by pchan- ( 118053 ) writes:
    
    holy crap, remind me to never hire you. if you know exactly what the string is going to look like, why didn't you just write it? if you expect it to change, why did you hard code the length values into the buffers? #include <stdio.h> int main(void) { puts("#include<stdio.h>\n" "int main(void)\n" "{\n" " puts(\"hello world!\");\n" " return 0;\n" "}"); return 0; }
    - Re:now all we need is automated.... (Score:3, Insightful)
      
      by jrockway ( 229604 ) writes:
      
      It was a joke, dumbass.
      
      If you're going to used fixed-length buffers, though, at least use sNprintf!
- Re:now all we need is automated.... (Score:1, Insightful)
  
  by Anonymous Coward writes:
  
  Its called lisp ;-)
- Re:now all we need is automated.... (Score:2)
  
  by MemoryDragon ( 544441 ) writes:
  
  Whats so funny about it, code generation is used left and right in modern projects, this stuff is great to shift the grundwork away from the developers and not having to go into outsourcing hell.
- Re:now all we need is automated.... (Score:1)
  
  by northcat ( 827059 ) writes:
  
  Why hasn't anyone mentioned lex or yacc yet?
- Re:now all we need is automated.... (Score:2)
  
  by 3seas ( 184403 ) writes:
  
  I think the kick in teh butt humor found in this is that of the computer auto generating the code, auto compiling it and auto testing it and regenerating code for improvement based on test results, ... loop it Johnny Mnemonic... uhhh ertrrr Neo..
  
  Being one fully aware of the possiblities of auto coding or using code generators, both of which exist today in one form or another, just not so completely available wide scope on much of any user/consumer platform..
  
  I was being serious but certainly found the hum
- IBM's Rational Software (Score:2)
  
  by erik umenhofer ( 782 ) writes:
  
  May not be 100% or even hardcore, but you can go from use case to code if you put in some time. It will also write Java code using your UML diagrams.
  
  It's based off of Eclipse. Check it out if you can.
- - - Re:NOT FUNNY: Chinese Military Software (Score:1, Offtopic)
      
      by dhakbar ( 783117 ) writes:
      
      Um, he's the well known "phrusa troll" that usually only posts in China-related stories.
      
      Sometimes, though, he interjects his posts into unrelated articles.
Why has it taken so long? (Score:1, Redundant)

by Beatlebum ( 213957 ) writes:

Why has it taken so long?
- Re:Why has it taken so long? (Score:2, Insightful)
  
  by Anonymous Coward writes:
  
  Bitkeeper.
- Re:Why has it taken so long? (Score:3, Informative)
  
  by teh_cn ( 887491 ) writes:
  
  mod me troll, but (free)bsd had this for years and not only for the kernel, but for world, too.
  - Re:Why has it taken so long? (Score:2, Funny)
    
    by VStrider ( 787148 ) writes:
    
    They had to. There isn't anyone left to do the testing.
- Re:Why has it taken so long? (Score:1)
  
  by Uerige ( 206572 ) writes:
  
  Because you didn't do it.
  - Stop playing the "then do it yourself" card, guys (Score:2)
    
    by CarpetShark ( 865376 ) writes:
    
    Because you didn't do it.
    Please don't play this card all the time. We hear it way too often in the Free Software/Open Source communities, and it's really quite silly.
    The grandparent post asked if it would make more sense to do it another way. That's a perfectly valid and logical question. Either he's right, and it does make more sense, or he's wrong (for a variety of reasons), and it's best to keep it the way it is. None of these require one person to do it incorrectly, and another to do it proper
- Re:Why has it taken so long? (Score:2)
  
  by ikewillis ( 586793 ) writes:
  
  Good question, especially considering FreeBSD Tinderbox [sentex.ca] has been doing this sort of thing for years, and not just with the kernel but with the entire base system.
- Re:Why has it taken so long? (Score:2)
  
  by putaro ( 235078 ) writes:
  
  Real men don't test their code - they just post it on the Internet and let everyone else do it for them.
- - Re:Why has it taken so long? (Score:1, Funny)
    
    by Anonymous Coward writes:
    
    How many tests has his what written?
Within 15 Minutes? WTF (Score:1, Insightful)

by LCookie ( 685814 ) writes:

"The Linux Kernel is now getting automatically tested within 15 minutes of a new version being released"

Would be much better to test it BEFORE a new version is being released, otherwise this is completely useless...
- Re:Within 15 Minutes? WTF (Score:2)
  
  by Thing 1 ( 178996 ) writes:
  
  Great idea. You should ask IBM to integrate their test platform into Linus' processes. He might be dubious after BitKeeper (that idiot) about another company helping him, but in this case I think it's a great idea.
  There may be (and probably are) other test beds out there, testing releases. It would be better for Linus (and the world) if he could release already-tested code to the world, instead of having the world duplicate all the testing effort, and IBM seems like a perfect solution.
  - Re:Within 15 Minutes? WTF (Score:5, Informative)
    
    by oxfletch ( 108699 ) writes: on Sunday June 05, 2005 @12:02PM (#12729463)
    
    I automatically test every nightly -git snapshot release, so it's fairly well tied in anyway. This also means my heaviest usage of our machines is at night, when most of the (US) developers are asleep.
    
    So it's fairly well tied in already ... and the whole -rc cycle should enable us to catch a lot of stuff.
    
    Parent Share
    twitter facebook
    - Re:Within 15 Minutes? WTF (Score:1)
      
      by netdur ( 816698 ) writes:
      
      > when most of the (US) developers are asleep
      as far as I know (US) developers sleeping during the night time in... China
      - Re:Within 15 Minutes? WTF (Score:2)
        
        by fbjon ( 692006 ) writes:
        
        Aer you a (US) developer, then? And what the hell is a (US) developer anyway, that's a weird-ass smiley if I ever saw one.
  - Re:Within 15 Minutes? WTF (Score:1)
    
    by DegeneratePR ( 889051 ) writes:
    
    In any case, most people, especially in mission-critical processes, don't compile a new kernel as soon as it's released. Myself, I try kernels after a while, when no major issues are found. Even then, I test them out first in different test machines. So 15 minutes before, 15 minutes after, it's all the same.
    - Re:Within 15 Minutes? WTF (Score:2)
      
      by Thing 1 ( 178996 ) writes:
      
      But it's not all the same, though. Once it's "blessed" by Linus, it's released. If he had access to the test machines prior to releasing it, he could release higher-quality code.
      And since the entire test run only takes 15 minutes, IBM (and the world) would benefit from allowing him multiple tests per release.
- Comment removed (Score:4, Insightful)
  
  by account_deleted ( 4530225 ) writes: on Sunday June 05, 2005 @11:57AM (#12729432)
  
  Comment removed based on user account deletion
  
  Parent Share
  twitter facebook
  - Re:Within 15 Minutes? WTF (Score:4, Insightful)
    
    by digitalunity ( 19107 ) writes: <digitalunityNO@SPAMyahoo.com> on Sunday June 05, 2005 @03:10PM (#12730431) Homepage
    
    Ummm...
    
    If everyone did this, the newest kernels would never get tested. I think it is important that we have a diverse range of users using new, almost new, and older but well tested kernels.
    
    Parent Share
    twitter facebook
- Re:Within 15 Minutes? WTF (Score:3, Insightful)
  
  by doshell ( 757915 ) writes:
  
  "Release" in the open source world has a broader sense than in commercial software. In open source not all "released" versions are meant for general public consumption; they include unstable versions targeted mostly at developers, so that severe isues can be detected and patched quickly.
  
  Taking this into account, I believe this is meant to catch bugs mainly in nightly (unstable) builds and release candidates, not in "final" versions (those should, at least in theory, have no serious bugs left around as th
- Re:Within 15 Minutes? WTF (Score:5, Informative)
  
  by Metteyya ( 790458 ) writes: on Sunday June 05, 2005 @12:06PM (#12729489)
  
  because they are nightly builds, that is - versions with applied patch, but untested yet.
  
  Parent Share
  twitter facebook
- Wait a minute... (Score:3)
  
  by RoLi ( 141856 ) writes:
  
  So let me summarize wether I understood it right:
  You say it's "completely useless" because you have to wait 15 minutes when a kernel is released.
  
  And this is modded "insightful".
Question: (Score:5, Interesting)

by bogaboga ( 793279 ) writes: on Sunday June 05, 2005 @11:50AM (#12729396)

How were the previous kernels being tested? Were sources for improvement/change/modification, bugs and areas requiring refactoring being discovered by chance?

Share
twitter facebook
- Re:Question: (Score:3, Informative)
  
  by Anonymous Coward writes:
  
  " How were the previous kernels being tested?"
  
  Hey guys, new kernel is out, bang away at it and let me know what you think.
  - Re:Question: (Score:3, Funny)
    
    by steve_l ( 109732 ) writes:
    
    I thought it was "hello, here is a new release of fedora for you to install..."
  - Re:Question: (Score:2)
    
    by xtracto ( 837672 ) writes:
    
    So they where using the thousand moonkeys approach uh?
    - - Re:That would be good, except (Score:2)
        
        by HishamMuhammad ( 553916 ) writes:
        
        The moon keys are hidden in the Star level 3, after you pass the warp zone in the Chocolate forest. You need a cheat to get a thousand, though.
- Re:Question: (Score:1)
  
  by ignorant_coward ( 883188 ) writes:
  
  After a new kernel was released, power meters on mothers' basements everywhere saw a little blip. Add up all these blips, and you get a (somewhat) tested kernel.
- Re:Question: (Score:2)
  
  by ImaLamer ( 260199 ) writes:
  
  In Soviet Union Kernel Tests You!!!
How much testing? (Score:2, Interesting)

by anthony_dipierro ( 543308 ) writes:

This is good, and long overdue (I'm surprised it hasn't been around for years), but just how much testing is being done? Compiling? Booting? Or are there actual functional and reliability tests which are being performed?
- Re:How much testing? (Score:5, Informative)
  
  by oxfletch ( 108699 ) writes: on Sunday June 05, 2005 @12:06PM (#12729483)
  
  Compiles, boots, runs dbench, tbench, kernbench, reaim, fsx. If one test fails, it'll highlight it
  in yellow, rather than green or red. I have a few of those in the internal tests, but not the external set.
  
  This is only the tip of the iceberg as to what can be done. We're already running LTP, etc internally, and several other tests. Some have licensing restrictions on results release (SPEC) ... LTP is a pain because some tests always fail, and I have to work out the differential against baseline. Will come later.
  
  Parent Share
  twitter facebook
What took so long (Score:4, Interesting)

by Timesprout ( 579035 ) writes: on Sunday June 05, 2005 @11:53AM (#12729405)

Most projects of any complexity use automated continuous build and testing as a standard development practise.

Share
twitter facebook
- Presumably... (Score:5, Insightful)
  
  by Kjella ( 173770 ) writes: on Sunday June 05, 2005 @11:57AM (#12729433) Homepage
  
  ...the cross-platform, cross-hardware part? Setting up one machine to build automatically is easy. Setting up a whole bunch of them (and all unique, read administration nightmare) and tie them together to a system, that's quite a bit of work.
  
  Kjella
  
  Parent Share
  twitter facebook
  - Re:Presumably... (Score:5, Informative)
    
    by oxfletch ( 108699 ) writes: on Sunday June 05, 2005 @12:10PM (#12729514)
    
    Indeed. The automation system I wrote is just a wrapper around an internal harness called ABAT that has a massive amount of work behind it. If systems crash it can detect that, power cycle them, etc.
    
    Going from 90% working to 99.9% working is frigging hard. I had all this working 3-6 months ago, but the results weren't good enough quality to be published. Several people internally put a massive amount of work into improving the quality and stability of the harness.
    
    Parent Share
    twitter facebook
    - Re:Presumably... (Score:3, Insightful)
      
      by Bob_Robertson ( 454888 ) writes:
      
      I don't remember who said it first:
      
      The first 90% takes 10% of the time.
      
      The last 10% takes 90% of the time.
      
      I expect one could substitute "money", "labor", "effort" for "time" in the above.
      
      Bob-
      - Re:Presumably... (Score:2)
        
        by Knetzar ( 698216 ) writes:
        
        It's generally known as the 80/20 rule. 80% takes 20% of the effort, while the other 20% takes 80% of the effort.
        
        The idea is the same though.
  - Re:Presumably... (Score:2)
    
    by TCM ( 130219 ) writes:
    
    ...the cross-platform, cross-hardware part?
    
    It's magic [netbsd.org]! A single script and I can build a complete operating system for a big-endian 64bit architecture on a 32bit little-endian architecture, or any of the other 48 supported archs. More than that, I can build a complete NetBSD for any arch on any halfway POSIXish system.
    
    build.sh bootstraps its own contained build utils (compiler, binutils et al) and builds the system with that. You can even build the complete system as non-root and get tarballs that you ca
    - Re:Presumably... (Score:2)
      
      by Nutria ( 679911 ) writes:
      
      How long did it take you to create+debug+tweak that script?
      
      How much testing does it do, other than "compile +link"?
      - Re:Presumably... (Score:2)
        
        by TCM ( 130219 ) writes:
        
        1. The script is part of NetBSD.
        2. http://cvsweb.netbsd.org/bsdweb.cgi/src/regress/ [netbsd.org] is supposedly used for regression testing. Ask a developer, I'm just a user :)
  - Re:Presumably... (Score:2)
    
    by duffbeer703 ( 177751 ) writes:
    
    We've been playing with some IBM tools at work that automate server setup and provisioning... its pretty amazing stuff.
    
    You can basically retask servers in something like 10-60 minutes depending on what you are doing, and its a completely automatic process.
  - That is what aegis does (Score:3, Interesting)
    
    by nietsch ( 112711 ) writes:
    
    http://aegis.sf.net/ [sf.net]aegis.sf.net
    and it can do a lot of other things too, like making sure that each change has an accompagning test and that all tests pass before anybody else is bothered with that change.
    
    The biggest downside for aegis (as I see it) is that it needs to run on a central development server, it is not server based like CVS or the others(it has a cvs-like interface for reading). But OTOH, would it be so hare to have the kernel developers log into a central compile farm where the linux kernel i
- Re:What took so long (Score:2)
  
  by Rakshasa Taisab ( 244699 ) writes:
  
  They've been doing that the whole time, they call them "users".
Maybe... (Score:2, Interesting)

by ratta ( 760424 ) writes:

automated performance regression tests may be useful too.
- Re:Maybe... (Score:5, Informative)
  
  by oxfletch ( 108699 ) writes: on Sunday June 05, 2005 @12:19PM (#12729552)
  
  The results are all there if anyone wants to play with them. Go to the results matrix, and click on the numerical part of the green box. Pick a test, and drill down to the results directory.
  
  The numbers are there, it's just a question of drawing graphs, etc. I have some for kernbench already, but I'm not finished automating them. If anyone wants to email me code to generate them from the directory structure published there, feel free ;-) Preferably python or perl into gnuplot.
  
  Parent Share
  twitter facebook
  - Re:Maybe... (Score:2)
    
    by Nutria ( 679911 ) writes:
    
    Instead of just reading a bunch of complaints, let me be 1 Slashdotter to thank you for your efforts.
    
    It's too bad the Stanford Checker can't be integrated into your system.
This is awesome (Score:5, Insightful)

by jnelson4765 ( 845296 ) writes: on Sunday June 05, 2005 @11:54AM (#12729410) Journal

But it can't catch everything - the 1394 bus was screwed in 2.6.11. There are a lot of regressions that show up - and even that healthy cluster of systems will not show every problem.

Sound issues? Older network and SCSI cards? There are a lot of drivers that break, and no one notices it because there is nobody with the hardware testing the -rc or -mm kernels.

Wouldn't it make more sense to package these tools for someone to install on their collection of oddball equipment, and assist in the debugging/testing?

Where's the ARM, MIPS, and SH?

Share
twitter facebook
- Re:This is awesome (Score:5, Insightful)
  
  by Meshach ( 578918 ) writes: on Sunday June 05, 2005 @12:12PM (#12729520)
  
  But it can't catch everything...
  
  But that is not the point of automated testing. As a member of a qa team who is developing automated tests I get comments like that every day
  
  Automated tests are not intended to catch everything or test strange permutations of pre-conditions. There purpose is to provide a mechanism for verifying that a build satisfies the basic requirements of the project.
  
  More exotic configs need to be tested manually as usual but automated tests can provide a "failsafe" just in case a basic part of the build is broken.
  
  Parent Share
  twitter facebook
  - Furthermore, it prevents regressions (Score:4, Insightful)
    
    by xant ( 99438 ) writes: on Sunday June 05, 2005 @03:01PM (#12730389) Homepage
    
    Reliable, repeatable testing is a great way to prevent fixes in one area from causing bugs in another. When I fix A, I generally only test A manually. I don't test every other conceivable code path, even though my fix for A might well impact them.
    
    An automated test for B will catch regressions caused by my fix in A, making it harder to backslide. Backsliding is very expensive because bugs are far removed from their cause. If an automated test sees that changes in A caused a regression in B, the cause is immediately obvious.
    
    Parent Share
    twitter facebook
  - Re:This is awesome (Score:2)
    
    by Mr. Underbridge ( 666784 ) writes:
    
    Automated tests are not intended to catch everything or test strange permutations of pre-conditions. There purpose is to provide a mechanism for verifying that a build satisfies the basic requirements of the project.
    Isn't that what a compiler is for? ;)
    - - Re:This is awesome (Score:2)
        
        by Mr. Underbridge ( 666784 ) writes:
        
        Meshach, meet joke. Joke, Meshach.
- Re:This is awesome (Score:2)
  
  by zappepcs ( 820751 ) writes:
  
  I agree with jnelson4765, new buids would be well served to be tested on a great many machines with a wide variety of hardware setups.
  
  Who should map the hardware testing platforms? I don't know, but I do know that if the new kernel builds are tested for a generic group of hardware and released, then other testers report on their tests using hardware X, you would end up with a relatively quick listing of a new build against many variants of hardware. Published correctly, it would allow people to search for
- Re:This is awesome (Score:2)
  
  by Cylix ( 55374 ) * writes:
  
  Unfortunately, organizing that kind of odd ball testing would be a management nightmare unless you want to go out and collect all of the hardware. Remember, some people do post patches and whole driver releases without stepping inside of the kernel team's realm.
  
  The only real way to automate something like that would be a dummy load facility. Some software which would emulate the hardware being in place. Something conceptually similar to that effect anyway.
  
  So then, for every driver for a device, you have a
- Re:This is awesome (Score:2)
  
  by Nutria ( 679911 ) writes:
  
  Where's the ARM, MIPS, and SH?
  
  IBM doesn't sell any ARM, MIPS or SH-based systems. So, they don't test them.
  
  The Debian buildd system is an automatic building and semi-testing system for, of course, all the archs that Debian supports, and that includes ARM, MIPS, and SH.
- Re:This is awesome (Score:2)
  
  by team99parody ( 880782 ) writes:
  
  Wouldn't it make more sense to package these tools for someone to install on their collection of oddball equipment, and assist in the debugging/testing?
  That's how the PostgreSQL build farm [pgbuildfarm.org] works. People with wierd hardware [onlamp.com] apply to be added to the automated test farm. ARM, MIPS, PARISC, Alpha, PowerPC, Sparc, etc. are all represented well in the postgresql automated tests.
ARM Linux has something similar (Score:5, Informative)

by kyllikki ( 88559 ) writes: on Sunday June 05, 2005 @11:54AM (#12729411) Homepage

ARM Linux has had something similar in Kautobuild [simtec.co.uk] for some time.

Although the testing and building is limited to the ARM platform.

The site also has a whos who thats worh looking at ;-)

Share
twitter facebook
Related projects at OSDL (Score:2, Informative)

by anandpur ( 303114 ) writes:

Related projects at OSDL
http://osdl.org/projects/26lnxstblztn/results/ [osdl.org]
http://developer.osdl.org/cherry/compile/ [osdl.org]
News Flash (Score:5, Informative)

by sirReal.83. ( 671912 ) writes: on Sunday June 05, 2005 @12:02PM (#12729464) Homepage

Red Hat (and probably Novell/SuSe, since they use over one thousand kernel patches) runs a myriad of tests on each of its own kernel builds nightly - and has been doing so for years. On more than just the 3 architectures covered by this test.

That said, pushing tests upstream is a great idea. Just not revolutionary or anything.

Share
twitter facebook
- Re:News Flash (Score:2)
  
  by bruthasj ( 175228 ) writes:
  
  News Flash #2:
  
  Redhat has several engineers that *are* upstream.
- Re:News Flash (Score:2)
  
  by bill_mcgonigle ( 4333 ) * writes:
  
  Man, I wish they'd test Fedora kernel releases on their test farm. Of a dozen different machines I've run 2.6 Fedora kernel releases on, I've lost 1394 on one, USB on another, the hardware clock, on a third, parallel port probing on the third, serial ports on a fifth, and the Compaq Smart Array on the sixth.
  
  The other six machines seem OK. But that's a 50% buggered rate from various flavors of 2.6 upgrades, mostly from nightly 'yum update's. These are all IBM, Compaq, HP, and Dell machines, so somebody's
Long uptimes (Score:5, Interesting)

by rice_burners_suck ( 243660 ) writes: on Sunday June 05, 2005 @12:02PM (#12729465)

This is a very smart system. The Samba team uses something very similar. The key to finding regressions with this method is to create tests for every piece of functionality, and to integrate it with the rest of the testing suite, so that each function of the kernel will be continuously tested. For new features, it is preferable to create these tests as the features are being coded. For existing millions of lines of code, it is necessary for some brave souls to go in and create these tests.
I hope they are using code from the Linux testing suite. That piece of work has already formed a nice set of tests. Also, I hope that the kernel is automatically built with many different combinations of options. And with time, I hope this will become better. The more tests, with the more hardware configurations, with the more kernel configurations, with the more types of input data (including many imaginative forms of incorrect input data to test that the kernel handles it gracefully and thwarts attacks based on such methods), the better quality we will have in the kernel, and it is likely that Linux will be unmatched in quality, stability, efficiency (well, maybe not efficiency necessarily), and long uptimes.

Share
twitter facebook
through the looking glass... (Score:4, Funny)

by moviepig.com ( 745183 ) writes: on Sunday June 05, 2005 @12:06PM (#12729485)

With an automated test suite, what happens when a class of bug is discovered to be untested-for? Presumably, the suite is modified to detect it. Then, is the resulting new suite itself subjected to an automated test suite? And, then...[divide-by-zero error...]

Share
twitter facebook
- Re:through the looking glass... (Score:4, Informative)
  
  by oxfletch ( 108699 ) writes: on Sunday June 05, 2005 @12:15PM (#12729538)
  
  There is indeed an internal self-test suite on the harness. It's not desperately sophisticated, and I wouldn't dare show it to anyone ;-) However, it does catch a lot of stupid bugs. It requires some manual intervention/inspection to work.
  
  Plus, there's a separate development grid where we test new test-harness code before it's put onto the
  production grid.
  
  Parent Share
  twitter facebook
- Re:through the looking glass... (Score:1)
  
  by sirReal.83. ( 671912 ) writes:
  
  divide by zero? what kind of crazy hacks did you put in that algorithm, boy? ;)
- Re:through the looking glass... (Score:1)
  
  by EvanED ( 569694 ) writes:
  
  You're not looking at a divide-by-zero error, but a stack overflow from the infinite recursion.
  - Re:through the looking glass... (Score:3, Funny)
    
    by moviepig.com ( 745183 ) writes:
    
    You're not looking at a divide-by-zero error, but a stack overflow from the infinite recursion.
    You're right, I made a mistake. I shall modify my test suite forthwith... [divide-by-zero error]
Does this mean... (Score:3)

by blixel ( 158224 ) writes: on Sunday June 05, 2005 @12:09PM (#12729506)

Does this mean we'll get back to 2.6.x releases? Instead of new version of 2.6.x being released as 2.6.x.x every third day?

Share
twitter facebook
Safety issues (Score:5, Funny)

by DruggedBunny ( 703795 ) writes: on Sunday June 05, 2005 @12:23PM (#12729569) Homepage

Martin Bligh announced this yesterday, running on top of IBM's internal test automation system.

Hope he doesn't fall off and hurt himself.

Share
twitter facebook
- Re:Safety issues (Score:2)
  
  by rbarreira ( 836272 ) writes:
  
  Geez man, you made my day... Or minute, at least :)
cool to see this publicly announced (Score:1)

by emmastrange ( 768051 ) writes:

I got to work on part of this system, which IBM calls Autobench, for my senior project at PSU. The system is a highly configurable framework which can download, compile, and run various benchmarks and profilers (for example while compiling a kernel). Its all centrally administered, so IBM can run a battery of tests on a variety of different machines at once.

I think Martin Bligh said that IBM has been using this for a while now, automatically downloading kernels upon release and testing them. The new thin
2.6.12 on amd64 (Score:2)

by scharkalvin ( 72228 ) writes:

needs work! The latest builds all failed!
- Re:2.6.12 on amd64 (Score:1)
  
  by StupidKatz ( 467476 ) writes:
  
  Considering that 2.6.12 hasn't been released yet, it just might be the case that they are still, oh, I don't know, working on it?
Linux enters the world of QA 101! (Score:1)

by mrkitty ( 584915 ) writes:

Years later and finally it is getting some *basic* QA testing done! What will they think of next!
- Re:Linux enters the world of QA 101! (Score:1)
  
  by SlashMaster ( 62630 ) writes:
  
  I'd expect the community to start advocating unit testing, an agile development practice, at some point to increase the reliabilty of code before it is even merged into the nightly builds.
  I realize that this is not the same as testing the entire package on dissimilar hardware like he is doing here; For instance, there are bound to be a few issues when developers of code and its underlying code base both submit updates the same evening. IMHO, it'd especially help new developers if there existed unit tests
- Re:Linux enters the world of QA 101! (Score:1)
  
  by LnxAddct ( 679316 ) writes:
  
  Individual distros have been doing this for years. Red Hat is one company that is known for its extensive testing of the kernel (as well as many other OSS projects). Don't use a vanilla kernel if you're running a production environment.
  Regards,
  Steve
- - Re:Linux enters the world of QA 101! (Score:2)
    
    by oxfletch ( 108699 ) writes:
    
    Firstly, because both RHEL and SLES pull their base from mainline kernel. I'm damned if I'm going to fix bugs 3 times - RHEL, SLES, and back in mainline. Let's fix it once, before it spreads.
    
    Secondly, it's MUCH, MUCH easier to fix a bug the night after it went in, not 3 months later. Everyone has context as to what's goin on fresh in their minds, and the change hasn't been buried under 7 tons of other crap.
Is this even worth anything? (Score:2)

by xenocide2 ( 231786 ) writes:

One of the main goals appears to be whether the kernel builds or not. I shouldn't have to tell slashdot that build errors are among the most trivial of OS programming errors. They certainly exist, as the chart shows, but whoever is in charge of this project has a long way to go, by adding real tests of functionality. Consider it job security ;)
- Re:Is this even worth anything? (Score:2)
  
  by oxfletch ( 108699 ) writes:
  
  For one, did you actually bother to look at the results at all, and what tests are being run, and
  published?
  
  For another, this is only the tip of the iceberg as to what can be done, but I'm not going to lock whatever I have now in some dingy dungeon until it's "finished". What's there is useful, ableit incomplete. Testing is *never* complete.
  
  The main goal, as you put it, is to improve the quality of the linux kernel. If we can ensure the kernel builds, boots, and runs basic tests ... in a fully automated wa
  - - Re:Is this even worth anything? (Score:2)
      
      by xenocide2 ( 231786 ) writes:
      
      Well, there is testing. It's just done by firms like RedHat and it's not made publicly available. There's also a very public and fairly thourough testing procedure done by Debian, but they don't specifically target the kernel, the way this particular system is.
Any other Open Source projects have similar? (Score:2)

by team99parody ( 880782 ) writes:

I think the PostgreSQL buildfarm [pgbuildfarm.org] is one of the coolest ones I've seen. It's distributed across a bunch of volunteer-run machines representing a broader selection of architectures than most any other automated-test projects I'm aware of. A nice article on it can be found here [onlamp.com]
Any other projects out there with similar transparency in their automated testing?
The same thing for NetBSD (Score:2)

by hubertf ( 124995 ) writes:

NetBSD has about the same thing - compiling of the whole operating system (kernel, userland, X) for ~50 platforms. Logs are available [netbsd.org] for developers to fix things.
- Hubert
- Re:Well, this time I am really unhappy! (Score:2, Insightful)
  
  by posternutbaguk ( 637765 ) writes:
  
  Current 2.6x very kernels unstable? Linux does not have any stable version? Obviously you havn't even used Linux in the last year or so.
  
  Testing a product to make it better doesn't mean the product is bad to start with. Some code has higher aspirations than that.
  - Re:Well, this time I am really unhappy! (Score:2)
    
    by kompiluj ( 677438 ) writes:
    
    The problem is that I have extensively and exclusively used Linux 2.6 the whole last year since I migrated from 2.4 because it lacked the features I needed. Back in the good old days it was that Linux had two versions: the development one - uneven numbers - like 2.1 and stable - even numbers - like 2.0. Now the development is in the "stable" branch - 2.6. And this results in big problems. You can get used to them on the desktop, you get mad having to drive to the server room because your 2.6.x server has li
- Re:Well, this time I am really unhappy! (Score:2)
  
  by toddestan ( 632714 ) writes:
  
  1) the very need for such tests means that current 2.6.x kernels are very unstable - this means that Linux currently does not have any stable version - not good
  
  If you think that the 2.6.x kernels are unstable, you can use the 2.0, 2.2, or 2.4 kernels. All those versions are still being maintained, and they are definently stable.
  - Re:Well, this time I am really unhappy! (Score:2)
    
    by kompiluj ( 677438 ) writes:
    
    Now really dear sir, how am I to use 2.0, 2.2 or 2.4, since all new features, which I need are only in 2.6? Because everybody believes that 2.6 is a stable release (which in theory is indicated by the even release-subnumber) features got added to 2.6 and nobody bothers with backporting them to 2.4. And this is really a big problem. If there was a development version, then 2.6 would be rock-solid stable, I would get no freezes, panics and other nasty events and the features would go to the stable kernel - li
- - Re:Well, this time I am really unhappy! (Score:1)
    
    by unleashedgamers ( 855464 ) * writes:
    
    2) Riiiiight. I'm really sure 2000 and XP aren't an improvement over Windows 95.
    
    2000 and XP are way diffrent than 95.
    
    Windows '95, '98 and ME are descended from DOS and Windows 3.x, and contain significant portions of old 16-bit legacy code. These Windows versions are essentially DOS-based, with 32-bit extensions. Process and resource management, memory protection and security were added as an afterthought and are rudimentary at best. This Windows product line is totally unsuited for applications where

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

now all we need is automated.... (Score:5, Funny)

Re:now all we need is automated.... (Score:2)

Re:now all we need is automated.... (Score:3, Funny)

Re:now all we need is automated.... (Score:5, Interesting)

Re:now all we need is automated.... (Score:2)

Re:now all we need is automated.... (Score:2)

Re:now all we need is automated.... (Score:3, Insightful)

Re:now all we need is automated.... (Score:1)

Re:now all we need is automated.... (Score:2)

Re:now all we need is automated.... (Score:3, Insightful)

Re:now all we need is automated.... (Score:1, Insightful)

Re:now all we need is automated.... (Score:2)

Re:now all we need is automated.... (Score:1)

Re:now all we need is automated.... (Score:2)

IBM's Rational Software (Score:2)

Re:NOT FUNNY: Chinese Military Software (Score:1, Offtopic)

Why has it taken so long? (Score:1, Redundant)

Re:Why has it taken so long? (Score:2, Insightful)

Re:Why has it taken so long? (Score:3, Informative)

Re:Why has it taken so long? (Score:2, Funny)

Re:Why has it taken so long? (Score:1)

Stop playing the "then do it yourself" card, guys (Score:2)

Re:Why has it taken so long? (Score:2)

Re:Why has it taken so long? (Score:2)

Re:Why has it taken so long? (Score:1, Funny)

Within 15 Minutes? WTF (Score:1, Insightful)

Re:Within 15 Minutes? WTF (Score:2)

Re:Within 15 Minutes? WTF (Score:5, Informative)

Re:Within 15 Minutes? WTF (Score:1)

Re:Within 15 Minutes? WTF (Score:2)

Re:Within 15 Minutes? WTF (Score:1)

Re:Within 15 Minutes? WTF (Score:2)

Comment removed (Score:4, Insightful)

Re:Within 15 Minutes? WTF (Score:4, Insightful)

Re:Within 15 Minutes? WTF (Score:3, Insightful)

Re:Within 15 Minutes? WTF (Score:5, Informative)

Wait a minute... (Score:3)

Question: (Score:5, Interesting)

Re:Question: (Score:3, Informative)

Re:Question: (Score:3, Funny)

Re:Question: (Score:2)

Re:That would be good, except (Score:2)

Re:Question: (Score:1)

Re:Question: (Score:2)

How much testing? (Score:2, Interesting)

Re:How much testing? (Score:5, Informative)

What took so long (Score:4, Interesting)

Presumably... (Score:5, Insightful)

Re:Presumably... (Score:5, Informative)

Re:Presumably... (Score:3, Insightful)

Re:Presumably... (Score:2)

Re:Presumably... (Score:2)

Re:Presumably... (Score:2)

Re:Presumably... (Score:2)

Re:Presumably... (Score:2)

That is what aegis does (Score:3, Interesting)

Re:What took so long (Score:2)

Maybe... (Score:2, Interesting)

Re:Maybe... (Score:5, Informative)

Re:Maybe... (Score:2)

This is awesome (Score:5, Insightful)

Re:This is awesome (Score:5, Insightful)

Furthermore, it prevents regressions (Score:4, Insightful)

Re:This is awesome (Score:2)

Re:This is awesome (Score:2)

Re:This is awesome (Score:2)

Re:This is awesome (Score:2)

Re:This is awesome (Score:2)

Re:This is awesome (Score:2)

ARM Linux has something similar (Score:5, Informative)

Related projects at OSDL (Score:2, Informative)

News Flash (Score:5, Informative)

Re:News Flash (Score:2)

Re:News Flash (Score:2)

Long uptimes (Score:5, Interesting)

through the looking glass... (Score:4, Funny)

Re:through the looking glass... (Score:4, Informative)

Re:through the looking glass... (Score:1)

Re:through the looking glass... (Score:1)

Re:through the looking glass... (Score:3, Funny)