Slashdot is powered by your submissions, so send in your scoop

Tuning The Kernel With A Genetic Algorithm 251

Posted by michael on Saturday January 08, 2005 @08:01AM from the self-modifying-code dept.

fsck! writes "Jake Moilanen provided a series of four patches against the 2.6.9 Linux kernel that introduce a simple genetic algorithm used for automatic tuning. The patches update the anticipatory IO scheduler and the zaphod CPU scheduler to both use the new in-kernel library, theoretically allowing them to automatically tune themselves for the best possible performance for any given workload. Jake says, 'using these patches, there are small gains (1-3%) in Unixbench & SpecJBB. I am hoping a scheduler guru will able to rework them to give higher gains.'"

This discussion has been archived. No new comments can be posted.

Tuning The Kernel With A Genetic Algorithm

Load All Comments

Search 251 Comments Log In/Create an Account

Comments Filter:

Innovation and open source (Score:4, Insightful)

by Anonymous Coward writes: on Saturday January 08, 2005 @08:09AM (#11296207)

A common criticism of Open Source is the accusation that it lacks innovation.

I mean, common. Just look at this. Amazing.

And even if it turns out to be not that good, it was still a good read :-)

Share
twitter facebook
- CPU's and compilers (Score:2)
  
  by cybrthng ( 22291 ) writes:
  
  have been doing this kind of logic for a while. Itanium is almost built entirely on this type of logic with combination of intel compilers and code technology.
  
  I think a hardware design that does more logic controls would be best...
- Re:Innovation and open source (Score:2)
  
  by LarsWestergren ( 9033 ) writes:
  
  >> [...]theoretically allowing them to automatically tune themselves for the best possible performance for any given workload.
  
  > A common criticism of Open Source is the accusation that it lacks innovation.
  
  > I mean, common. Just look at this. Amazing.
  
  Well, not to denigrate the efforts of the people working on the Linux kernel, but Java has had Hotspot and other Just in Time compiling tricks since 2001, and I'm sure there are others before it.
  
  Java Hotspot does not use the same technology of c
- - Re:Innovation and open source (Score:2)
    
    by Flaming Foobar ( 597181 ) writes:
    
    What innovative about a genetic algorithm?
    Innovative != invention
    This is very, very, innovative. It's easily the most innovative thing to hit a kernel since the 70's.
- - Re:Innovation and open source (Score:2)
    
    by gaj ( 1933 ) writes:
    
    Troll.
    Just repeating a lie over and over doesn't make it true.
    Cut/Copy/Paste works just ducky, and has for years. In fact, both methods of cut/paste work just ducky -- highlight/middle-click and highlight/shift-ctrl-x/shift-ctrl-v. And copy with highlight/shift-ctrl-c.
    - Re:Innovation and open source (Score:4, Insightful)
      
      by ColaMan ( 37550 ) writes: on Saturday January 08, 2005 @02:40PM (#11298506) Journal
      
      Sooo...... can I copy and paste an image from gPhoto into The Gimp yet? No? Call me when you guys manage to do what windows 3.0 could.
      
      Parent Share
      twitter facebook
      - Re:Innovation and open source (Score:3, Informative)
        
        by AstroDrabb ( 534369 ) writes:
        
        gPhoto is an image _viewer_ not an editor. In WinXP the default image viewer is "Windows Picture and Fax Viewer" and it doesn't allow you to copy an image from it and paste it into paint or photoShop, so it is no different than gPhoto. Have you ever tried to drag an image into Gimp? It works quit well.
        Oh, and X supports many formats on the clipboard [jwz.org]
        One of the really cool, yet rarely used, features of the selection mechanism is that it can negotiate what data formats to use. It's not just about text.
    - - Re:Moderators on drugs? (Score:2, Informative)
        
        by JFitzsimmons ( 764599 ) writes:
        
        Have you??
        
        Proof [fitzsimmons.ca]
Complexity? (Score:5, Insightful)

by BurntNickel ( 841511 ) writes: on Saturday January 08, 2005 @08:20AM (#11296244)

So how much additional complexity is added for a 1-3% perfomance improvement? I'm all for more speed, but keeping thinks simple can often be more improtant when it comes to maintainablity and adding additional features.

Share
twitter facebook
- Re:Complexity? (Score:3, Insightful)
  
  by bersl2 ( 689221 ) writes:
  
  It's only one guy (I think; didn't RTFA), and it's nowhere near being included in the mainline kernel. Your observation may be correct in general, but what's the problem here? If an experiment pans out, the internals will be changed down the line to incorporate the new idea; if this idea were to have yielded a 5-10% increase in performance, would you have made that comment?
  - - Re:Complexity? (Score:3, Interesting)
      
      by Morosoph ( 693565 ) writes:
      
      Genetic algorithms are pretty simple, compared to what the bulk of what the kernel is doing. Furthermore, the technology is a known quantity, and probably won't be running at run time. Given the existing size of the kernel (6 million lines), I don't think that it'll add a lot to complexity.
      - Re:Complexity? (Score:3, Interesting)
        
        by xenocide2 ( 231786 ) writes:
        
        Genetic programming is a known quanitity, but if it "wont be running at run time" then its borderline useless. The whole idea is that the genetic algorithm adapts to a workload. Currently, scheduling is about fine tuning at the expense of flexibility. You tune the scheduler for a desktop setup, or for a database server, or whatever you have. Most tweaks yield a few percentage point gains in theory and maybe a single percent in practice for the given problem set, at the expense of about a ten percent penalty
      - Re:Complexity? (Score:2)
        
        by sketerpot ( 454020 ) * writes:
        
        To give some idea of how simple genetic algorithms are, here's a fun fact: if you know what you're doing, you can write one in a few hours. It'll work, too. Maybe not that well, or quickly, but it can work.
      - Re:Complexity? (Score:2)
        
        by Dan Ost ( 415913 ) writes:
        
        If you don't count device drivers and other modules, how big is the kernel
        now days?
- Re:Complexity? (Score:2)
  
  by arivanov ( 12034 ) writes:
  
  I hate to sound like Victor Meldrew, but I seriously dislike the way 2.6 is going. It should have settled by now, but instead of that you see major changes to scheduler, network drivers in every changelog. The only thing that is being left alone for now is the VM (thanks god for that).
  
  While I have to admit that none of them is anything as scary as some of the stuff that happened to 2.4 around 2.4.7-2.4.13 when the VM got changed, it is not right.
  
  The supposedly ready and released kernel 2.6 by now from min
  - Re:Complexity? (Score:3, Insightful)
    
    by m50d ( 797211 ) writes:
    
    Like I've said before, kernel 2.6 is simply not stable yet. Wait until they fork off 2.7, then with luck it will settle down.
    - Re:Complexity? (Score:2)
      
      by Carewolf ( 581105 ) writes:
      
      It is stable. And if you check out the new development model, you can see there will not be a 2.7 anytime soon.
      - Re:Complexity? (Score:2)
        
        by Billly Gates ( 198444 ) writes:
        
        Then why does Debian still include kernel 2.4?
        
        2.6 is still beta.
        
        BSD looks alot better.
        
        Re:Complexity? (Score:2)
        
        by Billly Gates ( 198444 ) writes:
        
        Debian is not bleeding edge because they count bugs and only something bug free on all platforms is added into stable.
        
        If 2.6 was production ready the debian folks would include it.
      - Re:Complexity? (Score:2)
        
        by m50d ( 797211 ) writes:
        
        It's not stable. When it runs on my system, and everyone's system, without crashing more than, say, hourly, then it's stable. 2.6 is nowhere near that.
        
        Re:Complexity? (Score:2)
        
        by RedWizzard ( 192002 ) writes:
        
        It's not stable.
        
        What you meant to say is "it's not stable for me". I'm having no problems with it.
        
        Re:Complexity? (Score:2)
        
        by m50d ( 797211 ) writes:
        
        No. That's why I mentioned "everyone's system". A program which is stable on some of the systems it runs on is not a stable program. Doubly so for something as critical as a kernel.
        
        Re:Complexity? (Score:2)
        
        by RedWizzard ( 192002 ) writes:
        
        No. That's why I mentioned "everyone's system". A program which is stable on some of the systems it runs on is not a stable program. Doubly so for something as critical as a kernel.
        
        Unless the problem is your system: i.e. hardware.
  - - Re:Complexity? (Score:2)
      
      by Apro+im ( 241275 ) writes:
      
      Yeah, we see how well that worked out with 2.6.8's CD burning.
      
      Not that I agree with the grandparent, but it ticks me off that there's no 2.7 branch, and i mean like awful. I'm all for developing the stable branch, but to your average user (like me!), you'd rather know your kernel is going to work.
      
      How about instead of 2.7 being the groundwork for 2.8, why don't we put changes into 2.7, let the early adopters and kernel hackers use it, and when we find bugs like the Cd-burning problem, we can fix them befo
- Re:Complexity? (Score:2, Insightful)
  
  by devillion ( 831115 ) writes:
  
  Understanding (basic of) GAs is easy and so is implementation. They also work quite well. That's why they are so popular.
  IMHO, if GA implementation can be made really reliable it could maybe replace other code which may be (I don't know) even more complicated.
  - Re:Complexity? (Score:5, Informative)
    
    by grammar fascist ( 239789 ) writes: on Saturday January 08, 2005 @02:58PM (#11298652) Homepage
    
    Understanding (basic of) GAs is easy and so is implementation.
    
    I'm a machine learning researcher, and I'll second this. Also, the code for it will be quite self-contained (if done right), and shouldn't affect any parts of the kernel except where it's called to run an iteration.
    
    They also work quite well. That's why they are so popular.
    
    Sure they do. For a lot of problems, though, they're not so hot compared to other optimization methods like hill climbing and empirical gradient descent - they tend to run slowly - and I would like to ask Mr. Moilanen why he didn't use one. GAs are especially good with nominal parameters (discrete, unordered). But I would expect tuning parameters to be either discrete or continuous.
    
    GAs are theoretically capable of finding global optima, except that's kind of a red herring. The only reason you can prove that is that, theoretically, no matter how small the probability, you'll eventually get exactly the right sequence of mutations to produce a global optimum. In practice, they tend to produce local optima like most other optimization algorithms - especially as Moilanen describes it:
    
    All of the tunings are then ordered from best performance to worst, and the worst half of the performers are replaced with new possible tunings derived from the best half of the performers.
    
    You generally have to be a little less selective (more random) than this.
    
    Parent Share
    twitter facebook
- Re:Complexity? (Score:2)
  
  by Sophrosyne ( 630428 ) writes:
  
  don't worry, the new Kernel also comes on DVD.
- Re:Complexity? (Score:2, Insightful)
  
  by gnuLNX ( 410742 ) writes:
  
  GA's a pretty trivial to code. So very little complexity is added.
good luck with that (Score:5, Insightful)

by Illserve ( 56215 ) writes: on Saturday January 08, 2005 @08:21AM (#11296248)

If a parameter space is complex enough that you can use a genetic algorithm to tune it, the solutions it finds may have all sorts of unexpected potholes, bugs, etc.

In other words, non-competitive genetic algorithms are only as smart as the fitness function you give them. If your fitness criteria aren't smart enough to cover all the bases, your solutions will have unpredictable deficiencies.

Share
twitter facebook
- GA + Hill Climbing... (Score:4, Interesting)
  
  by Corpus_Callosum ( 617295 ) writes: on Saturday January 08, 2005 @08:56AM (#11296365) Homepage
  
  First thing: A GA is only truly effective if you let it exhaustively search the search space - which is why GAs are run against simulations rather than in operational systems. Imagine trying to tune a kernel at runtime by occassionally switching to random tuning parameters. I think this is extremely non-optimal. Of course, if most of the heavy lifting is done before-hand and the GA is simply examining pre-defined parameter sets on your machine, it could work. But it's not really much of a GA anymore.
  
  As an alternative, perhaps using some form of pseduo-GA that tries to find pre-tuned parameters that most closely match your operating environment and then letting a Hill-Climbing algorithm hit it would be a better solution.
  
  Hill climbing can also be used in a GA type manner by letting the GA determine witch parameters to climb and in what order. The climbing itself is pretty straightforward, allow vectors to interact with individual parameters. If the result is worse, reverse the vectors or switch to new parameters. Rinse, repeat.
  
  Yes, GA can produce odd bugs and potholes. Yes, it is the fitness test that determines if that will be true. But a good GA will generally find solutions that are as good or better than hand tuning for search spaces that are very complex. Overall, this is a good idea but is probably more complex than advertised.
  
  Parent Share
  twitter facebook
  - Re:GA + Hill Climbing... (Score:5, Insightful)
    
    by Illserve ( 56215 ) writes: on Saturday January 08, 2005 @12:53PM (#11297695)
    
    First thing: A GA is only truly effective if you let it exhaustively search the search space
    
    If you have the resources to exhastively search the space... you don't need a GA.
    
    A GA is generally used when the search space is hopelessly huge and you need to chart a course from A to(wards) B but you don't know the way.
    
    And in finding this solution, which is "grown", not engineered, it's much easier for unintended wierdnesses to creep in. A GA might solve problem X by ruining performance on problem Y, something that you, as a software engineer, would never even consider viable, and hence you forgot to force the fitness function to check problem Y along with problem X.
    
    Parent Share
    twitter facebook
    - Re:GA + Hill Climbing... (Score:2)
      
      by Corpus_Callosum ( 617295 ) writes:
      
      If you have the resources to exhastively search the space... you don't need a GA.
      
      Ahh, semantics. "Allow the GA to exhaustively search vs. The GA searches exhaustively"
      
      Of course the GA will not do an exhaustive search, but it must have the ability to do so (e.g. explore all the really bad solutions as well as the good ones - the solution space should not be artificially constrained by eliminating random solutions from testing).
      
      It is pretty obvious that I was trying to say this if you actually look
      - Re:GA + Hill Climbing... (Score:2)
        
        by Illserve ( 56215 ) writes:
        
        Nothing about properly implemented GA's can be considered "exhaustive". They are, by definition, a means of effectively navigating a hopeless search space. For example, it is impossible for a GA to "explore all the really bad solutions", as there are far too many in any interesting problem space.
        
        Re:GA + Hill Climbing... (Score:2)
        
        by Illserve ( 56215 ) writes:
        
        Having the ability to "exhaustively search" imples not only access to the entire problem domain, but also the computational power to try them all (or at least a significant percentage).
        
        GA's, IMHO, are designed for problems in which the amount of computational firepower is practically insignificant in comparison to the size of the domain.
        
        I think you'd do better finding a different term than "exhaustive" for what you are trying to say.
        
        Re:GA + Hill Climbing... (Score:2)
        
        by Illserve ( 56215 ) writes:
        
        No, it's just that the word exhaustive has a very particular meaning in the context of search algorithms.
        
        Re:GA + Hill Climbing... (Score:2)
        
        by Corpus_Callosum ( 617295 ) writes:
        
        A GA does exhaustively search a search space, given enough time. It just so happens that the solutions it finds while searching are generally very useful. These solutions may be used immediately and in some cases, you can stop searching further. But if you let the algorithm continue long enough, it WILL SEARCH THE ENTIRE SPACE.
        
        If the GA is not able to reach the entire space or if it is biased in such a way as to make portions of the space hard to reach, then it is not an effective GA. If I wanted to
        
        Re:GA + Hill Climbing... (Score:2)
        
        by Illserve ( 56215 ) writes:
        
        These statements apply to all search algorithms for parameterized problem domains. No search algorithm is very effective if it is bottled off from significant portions of the domain.
        
        Re:GA + Hill Climbing... (Score:2)
        
        by Corpus_Callosum ( 617295 ) writes:
        
        These statements apply to all search algorithms for parameterized problem domains. No search algorithm is very effective if it is bottled off from significant portions of the domain.
        
        Thank you. We agree. That was exactly my point - the implications are that in a scenario like this, if you allow the search to progress as it should, you will constantly be testing very sub-optimal solutions which will result in negative side-effects to system performance. For that reason, it would be necessary to place h
  - - Re:GA + Hill Climbing... (Score:2)
      
      by Corpus_Callosum ( 617295 ) writes:
      
      Now if you have killed off the GA and are using a straight forward hill climber (a la Simplex) you won't find the optimal solution. Either you need the GA to continue or your need a really good classical minimizer to move from your pre-defined seed points.
      
      I agree. However, we have a problem: Running a GA on an operational system will (MUST) test more highly sub-optimal solutions than it will near-optimal solutions. The performance penalty for searching the space will overwhelm the performance gains f
      - Re:GA + Hill Climbing... (Score:2)
        
        by Corpus_Callosum ( 617295 ) writes:
        
        That solution would mean only tuning the performance for whatever the system happens to be doing during that 1 minute. Perhaps a better solution would be to keep track of the current performance (via the fitness function) and begin genetic tuning only when it falls below a certain benchmark (ie: the average of the last 50 fitness function evaluations).
        
        Yes, you are right about that. I suppose I am imagining that this would be most useful in a data-center rather than the desktop. The desktop scenario is
        
        Re:GA + Hill Climbing... (Score:2, Interesting)
        
        by TapeCutter ( 624760 ) writes:
        
        Moment preezz, I think the both of you are missing something. I have had some experience writing commercial software that collects performance stats for things like capacity planning, tuning, etc. As you can imagine it would be bad press for an application like that to be a performance hog, but (in my experience) when used to "collect all" the machine will take ~7% performance hit (many competitors were worse for less data).
        
        Granted it was a user level app and stored it's data in an sql db, but roughly half
        
        Re:GA + Hill Climbing... (Score:2)
        
        by Corpus_Callosum ( 617295 ) writes:
        
        Moment preezz, I think the both of you are missing something. I have had some experience writing commercial software that collects performance stats for things like capacity planning, tuning, etc. As you can imagine it would be bad press for an application like that to be a performance hog, but (in my experience) when used to "collect all" the machine will take ~7% performance hit (many competitors were worse for less data).
        
        Actually, that is precisely what I am talking about. The fitness test that pro
  - - Re:GA + Hill Climbing... (Score:2)
      
      by Corpus_Callosum ( 617295 ) writes:
      
      "A GA is only truly effective if you let it exhaustively search the search space"
      
      I don't think that's true. What do you consider effective? I consider the algorithm to be effective if it finds a better solution, which this does.
      
      Effective: Sure, any gain is welcome. That is correct. But for the GA to be effective at what it does, it needs to be able to search the search-space. If the GA starts with 20 predetermined tuning parameter sets and mates and mutates those, then all we have is a fancy hi
      - Re:GA + Hill Climbing... (Score:3, Insightful)
        
        by Corpus_Callosum ( 617295 ) writes:
        
        "If the GA starts with 20 predetermined tuning parameter sets and mates and mutates those, then all we have is a fancy hill-climber."
        
        This isn't true. The mutation alone differentiates it from a hill-climber.
        
        Mutation in Genetic Algorithms are supposed to act as hill-climbers *most of the time*. Mutations are not supposed to make drastic changes - that is the job of random individulas inserted into populations and recombination (mating). Mutation (of the bit flipping variety) is mostly there to provi
- Parent is overrated FUD ! (Score:2)
  
  by hernick ( 63550 ) writes:
  
  Mr. Illserve, you imply that problems best solved by genetic algorithms may well be full of potholes and bugs. Your comment is overrated FUD; you are trying to scare unwary slashdotters away from genetic algorithms !
  
  First of all, the possible parameters that the genetic scheduler can affect can all be changed safely. The genetic scheduler does not try to invent new scheduling algorithms; rather, it tests different parameter sets and existing algorithms to find which works best with the current workload.
  
  Yo
  - Re:Parent is overrated FUD ! (Score:2)
    
    by Illserve ( 56215 ) writes:
    
    Admittedly, I don't understand this problem space very well. My point was just that I am wary of GA's in any situation in which reliability is a concern. But it could be that in this case the worst a GA can do is come up with a solution that is 10% slower for some unexpected type of problem.
    
    There's just this sense among lay people though that GA's are some kind of magical cure-all, and I think it's because this type of search is a bit of a black box. Put something in, turn the crank, and wait for a solu
not a panacea (Score:3, Interesting)

by DrSkwid ( 118965 ) writes: on Saturday January 08, 2005 @08:22AM (#11296255) Journal

They might converge on a point of attraction that is not the highest possible.

Sure the only way is to exhaustively search the "chromosome" space for every possibile combination, and computers are good at brute force!

Share
twitter facebook
- Simulated Annealing (Score:3, Interesting)
  
  by Mark_MF-WN ( 678030 ) writes:
  
  That's where 'Simulated Annealing' comes in. It can often avoid local maxima that aren't optimal.
- Re:not a panacea (Score:2, Insightful)
  
  by John Little John ( 842934 ) writes:
  
  They might converge on a point of attraction that is not the highest possible.
  
  Mutation helps with convergence issues.
  
  Sure the only way is to exhaustively search the "chromosome" space for every possibile combination, and computers are good at brute force!
  
  Um, I don't even know how many parameters are involved to be adjusted, but it does not take that many to make exhaustive search useless, even for a computer.
- Re: not a panacea (Score:2)
  
  by Black Parrot ( 19622 ) writes:
  
  > Sure the only way is to exhaustively search the "chromosome" space for every possibile combination, and computers are good at brute force!
  
  Problem is, the search space grows exponentially with the size of the chromosome. No problem if you have a short chromosome, but brute force is intractable for long chromosomes.
  
  For example, if you are trying to optimize a set of 100 binary parameters for some problem, brute force requires you to evaluate 2^100 combinations.
- - Re:not a panacea (Score:2)
    
    by DrSkwid ( 118965 ) writes:
    
    randomizing the variables doesnt guarantee anything.
    
    It might even mean that no local maxima are *ever* found
Other kernel parameters? (Score:5, Interesting)

by Feint ( 135854 ) writes: on Saturday January 08, 2005 @08:24AM (#11296258) Homepage

Could this be extended to include other kernel parameters as well? Depending on your app, things like TCP timeouts and other muck can have a large impact. Tuning this stuff is currently somewhat of a black art. Then as the user community of the app becomes familiar after rollout, a lot of the usage patterns change. In a few cases, this means we end up having to re-tune the kernel.

If this package could be extended to the other parameters, it would save my customers a *lot* of time and money.

If nothing else, this could be a deciding factor for some of our clients to use linux instead of windows.

Share
twitter facebook
- Re:Other kernel parameters? (Score:3, Interesting)
  
  by Corpus_Callosum ( 617295 ) writes:
  
  Now your talking. Adaptive tuning is definitely the future. While I disagree that a general purpose GA is useful here (you should not let a GA hit an operational system, you need to let it hit a simulation first to build up a certain amount of fitness in it's solution space), many adaptive techniques would be useful and could eliminate the need to hand tune in many environments.
- Re:Other kernel parameters? (Score:3, Interesting)
  
  by burns210 ( 572621 ) writes:
  
  That is a great idea. Now here is a dumb one:
  
  What about adding hooks for applications to to send/recieve performance changes after tweaks? Services, daemons, etc, need to communicate how the GA's latest tweak adjusted performance, right?
So.... (Score:2, Funny)

by mstefanus ( 705346 ) writes:

So will this means that if I install this kernel on my computer I will have baby Pentiums or baby Athlons soon?
- Re:So.... (Score:2, Funny)
  
  by LiquidCoooled ( 634315 ) writes:
  
  If you nurture your computer, and occasionally sit it next to another computer then maybe, just maybe, when you wake up one morning, you will have little PDAs running around the place :)
Not worth it... (Score:2, Insightful)

by mikelang ( 674146 ) writes:

1-2% gain is in the borders of statistical error. Definitely not worth the increased complexity of the solution.
- Re:Not worth it... (Score:5, Insightful)
  
  by Corfe ( 246685 ) writes: on Saturday January 08, 2005 @09:03AM (#11296386)
  
  It's a unique idea - what's wrong with running it for a while with your typical load (say, for a fileserver), finding some better-than-average parameters for the kernel, then running an unpatched kernel with those parameters manually entered?
  
  What is "on the borders of statistical error" depends on how many times the test was run, and how much variation there had been in his results before. I think it's pretty safe to assume that if he knows how to implement a genetic algorithm into the linux kernel, he knows how to handle statistics properly.
  
  Parent Share
  twitter facebook
- Re:Not worth it... (Score:2)
  
  by gatkinso ( 15975 ) writes:
  
  Slighty disagree - I think it is worthy of evaluation... probably better place for this is on the 2.7 branch.
- Re:Not worth it... (Score:2)
  
  by gl4ss ( 559668 ) writes:
  
  ..which would be why he's asking for scheduler gurus to work in it, no?
  
  being mostly just a proof of concept at this stage.
- Re:Not worth it... (Score:3, Insightful)
  
  by xenocide2 ( 231786 ) writes:
  
  Toms hardware consistantly favors computer hardware that only pushes above the competition by 1 percent or less. People spend an extra 40 dollars for this performance, and you're not willing to consider that people might like a FREE performance boost of a percent?
- Re:Not worth it... (Score:3, Interesting)
  
  by jelle ( 14827 ) writes:
  
  How do you know the margin of error? I've seen systems/measurements where 50% difference is a statistical error, and systems where it needs to be less than 0.2% to be a statistical error.
  
  Pragmatism and statistics are _not_ a good mix.
  
  Note that, for example, many hosting providers host hundreds of web sites per system. Adding a couple of percent in performance then adds a couple of percent to the bottom line of the cost picture for those companies. The same is true for supercomputer clusters used by many c
This has been done before (Score:2, Funny)

by drsmack1 ( 698392 ) * writes:

http://tinyurl.com/6pkzc
"Daystrom felt that such an act was an offense against the laws of God and man, and the computer that carried his engrams also believed it."
--Kirk
One question remains: (Score:3, Funny)

by Qbertino ( 265505 ) writes: <moiraNO@SPAMmodparlor.com> on Saturday January 08, 2005 @08:52AM (#11296350)

Did SCO allow him to modify their kernel?

Share
twitter facebook
Genetic packet scheduler (Score:3, Interesting)

by City Jim 3000 ( 726294 ) writes: on Saturday January 08, 2005 @09:07AM (#11296402)

Would it be possible to apply a genetic algorithm on a packet scheduler? IMO the packet schedulers available today needs too much manual tweaking.

Share
twitter facebook
The problem: Determining Performance (Score:4, Insightful)

by Corpus_Callosum ( 617295 ) writes: on Saturday January 08, 2005 @09:23AM (#11296448) Homepage

The main problem with this or any other adaptive tuning mechanism is actually acquiring performance metrics.

What is the system using to decide if a new parameter set is better than a previous? What is the fitness test?

Some applications are disk-bound, others are CPU-bound, others are network bound. The performance dance is often non-obvious (e.g. some applications will achieve generally higher performance by allowing I/O higher priority than context switching, while others that appear to perform in a similar manner will achive higher performance by reversing that order).

The kernel does not have any mechanism to determine if a particular application is performing better or worse, it can only really get a guage of throughput and load. While this MAY be enough to get small systemwide performance gains, in order to really acheive significant application-specific performance gains, I think that applications would need to explicitly add support for adaptive tuning by logging relevant performance metrics for the kernel to tune around.

Thoughts?

Share
twitter facebook
- Re:The problem: Determining Performance (Score:2)
  
  by angel'o'sphere ( 80593 ) writes:
  
  You are very right with this: The kernel does not have any mechanism to determine if a particular application is performing better or worse, it can only really get a guage of throughput and load.
  
  I thought long how the "fittness" part of that GA might work. Ultimately the only "fittness" the kernel can with high precision measure is "load" isn't it?
  
  Any better ideas?
  
  angel'o'sphere
  - Re:The problem: Determining Performance (Score:2)
    
    by Corpus_Callosum ( 617295 ) writes:
    
    I thought long how the "fittness" part of that GA might work. Ultimately the only "fittness" the kernel can with high precision measure is "load" isn't it?
    
    Any better ideas? I suppose the only way would be to have some "tuning" hooks that applications could use to announce their own performance metrics. The GA could optimize around load when these metrics are not available and take them into account if they are.
- - Re:Also: why tune a startup routine? (Score:2)
    
    by HolyCoitus ( 658601 ) writes:
    
    Startup code or a program initializing would not have an effect on the genetic portion. The effects from code being ran are very slow, and tuning things in that code will help. The idea is so that you would not have an adverse effect from a piece of code being ran for a minute. It would take time for the scheduler to change over. This is really designed for a computer that would be doing database work for a time on a website then at night may do some compiling. A desktop would gain little from this in
Oh no! (Score:2, Funny)

by brainnolo ( 688900 ) writes:

Does this means Linux will be effected by genetical diseases sooner or later?
Monte Carlo w Bayesian Stats (Score:2)

by G3ckoG33k ( 647276 ) writes:

Monte Carlo simulations w Bayesian Stats may explore very large otherwise intractable parameter spaces. Perhaps an alternative path?
GAs aren't rocket science (Score:5, Insightful)

by Earlybird ( 56426 ) writes: <slashdot@pureficti o n .net> on Saturday January 08, 2005 @09:33AM (#11296486) Homepage

Because most people aren't intimately familiar with genetic algorithms, and because GAs are associated with machine learning/artificial intelligence, GAs are seen as somewhat mysterious and magical, and are therefore either accepted with "whoa, cool!" or rejected with "whoa, complex!" While GAs are indeed novel compared to many long-established algorithms, both mentalities are overreactions.
In reality, the basic GA framework is "just" another efficient search algorithm, no cooler or more complex than, say, a hash table or a binary search tree; at its simplest, a GA is a way to find an optimal configuration of components without looking at all possible (potentially explosively exponential) combinations; instead, you look at just some permutations, and as you iterate through generations, applying breeding and mutation, you arrive at a generation which is statistically close to optimal.
GAs are also in no way new or unproven technology; a nice example of mainstream use is PostgreSQL [postgresql.org]'s query planner, which uses GAs to optimize query plans.

Share
twitter facebook
- Re:GAs aren't rocket science (Score:2)
  
  by zhiwenchong ( 155773 ) writes:
  
  GAs are not typically efficient on their own... and while their solutions are usually better than the nominal case (with no optimization), those solutions don't always qualify as "optimal".
  
  quote:
  
  When to Use GAs
  
  GAs treat the optimization problem as a black box, and are therefore very flexible. Because GAs use very little problem structure they will inevitably perform poorly relative to an algorithm which is is designed for a specific problem type. GAs are therefore best suited to messy problems which do n
- Re:GAs aren't rocket science (Score:2)
  
  by Illserve ( 56215 ) writes:
  
  Actually, some flavors of GA's are fundamentally different from other types of search, particularly those with an open ended problem space.
  
  But in the case of simple parameter-fitting problems (ie this one) , you're right.
practical applications (Score:2, Interesting)

by memoryband ( 847617 ) writes:

while performance gains of 1-3% in a well defined set of tasks (in this case the benchmarks) is a marginal improvement well inside statistical error...

this is a really interesting idea.

Moving the genetic algorithm processing to another machine may be warranted. If you had a good idea of what you were going to be doing (heavy database work for instance), a dedicated machine could be used to find an optimal scheduling solution and then that could be implemented on the production machine.

or maintain a list
- Re:practical applications (Score:2)
  
  by Corpus_Callosum ( 617295 ) writes:
  
  Moving the genetic algorithm processing to another machine may be warranted. If you had a good idea of what you were going to be doing (heavy database work for instance), a dedicated machine could be used to find an optimal scheduling solution and then that could be implemented on the production machine.
  
  Ahh.. interesting idea..
  
  If I am running in a cluster environment, I could dedicate one or more machines in the cluster to evolve tuning parameters. That machine could publish "discoveries" to the oth
- Re:practical applications (Score:2)
  
  by HolyCoitus ( 658601 ) writes:
  
  The genetic algorithm is supposed to create a situation where you no longer have to select a scheduler. What you are proposing is using it to find the best scheduler for the task. This is already known and the scheduler can be designed to take advantage of the environment if it is known. This is best to be thrown into a mixed environment where you do not know the types of processes.
  
  The implementation is horribly useful and needs improvement on the scheduler end as is said. The schedulers that it can u
- Re:practical applications (Score:2)
  
  by tdelaney ( 458893 ) writes:
  
  There's a severe flaw with this methodology though.
  
  The two machines are different (even if their specs are theoretically the same). Minor differences in characteristics can result in highly sub-optimal results if you move the determined parameters from one machine to another.
  
  To obtain *useful* parameter sets, you will need to do it on the machine in question. Your second option points towards this.
Shouldn't this be done in userspace? (Score:2)

by dpilot ( 134227 ) writes:

I can see a kernel patch to export some extra information and/or extra tuning hooks via proc or sysfs, but IMHO the algorithms themselves should be outside the kernel, running in a daemon.
Tweaking the algorithm or using other ones? (Score:2)

by moz25 ( 262020 ) writes:

Having worked with a variety of optimization algorithms, would it make sense to consider the optimization itself as the 'innovation', where the actual algorithm used is secondary? Perhaps this can be optimized further with other algorithms or using the most appropriate one for different cases... ?
Does anyone use this one? (Score:2, Interesting)

by oops.sgw ( 831993 ) writes:

Just compiled this stuff on an old testbox, now running it for about 100 generations. At first it was feeling very slow, ok, it's a Pentium2 ;-) but it was much slower than running vanilla 2.6.9 or 2.6.10, for example.

But it is getting much better now, I don't know how much generations there will be needed to get things right. It feels pretty much the same as with the vanilla kernels, let's see where this leads ...

Anyone else with experiences? AFAIK this thingy can only be tweaked by editing the code and
- Re:Futurology (Score:2, Interesting)
  
  by Moonbird ( 625445 ) writes:
  
  You know, in the german translation of Terminator 2 Arnold talks about how his brain is made of "Neutrale Netze" or "neutral nets" re-translated.
  
  It's pretty funny considering he is talking in his grave and totally serious robo-voice...
  - Re:Futurology (Score:4, Funny)
    
    by wertarbyte ( 811674 ) writes: on Saturday January 08, 2005 @09:08AM (#11296407) Homepage
    
    It's pretty funny considering he is talking in his grave and totally serious robo-voice...
    
    You mean there is another voice?
    
    Parent Share
    twitter facebook
- Re:Futurology (Score:2)
  
  by Eric Giguere ( 42863 ) writes:
  
  If one of Linus' kids takes over from his father then that could be considered a kind of tuning of the kernel using genetic algorithms!
  Eric
  Why the Vioxx recall reduced spam (well, maybe temporarily) [ericgiguere.com] (see also my William Shatner All-Bran humor [ericgiguere.com]
- Re:Dear Kernel Coders (Score:3, Insightful)
  
  by gclef ( 96311 ) writes:
  
  Nice troll, but your sarcasm presents a common fallacy: that work on one issue (adding features like this) means that less work is being done on some other issue (cleaning up security problems). The fact is, throwing more people at a problem does not always make it better, especially if the people you throw at it don't know the subject (which the author of this algorithm may not, can't speak for him).
  
  In other words: if you have someone who's good at writing Genetic Algorithms, but not so good at searching
- Re:Dear Kernel Coders (Score:3, Informative)
  
  by kneeless ( 837507 ) writes:
  
  As mentioned previously on Slashdot, uselib() comes from Linux 0.13. It was kept for the a.out to ELF transition. Someone recently noticed it and said, "What is _that_ doing in my system?" This is new code that's being looked at by hundreds of developers. It's pretty hard to get root kernel exploits when it's like that. Plus, this code doesn't introduce any calls with user level priviliges. (Read: no exploit)
- Re:Dear Kernel Coders (Score:5, Informative)
  
  by Xpilot ( 117961 ) writes: on Saturday January 08, 2005 @08:59AM (#11296377) Homepage
  
  Go grab the patches. They're commited into the BK repositories already. Sheesh.
  
  Patches for 2.4 can be found in this changeset [bkbits.net].
  
  Patches for 2.6 can be found in this changeset [bkbits.net].
  
  Click on the little "diff -Nur style" link for a an actual usable patch.
  
  In the course of a few hours, you have the fixes already. Yay for open source.
  
  Btw, nice troll :p
  
  Parent Share
  twitter facebook
  - Re:Dear Kernel Coders (Score:2)
    
    by thinkninja ( 606538 ) writes:
    
    Also 5 linux kernel advisories [seclists.org] from the grsecurity team.
  - Re:Dear Kernel Coders (Score:2)
    
    by simpleguy ( 5686 ) writes:
    
    My beef was about "avoiding" security issues like this, not about fixing them quickly.
    
    Please take time to read next time.
- Re:Dear Kernel Coders (Score:2)
  
  by Trillan ( 597339 ) writes:
  
  See Brooks' Law [wikipedia.org].
- Re:Oh, Oh (Score:2)
  
  by Sweetshark ( 696449 ) writes:
  
  that is so sad.
  kudos to SpanKY, who found the right words for this poor soul ....
- Re:Oh, Oh (Score:2)
  
  by bcmm ( 768152 ) writes:
  
  WTF are those use flags near the bottom of the bugzilla page?
  
  I think you are allowed to just say USE="*"
  
  If you want the binaries to be slower than RPMs...
- Re:Oh, Oh (Score:2)
  
  by m50d ( 797211 ) writes:
  
  -funroll-all-loops usually slows things down compared to -funroll-loops.
- No kidding? (Score:2)
  
  by Shazow ( 263582 ) writes:
  
  ...I'm using gentoo myself and love it.
  
  Wow, I never would have guessed...
  
  That is sarcasm, btw. I, too, am using gentoo and love it. :-)
- - Re:Oh, Oh (Score:2)
    
    by Stevyn ( 691306 ) writes:
    
    actually, that would be a USE flag
- - Re:Oh, Oh (Score:2)
    
    by Stevyn ( 691306 ) writes:
    
    I think that's why ricers are made fun of so much. They add in flags without any knowlege of what they do.
    
    Mine are somewhat conservative:
    CFLAGS="-O2 -march=pentium4 -fomit-frame-pointer -pipe -ftracer -mmmx -msse -msse2 -mfpmath=sse"
    
    I don't use O3 and some people claim it can make things slower. For some systems any O* will turn on -fomit-frame-pointer even though every how-to recommends it.
    
    I do use gcc-3.4.3 which isn't marked stable in portage for x86, but I haven't noticed any trouble with it and th
    - Re:Oh, Oh (Score:2)
      
      by MarcQuadra ( 129430 ) * writes:
      
      um, you shoould turn OFF -mmmx -msse -msse2 when using the -march=pentium4 option, they're already set properly.
      
      To see what flags are set 'behind the scenes' you can run 'gcc -Q -v -O3 -march=pentium4 helloworld.c' on a 'hello world' file. Here's an example:
      
      #gcc -Q -v -O3 -march=pentium4 helloworld.c ...
      options enabled: -feliminate-unused-debug-types -fdefer-pop
      -foptimize-sibling-calls -funit-at-a-time -fcse-follow-jumps
      -fcse-skip-blocks -fexpensive-optimizations -fthread-jumps
      -fstrength-reduce -fu
- - - Re:ooooh man (Score:2)
      
      by johannesg ( 664142 ) writes:
      
      I was making a joke. Gee, learn about sarcasm people...
- Intelegent people consider a GA (Score:2)
  
  by tallbill ( 819601 ) writes:
  
  Obviously any tuning must involve data collection on what is going on in a process. This means that some kind of logging must go on in the software.
  
  Anyone who has ever created a logging interface understands that a log will slow down a system.
  And so, even if one were to run the GA and have it show things, it will still mean that later that GA should be removed or swithced off. Otherwise the workings of the GA and its requisite log will slow down the system.
  
  So, use a GA,and then intellegently remove it a
- - Re:as someone who knows about GA (Score:2)
    
    by Carewolf ( 581105 ) writes:
    
    His code shows that adapting tuning parameters to the current workflow is beneficial even when doing it with something as rough as a genetic algorithm. And new well thought out design could use and improve the results of the genetic algorithm, even if GA is not going to be used in the final result.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Innovation and open source (Score:4, Insightful)

CPU's and compilers (Score:2)

Re:Innovation and open source (Score:2)

Re:Innovation and open source (Score:2)

Re:Innovation and open source (Score:2)

Re:Innovation and open source (Score:4, Insightful)

Re:Innovation and open source (Score:3, Informative)

Re:Moderators on drugs? (Score:2, Informative)

Complexity? (Score:5, Insightful)

Re:Complexity? (Score:3, Insightful)

Re:Complexity? (Score:3, Interesting)

Re:Complexity? (Score:3, Interesting)

Re:Complexity? (Score:2)

Re:Complexity? (Score:2)

Re:Complexity? (Score:2)

Re:Complexity? (Score:3, Insightful)

Re:Complexity? (Score:2)

Re:Complexity? (Score:2)

Re:Complexity? (Score:2)

Re:Complexity? (Score:2)

Re:Complexity? (Score:2)

Re:Complexity? (Score:2)

Re:Complexity? (Score:2)

Re:Complexity? (Score:2)

Re:Complexity? (Score:2, Insightful)

Re:Complexity? (Score:5, Informative)

Re:Complexity? (Score:2)

Re:Complexity? (Score:2, Insightful)

good luck with that (Score:5, Insightful)

GA + Hill Climbing... (Score:4, Interesting)

Re:GA + Hill Climbing... (Score:5, Insightful)

Re:GA + Hill Climbing... (Score:2)

Re:GA + Hill Climbing... (Score:2)

Re:GA + Hill Climbing... (Score:2)

Re:GA + Hill Climbing... (Score:2)

Re:GA + Hill Climbing... (Score:2)

Re:GA + Hill Climbing... (Score:2)

Re:GA + Hill Climbing... (Score:2)

Re:GA + Hill Climbing... (Score:2)

Re:GA + Hill Climbing... (Score:2)

Re:GA + Hill Climbing... (Score:2, Interesting)

Re:GA + Hill Climbing... (Score:2)

Re:GA + Hill Climbing... (Score:2)

Re:GA + Hill Climbing... (Score:3, Insightful)

Parent is overrated FUD ! (Score:2)

Re:Parent is overrated FUD ! (Score:2)

not a panacea (Score:3, Interesting)

Simulated Annealing (Score:3, Interesting)

Re:not a panacea (Score:2, Insightful)

Re: not a panacea (Score:2)

Re:not a panacea (Score:2)

Other kernel parameters? (Score:5, Interesting)

Re:Other kernel parameters? (Score:3, Interesting)

Re:Other kernel parameters? (Score:3, Interesting)

So.... (Score:2, Funny)

Re:So.... (Score:2, Funny)

Not worth it... (Score:2, Insightful)

Re:Not worth it... (Score:5, Insightful)

Re:Not worth it... (Score:2)

Re:Not worth it... (Score:2)

Re:Not worth it... (Score:3, Insightful)

Re:Not worth it... (Score:3, Interesting)

This has been done before (Score:2, Funny)

One question remains: (Score:3, Funny)

Genetic packet scheduler (Score:3, Interesting)

The problem: Determining Performance (Score:4, Insightful)

Re:The problem: Determining Performance (Score:2)

Re:The problem: Determining Performance (Score:2)

Re:Also: why tune a startup routine? (Score:2)

Oh no! (Score:2, Funny)

Monte Carlo w Bayesian Stats (Score:2)

GAs aren't rocket science (Score:5, Insightful)

Re:GAs aren't rocket science (Score:2)

Re:GAs aren't rocket science (Score:2)

practical applications (Score:2, Interesting)

Re:practical applications (Score:2)

Re:practical applications (Score:2)

Re:practical applications (Score:2)

Shouldn't this be done in userspace? (Score:2)

Tweaking the algorithm or using other ones? (Score:2)