Google's 4000 Node Linux Cluster 158
Check out the Red Hat press release running at LWN, or the news article at techweb about Google's 4000 Node Linux Box. Both articles are basically Red Hat commercials, but there's some interesting bits like the fact that they have a terebyte index of 300 million Web pages, and that they might expand their cluster to 6000 nodes in the future.
Re:Good comparison (Score:1)
Google is closed source... (Score:1)
What are they doing for Linux? Exploiting it.
Re: Oracle of Bacon (was: hey) (Score:1)
That's me [oracleofbacon.org].
Building a graph is quite straightforward
If you've taken a algorithms course (and passed) you, too, could probably write an Oracle of Bacon [oracleofbacon.org].
I believe that this was done on a single computer. Pretty sure it wasn't a cluster of 4,000 ;)
The Oracle takes up about 10% of the CPU time on a single Sun Ultra 5/300. (I didn't pick the machine. The Oracle also runs on my Linux 2xP2/350 at home.) It takes around 80 MB of memory -- 25 for the actors and movies and the rest for a cache of recent queries [oracleofbacon.org]. Each query consumes 0.6 seconds of CPU time, or 0.02 seconds if it comes from the cache. 90-95% of queries get served from the cache, so the Oracle [oracleofbacon.org] should withstand 10+ queries per second, sustained.
The task is trivially parallelizable across big clusters (UVA has a 256-node cluster [virginia.edu] that would do the trick), but the need for that has never arisen... :)
--Patrick
Re:Why x86? (Score:1)
ttyl
Farrell
Why x86? (Score:1)
ttyl
Farrell
Do your maths (Score:1)
Hello? Where did you buy your brain? We are talking about 4000 PCs here. Do the maths.
If a PC fails, on average, more frequently than once every 10 years and 11 months, then you are going to be replacing one machine each day, and three on Mondays, unless you also work Saturdays and Sundays.
Talk about job satisfaction.
Re:Why x86? (Score:1)
Now why would you want to do a thing like that?
Re:Do your maths (Score:1)
Of course not.
Then again, I have yet to see my first dead Sun.
On the other hand, every place I've ever worked has had a room set aside named the cemetary, where old PCs are left to rot.
Article suggestion (Score:1)
Re:hey (Score:1)
I guess there's something to be said for "headroom."
Re:Why x86 Linux? (Score:1)
There is more to a computer than the CPU. I have never needed to test or specify a system for a specific use, I thought that Suns had better memory system bandwidth, something that would seem to me to be a bottleneck here than computation speed.
For the price, x86 is probably better, though I sure hope they selected solid quality components, I would hate to be on a crew trying to maintain 4000 computers. Other things to consider are power use and the need for climate control or at minimum A/C. So, in your support, there are many factors that need to be taken into account when considering these things, maybe x86 won out on a serious shootout.
Re:google uses RAIS technology (Score:1)
SO WHERE ARE THE PICTURES? (Score:1)
Re:hey (Score:1)
Re:hey (Score:1)
~luge(I'm asking the friend who owns oracleofbacon.org what kind of hardware they used... no answer yet)
Re:FAST (http://www.alltheweb.com) runs FreeBSD! (Score:1)
Offtopic, but that is possible. When you post to an article where you have moderated posts, your moderations are canceled. You won't get any points back, though.
Why not a wildfire? (Score:1)
You can look at the specs at:
Benchmark performance of GS320 [compaq.com]
Which says Suns EV10000 (64 processor) is not as fast as the 32 processor GS320....
And the price of the GS320 is estimated arround
$600,000 from:
The Register [theregister.co.uk]
So one would have thought this system to be a real contender!!! Considering the PCs are $1000 each and if we use 6000 of them, we would be able to afford 10 of these GS320 beasts (with a total of 320 alpha processors )...
Re:Good comparison (Score:1)
--
Re:Very Smart *NOT* (Score:1)
There are very clean, and tested, methods to install large cabling installations, to handle large power requirements, etc... Certainly, the setup is complex, but that's the biz.
Think telco...go check out one of UUnet, Globalcenter, or Exodus' datacenters when you have a chance.
-Buffy
Re:a more technical article anywhere? -- Yes (Score:1)
RedHat needs more publicity (Score:1)
First, I thought as another user that it was obvious that Google was using Linux. Also, the whole clustering capability has been known for awhile.
Do we need every site to have a THIS SITE RUNS ON
I can hear the slashdotters now:
"You can't be a real geek site you run RedHat and not Debian!"
"Well little boy you aren't a real man till you have gotten Slackware working on a 486 33mhz machine with a bad BIOS."
"You are all full of crap because I run BSD and it has REAL security."
Yeah, yeah, yeah. Whatever. They can run a cluster and that is really neat and I love to Quake from their server and I bet my projects would compile really quick and wouldn't be neat if..... I think you know the rest.
Re:Why x86 Linux? (Score:1)
Use to have a link but lost it...
Re:Off the shelf server farms (Score:1)
God forbid that Sun should have competent people getting in the way of product shipping dates...
Re:Good comparison (Score:1)
Re:Why x86 Linux? (Score:1)
Of course there are several manufacturers of x86 hardware, so a comparason between a high-quality Sun box and a low-end x86 box is meaningless. How about a comparason of an x86 server, like VA or Compaq, that has much higher quality components against a Sun. Apples-to-apples.
Administration doesn't have to be too difficult either, there are several tools to help in managing large numbers of UNIX type systems, like PIKT and rdist, that can replicate files and configurations througout a mass of machines.
Maybe it would be more efficient to have a few very large boxes (E10K, S80, S/390, SGI O2K) but I don't think they started with the megabucks to burn, this gives them a cheap, scalable paradigm (solution, I meant solution!) that seems to work for them.
Re:Why? The Arabs are terrorists (Score:1)
How powerful are the machines in the cluster? (Score:1)
Tiny PC cluster (Score:1)
Re:Here's my immature post (Score:1)
Re:Why I use Google (Score:1)
Even if the actual search is slower than other engines, it's a user interface design that makes the overall searching much quicker. The only thing that I would like to see changed on Google is to be able to display 100 or more results from the front page. Then again, that would take away from the "streamlined interface" that I just got done praising, so I'll just shut up now.
-sk
Why I use Google (Score:1)
What database? (Score:1)
Re:Wow (Score:1)
Re:Good comparison (Score:1)
Any OS can get bloated, it's just a matter of what you consider excessive software. There's no question that the 300 little tools that come with Unix systems are useful for scripting, but to someone that only wants to run a web browser they're bloat. However, Unix is modular enough that you can run without many pieces -- it's the monolithic systems where bloat becomes really painful.
Re:Good comparison (Score:1)
Re:Very Smart *NOT* (Score:1)
comes down to managing differences between
them. If they're the same then handling
200 is no worse than handling 2000.
E10000's are far more tricky beasts than a
simple linux box, especially if you're wanting
to do domaining (the only reason you would choose
an e10000 over an e6500).
nothing new... (Score:1)
What does this article really offer new? It's been known from right the beginning that they run a Linux cluster. See here: [slashdot.org]
I find it more interesting that infact they use python [deja.com].
Perensdot rewls (Score:1)
I think Google is making an enormous mistake by using Linux as their OS-of-choice. Something more robust, such as Java or even Python,
Sure, Java already does many OS-like things. Python doesn't, tho. Strangely enough, google is widely known for using Python already....
Google has ads. (Score:1)
Sure they do. Try searching for "linux server" [google.com] and you get back a text-only ad for DigitalNation at the top.
Re:Google and that much data (Score:1)
--
Michael Sims-michael at slashdot.org
Favorite Searches (Score:1)
My favorite searches: Out of curiosity, I typed in Onion, and the top site was "The Onion- America's finest news source". My all time favorite happened the other day when I was searching for information of the statistics program "R". I typed in R, and the R FAQ was number 4 on the list! Rock on guys!
"He looks like he got in a fight with the 70's and got his ass kicked"
-Sherman Alexie
Re:Cluser? (Score:1)
Molly.
Underpowered? (Score:1)
"Well, it just got posted to
My $.01
Re:Very Smart *NOT* (Score:1)
There's no end to the possibilities!
Re:Google is driven by python not by perl (Score:1)
Re:Search engine = Government tool (Score:1)
Re:Why I use Google (Score:1)
Though even this is rather about the tech specs of google, I wonder how they can cache websites and still have not been sued yet.
I remember that in the Slashdot-FAQ there is a remark about caching websites to circumvent the slashdot-effect. Among other (more technical) reasons there was also a point about possible copyright-infringement.
Re:Clues you can lose (Score:1)
But has anyone consider that, those 6000 Linux boxes are actually a VM in an big s/390??
Just a tough.
Ok, I'm starting my own cluster now !! Let me see, a 386/4M, 2 486/16M, Pentium 75/32M, Pentium II 400/128M and 1 AMD 750/256M. Sure i can beat them. I have my own cluster!! What can i crunch now? I will test SETI@Home. May be i can find an alien life form sending email to Bill Gates:
BORG:Resistence is Futile!
BG: Hey! That's my MODO I'm gonna sue you!
Regards
Re:we already know what MIPS and a TB are... (Score:1)
Nothing wrong with that, I'd say. If you know it, fine, if you don't and accidentilly end up on one of their pages, it's only of help. It's not like you have to actively do anything with them.
Mad.
Re:Clues you can lose (Score:1)
I can assure, as one with experience, that Suns most certainly *DO* fall over. The E10K is a nice box with quite a few redundant feature. The ability to remove a system board on the fly ranks pretty high up there. Assuming you lose a processor or memory and assuming (this is a big one) that your system doesn't fall over immediatly, you can most likely replace that processor hot. However... it's been my experience that failling processors or memory bring the box (or domain) down more often than not. And let me assure, sun processors *DO* fail periodically. In a large shop, you can exect to replace at least one or two processors per year.
Re:Google is driven by python not by perl (Score:1)
Google Nodes (Score:1)
10^100 nodes...
Re:Very Smart (Score:1)
None the less, I see your point. I just wasn't refering to such a small setup...
Re:Why x86 Linux? (Score:1)
Re:Google is driven by python not by perl (Score:1)
http://www7.scu.edu.au/programme/fullpapers/192
Goto Section 4.1 and read it! Here is what you missed:
"Most of Google is implemented in C or C++ for efficiency and can run in either Solaris or Linux."
Only the web crawlers are implemented in python. (See Section 4.3)
Re:Why not a wildfire? (Score:1)
Google makes a nice search engine.
good for them, and RedHat.
Fook
Re:hey (Score:2)
...phil
why this doesn't make any sense (Score:2)
--
Re:FAST (http://www.alltheweb.com) runs FreeBSD! (Score:2)
You could still be right though... this only means they are running their frontend web servers on BSD. As to what powers their database is anyone's guess, since you'll never see that server from the outside world if they have half a brain. So, they could be running NT for their backend. This would be the totally wrong thing to do though. Usually companies use NT to serve the HTML because it has better applications for interactivity available than Unix, and a *nix for the real meaty, hardcore database queries, etc. I believe this is what Ebay does but I could be mistaken.
--
Re:Google and that much data (Score:2)
This presented some unique problems, tho. Using 300 nodes meant that, potentially, you could have 300 connections to EACH CLIENT. We needed to make a transparent single point of entry and use a 10.X.X.X ->legal NAT translation. Problem with that, of course, is that NAT often breaks apps like Real Audio or Napster or anything that embeds source/destination within the packet to be router through the routing level of the requestor.
Using NAT as a front-end to a server farm returning straight HTML documents won't cause any problems.
Go away!
CmdrTaco Speaks. (Score:2)
I forwarded a copy of the above post to Rob Malda, and he sent me a concise reply describing his view of how it's supposed to work. I think it's worthwhile to share his insights with the whole crew. With his permission, here's what he had to say:
So there you have it. Of course, that does raise the question of why we have the Overrated and Underrated moderation categories, but otherwise, I think I see his point.
--Joe--
[OT] Moderator Collision Problems: Proposal (Score:2)
At this time, it appears to have been rated back down to a 3. I think what happens is that moderators scan / read through posts, selecting particular posts to be moderated up or down. When they finally get to the end of the page, they click [Moderate]. When several moderators are actively viewing a story, you end up with multiple moderations pending for the same article. So, what should've received a +1 might get +2 or more of multiple moderators agreed that it deserved +1.
The problem is that the moderators don't get to see the other moderations being performed in parallel to their own moderation. Perhaps there's a solution. Slashdot could ask for confirmation in cases of "moderator collision."
For example, consider the following sequence of events:
Currently, Slashdot will apply both moderations immediately. This results in article #42 receiving +2, when it may only deserve +1. It's neither Moderator's fault -- they've moderated past each other. Alternately, I propose that Slashdot, in this case, only apply the unique moderation immediately, and then ask for confirmation on Moderator B's moderation of #42. This is because Moderator B had no way of knowing that Moderator A moderated #42 up while he was still reading the posts. Let's assume all moderations are applied, and continue the example:
At this point, Slashdot will apply the moderation. Under my proposal, this would not change, as Moderator C did already see that #69 was moderated up before he selected it for moderation.
What I'm guessing would be necessary is an additional bit of state which says "This was the score that the post was viewed with at the time the Moderator selected it for moderation." If the article's current score is different than the score it was viewed with, ask for confirmation that the moderation be applied for that specific moderation. A series of radio buttons could be displayed for the affected articles: "Apply Moderation? [_] Yes [X] No".
Thoughts?
--Joe--
Re:Clues you can lose (Score:2)
I pictured myself framing a house in the Texas summer heat, and repairing a barbed wire fence in a snowstorm.
Re:Bit of a unique personality? The guy's a prick! (Score:2)
If I had moderator points, I'd deal you down accordingly. Since I don't, I'll mention this:
I think that "fucken" is becoming a word. I'm glad it is, because it rhymes with "Turducken" [gumbopages.com]. I also think it would work in a subjunctive mood usage context.
Re:[OT] Moderator Collision Problems: Proposal (Score:2)
The overhead is an extra piece of state for each article - but since the score for each article is already in the web page, the only real impact is on the CGI script that does the update.
Re:Very Smart (Score:2)
Ah-HA! So that's what powers the Mentalplex! (Score:2)
"Must... Concentrate!
Re:Good comparison (Score:2)
It is so damn fast, that it just keeps amazing me.
If you haven't tried it - you should!
Here is an example on a seach for "linux":
"3810249 documents found - 0.0051 seconds search time".
Go ahead and try. (Score:2)
It's usually 6 steps to kevin bacon, and it's an NP complete problem. If you do find a way to solve it in polynomial time, please share your algorithm. You'll probably get a Nobel prize.
--Shoeboy
(former microserf)
Depends completely on the application (Score:2)
That's complete bunk. Whether a centralized multiprocessor machine or a massively-parallel distributed cluster would be faster depends completely on the task at hand. Specifically: How parallel is the task?
If the task can be broken up into many completely self-contained pieces, then a cluster will generally win. You can buy lots of low-end hardware cheaper then you can buy even very good high-end hardware.
If the task contains contention points or data access is very random, then you're better off with a single multiprocessor machine. An example of a contention point would be the locks in a database. An example of random data access would be logins to Slashdot.
Finally, it is worth pointing out that, after a certain point, most large machines have to move to a NUMA design, at which point you start to resemble a massively parallel cluster anyway.
Re:[OT] Moderator Collision Problems: Proposal (Score:2)
The problem is that if the moderator takes a long time to read the post and two people moderate it up, from a starting point of 1, to 3, and this moderator had selected to mod it down to 0, the cgi needs to be smart enough to use that as a relative -1, instead of moving the post to an absolute score of zero.
Otherwise someone could start reading immediately, mark one of Signal11's (for an example of someone with a +5 in nearly every thread) posts as a -1 (to 1) comment, then wait till he's been modded to 5 in the initial rush (by viewing the thread from a non-logged in browser) and then submitting, effectively making their -1 worth -4....
But, otherwise, your method seems the easiest and the least error prone.
The only problem is that without overlapping simultaneous moderation, the scores likely wouldn't be so high anymore, so people browsing at +4 and +5 would see less messages... But I always browse at 0 anyways, so it wouldn't bother me.
It isn't working (Score:2)
Others might respect your trolling, but the only thing that matters in the end is high-karma--and you ain't got it.
BTW, don't bother responding with a "what are you talking about, I'm not a troll" response: I don't intend to read it.
--
Have Exchange users? Want to run Linux? Can't afford OpenMail?
Re:Very Smart (Score:2)
FAST (http://www.alltheweb.com) runs FreeBSD! (Score:2)
Alltheweb.com is running "Apache/1.3.6 (Unix) PHP/3.0.11" on FreeBSD...
When I first saw the "powered by Dell Poweredge" sticker on their page, I briefly worried that it was going to be an NT site. Nope!
a more technical article anywhere? (Score:2)
As someone who is building a large portal with Redhat, it'd be nice to have some kind of technical reference as to how they've built it. What are they using to handle the clustering? Are they using the Piranha stuff that comes with Redhat 6.2, or are they using hardware, or maybe something they've written themselves? Are they using sessions, and if so how are they handling them?
Are any parts of the cluster sharing processing power, or they all just individual boxes clustered to appear as one?
I think it's great that they're getting press, I'm just hoping that one of these days there will be something published on how it all went down.
Re:Good comparison (Score:2)
I would be interested to hear more about "The troubles that go on at Google behind the scenes are bound to become public knowledge very very soon.", without further information I'd like to think that linux would not get a 'black eye' over any problems within google, but you seem to know more about this than me.
Re:Google is driven by python not by perl (Score:2)
Re:FAST (http://www.alltheweb.com) runs FreeBSD! (Score:2)
BTW, if this post goes through, it means i've managed to moderate and post to the same thread...whoops.
Re:A google is 10^100 (Score:2)
No it's not. A google is the verb form of googly (a cricket term) - an off-breaking ball with an apparent leg-break action on the part of a right-arm bowler to a right-handed batsman, or conversely for a left-arm bowler.
A googol is 1 followed by a hundred zeros, 10^100.
A googolplex is 1 followed by a googol of zeros, 10^googol.
This is why you use Raging (Score:2)
(seems to come up with slightly diff hits than Altavista itself, but works plenty good for me!)
Re:Google has ads. (Score:2)
Google advertisers will benefit from marketing to a web audience with these distinct demographics:
Male (65%), female (35%)
High education (65% have at least a BA/BS)
Professional (73%)
High income (average income is $71,000)
Highly technical (71% report high/very high computer skills)
Online experience of 4+ years (58%)
Accessing the Internet from work (48%)
Using the web for work purposes (31%)
Imagine (Score:2)
Bit of a unique personality? The guy's a prick! (Score:2)
John Saul Montoya (Yeah, Wosten, thatJohnny Montoya, the guy who kicked your fucken ass over the KKW second-stage funding. Don't fuck with Wall Street).
4000 nerd cluster (Score:2)
BTW, have you looked at the http://www.hotsheet.com/ [hotsheet.com] portal? It's a portal, yeah, but it's really "clean" looking and has a ton of useful links. That's why they host my email. (no, I'm not affiliated)
----
Clues you can lose (Score:3)
Did you say that with a straight face?
Assuming you depreciate a machine over three years (and that's really stretching things in the Real World), you're replacing a machine every just over every six and a half hours. Plus all the effort gets skewed down the the end of the three years. It would almost be economical to throw the door-key away and start afresh.
When you buy a Sun the damned thing just doesn't fall down unless you have a system mangler who keeps dicking around with it. And if a single Sun could not address the problem, then maybe it's time to buy some real iron, like a maxed out S/390. When you have a terabyte of data to process, you have to start paying a little more attention to things like I/O.
4000 PCs cannot be a viable economic replacement. That amount of hardware would require as highly a specialised environment as that of a mainframe (cooling and electricity), and certainly much more real estate. And they have really shitty I/O. If Google has money and space to piss away, well good for them, but it's hardly a wise business practice that anyone over 30 would recommend.
If you want to play with Linux, by all means invent some statistics that show that your MIPS/$ is better than the competition. Statistics can say anything you want them to. I, however, would like to know how they derived such figures. Ignorant readers of the article might otherwise be mislead into pursuing foolish choices in computing platforms.
Oh, and BTW, your regex is suboptimal, the split is entirely redundant and you shouldn't use double-quoted strings in Perl if you're not interpolating anything.
Off the shelf server farms (Score:3)
An interesting quote is this:
Re:Clues you can lose (Score:3)
This leaves the management questions. Presumably most of these PCs are configured exactly identically, apart from the ethernet card numbers, and the work is controlled by some central servers (for which big Suns might well be appropriate). So, if I was setting this up, how would I handle hardware failures:
1. a PC blows up
2. the central server notices some timeout on a
parcel of work or a heart-beat and takes that node out of the active list.
3. the central server (or another one specialized for the job) makes a more intensive effort to sort out the problem. If it can get in, it can probably trigger a reboot, or even a re-install, remotely.
4. If it can't get in at all then human assistance is needed. Add a task "reset node 1234" to the next hourly jobs printout for the operator
5. On the next pass through that part of the warehouse, the operator hits reset. The node tries to reboot, goes through health tests, possibly does an auto reinstall.
6. If no life then add it to the daily list for the operator with the electric handcart to pull and replace, send it in the daily shipment to the supplier.
I don't know for sure that this is how they do it, but it's how I would do it. Failure is a nuisance when it happens every few weeks. If it happens every few hours, then you can make it routine and pain-free. In a cluster of 4000 identical machines, hardware failures are part of life.
You mention other things: power -- a bare PC processor mobo and hard drive draws about 90W. So the whole cluster is about 360KW. This is a lot of power to get in, and heat to get out, but well within the normal range of, for instance, small factories, and the people who supply kit for that should be able to cope easily. PCs will work OK in any heat and humidity that people will, so ordinary office-grade air-conditioning will be fine.
So, in their very unusual circumstances, this probably is the right call for Google. They can routinize hardware failures to the point where they just cause a statistically predictable amount of work that must be budgetted for. The central servers that control all this, store the TB database, etc. are another story. There, the more conventional rules apply, and I would bet that those are normal server hardware -- Sun, IBM or high-end Intel servers.
Re:Why x86 Linux? (Score:3)
Try substituting Sun 6500's with 20 CPU's for each set of 20 Intel boxes and see what that does to the pricing.
Very Smart (Score:3)
But in this case I think Google is on the right track. MIPS/$ ratio is definately in the favor of the PC. And with sooo many PC's if one goes down it really wouldn't make a huge difference. If it were just a 2 or 4 node cluster then I would lean towards a RISC based architechture for reliability. But in this case the cost is just to staggering to imagine a Sun cluster for this.
Koodoos to Google, my new search engine of choice! Long live Linux!
Re:Good comparison (Score:3)
Re:Clues you can lose (Score:3)
I heard the CTO of Inktomi talk on this issue. Their basic approach to cluster buying is to buy midrange PCs in units of 100. Each cluster then consists of 100 identical PCs. Clusters are replaced as a unit, never upgraded. A site may have multiple clusters of different hardware. Every few months, they do evaluations to pick the machine with the best price/performance, which is usually a machine in the middle of the pack, not a top-end machine.
Re:Very Smart *NOT* (Score:3)
Good comparison (Score:4)
Probable source of their inspiration... (Score:4)
Now all that's needed is for thinkgeek to claim responsibility for this action.
Re:Good comparison (Score:4)
Yeah, all the other popular search engines nowadays seem to be ridden with banner ads, promotions, and all kinds of useless fluff on their pages. Google is nice and simple, doesn't clutter the screen, and in general makes everything easier on the eyes. I think this is part of the attractiveness of Google -- you're not flooded with irrelevant info and pictures, but just the stuff you're looking for.
One thing I have against Google though -- I wish they had an advanced search where you can specify to search for exact phrases, etc., or perhaps even a full boolean search. I don't know how Google works, so I can't tell if these features are left out because of design issues. But, being the "hacker's search engine" and everything, it really should support more advanced searches. If they can find a way to implement this well, it may even become a deciding factor against other search engines. (I hardly know any search engine out there that can handle full boolean search, and certainly Google's speed will be a great advantage.)
---
hey (Score:4)
So this "super computer" will be used for Total World Domination? Oh, can we use it atleast to take over some small thrid world countries? I promise to have it back by six tonight.
The Google crew must have some killer Seti@home stats.
I would like to put one of these in my basement and finally disprove the "7 steps to Kevin Bacon" theory everyone seems to buy into.
google uses RAIP technology (Score:4)
Re:Very Smart *NOT* (Score:5)
But damn, that takes a staff of 200 people to manage the security/connectivity/accounts/space and other duties just for the cluster.
The Power bill has to be outrageous!
The Cabling/switching/routing mess has to be totally unmanageable
What happens when you reach a buck in the hardware or have to patch the system or replace a kernel because of a hack that came about? It is costly and hellish to work on 4-6,000 pcs
I would have thought it to be wiser to setup Sun E10000's or something like that.. having 4 32 proc e 100000's in a cluster is a hell of alot easier to manage and cheaper. Sure your upfront bill may be more, but only have to worry about 8-16 power connections (redudancy) is alot easier then 6,000 power cords/strips/racks/floor space/cooling/maintenance.
Sure it is one hell of a beast to be proud of, but one hellova costly beast to work with.
Just my 2 cents
Google is driven by python not by perl (Score:5)
Re:Very Smart *NOT* (Score:5)
Last I checked (this was about a year or so ago) a fully loaded (64/64) E10K ran around $12M and the base (2psr) system was running around $800,000. Even if that's off by a factor of 3 or 4, you're still talking $3-$4M a piece... at three of them, you're looking at between $12-$48M. On the other hand, the typical white box PC will run between $800-$1500. That amounts to $3.68M-$6.9M for 4600 nodes. This doesn't include the network infrastructure or administration costs, however, as someone who has administered large clusters (largest was an 80 node SP/2), it actually becomes easier to administer that many nodes in a cluster than it would that many servers. Keep in mind that there most certainly are groupings of nodes where they are kept identical except for IP.
Another significant expense is that hardware support costs associated with such systems. If you have 4600 nodes, it's trivial to simply keep (MANY) spare systems floating around. Also, you can disable a node with negligible impact. Even if you're subdomaining an E10K, there are (a small few) single points of failure on the platform (regardless of what Suns documentation says). If you're not subdomaining it, you're simply talking a 32way SMP box (might as well just use a 6500 for that configuration). If you were to lose the backplane for whatever reason, you've lost a singificant portion of your compute resources.