NSF awards $500,000 grant for Beowulf Cluster 100
ragnar! writes "National Science Foundation (NSF) awarded $500,000 to support a new parallel computing facility for Bartol. The "major research infrastructure" (MRI) grant will support a parallel system based on 100 linked processors, each of which will run at speeds up to 600 megahertz, connected by fast Ethernet hardware - very similar to the Avalon-Beowulf Cluster, developed by the Los Alamos Center for Nonlinear Studies and Goddard Space Flight Center. "
Re:Isn't that a bit pricey? (Score:1)
MRI grants can't include personnel money or electricity.
Just to clarify.... (Score:2)
Re:First post! (Score:1)
Re:Isn't that a bit pricey? (Score:1)
Re:Linux close but no cigar (Score:2)
I imagine a BSD variant would be best - still open source, but the TCP/IP stack is faster, so you'd probably lose less in inter-processor communication.
If you're running a private gigabit-class network (GigE, Myrinet, Giganet, etc.) and have a separate control network (typically Fast Ethernet), there's no reason to run TCP/IP over the high-speed network. In tht case, you could bypass the TCP/IP stack entirely and have the message passing system (typically an MPI implementation) talk directly to the hardware -- the "user space"/"OS bypass" approach. This is what Myricom's GM and the various VIA implementations let you do. Most of the larger Beowulf cluster installations are going with something like this.
I must admit that I find it very surprising that they're going to the trouble of buying fast DEC Alphas and then connecting them with something as pokey as Fast Ethernet. I hope their RMHD and other calculations are pretty close to embarassingly parallel (i.e. almost no IPC), or the network will definitely end up being a performance bottleneck.
Re:Similar projects (Score:1)
the story is here [nasa.gov]
Re:ummmm.... (Score:1)
But who needs Karma whores anyhow? The only difference it makes is when you get over the +2 threshold, and then it doesn't matter.
--
grappler
Re:What software? (Score:2)
If you want to check out more, check out http://www.capsl.udel.edu [udel.edu]. The information on the web page isn't too organized, and all the info on the EARTH page is bound to be pretty old, but you might be able to get a decent idea.
Re:Grant Funding Realities (Score:1)
I'm currently a grad student at UD (in Mechanical Engineering, not Physics) and am really stoked that I might be able to run code on this system if I can talk to the right people.
I also work at one of the top composites research centers in the world (also at UD). It takes in $5 million a year, but in the end only between 8-10% gets spent on actual material and equipment. My boss has over $1 million in grants himself, but I still get a kick out of how much of our equipment is still held together with duct tape. A grant that large will be nibbled away to fix problems the physics department or research group has and any fix will cost at least a thousand bucks because it probably won't be something that can come off the shelf. I know my school, it will happen that way.
When all is said and done $500,000 over 3 years ain't that much really, especially since what defines necessary hardware can be kind of shaky. Also you have to consider that about 10% of the money research groups spend on equpiment gets siphoned off the back end by the UD purchasing department.
Incidentally personnel means grad students who don't get benefits and get paid a pittance. Thats probably what the other 20% of mathcing funds will pay for.
Re:cheap pc crap vs quality pc crap (Score:1)
You can get a sufficient quality PC for a scientific cluster for far less than $20k. I'd bet that they don't want 100 raid controllers.
Re:A Tiny Little Error in Judgment (Score:1)
dont get me wrong, I LIKE the fact that women go undiless..I just wish I knew when.
Re:Linux close but no cigar (Score:1)
Re:Linux close but no cigar (Score:2)
It came pre-made, with a slightly-modified version of RH 5.2 installed - basically just an SMP kernel and some utilities and libraries. You don't really need any special software, except for PVM or MPI (we use MPI). The MPI distribution we use is LAM 6.2 - I'm sure you can hunt it down if you look around a bit (try google.com/linux).
I'm going to eventually set up another small cluster for testing and development purposes, once we get the first piece of software into operation (or maybe before, if I don't have a whole lot of stuff to do) - I'm planning on setting it up with Debian and a much more sensible layout (share
I wouldn't recommend RedHat, tho - adminning on it is not much fun. (no apt!)
BALDRIC.. (Score:3)
My question is: How can we get this kind of support??
Re:The moderators... (Score:1)
Bye, bye, karma! (Score:1)
could a Beowulf cluster run an interactive application? Say, wine for instance?
I just had this vision of Windows apps actually running fast... On second thought, nahhh.
Some things are just not meant for mankind.
Clusters vs. supercomputers (Score:2)
The main result of this is that only the Government buys supercomputers, and nowadays they're mostly a boondoggle. SGI is currently trying to sell Cray, with limited success. Even Deep Blue is a cluster, made of stock CPUs on custom boards with additional custom hardware. The era of the classic supercomputer, with its huge mat of hand-wired connections, is over.
MODERATORS TO THE QUERY (Score:1)
Personally I thought this was kinda funny. not the funniest, but certianly has merrit.
Anyway, off to find a good post to moderate up (I wish more people would log in, or not post AC... doubtful you'll get moderated down for a minority opinion, only a stupid one... That and people seem to make more sense when they have a name)
Re:Linux close but no cigar (Score:2)
First post! (Score:3)
Sorry, just couldn't resist... bye-bye karma
"Software is like sex- the best is for free"
-Linus Torvalds
Similar projects (Score:2)
This source of funding isn't that unusual -- the University of Virginia Centurion cluster [virginia.edu] was funded by two $450,000 MRI grants.
Re:What software? (Score:2)
Almost no one uses Linda -- what would you think UDel does?
Most people with systems like this use a batch queue system like PBS and message passing libraries like MPI.
Linux close but no cigar (Score:3)
often used, but it have to compete with the large
and good old Unix suppliers. Take a look at:
http://www.fysik.dtu.dk/CAMP/valhal.html
Here you find a similar project, and even an
explanation why they didn't choose linux.
Seems like the commercial unices are running
out of time.
What software? (Score:2)
... (Score:3)
It's too difficult. (Score:2)
Re:Linux close but no cigar (Score:3)
Linux does have to compete with other Unixes, but people often decide in Linux's favor. For example, this cluster [hpti.com] is 277 nodes with better networking, and we chose Linux over Tru64, due to Linux's super system administration capabilities.
BTW, you can get Compaq's great Alpha compilers for Linux.
What happens next? (Score:1)
Dan
Nifty (Score:1)
Re:Nifty (Score:2)
... these machines ARE massively parallel supercomputers, if you build them big enough and you use the best commodity networking (like myrinet [myri.com]).
Re:BALDRIC.. (Score:1)
Yay for our $20000 cluster (Score:1)
Grant Funding Realities (Score:3)
People making coments about the amount of hardware/support that can be had for $500,00 should remember the realities of grant funding at a University in this country:
So, a $500k grant is about $250k after ICR. Then say you fund 2 peole at $35k/year to help build and run it. Now you're down to just $110k for hardware. Even with a "best case" run of the numbers and cheap people, you're still not going to have more than $150k for hardware in this grant.
Also keep in mind that this grant's funding is spread over 3 years.
100 600MHz PCs is going to run about $100k even before you start buying networking equipment, backup equipment and power supply/protection equipment.
In all likelyhood, Bartol is going to need additional funding (possibly x% matching money from the state or other similar grants) to make this a realitiy.
Just thought people should know that when you get a $500,000 grant, you don't just get a check for $500,000 to blow on hardware.
----
Re:explain me something (Score:2)
The article wasn't specific as to hardware, but since they said it was "much like the Avalon cluster" they might well be using Alphas, not Pentia. $5k/box would be a good price if they are using the newer Alpha boxes based on the 21264 chip (which is better than twice as fast, on average, than the 21164's used in Avalon, even at the same MHz).
Re:I liked one comment... (Score:1)
The Beowulf issue is management. (Score:1)
By chance, some projects answer this issue, as BLD [nersc.gov] (free, Berkeley Lab) or ALINKA [alinka.com] RAISIN [alinka.com] (commercial) and ALINKA LCM [alinka.com] (GPL), but there are still things to be done. Moreover, once you have overcome the software management, you still have to deal with the hardware (of these 1000 fans, one _has_ to fail...).
Still, the hardest job is not for the administrators: users have to actually write good parallel code... and this is no piece of cake.
Re:The Beowulf issue is management. (Score:1)
You also want streamlined software configs, automatic integrity checks of OS and software installs and a host of other stuff to keep the software side of the house healthy and under control.
This all adds up and makes a middle to large-sized Beowulf a bit pricier than expected. You will find vast differences between the various Beowulf integrators when it comes to management issues. VA is one that impressed me.
Re:Clusters vs. supercomputers (Score:1)
Not true! While clusters and in particular Linux clusters are coming on strong, there are things that they just can't do.
The bandwidth of the interconnects on a Beowulf and worse still their latency are just not there yet to go head to head with a traditional supercomputer. They are getting very close, I admit, but haven't consistently beaten a Cray T3E-type interconnect yet.
Also, some codes/algorithms just don't lend themselves well to massively parallel implementations. They might be much happier on SMP-type machines or perhaps on vector machines.
Finally, some of the management issues for very large Linux clusters aren't fully resolved yet, but they have been in place on traditional supercomputers for quite some time.
As a result many institutions, including government, but also research sites and large financial institutions continue to buy Cray, SGI and SUN supercomputers, all of which aren't clusters. Just check the latest Top500 [top500.org] list, in particular the slides and statistics.
Remember, we're talking large systems here, and I would define this as more than 16 nodes and more than 32 CPUs minimum.
Re:døød! (Score:1)
I dunno why I'm writing this... no one's still lookin' at this discussion.
Re:Clusters vs. supercomputers (Score:1)
I wonder what a discount broker is doing with the twelfth biggest number-cruncher in the world.
Re:First post! (Score:3)
Re:NSF knows only LINUX/BEOWULF can handle this! (Score:1)
I enjoy linux, but saying that a quad xeon outperforms a cray is rediculous. Also, saying that linux outscales Solaris is a bit far fetched...
Isn't that a bit pricey? (Score:1)
Re:Grant Funding Realities (Score:3)
MRI grants do not allow universities to charge overhead, and is 100% hardware money. You also have to get at least 20% matching funds.
In general, equipment over $500 isn't assessed overhead by any university.
SLASHDOT (Score:1)
ummmm.... (Score:2)
--
grappler
Crypto (Score:1)
In any case, the IRS might think about using AI Beowulf clusters to check tax returns. Ha. That would be the day. (I want my refund!)
What Crypto applications might there be for a Beowulf cluster out there? Genetic algorithms for new ciphers?
Jon
Re:explain me something (Score:1)
A 100-box Alpha setup would of course be much more powerful, but also much more costly; I wonder if the added expense would be better used by just buying more, but cheaper x86 machines?
Now, there are so many possibilities for people looking for power setups. We've got Athalons, G4s, Pentium IIIs, Alphas, Transmeta mystery chips, and those are just the CPU choices. I can't wait to see what will win the price/performance war once multiprocessor, multicore and cluster technology really go mainstream.
Re:Linux close but no cigar (Score:1)
I liked one comment... (Score:1)
The moderators... (Score:3)
Their first instinct is, "Oh God, it's a beowulf post - moderate down, moderate down." It must be a hard itch for them not to scratch in this case
explain me something (Score:3)
If they are purchasing hardware for that amount, they're getting ripped, because I'm thinking all the needed hardware, including the boxes and the networking equipment, can be had for under $150,000 (they could get a nice bulk order discount).
My figure wouldn't include costs like assembly/setup labour and the OS (heh) but half the work is opening the boxes...
Seriously, once the system is going and the scientists have their apps setup, all you need to do is make sure it doesn't overheat. (We are talking about a massive number of x86 systems, here).
Disclaimer: I really don't know what the hell I'm talking about in this post. If someone could inform us what it costs to maintain a project like this, please post.