



How to get 1.5 TeraFlops from Linux 280
Oak Ridge National
Lab has purchased from SGI an Altix
3000 (flash movie). This
article claims that:
SGI Altix 3000 is recognized as the first Linux cluster that scales up to 64 processors within each node and the first cluster ever to allow global shared memory access across nodes.
There is more here,
here,
and here.
Look Out! (Score:4, Funny)
SCO will be all over your ass now!
Re:Look Out! (Score:4, Funny)
Beowulf cluster jokes... (Score:5, Interesting)
Better than Beowulf for normal use... (Score:5, Informative)
Re:Better than Beowulf for normal use... (Score:3, Informative)
Re:Better than Beowulf for normal use... (Score:5, Informative)
However, this also can cause problems. Most threaded programs are written assuming that all the threads have high speed (i.e. system bus / cpu cache) access to shared information. When we introduce the latency incurred by a network, this can cause programs to run alot slower then they would if they simply had all the threads on a single box. Obviously, it all depends on how the program was written, and what it does.
If you are writting a program specifically for a cluster, I would suggest instead looking at something like LAM-MPI [lam-mpi.org]. This allows for a much more controlling approach to be taken. It is more work (you have to decide how the work will be split) but it allows for much better control of where and what is being done and how to optimize it.
Re:Better than Beowulf for normal use... (Score:3, Informative)
http://howto.ipng.be/openMosixWiki/index.php/Ap
You have to write your application as a bunch of processes to take advantage of a mosix cluster.
Joe
Mosix... (Score:3, Informative)
First, you have to understand process migration. In a mosix cluster, a running process can be moved, lock stock and barrel, from one CPU to another. All that is left behind is a "stub" process that forwards all file I/O across the network to the new location. So, if the program was a 3D raytracer that had the source description file and the output file open, after migration all file accesses to those files would be forwarded over the network to the st
Re:Beowulf cluster jokes... (Score:3, Informative)
Re:Beowulf cluster jokes... (Score:2)
Well maybe you're looking in the wrong place [google.com].
Good luck.
Re:Beowulf cluster jokes... (Score:5, Informative)
http://bofh.be/clusterknoppix/
Or, Try Quantix, which comes with some apps (Score:2, Informative)
Re:Beowulf cluster jokes... (Score:5, Informative)
At that time they were working on a Java implementation, but I don't know what happened with that.
Re:Beowulf cluster jokes... (Score:3, Interesting)
int a;
void doSomeCalculations(int i) {
a = doSomethingElse(a + i);
}
Would fail (multiple threaded access to a). It is extremely difficult to detect sideffects in C. I've never seen a "smart" compiler as you put it, though there are systems where the programmers can explicitely parallelize a loop.
Re:Beowulf cluster jokes... (Score:2)
Furthermore my mention that C is very hard to parallelize is also mentioned in the same chapter.
Re:Beowulf cluster jokes... (Score:2)
Re:Beowulf cluster jokes... (Score:3, Informative)
Re:Beowulf cluster jokes... (Score:2)
Re:Beowulf cluster jokes... (Score:2, Informative)
As far as languages go, you'll need an MPI library (like MPICH [anl.gov], or LAM/MPI [lam-mpi.org] (which is also a runtime environment), but the actual code used is usually C, C++, or Fortran. BTW, OSCAR comes with MPICH and LAM/MPI.
Re:Beowulf cluster jokes... (Score:2, Funny)
Re:Beowulf cluster jokes... (Score:2, Informative)
Re:Beowulf cluster jokes... (Score:2)
LOTS of monkeys.
Apple (Score:5, Funny)
"Now how am I going to make the G5's look faster than THIS?"
Re:Apple (Score:2, Funny)
For his next trick, Jobs is going to walk on water.
Re:Apple (Score:2, Interesting)
You can already compile programs with LAM-MPI support, so in reality there is nada stopping you from building a Beowulf cluster of XServes. There may even be a compelling reason to use XServes over x86 boxes after XServers are updat
Re:Apple (Score:2)
Re:Apple (Score:2)
Back in the day, on the old black NeXT hardware running NeXTStep 3.3, there was an app called Zilla that could be used to distribute compute intensive jobs around a network of NeXT machines. They said that 100 NeXT Turbos was an even
One Word: (Score:2)
(more info: http://slashdot.org/comments.pl?sid=45647&cid=472
kernel sources? (Score:5, Interesting)
Re:kernel sources? (Score:2, Informative)
Rocket Haid... (Score:5, Funny)
(Inside joke for my ol' friends at ORNL...)
Re:Rocket Haid... (Score:1)
(from an employee of PNNL)
Re:Rocket Haid... (Score:2)
Re:Rocket Haid... (Score:2)
And I'm a network admin at their credit union [ornlfcu.com]...
Re:Rocket Haid... (Score:2)
SCO and Microsoft reactions? (Score:5, Interesting)
It's funny that Microsoft always tries to downplay Linux's enterprise capabilities, when Linux has been scaled to far more power then Microsoft's best offering for years now. Windows 2003 is a clumsy, bloated, closed source chunk of green crap.
Re:SCO and Microsoft reactions? (Score:2, Interesting)
RTFA. They are using this machine for research in the "sciences, clean energy management and production, environmental protection, and homeland security."
It's not a web server, and it isn't demonstrating "enterprise capabilities." Windows has never been intended for, or used for, scientific computing on a large scale.
Re:SCO and Microsoft reactions? (Score:2)
Re:SCO and Microsoft reactions? (Score:2)
Hey, at least it's not running IRIX (Score:1)
Re:Hey, at least it's not running IRIX (Score:4, Informative)
I always liked Irix, and everyone I ever talked to who used Irix liked it. The GUI is about 500x more usable than the horrors of OpenWindows or CDE on Solaris.. bleugh.
Re:Hey, at least it's not running IRIX (Score:4, Funny)
Re:Hey, at least it's not running IRIX (Score:3, Interesting)
I vastly prefer 4DWM to GNOME or KDE as well. I'm helping a coworker set up a Dell inspiron 7500 (P3-700) with Linux, and he immediately complained that KDE was far too slow. I switched to WindowMaker, and he immediately noticed the difference. This is a three-year-old machine, with tons of memory and a reasonable processor, and it crawl
Re:Hey, at least it's not running IRIX (Score:2)
4Dwm is snappy --even on a 150Mhz machine.
IRIX is a nice place to work. Now that I am getting more into Linux, the little differences annoy me. SGI does do its part though. freeware.sgi.com is a very nice resource. Get any old SGI, point it there and download most of the good OSS built and ready for your machine.
Re:Hey, at least it's not running IRIX (Score:2)
IRIX is the most stable unix I have used. It hardly ever crashes. OS updates don't break the system. The desktop is quite nice although not as nice as gnome. But that's not a problem because IRIX can also run gnome!
I know with a 64 processor machine I would prefer to use IRIX if I could. Linux
Best of both worlds... (Score:2, Funny)
Yanking from my journal entry of 6/30/03 (Score:5, Informative)
HPC Wire [tgc.com] had an article [tgc.com] that I referenced in my journal on 6/30.
It's an interesting machine. I'd love to get one to play with. I'm sure our benchmarkers will have some even more interesting comments once they're done. Expect teething problems, folks. Systems of this size and complexity take time to break in.
lites (Score:3, Funny)
Re:lites (Score:5, Interesting)
Suddenly the nobreak was all out, and the billion dollar machine went *poof* - down. Damage? A couple of SCSI disks, but of course everything was mirrored and had parity so even with the damaged disks, there was no data loss.
Then (after a few hours) the powerfaillure ended, the lights went back on in the building, but the lights on the big cluster were still off. The other way round than you'd like to see. Although, when the building power was out, and the nobreak for the machine was active, it sure was a pretty sight. Although, with the impending doom, I didn't really have time to appreciate it.
Re:lites (Score:5, Informative)
What the hell kind of Origin 3800 do YOU have? ISTR ours (512-proc) was roughly $10M.
Re:lites (Score:2)
I was actually surprised that the thing managed to run
Re:lites (Score:3, Informative)
There are two 1,024-processor Origin 3000's in the world. One is in Eagan, Minnesota. The other is at NASA. The NASA machine is called chapman. It has 256 GB of RAM. Not terabytes.
How do I know this? Because I'm sitting here looking at lomax right now.
You're a... whaddya call it. Liar.
Re:lites (Score:2)
I have an account on turing.
Re:lites (Score:2)
Re:lites (Score:3, Informative)
Re:lites (Score:2)
Re:lites (Score:2)
Re:lites (Score:4, Informative)
Just to nitpick... most Origins are not clusters but rather one large single machine. It is possible to partition the machine in firmware and have each partition talk to others over the existing (and now unused) numalink interconnects... but it's much faster (even for plain MPI code) to just run the beast as one large single machine.
Obligatory: Mods on Crack! (Score:5, Funny)
Guess it's time to meta-moderate!
Re:Obligatory: Mods on Crack! (Score:5, Funny)
Only today?
uh huh (Score:2, Insightful)
Oops (RTFA) (Score:5, Informative)
So... (Score:3, Funny)
Re:So... (Score:2)
Re:So... (Score:3, Insightful)
Great.. (Score:2)
Conversion scale? (Score:5, Funny)
Benchmarks I can Understand? (Score:3, Funny)
Kernel compile time with make -j?
Hmm, what are some other good ones?
SGI: Unsung coorporate heros ? (Score:5, Interesting)
In the meantime their additions have contributed quite a bit to open en free thinking in software, take OpenGL and open Inventor, or even to the kernel directly as with the XFS filesystem.
I always liked this approach more than the hyping others have done with linux, but unfortunately this has kept them unadorned within the community. With the Altix cluster (as with their GNU/Linux workstations,which unfortunately failed) I think they have shown that they put their money where their mouth isn't.
I think it's only fair that when we are talking about the large coorporate players in the OSS field SGI at least deserves a footnote for their efforts instead of just hammering exclusively on IBM,Sun etc..as the great backers.
I know, I know. It's a coorporation, so they inherently put money over freedom, it's just something I noticed because of the lack of their name in any high-profile discussions, which I think is unfair.
Re:SGI: Unsung coorporate heros ? (Score:3, Interesting)
The table would record the number of packages released, the number of patches, and the licenses used for each. Originally, I was going to make a four-way split - open-source packages or patches, and packages/patches for open-source OS'.
From this, you could create some kind of scoring system, and thus compare the "open-sourceness" of companies. (From the above, it should be obvious that I consider
And just how did they accomplish this... (Score:5, Funny)
Re:And just how did they accomplish this... (Score:2)
It's really 99% SCO Xenix inserted with IBM's help.
Setting one up now (Score:5, Informative)
It is the first one that the regional SGI reps had actually installed, but since it is almost exactly the same as the MIPS-based origin 3000 servers (with the exception of the obviously different Itanium 2 cpus and supporting chipsets), they ran into almost no problems getting it online. I have also been suprised as to how many commercial codes have already been ported to the platform.
The main reasons we purchased this machine is for the ease in parallelizing code and the floating point performance of the Itaniam 2 cpus. We're computational materials engineers and the less time we have to spend optimizing codes so that the nodes of a cluster are always kept busy and minimizing I/O bottlenecks gives us more time to concentrate on the theoretical issues.
It runs RedHat 7.2 with some tweaks by SGI called SGI ProPack. The Propack modifications come on separate CDs, with the proprietary software on separate CDs from the open source software. So far, from the command line, everything works just like my PC. It's kind of strange running Linux on a >$100K machine, but it sure beats dealing with the annoying differences between IRIX and Linux. Now to see if it performs as well as we expect...
Re:Setting one up now (Score:2)
Re:Mod parent up (Score:2)
Re:Setting one up now (Score:2, Funny)
Non-Flash Links (Score:3, Informative)
Main product page: http://www.sgi.com/servers/altix/ [sgi.com]
and here there are bunch of PDFs to download: http://www.sgi.com/servers/altix/datasheets.html [sgi.com]
for example:
SGI Altix 3000 Family of Servers and Superclusters [sgi.com] (172K)
Linux Software for the SGI Altix 3000 Family (50K) [sgi.com]
SGI Technology Solutions for Linux (48K) [sgi.com]
They must be running their web server on it. (Score:3, Funny)
So how fast (Score:2)
What IS a cluster, anyway? (Score:2)
My gut reaction is that this isn't a cluster. A cluster is a network of independent computers collaborating on delivering some service.
This is a parallell (super) computer. Key difference: all the processors share a single memory space with each other. Programs will run exactly as if this was a single (multitasking) computer.
Most clusters I've worked on are just a bunch of computers with a fast network, using various protocols to synchronize their behavior ("Hey, node 19 isn't pinging, he must have died.
Re:What IS a cluster, anyway? (Score:2)
Do you (or anyone else here) know how a 1 bit processor works? I have read a bit about the CM, and always come across the "1-bit processor" thing - but never have found enough information on how a 1-bit processor really works?
I have a feeling it is something like a single-instruction cpu (subtract then branch or something like that), but even more esoteric in practice. Furthermore, I tend to wonder how you write programs for such a mac
Re:What IS a cluster, anyway? (Score:2)
No, no, it's the data bus that's one-bit. Basically, you stream data to the processor over a serial line, either from memory or from another processor. There are a bunch of instructions. All the normal boolean operations, operations for streaming data aka NEWS, sleep-if, wake... No arithmetic operations, obviously.
No links handy, sorry.
Re:What IS a cluster, anyway? (Score:2)
Yes, granted, but they still share a single memory space (and IMHO are one parallell computer):
In other words, 64 processors in each node (read box), but globally shared memory between all the nodes.
Sure, memory access won't be super fast across the borders (like maybe fibre channel
So what? (Score:2)
I'm offering a Bounty to all posters (Score:4, Funny)
Re:I'm offering a Bounty to all posters (Score:2)
distributed shared memory (Score:2)
Note also that there are several high speed interconnects for Linux clusters available from many different vendors, including InfiniBand, Gigabit Ethernet, FireWire, and Myrinet.
Re:distributed shared memory (Score:3, Informative)
Note also that there are several high speed interconnects for Linux clusters available from many different vendors, including InfiniBand, Gigabit Ethernet, FireWire, and Myrinet.
SGI systems (Origin and Altix) have massive interconnects that hold together the single-system architecture. They're fast for shmem-type shared memory apps, but also for MPI. In fact, SGI
Re:distributed shared memory (Score:2)
I think SGI is playing with the numbers there: while that is the limit for how much data you can push through a link, it seems unlikely that you get that kind of performance for arbitrary processor-to-processor communications or as aggregate bandwidth.
In any case, very fast interconnects are not cost-efficient when they are not needed. 3.2GB is the memory bandwidth of a 400MHz bus motherboard. Mos
How to get 2+ TeraFlops from Linux (Score:2, Informative)
Niflheim at Danish University of Technology [fysik.dtu.dk]
Re:yup (Score:2)
Re:First one... (Score:1)
-flyneye from 2 threads down-
Even better! (Score:1)
Re:First one... (Score:2)
Re:RETARDED MODERATORS (Score:2)
Thanks for the concern, but don't worry about my karma. I've said it before, I have plenty to burn. I just wish moderators would actually read the comment before going "Look, it's really long, therefore it must be at least interesting"
Re:RETARDED MODERATORS (Score:2)
<voice char="Forrest Gump">Stupid is as stupid does</voice>
Seriously, I understand how the trolls have multiple accounts, get points in one, post crap in another, and use the first to mod up the second. However, the editors have unlimited points, along with the rest of the "legit" moderators, and comments like that shouldn't stay at +2 informative for that long.
Re:64 processors = 1.5 Cells (Score:2, Informative)
thats 64 processors per node
Re:beowulf cluster problem! (Score:2)
It knows you're charging by the hour!
Re:...scales up to 64 processors (Score:2)
Re:Great but does it run (Score:2)