Debian Cluster Replaces Supercomputer For Weather Forecasting 160
wazza brings us a story about the Philippine government's weather service (PAGASA), which has recently used an eight-PC Debian cluster to replace an SGI supercomputer. The system processes data from local sources and the Global Telecommunication System, and it has reduced monthly operational costs by a factor of 20. Quoting:
"'We tried several Linux flavours, including Red Hat, Mandrake, Fedora etc,' said Alan Pineda, head of ICT and flood forecasting at PAGASA. 'It doesn't make a dent in our budget; it's very negligible.' Pineda said PAGASA also wanted to implement a system which is very scalable. All of the equipment used for PICWIN's data gathering comes off-the-shelf, including laptops and mobile phones to transmit weather data such as temperature, humidity, rainfall, cloud formation and atmospheric pressure from field stations via SMS into PAGASA's central database."
I don't understand the difference (Score:3, Interesting)
Re:I don't understand the difference (Score:5, Informative)
Also the core os ( most minimal installation ) has many different tools and libs.
Also at time of release they can pick from many different versions of a single package.
That in combination with what version of GCC and compile flags can and does make a huge differance.
And at least with Debian you really do know how the systems was build, with RedHat I still wonder...
Marcel
Re:I don't understand the difference (Score:4, Informative)
Re: (Score:2)
Re:I don't understand the difference (Score:4, Informative)
Re:I don't understand the difference (Score:5, Informative)
Once you got it configured the way you want it, there's little intervention involved to maintaining it. It'll just keep chugging along. The keyword there is "correctly". Follow the readmes, howtos, and best practices, and you're golden.
It's also one of the oldest distributions which always kept to the spirit of GNU/Linux in general: community development and enrichment. Debian developers pride themselves on that spirit. To make the best software for humans. (At least that's what I gather from hanging out with Debian folk) These people are not only passionate in the software that they write, they do it without wanting anything in return, being humble in the way they do it, and wanting no reward for doing it. To them, their reward is in other people using their software and loving it! In my opinion they're not recognized enough.
But what do I know? I just use the software.
Re: (Score:1)
Re: (Score:3, Insightful)
So what you have is a list of the 500 biggest operations who think it is important to brag about what platform you are running. That's quite different from the 500 biggest operations (I'd be surprised if the two lists overlap at all).
I know that if I were running one of the largest, most sophisticated computer systems in the world, I wouldn't be going around telling my competitors how I did it.
Re: (Score:2)
Also the stable version of Debian is very stable, as in it doesn't change. Security fixes are almost always backported so you don't wind up with new features or changed behaviours, etc. I don't follow Red Hat so I don't know much they differ in that regard, but when you have a server that's configured how you want it and working fine it's really nice to know that if you install a security update it's not going to change any of the functionality.
In addition, packages go from unstable through testing and si
Re:I don't understand the difference (Score:4, Informative)
This is, of course, a nightmare. Worse, the kernels on all of these have the notorious RH patchsets, so as far as anyone knows, each and every one has a mish-mash of backported "features" from newer kernels, but few of the bugs fixed that those newer ones have. In fact, several are still at 2.4.x kernels that, even years later, suffer from a kswapd problem that makes the machines unusable after a while. And we're getting in newer hardware that the 2.6 kernels that ARE supplied don't support. Everyone here has given up trying to build a plain vanilla kernel from the latest sources, because there are so many RH-applied patches to their kernels that may or may not even be applicable or available for the vanilla Linus kernels, that no one can say with any degree of certainty that the system will even boot. With Debian, I gave up on vanilla kernels because I was just tired of sitting through recompiles for the "advantage" of just removing a few things that were modules that I knew I would never use, customized to each of a half-dozen machines.
With Debian, which I've run for years without a "reinstall", updates are significantly simpler to perform, and if you want to throw backports.org into your sources.list (which may or may not be a fair thing to do), even *stable* has 2.6.22, or 2.6.18 withouth bpo in the mix. Remote updates? No problem, Debian was *designed* to be updatable like that from the start. The dpkg/apt* tools are still worlds ahead of the RH (and SUSE) equivalents. Dependencies are easier to deal with, as are conflicts, and security.d.o seems to be a lot more on the ball about patches than anyone else.
In fact, I'm often telling our security guys about updates that fix vulnerabilities that their higher-up counterparts haven't started riding them about yet, so they can get a head start on going through the major hassle of begging/cajoling/threatening the RH admins to grab the latest sources, rebuild, and re-install so we don't get slammed on the next security scan for having vulnerable servers. Not that it's ever "the latest", but always "the least amount of upgrade needed to avoid being flagged", which means that next month we go through this all again. With Debian, "aptitude update ; aptitude upgrade" (or just "aptitude"/whatever and then manually select packages, though in stable, it's only going to be fixes anyway most of the time), and the handful of systems I've gotten in under the radar are up-to-date security-wise with significantly less effort.
Even the "you get what you pay for in terms of support" canard has been proven to be false. We had a couple brand-new freshly-patched RHEL5 systems that just would not stay up. First thing RH Support has us do is run some custom tool of theirs before they'll even attempt to help us. A tool that, I should add, is rather difficult to run on a machine that goes down within minutes of coming up. Finally we re-installed and didn't apply the RH-sanctioned updates. Machine...stays up. Same thing with some RH-specific clustering software. Another RH-approved release resulted in...no clustering at all. For whatever reason, re-installing the prior RPM wasn't possible, but the
One thing always missing from such stories... (Score:5, Interesting)
What was the age and the specs of the SGI being replaced?
Going by Moore's law, a factor of 20 performance improvement takes about 6 to 8 years. If the SGI was at least that old, this isn't news -- it's just the state of the art these days. In other words, small clusters capable of weather forcasting are relatively run-of-the-mill.
Of course, props to linux for being the enabler in this case.
Re:One thing always missing from such stories... (Score:4, Funny)
6 to 8 years, you say? Well, then, they'll be ready to upgrade about the time the next version of Debian is released.
Re: (Score:2)
Debian "testing" is upgraded several times a month.
Debian "stable" is upgraded every one or two years.
Take your pick.
I chose "unstable" which is stable enough to be on my home machine. I have never had any serious issues, so far, after one year of usage.
For a production server I would use "stable" but for a research machine the "unstable" looks like a good choice. I guess the people who built it would know what to do.
The only on
Re:One thing always missing from such stories... (Score:5, Informative)
> Debian "unstable" Sid is upgraded every day, or at least several times per week.
True.
> Debian "testing" is upgraded several times a month.
Wrong. Debian testing is updated automatically from packages in Debian unstable. The difference is simply that a package has to sit in Debian unstable for a few days, and no significant bugs can be introduced by the new package, before it is updated. Since the process is automatic, Debian testing is updated just slightly less continuously as unstable (it depends on the robot to check the package dependencies and bug reports rather than the maintainer to upload a new version).
The only time when the update rate is seen as low as less than that is when testing is in deep freeze, i.e., a new stable is about to be created.
> Debian "stable" is upgraded every one or two years.
It usually takes slightly longer than two years.
> The only one I have avoided is "Debian experimental"...
You cannot have a pure "Debian experimental" system. Debian experimental are subsystems that could have profound effect on the rest of the system, and so is provided for trial in isolation. E.g., back in the Gnome 1 to Gnome 2 transition days, or XFree 3 to XFree 4 days, these subsystems are tested in experimental before moving to unstable. These packages are supposed to be used on top of or to replace some unstable packages. Since they affects one particular subsystem, experienced testers can try one particular one based on their needs.
Re: (Score:3, Interesting)
Re: (Score:3, Informative)
> automatic as you describe. At least, the most important packages (like linux, gcc, glibc,
> dpkg, python, xorg, gnome, kde,
> made only when the maintainers think they're ready to be included in the next stable
> debian release and when they're sure that they don't break anything.
The process is automatic. There is even a script to tell you why a particu
Re: (Score:1)
It was a cost reduction of a factor of 20, not a performance improvement.
Re: (Score:2)
It was a cost reduction of a factor of 20, not a performance improvement.
Err.. if the old SGI had 400 CPUs and the new cluster needs 20 CPUs to match its performance, then new cluster performs the same at 1/20th the cost.
I know this is an over-simplification, the SGI probably used vector processors and relied a lot less on parallel-processing, cost-per-processor and number of processors will be completely skewed from my example if that's the case, yada yada yada.. but you get the point I hope..
Re: (Score:1)
Well, RTFA:
"Previously, for almost a decade PAGASA used an SGI Irix supercomputer...."
So they were probably using an O2k. Somebody wake me when there is some real news.....
Re: (Score:3, Informative)
This Philipine newspaper story [manilatimes.net] fills in some important details missing from the Australian PC News article: the age of the SGI system (10 years) and the reason it was costing s
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
http://slashdot.org/comments.pl?sid=487034&threshold=1&commentsort=0&mode=thread&pid=22748784#22749162 [slashdot.org]
This isn't anything new... (Score:5, Interesting)
Re:This isn't anything new... Global Telecom. Sys. (Score:2)
how they get their GTS data, probably use a satellite downlink. There is a GPL GTS switch that's developed for Debian:
http://metpx.sourceforge.net/ [sourceforge.net]
Re: (Score:2)
GTS was until recently largely an X.25 [wikipedia.org] PSTN [wikipedia.org]. I learned X.25 helping maintain message-switching software at a military weather forecasting center; we were a subsidiary node of GTS in that capacity.
I know that many nodes in the GTS have gone to FTP or TCP socket streaming over the Internet (or VLANs running under the Internet). Old-sk00l by 'net standards, but Très moderne in the WMO timescale.
Re: (Score:2)
Many think of the GTS as an X.25 network, but X.25 is going
Re: (Score:2)
Obligatory (Score:3, Funny)
oh, wait...
Re: (Score:1)
Where is... (Score:5, Funny)
apt-get -f -y install gweather
But it failed with something about "ldconfig:
Is libearthquake in unstable?
Re:Where is... (Score:5, Funny)
more weather - For when you need a new update.
less weather - Got too much weather? Reduce it!
vi weather - When you want to change the weather.
emacs weather - When you want to change the weather on 15 separate planets at once.
cat weather - It's raining... oh, never mind.
Re: (Score:3, Funny)
Re: (Score:2, Funny)
Re: (Score:2)
Only problem is that less is more:-/
Re:Where is... (Score:4, Funny)
Scalable ? (Score:1)
"an eight-PC Debian cluster"
"[we] wanted to implement a system which is very scalable"
8 PCS ? 64 cores at most. And they call that scalable ? Come on, today's top500 top machines scale on 10'000 cores. They're 15 years late.
Re: (Score:3, Informative)
Re: (Score:2)
They're not paying Intel either (Score:2, Interesting)
How do they actually supercompute? (Score:1)
What I want to know is: Do they have a big 64 bit addressable RAM image spread over all nodes, communicating with pthreads, like I prefer? Or perhaps they have several 32bit RAM images communicating with some special message protocol. Or perhaps they just have lots of quite independent but equal programs running, as an ensemble. Or perhaps some kind of pipeline where the different parts of the calculatio
Re: (Score:2)
As a weather geek as well as a GNU/Linux geek... (Score:1)
Debian rocks (Score:2)
FYI, I am a little biased, but Debian is the distro that constantly gives me the least trouble.
Re: (Score:2)
Re:hmm (Score:5, Funny)
Re: (Score:2)
Re: (Score:2, Funny)
Re: (Score:2, Funny)
Re: (Score:2)
Re:hmm (Score:5, Funny)
Re: (Score:3, Funny)
Re:Debian? (Score:4, Insightful)
-- from a debian user... who actually started quite late with potato....
Re:Debian? (Score:4, Interesting)
Stability is a major thing with Debian, and my experience has been that this is quite true.
Re: (Score:1)
Re:Debian? (Score:4, Funny)
Re:Debian? (Score:5, Informative)
Wow - how many performance clusters do you run again?
Not that I run a "performance cluster" as such - but I do run a bunch of machines that are very busy, all on Debian.
You know what? We compile the couple of programs where CPU is the bottleneck from source. We also compile Cyrus IMAP from source because we apply a pile of patches, but if someone else was packaging up all those patches in upstream, I'd be happy for them to be compiled there. Disk IO is the issue with Cyrus, and a custom compile won't help with that.
Yeah, we build our own kernels as well - that's another point that's worth the effort to customise.
Re:Debian? (Score:5, Insightful)
The things I (and my co-workers) put a lot of optimization effort into is the kernel and our apps. You're exactly right.. 99.9% of our CPU cycles go into getting work done, and that 0.1% used by
Re: (Score:3, Informative)
I agree. I used to run some clusters for the UCLA Chemistry department and the only real customizations we did was to install a custom kernel in Redhat 9 to handle the huge amount of memory we had installed. And even that wasn't in all the clusters. But yeah, the code the clusters was actually running was pretty much always compiled by hand.
Re:Debian? (Score:4, Funny)
You really oughta use a compiler for that.
Re: (Score:2)
There may or may not be that much performance gain from compiling from source, but depending on what your sysadmins cost, having the ability to run
and have your system be automatically updated could end up being more cost effective than additional support even if that means that you have to buy more/bigger hardware.
Personally I use the package management system for
Re: (Score:3, Insightful)
I know people who know a fair amount about running clusters. None of them want the headache of dealing with the random-ass unexpected conflicts that arise out of having the explosion of possibilities for custom compiling for each server. Also, nobody wants to use their precious "performance cluster" cycles compiling every update. If you really need to compile tweaks (for the important stuff only), you do it offline, once, and then build a *binary*
Re:Debian? (Score:5, Funny)
Re: (Score:2, Insightful)
How fortunate that apt/dpkg handles source packages so well then... Punching in 'apt-get -b source <whatever>' is not a whole lot harder than 'port install <whatever>' or whatever you prefer, is it? I know, I know... Don't feed the troll... Sorry.
Re:Debian? (Score:5, Informative)
I hated Debian at first because it wasn't friendly, but I looked into it more and realized it was the best choice I could make for my production servers. I can set them up and check once a week or so and they're still chugging along without need of intervention.
I wouldn't use Debian on my desktop (I use Kubuntu), but it can't be beat for servers.
It's NOT a desktop distro. Especially compared to Mandr* or Ubuntu or many others out there.
Re:Debian? (Score:5, Informative)
Re: (Score:3, Informative)
Every time I want to install Ubuntu on some random machine, it fails. I always have to go back to Debian.
I have Debian currently installed for my father and my sister. Spares me the headaches of Windows problems. The only support I need to deliver to them is giving information about performing tasks.
Re: (Score:2)
Re: (Score:2)
Fixed it.
Re: (Score:2)
Re: (Score:2)
Did try it with Potato, Woody, and Sarge.
So yes, you're right, I don't know a damn thing about Debian or Debian on the desktop.
Seriously, Debian can be used on the desktop. So can Slack or just about any distro out there, but Debian was designed for "set it up and forget it" server usage.
By the same token, my 1986 Mercedes 560SL convertible has a kick-ass engine and I can use it to haul trailers full of wood or bricks, but that's not what it's designed for. There are wor
Re:Debian? (Score:4, Insightful)
I doubt they have X installed on these machines.
Re: (Score:2)
That being said, I've had very few problems getting X running in Debian. At least 80% of that is researching before I buy a video card... If X doesn't support the card/chipset, I don't care if it's the latest and greatest card out there with 2G of FTL SDRAM, 2 quintillion colors, and able to support 300 fps @ 16000x9000 resolutio
Re:Debian? (Score:4, Insightful)
Re: (Score:2)
so I can test things locally, but has slight differences in defaults.
Re: (Score:2)
Re: (Score:3, Informative)
Re:Debian? (Score:4, Insightful)
Re: (Score:3, Insightful)
Re: (Score:3, Insightful)
Re: (Score:2)
Re:Debian? (Score:5, Insightful)
Not sure why you call Debian a desktop distro, it's much more useful as a server.
Re:Debian? (Score:5, Insightful)
Why debian when you can have Slackware? (Score:2)
Re:Why debian when you can have Slackware? (Score:4, Insightful)
Re:Debian? (Score:5, Interesting)
Debian isn't - and never has been - a desktop distro. If you want a desktop distro built on Debian architecture, you get Ubuntu, or Knoppix, or one of a dozen others. Debian's unique selling proposition is a combination of stability, which is very important to production servers, and a rigorous commitment to free software. Packages don't make it into Debian Stable until they have been thoroughly tested. Debian also has the best package management system in the industry.
Frankly, I wouldn't run a server with anything else.
Re: (Score:3, Insightful)
It just takes a little more effort if you do something pointless like start out with just the min install.
Re: (Score:2)
Seven Damn Small Linux for the Dwarf-lords in their halls of stone,
Nine Linspire for Mortal Men doomed to die,
One Debian for the Dark Lord on his dark throne
In the Land of Mordor where the Shadows lie.
One Debian to rule them all, One slocate to find them,
One distro to bring them all and in the darkness bind them
In the Land of Debian where the apt repositories lie.
http://www.debian.org/misc/children-distros [debian.org]
disturbance in the force (Score:2)
Re: (Score:2)
Re:dent in the budget (Score:4, Informative)
Re: (Score:3, Informative)
Re: (Score:3, Insightful)
Sure. Add in paying for tech support or the cost in man-hours it takes to keep it running. Both can make a serious dent where nobody expected to see one.
That's the big attraction for Debian. For a production system, support tasks drop to almost nothing. It's there. It runs. If and when a patch is needed, it is just that - a patch - and not any weird licensing changes or mutations in functionality.
Of the linux distros, it's an excellent choice for servers, perhaps the best. Given the rock-solid nature, it can be good for enterprise desktops, if you are willing to plan. However, Kubuntu LTS meets that need.
Re: (Score:2)
I'd think that there might be a learning curve as the staff adapts to Debian, but after that, yes, the support drops to almost nothing. That's why I use it on my servers -- hardly any time ever spent on support and if something goes wrong, there's a 99.9% chance it's hardware relate
Re: (Score:3, Insightful)
Also, their supercomputer may just be outdated, not necessarily because of bloated
Re: (Score:3, Informative)
Also, their supercomputer may just be outdated, not necessarily because of bloated software. I don't know how well SGI's products and support survived their recent bankrupcy, but I'd imagine not too well (though they seem to have built the Xeon-based #3 from the Top 500 recently).
AFAIK, SGI still supports IRIX computers, but it doesn't do new ones - hasn't for several years. They do Linix of course - looks [sgi.com] like you can have SuSe or RedHat - and they also do some Microsoft thing too. I'm sure SGI could have serviced this customer, but they clearly didn't want to pay for support. You get what you pay for, and SGI still have very smart/clever people....
Re: (Score:2, Informative)
Re: (Score:2, Informative)
And consistency? Like how the entire Debian repository is cross checked every day to ensure consistency?
I'm also intrigued by your reference to updates destroying servers. Do you get this behaviour with Red Hat? Makes me glad I'm not using it, then.
And managing an arbitrary nu
Re: (Score:1)