Linux Clustering 162
Linux Clustering. Building and Maintaining Linux Clusters | |
author | Charles Bookman |
pages | xv + 265 |
publisher | New Riders |
rating | 2/10 |
reviewer | Steve Krause |
ISBN | 1578702747 |
summary | A guide to clustering software, networking, and journaling filesystems |
Bookman emphasizes a central piece of wisdom that no system administrator should ignore: redundancy. In the case of high availability clusters, parts redundancy is the name of the game, but one should not forget the human component; no administrator should be caught with only a cell phone -- keep a pager just in case. However, in a post-modern turn that might seem brilliant if it were applied in a work of fiction rather than a technical book, the author seems to apply the concept of redundancy to the text itself.
That the book began not as a book but rather as a collection of talks or presentations, or some other smaller format, is evidenced by the repetition of information between chapters and sections. Such nearly poetic repetitions also occurs within sentences and paragraphs (e.g. "nightly backups each night" on page 25).
An editor never looked at Linux Clustering; the book had two "technical reviewers" but their contributions seemingly didn't include fixing mangled syntax and strained style. On page 14 in the second paragraph a large segment of a sentence from the previous page is pasted into another sentence, resulting in a nonsensical block of text. The number of hyphenation, syntax, word choice, and subject-verb agreement errors is atrocious and makes the book difficult to read.
Some of the misinformation in the text appears to be unintentional (but ignorance is no excuse for a UNIX systems administrator); some is due to the fact that the author deals only with old (2.2) kernels (though the book came out 18 months after the 2.4 kernel release), old versions of journaling filesystems, and old distributions; and yet other misinformation is the result of misplaced attempts at humor (such as stating that GNU stands for the Gateway Naming Utility; one can only hope that this was intended to be funny). Other jokes often misfire, but do point to the intended audience (consider, for example, the section heading "Space: The Final Frontier").
In the Introduction, the author indicates that the book should be read by "Linux enthusiasts and users who want to get a Linux cluster up and running with the least amount of fuss." The organization of the book will not, however, aid this enterprise, for there is little "how to" information provided, but rather a great deal of background information on compiling kernels, various types of journaling file systems, and RedHat's Kickstart (perhaps inappropriate considering that the book specifically states that basic information will not be covered). Another section or two deal with basic networking and security. Various types of clusters are discussed, as are a few of the types of clustering software (e.g. Condor and Mosix) available.
The book, however, is clearly intended for administrators of clustering systems; a special emphasis is high-availability and load-balancing clusters. Parallel computing and the types of applications end users would wish to run receive far too little discussion.
Almost all technical books regurgitate the contents of freely available FAQs and HOWTOs to some degree, yet the good ones summarize the relevant points, make dry documentation more accessible, and give the reader some new insights. Because Bookman's Linux Clustering suffers from heinous spelling, grammar, and style errors; deals primarily with outdated software; contributes little new to the discussion; and doesn't speak to non-admins, I can only recommend that those interested in Linux clustering stick to online FAQs and HOWTOs.
You can purchase the Linux Clustering: Building and Maintaining Linux Clusters from bn.com. Slashdot welcomes readers' book reviews -- to see your own review here, read the book review guidelines, then visit the submission page.
MySQL & mSQL (Score:4, Funny)
MySQL & mSQL: Worst. Book. Ever. (Score:2, Informative)
Here are the books I've found most helpful on MySQL (and using MySQL with other things):
MySQL- Paul DuBois
MySQL and Perl for the Web- Paul DuBois
PHP & MySQL- Welling & Thompson
Re:MySQL & mSQL: Worst. Book. Ever. (Score:3, Informative)
Re:MySQL & mSQL: Worst. Book. Ever. (Score:2)
Re:MySQL & mSQL (Score:2)
Using it as a reference, and my knowledge of SQL, I did my first paid perl/php/mysql work while still in earnt University, and earnt close to $10,000 working from home, while studying. Having this work on my resume helped get me a $53,000/year job at age 21.
Ok, maybe other books could have helped too, but this was 1999, and like I said, the book did me well. I keep it at work for qui
No good books? (Score:5, Interesting)
Re:No good books? (Score:5, Interesting)
My friend worked at a lab in Princeton modeling the inside of a reactor. He worked with a 32 node linux cluster and did all the graphics modeling using a modified version of Unreal Tournament.
Re:No good books? (Score:2)
Perhaps it's because theses authors write books about clusters but never had to build one for any particular reason.
Re:No good books? (Score:1)
Sweet. So did the reactor use the shock rifle alt-fire to start up or something?
But more on topic, I think you're right about the size of the market. About how many big clusters are there out there anyway?
Re:No good books? (Score:2)
Re:No good books? (Score:1)
Re:No good books? (Score:5, Informative)
Re:No good books? (Score:3, Informative)
Re:No good books? (Score:5, Informative)
granularity, I'm tired of explaining this on
There are two main typesd of cluster, High Performance and High Availability.
HPC tries to increase the power of the cluster by spreading jobs out over the whole cluster. HPC breaks the work down into blocks and farms this block out to the nodes. In the worst case a single node failure could cause the whole cluster to fail.
HAC tries to increase the uptime of the cluster by running the same job on more than one node. If a node fails then the job on the mirror node takes over. It's worth noting that no Linux cluster has 100% HA.
If a single node is going to fail 5% of the time what is the up time of a 100 node cluster?
I work in the commercial not the scietific world. HPCs bore me. HACs could be a god send.
Imagine if you will a cluster that automatically deals with node addition/subtraction. I have 1,000 users connected to this cluster using Xterms. I need more power, add more nodes. If any nodes fail the user never loses anyuptime as their work is switched to a mirrored node or nodes.
Centralised computing rocks.
Re:No good books? (Score:2)
The problem with your idea for HA is who watches the watchers? The process comparing the results must have a backup, so who checks that the checking process and it's back up are in sync? Quite quickly becomes a nightmare.
FYI, Tandem are the only company I know of with a fault tolerant intel server. They had to design their own mother boards as I recall. Lockstep of the processors was the way they went.
Re:No good books? (Score:1)
Re:No good books? (Score:4, Insightful)
However, there are good uses for modification of vehicles, like say air bags. I don't call air bags, gimmicks, just because I think that guys who put dual exhaust systems on a 20 year old car seem like they are wasting money to me.
However, in terms of redundancy, your far, far better off with 10 P3 500's, then with one P4 5Ghz machine. One of the PIII's is having problems, shut if off, run the diagnostics. The P4 has problems, you shut if off, you are in deep shit.
If I had my choice, I'd rather have a cluster of 5-10 well built, redundant machines then one machine 10 times as fast for any problem that can easily be distributed (think websites, DNS, mail servers). No, I don't want to use 10, 3 year old Dell workstations to serve up my enterprise website, but I wouldn't have any objections to 10 Dell Servers that were bleeding edge 3 years ago assuming it uses parts that are still commonly available.
Kirby
Re:No good books? (Score:2, Informative)
Re:No good books? (Score:2)
I'm sure Brin and Page would love to hear more about your unique insights into the cost-effectiveness of Linux clusters.
http://www.internetwk.com/lead/lead060100.htm [internetwk.com]
http://www.redhat.com/about/presscenter/2000/press _google.html [redhat.com]
[each link opens in new window]
Re:No good books? (Score:2)
Re:No good books? (Score:2)
If anyone would like to send me on the righteous path towards clustering enlightenment, here are the details of what I have, and what I'd like to do... I haven't looked into clustering too much, so I don't really even know if this is possible.
I have 18 Linux workstations, and 1 server. I'd like all the workstations to work together to run all the processes on them, and also ha
Re:No good books? (Score:1)
Paul DuBois, MySQL (Score:5, Funny)
http://www.pgro.uk7.net/qu092902.htm [uk7.net]
Wow. (Score:1)
Re:Paul DuBois, MySQL (Score:5, Interesting)
It seems that a large majority of MySQL users also know little or nothing about relational theory. The MySQL core developers fought a long time against including support for foreign key constraints. Thus not knowing anything about relational theory may not be a drawback in writing a book about MySQL; it matches the target audience, even if it fails to educate.
Re:Paul DuBois, MySQL (Score:2, Insightful)
Re:Paul DuBois, MySQL (Score:2)
This is text file stuff -- comma delimited stuff. And it doesn't help you learn SQL any more than writing a clever perl script helps you learn software engineering.
O'Reilly's worst dud was also about Linux clusters (Score:5, Interesting)
How bad was it? It came with a CD-ROM that was supposed to automate the process of setting up a Beowulf cluster. None of the software on the CD-ROM worked. Running the install script printed out a message telling you to go to a Web site and download the newest beta version of the software. No such software was available ... ever. O'Reilly shortly withdrew the book ... and, reportedly at least, fired the editor who approved it for publication.
Want more details? Here you go. [everything2.com] Waiting for this book, and then discovering slowly just how awful it was, set back a clustering project at my workplace by several months, by the way.
Re:O'Reilly's worst dud was also about Linux clust (Score:2)
Re:O'Reilly's worst dud was also about Linux clust (Score:1, Funny)
Re:O'Reilly's worst dud was also about Linux clust (Score:2)
Re:O'Reilly's worst dud was also about Linux clust (Score:2)
Waited a while, didn't you? That book was withdrawn from sale two years ago. People who wrote to O'Reilly customer service at the time did get free books (an up-to-date copy of Linux in a Nutshell in my case, twice as thick and more useful for clubbing lusers^W^Winforming coworkers with) ... but I don't know if they'd do anything for it now.
Re:O'Reilly's worst dud was also about Linux clust (Score:2)
So much for trusting reviews.........
Re:O'Reilly's worst dud was also about Linux clust (Score:2)
Re:O'Reilly's worst dud was also about Linux clust (Score:2)
Re:O'Reilly's worst dud was also about Linux clust (Score:4, Insightful)
Re:O'Reilly's worst dud was also about Linux clust (Score:1)
OK, so they fired the editor, but not the author? (Score:1, Interesting)
Yet, funny enough, the author still writes a Linux column for O'Reilly. (His most recent entry elicited an endless stream of angry reader responses, calling it "crap," etc.) The whole thing just makes me wonder who is BS'ing who. He almost sounds like one of those guys that magically gets his way out of anything. Can't you just hear this guy promising his great program to automate the clu
O'Reilly's Worst Failure (Score:3, Informative)
In my mind, this is simple -- I have never read a worse O'Reilly book than Building Linux Clusters [oreilly.com]. There is a reason that ORA pulled this book out of print after only 6 months, and haven't even bothered to try to fix it and reprint a new edition. It was basically a commercial for the company the author ran, it read as if it hadn't been edited (spelling and grammar mistakes everywhere, included pictures were of the wrong thing that the text referred to), and the code included was so buggy it wouldn't work at all without a lot of fixing.
This was the first book on Linux Clustering I read, and I was hugely disappointed
Re:O'Reilly's Worst Failure (Score:2)
all the above is true, bought the book, read the book, tossed the book.
O'Reily's Msql and Mysql (Score:3, Insightful)
And of course, it's been a long time since i first read the book, but i don't remember it being unpleasant. Why all the disdain?
Re:O'Reily's Msql and Mysql (Score:3, Insightful)
Re:O'Reily's Msql and Mysql (Score:2)
MySQL & mSQL (Score:1)
Re:MySQL & mSQL (Score:1)
Re:O'Reily's Msql and Mysql (Score:1)
Re:O'Reily's Msql and Mysql (Score:5, Informative)
I own maybe 30 O'Reilly Titles and "MySQL and mSQL" is easily the worst one of the bunch. Here's my take as to why.
I came from an Oracle background (i.e. already understood SQL, relational databases, etc.) and I was interested in 2 things; how to administer a MySQL database and how to do simple access from PHP/Perl.
Now go and pick that book and try and find that information. The description of the MySQL security model is muddled and confusing. No good details on how to do backup and restore. The examples for using PHP/Perl are horrible. The book has several chapters of filler.
A year or two later I bought the New Riders title mentioned in the writeup. It is a massive improvement over "MySQL and mSQL" - read them side by side and you'll see.
One thing that book taught me - just because it is a publisher you trust, don't assume the book will be good. Read it or read a review first!
Re:O'Reily's Msql and Mysql (Score:2)
Ah, so you don't have "Linux Network Administration", then? Otherwise known as "10% of the information of TCP/IP Network Administration for 90% of the price".
TWW
O'Reilly Quality (Score:2)
They must have pretty good editors, because they do avoid the mistakes most computer publishers make, like sloppy revisions that leaves Windows 95 material in a book on Windows NT. And they seem to have put a lot of work into their publication workflow [oreilly.com]. But it's pretty obvious that nobody at
Re:O'Reily's Msql and Mysql (Score:2)
I think the book was great for that audience.
No book is for everyone. In the end, you can only judge it by its success communicating to its intended audience.
Fortunately, with Managing and Using MySQL, we did tackle the needs of people with dba backgrounds and dba needs.
Re:O'Reily's Msql and Mysql (Score:2)
The book was not really aimed at you. It was aimed at open source programmers and hackers getting their first taste of database development.
Ok, I can buy that. Like I said, my bad for not reading the book first or reading a review. I just wanted a book on MySQL and I automatically grabbed the O'Reilly title.
I'm not a DBA, just a programmer that is familiar with databases. I guess for a more "advanced" user MySQL and mSQL is just not a good book might be a fairer statement.
So...what books DO you all recommend? (Score:5, Interesting)
Re:So...what books DO you all recommend? (Score:3, Informative)
Some of the more popular that are used in the bio-sciences are.
http://www.platform.com/ (demo only, but most powerful)
http://www.openpbs.com/ (open source)
http://gridengine.sunsource.net/ (open source)
All these run on linux.
jpc
Other stinkers from O'Reilly (Score:3, Informative)
And I hate it when O'Reilly comes out with a bad book, because generally their books are great.
Easy way to get a cluster up and running? (Score:2, Funny)
Re:Easy way to get a cluster up and running? (Score:3, Informative)
Head over to LinuxNetworx [linuxnetworx.com] now for a LinuxBIOS-ready Evolocity II cluster!
Note: I'm not an employee of LinuxNetworx, but they still kick ass.
Re:Easy way to get a cluster up and running? (Score:1)
Head over to RLX [rlx.com] for a rack of 70 dual Xeon 3.0 GHz server blades all set up with the necessary clustering software and completely managed from a single web browser.
jpc
OK, so who's got a GOOD book on this topic? (Score:5, Insightful)
1. Clustering (not just replicating) MySQL databases.
2. Network attached storage.
3. Load balancing and failover.
4. Probably six other things I'm not thinking of right now.
Anyone got any suggestions?
Re: mysql replication and high availability howto (Score:4, Informative)
Re:OK, so who's got a GOOD book on this topic? (Score:5, Informative)
Eddie: Load Balancing Software
http://eddie.sourceforge.net/
Linux Virtual Server Project: Clustering Tools
http://www.linuxvirtualserver.org/
OpenAFS: Efficient Distributed Storage
http://www.openafs.org/
Load-balancing and failover are tough nuts. You can do some stupid things like Round Robin DNS or Rotary NAT, but to be actual balancing, you need a balancer box. You can either make your own (using proprietary software or the stuff above) or buy a piece of hardware to do the job for you. I've heard Cisco makes some good ones.
NAS units usually operate using CIFS, AFP, or NFS, all of which are pretty lame options for a modern cluster. SANs are pretty cool, but you need some big-ass hardware to support them. Personally, I'm working on an OpenAFS cluster, which is pretty easy if you look into the capabilites of the software. Coda is another option of which I'm not using because it doesn't play as well with Windows.
As for clustering MySQL: If you read the Slashdot interview log they had a couple days ago, you'd see that the setup here is a master writer that replicates to a couple of reader databases. This is about as effective as it gets with MySQL. If you need higher power, I've read that commercial versions of Postgres support clustering/synchronization. More powerful than that and you're into Oracle territory.
Unevenly distributed? (Score:5, Funny)
From the review:
The book is divided into eleven chapters, unevenly distributed among three sections:
That's good news: I would hate to read a fractional chapter.
Re:Unevenly distributed? (Score:1)
mSQL (Score:2)
Re:mSQL (Score:2)
And if something works with mSQL, it is pretty certain that it will run with anything else.
Anyway, its off-topic, but I hope it helps.
If you need socket bindings for mSQL, email me at
fred_weigel at h o t m a i l dot c o m.
Ratboy
Re:mSQL (Score:1)
I agree (Score:3, Informative)
New Riders (Score:4, Informative)
"Digital Texturing and Painting"
"Digital Lighting & Rendering"
New Rider's focus is more on the artist / animator / illustrator side of things - and at that they excel (the above two are those I'm most familar with, and they are excellent).
I'm sure they'll gradually improve their hardcore technical books, but it's stupid to dismiss "all" their books as being bad. Just like O'Reilly has a reputation in some circles for being overly dry and out of date - *some* people find their books useful.
Re:New Riders (Score:1)
Re:New Riders (Score:1)
Don't assume... (Score:3, Insightful)
When you see a book in print, don't assume that the suggestions of the technical editors/reviewers have been heeded. The author basically has final say over the content of the book -- meaning that a tech reviewer/editor can be completely ignored no matter how much they complain about the content of the book or how much it doesn't address what it should.
And, the tech reviewers/editors are explicitly asked not to try to fix grammar and so forth -- that's supposed to be the job of a different editor.
Also... I'm surprised to see a review of this book popping up now, it was published about a year ago.
Re:Don't assume... (Score:2)
Re:Don't assume... (Score:2)
Tech publishing is pretty slipshod with some of the major publishers. Their project editors and acquisitions editors aren't experts in the field, so they have no way of knowing whether the author knows what they're talking about or not. The authors sometimes recommend their own te
huh? (Score:2)
From the review: (but ignorance is no excuse for a UNIX systems administrator)
That's right. Also,
Ignorance is no excuse for the law.
why are negative reviews necessary? (Score:2)
I think negative reviews can be helpful... (Score:1)
However, I do agree that it should be a requirement that if you leave a negative review, you should also show an alternative title.
Imagine... (Score:3, Funny)
Interesting timing (Score:1)
On the funny side, my director started the conversation with "have you ever heard of a Beowulf Cluster"? He didn't understand why I couldn't stop laughing. I obviously spend waaaaaay to muc
Linux Magazine June Issue is Clustering (Score:2, Informative)
"O'Reilly's worst dud: MySQL & mSQL" (Score:4, Interesting)
MySQL&mSQL was my first O'Reilly book, back in my old days in school. I spend many nights reading it and many classes trying out the things I read.
I still like it although it has become completely outdated now (at least my edition).
Maybe I should have a look at the
Re:"O'Reilly's worst dud: MySQL & mSQL" (Score:2)
In actuality, the book got a very good /. review. In fact, reviews on it tended to be fairly schizo. People either loved the book or they did not like it at all. I think this reflects partly the fact that we tried to deal with MySQL and mSQL together (a mistake, though it does reflect that at the time the market is was in transition from mSQL to MySQL) and that the book was aimed at MySQL pr
Re:"O'Reilly's worst dud: MySQL & mSQL" (Score:2)
You're right that tackling both mSQL and MySQL was a bad idea in retrospect, but again, like you said, at the time it reflected market conditions.
Thanks!
O'Reilly's MySQL book isn't so bad (Score:1)
In other news... (Score:1)
---
A) A linux cloister.
chaos-five
There is no good linux cluster book... (Score:3, Interesting)
So I read online, whatever I found that was up to date and settled on the satisfying OpenMosix [sourceforge.net] and... it works ! :-)
A solution for simple hight availability services (Score:2)
Pen [siag.nu], a load balancer for "simple" tcp based protocols such as http or smtp. It allows several servers to appear as one to the outside and automatically detects servers that are down and distributes clients among the available servers. This gives high availability and scalable performance.
Re:Would you prefer RIAA math? (Score:3, Funny)
Gah! (Score:1)
Do you "get it" now, or do we need to lock you in a room with RMS?
Re:Gah! (Score:1)
No, I believe the replies have proven that the original poster was nit-picking just for the sake of nit-picking.
I read "unevenly distributed among three sections" to mean there's probably not an equal amount of coverage for each section. The statement was not necessarily redundant...it depends on how you read it.
I mean, I appreciate a clever jackass comment as much as the next geek...but you
WOW, what a cool sig... (Score:1)
Can I barrow it?
Re:Would you prefer RIAA math? (Score:2)
Re:Would you prefer RIAA math? (Score:2)
Oh well, I guess he's going to be buying the first round of drinks on Friday evening then.
Re:Oh please! (Score:3, Insightful)
His complaint was the lack of editing made it almost unreadable. I.e., re-reading a sentence several times to figure out what it said.
Good grammar and spelling aren't "windowdressing". They are essential for easy reading.
Re:Oh please! (Score:2)
Re: (Score:1)
Re:Oh please! (Score:3, Insightful)
Is this true? (Score:4, Insightful)
Re:FUNNY MODERATION EXPOSED! (Score:1)
Sheesh, Buddy, it's in the FAQ [slashdot.org].
Re:Fault tolerant Linux clusters? (Score:1)
Obligatory disclaimer: I have no connection to veritas other thanusing their stuff.
Re:What did you say? (Score:1)
Exactly. I can't believe this guy is complaining about a murky writing style while at the same time f-ing up a simple introductory paragraph.
Bah!
Thanks for the lesson (Score:2)
1. don't mention Beowulf again, even in irony
2. don't feed the trolls
3. don't mention moderation
4. don't comment on anticipated future negative comments
5. don't annoy
6. Last and most important: don't detract from the story or conversation
It was unintentional and I wasn't looking for replies, but I can see I made quite a number of mistakes in one post. It was a first and a last post of its nature from me. Not to be repeated.