Are There Large RDBMS Using Linux? 327
Jason Perlow of Linux Magazine writes:"
With all of the recent computer press coverage of Amazon and Intel converting their
web servers and other front end application servers to Linux, many of these stories
neglect to mention that the back end systems these companies use still rely on
commercial Unixes like Solaris, AIX and HPUX to host their RDBMSes (Oracle, DB2,
Sybase, Informix) for their mission critical transactional applications and data
mining.
Are there any companies out there actively using Linux to host a mission-critical
RDBMS ? or looking to replace UNIX with Linux for this purpose?"
Shareholders... (Score:1, Insightful)
Plus, senior IT execs need reliable support and assurance that they got the best software in the market for the job, just in case things go wrong. Its a liabilities thing
Re:Shareholders... (Score:2, Insightful)
Since when has using financial resources intelligently 'look[ed] bad on shareholders'?
Your second point could point out a strong market opportunity for consultants.
Of course, that consultant market diminishes the cost savings of using open source applications.
However, when a particular open source database is as ubiquitous as, say, TCP/IP, it strikes me that _savvy_ shareholders would view its use as a strength, as the company reduces the heroin-addiction-like lock-in of, say, SQL Server.
Re:Shareholders... (Score:2, Interesting)
Just as with the DB, if things run smoothly then everything is ok
Re:Shareholders... (Score:2)
Not at all. Since the use of a consultant comes first and the solution follows, the consultant fees were a given regardless of the recommended solution so the savings are still there. However, consultants implementing Linux solutions may find themselves at a disadvantage when their proposal comes in significantly lower and is viewed with skepticism since everyone else was in a certain (higher) range. One way to combat that is to propose high and then come in well under budget for the first job or two.
Re:Shareholders... (Score:2, Insightful)
you have never read any of the licenses have you? shame on you.
Re:Shareholders... (Score:2)
Only if you've bought RHAT or LNUX
Plus, senior IT execs need reliable support and assurance that they got the best software in the market for the job, just in case things go wrong. Its a liabilities thing
It's not about things going wrong, and don't kid yourself that it is, after all there are plenty of organizations offering support on Linux, even IBM will do so (if you pay them enough). It's a matter of the right tool for the job, and at the high end, Linux trails behind commercial Unix implementations like Solaris and AIX, tightly integrated with their hardware, and with solid high performance capability, for example Solaris' threading and logical domains.
Re:Shareholders... (Score:3, Insightful)
Re:Shareholders... (Score:2)
I certainly would not run MY company's mission-critical systems on www and newsgroup support offerings. These are great and all but when something goes wrong, and you cannot figure it our, you NEED to have someone to call to get your system up as quickly as possible. As expensive as Tech Support is, it is not nearly as expensive as leaving the servers down while you research the problem. What you do instead is get a contract with a firm that will be available 24x7 in case you have problems and they will come out and work on it.
Enterprise != Consumer (Score:2)
The web will answer any common question about common software. But if you're doing something even a little off the beaten path, you can run into problems that haven't been solved yet. This can be very dismaying when you're under time pressure. Commercial Unix vendors will actually investigate the problem and issue a patch if necessary.
Re:Shareholders... (Score:2)
I used to agree with you until I discovered that proprietary software documentation is often less usable than the documentation for major OSS projects.
I do agree with you about www support BTW. It is NOT for mission-critical applications, but good support is available even on open source projects for a fee. Microsoft says, a very high fee (they even don't pull the no support card because they know its BS), but compare Red Hat's enterprise support to Oracles... And their support reputation...
Re:Shareholders... (Score:2)
because the documentation I get with my prorietary apps is so smegging useful.
Well, I'm using a Linux solution completely. (Score:2, Interesting)
Re:Well, I'm using a Linux solution completely. (Score:2)
Prada uses Linux (Score:5, Informative)
I don't know what size the database is, but the Clariions had 400GB each worth of disks.
Re:Prada uses Linux (Score:2, Informative)
(I'm a former DGer.) I was under the impression that EMC bought DG exactly so they could get their hands on that "moldy old DG crap". Certainly the Aviion line wasn't that attractive. If the EMC salespeople don't want to sell a purchased company's products, that might be a post-acquisition integration (or higher-comission) issue rather than any reflection on the quality of the product.
Plus, I think it's still under active development, so "moldy old" doesn't really apply, either.
John
It'll change (Score:5, Informative)
As distributions like SuSE continue pushing ahead with high-end features (like logical volume managers, which SuSE already has), usage of these products on Linux will undoubtedly increase. Part of the situation here is cost. When Oracle Enterprise Edition costs $40,000 per CPU, plus another $8,000 or so per year for support, who cares about spending a little more for high end Sun or IBM systems?
Also, Oracle 8i, while supported on Linux, did not offer a couple of features found in Oracle 8i for other systems. In particular, full interMedia support for full-text searches of all sorts of documents (especially from software made in Redmond) was not available in the 8i Linux version. The new 9i does support this feature under Linux.
Re:It'll change (Score:2, Informative)
Re:Informative? (Score:3, Insightful)
Clustering dozens of separate boxes doesn't help for most applications. If you have some high-volume (w.r.t. transactions) tables you can really forget distributing those over separate boxes.
Have you really ever tried running large databases (in size and transaction volume) on any kind of x86 configuration?
You forgot a question... (Score:4, Insightful)
I like Linux, but on the scalabilty front, it's still got a ways to go. Moreover, since most Linux used by corps (at least here) is Intel based, you've got to deal with less mature hardware (backplanes, reduncancy, etc.). Plus the enterprise management tools required are only starting to appear for Linux.
*climbs into his asbestos underwear to wait for the inevitable jihad*
Real Unix is a lot better for mission Critical. (Score:4, Insightful)
Re:Real Unix is a lot better for mission Critical. (Score:2)
I find it, however, interesting that a general purpose implementation of UNIX such as Linux is even playing in the same ballpark. The stability, utility, affordability, ubiquity and avalabilty of Linux make for a strong argument for it's use in many situations.
IBM DB2 case studies (Score:1)
Re:IBM DB2 case studies (Score:2, Funny)
Case studies on ibm.com that favourably argue the use of Linux and DB2 on S/390? I'm going to buy one right away! If IBM sez it is good for me to buy their stuff then who am I to argue!
;-)
</FLAMEOFF>
only the question or the answer can be known (Score:1)
What are the largest Free Software Database sites? (Score:4, Insightful)
Any war stories?
How about building Redundant Arrays of Inexpensive Database Hosts?
Re:What are the largest Free Software Database sit (Score:2, Informative)
Re:What are the largest Free Software Database sit (Score:3, Informative)
Running MySQL 3.23.40 on a 1.4GHz Athlon with 1GB of RAM and an 18GB 15krpm SCSI drive, the system is doing ok, but it's starting to feel the load peaks. I'll be upgrading to RAID fairly soon, which should help things.
All in all, I'm very happy with MySQL, but I'm strongly considering a move to Postgres, because the lack of row-level locking is starting to become a problem. Stability has been no problem... no crashes, no data corruption, nothing.
I'm sure this is in no way one of the largest installations of free software databases, but I thought I'd post my experiences anyway.
Re:What are the largest Free Software Database sit (Score:2)
Re:What are the largest Free Software Database sit (Score:2)
I don't want to upgrade to 4.0 (which is still in alpha) just yet, but I believe I'll compile 3.23.44 with InnoDB support and give it a shot. Hopefully, the upgrade is as easy as you say. Any hints/tips/caveats or possible problems you've run into would be helpful.
Why not start with PostgreSQL? (Score:2)
Why? Surely one of the point-and-click interfaces to PostgreSQL would make more sense?
OK, good reasoning (Score:2)
Give it time (Score:2, Informative)
I think we're going to see things change gradually as acceptance grows. Don't rush things. People will move when they're ready, and trust is there. Redhat's worth watching. And it doesn't have to be big vendors, as so much less functionality is needed in the DBMS in these days of N-tier & appservers based infrastructures
And how about designing FOR failure and using commodity boxes (running a free OS?) at the same time? Check out Clustra [clustra.com] for a RDBMS that runs on Linux & Solaris, runs over LOTS of small, cheap commodity boxes, and is as a result, very reliable (yes, I do use it). Ok, so it's not free in any sense, but it's good and solid, and used by some big players in the telecoms industry.
This wouldn't make sense (Score:5, Insightful)
Furthermore, there are no potential cost savings. Solaris essentially 'comes with' an E1000, and all your administrators are trained in Solaris.
Re:This wouldn't make sense (Score:2, Insightful)
I also keep in mind that when I apply a kernel patch to Solaris, I don't have to worry about getting little suprises like a completely redesigned and poorly tested VM subsystem, because the Solaris development team were stuck in a mailing list flamewar.
Re:This wouldn't make sense (Score:2, Funny)
Or because a team member was actually getting laid that week...
Re:This wouldn't make sense (Score:3, Interesting)
And the completely redesigned VM in Linux 2.4.10 meant that my cluster had a speedup of 1-2 orders of magnitude (heavy swap vs virtually none). And all of my systems running 2.4.10 (12 of them) have been up since the first boot. I'd rather have them fix things in the kernel than use a band-aid approach.
>because the Solaris development team were stuck in a mailing list flamewar.
Are you in elementary school? You seem to have no grasp of corporate politics. First, I wouldn't call it a real flame war. I read the mailing list every day, so I'm familiar with what is going on. Second, one difference between open source projects like Linux is that discussions are public; proprietary projects can have real warfare going on, but the conflict might not be known to the public. I bet you that the fight at Sun going from SunOS to Solaris would make any lkml flamewar look quite tame.
-asb
Oracle's on Linux... (Score:5, Insightful)
That has since changed. They are now using Suse Linux for all of their appliances. They work fairly well for what they are designed to do, which is to provide an administratively simple appliance... you don't deal with the OS, ony the Oracle admin interfaces.
Looking at my client list, 4 out of 12 of them are running various Oracle instances in Production on Linux, both Suse (the only officially Oracle supported Linux distro, if I'm not mistaken) and Red Hat. 9 of those 12 run Linux in development environments.
While the Linux deployment has usually been in a development environment, I've seen the trend start to move into Production environments. I think this can be attributed to a number of factors; the maturity/stability of Linux, the cost (hardware and software), the feature set (journalling file systems without having to pay through the nose for Veritas), and the hardware availability.
That and the fact that Oracle offers support for Suse. That is HUGE.
While the bigger companies are still using Solaris and HP-UX for their Oracle needs due to the hardware involved (I have yet to see an E10K run Linux, never mind in production), most of the smaller companies I deal with are running Oracle on Linux in some part of their company.
Also, a number of Oracle's newer integrated development tools (JDeveloper, Enterprise Manager, etc.) are being ported to be 100% Java so that they will (and do) run on Linux.
Re:Oracle's on Linux... (Score:2, Informative)
Re:Oracle's on Linux... (Score:2)
Didn't know about it... I stand corrected.
:)
Re:Oracle's on Linux... (Score:2, Informative)
Momentum... (Score:5, Insightful)
Aside from this, much of the main databases (including almost all the mission critical stuff) here are on HP systems. Despite HP's uncertain future (having ditched PA-RISC), I doubt they'll move from HP in the near future.
Now take this reluctance to move between mainstream Unix vendors and apply this to linux, the upstart on the block. Quite aside from the "free" nature of linux and perceived lack of accountability, there's a further issue. Even when sticking with mainstream ventors, there's a reluctance to mix vendors; i.e. there's a desire to use IBM software on an AIX box, simply to avoid the finger pointing that can ensue. IBM have even had ad campaigns based on this. There's a certain comfort factor in knowing that you can go to one vendor and say "fix this" which you don't get with linux on Intel. IBM, HP and Sun all make the hardware and OS; you don't get that with linux (with the potential exception of some IBM kit like the S/390).
To get over this, there need to be vendors willing to support the software and hardware side of a linux solution. Hopefully IBM will pave the way with things like S/390 and the zSeries server.
Re:Momentum... (Score:2, Insightful)
Well, that's not true, it is just different. Why not go to the core developers and offer them some money to fix something or add a feature you would like? I think this system would be far better than complaining through 20 levels of incompetent tech support to finally get the message that it "will be fixed in the next release". I've never heard of anyone getting some software bug fixed by going to Oracle or Microsoft or whoever else and saying "fix this". Hell, I remember a recent article (sorry I can't find it now) where a CIO was relating all the massive problems he was having with Oracle (the company) fixing his software - and they were a multi-million dollar client.
It would be nice if sites like source forge were set up so that the development group could accept donations or payments for bug fixes or add ons. This was a great oversight.
Re:Momentum... (follow up) (Score:2, Informative)
Note that the UCITA and DCMA make it even more difficult - actually almost impossible - to sue your software vendor.
So WHY does everyone keep repeating this mantra that you can "at least sue your vendor" with proprietary software? YOU CAN'T. And how is a contract with a closed source vendor any more legitimate than a contract with an open source one?
Re:Momentum... (follow up) (Score:3, Interesting)
Would we have had this if the software package was from Sun? Well, Sun might have blamed IBM, IBM might have blamed Sun and we'd be left with something which doesn't work. We've been lucky in that IBM want this to work to secure future business, and that is the carrot you can use to 'bribe' vendors to fix bugs.
While open source allows you to track down the bugs and fix them yourself, it relies on you hiring programmers and/or smart admins. Many companies don't want to do that, particularly when you can get the people who wrote the code to fix it (whether you can get them to fix it or not is a different matter; managers' perception is that you can and that's what affects buying decisions).
As for suing, it depends on the terms of the contract. A large enough business should be able to negotiate special terms with vendors to secure business (don't play ball with us, you don't get our money). If a company wants to be bullish enough, it can negotiate terms that do allow it to sue the company, even with UCITA and DCMA. Unless I'm mistaken, those acts mean that vendors are allowed to put horrible restrictions on sale of software etc. It doesn't say that individual purchasers can't negotiate a better deal.
One final point. I'm not saying this to say "linux is doomed, it's never going to make it". I have great hopes for linux (in my last job, I made a lot of use of open source software to good effect), but there are still a few things to be ironed out before big companies are going to adopt it in a large scale. Half of what I'm doing here is playing devil's advocate because I like a good argument (NB: argument != flame-fest!).
Re:Momentum... (follow up) (Score:3, Funny)
A friend of mine was consulting many years ago with a large financial firm helping them to maintain their Netware 3.x servers.(as you can see it was several years ago) They had a tape backup system in house from one of the really large vendors that was not working.
They went for like a month where they could not get good reliable tape backups on the servers, and playing phone tag with the vendor trying to figure out the problem. Just wasn't working.
Anyway towards the end of the month, my friend griped to the CIO about the problems they were having and his frustration with dealing with the vendor. The CIO brought up the issue at the board meeting and how it was a risk to the company.
At this point the VP of trading piped up... "You know, we own several million shares of that company in our portfolio... let me see what I can do"
VP of trading calls up the President of the vendor company, tells him that if they don't fix the problem with the tape backup software he's going to issue a warning about the companies product quality and dump every single share of their stock on the market.
The next morning a team of developers were flown in and working on the problems. They had to recompile several modules, but they had the issues resolved within two days.
I guess the point is, there are many ways you can leverage a vendor. It doesn't have to be a lawsuit.
As larien said, usually you just threaten to not pay the contract, or not renew. Or add stipulations as part of the negotiation. I've been involved in many an instance where that has played a huge part in getting better support.
Once I had some issues with a GIS package we had purchased. I tried to work with support, and they ignored me. So when the $5k yearly maintenance agreement came up, I told my boss not to pay it because it didn't gain us anything. I also posted a note to a usenet group explaining my problem.
Next day I got a phone call from the development manager.
Financial incentives are the strongest leverage you can have with a software vendor. Like it's been pointed out... that doesn't work with Open Source in quite the same way.
Re:Momentum... (follow up) (Score:2)
Yeah, this is basically why Sun and Oracle have a
special support thing - basically Sun will support
both it's stuff and Oracle's (and Veritas too if
you're using that) with just one number to call
for all of it. "One throat to choke" as Scott
McNealy calls it.
However, I guess DB2 on Sun hardware is too small
to do the same thing... (they'll push you to
migrate to Oracle instead I guess)
Re:Momentum... (Score:4, Insightful)
We did it at my old job (Score:2, Informative)
We were also an AIX shop, but decided to go with Linux for this application because of the overall price of hardware and supporting applications.
Yes, we use Oracle 9i (Score:5, Informative)
At $5,250 for just a 2-year. single processor standard edition license, 9i is not cheap and
most companies who already have an infrastructure built on it will not always realize a signifigant cost savings by moving to a Linux platform. 9i
Enterprise Edition is a cool $45K per processor so it is easy to see how the difference between $20K and $100K for an 8-way Intel versus an 8-way Sun
machine may not always be the determining factor in a platform decision for a system with a 5+ year time horizon.
not a company, but.. (Score:4, Interesting)
hmm (Score:5, Interesting)
i myself am in the datawarehouse of a large international company, our DWH is run off IBM as400's with DB2 + essbase/hyperion.
there are several factors why there will be no change in this.
IBM offers complete intergrated solutions (HW+SW) that you dont get with opensource solutions.
the opensource rdbms cant compete with the likes of DB2 and Oracle in terms of scalability and features.
3rd party integration. (Esssbase/Hyperion) database cube solutions dont exist for linux/freebsd. (man 3d cube db's are funky)
stable cross platform ODBC drivers, (winnt drivers for ASP, JODBC java+websphere, AS400 + RS6000 drivers)
support. (who gives 24/7 support on postgress, and send out tech support guys giving consultations, will come on site on a sunday at 4am?)
what OpenSource rdbms provide true mutli language support (we have records in cryllic, japanese, american, german, etc)?
high availablity (i dont know the current state of HA functionality in the linux kernel)
Linux on the AS400 is not seen as providing anywhere need the requirements at present, and its opensource database solutions are same.
(and i dont even think there is any cube database products in the opensource area... ???)
Re:hmm (Score:4, Informative)
RedHat either already does or will soon.
"what OpenSource rdbms provide true mutli language support (we have records in cryllic, japanese, american, german, etc)?"
PostgreSQL [postgresql.org].
"high availablity (i dont know the current state of HA functionality in the linux kernel)"
Why not look it up? [redhat.com]
Re:hmm (Score:2, Insightful)
// dijit
a few success stories (Score:3, Informative)
In addition to the links above, most of the big database systems have active Linux ports. Any Oracle [oracle.com], Sybase [sybase.com], Informix or DB2 [ibm.com], InterSystems [intersys.com], Poet [fastobjects.com], or Versant [versant.com] customer is a potential Linux customer.
Linux and Oracle 9i RAC (can't beat the price) (Score:5, Informative)
We have had success using Redhat 7.1 (upgraded kernel to use LVM) and Suse 7.2 (comes w/LVM) for the linux distribution. Do not attempt RAC or OPS without an LVM of some sort. It can be done, but it shouldn't.
The biggest expense you will have is the disk array, and you should not skimp on this. Buy fast reliable maintained disk.
The Linux solution beats out Sun solutions in price hands down. You are talking $30,000 per box for the minimal Sun allowed hardware requirement for the Sun Cluster software with the Oracle Parallel DB runtime licenses (this has changed with v3 and so have the hw requirements). The Sun Cluster software requires an extensive review process by Sun which basically insures your company has two extra of everything and can be onsite to help Sun with their software and hardware in 4 hours. If your company doesn't have it's shit together, Sun and the few vendors that even know what Sun Cluster is aren't even going to bother talking to you about it.
This Linux solutions beats out a Windows NT solution in reliablity over the simple fact that the disk and volume management is clumsy. There is no easy way to create labeled raw devices on a Windows machine. The process as I remembered it was creating unlabeled logic partitions for each disk space and then maintaining a file pointing to the value of the related registry key to map out the tablespaces. As soon as you added a partition, modified a partition, or even used another node to look at the partition table, you and the database were screwed (i.e. restore). This problem with managing shared disk may have been fixed in 2000.
The weakest point in the entire Oracle 9i RAC is the cluster software layer. Whether you are using Sun's Cluster Software, the Oracle supplied cluster manager for Linux, or the hardware vendor supplied OSD layer for Windows. Be prepared to spend serious time in monitoring and getting it under control with appropriate patches.
Once you have fought your way through all of this you can reap in the rewards that multiple nodes with shared data gives you. The greatest benefit is the ability to partition your data and your application which allows you more opportunities to scale. If your data does not partition by some logical means (date, timezone, city, planet, etc) forget about it. Just get a big honking database machine (especially you SAP/Peoplesoft poor SOBs).
Re:Linux and Oracle 9i RAC (can't beat the price) (Score:2)
With the possible exeption of IBM you'll probably not see Linux running DB clusters on those platforms, as the companies have put so much time and money into optimizing their own cluster technology. The "big iron" is still proprietary space for now, and the big databases need big iron unless they're the digital equivalent of scrapyards (where the most common activity is rusting).
That's not to say Linux isn't ideal for the space you're in -- as a better alternative to NT on commodity boxes. Many of today's "small-to-average" databases run in the 100-500GB range, and dwarf the "large databases" of not-so-long-ago. It's just that today's VLDB are measured in terabytes, headed for petabytes, and expecations for response times are shrinking. Our ambition grows with our grasp.
Re:Linux and Oracle 9i RAC (can't beat the price) (Score:2)
But who says you need a Sun cluster anyway ? Couldn't one just get a single Sun box ? The Sun model seems to be that you get a big machine, and scale it up by pouring CPU juice into it.
Of course, the big-iron machines from Sun are fairly pricy though -- I agree with your main point, that Sun is expensive
Oh, just the UK's largest ISP... (Score:2, Interesting)
Re:Oh, just the UK's largest ISP... (Score:3, Interesting)
The mail system used to be on Linux (presumably still is), DNS on Solaris and a heap of NT boxes for customers' websites. Things might have changed since I last worked there, but I don't remember a large Linux database anywhere.
Large? (Score:5, Insightful)
The much more difficult problems are availability (i.e. 7x24, runs for years with no interruption) and throughput.
When you combine these constraints to specify the problem of a large, highly available and highly active database that meets ACID test criteria, you have an enormously difficult problem. Until recently with the advent of Linux on mainframes Linux couldn't even dream of playing in this space simply because of the hardware it ran on. Sure, lots of people have Linux boxes that have uptimes for years, but some people have had to reboot because of a bad hard disk or other component. It doesn't happen very often, but it does happen. And the I/O bandwidth hasn't been there to support the kind of throughput needed at the high end.
Linux on mainframes doesn't really change this at all in the short term, even if you have a proven DBMS like Oracle (forget MySQL or Postgres), because the system as a whole hasn't proven itself. Question: How much money does an airline lose if it's reservation system is down for a few hours, even if it happens once every several years? How much money does a financial institution lose by being unable to execute transactions for even an hour? Answer: enough to buy plenty of proprietary software. People who run these kinds of applications are willing to pay the price for systems with a track record of success in this demanding area. They are often willing to sacrifice certain kinds of sophistication to ensure the safety of their company's critical operations.
I think that once Linux is established on the kind of iron that is needed for these applications, it will take as much as a decade before people will trust it for these kinds of missions. Phrases like "mission critical" are bandied about so they have little meaning; Linux is ready to support many applications that are important to businesses today, but can't be entrusted with other ones yet.
Nobody with a working application of the type I describe here is going to migrate to Linux. Nobody starting such an application from scratch will give more than a moment's consideration to Linux. The most likely entree into this space will be evolution of an application from something that is reasonable to host on Linux on small to midrange computers. If the company doesn't have the resources or the time to migrate to something more reasonable, the Linux will begin to get its shot at proving itself.
You can use Access (Score:3, Funny)
ODBC Socket Server is an open source database access toolkit that exposes Windows ODBC data sources with an XML-based TCP/IP interface.
It has clients for PHP, Perl, C (in Windows, Mac, and Linux), Java
.
Re:You can use Access (Score:2, Informative)
Re:You can use Access +1 Funny (Score:2)
Two real world examples involving IBM and DB2 (Score:3, Insightful)
"At the Wimbledon Tennis Championships, Linux, dB2, and Netfinity servers make it possible to offer real-time information on scores to fans around the globe. Last year, over 914 million web hits occurred during the games, requesting scores and statistics."
"ERP Central is a portal for ERP consultants. They offer ERP news, job postings, and other information, but their big 'traffic builder' is a free time and expense tracking program which users can access to maintain their schedule information and submit it back to their offices from the site. Linux hosted and built on top of Websphere and dB2, the application can scale to handle over 100,000 users and organizations whose consultants use the software estimate that it saves them 75% in time savings, an average value of $500,000 per organization per year."
JK
No sales force for Linux. (Score:3, Insightful)
We have a need for a new DB system
What systems are available?
Schedule meetings with the sales people from the various vendors, so that we can compare what's out there.
Boink! That's where Linux bounces up against the wall of established companies... except for a smattering of VARs, nobody is there to "attend the meeting" to tout Linux's praises to the big boss... except for the internal sysadmin and/or program managers, who then have to plug the stuff as a better alternative to the established vendors. So, IMHO, for corporate usage, it's not about what the OS can do, it's all in the selling of it.
Now if you'll pardon me, I have to go to a meeting where a big storage vendor will be showing us their wares. Really. ;)
Consider what defines a "large" DB (Score:3, Insightful)
A high number of simultaneous users will require more processor/CPU capacity. A high number of concurrent users (with a low number of simultaneous users) might not require much processor capacity but will likely require more memory capacity due to the number of concurrent connections (and each connection having some amount of it's own memory).
yes, we use Oracle 8i on Linux (Score:4, Informative)
None of the databases are gigantic - 80Gb is the largest, but we haven't had any problems at all. If anything, most of these databases used to be on True64 (Digital Unix before that) and we had a lot of problems (although they were probably hardware related). Also - users have reported that performance is better (not that it was a real issue before) but we've never bothered/attempted to document that.
I can't say that the main factor for the move was money (although it was a factor) - after all, if you can afford the Oracle licenses you probably should not be cheap with the hardware/OS but we've had a whole lot of RH Linux for other applications and it just made sense to consolidate.
Re:yes, we use Oracle 8i on Linux (Score:2)
And it's been running smoothly ever since, so we have not gotten back to upgrade the kernel. If it ain't broke, why fix it I guess? Next time the box's hardware will need to be sent back (we tend to lease our hardware rather than buy it) it will get an upgraded configuration all around...
Progress Database on Red Hat (Score:3, Interesting)
This is not the right question to ask (Score:4, Interesting)
Instead of just asking a question that is almost guaranteed to pat ourselves on the back, we need to also ask for descriptions of the conditions that people are using Linux for RDMSes. That is, before the answer "are you using Linux" can be properly interpreted, we also need to know answers to questions like: How many connections ? How many users ? What size of a database ? What kind of availability do you demand ? What kind of information is being stored ? How big is your staff ? How big is your budget ?
After all, knowing that a company uses Linux to host Postgres/MySql tells us nothing if the company can't afford to buy a Sun box/Oracle license in the first place.
Re:This is not the right question to ask (Score:2)
Excellent point!
we also need to know ... How many connections ? How many users ?
Also: How much money in financial transactions does your company stand too lose if the server is ofline for an hour? What other implications are there of such an outage? How much money would be lost, and what other implications are there is an hour's worth of data is lost completely? How much is at risk, what sensitive data is compromised if that data is exposed to malicious hackers?
This kind of question defines just how mission critical "mission critical" really is.
Don't need to bow down to Oracle (Score:2, Informative)
Amazon uses... (Score:2, Informative)
Objectstore. An object oriented database (see www.objectdesign.com ) thats known for its speed.
Who knows why we didn't say that.
PICK Database (Score:2, Interesting)
db2 on linux (Score:2)
Linux on rs/6000, as/400 (iSeries), and system
Now, buy that nice hardware (better than plain ole Intel boxes) and either run Linux on em with DB2, or AIX, os/400, or z/OS
We're in the process of moving... (Score:2, Informative)
Anyway, we're shopping replacements for our 3500's and we've found that bang for the buck, Linux for Databases is the way to go. Most of these servers are one-task anyway, and Oracle runs like a champ so far. There are some issues with Glibc that require some manipulation of libraries to get around if you want to use any other dist. than SUSE tho, which sux. That said, we're testing with mandrake 8.1 and it runs fine (post patch).
Wrong question to ask (Score:4, Redundant)
Linux can run RDBMS just fine, it's all the other stuff that is lagging. Manufacturers of fiber storage and other high end products tend to focus on solaris more than linux. Large RDBMS includes a lot of other important details that need constant management and attention. Building a PC box with redudant powersource, fans, backup CPU's and motherboards gets you close to solaris prices, so enterprise projects tend to choose solaris or mainframes.
You are in the wrong league (Score:5, Insightful)
A large database (in this context) is an enterprise-sized system: multiple platforms serving many millions of records in short periods of time.
I have customers fielding databases on multiple Enterprise 10000 servers...single tables of more than 35 million rows. This is actually a "medium" system in my mind.
I love Linux, I hacked around the pre 1.0 kernels many years ago. BUT, it does not scale up too well. Even the little things in Linux make it hard to do a good (maintainable) job: shifting device names (pull one of your HDs and see what happens), inability to modify hardware subsystems (storage in particular) while running live, etc. Even EMC, NetApp and XIOtech hardware can't fix these issues.
If the Linux crowd wants to be accepted by Big Business, they must learn the needs of Big Business.
Running a few 4-proc Intel servers with Oracle or Sybase does not put you in the same league. Nor does storing 10,000 articles in MySQL.
If you can imagine doing it yourself, if you can even imagine the amount of data to store, then you are almost surely below the threshholds I need to work in every day.
Moderate this up, please! (Score:2)
Too bad the guy posted as an AC...
we're definately in different leagues (was you are (Score:2, Informative)
I am the lead dba for a company that processes 15-20 million us dollars worth of transactions per day. My backend database is solaris/oracle, it does 3000-4000 sql statements per second, and my company would loose maybe $1000 in revenue for each minute it is down. The larger
two tables in this databasehave in excess of 300 million rows, and are acessed by 100k customers per day. We have over 11 million customers.
It's running on a E4500, which is saving us a lot of money *not* buying E10000s. I like to think it's tuned well, but a big part of the reason it works (fast) is also that it is on an EMC with over 90 disk drives in it. I'ts all about IO bandwidth and servicability in my world, and on those points you are correct in saying sun is a handsdown winner over linux.
.
Now, I work with a sysadmin who is a whiz at making lots of linux boxes work reliably as a web frontend, and is also good at keeping our backend solaris based database up 24/7. neither of us is anxious to put the backend on linux, but we did put up a significantly large, high performance, but *relatively * low availability database up on linux.
It's a 6x800mhz intel box with 4g ram and 16 disks on mylex caching raid 5 controlers. Raid-5 sucks in general, but the point of this system was to get a lot of bang for the buck, so as a big league dba, I took the challenge of making data loads fast in spite of raid-5, in order to get a crack at de-installing windows from this box. If I spent some bucks on more disks, we could get a much faster system, but then that was never the point of this system.
The system is about 200G worth of partition tables (copies of the same 300M row tables mentioned above) with partitioned rollup tables off the sides, for business analysis. The real trick is the partitioning. because of the partitioning, this system is able to do many types of analyses that cant be done on our other analysis system which happens to be solaris with 60 disk drives.
the linux box was a leftover from a failed windows project, so in some sense it was free, but I belive it woulda cost about $80k new. gig ethernet and controler was about 10 or 15k of it.
It's working well for DSS, since the 2 times it's crashed in the last few months didn't really hurt anything.
I'm rambling on now, but I'll talk to the DBAs out there, who speak my language.
If you're gonna do Linux oracle:
- reiserfs sucked performance wise on top of raid 5. Don't know if I did something wrong, but I abandoned it in favor of ext2. I don't care if fsck takes a long time on this system, and ext2 creamed it for database io perf on raid5. I also couldn't get perf out of reiser on simple stripes without the added hurt of raid5, so go figure. fsck times are irrelevant if you use raw partitions, so this is the way to go in most cases.
- Max out the memory (of course) on an intel box. I think the most you can do is 4G on intel platforms. this is sufficient for me, but I kept the SGA down to about 500m, so I could have 10 way parallel processes with 200-200M of sort area size.
- Watch out for linux caching. I've turned it off for my filesystems. It's easy to get into "writeback debt" by pushing a lot of dirty blocks out of oracle cache into ext2fs cache. Add raid5 suckiness at random writeback, and you've got serious constipation problems on your hands.
- I've used some raw partitions, for this system , they seem to be worth it to avoid ext2fs caching hassles, but I haven't migrated completely yet. The "raw" command must be used to "bind" a name to a disk partition before it can be used by oracle as a raw partition, so it makes for a few extra hassles, but no big deal.
- I got a mylex caching controler, which aparently has hot swapping capability in the hardware, mitigating the absence of veritas volume manager and hot plug capabilities at the linux level. It also makes raid5 tolerable. Haven't proven hot swapping by testing yet tho.
- Ext2 fs has some raid5 aware stuff, this helped on the raid5 mylex vols I have, based on cursory thruput tests, but I'm not sure I'm getting the block alignment proper at the oracle level. (don't know after all the oracle/ext2/controler layers, if oracles 16k blocks are aligned with the stripes on the mylex. sigh.
FWIW, back in the dot-com heyday, I also had clients doing modest high availability (to them) databases on oracle/linux. Even then, on relatively small (in gigabytes) database the biggest tunining hassle was writeback caching of linux getting in the way of oracle, and the biggest hassle of scalability was managing many many disks. Raw partitions can get around the former, intelligent controlers (mylex etc) or intelligen disk arrays (clariion, sun t3 etc.)
get around the latter
Supported over 2.5 BILLION record database (Score:2, Informative)
on thursday jan 4, 2001 steve miano, ed bierly, keith mason and i
loaded 2.5 billion trades and quotes on a 50cpu linux cluster.
simple table scans on one billion trades, e.g.
select distinct sym from trade
select max price from trade
take 1 second
multi-dimensional aggregations, e.g.
/ 100 top traded stocks
100 first desc select sum size*price by sym from trade
/ daily high and close
select high:max price, close:last price by sym, date from trade
take 10 to 20 seconds
translating the data from TAQ to kdb took about 5 hours.
(steve had loaded the 200 TAQ cd's onto several disk drives.)
distributing the 100gigabytes over the 100Mbit ethernet took 3 hours.
(this cluster should probably have Gbit ethernet)
loading the database (k db taq.m -P 2080), starting 50 slaves,
connecting, mapping shared indicative tables over nfs, building
parallel partitions, etc. took
Re:You are in the wrong league (Score:2)
Linux a better option than Solaris (Score:2, Informative)
Re:Linux a better option than Solaris (Score:2)
Weather.com (Score:2, Informative)
Linux is mission critical, but are the people (Score:2, Insightful)
But, and this is a big but, it has to be setup by the right person. I have seen Unix and MVS systems setup and hose up for mission critical situations. We lost a lot of money while the systems were down. The higher ups would blame the people (as they should have) because the systems work in other situations just fine so it must be the people.
Based on perceptions, if it were Linux setup by the wrong guy and things went belly up they would blame the Linux because it's untested. It would end up the scape goat instead of the lazy implementation group. That's what Linux has to overcome.
I remember a quote I think was from the Red Baron, "It's not the crate, but the man in it that counts".
Re:Linux is mission critical, but are the people (Score:2)
That said, it seems as if your argument bears more weight on the manufacturer of said crate than the man in it.
PostgreSQL 8GB (Score:2, Informative)
We run a large auditing system (OLAP-oriented rather than OLTP-oriented) on PostgreSQL (v7.1.3) on Linux (RH 7.1), using Tomcat (v4.0.1) as the front-end. We're running it on a Dell PowerEdge 2400 (2x PIII-866) with their Perc RAID controller with a Raid 1 and a raid 0+1 volume.
Our database is currently a bit over 8 GB, with many of the tables exceeding one million records. Queries typically join > 5 tables.
We moved from an MS Access/SQL Server environment and are much happier with the functionality , performance, and stability we now have.
Not to slam DB2, as I think it's a great product and have successfully used it for some really big projects, but for this application I found the PostgreSQL delivered ~4x the performance on many of our key queries. The lower cost and lower administrative overhead sealed the deal in favor of PostgreSQL.
As always, though, your mileage may vary.
Gordon.
It's not just 'commercial vs Linux' (Score:2)
I do know there are a *lot* of large-scale BSD RDBMS systems out there.
It seems a little skewed to put Linux against 'commercial OSes' when BSD isn't a commercial OS, and is arguably better suited to the tasks at hand than Linux.
Use a hammer for a nail, and a screwdriver for screws.
Oracle on DEC Alpha (Score:3)
With MS's abysmal support for NT on the Alpha these days, we've considered moving the Oracle database to another OS. I don't think we want to trash the DEC Alphaservers yet though - since they're still respectable machines. Linux for Alpha is definitely an interesting option for us - but I'm wondering if anyone has had experiences with Oracle for Linux on the DEC Alpha? How does it compare, performance-wise, to running Oracle on the Alpha version of NT?
Last time I checked, Oracle wasn't really giving a high level of support to Oracle for Linux unless you used it on Intel hardware?
Fairly big PostgreSQL db... (Score:2)
Anyways, the system basically handled a few gigs a day or so of data from GPS satellites and such. It basically crunched numbers and stored results in an effort to figure out how much the earth's tectonic plates were moving from day to day. I would imagine that this system handled many, many rows and transactions daily. I'm pretty sure they moved away from Ingres to PgSQL, which they're probably still using now.
It's not exactly a commercial application, but it is an RDBMS that handles a lot of data, and apparently worked quite well.
J
Yes, we do (Score:2)
Postgres 7.1.3 on Mandrake (Score:2)
Cyclopatra
google (Score:2, Interesting)
Google has huge databases (caching the web). It is run on tons of linux boxes. Their entire business depends on speed and accurate information.
an article about Google [nwfusion.com]
This doesn't make sense; Linux Oracle isn't 64-bit (Score:5, Informative)
If you are running a VLDB on Oracle, you want a 64-bit system; otherwise the SGA is limited to 2GIG.
Oracle only supports Linux x86, with all of its 32-bit memory constraints. Does Linux implement memory windows like 32-bit HP-UX?
Also, at linux.sybase.com, you can download for free the Alpha-axp version of Sybase ASE 11.0.3.3 - this is probably the most available commercial 64-bit database for Linux.
Really, the Linux and WinNT versions of Oracle are at the low end of the food chain.
Re:Largest Oracle implementation on Linux (Score:2)
Not that I believe the graph that much, anyway.
Re:Financial Institutions (Score:2)
We at DrKW (Dresdner Kleinwort Wasserstein) do not use Linux of any flavour for our database servers.
Re:Hmmmm.....SAP R/3 installations? (Score:2, Funny)
Re:"Mission Critical?" (Score:2, Funny)
"What legitimate member of the
Spend time in enough meetings with Admin management and director types and you too can speak like this.
Re:What Yahoo! doesn't want you to know... (Score:2)
Oracle is only supported for Linux on Intel.