How Google Uses Linux 155

Posted by timothy on Saturday November 07, 2009 @04:39PM from the rebasing-not-freebasing dept.

postfail writes 'lwn.net coverage of the 2009 Linux Kernel Summit includes a recap of a presentation by Google engineers on how they use Linux. According to the article, a team of 30 Google engineers is rebasing to the mainline kernel every 17 months, presently carrying 1208 patches to 2.6.26 and inserting almost 300,000 lines of code; roughly 25% of those patches are backports of newer features.'

This discussion has been archived. No new comments can be posted.

How Google Uses Linux

Search 155 Comments Log In/Create an Account

Comments Filter:

Release the patches already (Score:5, Interesting)

by Dice ( 109560 ) writes: on Saturday November 07, 2009 @04:52PM (#30016662)

They monitor all disk and network traffic, record it, and use it for analyzing their operations later on. Hooks have been added to let them associate all disk I/O back to applications - including asynchronous writeback I/O.
I. Want. This.

Share
twitter facebook
Re:Togh (Score:3, Interesting)

by MichaelSmith ( 789609 ) writes: on Saturday November 07, 2009 @05:30PM (#30016918) Homepage Journal

TFA does suggest though that google have gotten themselves into a horrible mess with their local changes and would be better off by offloading their stuff to the community and taking properly integrated releases.

Parent Share
twitter facebook
Is it worth it? (Score:2, Interesting)

by ToasterMonkey ( 467067 ) writes: on Saturday November 07, 2009 @05:38PM (#30016976) Homepage

The whole article sounds so painful, what do they actually get out of it?
Google started with the 2.4.18 kernel - but they patched over 2000 files, inserting 492,000 lines of code. Among other things, they backported 64-bit support into that kernel. Eventually they moved to 2.6.11, primarily because they needed SATA support. A 2.6.18-based kernel followed, and they are now working on preparing a 2.6.26-based kernel for deployment in the near future. They are currently carrying 1208 patches to 2.6.26, inserting almost 300,000 lines of code. Roughly 25% of those patches, Mike estimates, are backports of newer features.
In the area of CPU scheduling, Google found the move to the completely fair scheduler to be painful. In fact, it was such a problem that they finally forward-ported the old O(1) scheduler and can run it in 2.6.26. Changes in the semantics of sched_yield() created grief, especially with the user-space locking that Google uses. High-priority threads can make a mess of load balancing, even if they run for very short periods of time. And load balancing matters: Google runs something like 5000 threads on systems with 16-32 cores.
Google makes a lot of use of the out-of-memory (OOM) killer to pare back overloaded systems. That can create trouble, though, when processes holding mutexes encounter the OOM killer. Mike wonders why the kernel tries so hard, rather than just failing allocation requests when memory gets too tight.
Ooooh... efficiency.. I'm curious what the net savings is.. compared to buying more cheap hardware.
So what is Google doing with all that code in the kernel? They try very hard to get the most out of every machine they have, so they cram a lot of work onto each.
(30 * kernel engineer salary) / (generic x86 server + cooling + power) = ?

Share
twitter facebook
Low memory conditions (Score:5, Interesting)

by jones_supa ( 887896 ) writes: on Saturday November 07, 2009 @05:45PM (#30017000)

Google makes a lot of use of the out-of-memory (OOM) killer to pare back overloaded systems. That can create trouble, though, when processes holding mutexes encounter the OOM killer. Mike wonders why the kernel tries so hard, rather than just failing allocation requests when memory gets too tight.
This is something I have been wondering too. Doesn't it just lead to applications crashing more often than them normally reporting they cannot allocate more memory?

Share
twitter facebook
Re:Is it worth it? (Score:5, Interesting)

by dingen ( 958134 ) writes: on Saturday November 07, 2009 @05:57PM (#30017072)

Ooooh... efficiency.. I'm curious what the net savings is.. compared to buying more cheap hardware.
We're talking about Google here. They have dozens of datacenters all over the globe, filled with hundreds of thousands of servers. Some estimate even a million servers or more.
So lets assume they have indeed a million servers and they need 5% more efficiency out of their server farms. Following your logic, it would be better to add 50,000 (!) cheap servers which consume space, power and require cooling and maintenance, but I'll bet you paying a handful of engineers to tweak your software is *a lot* cheaper. Especially since Google isn't "a project" or something. They're here for the long run. They're here to stay and in order to make that happen, they need to get the most from their platform as possible.

Parent Share
twitter facebook
Re:Is it worth it? (Score:4, Interesting)

by LordNimon ( 85072 ) writes: on Saturday November 07, 2009 @06:19PM (#30017214)

Porting patches from one kernel version to another is not innovation.

A while back I got an invitation to work for Google as a kernel developer. I declined to interview, because I already had a job doing just that. This article makes me glad I never accepted that offer. I feel sorry for those kernel developers at Google. Porting all that code back-and-forth over and over again. Now *that's* a crappy job.

Parent Share
twitter facebook
Re:Is it worth it? (Score:2, Interesting)

by Taur0 ( 1634625 ) writes: on Saturday November 07, 2009 @06:30PM (#30017272)

I really hope you're not an engineer, because your solution to a problem should never be: "Screw the most efficient solution, we'll just go out and buy more and waste more energy!" These incremental increases in efficiency will drastically change a product overtime, look at cars for example. The countless engineers working at GM, Toyota, Ford, etc. could have easily said: "meh whatever, just make them buy more gas". The modern combustion engine is only about 30% efficient, but that's far better than when the combustion engine was first thought of, which was somewhere around 0.4%.

Parent Share
twitter facebook
Re:Does Google give coade back (Score:5, Interesting)

by marcansoft ( 727665 ) writes: <hector@marcansoft . c om> on Saturday November 07, 2009 @06:50PM (#30017380) Homepage

Andrew Morton, Google employee and maintainer of the -mm tree, contributed the vast majority of the changes filed under "Google" (and most of those changes aren't Google-specific - Andrew has been doing this since before he was employed there). If you subtract Andrew, Google is responsible for a tiny part of kernel development last I heard, unfortunately.

Parent Share
twitter facebook
Re:The Win32 Way (Score:2, Interesting)

by Sam Douglas ( 1106539 ) writes: <sam.douglas32@gmail.com> on Saturday November 07, 2009 @07:20PM (#30017578) Homepage

In Unix if malloc returns null then the memory allocation failed and you don't have the memory. A well written program should check that. Overcommitting memory can have efficiency advantages, but things can also turn out badly. Linux has heuristics to determine how much to overcommit the memory, or it can be disabled entirely.
http://utcc.utoronto.ca/~cks/space/blog/unix/MemoryOvercommit [utoronto.ca]
http://utcc.utoronto.ca/~cks/space/blog/linux/LinuxVMOvercommit [utoronto.ca]

Parent Share
twitter facebook
Re:Togh (Score:3, Interesting)

by grcumb ( 781340 ) writes: on Saturday November 07, 2009 @08:30PM (#30018028) Homepage Journal

The Linux dev model is NOT something to be proud of.
Indeed:
"The Linux dev model is the worst form of development, except for all those other forms that have been tried from time to time." - Winston Churchill
... Oh wait, no. That was me, actually.
Holy humour-impaired down-modding, Batman! How is the above a troll?
For those too dense to get the joke: I actually agree that the Linux development model has significant weaknesses. It's just that, despite its shortcomings, it actually has proven workable for many years now.
I'm not implying that there aren't better community-driven coding projects in existence. Nor do I want to suggest that critiquing the community is unwarranted (or even unwanted). It's just that, for all its warts, it has produced consistent results over the years.

Parent Share
twitter facebook
Real example... (Score:5, Interesting)

by Fished ( 574624 ) writes: <amphigory@gm[ ].com ['ail' in gap]> on Saturday November 07, 2009 @09:42PM (#30018456)

Back in the 90's, we had a customized patch to Apache to make it forward tickets within our intranet as supplied by our (also customized) Kerberos libraries for our (also customized) build of Lynx. It all had to do with a very robust system for managing customer contacts that ran with virtually no maintenance from 1999 to 2007--and I was the only person who understood it because I wrote it as the SA--when it was scrapped for a "modern" and "supportable" solution that (of course) requires a dozen full-time developers and crashes all the time.
Not really bitching too much, because that platform was a product of the go-go 90's, and IT doctrine has changed for the better. No way should a product be out there with all your customer information that only one person understands. But it was a sweet solution that did its job and did its job well for a LONG time. Better living through the UNIX way of doing things!
But, anyway, I never bothered to contribute any of the patches from that back to the Apache tree (or the other trees) because they really only made sense in that particular context and as a group. If you weren't doing EXACTLY what we were doing, there was no point in the patches, and NOBODY was doing exactly what we were doing.

Parent Share
twitter facebook
Re:Does Google give coade back (Score:1, Interesting)

by Anonymous Coward writes: on Sunday November 08, 2009 @02:05AM (#30019550)

Hiring someone to keep doing what they were already doing doesn't make you a kernel contributor.
I disagree with this statement. He's being paid to work on the kernel. What's the difference?

Parent Share
twitter facebook

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

How Google Uses Linux 155

How Google Uses Linux More Login

How Google Uses Linux

Release the patches already (Score:5, Interesting)

Re:Togh (Score:3, Interesting)

Is it worth it? (Score:2, Interesting)

Low memory conditions (Score:5, Interesting)

Re:Is it worth it? (Score:5, Interesting)

Re:Is it worth it? (Score:4, Interesting)

Re:Is it worth it? (Score:2, Interesting)

Re:Does Google give coade back (Score:5, Interesting)

Re:The Win32 Way (Score:2, Interesting)

Re:Togh (Score:3, Interesting)

Real example... (Score:5, Interesting)

Re:Does Google give coade back (Score:1, Interesting)

Related Links Top of the: day, week, month.

Slashdot Top Deals

Slashdot