Catch up on stories from the past week (and beyond) at the Slashdot story archive

 



Forgot your password?
typodupeerror
Get HideMyAss! VPN, PC Mag's Top 10 VPNs of 2016 for 55% off for a Limited Time ×
Supercomputing

Submission + - Apache Hadoop Wins Terabyte Sort Benchmark (yahoo.com)

owenomalley writes: "One of Yahoo's Hadoop clusters sorted 1 terabyte of data in 209 seconds, which beat the previous record of 297 seconds in the annual general purpose (daytona) terabyte sort benchmark. Apache Hadoop is an open source Java framework for writing data-intensive distributed programs, and includes both a distributed file system and a map/reduce implementation. The sort benchmark, which was created in 1998 by Jim Gray, specifies the input data (10 billion 100 byte records), which must be completely sorted and written to disk. This is the first time that either a Java or an open source program has won. Yahoo is both the largest user of Hadoop with 13,000+ nodes running hundreds of thousands of jobs a month and the largest contributor, although non-Yahoo usage and contributions are increasing rapidly."
This discussion was created for logged-in users only, but now has been archived. No new comments can be posted.

Apache Hadoop Wins Terabyte Sort Benchmark

Comments Filter:

The beer-cooled computer does not harm the ozone layer. -- John M. Ford, a.k.a. Dr. Mike

Working...