Browsing All posts tagged under »CaseStudy«

LinkedIn’s Data Infrastructure

August 4, 2010


Jay Kreps of LinkedIn presented some informative details of how they process data at the recent Hadoop Summit. Kreps described how LinkedIn crunches 120 billion relationships per day and blends large scale data computation with high volume, low latency site serving. Much of LinkedIn’s important data is offline – it moves fairly slowly. So they […]

Facebook on Hadoop, Hive, HBase, and A/B Testing

July 14, 2010


The Hadoop Summit of 2010 included presentations from a number of large scale users of Hadoop and related technologies. Notably, Facebook presented a keynote and details information about their use of Hive for analytics. Mike Schroepfer, Facebook’s VP of Engineering delivered a keynote describing the scale of their data processing with Hadoop. Schroepfer gave an […]

Yahoo! Updates from Hadoop Summit 2010

July 12, 2010


The Hadoop Summit of 2010 started off with a vuvuzela blast from Blake Irving, Chief Product Officer for Yahoo. Yahoo delivered keynote addresses that outlined the scale of their use, technical directions for their contributions, and architectural patterns in how they apply the technology. The increasing interest in Hadoop was evident: this year’s conference had […]