MongoSV Conference

December 7, 2010


It was great attending the MongoSV conference last week. The conference was packed with attendees from a mix of backgrounds, showing lots of interest in scaling with MongoDB. There were some good technical talks from 10gen about their database. I especially liked the hands on intro by CEO Dwight Merriman, and the hand drawn slides […]

Posted in: Uncategorized

Foursquare’s MongoDB Outage

October 15, 2010


Foursquare recently suffered a total site outage for eleven hours. The outage was caused by unexpected uneven growth in their MongoDB database that their monitoring didn’t detect. The system outage was prolonged when an attempt to add a partition didn’t work due to fragmentation, and required taking the database offline to compact it. This article […]

Tagged: ,
Posted in: Big Data

LinkedIn’s Data Infrastructure

August 4, 2010


Jay Kreps of LinkedIn presented some informative details of how they process data at the recent Hadoop Summit. Kreps described how LinkedIn crunches 120 billion relationships per day and blends large scale data computation with high volume, low latency site serving. Much of LinkedIn’s important data is offline – it moves fairly slowly. So they […]

Facebook on Hadoop, Hive, HBase, and A/B Testing

July 14, 2010


The Hadoop Summit of 2010 included presentations from a number of large scale users of Hadoop and related technologies. Notably, Facebook presented a keynote and details information about their use of Hive for analytics. Mike Schroepfer, Facebook’s VP of Engineering delivered a keynote describing the scale of their data processing with Hadoop. Schroepfer gave an […]

Yahoo! Updates from Hadoop Summit 2010

July 12, 2010


The Hadoop Summit of 2010 started off with a vuvuzela blast from Blake Irving, Chief Product Officer for Yahoo. Yahoo delivered keynote addresses that outlined the scale of their use, technical directions for their contributions, and architectural patterns in how they apply the technology. The increasing interest in Hadoop was evident: this year’s conference had […]

Tagged: ,
Posted in: Big Data, Hadoop

GigaOm Structure Highlights

July 9, 2010


The GigaOM Stucture conference a couple of weeks ago addressed many areas of cloud computing. One of the key themes of the event was the emergence of new data architectures. Throughout the panels, interviews, and presentations many speakers identified significant changes in how data gets handled that will be coming. Paul Maritz, CEO of VMWare, […]

Moving to WordPress

May 16, 2007


I’m starting the process of moving my blog from to So please watch this space for future posts.

Posted in: Uncategorized