Apache Hadoop - Papers

Papers

Some papers influenced the birth and growth of Hadoop and big data processing. Here is a partial list:

2004 MapReduce: Simplified Data Processing on Large Clusters by Jeffrey Dean and Sanjay Ghemawat from Google Lab. This paper inspired Doug Cutting to develop an open-source implementation of the Map-Reduce framework. He named it Hadoop, after his son's toy elephant.
2005 From Databases to Dataspaces: A New Abstraction for Information Management, the authors highlight the need for storage systems to accept all data formats and to provide APIs for data access that evolve based on the storage system’s understanding of the data.
2006 Bigtable: A Distributed Storage System for Structured Data from Google Lab.
2008 H-store: a high-performance, distributed main memory transaction processing system
2009 MAD Skills: New Analysis Practices for Big Data
2011 Apache Hadoop Goes Realtime at Facebook

Read more about this topic: Apache Hadoop

Famous quotes containing the word papers:

““The papers are delivered every day;
I am alone and never shed a tear.””
—Stanley Jasspon Kunitz (b. 1905)

“Yesterday the Electoral Commission decided not to go behind the papers filed with the Vice-President in the case of Florida.... I read the arguments in the Congressional Record and can’t see how lawyers can differ on the question. But the decision is by a strictly party vote—eight Republicans against seven Democrats! It shows the strength of party ties.”
—Rutherford Birchard Hayes (1822–1893)

Related Phrases

Related Words