Apache Hadoop - Papers

Papers

Some papers influenced the birth and growth of Hadoop and big data processing. Here is a partial list:

  • 2004 MapReduce: Simplified Data Processing on Large Clusters by Jeffrey Dean and Sanjay Ghemawat from Google Lab. This paper inspired Doug Cutting to develop an open-source implementation of the Map-Reduce framework. He named it Hadoop, after his son's toy elephant.
  • 2005 From Databases to Dataspaces: A New Abstraction for Information Management, the authors highlight the need for storage systems to accept all data formats and to provide APIs for data access that evolve based on the storage system’s understanding of the data.
  • 2006 Bigtable: A Distributed Storage System for Structured Data from Google Lab.
  • 2008 H-store: a high-performance, distributed main memory transaction processing system
  • 2009 MAD Skills: New Analysis Practices for Big Data
  • 2011 Apache Hadoop Goes Realtime at Facebook

Read more about this topic:  Apache Hadoop

Famous quotes containing the word papers:

    You had such a vision of the street
    As the street hardly understands;
    Sitting along the bed’s edge, where
    You curled the papers from your hair,
    Or clasped the yellow soles of feet
    In the palms of both soiled hands.
    —T.S. (Thomas Stearns)

    The Madcap Heiress, isn’t that what the papers usually call her? Millions of dollars and no sense.
    Vina Delmar, U.S. novelist, playwright. Lucy (Irene Dunne)

    Sitting along the bed’s edge, where
    You curled the papers from your hair,
    Or clasped the yellow soles of feet
    In the palms of both soiled hands.
    —T.S. (Thomas Stearns)