Apache Hadoop - Papers

Papers

Some papers influenced the birth and growth of Hadoop and big data processing. Here is a partial list:

  • 2004 MapReduce: Simplified Data Processing on Large Clusters by Jeffrey Dean and Sanjay Ghemawat from Google Lab. This paper inspired Doug Cutting to develop an open-source implementation of the Map-Reduce framework. He named it Hadoop, after his son's toy elephant.
  • 2005 From Databases to Dataspaces: A New Abstraction for Information Management, the authors highlight the need for storage systems to accept all data formats and to provide APIs for data access that evolve based on the storage system’s understanding of the data.
  • 2006 Bigtable: A Distributed Storage System for Structured Data from Google Lab.
  • 2008 H-store: a high-performance, distributed main memory transaction processing system
  • 2009 MAD Skills: New Analysis Practices for Big Data
  • 2011 Apache Hadoop Goes Realtime at Facebook

Read more about this topic:  Apache Hadoop

Famous quotes containing the word papers:

    All the familiar horrors we
    Associate with others
    Are coming fast along our way:
    The wind is warning in our tree
    And morning papers still betray
    The shrieking of the mothers.
    Philip Larkin (1922–1986)

    “The papers are delivered every day;
    I am alone and never shed a tear.”
    Stanley Jasspon Kunitz (b. 1905)

    I could draw Bloom County with my nose and pay my cleaning lady to write it, and I’d bet I wouldn’t lose 10% of my papers over the next twenty years. Such is the nature of comic-strips. Once established, their half-life is usually more than nuclear waste.
    Berkeley Breathed (b. 1957)