Apache Hadoop - Papers

Papers

Some papers influenced the birth and growth of Hadoop and big data processing. Here is a partial list:

  • 2004 MapReduce: Simplified Data Processing on Large Clusters by Jeffrey Dean and Sanjay Ghemawat from Google Lab. This paper inspired Doug Cutting to develop an open-source implementation of the Map-Reduce framework. He named it Hadoop, after his son's toy elephant.
  • 2005 From Databases to Dataspaces: A New Abstraction for Information Management, the authors highlight the need for storage systems to accept all data formats and to provide APIs for data access that evolve based on the storage system’s understanding of the data.
  • 2006 Bigtable: A Distributed Storage System for Structured Data from Google Lab.
  • 2008 H-store: a high-performance, distributed main memory transaction processing system
  • 2009 MAD Skills: New Analysis Practices for Big Data
  • 2011 Apache Hadoop Goes Realtime at Facebook

Read more about this topic:  Apache Hadoop

Famous quotes containing the word papers:

    “The papers are delivered every day;
    I am alone and never shed a tear.”
    Stanley Jasspon Kunitz (b. 1905)

    To a historian libraries are food, shelter, and even muse. They are of two kinds: the library of published material, books, pamphlets, periodicals, and the archive of unpublished papers and documents.
    Barbara Tuchman (1912–1989)

    The Madcap Heiress, isn’t that what the papers usually call her? Millions of dollars and no sense.
    Vina Delmar, U.S. novelist, playwright. Lucy (Irene Dunne)