Project Gutenberg - Scope of Collection

Scope of Collection

As of November 2011, Project Gutenberg claimed over 40,000 items in its collection, with an average of over fifty new e-books being added each week. These are primarily works of literature from the Western cultural tradition. In addition to literature such as novels, poetry, short stories and drama, Project Gutenberg also has cookbooks, reference works and issues of periodicals. The Project Gutenberg collection also has a few non-text items such as audio files and music notation files.

Most releases are in English, but there are also significant numbers in many other languages. As of November 2010, the non-English languages most represented are: French, German, Finnish, Dutch, Portuguese, and Chinese.

Whenever possible, Gutenberg releases are available in plain text, mainly using US-ASCII character encoding but frequently extended to ISO-8859-1 (needed to represent accented characters in French and Scharfes s in German, for example). Besides being copyright-free, the requirement for a Latin (character set) text version of the release has been a criterion of Michael Hart's since the founding of Project Gutenberg, as he believes this is the format most likely to be readable in the extended future. Out of necessity, this criterion has had to be extended further for the sizable collection of texts in East Asian languages such as Chinese and Japanese now in the collection, where UTF-8 is used instead.

Other formats may be released as well when submitted by volunteers. The most common non-ASCII format is HTML, which allows markup and illustrations to be included. Some project members and users have requested more advanced formats, believing them to be much easier to read. But some formats that are not easily editable, such as PDF, are generally not considered to fit in with the goals of Project Gutenberg, although many are being introduced to the collection in PDF format so that illustrations can be added to downloadable documents. For years, there has been discussion of using some type of XML, although progress on that has been slow.

Beginning in 2009 the Project Gutenberg catalog began offering auto-generated alternate file formats, including html, EPUB and plucker.

Read more about this topic:  Project Gutenberg

Famous quotes containing the words scope of, scope and/or collection:

    For it is not the bare words but the scope of the writer that gives the true light, by which any writing is to be interpreted; and they that insist upon single texts, without considering the main design, can derive no thing from them clearly.
    Thomas Hobbes (1579–1688)

    The scope of modern government in what it can and ought to accomplish for its people has been widened far beyond the principles laid down by the old “laissez faire” school of political rights, and the widening has met popular approval.
    William Howard Taft (1857–1930)

    What is all wisdom save a collection of platitudes? Take fifty of our current proverbial sayings—they are so trite, so threadbare, that we can hardly bring our lips to utter them. None the less they embody the concentrated experience of the race and the man who orders his life according to their teaching cannot go far wrong.
    Norman Douglas (1868–1952)