Aspects of Web Curation
Web curation, like any digital curation, entails:
- Certification of the trustworthiness and integrity of the collection content
- Collecting verifiable Web assets
- Providing Web asset search and retrieval
- Semantic and ontological continuity and comparability of the collection content
Thus, besides the discussion on methods of collecting the Web, those of providing access, certification, and organizing must be included. There are a set of popular tools that addresses these curation steps:
A suite of tools for Web Curation by International Internet Preservation Consortium:
- Heritrix - official website - collecting Web asset
- NutchWAX - search Web archive collections
- Wayback (Open source Wayback Machine) - search and navigate Web archive collections using NutchWax
- Web Curator Tool - Selection and Management of Web Collection
Other open source tools for manipulating web archives:
- WARC Tools - for creating, reading, parsing and manipulating, web archives programmatically
- Search Tools - for indexing and searching full-text and metadata within web archives
Read more about this topic: Web Archiving
Famous quotes containing the words aspects of, aspects and/or web:
“Grammar is a tricky, inconsistent thing. Being the backbone of speech and writing, it should, we think, be eminently logical, make perfect sense, like the human skeleton. But, of course, the skeleton is arbitrary, too. Why twelve pairs of ribs rather than eleven or thirteen? Why thirty-two teeth? It has something to do with evolution and functionalismbut only sometimes, not always. So there are aspects of grammar that make good, logical sense, and others that do not.”
—John Simon (b. 1925)
“It is always a sign of an unproductive time when it concerns itself with petty and technical aspects [in philology], and likewise it is a sign of an unproductive person to pursue such trifles.”
—Johann Wolfgang Von Goethe (17491832)
“With as little a web as this will I ensnare as great a fly as Cassio.”
—William Shakespeare (15641616)