Document Conversion - Paper Documents Conversion

Paper Documents Conversion

The task of converting scanned paper documents to useful electronic formats is one of the most important applications for document conversion. Documents, scanned to image formats, have lots of limitations such as large file size, impossibility of context search and content reuse. Consideration should be given to conversion to more useful formats, such as:

  • Searchable: PDF
  • Archive: PDF/A – for the long-term storage
  • Compressed: MRC-PDF
  • Editable: TXT, RTF, DOC, XLS, PPT
  • Structured: XML, HTML

Content extraction from the document image is the task of Optical Character Recognition (OCR) or Intelligent Character Recognition (ICR) technologies. Modern OCR applications convert image files to different document formats with saving not just content but also the structure of document (ADRT).

Read more about this topic:  Document Conversion

Famous quotes containing the words paper, documents and/or conversion:

    All the reputedly powerful reactionaries are merely paper tigers. The reason is that they are divorced from the people. Look! Was not Hitler a paper tiger? Was Hitler not overthrown?... U.S. imperialism has not yet been overthrown and it has the atomic bomb. I believe it also will be overthrown. It, too, is a paper tiger.
    Mao Zedong (1893–1976)

    Our medieval historians who prefer to rely as much as possible on official documents because the chronicles are unreliable, fall thereby into an occasionally dangerous error. The documents tell us little about the difference in tone which separates us from those times; they let us forget the fervent pathos of medieval life.
    Johan Huizinga (1872–1945)

    The conversion of a savage to Christianity is the conversion of Christianity to savagery.
    George Bernard Shaw (1856–1950)