Document Conversion - Paper Documents Conversion

Paper Documents Conversion

The task of converting scanned paper documents to useful electronic formats is one of the most important applications for document conversion. Documents, scanned to image formats, have lots of limitations such as large file size, impossibility of context search and content reuse. Consideration should be given to conversion to more useful formats, such as:

  • Searchable: PDF
  • Archive: PDF/A – for the long-term storage
  • Compressed: MRC-PDF
  • Editable: TXT, RTF, DOC, XLS, PPT
  • Structured: XML, HTML

Content extraction from the document image is the task of Optical Character Recognition (OCR) or Intelligent Character Recognition (ICR) technologies. Modern OCR applications convert image files to different document formats with saving not just content but also the structure of document (ADRT).

Read more about this topic:  Document Conversion

Famous quotes containing the words paper, documents and/or conversion:

    A cow does not know how much milk it has until the milkman starts working on it. Then it looks round in surprise and sees the pail full to the brim. In the same way a writer has no idea how much he has to say till his pen draws it out of him. Thoughts will then appear on the paper that he is amazed to find that he possessed. “How brilliant!” he says to himself. “I had no idea I was so intelligent.” But the reader may not be so im pressed.
    Gerald Branan (1894–1987)

    The American Constitution, one of the few modern political documents drawn up by men who were forced by the sternest circumstances to think out what they really had to face instead of chopping logic in a university classroom.
    George Bernard Shaw (1856–1950)

    The conversion of a savage to Christianity is the conversion of Christianity to savagery.
    George Bernard Shaw (1856–1950)