Optical Character Recognition

Optical character recognition, usually abbreviated to OCR, is the mechanical or electronic conversion of scanned images of handwritten, typewritten or printed text into machine-encoded text. It is widely used as a form of data entry from some sort of original paper data source, whether documents, sales receipts, mail, or any number of printed records. It is a common method of digitizing printed texts so that they can be electronically searched, stored more compactly, displayed on-line, and used in machine processes such as machine translation, text-to-speech and text mining. OCR is a field of research in pattern recognition, artificial intelligence and computer vision.

Early versions needed to be programmed with images of each character, and worked on one font at a time. "Intelligent" systems with a high degree of recognition accuracy for most fonts are now common. Some systems are capable of reproducing formatted output that closely approximates the original scanned page including images, columns and other non-textual components.

Read more about Optical Character Recognition:  History, Importance of OCR To The Blind, OCR Software, Current State of OCR Technology

Famous quotes containing the words optical, character and/or recognition:

    There is an optical illusion about every person we meet.
    Ralph Waldo Emerson (1803–1882)

    When one walks, one is brought into touch first of all with the essential relations between one’s physical powers and the character of the country; one is compelled to see it as its natives do. Then every man one meets is an individual. One is no longer regarded by the whole population as an unapproachable and uninteresting animal to be cheated and robbed.
    Aleister Crowley (1875–1947)

    The recognition of Russia on November 16, 1933, started forces which were to have considerable influence in the attempt to collectivize the United States.
    Herbert Hoover (1874–1964)