Automatic image annotation (also known as automatic image tagging or linguistic indexing) is the process by which a computer system automatically assigns metadata in the form of captioning or keywords to a digital image. This application of computer vision techniques is used in image retrieval systems to organize and locate images of interest from a database.
This method can be regarded as a type of multi-class image classification with a very large number of classes - as large as the vocabulary size. Typically, image analysis in the form of extracted feature vectors and the training annotation words are used by machine learning techniques to attempt to automatically apply annotations to new images. The first methods learned the correlations between image features and training annotations, then techniques were developed using machine translation to try to translate the textual vocabulary with the 'visual vocabulary', or clustered regions known as blobs. Work following these efforts have included classification approaches, relevance models and so on.
The advantages of automatic image annotation versus content-based image retrieval are that queries can be more naturally specified by the user . CBIR generally (at present) requires users to search by image concepts such as color and texture, or finding example queries. Certain image features in example images may override the concept that the user is really focusing on. The traditional methods of image retrieval such as those used by libraries have relied on manually annotated images, which is expensive and time-consuming, especially given the large and constantly growing image databases in existence.
Some annotation engines are online, including the ALIPR.com real-time tagging engine developed by Pennsylvania State University researchers, and Behold - an image search engine that indexes over 1 million Flickr images using automatically generated tags.
Other articles related to "automatic image annotation, image, images, annotation":
... "Image-to-word transformation based on dividing and vector quantizing images with words." ... Annotation as machine translation P Duygulu, K Barnard, N de Fretias, and D Forsyth (2002) ... "Object recognition as machine translation Learning a lexicon for a fixed image vocabulary" ...
Famous quotes containing the words automatic and/or image:
“Natural selection, the blind, unconscious, automatic process which Darwin discovered, and which we now know is the explanation for the existence and apparently purposeful form of all life, has no purpose in mind. It has no mind and no minds eye. It does not plan for the future. It has no vision, no foresight, no sight at all. If it can be said to play the role of the watchmaker in nature, it is the blind watchmaker.”
—Richard Dawkins (b. 1941)
“Some days your hats off to the full-time mothers for being able to endure the relentless routine and incessant policing seven days a week instead of two. But on other days, merely the image of this woman crafting a brontosaurus out of sugar paste and sheet cake for her two-year-olds birthday drives a stake through your heart.”
—Melinda M. Marshall (20th century)