Automatic Image Annotation - Some Major Work

Some Major Work

  • Word co-occurrence model
Y Mori, H Takahashi, and R Oka (1999). "Image-to-word transformation based on dividing and vector quantizing images with words.". Proceedings of the International Workshop on Multimedia Intelligent Storage and Retrieval Management.
  • Annotation as machine translation
P Duygulu, K Barnard, N de Fretias, and D Forsyth (2002). "Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary". Proceedings of the European Conference on Computer Vision. pp. 97–112. http://vision.cs.arizona.edu/kobus/research/publications/ECCV-02-1/.
  • Statistical models
J Li and J Z Wang (2006). "Real-time Computerized Annotation of Pictures". Proc. ACM Multimedia. pp. 911–920. http://www-db.stanford.edu/~wangz/project/imsearch/ALIP/ACMMM06/.
J Z Wang and J Li (2002). "Learning-Based Linguistic Indexing of Pictures with 2-D MHMMs". Proc. ACM Multimedia. pp. 436–445. http://www-db.stanford.edu/~wangz/project/imsearch/ALIP/ACM02/.
  • Automatic linguistic indexing of pictures
J Li and J Z Wang (2008). "Real-time Computerized Annotation of Pictures". IEEE Trans. on Pattern Analysis and Machine Intelligence. http://infolab.stanford.edu/~wangz/project/imsearch/ALIP/PAMI08/.
J Li and J Z Wang (2003). "Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach". IEEE Trans. on Pattern Analysis and Machine Intelligence. pp. 1075–1088. http://www-db.stanford.edu/~wangz/project/imsearch/ALIP/PAMI03/.
  • Hierarchical Aspect Cluster Model
K Barnard, D A Forsyth (2001). "Learning the Semantics of Words and Pictures". Proceedings of International Conference on Computer Vision. pp. 408–415. http://kobus.ca/research/publications/ICCV-01/.
  • Latent Dirichlet Allocation model
D Blei, A Ng, and M Jordan (2003). "Latent Dirichlet allocation". Journal of Machine Learning Research. pp. 3:993–1022. http://www.ics.uci.edu/~liang/seminars/win05/papers/blei03-latent-dirichlet.pdf.
  • Supervised multiclass labeling
G Carneiro, A B Chan, P Moreno, and N Vasconcelos (2006). "Supervised Learning of Semantic Classes for Image Annotation and Retrieval". IEEE Trans. on Pattern Analysis and Machine Intelligence. pp. 394–410. http://www.svcl.ucsd.edu/publications/journal/2007/pami/pami07-semantics.pdf.
  • Texture similarity
R W Picard and T P Minka (1995). "Vision Texture for Annotation". Multimedia Systems. http://citeseer.ist.psu.edu/picard95vision.html.
  • Support Vector Machines
C Cusano, G Ciocca, and R Scettini (2004). "Image Annotation Using SVM". Proceedings of Internet Imaging IV. http://adsabs.harvard.edu/cgi-bin/nph-bib_query?bibcode=2003SPIE.5304..330C&db_key=INST.
  • Ensemble of Decision Trees and Random Subwindows
R Maree, P Geurts, J Piater, and L Wehenkel (2005). "Random Subwindows for Robust Image Classification". Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition. pp. 1:34-30. http://www.montefiore.ulg.ac.be/~maree/#publications.
  • Maximum Entropy
J Jeon, R Manmatha (2004). "Using Maximum Entropy for Automatic Image Annotation". Int'l Conf on Image and Video Retrieval (CIVR 2004). pp. 24–32. http://ciir.cs.umass.edu/pubfiles/mm-355.pdf.
  • Relevance models
J Jeon, V Lavrenko, and R Manmatha (2003). "Automatic image annotation and retrieval using cross-media relevance models". Proceedings of the ACM SIGIR Conference on Research and Development in Information Retrieval. pp. 119–126. http://ciir.cs.umass.edu/pubfiles/mm-41.pdf.
  • Relevance models using continuous probability density functions
V Lavrenko, R Manmatha, and J Jeon (2003). "A model for learning the semantics of pictures". Proceedings of the 16th Conference on Advances in Neural Information Processing Systems NIPS. http://ciir.cs.umass.edu/pubfiles/mm-46.pdf.
  • Coherent Language Model
R Jin, J Y Chai, L Si (2004). "Effective Automatic Image Annotation via A Coherent Language Model and Active Learning". Proceedings of MM'04. http://www.cse.msu.edu/~rongjin/publications/acmmm04.jin.pdf.
  • Inference networks
D Metzler and R Manmatha (2004). "An inference network approach to image retrieval". Proceedings of the International Conference on Image and Video Retrieval. pp. 42–50. http://ciir.cs.umass.edu/pubfiles/mm-346.pdf.
  • Multiple Bernoulli distribution
S Feng, R Manmatha, and V Lavrenko (2004). "Multiple Bernoulli relevance models for image and video annotation". IEEE Conference on Computer Vision and Pattern Recognition. pp. 1002–1009. http://ciir.cs.umass.edu/pubfiles/mm-333.pdf.
  • Multiple design alternatives
J Y Pan, H-J Yang, P Duygulu and C Faloutsos (2004). "Automatic Image Captioning". Proceedings of the 2004 IEEE International Conference on Multimedia and Expo (ICME'04). http://www.informedia.cs.cmu.edu/documents/ICME04AutoICap.pdf.
  • Natural scene annotation
J Fan, Y Gao, H Luo and G Xu (2004). "Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation". Proceedings of the 27th annual international conference on Research and development in information retrieval. pp. 361–368. http://portal.acm.org/ft_gateway.cfm?id=1009055&type=pdf&coll=GUIDE&dl=GUIDE&CFID=1581830&CFTOKEN=99651762.
  • Relevant low-level global filters
A Oliva and A Torralba (2001). "Modeling the shape of the scene: a holistic representation of the spatial envelope". International Journal of Computer Vision. pp. 42:145–175. http://cvcl.mit.edu/Papers/IJCV01-Oliva-Torralba.pdf.
  • Global image features and nonparametric density estimation
A Yavlinsky, E Schofield and S Rüger (2005). "Automated Image Annotation Using Global Features and Robust Nonparametric Density Estimation". Int'l Conf on Image and Video Retrieval (CIVR, Singapore, Jul 2005). http://km.doc.ic.ac.uk/www-pub/civr05-annotation.pdf.
  • Video semantics
N Vasconcelos and A Lippman (2001). "Statistical Models of Video Structure for Content Analysis and Characterization". IEEE Transactions on Image Processing. pp. 1–17. http://www.svcl.ucsd.edu/publications/journal/2000/ip/ip00.pdf.
Ilaria Bartolini, Marco Patella, and Corrado Romani (2010). "Shiatsu: Semantic-based Hierarchical Automatic Tagging of Videos by Segmentation Using Cuts". 3rd ACM International Multimedia Workshop on Automated Information Extraction in Media Production (AIEMPro10). http://dl.acm.org/citation.cfm?doid=1862344.1862364.
  • Image Annotation Refinement
Yohan Jin, Latifur Khan, Lei Wang, and Mamoun Awad (2005). "Image annotations by combining multiple evidence & wordNet". 13th Annual ACM International Conference on Multimedia (MM 05). pp. 706–715. http://portal.acm.org/citation.cfm?id=1101305&dl=GUIDE,.
Changhu Wang, Feng Jing, Lei Zhang, and Hong-Jiang Zhang (2006). "Image annotation refinement using random walk with restarts". 14th Annual ACM International Conference on Multimedia (MM 06). http://portal.acm.org/citation.cfm?id=1180639.1180774#,.
Changhu Wang, Feng Jing, Lei Zhang, and Hong-Jiang Zhang (2007). "content-based image annotation refinement". IEEE Conference on Computer Vision and Pattern Recognition (CVPR 07). http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=4270246.
Ilaria Bartolini and Paolo Ciaccia (2007). "Imagination: Exploiting Link Analysis for Accurate Image Annotation". Springer Adaptive Multimedia Retrieval. http://www.springerlink.com/content/p45754gl2502u852/.
Ilaria Bartolini and Paolo Ciaccia (2010). "Multi-dimensional Keyword-based Image Annotation and Search". 2nd ACM International Workshop on Keyword Search on Structured Data (KEYS 2010). http://dl.acm.org/citation.cfm?doid=1868366.1868371.
  • Automatic Image Annotation by Ensemble of Visual Descriptors
Emre Akbas and Fatos Y. Vural (2007). "Automatic Image Annotation by Ensemble of Visual Descriptors". Intl. Conf. on Computer Vision (CVPR) 2007, Workshop on Semantic Learning Applications in Multimedia. http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=4270482.
  • Simultaneous Image Classification and Annotation
Chong Wang and David Blei and Li Fei-Fei (2009). "Simultaneous Image Classification and Annotation". Intl. Conf. on Computer Vision (CVPR). http://cs.stanford.edu/groups/vision/documents/WangBleiFei-Fei_CVPR2009.pdf.

Read more about this topic:  Automatic Image Annotation

Famous quotes containing the words major and/or work:

    What, really, is wanted from a neighborhood? Convenience, certainly, an absence of major aggravation, to be sure. But perhaps most of all, ideally, what is wanted is a comfortable background, a breathing space of intermission between the intensities of private life and the calculations of public life.
    Joseph Epstein (b. 1937)

    Irish was a man of parts even if some of them didn’t work too well.
    Angela Carter (1940–1992)