Publication

PLSA on Large-scale Image Databases

Source:

Proceedings of the 2007 International Conference on Acoustics, Speech and Signal Processing, IEEE, Honolulu, Hawaii (2007)

URL:

http://www.informatik.uni-augsburg.de/lehrstuehle/mmc/publications/reports/MMC20/TR2006-33.pdf

Abstract:

The web and image repositories such as Fickr are the largest image databases in the world. There are billions of images on the web, and hundreds of million high-quality images in image repositories. Currently, these images are indexed based on manually-entered tags and individual and group usage patterns. In this work we explore a third information dimension: image features. We explore probabilistic latent semantic analysis (pLSA) in order to infer which visual patterns describe each object. We build models that connect words and image features, and use content features and tags to find similar images. We demonstrate that image features using gray-scale salient points and an aspect model based on pLSA outperforms a conventional word-frequency model as well as refined color-histrogram approach on an image-similarity task.

Download:

© 2009 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.