Boosting image retrieval through aggregating search results based on visual annotations
Source:
ACM Multimedia, ACM, Vancouver, Canada (2008)
Abstract:
Online photo sharing systems, such as Flickr and Picasa, provide a
valuable source of human-annotated photos. Textual annotations are used
not only to describe the visual content of an image, but also subjective,
spatial, temporal and social dimensions, complicating the task of
keyword-based search. In this paper we investigate a method
that exploits visual annotations, e.g. notes in Flickr, to enhance
keyword-based systems retrieval performance. For this purpose we
adopt the bag-of-visual-words approach for content-based image
retrieval as our baseline. We then apply rank aggregation
of the top 25 results obtained with a set of visual annotations that
match the keyword-based query.
The results on retrieval experiments show significant
improvements in retrieval performance when comparing the aggregated
approach with our baseline, which also slightly outperforms text-only
search. When using a textual filter on the search space
in combination with the aggregated approach an additional boost in retrieval
performance is observed, which underlines the need for large scale
content-based image retrieval techniques to complement the text-based
search.
Download:
ACM COPYRIGHT NOTICE. Copyright © 2009 by the Association for Computing Machinery, Inc. Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, to republish, to post on servers, or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from Publications Dept., ACM, Inc., fax +1 (212) 869-0481, or
permissions@acm.org.