Copy the page URI to the clipboard
Magalhães, João and Rüger, Stefan
(2010).
DOI: https://doi.org/10.1145/1852102.1852105
Abstract
This article is set in the context of searching text and image repositories by keyword. We develop a unified probabilistic framework for text, image, and combined text and image retrieval that is based on the detection of keywords (concepts) using automated image annotation technology. Our framework is deeply rooted in information theory and lends itself to use with other media types.
We estimate a statistical model in a multimodal feature space for each possible query keyword. The key element of our framework is to identify feature space transformations that make them comparable in complexity and density. We select the optimal multimodal feature space with a minimum description length criterion from a set of candidate feature spaces that are computed with the average-mutual-information criterion for the text part and hierarchical expectation maximization for the visual part of the data. We evaluate our approach in three retrieval experiments (only text retrieval, only image retrieval, and text combined with image retrieval), verify the framework’s low computational complexity, and compare with existing state-of-the-art ad-hoc models.
Viewing alternatives
Download history
Metrics
Public Attention
Altmetrics from AltmetricNumber of Citations
Citations from DimensionsItem Actions
Export
About
- Item ORO ID
- 28193
- Item Type
- Journal Item
- ISSN
- 1558-2868
- Extra Information
- This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution.
- Keywords
- algorithms; measurement; experimentation; indexing; search, multimedia; automated keyword annotation
- Academic Unit or School
-
Faculty of Science, Technology, Engineering and Mathematics (STEM) > Knowledge Media Institute (KMi)
Faculty of Science, Technology, Engineering and Mathematics (STEM) - Copyright Holders
- © 2010 ACM
- Depositing User
- Stefan Rüger