Magalhães, João and Rüger, Stefan
An information-theoretic framework for semantic-multimedia retrieval.
ACM Transactions on Information Systems (TOIS), 28(4),
Full text available as:
This article is set in the context of searching text and image repositories by keyword. We develop a unified probabilistic framework for text, image, and combined text and image retrieval that is based on the detection of keywords (concepts) using automated image annotation technology. Our framework is deeply rooted in information theory and lends itself to use with other media types.
We estimate a statistical model in a multimodal feature space for each possible query keyword. The key element of our framework is to identify feature space transformations that make them comparable in complexity and density. We select the optimal multimodal feature space with a minimum description length criterion from a set of candidate feature spaces that are computed with the average-mutual-information criterion for the text part and hierarchical expectation maximization for the visual part of the data. We evaluate our approach in three retrieval experiments (only text retrieval, only image retrieval, and text combined with image retrieval), verify the framework’s low computational complexity, and compare with existing state-of-the-art ad-hoc models.
||This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution.
||algorithms; measurement; experimentation; indexing; search, multimedia; automated keyword annotation
||Knowledge Media Institute
||16 Feb 2011 12:18
||25 Oct 2012 17:21
Actions (login may be required)