Copy the page URI to the clipboard
Yang, Hui and Zhang, Minjie
(2003).
DOI: https://doi.org/10.1007/b94701
URL: http://www.informatik.uni-trier.de/~ley/db/conf/au...
Abstract
As the number and diversity of distributed information sources on the Internet exponentially increase, it is difficult for the user to know which databases are appropriate to search. Given database language models that describe the content of each database, database selection services can provide assistance in locate relevant databases of the users information need. In this paper, we propose a database selection approach based on statistical language modeling. The basic idea behind the approach is that, for the databases that are categorized into a topic hierarchy, individual language models are estimated at different search stages, and then the databases are ranked by the similarity to the query according to the estimated language model. Two-stage smoothed language models are presented to circumvent the inaccuracy due to word sparseness. Experimental results demonstrate such a language modeling approach is competitive with current state-of-the-art database selection approaches.
Viewing alternatives
Metrics
Public Attention
Altmetrics from AltmetricNumber of Citations
Citations from DimensionsItem Actions
Export
About
- Item ORO ID
- 12997
- Item Type
- Book Section
- ISBN
- 3-540-20646-9, 978-3-540-20646-0
- Extra Information
- 16th Australian Conference on AI, Perth, Australia, December 3-5, 2003. Proceedings
- Academic Unit or School
-
Faculty of Science, Technology, Engineering and Mathematics (STEM) > Computing and Communications
Faculty of Science, Technology, Engineering and Mathematics (STEM) - Research Group
- Centre for Research in Computing (CRC)
- Depositing User
- Hui Yang