Yan, Tingxu; Maxwell, Tamsin; Song, Dawei; Hou, Yuexian and Zhang, Peng
(2010).
|
PDF (Accepted Manuscript)
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Download (136Kb) |
| URL: | http://dl.acm.org/citation.cfm?id=1858864 |
|---|---|
| Google Scholar: | Look up in Google Scholar |
Abstract
Bag-of-words approaches to information retrieval (IR) are effective but assume independence between words. The Hyperspace Analogue to Language (HAL) is a cognitively motivated and validated semantic space model that captures statistical dependencies between words by considering their co-occurrences in a surrounding window of text. HAL has been successfully applied to query expansion in IR, but has several limitations, including high processing cost and use of distributional statistics that do not exploit syntax. In this paper, we pursue two methods for incorporating syntactic-semantic information from textual ‘events’ into HAL. We build the HAL space directly from events to investigate whether processing costs can be reduced through more careful definition of word co-occurrence, and improve the quality of the pseudo-relevance feedback by applying event information as a constraint during HAL construction. Both methods significantly improve performance results in comparison with original HAL, and interpolation of HAL and relevance model expansion outperforms either method alone.
| Item Type: | Conference Item |
|---|---|
| Copyright Holders: | 2010 Association for Computational Linguistics |
| Extra Information: | ACLShort '10
Proceedings of the ACL 2010 Conference Short Papers Association for Computational Linguistics, Stroudsburg, PA, USA ©2010 pp.120-125 |
| Academic Unit/Department: | Mathematics, Computing and Technology > Computing |
| Item ID: | 33902 |
| Depositing User: | Dawei Song |
| Date Deposited: | 21 Jun 2012 09:27 |
| Last Modified: | 23 Oct 2012 17:28 |
| URI: | http://oro.open.ac.uk/id/eprint/33902 |
Actions (login may be required)
| View Item | |
| Public: Report issue / request change |




