The Open UniversitySkip to content
 

Classifying document titles based on information inference

Song, Dawei; Bruza, Peter; Huang, Helen and Lau, Raymond (2003). Classifying document titles based on information inference. In: Not Set ed. Foundations of Intelligent Systems. Springer Berlin / Heidelberg, Volume 2871/2003 (Volume 2871/2003). Springer: Berlin, pp. 297–306.

DOI (Digital Object Identifier) Link: http://dx.doi.org/10.1007/b14019
Google Scholar: Look up in Google Scholar

Abstract

We propose an intelligent document title classification agent based on a theory of information inference. The information is represented as vectorial spaces computed by a cognitively motivated model, namely Hyperspace Analogue to Language (HAL). A combination heuristic is used to combine a group of concepts into one single combination vector. Information inference can be performed on the HAL spaces via computing information flow between vectors or combination vectors. Based on this theory, a document title is treated as a combination vector by applying the combination heuristic to all the non-stop terms in the title. Two methodologies for learning and assigning categories to document titles are addressed. Experimental results on Reuters-21578 corpus show that our framework is promising and its performance achieves 71% of the upper bound (which is approximated by using whole documents).

Item Type: Book Chapter
ISBN: 3-540-20256-0, 978-3-540-20256-1
Academic Unit/Department: Mathematics, Computing and Technology > Computing & Communications
Knowledge Media Institute
Item ID: 9331
Depositing User: Users 6898 not found.
Date Deposited: 28 Sep 2007
Last Modified: 22 Jun 2012 11:35
URI: http://oro.open.ac.uk/id/eprint/9331
Share this page:

Actions (login may be required)

View Item
Report issue / request change

Policies | Disclaimer

© The Open University   + 44 (0)870 333 4340   general-enquiries@open.ac.uk