The Open UniversitySkip to content
 

Mining a web citation database for document clustering

He, Y. ; Hui, S. C. and Fong, A. C. M. (2002). Mining a web citation database for document clustering. Applied Artificial Intelligence, 16(4) pp. 283–302.

DOI (Digital Object Identifier) Link: http://dx.doi.org/10.1080/08839510252906462
Google Scholar: Look up in Google Scholar

Abstract

The World Wide Web has become an important medium for disseminating scientific publications. Many publications are now made available over the Web. However, existing search engines are ineffective in searching these publications, as they do not index Web publications that normally appear in PDF (Portable Document Format) or PostScript formats. One way to index Web publications is through citation indices, which contain the references that the publications cite. Web Citation Database is a data warehouse to store the citation indices. In this paper, we propose a mining process to extract document cluster knowledge from the Web Citation Database to support the retrieval of Web publications. The mining techniques used for document cluster generation are based on Kohonen's Self-Organizing Map (KSOM) and Fuzzy Adaptive Resonance Theory (Fuzzy ART). The proposed techniques have been incorporated into a citation-based retrieval system known as PubSearch for Web scientific publications.

Item Type: Journal Article
Copyright Holders: 2002 Taylor & Francis
ISSN: 0883-9514
Academic Unit/Department: Knowledge Media Institute
Interdisciplinary Research Centre: Centre for Research in Computing (CRC)
Item ID: 28566
Depositing User: Kay Dave
Date Deposited: 20 Apr 2011 13:40
Last Modified: 22 Oct 2012 09:36
URI: http://oro.open.ac.uk/id/eprint/28566
Share this page:

Actions (login may be required)

View Item
Report issue / request change

Policies | Disclaimer

© The Open University   + 44 (0)870 333 4340   general-enquiries@open.ac.uk