The Open UniversitySkip to content

Mining cross-document relationships from text

Knoth, Petr and Zdrahal, Zdenek (2011). Mining cross-document relationships from text. In: The First International Conference on Advances in Information Mining and Management (IMMM 2011), 23-28 Oct 2011, Barcelona, Spain.

Full text available as:
PDF (Accepted Manuscript) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Download (218kB)
Google Scholar: Look up in Google Scholar


The paper argues that automatic link generation and typing methods are needed to find and maintain cross document links in large and growing textual collections. Such links are important to organise information and to support search and navigation. We present an experimental study on mining cross document links from a collection of 5000 documents. We identify a set of link types and show that the value of semantic similarity is a good distinguishing indicator.

Item Type: Conference or Workshop Item
Copyright Holders: 2011 IARIA
Keywords: text mining; automatic link generation and typing; semantic similarity; digital libraries; CORE
Academic Unit/School: Faculty of Science, Technology, Engineering and Mathematics (STEM) > Knowledge Media Institute (KMi)
Faculty of Science, Technology, Engineering and Mathematics (STEM)
Research Group: Centre for Research in Computing (CRC)
Big Scientific Data and Text Analytics Group (BSDTAG)
Related URLs:
Item ID: 29302
Depositing User: Kay Dave
Date Deposited: 18 Nov 2011 10:22
Last Modified: 12 Jun 2020 08:30
Share this page:

Download history for this item

These details should be considered as only a guide to the number of downloads performed manually. Algorithmic methods have been applied in an attempt to remove automated downloads from the displayed statistics but no guarantee can be made as to the accuracy of the figures.

Actions (login may be required)

Policies | Disclaimer

© The Open University   contact the OU