Mining cross-document relationships from text

Knoth, Petr and Zdrahal, Zdenek (2011). Mining cross-document relationships from text. In: The First International Conference on Advances in Information Mining and Management (IMMM 2011), 23-28 Oct 2011, Barcelona, Spain.



The paper argues that automatic link generation and typing methods are needed to find and maintain cross document links in large and growing textual collections. Such links are important to organise information and to support search and navigation. We present an experimental study on mining cross document links from a collection of 5000 documents. We identify a set of link types and show that the value of semantic similarity is a good distinguishing indicator.

Viewing alternatives

Download history

Item Actions