Knoth, Petr; Zilka, Lukas and Zdrahal, Zdenek
PDF (Version of Record)
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
|Google Scholar:||Look up in Google Scholar|
This paper describes the methods used in the submission of Knowledge Media institute (KMI), The Open University to the NTCIR-9 Cross-Lingual Link Discovery (CLLD)task entitled CrossLink. KMI submitted four runs for link discovery from English to Chinese; however, the developed methods, which utilise Explicit Semantic Analysis (ESA), are applicable also to other language combinations. Three of the runs are based on exploiting the existing cross-lingual mapping between different versions of Wikipedia articles. In the fourth run, we assume information about the mapping is not available. Our methods achieved encouraging results and we describe in detail how their performance can be further improved. Finally, we discuss two important issues in link discovery: the evaluation methodology and the applicability of the developed methods across dfferent textual collections.
|Item Type:||Conference Item|
|Copyright Holders:||The Authors|
|Keywords:||Cross-lingual Link Discovery; Link Discovery; Semantic Similarity; Explicit Semantic Analysis; NTCIR; Wikipedia|
|Academic Unit/Department:||Knowledge Media Institute|
|Interdisciplinary Research Centre:||Centre for Research in Computing (CRC)|
|Depositing User:||Kay Dave|
|Date Deposited:||25 Jan 2012 15:16|
|Last Modified:||24 Feb 2016 06:25|
|Share this page:|
Download history for this item
These details should be considered as only a guide to the number of downloads performed manually. Algorithmic methods have been applied in an attempt to remove automated downloads from the displayed statistics but no guarantee can be made as to the accuracy of the figures.