KMI, The Open University at NTCIR-9 CrossLink: Cross-Lingual Link Discovery in Wikipedia using explicit semantic analysis

Knoth, Petr; Zilka, Lukas and Zdrahal, Zdenek (2011). KMI, The Open University at NTCIR-9 CrossLink: Cross-Lingual Link Discovery in Wikipedia using explicit semantic analysis. In: NTCIR-9: The 9th NTCIR Workshop Meeting: Evaluation of Information Access Technologies: Information Retrieval, Question Answering, and Cross-Lingual Information Access, 6-9 Dec 2011, Tokyo, Japan.

URL: http://research.nii.ac.jp/ntcir/ntcir-9/

Abstract

This paper describes the methods used in the submission of Knowledge Media institute (KMI), The Open University to the NTCIR-9 Cross-Lingual Link Discovery (CLLD)task entitled CrossLink. KMI submitted four runs for link discovery from English to Chinese; however, the developed methods, which utilise Explicit Semantic Analysis (ESA), are applicable also to other language combinations. Three of the runs are based on exploiting the existing cross-lingual mapping between different versions of Wikipedia articles. In the fourth run, we assume information about the mapping is not available. Our methods achieved encouraging results and we describe in detail how their performance can be further improved. Finally, we discuss two important issues in link discovery: the evaluation methodology and the applicability of the developed methods across dfferent textual collections.

Viewing alternatives

Download history

Item Actions

Export

About