Knoth, Petr; Zilka, Lukas and Zdrahal, Zdenek
KMI, The Open University at NTCIR-9 CrossLink: Cross-Lingual Link Discovery in Wikipedia using explicit semantic analysis.
In: NTCIR-9: The 9th NTCIR Workshop Meeting: Evaluation of Information Access Technologies: Information Retrieval, Question Answering, and Cross-Lingual Information Access, 6-9 December 2011, Tokyo, Japan.
Full text available as:
This paper describes the methods used in the submission of Knowledge Media institute (KMI), The Open University to the NTCIR-9 Cross-Lingual Link Discovery (CLLD)task entitled CrossLink. KMI submitted four runs for link discovery from English to Chinese; however, the developed methods, which utilise Explicit Semantic Analysis (ESA), are applicable also to other language combinations. Three of the runs are based on exploiting the existing cross-lingual mapping between different versions of Wikipedia articles. In the fourth run, we assume information about the mapping is not available. Our methods achieved encouraging results and we describe in detail how their performance can be further improved. Finally, we discuss two important issues in link discovery: the evaluation methodology and the applicability of the developed methods across dfferent textual collections.
Actions (login may be required)