Copy the page URI to the clipboard
Knoth, Petr and Herrmannova, Drahomira
(2013).
URL: http://research.nii.ac.jp/ntcir/workshop/OnlinePro...
Abstract
Cross-Lingual Link Discovery (CLLD) aims to automatically find links between documents written in different languages. In this paper, we first present a relatively simple yet effective methods for CLLD in Wiki collections, explaining the fndings that motivated their design. Our methods (team KMI) achieved in the NTCIR-10 CrossLink-2 evaluation the best overall results in the English to Chinese, Japanese and Korean (E2CJK) task and were the top performers in the Chinese, Japanese, Korean to English task (CJK2E)1 [Tang et al.,2013]. Though tested on these language combinations, the methods are language agnostic and can be easily applied to any other language combination with sufficient corpora and available pre-processing tools. In the second part of the paper, we provide an in depth analysis of the nature of the task, the evaluation metrics and the impact of the system components on the overall CLLD performance. We believe a good understanding of these aspects is the key to improving CLLD systems in the future.
Viewing alternatives
Download history
Item Actions
Export
About
- Item ORO ID
- 37825
- Item Type
- Conference or Workshop Item
- Extra Information
-
Proceedings of the 10th NTCIR Conference on Evaluation of Information Access
Technologies, June 18-21, 2013 Tokyo Japan
Edited by Noriko Kando, Kazuaki Kishida
2013 National Institute of Informatics
ISBN: ISBN 978-4-86049-062-1 - Keywords
- cross-lingual link discovery; link discovery; semantic similarity; explicit semantic analysis; NTCIR; Wikipedia; CORE
- Academic Unit or School
-
Faculty of Science, Technology, Engineering and Mathematics (STEM) > Knowledge Media Institute (KMi)
Faculty of Science, Technology, Engineering and Mathematics (STEM) - Research Group
-
Centre for Research in Computing (CRC)
Big Scientific Data and Text Analytics Group (BSDTAG) - Copyright Holders
- © 2013 National Institute of Informatics
- Related URLs
- Depositing User
- Kay Dave