Copy the page URI to the clipboard
Knoth, Petr; Zilka, Lukas and Zdrahal, Zdenek
(2011).
URL: http://www.cfilt.iitb.ac.in/~clia2011/
Abstract
This paper explores how to automatically generate cross language links between resources in large document collections. The paper presents new methods for Cross Lingual Link Discovery(CLLD) based on Explicit Semantic Analysis (ESA). The methods are applicable to any multilingual document collection. In this report, we present their comparative study on the Wikipedia corpus and provide new insights into the evaluation of link discovery systems. In particular, we measure the agreement of human annotators in linking articles in different language versions of Wikipedia, and compare it to the results achieved by the presented methods.
Viewing alternatives
Download history
Item Actions
Export
About
- Item ORO ID
- 29584
- Item Type
- Conference or Workshop Item
- Keywords
- CORE
- Academic Unit or School
-
Faculty of Science, Technology, Engineering and Mathematics (STEM) > Knowledge Media Institute (KMi)
Faculty of Science, Technology, Engineering and Mathematics (STEM) - Research Group
-
Centre for Research in Computing (CRC)
Big Scientific Data and Text Analytics Group (BSDTAG) - Copyright Holders
- © Unknown
- Depositing User
- Kay Dave