The Open UniversitySkip to content
 

Interlingual annotation of parallel text corpora: a new framework for annotation and evaluation

Dorr, Bonnie J.; Passonneau, Rebecca J.; Farwell, David; Green, Rebecca; Habash, Nizar; Helmreich, Stephen; Hovy, Eduard; Levin, Lori; Miller, Keith J.; Mitamura, Teruko; Rambow, Owen and Siddharthan, Advaith (2010). Interlingual annotation of parallel text corpora: a new framework for annotation and evaluation. Natural Language Engineering, 16(3) pp. 197–243.

DOI (Digital Object Identifier) Link: https://doi.org/10.1017/S1351324910000070
Google Scholar: Look up in Google Scholar

Abstract

This paper focuses on an important step in the creation of a system of meaning representation and the development of semantically annotated parallel corpora, for use in applications such as machine translation, question answering, text summarization, and information retrieval. The work described below constitutes the first effort of any kind to annotate multiple translations of foreign-language texts with interlingual content. Three levels of representation are introduced: deep syntactic dependencies (IL0), intermediate semantic representations (IL1), and a normalized representation that unifies conversives, nonliteral language, and paraphrase (IL2). The resulting annotated, multilingually induced, parallel corpora will be useful as an empirical basis for a wide range of research, including the development and evaluation of interlingual NLP systems and paraphrase-extraction systems as well as a host of other research and development efforts in theoretical and applied linguistics, foreign language pedagogy, translation studies, and other related disciplines.

Item Type: Journal Item
Copyright Holders: 2010 Cambridge University Press
ISSN: 1351-3249
Project Funding Details:
Funded Project NameProject IDFunding Body
Not SetIIS0326553National Science Foundation (NSF)
Not SetIIS-0705832National Science Foundation (NSF)
Not SetIIS-0531176National Science Foundation (NSF)
Not SetNot SetJohns Hopkins Human Language Technology Center of Excellence
Academic Unit/School: Faculty of Science, Technology, Engineering and Mathematics (STEM) > Knowledge Media Institute (KMi)
Faculty of Science, Technology, Engineering and Mathematics (STEM)
Item ID: 58889
Depositing User: Advaith Siddharthan
Date Deposited: 28 Jan 2019 15:38
Last Modified: 29 Mar 2019 11:14
URI: http://oro.open.ac.uk/id/eprint/58889
Share this page:

Metrics

Altmetrics from Altmetric

Citations from Dimensions

Actions (login may be required)

Policies | Disclaimer

© The Open University   contact the OU