The Open UniversitySkip to content
 

EUROGENE: multilingual retrieval and machine translation applied to human genetics

Knoth, Petr; Collins, Trevor; Sklavounou, Elsa and Zdrahal, Zdenek (2010). EUROGENE: multilingual retrieval and machine translation applied to human genetics. In: The 32nd European Conference on Information Retrieval (ECIR 2010), 28-31 Mar 2010, Milton Keynes, UK, Springer, pp. 670–671.

Full text available as:
Full text not publicly available
Due to copyright restrictions, this file is not available for public download
Click here to request a copy from the OU Author.
DOI (Digital Object Identifier) Link: http://dx.doi.org/10.1007/978-3-642-12275-0_74
Google Scholar: Look up in Google Scholar

Abstract

The objective of Eurogene is to collect a critical mass of educational content in the field of human genetics in nine European languages and to build a platform that will support the retrieval, sharing and navigation over the learning content. The Eurogene platform is already operational and is being used by the genetics community. In this paper, a part of the Eurogene platform related to the retrieval and machine translation of domain specific content is described. Our contribution lies in an approach for domain-specific adaption of cross-language information retrieval (CLIR) and machine translation (MT). The CLIR system is based on a multilingual domain ontology which is also used as a synchronization component between CLIR and MT. The MT system is adapted to the target domain using the terminology represented in the ontology and using statistical training performed on a collection of parallel texts. In the statistical training phase, new translations of a term can be discovered and used for ontology updating. The paper is organized as follows. First, we describe the motivation for our approach and the multilingual domain ontology. Later, the CLIR and MT components and their domain adaption and synchronization are discussed.

Item Type: Conference Item
Copyright Holders: 2010 Springer-Verlag Berlin Heidelberg
ISBN: 3-642-12274-4, 978-3-642-12274-3
ISSN: 0302-9743
Project Funding Details:
Funded Project NameProject IDFunding Body
Not SetNot SetEurogene (The First Pan-European Learning Service in the Field of Genetics, Contract no. ECP-2006-EDU-410018)
Extra Information: The original publication is available at www.springerlink.com
Academic Unit/Department: Knowledge Media Institute
Interdisciplinary Research Centre: Centre for Research in Computing (CRC)
Item ID: 23402
Depositing User: Kay Dave
Date Deposited: 05 Oct 2010 11:45
Last Modified: 12 May 2013 13:06
URI: http://oro.open.ac.uk/id/eprint/23402
Share this page:

Actions (login may be required)

View Item
Report issue / request change

Policies | Disclaimer

© The Open University   + 44 (0)870 333 4340   general-enquiries@open.ac.uk