The Open UniversitySkip to content
 

A corpus analysis of discourse relations for Natural Language Generation

Williams, Sandra and Reiter, Ehud (2003). A corpus analysis of discourse relations for Natural Language Generation. In: Proceedings of the Corpus Linguistics 2003 conference, 28 - 31 March, 2003, Lancaster University, UK, pp. 899–908.

Full text available as:
[img]
Preview
PDF (Not Set) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Download (277Kb)
URL: http://ucrel.lancs.ac.uk/cl2003/#proceedings
Google Scholar: Look up in Google Scholar

Abstract

We are developing a Natural Language Generation (NLG) system that generates texts tailored for the reading ability of individual readers. As part of building the system, GIRL (Generator for Individual Reading Levels), we carried out an analysis of the RST Discourse Treebank Corpus to find out how human writers linguistically realise discourse relations. The goal of the analysis was (a) to create a model of the choices that need to be made when realising discourse relations, and (b) to understand how these choices were typically made for “normal” readers, for a variety of discourse relations. We present our results for discourse relations: concession, condition, elaboration additional, evaluation, example, reason and restatement. We discuss the results and how they were used in GIRL.

Item Type: Conference Item
Extra Information: ISBN 1 86220 131 5
Academic Unit/Department: Mathematics, Computing and Technology > Computing & Communications
Interdisciplinary Research Centre: Centre for Research in Computing (CRC)
Item ID: 12456
Depositing User: Sandra Williams
Date Deposited: 26 Nov 2008 07:23
Last Modified: 05 Dec 2010 05:38
URI: http://oro.open.ac.uk/id/eprint/12456
Share this page:

Actions (login may be required)

View Item
Report issue / request change

Policies | Disclaimer

© The Open University   + 44 (0)870 333 4340   general-enquiries@open.ac.uk