The Open UniversitySkip to content
 

A corpus analysis of discourse relations for Natural Language Generation

Williams, Sandra and Reiter, Ehud (2003). A corpus analysis of discourse relations for Natural Language Generation. In: Proceedings of the Corpus Linguistics 2003 conference, 28 - 31 March, 2003, Lancaster University, UK.
Full text available as:
[img]
Preview
PDF (Not Set) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Download (277Kb) | Preview
    URL: http://ucrel.lancs.ac.uk/cl2003/#proceedings
    Google Scholar Look up in Google Scholar

    Abstract

    We are developing a Natural Language Generation (NLG) system that generates texts tailored for the reading ability of individual readers. As part of building the system, GIRL (Generator for Individual Reading Levels), we carried out an analysis of the RST Discourse Treebank Corpus to find out how human writers linguistically realise discourse relations. The goal of the analysis was (a) to create a model of the choices that need to be made when realising discourse relations, and (b) to understand how these choices were typically made for “normal” readers, for a variety of discourse relations. We present our results for discourse relations: concession, condition, elaboration additional, evaluation, example, reason and restatement. We discuss the results and how they were used in GIRL.

    Item Type: Conference or Workshop Item
    Extra Information: ISBN 1 86220 131 5
    Academic Unit/Department: Mathematics, Computing and Technology > Computing
    Interdisciplinary Research Centre: Centre for Research in Computing (CRC)
    Item ID: 12456
    Depositing User: Sandra Williams
    Date Deposited: 26 Nov 2008 07:23
    Last Modified: 05 Dec 2010 05:38
    URI: http://oro.open.ac.uk/id/eprint/12456
    Repository Staff Only: edit this item
    Public: Report issue/request change

    Policies | Disclaimer

    © The Open University   + 44 (0)870 333 4340   general-enquiries@open.ac.uk