The Open UniversitySkip to content

Document structure

Power, Richard; Scott, Donia and Bouayad-Agha, Nadjet (2003). Document structure. Computational Linguistics, 29(2) pp. 211–260.

DOI (Digital Object Identifier) Link:
Google Scholar: Look up in Google Scholar


We argue the case for abstract document structure as a separate descriptive level in the analysis and generation of written texts. The purpose of this representation is to mediate between the message of a text (i.e., its discourse structure) and its physical presentation (i.e., its organization into graphical constituents like sections, paragraphs, sentences, bulleted lists, figures, and footnotes). Abstract document structure can be seen as an extension of Nunberg's "text-grammar"; it is also closely related to "logical" markup in languages like HTML and LATEX. We show that by using this intermediate representation, several subtasks in language generation and language understanding can be defined more cleanly.

Item Type: Journal Item
Copyright Holders: 2003 Association for Computational Linguistics
ISSN: 1530-9312
Keywords: document structure; discourse connectives; natural language generation
Academic Unit/School: Faculty of Science, Technology, Engineering and Mathematics (STEM) > Computing and Communications
Faculty of Science, Technology, Engineering and Mathematics (STEM)
Research Group: Centre for Research in Computing (CRC)
Item ID: 4841
Depositing User: Richard Power
Date Deposited: 11 Jul 2006
Last Modified: 07 Dec 2018 08:57
Share this page:


Altmetrics from Altmetric

Citations from Dimensions

Actions (login may be required)

Policies | Disclaimer

© The Open University   contact the OU