The Open UniversitySkip to content
 

Categorizing children: automated text classification of CHILDES files

Opsomer, Rob; Knoth, Petr; van Polen, Freek; Trapman, Jantine and Wiering, Marco (2008). Categorizing children: automated text classification of CHILDES files. In: The 20th Belgian-Netherlands Conference on Artificial Intelligence (BNAIC 2008), 30 - 31 Oct 2008, Enchede, The Netherlands.

Full text available as:
Full text not publicly available
Due to copyright restrictions, this file is not available for public download
Google Scholar: Look up in Google Scholar

Abstract

In this paper we present the application of machine learning text classification methods to two tasks: categorization of children's speech in the CHILDES Database according to gender and age. Both tasks are binary. For age, we distinguish two age groups between the age of 1.9 and 3.0 years old. The boundary between the groups lies at the age of 2.4 which is both the mean and the median of the age in our data set. We show that the machine learning approach, based on a bag of words, can achieve much better results than features such as average utterance length or Type-Token Ratio, which are methods traditionally used by linguists. We have achieved 80.5% and 70.5% classification accuracy for the age and gender task respectively.

Item Type: Conference Item
Copyright Holders: 2008 Universiteit Twente, Enschede
ISSN: 1568-7805
Extra Information: Proceedings of the twentieth Belgian-Dutch Conference on
Artificial Intelligence.
Enschede, October 30-31, 2008.
Anton Nijholt, Maja Pantic, Mannes Poel and Hendri Hondorp (eds.)
Academic Unit/Department: Knowledge Media Institute
Interdisciplinary Research Centre: Centre for Research in Computing (CRC)
Related URLs:
Item ID: 24749
Depositing User: Kay Dave
Date Deposited: 19 Nov 2010 09:24
Last Modified: 25 Oct 2012 01:53
URI: http://oro.open.ac.uk/id/eprint/24749
Share this page:

Actions (login may be required)

View Item
Report issue / request change

Policies | Disclaimer

© The Open University   + 44 (0)870 333 4340   general-enquiries@open.ac.uk