Copy the page URI to the clipboard
Sarkar, Avik; De Roeck, Anne and Garthwaite, Paul
(2005).
URL: http://citeseerx.ist.psu.edu/viewdoc/download?doi=...
Abstract
In this paper, we propose to investigate style through modeling burstiness in the occurrence patterns of terms in different collections. We set out a fine grained model that looks at gaps between the successive occurrence of the term using a mixture of exponential distributions. A Bayesian framework allows flexibility in fitting the model. The parameter estimates are then studied to understand the distributional properties of a term in various collections. We investigate the behaviour of a range of terms and conclude that the model brings out useful features that may be deployed in the analysis of style.
Viewing alternatives
Item Actions
Export
About
- Item ORO ID
- 22562
- Item Type
- Conference or Workshop Item
- Keywords
- term burstiness, term re-occurrence, Bayesian analysis, mixture models, stylistic analysis, frequent terms
- Academic Unit or School
-
Faculty of Science, Technology, Engineering and Mathematics (STEM) > Computing and Communications
Faculty of Science, Technology, Engineering and Mathematics (STEM)
Faculty of Science, Technology, Engineering and Mathematics (STEM) > Mathematics and Statistics - Research Group
- Centre for Research in Computing (CRC)
- Copyright Holders
- © 2005 The Authors
- Depositing User
- Sarah Frain