Copy the page URI to the clipboard
Cano Basave, Amparo Elizabeth; He, Yulan and Xu, Ruifeng
(2014).
URL: http://acl2014.org/
Abstract
Latent topics derived by topic models such as Latent Dirichlet Allocation (LDA) are the result of hidden thematic structures which provide further insights into the data. The automatic labelling of such topics derived from social media poses however new challenges since topics may characterise novel events happening in the real world. Existing automatic topic labelling approaches which depend on external knowledge sources become less applicable here since relevant articles/concepts of the extracted topics may not exist in external sources. In this paper we propose to address the problem of automatic labelling of latent topics learned from Twitter as a summarisation problem. We introduce a framework which apply summarisation algorithms to generate topic labels. These algorithms are independent of external sources and only rely on the identification of dominant terms in documents related to the latent topic. We compare the efficiency of existing state of the art summarisation algorithms. Our results suggest that summarisation algorithms generate better topic labels which capture event-related context compared to the top-n terms returned by LDA.
Viewing alternatives
Download history
Item Actions
Export
About
- Item ORO ID
- 41413
- Item Type
- Conference or Workshop Item
- ISBN
- 1-937284-73-5, 978-1-937284-73-2
- Project Funding Details
-
Funded Project Name Project ID Funding Body Not Set EP/J020427/1 EPRSC EU-FP7 project SENSE4US 611242 EU Not Set GJHZ20120613110641217 Shenzhen International Cooperation Research Funding - Keywords
- topic models, automatic labelling
- Academic Unit or School
-
Faculty of Science, Technology, Engineering and Mathematics (STEM) > Knowledge Media Institute (KMi)
Faculty of Science, Technology, Engineering and Mathematics (STEM) - Research Group
- Centre for Research in Computing (CRC)
- Copyright Holders
- © 2014 Association for Computational Linguistics
- Depositing User
- Amparo Cano Basave