The Open UniversitySkip to content
 

Weakly-supervised joint sentiment-topic detection from text

Lin, Chenghua; He, Yulan; Everson, Richard and Rüger, Stefan (2012). Weakly-supervised joint sentiment-topic detection from text. IEEE Transactions on Knowledge and Data Engineering, 24(6) pp. 1134–1145.

Full text available as:
[img]
Preview
PDF (Accepted Manuscript) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Download (436Kb)
DOI (Digital Object Identifier) Link: http://dx.doi.org/10.1109/TKDE.2011.48
Google Scholar: Look up in Google Scholar

Abstract

Sentiment analysis or opinion mining aims to use automated tools to detect subjective information such as opinions, attitudes, and feelings expressed in text. This paper proposes a novel probabilistic modeling framework called joint sentiment-topic (JST) model based on latent Dirichlet allocation (LDA), which detects sentiment and topic simultaneously from text. A reparameterized version of the JST model called Reverse-JST, by reversing the sequence of sentiment and topic generation in the modelling process, is also studied. Although JST is equivalent to Reverse-JST without hierarchical prior, extensive experiments show that when sentiment priors are added, JST performs consistently better than Reverse-JST. Besides, unlike supervised approaches to sentiment classification which often fail to produce satisfactory performance when shifting to other domains, the weakly-supervised nature of JST makes it highly portable to other domains. This is verified by the experimental results on datasets from five different domains where the JST model even outperforms existing semi-supervised approaches in some of the datasets despite using no labelled documents. Moreover, the topics and topic sentiment detected by JST are indeed coherent and informative. We hypothesize that the JST model can readily meet the demand of large-scale sentiment analysis from the web in an open-ended fashion.

Item Type: Journal Article
Copyright Holders: 2012 IEEE
ISSN: 1041-4347
Keywords: machine learning; sentiment analysis; text analysis; joint sentiment-topic model; latent Dirichlet allocation; opinion mining
Academic Unit/Department: Knowledge Media Institute
Interdisciplinary Research Centre: Centre for Research in Computing (CRC)
Item ID: 31503
Depositing User: Yulan He
Date Deposited: 25 Jan 2012 15:19
Last Modified: 24 Oct 2012 14:32
URI: http://oro.open.ac.uk/id/eprint/31503
Share this page:

Actions (login may be required)

View Item
Report issue / request change

Policies | Disclaimer

© The Open University   + 44 (0)870 333 4340   general-enquiries@open.ac.uk